CN110627797A - 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 - Google Patents
用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 Download PDFInfo
- Publication number
- CN110627797A CN110627797A CN201910920109.XA CN201910920109A CN110627797A CN 110627797 A CN110627797 A CN 110627797A CN 201910920109 A CN201910920109 A CN 201910920109A CN 110627797 A CN110627797 A CN 110627797A
- Authority
- CN
- China
- Prior art keywords
- group
- alkyl
- radical
- genbank
- attachment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000011282 treatment Methods 0.000 title abstract description 30
- 230000002062 proliferating effect Effects 0.000 title abstract description 15
- 208000023275 Autoimmune disease Diseases 0.000 title abstract description 10
- YUOCYTRGANSSRY-UHFFFAOYSA-N pyrrolo[2,3-i][1,2]benzodiazepine Chemical class C1=CN=NC2=C3C=CN=C3C=CC2=C1 YUOCYTRGANSSRY-UHFFFAOYSA-N 0.000 title abstract description 9
- 150000001875 compounds Chemical class 0.000 claims abstract description 183
- 150000003839 salts Chemical class 0.000 claims abstract description 31
- -1 2- (trimethylsilyl) ethoxycarbonyl Chemical group 0.000 claims description 209
- 239000003814 drug Substances 0.000 claims description 114
- 229940079593 drug Drugs 0.000 claims description 106
- 238000000034 method Methods 0.000 claims description 91
- 125000005647 linker group Chemical group 0.000 claims description 86
- 239000003446 ligand Substances 0.000 claims description 83
- 125000000217 alkyl group Chemical group 0.000 claims description 75
- 125000001424 substituent group Chemical group 0.000 claims description 55
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 claims description 45
- 125000003118 aryl group Chemical group 0.000 claims description 44
- 108010016626 Dipeptides Proteins 0.000 claims description 43
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 claims description 43
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical group N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 37
- 229910052736 halogen Inorganic materials 0.000 claims description 32
- 150000002367 halogens Chemical class 0.000 claims description 32
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 claims description 29
- 125000004432 carbon atom Chemical group C* 0.000 claims description 28
- 229910052739 hydrogen Inorganic materials 0.000 claims description 28
- 229910052760 oxygen Inorganic materials 0.000 claims description 26
- 229910052717 sulfur Inorganic materials 0.000 claims description 25
- 229910052757 nitrogen Inorganic materials 0.000 claims description 23
- 125000000539 amino acid group Chemical group 0.000 claims description 22
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 claims description 20
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 19
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 claims description 18
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 claims description 18
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 claims description 18
- 150000001768 cations Chemical class 0.000 claims description 16
- OMRPLUKQNWNZAV-CONSDPRKSA-N (6as)-3-[3-[[(6as)-2-methoxy-8-(4-methoxyphenyl)-11-oxo-6a,7-dihydropyrrolo[2,1-c][1,4]benzodiazepin-3-yl]oxy]propoxy]-8-(4-aminophenyl)-2-methoxy-6a,7-dihydropyrrolo[2,1-c][1,4]benzodiazepin-11-one Chemical compound C1=CC(OC)=CC=C1C1=CN2C(=O)C3=CC(OC)=C(OCCCOC=4C(=CC=5C(=O)N6C=C(C[C@H]6C=NC=5C=4)C=4C=CC(N)=CC=4)OC)C=C3N=C[C@@H]2C1 OMRPLUKQNWNZAV-CONSDPRKSA-N 0.000 claims description 15
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 claims description 15
- 125000003342 alkenyl group Chemical group 0.000 claims description 15
- 125000002947 alkylene group Chemical group 0.000 claims description 14
- 125000005842 heteroatom Chemical group 0.000 claims description 14
- 125000004076 pyridyl group Chemical group 0.000 claims description 14
- 229910052731 fluorine Inorganic materials 0.000 claims description 13
- 238000006467 substitution reaction Methods 0.000 claims description 13
- 125000004185 ester group Chemical group 0.000 claims description 12
- 125000003356 phenylsulfanyl group Chemical group [*]SC1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 claims description 12
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 claims description 11
- 125000000623 heterocyclic group Chemical group 0.000 claims description 11
- 125000000753 cycloalkyl group Chemical group 0.000 claims description 9
- 229910052727 yttrium Inorganic materials 0.000 claims description 9
- 125000000304 alkynyl group Chemical group 0.000 claims description 8
- 125000001931 aliphatic group Chemical group 0.000 claims description 7
- 229910052799 carbon Inorganic materials 0.000 claims description 7
- 125000004093 cyano group Chemical group *C#N 0.000 claims description 7
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 6
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 6
- 150000002825 nitriles Chemical group 0.000 claims description 6
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 claims description 6
- 125000003545 alkoxy group Chemical group 0.000 claims description 5
- 125000005907 alkyl ester group Chemical group 0.000 claims description 5
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 claims description 5
- 108010001267 Protein Subunits Proteins 0.000 claims description 4
- 125000001153 fluoro group Chemical group F* 0.000 claims description 4
- 125000000717 hydrazino group Chemical group [H]N([*])N([H])[H] 0.000 claims description 4
- 125000001624 naphthyl group Chemical group 0.000 claims description 4
- 125000001309 chloro group Chemical group Cl* 0.000 claims description 3
- 125000005519 fluorenylmethyloxycarbonyl group Chemical group 0.000 claims description 3
- 125000001313 C5-C10 heteroaryl group Chemical group 0.000 claims 3
- 125000003828 azulenyl group Chemical group 0.000 claims 3
- 241001255830 Thema Species 0.000 claims 1
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 claims 1
- 239000012453 solvate Substances 0.000 abstract description 22
- 150000003254 radicals Chemical class 0.000 description 134
- 108090000765 processed proteins & peptides Proteins 0.000 description 123
- 102000004196 processed proteins & peptides Human genes 0.000 description 110
- 229920001184 polypeptide Polymers 0.000 description 104
- 238000002360 preparation method Methods 0.000 description 90
- 210000004027 cell Anatomy 0.000 description 85
- 241000282414 Homo sapiens Species 0.000 description 84
- 239000002773 nucleotide Substances 0.000 description 81
- 125000003729 nucleotide group Chemical group 0.000 description 81
- 125000003275 alpha amino acid group Chemical group 0.000 description 69
- 239000000203 mixture Substances 0.000 description 58
- 239000011230 binding agent Substances 0.000 description 56
- 239000000562 conjugate Substances 0.000 description 51
- 206010028980 Neoplasm Diseases 0.000 description 50
- 239000000427 antigen Substances 0.000 description 46
- 108091007433 antigens Proteins 0.000 description 45
- 102000036639 antigens Human genes 0.000 description 45
- 108090000623 proteins and genes Proteins 0.000 description 45
- 125000006239 protecting group Chemical group 0.000 description 44
- 238000009739 binding Methods 0.000 description 42
- WMAXWOOEPJQXEB-UHFFFAOYSA-N 2-phenyl-5-(4-phenylphenyl)-1,3,4-oxadiazole Chemical compound C1=CC=CC=C1C1=NN=C(C=2C=CC(=CC=2)C=2C=CC=CC=2)O1 WMAXWOOEPJQXEB-UHFFFAOYSA-N 0.000 description 40
- 230000027455 binding Effects 0.000 description 39
- 238000003786 synthesis reaction Methods 0.000 description 39
- 239000000611 antibody drug conjugate Substances 0.000 description 38
- 229940049595 antibody-drug conjugate Drugs 0.000 description 38
- 238000006243 chemical reaction Methods 0.000 description 38
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 37
- 235000001014 amino acid Nutrition 0.000 description 37
- 229940024606 amino acid Drugs 0.000 description 36
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 33
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 33
- 150000001413 amino acids Chemical class 0.000 description 33
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 32
- 230000015572 biosynthetic process Effects 0.000 description 29
- 201000011510 cancer Diseases 0.000 description 27
- 239000000243 solution Substances 0.000 description 27
- 229910052796 boron Inorganic materials 0.000 description 26
- 235000018102 proteins Nutrition 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 26
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 25
- 210000004408 hybridoma Anatomy 0.000 description 25
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 24
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 24
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 24
- 125000000524 functional group Chemical group 0.000 description 24
- 125000003277 amino group Chemical group 0.000 description 23
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 23
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 21
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 21
- 230000002829 reductive effect Effects 0.000 description 21
- 108060003951 Immunoglobulin Proteins 0.000 description 20
- 239000003153 chemical reaction reagent Substances 0.000 description 20
- 102000018358 immunoglobulin Human genes 0.000 description 20
- 125000006413 ring segment Chemical group 0.000 description 20
- 102000005600 Cathepsins Human genes 0.000 description 19
- 108010084457 Cathepsins Proteins 0.000 description 19
- 102100041003 Glutamate carboxypeptidase 2 Human genes 0.000 description 19
- 239000002904 solvent Substances 0.000 description 19
- 102100021435 Macrophage-stimulating protein receptor Human genes 0.000 description 18
- 201000010099 disease Diseases 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 16
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 16
- 101001106413 Homo sapiens Macrophage-stimulating protein receptor Proteins 0.000 description 16
- 102100034256 Mucin-1 Human genes 0.000 description 16
- 235000019253 formic acid Nutrition 0.000 description 16
- 238000011068 loading method Methods 0.000 description 16
- 239000012634 fragment Substances 0.000 description 15
- 239000001257 hydrogen Substances 0.000 description 15
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 15
- 102100033579 Trophoblast glycoprotein Human genes 0.000 description 14
- 238000005859 coupling reaction Methods 0.000 description 14
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 14
- 238000005160 1H NMR spectroscopy Methods 0.000 description 13
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 13
- 239000002253 acid Substances 0.000 description 13
- 239000000539 dimer Substances 0.000 description 13
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 12
- 102100038080 B-cell receptor CD22 Human genes 0.000 description 12
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- 239000004606 Fillers/Extenders Substances 0.000 description 12
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 12
- 230000009471 action Effects 0.000 description 12
- 125000003368 amide group Chemical group 0.000 description 12
- 150000001412 amines Chemical class 0.000 description 12
- 239000003795 chemical substances by application Substances 0.000 description 12
- 230000008878 coupling Effects 0.000 description 12
- 238000010168 coupling process Methods 0.000 description 12
- 235000019439 ethyl acetate Nutrition 0.000 description 12
- 238000009472 formulation Methods 0.000 description 12
- 150000002466 imines Chemical class 0.000 description 12
- 229920006395 saturated elastomer Polymers 0.000 description 12
- 125000006850 spacer group Chemical group 0.000 description 12
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 11
- 102100034459 Hepatitis A virus cellular receptor 1 Human genes 0.000 description 11
- GDIHDSKOVXEJQQ-UHFFFAOYSA-N NC(O)=O.NC(O)=O.NC(O)=O.N Chemical group NC(O)=O.NC(O)=O.NC(O)=O.N GDIHDSKOVXEJQQ-UHFFFAOYSA-N 0.000 description 11
- 238000007792 addition Methods 0.000 description 11
- 210000004369 blood Anatomy 0.000 description 11
- 239000008280 blood Substances 0.000 description 11
- 239000012267 brine Substances 0.000 description 11
- 150000002148 esters Chemical class 0.000 description 11
- 125000005439 maleimidyl group Chemical group C1(C=CC(N1*)=O)=O 0.000 description 11
- 229920001481 poly(stearyl methacrylate) Polymers 0.000 description 11
- 239000011541 reaction mixture Substances 0.000 description 11
- 238000002390 rotary evaporation Methods 0.000 description 11
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical compound O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 11
- 239000000126 substance Substances 0.000 description 11
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 10
- 102100024423 Carbonic anhydrase 9 Human genes 0.000 description 10
- 102100022623 Hepatocyte growth factor receptor Human genes 0.000 description 10
- 101000892862 Homo sapiens Glutamate carboxypeptidase 2 Proteins 0.000 description 10
- KDLHZDBZIXYQEI-UHFFFAOYSA-N Palladium Chemical compound [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 10
- 239000003242 anti bacterial agent Substances 0.000 description 10
- 229940088710 antibiotic agent Drugs 0.000 description 10
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 10
- 102000005962 receptors Human genes 0.000 description 10
- 108020003175 receptors Proteins 0.000 description 10
- 102000001301 EGF receptor Human genes 0.000 description 9
- 102100022337 Integrin alpha-V Human genes 0.000 description 9
- 102100026878 Interleukin-2 receptor subunit alpha Human genes 0.000 description 9
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 9
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 9
- 102100021969 Nucleotide pyrophosphatase Human genes 0.000 description 9
- 210000001744 T-lymphocyte Anatomy 0.000 description 9
- 108010008125 Tenascin Proteins 0.000 description 9
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 9
- 239000002246 antineoplastic agent Substances 0.000 description 9
- 150000005840 aryl radicals Chemical class 0.000 description 9
- 239000000969 carrier Substances 0.000 description 9
- 235000018417 cysteine Nutrition 0.000 description 9
- 150000002431 hydrogen Chemical class 0.000 description 9
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 9
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 9
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 9
- CSNNHWWHGAXBCP-UHFFFAOYSA-L magnesium sulphate Substances [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 9
- 239000000178 monomer Substances 0.000 description 9
- 239000000546 pharmaceutical excipient Substances 0.000 description 9
- 125000005420 sulfonamido group Chemical group S(=O)(=O)(N*)* 0.000 description 9
- 230000008685 targeting Effects 0.000 description 9
- 230000001225 therapeutic effect Effects 0.000 description 9
- ONACFYWVMLRFIX-UHFFFAOYSA-N 1H-1,4-diazepin-2-yl trifluoromethanesulfonate Chemical compound FC(S(=O)(=O)OC=1NC=CC=NC=1)(F)F ONACFYWVMLRFIX-UHFFFAOYSA-N 0.000 description 8
- DJQYYYCQOZMCRC-UHFFFAOYSA-N 2-aminopropane-1,3-dithiol Chemical compound SCC(N)CS DJQYYYCQOZMCRC-UHFFFAOYSA-N 0.000 description 8
- 102100024217 CAMPATH-1 antigen Human genes 0.000 description 8
- 101001046677 Homo sapiens Integrin alpha-V Proteins 0.000 description 8
- 101001133056 Homo sapiens Mucin-1 Proteins 0.000 description 8
- 101000801433 Homo sapiens Trophoblast glycoprotein Proteins 0.000 description 8
- 102000007000 Tenascin Human genes 0.000 description 8
- 102100037236 Tyrosine-protein kinase receptor UFO Human genes 0.000 description 8
- 102100038929 V-set domain-containing T-cell activation inhibitor 1 Human genes 0.000 description 8
- 210000003719 b-lymphocyte Anatomy 0.000 description 8
- 210000000349 chromosome Anatomy 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 8
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 8
- 229940127089 cytotoxic agent Drugs 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 239000012071 phase Substances 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 241000894007 species Species 0.000 description 8
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 8
- 238000002560 therapeutic procedure Methods 0.000 description 8
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 8
- 102100022005 B-lymphocyte antigen CD20 Human genes 0.000 description 7
- 102100032556 C-type lectin domain family 14 member A Human genes 0.000 description 7
- 102100030074 Dickkopf-related protein 1 Human genes 0.000 description 7
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 7
- 102100022662 Guanylyl cyclase C Human genes 0.000 description 7
- 102100039813 Inactive tyrosine-protein kinase 7 Human genes 0.000 description 7
- 102100031586 Leukocyte antigen CD37 Human genes 0.000 description 7
- 108010008707 Mucin-1 Proteins 0.000 description 7
- 241001529936 Murinae Species 0.000 description 7
- 102100029198 SLAM family member 7 Human genes 0.000 description 7
- NKANXQFJJICGDU-QPLCGJKRSA-N Tamoxifen Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 NKANXQFJJICGDU-QPLCGJKRSA-N 0.000 description 7
- 102100036857 Tumor necrosis factor receptor superfamily member 8 Human genes 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 230000001363 autoimmune Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 239000012043 crude product Substances 0.000 description 7
- 125000004122 cyclic group Chemical group 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 230000018109 developmental process Effects 0.000 description 7
- 208000035475 disorder Diseases 0.000 description 7
- 230000014509 gene expression Effects 0.000 description 7
- 239000004220 glutamic acid Substances 0.000 description 7
- 230000002163 immunogen Effects 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 239000000741 silica gel Substances 0.000 description 7
- 229910002027 silica gel Inorganic materials 0.000 description 7
- 125000003396 thiol group Chemical class [H]S* 0.000 description 7
- 102100040842 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Human genes 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- 108010008014 B-Cell Maturation Antigen Proteins 0.000 description 6
- 102100025221 CD70 antigen Human genes 0.000 description 6
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical compound NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 6
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 6
- 108060006698 EGF receptor Proteins 0.000 description 6
- 102100030340 Ephrin type-A receptor 2 Human genes 0.000 description 6
- GHASVSINZRGABV-UHFFFAOYSA-N Fluorouracil Chemical compound FC1=CNC(=O)NC1=O GHASVSINZRGABV-UHFFFAOYSA-N 0.000 description 6
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 6
- 101710183768 Glutamate carboxypeptidase 2 Proteins 0.000 description 6
- 102100030595 HLA class II histocompatibility antigen gamma chain Human genes 0.000 description 6
- 101000897405 Homo sapiens B-lymphocyte antigen CD20 Proteins 0.000 description 6
- 101000934356 Homo sapiens CD70 antigen Proteins 0.000 description 6
- 101000910338 Homo sapiens Carbonic anhydrase 9 Proteins 0.000 description 6
- 101001055144 Homo sapiens Interleukin-2 receptor subunit alpha Proteins 0.000 description 6
- 101000777628 Homo sapiens Leukocyte antigen CD37 Proteins 0.000 description 6
- 101000628535 Homo sapiens Metalloreductase STEAP2 Proteins 0.000 description 6
- 101000904724 Homo sapiens Transmembrane glycoprotein NMB Proteins 0.000 description 6
- 101000807561 Homo sapiens Tyrosine-protein kinase receptor UFO Proteins 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- 239000004472 Lysine Substances 0.000 description 6
- 102100026711 Metalloreductase STEAP2 Human genes 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 description 6
- GLUUGHFHXGJENI-UHFFFAOYSA-N Piperazine Chemical compound C1CNCCN1 GLUUGHFHXGJENI-UHFFFAOYSA-N 0.000 description 6
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 6
- 102100023832 Prolyl endopeptidase FAP Human genes 0.000 description 6
- 101710100969 Receptor tyrosine-protein kinase erbB-3 Proteins 0.000 description 6
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 6
- 102100035721 Syndecan-1 Human genes 0.000 description 6
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 6
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 6
- 101710190034 Trophoblast glycoprotein Proteins 0.000 description 6
- 102100033726 Tumor necrosis factor receptor superfamily member 17 Human genes 0.000 description 6
- 102100023543 Vascular cell adhesion protein 1 Human genes 0.000 description 6
- 102100023144 Zinc transporter ZIP6 Human genes 0.000 description 6
- 239000004480 active ingredient Substances 0.000 description 6
- 229910052786 argon Inorganic materials 0.000 description 6
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 6
- 238000004587 chromatography analysis Methods 0.000 description 6
- 239000003085 diluting agent Substances 0.000 description 6
- 239000004615 ingredient Substances 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 229940087646 methanolamine Drugs 0.000 description 6
- 239000008194 pharmaceutical composition Substances 0.000 description 6
- 210000002307 prostate Anatomy 0.000 description 6
- 238000006722 reduction reaction Methods 0.000 description 6
- 239000007787 solid Substances 0.000 description 6
- 238000003756 stirring Methods 0.000 description 6
- 150000003573 thiols Chemical class 0.000 description 6
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 5
- 102100027203 B-cell antigen receptor complex-associated protein beta chain Human genes 0.000 description 5
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 description 5
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 5
- 108020004414 DNA Proteins 0.000 description 5
- 102100037241 Endoglin Human genes 0.000 description 5
- 101000942280 Homo sapiens C-type lectin domain family 14 member A Proteins 0.000 description 5
- 101001082627 Homo sapiens HLA class II histocompatibility antigen gamma chain Proteins 0.000 description 5
- 101000633784 Homo sapiens SLAM family member 7 Proteins 0.000 description 5
- 101000834948 Homo sapiens Tomoregulin-2 Proteins 0.000 description 5
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 5
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 5
- ABLZXFCXXLZCGV-UHFFFAOYSA-N Phosphorous acid Chemical compound OP(O)=O ABLZXFCXXLZCGV-UHFFFAOYSA-N 0.000 description 5
- 102100040122 Proline-rich protein 4 Human genes 0.000 description 5
- 102100029986 Receptor tyrosine-protein kinase erbB-3 Human genes 0.000 description 5
- 102100032780 Semaphorin-5B Human genes 0.000 description 5
- 102100026160 Tomoregulin-2 Human genes 0.000 description 5
- 102100029690 Tumor necrosis factor receptor superfamily member 13C Human genes 0.000 description 5
- 125000002723 alicyclic group Chemical group 0.000 description 5
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical group [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 5
- 230000004071 biological effect Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 125000000484 butyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 5
- 150000001721 carbon Chemical group 0.000 description 5
- 150000001767 cationic compounds Chemical class 0.000 description 5
- 229960002173 citrulline Drugs 0.000 description 5
- 231100000433 cytotoxic Toxicity 0.000 description 5
- 230000001472 cytotoxic effect Effects 0.000 description 5
- 238000010511 deprotection reaction Methods 0.000 description 5
- 125000002147 dimethylamino group Chemical group [H]C([H])([H])N(*)C([H])([H])[H] 0.000 description 5
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 5
- 238000001035 drying Methods 0.000 description 5
- 229960002949 fluorouracil Drugs 0.000 description 5
- 235000013922 glutamic acid Nutrition 0.000 description 5
- DOUHZFSGSXMPIE-UHFFFAOYSA-N hydroxidooxidosulfur(.) Chemical compound [O]SO DOUHZFSGSXMPIE-UHFFFAOYSA-N 0.000 description 5
- 239000000543 intermediate Substances 0.000 description 5
- 229910052740 iodine Inorganic materials 0.000 description 5
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 5
- 125000002183 isoquinolinyl group Chemical group C1(=NC=CC2=CC=CC=C12)* 0.000 description 5
- 208000032839 leukemia Diseases 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- XMYQHJDBLRZMLW-UHFFFAOYSA-N methanolamine Chemical compound NCO XMYQHJDBLRZMLW-UHFFFAOYSA-N 0.000 description 5
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 5
- 108010067588 nucleotide pyrophosphatase Proteins 0.000 description 5
- 125000002943 quinolinyl group Chemical group N1=C(C=CC2=CC=CC=C12)* 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 238000010898 silica gel chromatography Methods 0.000 description 5
- 239000000725 suspension Substances 0.000 description 5
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 4
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 4
- PBVAJRFEEOIAGW-UHFFFAOYSA-N 3-[bis(2-carboxyethyl)phosphanyl]propanoic acid;hydrochloride Chemical compound Cl.OC(=O)CCP(CCC(O)=O)CCC(O)=O PBVAJRFEEOIAGW-UHFFFAOYSA-N 0.000 description 4
- 108090000663 Annexin A1 Proteins 0.000 description 4
- 102000006942 B-Cell Maturation Antigen Human genes 0.000 description 4
- 101710187595 B-cell receptor CD22 Proteins 0.000 description 4
- 108700012439 CA9 Proteins 0.000 description 4
- 102000007499 CD27 Ligand Human genes 0.000 description 4
- 108010046080 CD27 Ligand Proteins 0.000 description 4
- 108010065524 CD52 Antigen Proteins 0.000 description 4
- 102000005572 Cathepsin A Human genes 0.000 description 4
- 108010059081 Cathepsin A Proteins 0.000 description 4
- 206010009944 Colon cancer Diseases 0.000 description 4
- 101710099518 Dickkopf-related protein 1 Proteins 0.000 description 4
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 4
- 102100021451 Endoplasmic reticulum chaperone BiP Human genes 0.000 description 4
- QUSNBJAOOMFDIB-UHFFFAOYSA-N Ethylamine Chemical compound CCN QUSNBJAOOMFDIB-UHFFFAOYSA-N 0.000 description 4
- 108010007712 Hepatitis A Virus Cellular Receptor 1 Proteins 0.000 description 4
- 101710184069 Hepatocyte growth factor receptor Proteins 0.000 description 4
- 101000914491 Homo sapiens B-cell antigen receptor complex-associated protein beta chain Proteins 0.000 description 4
- 101000884305 Homo sapiens B-cell receptor CD22 Proteins 0.000 description 4
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 description 4
- 101000980814 Homo sapiens CAMPATH-1 antigen Proteins 0.000 description 4
- 101000881679 Homo sapiens Endoglin Proteins 0.000 description 4
- 101000972946 Homo sapiens Hepatocyte growth factor receptor Proteins 0.000 description 4
- 101000934338 Homo sapiens Myeloid cell surface antigen CD33 Proteins 0.000 description 4
- 101000610548 Homo sapiens Proline-rich protein 4 Proteins 0.000 description 4
- 101000684208 Homo sapiens Prolyl endopeptidase FAP Proteins 0.000 description 4
- 101000654679 Homo sapiens Semaphorin-5B Proteins 0.000 description 4
- 101000955999 Homo sapiens V-set domain-containing T-cell activation inhibitor 1 Proteins 0.000 description 4
- 101000685848 Homo sapiens Zinc transporter ZIP6 Proteins 0.000 description 4
- 229930194542 Keto Natural products 0.000 description 4
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical group NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 4
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 4
- 206010025323 Lymphomas Diseases 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 108010063954 Mucins Proteins 0.000 description 4
- 102000015728 Mucins Human genes 0.000 description 4
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 4
- 102100027347 Neural cell adhesion molecule 1 Human genes 0.000 description 4
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- 206010060862 Prostate cancer Diseases 0.000 description 4
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 108010029157 Sialic Acid Binding Ig-like Lectin 2 Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 108090000058 Syndecan-1 Proteins 0.000 description 4
- 102000004357 Transferases Human genes 0.000 description 4
- 108090000992 Transferases Proteins 0.000 description 4
- 108010079206 V-Set Domain-Containing T-Cell Activation Inhibitor 1 Proteins 0.000 description 4
- 230000002776 aggregation Effects 0.000 description 4
- 238000004220 aggregation Methods 0.000 description 4
- 150000001409 amidines Chemical class 0.000 description 4
- 125000000129 anionic group Chemical group 0.000 description 4
- 239000012300 argon atmosphere Substances 0.000 description 4
- WGQKYBSKWIADBV-UHFFFAOYSA-N benzylamine Chemical compound NCC1=CC=CC=C1 WGQKYBSKWIADBV-UHFFFAOYSA-N 0.000 description 4
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 4
- HQABUPZFAYXKJW-UHFFFAOYSA-N butan-1-amine Chemical compound CCCCN HQABUPZFAYXKJW-UHFFFAOYSA-N 0.000 description 4
- 229910052801 chlorine Inorganic materials 0.000 description 4
- 235000013477 citrulline Nutrition 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 150000004985 diamines Chemical class 0.000 description 4
- KPUWHANPEXNPJT-UHFFFAOYSA-N disiloxane Chemical class [SiH3]O[SiH3] KPUWHANPEXNPJT-UHFFFAOYSA-N 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000002552 dosage form Substances 0.000 description 4
- 229960004679 doxorubicin Drugs 0.000 description 4
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 239000003112 inhibitor Substances 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 229910001411 inorganic cation Inorganic materials 0.000 description 4
- 230000000155 isotopic effect Effects 0.000 description 4
- 125000000468 ketone group Chemical group 0.000 description 4
- 125000004433 nitrogen atom Chemical group N* 0.000 description 4
- 239000012074 organic phase Substances 0.000 description 4
- 230000003647 oxidation Effects 0.000 description 4
- 238000007254 oxidation reaction Methods 0.000 description 4
- 125000001820 oxy group Chemical group [*:1]O[*:2] 0.000 description 4
- 229910052763 palladium Inorganic materials 0.000 description 4
- NFHFRUOZVGFOOS-UHFFFAOYSA-N palladium;triphenylphosphane Chemical compound [Pd].C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 NFHFRUOZVGFOOS-UHFFFAOYSA-N 0.000 description 4
- 125000001147 pentyl group Chemical group C(CCCC)* 0.000 description 4
- 125000000843 phenylene group Chemical group C1(=C(C=CC=C1)*)* 0.000 description 4
- 230000000069 prophylactic effect Effects 0.000 description 4
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 4
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 4
- 229960002930 sirolimus Drugs 0.000 description 4
- 239000003381 stabilizer Substances 0.000 description 4
- 239000013589 supplement Substances 0.000 description 4
- 239000003826 tablet Substances 0.000 description 4
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 4
- CXWXQJXEFPUFDZ-UHFFFAOYSA-N tetralin Chemical compound C1=CC=C2CCCCC2=C1 CXWXQJXEFPUFDZ-UHFFFAOYSA-N 0.000 description 4
- 238000004809 thin layer chromatography Methods 0.000 description 4
- WYWHKKSPHMUBEB-UHFFFAOYSA-N tioguanine Chemical compound N1C(N)=NC(=S)C2=C1N=CN2 WYWHKKSPHMUBEB-UHFFFAOYSA-N 0.000 description 4
- 239000003039 volatile agent Substances 0.000 description 4
- 239000011701 zinc Substances 0.000 description 4
- DLFVBJFMPXGRIB-UHFFFAOYSA-N Acetamide Chemical group CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 description 3
- 108010088751 Albumins Proteins 0.000 description 3
- 102000009027 Albumins Human genes 0.000 description 3
- 102100040006 Annexin A1 Human genes 0.000 description 3
- 102000004145 Annexin A1 Human genes 0.000 description 3
- 102100025218 B-cell differentiation antigen CD72 Human genes 0.000 description 3
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 206010006187 Breast cancer Diseases 0.000 description 3
- 208000026310 Breast neoplasm Diseases 0.000 description 3
- 102100024220 CD180 antigen Human genes 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- RGSFGYAAUTVSQA-UHFFFAOYSA-N Cyclopentane Chemical compound C1CCCC1 RGSFGYAAUTVSQA-UHFFFAOYSA-N 0.000 description 3
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 3
- UHDGCWIWMRVCDJ-CCXZUQQUSA-N Cytarabine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](CO)O1 UHDGCWIWMRVCDJ-CCXZUQQUSA-N 0.000 description 3
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 3
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 3
- 108010092160 Dactinomycin Proteins 0.000 description 3
- 102100020743 Dipeptidase 1 Human genes 0.000 description 3
- GKQLYSROISKDLL-UHFFFAOYSA-N EEDQ Chemical compound C1=CC=C2N(C(=O)OCC)C(OCC)C=CC2=C1 GKQLYSROISKDLL-UHFFFAOYSA-N 0.000 description 3
- 108700041152 Endoplasmic Reticulum Chaperone BiP Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- 102100031511 Fc receptor-like protein 2 Human genes 0.000 description 3
- 102000010451 Folate receptor alpha Human genes 0.000 description 3
- 108050001931 Folate receptor alpha Proteins 0.000 description 3
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 101710198293 Guanylyl cyclase C Proteins 0.000 description 3
- 102000018713 Histocompatibility Antigens Class II Human genes 0.000 description 3
- 101000980829 Homo sapiens CD180 antigen Proteins 0.000 description 3
- 101000899808 Homo sapiens Guanylyl cyclase C Proteins 0.000 description 3
- 101001068136 Homo sapiens Hepatitis A virus cellular receptor 1 Proteins 0.000 description 3
- 101000606465 Homo sapiens Inactive tyrosine-protein kinase 7 Proteins 0.000 description 3
- 101000897042 Homo sapiens Nucleotide pyrophosphatase Proteins 0.000 description 3
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 3
- 101000622304 Homo sapiens Vascular cell adhesion protein 1 Proteins 0.000 description 3
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 3
- 101710099452 Inactive tyrosine-protein kinase 7 Proteins 0.000 description 3
- 102100033011 Integrin beta-6 Human genes 0.000 description 3
- 208000008839 Kidney Neoplasms Diseases 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 3
- 108091054438 MHC class II family Proteins 0.000 description 3
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 3
- IZTIOKUFBARREJ-NDEPHWFRSA-N NC1=CC=C(C=C1)C=1C[C@H](N(C=1)C(=O)C1=C(C=C(C(=C1)OC)O[Si](C(C)C)(C(C)C)C(C)C)[N+](=O)[O-])CO[Si](C)(C)C(C)(C)C Chemical compound NC1=CC=C(C=C1)C=1C[C@H](N(C=1)C(=O)C1=C(C=C(C(=C1)OC)O[Si](C(C)C)(C(C)C)C(C)C)[N+](=O)[O-])CO[Si](C)(C)C(C)(C)C IZTIOKUFBARREJ-NDEPHWFRSA-N 0.000 description 3
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 3
- 108010069196 Neural Cell Adhesion Molecules Proteins 0.000 description 3
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 3
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 3
- 108700020796 Oncogene Proteins 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- 206010033128 Ovarian cancer Diseases 0.000 description 3
- 206010061535 Ovarian neoplasm Diseases 0.000 description 3
- 239000012828 PI3K inhibitor Substances 0.000 description 3
- 229930012538 Paclitaxel Natural products 0.000 description 3
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- RWRDLPDLKQPQOW-UHFFFAOYSA-N Pyrrolidine Chemical compound C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 3
- SMWDFEZZVXVKRB-UHFFFAOYSA-N Quinoline Chemical compound N1=CC=CC2=CC=CC=C21 SMWDFEZZVXVKRB-UHFFFAOYSA-N 0.000 description 3
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 3
- 206010038389 Renal cancer Diseases 0.000 description 3
- 241000283984 Rodentia Species 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- 102100038437 Sodium-dependent phosphate transport protein 2B Human genes 0.000 description 3
- 102100038126 Tenascin Human genes 0.000 description 3
- DHXVGJBLRPWPCS-UHFFFAOYSA-N Tetrahydropyran Chemical compound C1CCOCC1 DHXVGJBLRPWPCS-UHFFFAOYSA-N 0.000 description 3
- FOCVUCIESVLUNU-UHFFFAOYSA-N Thiotepa Chemical compound C1CN1P(N1CC1)(=S)N1CC1 FOCVUCIESVLUNU-UHFFFAOYSA-N 0.000 description 3
- 101710165436 Tumor necrosis factor receptor superfamily member 8 Proteins 0.000 description 3
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 3
- 108010000134 Vascular Cell Adhesion Molecule-1 Proteins 0.000 description 3
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 3
- DHKHKXVYLBGOIT-UHFFFAOYSA-N acetaldehyde Diethyl Acetal Natural products CCOC(C)OCC DHKHKXVYLBGOIT-UHFFFAOYSA-N 0.000 description 3
- 150000001241 acetals Chemical class 0.000 description 3
- 229960000583 acetic acid Drugs 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 125000003158 alcohol group Chemical group 0.000 description 3
- 229960000548 alemtuzumab Drugs 0.000 description 3
- 125000004450 alkenylene group Chemical group 0.000 description 3
- 125000004419 alkynylene group Chemical group 0.000 description 3
- 239000003963 antioxidant agent Substances 0.000 description 3
- 235000006708 antioxidants Nutrition 0.000 description 3
- 150000001491 aromatic compounds Chemical class 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 125000004429 atom Chemical group 0.000 description 3
- VSRXQHXAPYXROS-UHFFFAOYSA-N azanide;cyclobutane-1,1-dicarboxylic acid;platinum(2+) Chemical compound [NH2-].[NH2-].[Pt+2].OC(=O)C1(C(O)=O)CCC1 VSRXQHXAPYXROS-UHFFFAOYSA-N 0.000 description 3
- 239000002585 base Substances 0.000 description 3
- AEMOLEFTQBMNLQ-QIUUJYRFSA-N beta-D-glucuronic acid Chemical compound O[C@@H]1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-QIUUJYRFSA-N 0.000 description 3
- 229910052794 bromium Inorganic materials 0.000 description 3
- KVUAALJSMIVURS-ZEDZUCNESA-L calcium folinate Chemical compound [Ca+2].C1NC=2NC(N)=NC(=O)C=2N(C=O)C1CNC1=CC=C(C(=O)N[C@@H](CCC([O-])=O)C([O-])=O)C=C1 KVUAALJSMIVURS-ZEDZUCNESA-L 0.000 description 3
- 230000004663 cell proliferation Effects 0.000 description 3
- 239000003638 chemical reducing agent Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 208000029742 colonic neoplasm Diseases 0.000 description 3
- 239000002131 composite material Substances 0.000 description 3
- 229960004397 cyclophosphamide Drugs 0.000 description 3
- 229960002806 daclizumab Drugs 0.000 description 3
- 230000007123 defense Effects 0.000 description 3
- 150000002019 disulfides Chemical class 0.000 description 3
- VLCYCQAOQCDTCN-UHFFFAOYSA-N eflornithine Chemical compound NCCCC(N)(C(F)F)C(O)=O VLCYCQAOQCDTCN-UHFFFAOYSA-N 0.000 description 3
- 150000002085 enols Chemical class 0.000 description 3
- 229950009760 epratuzumab Drugs 0.000 description 3
- CCIVGXIOQKPBKL-UHFFFAOYSA-M ethanesulfonate Chemical compound CCS([O-])(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-M 0.000 description 3
- 239000006260 foam Substances 0.000 description 3
- 102000006815 folate receptor Human genes 0.000 description 3
- 108020005243 folate receptor Proteins 0.000 description 3
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- 229940097043 glucuronic acid Drugs 0.000 description 3
- 150000002373 hemiacetals Chemical class 0.000 description 3
- DMEGYFMYUHOHGS-UHFFFAOYSA-N heptamethylene Natural products C1CCCCCC1 DMEGYFMYUHOHGS-UHFFFAOYSA-N 0.000 description 3
- 229940022353 herceptin Drugs 0.000 description 3
- 125000001072 heteroaryl group Chemical group 0.000 description 3
- 150000002430 hydrocarbons Chemical class 0.000 description 3
- 125000001841 imino group Chemical group [H]N=* 0.000 description 3
- 229940072221 immunoglobulins Drugs 0.000 description 3
- 102000006495 integrins Human genes 0.000 description 3
- 108010044426 integrins Proteins 0.000 description 3
- 108010028930 invariant chain Proteins 0.000 description 3
- UWKQSNNFCGGAFS-XIFFEERXSA-N irinotecan Chemical compound C1=C2C(CC)=C3CN(C(C4=C([C@@](C(=O)OC4)(O)CC)C=4)=O)C=4C3=NC2=CC=C1OC(=O)N(CC1)CCC1N1CCCCC1 UWKQSNNFCGGAFS-XIFFEERXSA-N 0.000 description 3
- 125000004551 isoquinolin-3-yl group Chemical group C1=NC(=CC2=CC=CC=C12)* 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 201000010982 kidney cancer Diseases 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 208000014018 liver neoplasm Diseases 0.000 description 3
- 201000005202 lung cancer Diseases 0.000 description 3
- 208000020816 lung neoplasm Diseases 0.000 description 3
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 230000036210 malignancy Effects 0.000 description 3
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 229950008001 matuzumab Drugs 0.000 description 3
- GLVAUDGFNGKCSF-UHFFFAOYSA-N mercaptopurine Chemical compound S=C1NC=NC2=C1NC=N2 GLVAUDGFNGKCSF-UHFFFAOYSA-N 0.000 description 3
- 229960000485 methotrexate Drugs 0.000 description 3
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 3
- GDOPTJXRTPNYNR-UHFFFAOYSA-N methylcyclopentane Chemical compound CC1CCCC1 GDOPTJXRTPNYNR-UHFFFAOYSA-N 0.000 description 3
- 239000002829 mitogen activated protein kinase inhibitor Substances 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 125000002950 monocyclic group Chemical group 0.000 description 3
- 125000002757 morpholinyl group Chemical group 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 150000002892 organic cations Chemical class 0.000 description 3
- 239000012044 organic layer Substances 0.000 description 3
- 230000001590 oxidative effect Effects 0.000 description 3
- 229960001592 paclitaxel Drugs 0.000 description 3
- 201000002528 pancreatic cancer Diseases 0.000 description 3
- 208000008443 pancreatic carcinoma Diseases 0.000 description 3
- 229960001972 panitumumab Drugs 0.000 description 3
- 229960002087 pertuzumab Drugs 0.000 description 3
- XYFCBTPGUUZFHI-UHFFFAOYSA-N phosphine group Chemical group P XYFCBTPGUUZFHI-UHFFFAOYSA-N 0.000 description 3
- 229940043441 phosphoinositide 3-kinase inhibitor Drugs 0.000 description 3
- 150000008300 phosphoramidites Chemical class 0.000 description 3
- 125000004193 piperazinyl group Chemical group 0.000 description 3
- 125000003386 piperidinyl group Chemical group 0.000 description 3
- 229920001223 polyethylene glycol Polymers 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 239000003755 preservative agent Substances 0.000 description 3
- 229960002429 proline Drugs 0.000 description 3
- 125000002572 propoxy group Chemical group [*]OC([H])([H])C(C([H])([H])[H])([H])[H] 0.000 description 3
- 125000004548 quinolin-3-yl group Chemical group N1=CC(=CC2=CC=CC=C12)* 0.000 description 3
- 125000004550 quinolin-6-yl group Chemical group N1=CC=CC2=CC(=CC=C12)* 0.000 description 3
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 3
- 229910000029 sodium carbonate Inorganic materials 0.000 description 3
- 108090000586 somatostatin receptor 2 Proteins 0.000 description 3
- 108090000680 somatostatin receptor 5 Proteins 0.000 description 3
- 206010041823 squamous cell carcinoma Diseases 0.000 description 3
- 208000017572 squamous cell neoplasm Diseases 0.000 description 3
- 125000001273 sulfonato group Chemical group [O-]S(*)(=O)=O 0.000 description 3
- 125000004354 sulfur functional group Chemical group 0.000 description 3
- 229960001603 tamoxifen Drugs 0.000 description 3
- 125000001544 thienyl group Chemical group 0.000 description 3
- 150000003568 thioethers Chemical class 0.000 description 3
- 229960000575 trastuzumab Drugs 0.000 description 3
- 208000035408 type 1 diabetes mellitus 1 Diseases 0.000 description 3
- 125000000391 vinyl group Chemical group [H]C([*])=C([H])[H] 0.000 description 3
- 229910052725 zinc Inorganic materials 0.000 description 3
- 125000006652 (C3-C12) cycloalkyl group Chemical group 0.000 description 2
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 2
- UQVNRKBFAXNOGA-LWTNMJDUSA-N (E)-tomaymycin Chemical compound CO[C@H]1NC2=CC(O)=C(OC)C=C2C(=O)N2C\C(=C\C)C[C@@H]12 UQVNRKBFAXNOGA-LWTNMJDUSA-N 0.000 description 2
- IAKHMKGGTNLKSZ-INIZCTEOSA-N (S)-colchicine Chemical compound C1([C@@H](NC(C)=O)CC2)=CC(=O)C(OC)=CC=C1C1=C2C=C(OC)C(OC)=C1OC IAKHMKGGTNLKSZ-INIZCTEOSA-N 0.000 description 2
- AZQWKYJCGOJGHM-UHFFFAOYSA-N 1,4-benzoquinone Chemical compound O=C1C=CC(=O)C=C1 AZQWKYJCGOJGHM-UHFFFAOYSA-N 0.000 description 2
- KXMZDGSRSGHMMK-VWLOTQADSA-N 1-(6,7-dihydro-5h-benzo[2,3]cyclohepta[2,4-d]pyridazin-3-yl)-3-n-[(7s)-7-pyrrolidin-1-yl-6,7,8,9-tetrahydro-5h-benzo[7]annulen-3-yl]-1,2,4-triazole-3,5-diamine Chemical compound N1([C@H]2CCC3=CC=C(C=C3CC2)NC=2N=C(N(N=2)C=2N=NC=3C4=CC=CC=C4CCCC=3C=2)N)CCCC1 KXMZDGSRSGHMMK-VWLOTQADSA-N 0.000 description 2
- IANQTJSKSUMEQM-UHFFFAOYSA-N 1-benzofuran Chemical compound C1=CC=C2OC=CC2=C1 IANQTJSKSUMEQM-UHFFFAOYSA-N 0.000 description 2
- FCEHBMOGCRZNNI-UHFFFAOYSA-N 1-benzothiophene Chemical compound C1=CC=C2SC=CC2=C1 FCEHBMOGCRZNNI-UHFFFAOYSA-N 0.000 description 2
- CTMHWPIWNRWQEG-UHFFFAOYSA-N 1-methylcyclohexene Chemical compound CC1=CCCCC1 CTMHWPIWNRWQEG-UHFFFAOYSA-N 0.000 description 2
- YBYIRNPNPLQARY-UHFFFAOYSA-N 1H-indene Chemical compound C1=CC=C2CC=CC2=C1 YBYIRNPNPLQARY-UHFFFAOYSA-N 0.000 description 2
- LJCZNYWLQZZIOS-UHFFFAOYSA-N 2,2,2-trichlorethoxycarbonyl chloride Chemical compound ClC(=O)OCC(Cl)(Cl)Cl LJCZNYWLQZZIOS-UHFFFAOYSA-N 0.000 description 2
- VEPOHXYIFQMVHW-XOZOLZJESA-N 2,3-dihydroxybutanedioic acid (2S,3S)-3,4-dimethyl-2-phenylmorpholine Chemical compound OC(C(O)C(O)=O)C(O)=O.C[C@H]1[C@@H](OCCN1C)c1ccccc1 VEPOHXYIFQMVHW-XOZOLZJESA-N 0.000 description 2
- GAKIJEPUVBHWCK-UHFFFAOYSA-N 2-methoxy-2-(3,4,5-trimethoxyphenyl)ethanamine Chemical group COC(CN)C1=CC(OC)=C(OC)C(OC)=C1 GAKIJEPUVBHWCK-UHFFFAOYSA-N 0.000 description 2
- 125000003903 2-propenyl group Chemical group [H]C([*])([H])C([H])=C([H])[H] 0.000 description 2
- 125000001494 2-propynyl group Chemical group [H]C#CC([H])([H])* 0.000 description 2
- 125000000175 2-thienyl group Chemical group S1C([*])=C([H])C([H])=C1[H] 0.000 description 2
- 108010083651 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase Proteins 0.000 description 2
- IBOFVQJTBBUKMU-UHFFFAOYSA-N 4,4'-methylene-bis-(2-chloroaniline) Chemical compound C1=C(Cl)C(N)=CC=C1CC1=CC=C(N)C(Cl)=C1 IBOFVQJTBBUKMU-UHFFFAOYSA-N 0.000 description 2
- ZANPJXNYBVVNSD-UHFFFAOYSA-N 4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)aniline Chemical compound O1C(C)(C)C(C)(C)OB1C1=CC=C(N)C=C1 ZANPJXNYBVVNSD-UHFFFAOYSA-N 0.000 description 2
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-Dimethylaminopyridine Chemical compound CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 description 2
- HVBSAKJJOYLTQU-UHFFFAOYSA-N 4-aminobenzenesulfonic acid Chemical compound NC1=CC=C(S(O)(=O)=O)C=C1 HVBSAKJJOYLTQU-UHFFFAOYSA-N 0.000 description 2
- 125000004172 4-methoxyphenyl group Chemical group [H]C1=C([H])C(OC([H])([H])[H])=C([H])C([H])=C1* 0.000 description 2
- VVIAGPKUTFNRDU-UHFFFAOYSA-N 6S-folinic acid Natural products C1NC=2NC(N)=NC(=O)C=2N(C=O)C1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 VVIAGPKUTFNRDU-UHFFFAOYSA-N 0.000 description 2
- SDYJYMBMSYYZIC-UHFFFAOYSA-N 7,7-dimethyl-4-methylidene-3,3a,5,6,6a,8,9,10-octahydrobenzo[h][2]benzofuran-1-one Chemical compound C=C1CCC2C(C)(C)CCCC32C(=O)OCC31 SDYJYMBMSYYZIC-UHFFFAOYSA-N 0.000 description 2
- STQGQHZAVUOBTE-UHFFFAOYSA-N 7-Cyan-hept-2t-en-4,6-diinsaeure Natural products C1=2C(O)=C3C(=O)C=4C(OC)=CC=CC=4C(=O)C3=C(O)C=2CC(O)(C(C)=O)CC1OC1CC(N)C(O)C(C)O1 STQGQHZAVUOBTE-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- UJOBWOGCFQCDNV-UHFFFAOYSA-N 9H-carbazole Chemical compound C1=CC=C2C3=CC=CC=C3NC2=C1 UJOBWOGCFQCDNV-UHFFFAOYSA-N 0.000 description 2
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 2
- PAYRUJLWNCNPSJ-UHFFFAOYSA-N Aniline Chemical compound NC1=CC=CC=C1 PAYRUJLWNCNPSJ-UHFFFAOYSA-N 0.000 description 2
- 102000000412 Annexin Human genes 0.000 description 2
- 108050008874 Annexin Proteins 0.000 description 2
- 208000002267 Anti-neutrophil cytoplasmic antibody-associated vasculitis Diseases 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 201000001320 Atherosclerosis Diseases 0.000 description 2
- 108091008875 B cell receptors Proteins 0.000 description 2
- 108010046304 B-Cell Activation Factor Receptor Proteins 0.000 description 2
- 101710129514 B-cell differentiation antigen CD72 Proteins 0.000 description 2
- MLDQJTXFUGDVEO-UHFFFAOYSA-N BAY-43-9006 Chemical compound C1=NC(C(=O)NC)=CC(OC=2C=CC(NC(=O)NC=3C=C(C(Cl)=CC=3)C(F)(F)F)=CC=2)=C1 MLDQJTXFUGDVEO-UHFFFAOYSA-N 0.000 description 2
- 208000023328 Basedow disease Diseases 0.000 description 2
- 206010005003 Bladder cancer Diseases 0.000 description 2
- 208000003174 Brain Neoplasms Diseases 0.000 description 2
- 108010022366 Carcinoembryonic Antigen Proteins 0.000 description 2
- 102100025473 Carcinoembryonic antigen-related cell adhesion molecule 6 Human genes 0.000 description 2
- 201000009030 Carcinoma Diseases 0.000 description 2
- 206010009900 Colitis ulcerative Diseases 0.000 description 2
- 108010023729 Complement 3d Receptors Proteins 0.000 description 2
- 102000011412 Complement 3d Receptors Human genes 0.000 description 2
- WQZGKKKJIJFFOK-WHZQZERISA-N D-aldose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-WHZQZERISA-N 0.000 description 2
- RGHNJXZEOKUKBD-SQOUGZDYSA-N D-gluconic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 2
- XBPCUCUWBYBCDP-UHFFFAOYSA-N Dicyclohexylamine Chemical compound C1CCCCC1NC1CCCCC1 XBPCUCUWBYBCDP-UHFFFAOYSA-N 0.000 description 2
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 101150076616 EPHA2 gene Proteins 0.000 description 2
- 108010036395 Endoglin Proteins 0.000 description 2
- 101710116743 Ephrin type-A receptor 2 Proteins 0.000 description 2
- 241000283073 Equus caballus Species 0.000 description 2
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 2
- PIICEJLVQHRZGT-UHFFFAOYSA-N Ethylenediamine Chemical compound NCCN PIICEJLVQHRZGT-UHFFFAOYSA-N 0.000 description 2
- 102100031517 Fc receptor-like protein 1 Human genes 0.000 description 2
- 101710120224 Fc receptor-like protein 1 Proteins 0.000 description 2
- MPJKWIXIYCLVCU-UHFFFAOYSA-N Folinic acid Natural products NC1=NC2=C(N(C=O)C(CNc3ccc(cc3)C(=O)NC(CCC(=O)O)CC(=O)O)CN2)C(=O)N1 MPJKWIXIYCLVCU-UHFFFAOYSA-N 0.000 description 2
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 2
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 2
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 description 2
- 206010018338 Glioma Diseases 0.000 description 2
- 206010018364 Glomerulonephritis Diseases 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 208000015023 Graves' disease Diseases 0.000 description 2
- 101710185991 Hepatitis A virus cellular receptor 1 homolog Proteins 0.000 description 2
- 102100028999 High mobility group protein HMGI-C Human genes 0.000 description 2
- 108010027412 Histocompatibility Antigens Class II Proteins 0.000 description 2
- 101000893701 Homo sapiens 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Proteins 0.000 description 2
- 101000690301 Homo sapiens Aldo-keto reductase family 1 member C4 Proteins 0.000 description 2
- 101000959738 Homo sapiens Annexin A1 Proteins 0.000 description 2
- 101000914324 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 5 Proteins 0.000 description 2
- 101000914326 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 6 Proteins 0.000 description 2
- 101000864646 Homo sapiens Dickkopf-related protein 1 Proteins 0.000 description 2
- 101000938346 Homo sapiens Ephrin type-A receptor 2 Proteins 0.000 description 2
- 101000851181 Homo sapiens Epidermal growth factor receptor Proteins 0.000 description 2
- 101000846911 Homo sapiens Fc receptor-like protein 2 Proteins 0.000 description 2
- 101000986379 Homo sapiens High mobility group protein HMGI-C Proteins 0.000 description 2
- 101001015064 Homo sapiens Integrin beta-6 Proteins 0.000 description 2
- 101001057504 Homo sapiens Interferon-stimulated gene 20 kDa protein Proteins 0.000 description 2
- 101001063456 Homo sapiens Leucine-rich repeat-containing G-protein coupled receptor 5 Proteins 0.000 description 2
- 101000623901 Homo sapiens Mucin-16 Proteins 0.000 description 2
- 101001116548 Homo sapiens Protein CBFA2T1 Proteins 0.000 description 2
- 101000604039 Homo sapiens Sodium-dependent phosphate transport protein 2B Proteins 0.000 description 2
- 101000874179 Homo sapiens Syndecan-1 Proteins 0.000 description 2
- 101000844504 Homo sapiens Transient receptor potential cation channel subfamily M member 4 Proteins 0.000 description 2
- 101000801255 Homo sapiens Tumor necrosis factor receptor superfamily member 17 Proteins 0.000 description 2
- 101000851376 Homo sapiens Tumor necrosis factor receptor superfamily member 8 Proteins 0.000 description 2
- 208000002979 Influenza in Birds Diseases 0.000 description 2
- 101710190483 Interleukin-2 receptor subunit alpha Proteins 0.000 description 2
- 108010044023 Ki-1 Antigen Proteins 0.000 description 2
- PWKSKIMOESPYIA-BYPYZUCNSA-N L-N-acetyl-Cysteine Chemical compound CC(=O)N[C@@H](CS)C(O)=O PWKSKIMOESPYIA-BYPYZUCNSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- 101710197070 Lectin-2 Proteins 0.000 description 2
- 102100031036 Leucine-rich repeat-containing G-protein coupled receptor 5 Human genes 0.000 description 2
- 206010024612 Lipoma Diseases 0.000 description 2
- 208000005777 Lupus Nephritis Diseases 0.000 description 2
- 229940124647 MEK inhibitor Drugs 0.000 description 2
- 102000043131 MHC class II family Human genes 0.000 description 2
- 101710196759 Macrophage-stimulating protein receptor Proteins 0.000 description 2
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Chemical compound C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 2
- 102100023123 Mucin-16 Human genes 0.000 description 2
- 101100327295 Mus musculus Cd22 gene Proteins 0.000 description 2
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 2
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 2
- MBBZMMPHUWSWHV-BDVNFPICSA-N N-methylglucamine Chemical compound CNC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO MBBZMMPHUWSWHV-BDVNFPICSA-N 0.000 description 2
- UFWIBTONFRDIAS-UHFFFAOYSA-N Naphthalene Chemical compound C1=CC=CC2=CC=CC=C21 UFWIBTONFRDIAS-UHFFFAOYSA-N 0.000 description 2
- 101710160107 Outer membrane protein A Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 208000031845 Pernicious anaemia Diseases 0.000 description 2
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 2
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 2
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 2
- 102100036735 Prostate stem cell antigen Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 102000052575 Proto-Oncogene Human genes 0.000 description 2
- 108700020978 Proto-Oncogene Proteins 0.000 description 2
- 108010089836 Proto-Oncogene Proteins c-met Proteins 0.000 description 2
- 201000004681 Psoriasis Diseases 0.000 description 2
- 108010009413 Pyrophosphatases Proteins 0.000 description 2
- 102000009609 Pyrophosphatases Human genes 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 239000008156 Ringer's lactate solution Substances 0.000 description 2
- 108091006207 SLC-Transporter Proteins 0.000 description 2
- 102000037054 SLC-Transporter Human genes 0.000 description 2
- 206010039491 Sarcoma Diseases 0.000 description 2
- 208000021386 Sjogren Syndrome Diseases 0.000 description 2
- 206010041067 Small cell lung cancer Diseases 0.000 description 2
- 108050001286 Somatostatin Receptor Proteins 0.000 description 2
- 102000011096 Somatostatin receptor Human genes 0.000 description 2
- 102100023802 Somatostatin receptor type 2 Human genes 0.000 description 2
- 102100023806 Somatostatin receptor type 5 Human genes 0.000 description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 230000006044 T cell activation Effects 0.000 description 2
- 102100033447 T-lymphocyte surface antigen Ly-9 Human genes 0.000 description 2
- 101710114141 T-lymphocyte surface antigen Ly-9 Proteins 0.000 description 2
- BPEGJWRSRHCHSN-UHFFFAOYSA-N Temozolomide Chemical compound O=C1N(C)N=NC2=C(C(N)=O)N=CN21 BPEGJWRSRHCHSN-UHFFFAOYSA-N 0.000 description 2
- 241001061127 Thione Species 0.000 description 2
- UQVNRKBFAXNOGA-IUODEOHRSA-N Tomaymycin Natural products CO[C@H]1Nc2cc(O)c(OC)cc2C(=O)N3CC(=CC)C[C@H]13 UQVNRKBFAXNOGA-IUODEOHRSA-N 0.000 description 2
- 101710187885 Tumor necrosis factor receptor superfamily member 17 Proteins 0.000 description 2
- 201000006704 Ulcerative Colitis Diseases 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 101710097851 Zinc transporter ZIP6 Proteins 0.000 description 2
- CWRYPZZKDGJXCA-UHFFFAOYSA-N acenaphthene Chemical compound C1=CC(CC2)=C3C2=CC=CC3=C1 CWRYPZZKDGJXCA-UHFFFAOYSA-N 0.000 description 2
- 235000011054 acetic acid Nutrition 0.000 description 2
- 150000001242 acetic acid derivatives Chemical class 0.000 description 2
- 229960004308 acetylcysteine Drugs 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 229930183665 actinomycin Natural products 0.000 description 2
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 2
- 239000013543 active substance Substances 0.000 description 2
- 208000021841 acute erythroid leukemia Diseases 0.000 description 2
- 125000004423 acyloxy group Chemical group 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 125000006241 alcohol protecting group Chemical group 0.000 description 2
- 229910001413 alkali metal ion Inorganic materials 0.000 description 2
- 229910052784 alkaline earth metal Inorganic materials 0.000 description 2
- HSFWRNGVRCDJHI-UHFFFAOYSA-N alpha-acetylene Natural products C#C HSFWRNGVRCDJHI-UHFFFAOYSA-N 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 2
- 230000000340 anti-metabolite Effects 0.000 description 2
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 2
- 229940100197 antimetabolite Drugs 0.000 description 2
- 239000002256 antimetabolite Substances 0.000 description 2
- 239000012736 aqueous medium Substances 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 206010064097 avian influenza Diseases 0.000 description 2
- 150000001541 aziridines Chemical class 0.000 description 2
- CUFNKYGDVFVPHO-UHFFFAOYSA-N azulene Chemical compound C1=CC=CC2=CC=CC2=C1 CUFNKYGDVFVPHO-UHFFFAOYSA-N 0.000 description 2
- 229950009568 bemcentinib Drugs 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- XSCHRSMBECNVNS-UHFFFAOYSA-N benzopyrazine Natural products N1=CC=NC2=CC=CC=C21 XSCHRSMBECNVNS-UHFFFAOYSA-N 0.000 description 2
- IOJUPLGTWVMSFF-UHFFFAOYSA-N benzothiazole Chemical compound C1=CC=C2SC=NC2=C1 IOJUPLGTWVMSFF-UHFFFAOYSA-N 0.000 description 2
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 2
- 229960000397 bevacizumab Drugs 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 210000003995 blood forming stem cell Anatomy 0.000 description 2
- 230000037396 body weight Effects 0.000 description 2
- 210000000481 breast Anatomy 0.000 description 2
- 229940112129 campath Drugs 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 125000003739 carbamimidoyl group Chemical group C(N)(=N)* 0.000 description 2
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 2
- 125000005488 carboaryl group Chemical group 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 2
- SKOLWUPSYHWYAM-UHFFFAOYSA-N carbonodithioic O,S-acid Chemical compound SC(S)=O SKOLWUPSYHWYAM-UHFFFAOYSA-N 0.000 description 2
- 229960004562 carboplatin Drugs 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 201000009909 cataract 6 multiple types Diseases 0.000 description 2
- 125000002091 cationic group Chemical group 0.000 description 2
- 230000024245 cell differentiation Effects 0.000 description 2
- 229960005395 cetuximab Drugs 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000002512 chemotherapy Methods 0.000 description 2
- 229960004630 chlorambucil Drugs 0.000 description 2
- JCKYGMPEJWAADB-UHFFFAOYSA-N chlorambucil Chemical compound OC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 JCKYGMPEJWAADB-UHFFFAOYSA-N 0.000 description 2
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 2
- 229960001231 choline Drugs 0.000 description 2
- WCZVZNOTHYJIEI-UHFFFAOYSA-N cinnoline Chemical compound N1=NC=CC2=CC=CC=C21 WCZVZNOTHYJIEI-UHFFFAOYSA-N 0.000 description 2
- 229960004316 cisplatin Drugs 0.000 description 2
- DQLATGHUWYMOKM-UHFFFAOYSA-L cisplatin Chemical compound N[Pt](N)(Cl)Cl DQLATGHUWYMOKM-UHFFFAOYSA-L 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- PSNOPSMXOBPNNV-VVCTWANISA-N cryptophycin 1 Chemical compound C1=C(Cl)C(OC)=CC=C1C[C@@H]1C(=O)NC[C@@H](C)C(=O)O[C@@H](CC(C)C)C(=O)O[C@H]([C@H](C)[C@@H]2[C@H](O2)C=2C=CC=CC=2)C/C=C/C(=O)N1 PSNOPSMXOBPNNV-VVCTWANISA-N 0.000 description 2
- PSNOPSMXOBPNNV-UHFFFAOYSA-N cryptophycin-327 Natural products C1=C(Cl)C(OC)=CC=C1CC1C(=O)NCC(C)C(=O)OC(CC(C)C)C(=O)OC(C(C)C2C(O2)C=2C=CC=CC=2)CC=CC(=O)N1 PSNOPSMXOBPNNV-UHFFFAOYSA-N 0.000 description 2
- IBAHLNWTOIHLKE-UHFFFAOYSA-N cyano cyanate Chemical compound N#COC#N IBAHLNWTOIHLKE-UHFFFAOYSA-N 0.000 description 2
- HGCIXCUEYOPUTN-UHFFFAOYSA-N cyclohexene Chemical compound C1CCC=CC1 HGCIXCUEYOPUTN-UHFFFAOYSA-N 0.000 description 2
- LPIQUOYDBNQMRZ-UHFFFAOYSA-N cyclopentene Chemical compound C1CC=CC1 LPIQUOYDBNQMRZ-UHFFFAOYSA-N 0.000 description 2
- 229960000684 cytarabine Drugs 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 230000001085 cytostatic effect Effects 0.000 description 2
- STQGQHZAVUOBTE-VGBVRHCVSA-N daunorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(C)=O)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 STQGQHZAVUOBTE-VGBVRHCVSA-N 0.000 description 2
- 229960000975 daunorubicin Drugs 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- TXCDCPKCNAJMEE-UHFFFAOYSA-N dibenzofuran Chemical compound C1=CC=C2C3=CC=CC=C3OC2=C1 TXCDCPKCNAJMEE-UHFFFAOYSA-N 0.000 description 2
- IYYZUPMFVPLQIF-UHFFFAOYSA-N dibenzothiophene Chemical compound C1=CC=C2C3=CC=CC=C3SC2=C1 IYYZUPMFVPLQIF-UHFFFAOYSA-N 0.000 description 2
- ZBCBWPMODOFKDW-UHFFFAOYSA-N diethanolamine Chemical compound OCCNCCO ZBCBWPMODOFKDW-UHFFFAOYSA-N 0.000 description 2
- HPNMFZURTQLUMO-UHFFFAOYSA-N diethylamine Chemical compound CCNCC HPNMFZURTQLUMO-UHFFFAOYSA-N 0.000 description 2
- QWHNJUXXYKPLQM-UHFFFAOYSA-N dimethyl cyclopentane Natural products CC1(C)CCCC1 QWHNJUXXYKPLQM-UHFFFAOYSA-N 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- 125000002228 disulfide group Chemical group 0.000 description 2
- 239000003534 dna topoisomerase inhibitor Substances 0.000 description 2
- POULHZVOKOAJMA-UHFFFAOYSA-N dodecanoic acid Chemical compound CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 230000002616 endonucleolytic effect Effects 0.000 description 2
- 229940011871 estrogen Drugs 0.000 description 2
- 239000000262 estrogen Substances 0.000 description 2
- 125000002534 ethynyl group Chemical group [H]C#C* 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000003818 flash chromatography Methods 0.000 description 2
- 239000011737 fluorine Substances 0.000 description 2
- 235000019152 folic acid Nutrition 0.000 description 2
- 239000011724 folic acid Substances 0.000 description 2
- 235000008191 folinic acid Nutrition 0.000 description 2
- 239000011672 folinic acid Substances 0.000 description 2
- 125000002541 furyl group Chemical group 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 150000008195 galaktosides Chemical class 0.000 description 2
- CHPZKNULDCNCBW-UHFFFAOYSA-N gallium nitrate Chemical compound [Ga+3].[O-][N+]([O-])=O.[O-][N+]([O-])=O.[O-][N+]([O-])=O CHPZKNULDCNCBW-UHFFFAOYSA-N 0.000 description 2
- 206010017758 gastric cancer Diseases 0.000 description 2
- 230000002496 gastric effect Effects 0.000 description 2
- 229960005277 gemcitabine Drugs 0.000 description 2
- SDUQYLNIPVEERB-QPPQHZFASA-N gemcitabine Chemical compound O=C1N=C(N)C=CN1[C@H]1C(F)(F)[C@H](O)[C@@H](CO)O1 SDUQYLNIPVEERB-QPPQHZFASA-N 0.000 description 2
- 229960000578 gemtuzumab Drugs 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 102000054751 human RUNX1T1 Human genes 0.000 description 2
- XMBWDFGMSWQBCA-UHFFFAOYSA-N hydrogen iodide Chemical compound I XMBWDFGMSWQBCA-UHFFFAOYSA-N 0.000 description 2
- 230000003463 hyperproliferative effect Effects 0.000 description 2
- 229960001101 ifosfamide Drugs 0.000 description 2
- HOMGKSMUEGBAAB-UHFFFAOYSA-N ifosfamide Chemical compound ClCCNP1(=O)OCCCN1CCCl HOMGKSMUEGBAAB-UHFFFAOYSA-N 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- PQNFLJBBNBOBRQ-UHFFFAOYSA-N indane Chemical compound C1=CC=C2CCCC2=C1 PQNFLJBBNBOBRQ-UHFFFAOYSA-N 0.000 description 2
- SUMDYPCJJOFFON-UHFFFAOYSA-N isethionic acid Chemical compound OCCS(O)(=O)=O SUMDYPCJJOFFON-UHFFFAOYSA-N 0.000 description 2
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 2
- AWJUIBRHMBBTKR-UHFFFAOYSA-N isoquinoline Chemical compound C1=NC=CC2=CC=CC=C21 AWJUIBRHMBBTKR-UHFFFAOYSA-N 0.000 description 2
- CTAPFRYPJLPFDF-UHFFFAOYSA-N isoxazole Chemical compound C=1C=NOC=1 CTAPFRYPJLPFDF-UHFFFAOYSA-N 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- BCFGMOOMADDAQU-UHFFFAOYSA-N lapatinib Chemical compound O1C(CNCCS(=O)(=O)C)=CC=C1C1=CC=C(N=CN=C2NC=3C=C(Cl)C(OCC=4C=C(F)C=CC=4)=CC=3)C2=C1 BCFGMOOMADDAQU-UHFFFAOYSA-N 0.000 description 2
- 229960003881 letrozole Drugs 0.000 description 2
- HPJKCIUCZWXJDR-UHFFFAOYSA-N letrozole Chemical compound C1=CC(C#N)=CC=C1C(N1N=CN=C1)C1=CC=C(C#N)C=C1 HPJKCIUCZWXJDR-UHFFFAOYSA-N 0.000 description 2
- 229960001691 leucovorin Drugs 0.000 description 2
- 229950002950 lintuzumab Drugs 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 206010025135 lupus erythematosus Diseases 0.000 description 2
- MQXVYODZCMMZEM-ZYUZMQFOSA-N mannomustine Chemical compound ClCCNC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CNCCCl MQXVYODZCMMZEM-ZYUZMQFOSA-N 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 229960003194 meglumine Drugs 0.000 description 2
- 210000004379 membrane Anatomy 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229960001428 mercaptopurine Drugs 0.000 description 2
- BDJAEZRIGNCQBZ-UHFFFAOYSA-N methylcyclobutane Chemical compound CC1CCC1 BDJAEZRIGNCQBZ-UHFFFAOYSA-N 0.000 description 2
- UAEPNZWRGJTJPN-UHFFFAOYSA-N methylcyclohexane Chemical compound CC1CCCCC1 UAEPNZWRGJTJPN-UHFFFAOYSA-N 0.000 description 2
- 125000004170 methylsulfonyl group Chemical group [H]C([H])([H])S(*)(=O)=O 0.000 description 2
- 229960001156 mitoxantrone Drugs 0.000 description 2
- KKZJGLLVHKMTCM-UHFFFAOYSA-N mitoxantrone Chemical compound O=C1C2=C(O)C=CC(O)=C2C(=O)C2=C1C(NCCNCCO)=CC=C2NCCNCCO KKZJGLLVHKMTCM-UHFFFAOYSA-N 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 229940051875 mucins Drugs 0.000 description 2
- 201000006417 multiple sclerosis Diseases 0.000 description 2
- GTWJETSWSUWSEJ-UHFFFAOYSA-N n-benzylaniline Chemical compound C=1C=CC=CC=1CNC1=CC=CC=C1 GTWJETSWSUWSEJ-UHFFFAOYSA-N 0.000 description 2
- 125000004108 n-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 2
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 2
- LQNUZADURLCDLV-UHFFFAOYSA-N nitrobenzene Chemical compound [O-][N+](=O)C1=CC=CC=C1 LQNUZADURLCDLV-UHFFFAOYSA-N 0.000 description 2
- 125000000018 nitroso group Chemical group N(=O)* 0.000 description 2
- WPHGSKGZRAQSGP-UHFFFAOYSA-N norcarane Chemical compound C1CCCC2CC21 WPHGSKGZRAQSGP-UHFFFAOYSA-N 0.000 description 2
- 230000000269 nucleophilic effect Effects 0.000 description 2
- 229960002450 ofatumumab Drugs 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 150000002891 organic anions Chemical class 0.000 description 2
- CTSLXHKWHWQRSH-UHFFFAOYSA-N oxalyl chloride Chemical compound ClC(=O)C(Cl)=O CTSLXHKWHWQRSH-UHFFFAOYSA-N 0.000 description 2
- 239000005022 packaging material Substances 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 239000003208 petroleum Substances 0.000 description 2
- YNPNZTXNASCQKK-UHFFFAOYSA-N phenanthrene Chemical compound C1=CC=C2C3=CC=CC=C3C=CC2=C1 YNPNZTXNASCQKK-UHFFFAOYSA-N 0.000 description 2
- RDOWQLZANAYVLL-UHFFFAOYSA-N phenanthridine Chemical compound C1=CC=C2C3=CC=CC=C3C=NC2=C1 RDOWQLZANAYVLL-UHFFFAOYSA-N 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- PTMHPRAIXMAOOB-UHFFFAOYSA-N phosphoramidic acid Chemical group NP(O)(O)=O PTMHPRAIXMAOOB-UHFFFAOYSA-N 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 125000006308 propyl amino group Chemical group 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 2
- 125000001453 quaternary ammonium group Chemical group 0.000 description 2
- GZUITABIAKMVPG-UHFFFAOYSA-N raloxifene Chemical compound C1=CC(O)=CC=C1C1=C(C(=O)C=2C=CC(OCCN3CCCCC3)=CC=2)C2=CC=C(O)C=C2S1 GZUITABIAKMVPG-UHFFFAOYSA-N 0.000 description 2
- 229960004622 raloxifene Drugs 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 238000006268 reductive amination reaction Methods 0.000 description 2
- 238000010992 reflux Methods 0.000 description 2
- 238000004007 reversed phase HPLC Methods 0.000 description 2
- 206010039073 rheumatoid arthritis Diseases 0.000 description 2
- 229960004641 rituximab Drugs 0.000 description 2
- HXCHCVDVKSCDHU-PJKCJEBCSA-N s-[(2r,3s,4s,6s)-6-[[(2r,3s,4s,5r,6r)-5-[(2s,4s,5s)-5-(ethylamino)-4-methoxyoxan-2-yl]oxy-4-hydroxy-6-[[(2s,5z,9r,13e)-9-hydroxy-12-(methoxycarbonylamino)-13-[2-(methyltrisulfanyl)ethylidene]-11-oxo-2-bicyclo[7.3.1]trideca-1(12),5-dien-3,7-diynyl]oxy]-2-m Chemical compound C1[C@H](OC)[C@@H](NCC)CO[C@H]1O[C@H]1[C@H](O[C@@H]2C\3=C(NC(=O)OC)C(=O)C[C@@](C/3=C/CSSSC)(O)C#C\C=C/C#C2)O[C@H](C)[C@@H](NO[C@@H]2O[C@H](C)[C@@H](SC(=O)C=3C(=C(OC)C(O[C@H]4[C@@H]([C@H](OC)[C@@H](O)[C@H](C)O4)O)=C(I)C=3C)OC)[C@@H](O)C2)[C@@H]1O HXCHCVDVKSCDHU-PJKCJEBCSA-N 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 150000003335 secondary amines Chemical class 0.000 description 2
- 229940095743 selective estrogen receptor modulator Drugs 0.000 description 2
- 239000000333 selective estrogen receptor modulator Substances 0.000 description 2
- CYOHGALHFOKKQC-UHFFFAOYSA-N selumetinib Chemical compound OCCONC(=O)C=1C=C2N(C)C=NC2=C(F)C=1NC1=CC=C(Br)C=C1Cl CYOHGALHFOKKQC-UHFFFAOYSA-N 0.000 description 2
- NDVLTYZPCACLMA-UHFFFAOYSA-N silver oxide Chemical compound [O-2].[Ag+].[Ag+] NDVLTYZPCACLMA-UHFFFAOYSA-N 0.000 description 2
- 229940115586 simulect Drugs 0.000 description 2
- 208000000587 small cell lung carcinoma Diseases 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000008354 sodium chloride injection Substances 0.000 description 2
- 102000004052 somatostatin receptor 2 Human genes 0.000 description 2
- 102000004115 somatostatin receptor 5 Human genes 0.000 description 2
- 229960003787 sorafenib Drugs 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 201000011549 stomach cancer Diseases 0.000 description 2
- PVYJZLYGTZKPJE-UHFFFAOYSA-N streptonigrin Chemical compound C=1C=C2C(=O)C(OC)=C(N)C(=O)C2=NC=1C(C=1N)=NC(C(O)=O)=C(C)C=1C1=CC=C(OC)C(OC)=C1O PVYJZLYGTZKPJE-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-L sulfate group Chemical group S(=O)(=O)([O-])[O-] QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 2
- 150000003871 sulfonates Chemical class 0.000 description 2
- 150000003457 sulfones Chemical class 0.000 description 2
- RWSOTUBLDIXVET-UHFFFAOYSA-O sulfonium group Chemical group [SH3+] RWSOTUBLDIXVET-UHFFFAOYSA-O 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- ISIJQEHRDSCQIU-UHFFFAOYSA-N tert-butyl 2,7-diazaspiro[4.5]decane-7-carboxylate Chemical compound C1N(C(=O)OC(C)(C)C)CCCC11CNCC1 ISIJQEHRDSCQIU-UHFFFAOYSA-N 0.000 description 2
- 125000001981 tert-butyldimethylsilyl group Chemical group [H]C([H])([H])[Si]([H])(C([H])([H])[H])[*]C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- CZDYPVPMEAXLPK-UHFFFAOYSA-N tetramethylsilane Chemical compound C[Si](C)(C)C CZDYPVPMEAXLPK-UHFFFAOYSA-N 0.000 description 2
- JWCVYQRPINPYQJ-UHFFFAOYSA-N thiepane Chemical compound C1CCCSCC1 JWCVYQRPINPYQJ-UHFFFAOYSA-N 0.000 description 2
- 229960001196 thiotepa Drugs 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 206010043778 thyroiditis Diseases 0.000 description 2
- 229960003087 tioguanine Drugs 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 229940044693 topoisomerase inhibitor Drugs 0.000 description 2
- 229960005267 tositumomab Drugs 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 238000011269 treatment regimen Methods 0.000 description 2
- AQRLNPVMDITEJU-UHFFFAOYSA-N triethylsilane Chemical compound CC[SiH](CC)CC AQRLNPVMDITEJU-UHFFFAOYSA-N 0.000 description 2
- QAEDZJGFFMLHHQ-UHFFFAOYSA-N trifluoroacetic anhydride Chemical compound FC(F)(F)C(=O)OC(=O)C(F)(F)F QAEDZJGFFMLHHQ-UHFFFAOYSA-N 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 229960000281 trometamol Drugs 0.000 description 2
- 201000005112 urinary bladder cancer Diseases 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N valeric acid Chemical compound CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 description 2
- UHTHHESEBZOYNR-UHFFFAOYSA-N vandetanib Chemical compound COC1=CC(C(/N=CN2)=N/C=3C(=CC(Br)=CC=3)F)=C2C=C1OCC1CCN(C)CC1 UHTHHESEBZOYNR-UHFFFAOYSA-N 0.000 description 2
- 229960000241 vandetanib Drugs 0.000 description 2
- 235000015112 vegetable and seed oil Nutrition 0.000 description 2
- 239000008158 vegetable oil Substances 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 229950000815 veltuzumab Drugs 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 229950008250 zalutumumab Drugs 0.000 description 2
- OGLCQHRZUSEXNB-UAPNVWQMSA-N (2r,3r,3ar,6ar)-2,3,6-trihydroxy-3,3a,6,6a-tetrahydro-2h-furo[3,2-b]furan-5-one Chemical compound OC1C(=O)O[C@@H]2[C@@H](O)[C@H](O)O[C@@H]21 OGLCQHRZUSEXNB-UAPNVWQMSA-N 0.000 description 1
- AGGWFDNPHKLBBV-YUMQZZPRSA-N (2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]-5-(carbamoylamino)pentanoic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=O AGGWFDNPHKLBBV-YUMQZZPRSA-N 0.000 description 1
- SVNJBEMPMKWDCO-KCHLEUMXSA-N (2s)-2-[[(2s)-3-carboxy-2-[[2-[[(2s)-5-(diaminomethylideneamino)-2-[[4-oxo-4-[[4-(4-oxo-8-phenylchromen-2-yl)morpholin-4-ium-4-yl]methoxy]butanoyl]amino]pentanoyl]amino]acetyl]amino]propanoyl]amino]-3-hydroxypropanoate Chemical compound C=1C(=O)C2=CC=CC(C=3C=CC=CC=3)=C2OC=1[N+]1(COC(=O)CCC(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C([O-])=O)CCOCC1 SVNJBEMPMKWDCO-KCHLEUMXSA-N 0.000 description 1
- FLWWDYNPWOSLEO-HQVZTVAUSA-N (2s)-2-[[4-[1-(2-amino-4-oxo-1h-pteridin-6-yl)ethyl-methylamino]benzoyl]amino]pentanedioic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1C(C)N(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FLWWDYNPWOSLEO-HQVZTVAUSA-N 0.000 description 1
- ALBODLTZUXKBGZ-JUUVMNCLSA-N (2s)-2-amino-3-phenylpropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical group NCCCC[C@H](N)C(O)=O.OC(=O)[C@@H](N)CC1=CC=CC=C1 ALBODLTZUXKBGZ-JUUVMNCLSA-N 0.000 description 1
- JPSHPWJJSVEEAX-OWPBQMJCSA-N (2s)-2-amino-4-fluoranylpentanedioic acid Chemical compound OC(=O)[C@@H](N)CC([18F])C(O)=O JPSHPWJJSVEEAX-OWPBQMJCSA-N 0.000 description 1
- TVIRNGFXQVMMGB-OFWIHYRESA-N (3s,6r,10r,13e,16s)-16-[(2r,3r,4s)-4-chloro-3-hydroxy-4-phenylbutan-2-yl]-10-[(3-chloro-4-methoxyphenyl)methyl]-6-methyl-3-(2-methylpropyl)-1,4-dioxa-8,11-diazacyclohexadec-13-ene-2,5,9,12-tetrone Chemical compound C1=C(Cl)C(OC)=CC=C1C[C@@H]1C(=O)NC[C@@H](C)C(=O)O[C@@H](CC(C)C)C(=O)O[C@H]([C@H](C)[C@@H](O)[C@@H](Cl)C=2C=CC=CC=2)C/C=C/C(=O)N1 TVIRNGFXQVMMGB-OFWIHYRESA-N 0.000 description 1
- RWZVMMQNDHPRQD-SFTDATJTSA-N (6as)-3-[3-[[(6as)-2-methoxy-8-methylidene-11-oxo-7,9-dihydro-6ah-pyrrolo[2,1-c][1,4]benzodiazepin-3-yl]oxy]propoxy]-2-methoxy-8-methylidene-7,9-dihydro-6ah-pyrrolo[2,1-c][1,4]benzodiazepin-11-one Chemical compound N1=C[C@@H]2CC(=C)CN2C(=O)C(C=C2OC)=C1C=C2OCCCOC1=CC(N=C[C@H]2N(CC(=C)C2)C2=O)=C2C=C1OC RWZVMMQNDHPRQD-SFTDATJTSA-N 0.000 description 1
- SXCIMUIAZXOVIR-PUCKCBAPSA-N (7s,9s)-6,9,11-trihydroxy-9-(2-hydroxyacetyl)-7-[(2r,4s,5s,6s)-5-hydroxy-6-methyl-4-morpholin-4-yloxan-2-yl]oxy-4-methoxy-8,10-dihydro-7h-tetracene-5,12-dione Chemical compound N1([C@H]2C[C@@H](O[C@@H](C)[C@H]2O)O[C@H]2C[C@@](O)(CC=3C(O)=C4C(=O)C=5C=CC=C(C=5C(=O)C4=C(O)C=32)OC)C(=O)CO)CCOCC1 SXCIMUIAZXOVIR-PUCKCBAPSA-N 0.000 description 1
- FPVKHBSQESCIEP-UHFFFAOYSA-N (8S)-3-(2-deoxy-beta-D-erythro-pentofuranosyl)-3,6,7,8-tetrahydroimidazo[4,5-d][1,3]diazepin-8-ol Natural products C1C(O)C(CO)OC1N1C(NC=NCC2O)=C2N=C1 FPVKHBSQESCIEP-UHFFFAOYSA-N 0.000 description 1
- IEXUMDBQLIVNHZ-YOUGDJEHSA-N (8s,11r,13r,14s,17s)-11-[4-(dimethylamino)phenyl]-17-hydroxy-17-(3-hydroxypropyl)-13-methyl-1,2,6,7,8,11,12,14,15,16-decahydrocyclopenta[a]phenanthren-3-one Chemical compound C1=CC(N(C)C)=CC=C1[C@@H]1C2=C3CCC(=O)C=C3CC[C@H]2[C@H](CC[C@]2(O)CCCO)[C@@]2(C)C1 IEXUMDBQLIVNHZ-YOUGDJEHSA-N 0.000 description 1
- 125000004400 (C1-C12) alkyl group Chemical group 0.000 description 1
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 description 1
- 125000006701 (C1-C7) alkyl group Chemical group 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- FDKXTQMXEQVLRF-ZHACJKMWSA-N (E)-dacarbazine Chemical compound CN(C)\N=N\c1[nH]cnc1C(N)=O FDKXTQMXEQVLRF-ZHACJKMWSA-N 0.000 description 1
- RXZBMPWDPOLZGW-XMRMVWPWSA-N (E)-roxithromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=N/OCOCCOC)/[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 RXZBMPWDPOLZGW-XMRMVWPWSA-N 0.000 description 1
- LKJPYSCBVHEWIU-KRWDZBQOSA-N (R)-bicalutamide Chemical compound C([C@@](O)(C)C(=O)NC=1C=C(C(C#N)=CC=1)C(F)(F)F)S(=O)(=O)C1=CC=C(F)C=C1 LKJPYSCBVHEWIU-KRWDZBQOSA-N 0.000 description 1
- MIOPJNTWMNEORI-GMSGAONNSA-N (S)-camphorsulfonic acid Chemical compound C1C[C@@]2(CS(O)(=O)=O)C(=O)C[C@@H]1C2(C)C MIOPJNTWMNEORI-GMSGAONNSA-N 0.000 description 1
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- AGNGYMCLFWQVGX-AGFFZDDWSA-N (e)-1-[(2s)-2-amino-2-carboxyethoxy]-2-diazonioethenolate Chemical compound OC(=O)[C@@H](N)CO\C([O-])=C\[N+]#N AGNGYMCLFWQVGX-AGFFZDDWSA-N 0.000 description 1
- UWYVPFMHMJIBHE-OWOJBTEDSA-N (e)-2-hydroxybut-2-enedioic acid Chemical compound OC(=O)\C=C(\O)C(O)=O UWYVPFMHMJIBHE-OWOJBTEDSA-N 0.000 description 1
- WBYWAXJHAXSJNI-VOTSOKGWSA-M .beta-Phenylacrylic acid Natural products [O-]C(=O)\C=C\C1=CC=CC=C1 WBYWAXJHAXSJNI-VOTSOKGWSA-M 0.000 description 1
- OFZYBEBWCZBCPM-UHFFFAOYSA-N 1,1-dimethylcyclobutane Chemical compound CC1(C)CCC1 OFZYBEBWCZBCPM-UHFFFAOYSA-N 0.000 description 1
- PBIJFSCPEFQXBB-UHFFFAOYSA-N 1,1-dimethylcyclopropane Chemical compound CC1(C)CC1 PBIJFSCPEFQXBB-UHFFFAOYSA-N 0.000 description 1
- FNQJDLTXOVEEFB-UHFFFAOYSA-N 1,2,3-benzothiadiazole Chemical compound C1=CC=C2SN=NC2=C1 FNQJDLTXOVEEFB-UHFFFAOYSA-N 0.000 description 1
- JYEUMXHLPRZUAT-UHFFFAOYSA-N 1,2,3-triazine Chemical compound C1=CN=NN=C1 JYEUMXHLPRZUAT-UHFFFAOYSA-N 0.000 description 1
- KTZQTRPPVKQPFO-UHFFFAOYSA-N 1,2-benzoxazole Chemical compound C1=CC=C2C=NOC2=C1 KTZQTRPPVKQPFO-UHFFFAOYSA-N 0.000 description 1
- PUAKTHBSHFXVAG-UHFFFAOYSA-N 1,2-dimethylcyclobutene Chemical compound CC1=C(C)CC1 PUAKTHBSHFXVAG-UHFFFAOYSA-N 0.000 description 1
- SZZWLAZADBEDQP-UHFFFAOYSA-N 1,2-dimethylcyclopentene Chemical compound CC1=C(C)CCC1 SZZWLAZADBEDQP-UHFFFAOYSA-N 0.000 description 1
- BGJSXRVXTHVRSN-UHFFFAOYSA-N 1,3,5-trioxane Chemical compound C1OCOCO1 BGJSXRVXTHVRSN-UHFFFAOYSA-N 0.000 description 1
- BCMCBBGGLRIHSE-UHFFFAOYSA-N 1,3-benzoxazole Chemical compound C1=CC=C2OC=NC2=C1 BCMCBBGGLRIHSE-UHFFFAOYSA-N 0.000 description 1
- FONKWHRXTPJODV-DNQXCXABSA-N 1,3-bis[2-[(8s)-8-(chloromethyl)-4-hydroxy-1-methyl-7,8-dihydro-3h-pyrrolo[3,2-e]indole-6-carbonyl]-1h-indol-5-yl]urea Chemical compound C1([C@H](CCl)CN2C(=O)C=3NC4=CC=C(C=C4C=3)NC(=O)NC=3C=C4C=C(NC4=CC=3)C(=O)N3C4=CC(O)=C5NC=C(C5=C4[C@H](CCl)C3)C)=C2C=C(O)C2=C1C(C)=CN2 FONKWHRXTPJODV-DNQXCXABSA-N 0.000 description 1
- WNXJIVFYUVYPPR-UHFFFAOYSA-N 1,3-dioxolane Chemical class C1COCO1 WNXJIVFYUVYPPR-UHFFFAOYSA-N 0.000 description 1
- QJQGMOIUEXCUOH-UHFFFAOYSA-N 1,3-oxazole;1h-pyrrole Chemical compound C=1C=CNC=1.C1=COC=N1 QJQGMOIUEXCUOH-UHFFFAOYSA-N 0.000 description 1
- OGYGFUAIIOPWQD-UHFFFAOYSA-N 1,3-thiazolidine Chemical compound C1CSCN1 OGYGFUAIIOPWQD-UHFFFAOYSA-N 0.000 description 1
- YNGDWRXWKFWCJY-UHFFFAOYSA-N 1,4-Dihydropyridine Chemical compound C1C=CNC=C1 YNGDWRXWKFWCJY-UHFFFAOYSA-N 0.000 description 1
- RSWGJHLUYNHPMX-UHFFFAOYSA-N 1,4a-dimethyl-7-propan-2-yl-2,3,4,4b,5,6,10,10a-octahydrophenanthrene-1-carboxylic acid Chemical compound C12CCC(C(C)C)=CC2=CCC2C1(C)CCCC2(C)C(O)=O RSWGJHLUYNHPMX-UHFFFAOYSA-N 0.000 description 1
- VMLKTERJLVWEJJ-UHFFFAOYSA-N 1,5-naphthyridine Chemical compound C1=CC=NC2=CC=CN=C21 VMLKTERJLVWEJJ-UHFFFAOYSA-N 0.000 description 1
- FLBAYUMRQUHISI-UHFFFAOYSA-N 1,8-naphthyridine Chemical compound N1=CC=CC2=CC=CN=C21 FLBAYUMRQUHISI-UHFFFAOYSA-N 0.000 description 1
- ZGEGCLOFRBLKSE-UHFFFAOYSA-N 1-Heptene Chemical group CCCCCC=C ZGEGCLOFRBLKSE-UHFFFAOYSA-N 0.000 description 1
- TUSDEZXZIZRFGC-UHFFFAOYSA-N 1-O-galloyl-3,6-(R)-HHDP-beta-D-glucose Natural products OC1C(O2)COC(=O)C3=CC(O)=C(O)C(O)=C3C3=C(O)C(O)=C(O)C=C3C(=O)OC1C(O)C2OC(=O)C1=CC(O)=C(O)C(O)=C1 TUSDEZXZIZRFGC-UHFFFAOYSA-N 0.000 description 1
- LYLMANSTAHZWBY-UHFFFAOYSA-N 1-[6-(1-hydroxy-2,5-dioxopyrrolidin-3-yl)-6-oxohexyl]pyrrole-2,5-dione Chemical compound O=C1N(O)C(=O)CC1C(=O)CCCCCN1C(=O)C=CC1=O LYLMANSTAHZWBY-UHFFFAOYSA-N 0.000 description 1
- 125000006018 1-methyl-ethenyl group Chemical group 0.000 description 1
- AVPHQXWAMGTQPF-UHFFFAOYSA-N 1-methylcyclobutene Chemical compound CC1=CCC1 AVPHQXWAMGTQPF-UHFFFAOYSA-N 0.000 description 1
- ATQUFXWBVZUTKO-UHFFFAOYSA-N 1-methylcyclopentene Chemical compound CC1=CCCC1 ATQUFXWBVZUTKO-UHFFFAOYSA-N 0.000 description 1
- SHDPRTQPPWIEJG-UHFFFAOYSA-N 1-methylcyclopropene Chemical compound CC1=CC1 SHDPRTQPPWIEJG-UHFFFAOYSA-N 0.000 description 1
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical group COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 1
- 125000006017 1-propenyl group Chemical group 0.000 description 1
- WJFKNYWRSNBZNX-UHFFFAOYSA-N 10H-phenothiazine Chemical compound C1=CC=C2NC3=CC=CC=C3SC2=C1 WJFKNYWRSNBZNX-UHFFFAOYSA-N 0.000 description 1
- TZMSYXZUNZXBOL-UHFFFAOYSA-N 10H-phenoxazine Chemical compound C1=CC=C2NC3=CC=CC=C3OC2=C1 TZMSYXZUNZXBOL-UHFFFAOYSA-N 0.000 description 1
- HYZJCKYKOHLVJF-UHFFFAOYSA-N 1H-benzimidazole Chemical compound C1=CC=C2NC=NC2=C1 HYZJCKYKOHLVJF-UHFFFAOYSA-N 0.000 description 1
- MFJCPDOGFAYSTF-UHFFFAOYSA-N 1H-isochromene Chemical compound C1=CC=C2COC=CC2=C1 MFJCPDOGFAYSTF-UHFFFAOYSA-N 0.000 description 1
- AAQTWLBJPNLKHT-UHFFFAOYSA-N 1H-perimidine Chemical compound N1C=NC2=CC=CC3=CC=CC1=C32 AAQTWLBJPNLKHT-UHFFFAOYSA-N 0.000 description 1
- IEMAOEFPZAIMCN-UHFFFAOYSA-N 1H-pyrazole Chemical compound C=1C=NNC=1.C=1C=NNC=1 IEMAOEFPZAIMCN-UHFFFAOYSA-N 0.000 description 1
- HUEXNHSMABCRTH-UHFFFAOYSA-N 1h-imidazole Chemical compound C1=CNC=N1.C1=CNC=N1 HUEXNHSMABCRTH-UHFFFAOYSA-N 0.000 description 1
- AWBOSXFRPFZLOP-UHFFFAOYSA-N 2,1,3-benzoxadiazole Chemical compound C1=CC=CC2=NON=C21 AWBOSXFRPFZLOP-UHFFFAOYSA-N 0.000 description 1
- KWMLJOLKUYYJFJ-UHFFFAOYSA-N 2,3,4,5,6,7-Hexahydroxyheptanoic acid Chemical compound OCC(O)C(O)C(O)C(O)C(O)C(O)=O KWMLJOLKUYYJFJ-UHFFFAOYSA-N 0.000 description 1
- BTOTXLJHDSNXMW-POYBYMJQSA-N 2,3-dideoxyuridine Chemical compound O1[C@H](CO)CC[C@@H]1N1C(=O)NC(=O)C=C1 BTOTXLJHDSNXMW-POYBYMJQSA-N 0.000 description 1
- FJRPOHLDJUJARI-UHFFFAOYSA-N 2,3-dihydro-1,2-oxazole Chemical compound C1NOC=C1 FJRPOHLDJUJARI-UHFFFAOYSA-N 0.000 description 1
- ZABMHLDQFJHDSC-UHFFFAOYSA-N 2,3-dihydro-1,3-oxazole Chemical compound C1NC=CO1 ZABMHLDQFJHDSC-UHFFFAOYSA-N 0.000 description 1
- FLNPFFMWAPTGOT-UHFFFAOYSA-N 2,3-dihydro-1h-pyrazole Chemical compound C1NNC=C1.C1NNC=C1 FLNPFFMWAPTGOT-UHFFFAOYSA-N 0.000 description 1
- HABBZWBWPKGYSL-UHFFFAOYSA-N 2,3-dihydrofuran;furan Chemical compound C1CC=CO1.C=1C=COC=1 HABBZWBWPKGYSL-UHFFFAOYSA-N 0.000 description 1
- OISVCGZHLKNMSJ-UHFFFAOYSA-N 2,6-dimethyl-pyridine Natural products CC1=CC=CC(C)=N1 OISVCGZHLKNMSJ-UHFFFAOYSA-N 0.000 description 1
- LBLYYCQCTBFVLH-UHFFFAOYSA-N 2-Methylbenzenesulfonic acid Chemical compound CC1=CC=CC=C1S(O)(=O)=O LBLYYCQCTBFVLH-UHFFFAOYSA-N 0.000 description 1
- QINPEPAQOBZPOF-UHFFFAOYSA-N 2-amino-n-[3-[[3-(2-chloro-5-methoxyanilino)quinoxalin-2-yl]sulfamoyl]phenyl]-2-methylpropanamide Chemical compound COC1=CC=C(Cl)C(NC=2C(=NC3=CC=CC=C3N=2)NS(=O)(=O)C=2C=C(NC(=O)C(C)(C)N)C=CC=2)=C1 QINPEPAQOBZPOF-UHFFFAOYSA-N 0.000 description 1
- UXGVMFHEKMGWMA-UHFFFAOYSA-N 2-benzofuran Chemical compound C1=CC=CC2=COC=C21 UXGVMFHEKMGWMA-UHFFFAOYSA-N 0.000 description 1
- 125000004182 2-chlorophenyl group Chemical group [H]C1=C([H])C(Cl)=C(*)C([H])=C1[H] 0.000 description 1
- 125000002941 2-furyl group Chemical group O1C([*])=C([H])C([H])=C1[H] 0.000 description 1
- NEAQRZUHTPSBBM-UHFFFAOYSA-N 2-hydroxy-3,3-dimethyl-7-nitro-4h-isoquinolin-1-one Chemical compound C1=C([N+]([O-])=O)C=C2C(=O)N(O)C(C)(C)CC2=C1 NEAQRZUHTPSBBM-UHFFFAOYSA-N 0.000 description 1
- CTRPRMNBTVRDFH-UHFFFAOYSA-N 2-n-methyl-1,3,5-triazine-2,4,6-triamine Chemical class CNC1=NC(N)=NC(N)=N1 CTRPRMNBTVRDFH-UHFFFAOYSA-N 0.000 description 1
- 229940080296 2-naphthalenesulfonate Drugs 0.000 description 1
- WLJVXDMOQOGPHL-PPJXEINESA-N 2-phenylacetic acid Chemical compound O[14C](=O)CC1=CC=CC=C1 WLJVXDMOQOGPHL-PPJXEINESA-N 0.000 description 1
- VSWICNJIUPRZIK-UHFFFAOYSA-N 2-piperideine Chemical compound C1CNC=CC1 VSWICNJIUPRZIK-UHFFFAOYSA-N 0.000 description 1
- UUIMDJFBHNDZOW-UHFFFAOYSA-N 2-tert-butylpyridine Chemical compound CC(C)(C)C1=CC=CC=N1 UUIMDJFBHNDZOW-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- BQTJMKIHKULPCZ-UHFFFAOYSA-N 2H-indene Chemical compound C1=CC=CC2=CCC=C21 BQTJMKIHKULPCZ-UHFFFAOYSA-N 0.000 description 1
- VHMICKWLTGFITH-UHFFFAOYSA-N 2H-isoindole Chemical compound C1=CC=CC2=CNC=C21 VHMICKWLTGFITH-UHFFFAOYSA-N 0.000 description 1
- MGADZUXDNSDTHW-UHFFFAOYSA-N 2H-pyran Chemical compound C1OC=CC=C1 MGADZUXDNSDTHW-UHFFFAOYSA-N 0.000 description 1
- JZIBVTUXIVIFGC-UHFFFAOYSA-N 2H-pyrrole Chemical compound C1C=CC=N1 JZIBVTUXIVIFGC-UHFFFAOYSA-N 0.000 description 1
- CMLFRMDBDNHMRA-UHFFFAOYSA-N 2h-1,2-benzoxazine Chemical compound C1=CC=C2C=CNOC2=C1 CMLFRMDBDNHMRA-UHFFFAOYSA-N 0.000 description 1
- KIUMMUBSPKGMOY-UHFFFAOYSA-N 3,3'-Dithiobis(6-nitrobenzoic acid) Chemical compound C1=C([N+]([O-])=O)C(C(=O)O)=CC(SSC=2C=C(C(=CC=2)[N+]([O-])=O)C(O)=O)=C1 KIUMMUBSPKGMOY-UHFFFAOYSA-N 0.000 description 1
- NDMPLJNOPCLANR-UHFFFAOYSA-N 3,4-dihydroxy-15-(4-hydroxy-18-methoxycarbonyl-5,18-seco-ibogamin-18-yl)-16-methoxy-1-methyl-6,7-didehydro-aspidospermidine-3-carboxylic acid methyl ester Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 NDMPLJNOPCLANR-UHFFFAOYSA-N 0.000 description 1
- PWMYMKOUNYTVQN-UHFFFAOYSA-N 3-(8,8-diethyl-2-aza-8-germaspiro[4.5]decan-2-yl)-n,n-dimethylpropan-1-amine Chemical compound C1C[Ge](CC)(CC)CCC11CN(CCCN(C)C)CC1 PWMYMKOUNYTVQN-UHFFFAOYSA-N 0.000 description 1
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 description 1
- VNEOHNUYPRAJMX-UHFFFAOYSA-N 3-[[2-[[2-amino-3-(1h-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-4-[[1-(butoxycarbonylamino)-1-oxo-3-phenylpropan-2-yl]amino]-4-oxobutanoic acid Chemical compound C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(O)=O)C(=O)NC(C(=O)NC(=O)OCCCC)CC1=CC=CC=C1 VNEOHNUYPRAJMX-UHFFFAOYSA-N 0.000 description 1
- QCHPKSFMDHPSNR-UHFFFAOYSA-N 3-aminoisobutyric acid Chemical compound NCC(C)C(O)=O QCHPKSFMDHPSNR-UHFFFAOYSA-N 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- UOQHWNPVNXSDDO-UHFFFAOYSA-N 3-bromoimidazo[1,2-a]pyridine-6-carbonitrile Chemical compound C1=CC(C#N)=CN2C(Br)=CN=C21 UOQHWNPVNXSDDO-UHFFFAOYSA-N 0.000 description 1
- 125000004179 3-chlorophenyl group Chemical group [H]C1=C([H])C(*)=C([H])C(Cl)=C1[H] 0.000 description 1
- 101710147124 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Proteins 0.000 description 1
- 125000004207 3-methoxyphenyl group Chemical group [H]C1=C([H])C(*)=C([H])C(OC([H])([H])[H])=C1[H] 0.000 description 1
- JVQIKJMSUIMUDI-UHFFFAOYSA-N 3-pyrroline Chemical class C1NCC=C1 JVQIKJMSUIMUDI-UHFFFAOYSA-N 0.000 description 1
- 150000004066 3-pyrrolines Chemical class 0.000 description 1
- 125000001541 3-thienyl group Chemical group S1C([H])=C([*])C([H])=C1[H] 0.000 description 1
- BBQWIRWKYMBILV-UHFFFAOYSA-N 3h-thiophen-2-imine Chemical compound N=C1CC=CS1 BBQWIRWKYMBILV-UHFFFAOYSA-N 0.000 description 1
- AOJJSUZBOXZQNB-VTZDEGQISA-N 4'-epidoxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-VTZDEGQISA-N 0.000 description 1
- LZPWAYBEOJRFAX-UHFFFAOYSA-N 4,4,5,5-tetramethyl-1,3,2$l^{2}-dioxaborolane Chemical compound CC1(C)O[B]OC1(C)C LZPWAYBEOJRFAX-UHFFFAOYSA-N 0.000 description 1
- ZBWXZZIIMVVCNZ-UHFFFAOYSA-N 4,5-dihydroacephenanthrylene Chemical compound C1=CC(CC2)=C3C2=CC2=CC=CC=C2C3=C1 ZBWXZZIIMVVCNZ-UHFFFAOYSA-N 0.000 description 1
- GMOGICAFJFPMNS-UHFFFAOYSA-N 4-(1,4,8,11-tetrazacyclotetradec-1-ylmethyl)benzoic acid Chemical compound C1=CC(C(=O)O)=CC=C1CN1CCNCCCNCCNCCC1 GMOGICAFJFPMNS-UHFFFAOYSA-N 0.000 description 1
- CLPFFLWZZBQMAO-UHFFFAOYSA-N 4-(5,6,7,8-tetrahydroimidazo[1,5-a]pyridin-5-yl)benzonitrile Chemical compound C1=CC(C#N)=CC=C1C1N2C=NC=C2CCC1 CLPFFLWZZBQMAO-UHFFFAOYSA-N 0.000 description 1
- DODQJNMQWMSYGS-QPLCGJKRSA-N 4-[(z)-1-[4-[2-(dimethylamino)ethoxy]phenyl]-1-phenylbut-1-en-2-yl]phenol Chemical compound C=1C=C(O)C=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 DODQJNMQWMSYGS-QPLCGJKRSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 1
- 125000004801 4-cyanophenyl group Chemical group [H]C1=C([H])C(C#N)=C([H])C([H])=C1* 0.000 description 1
- 229960000549 4-dimethylaminophenol Drugs 0.000 description 1
- 125000001255 4-fluorophenyl group Chemical group [H]C1=C([H])C(*)=C([H])C([H])=C1F 0.000 description 1
- 125000006418 4-methylphenylsulfonyl group Chemical group 0.000 description 1
- GDRVFDDBLLKWRI-UHFFFAOYSA-N 4H-quinolizine Chemical compound C1=CC=CN2CC=CC=C21 GDRVFDDBLLKWRI-UHFFFAOYSA-N 0.000 description 1
- BBEQQKBWUHCIOU-UHFFFAOYSA-N 5-(dimethylamino)-1-naphthalenesulfonic acid(dansyl acid) Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(O)(=O)=O BBEQQKBWUHCIOU-UHFFFAOYSA-N 0.000 description 1
- IDPUKCWIGUEADI-UHFFFAOYSA-N 5-[bis(2-chloroethyl)amino]uracil Chemical compound ClCCN(CCCl)C1=CNC(=O)NC1=O IDPUKCWIGUEADI-UHFFFAOYSA-N 0.000 description 1
- NMUSYJAQQFHJEW-KVTDHHQDSA-N 5-azacytidine Chemical compound O=C1N=C(N)N=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NMUSYJAQQFHJEW-KVTDHHQDSA-N 0.000 description 1
- MCNGFYXMIRHJEO-UHFFFAOYSA-N 5-methylpyrazolo[1,5-a]pyrimidin-7-amine Chemical compound N1=C(C)C=C(N)N2N=CC=C21 MCNGFYXMIRHJEO-UHFFFAOYSA-N 0.000 description 1
- WYXSYVWAUAUWLD-SHUUEZRQSA-N 6-azauridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=N1 WYXSYVWAUAUWLD-SHUUEZRQSA-N 0.000 description 1
- NJBMMMJOXRZENQ-UHFFFAOYSA-N 6H-pyrrolo[2,3-f]quinoline Chemical compound c1cc2ccc3[nH]cccc3c2n1 NJBMMMJOXRZENQ-UHFFFAOYSA-N 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- ZGXJTSGNIOSYLO-UHFFFAOYSA-N 88755TAZ87 Chemical compound NCC(=O)CCC(O)=O ZGXJTSGNIOSYLO-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- PQJUJGAVDBINPI-UHFFFAOYSA-N 9H-thioxanthene Chemical compound C1=CC=C2CC3=CC=CC=C3SC2=C1 PQJUJGAVDBINPI-UHFFFAOYSA-N 0.000 description 1
- GJCOSYZMQJWQCA-UHFFFAOYSA-N 9H-xanthene Chemical compound C1=CC=C2CC3=CC=CC=C3OC2=C1 GJCOSYZMQJWQCA-UHFFFAOYSA-N 0.000 description 1
- 108010005465 AC133 Antigen Proteins 0.000 description 1
- 102000005908 AC133 Antigen Human genes 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 239000005964 Acibenzolar-S-methyl Substances 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 208000026872 Addison Disease Diseases 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 206010061424 Anal cancer Diseases 0.000 description 1
- 108010049777 Ankyrins Proteins 0.000 description 1
- 102000008102 Ankyrins Human genes 0.000 description 1
- 208000003343 Antiphospholipid Syndrome Diseases 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- 101001005269 Arabidopsis thaliana Ceramide synthase 1 LOH3 Proteins 0.000 description 1
- 101001005312 Arabidopsis thaliana Ceramide synthase LOH1 Proteins 0.000 description 1
- 101100067974 Arabidopsis thaliana POP2 gene Proteins 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- BFYIZQONLCFLEV-DAELLWKTSA-N Aromasine Chemical compound O=C1C=C[C@]2(C)[C@H]3CC[C@](C)(C(CC4)=O)[C@@H]4[C@@H]3CC(=C)C2=C1 BFYIZQONLCFLEV-DAELLWKTSA-N 0.000 description 1
- 102000014654 Aromatase Human genes 0.000 description 1
- 108010078554 Aromatase Proteins 0.000 description 1
- BSYNRYMUTXBXSQ-UHFFFAOYSA-N Aspirin Chemical compound CC(=O)OC1=CC=CC=C1C(O)=O BSYNRYMUTXBXSQ-UHFFFAOYSA-N 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 102100034605 Atrial natriuretic peptide receptor 3 Human genes 0.000 description 1
- 206010003827 Autoimmune hepatitis Diseases 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- NOWKCMXCCJGMRR-UHFFFAOYSA-N Aziridine Chemical class C1CN1 NOWKCMXCCJGMRR-UHFFFAOYSA-N 0.000 description 1
- 230000024704 B cell apoptotic process Effects 0.000 description 1
- 102100027205 B-cell antigen receptor complex-associated protein alpha chain Human genes 0.000 description 1
- 101710166261 B-cell antigen receptor complex-associated protein beta chain Proteins 0.000 description 1
- 230000003844 B-cell-activation Effects 0.000 description 1
- 101710117995 B-lymphocyte antigen CD19 Proteins 0.000 description 1
- 108050001413 B-lymphocyte antigen CD20 Proteins 0.000 description 1
- 108010003455 BLyS receptor Proteins 0.000 description 1
- 208000009137 Behcet syndrome Diseases 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- VGGGPCQERPFHOB-MCIONIFRSA-N Bestatin Chemical compound CC(C)C[C@H](C(O)=O)NC(=O)[C@@H](O)[C@H](N)CC1=CC=CC=C1 VGGGPCQERPFHOB-MCIONIFRSA-N 0.000 description 1
- 208000008439 Biliary Liver Cirrhosis Diseases 0.000 description 1
- 208000033222 Biliary cirrhosis primary Diseases 0.000 description 1
- 229940122361 Bisphosphonate Drugs 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- 102000007350 Bone Morphogenetic Proteins Human genes 0.000 description 1
- 108010007726 Bone Morphogenetic Proteins Proteins 0.000 description 1
- 208000020084 Bone disease Diseases 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- COVZYZSDYWQREU-UHFFFAOYSA-N Busulfan Chemical compound CS(=O)(=O)OCCCCOS(C)(=O)=O COVZYZSDYWQREU-UHFFFAOYSA-N 0.000 description 1
- CMXYHFPGMFTECU-UHFFFAOYSA-N C(=S)=S=C(C(=O)O)C(=O)O Chemical compound C(=S)=S=C(C(=O)O)C(=O)O CMXYHFPGMFTECU-UHFFFAOYSA-N 0.000 description 1
- 108090000342 C-Type Lectins Proteins 0.000 description 1
- 102000003930 C-Type Lectins Human genes 0.000 description 1
- 102100031658 C-X-C chemokine receptor type 5 Human genes 0.000 description 1
- 102100025277 C-X-C motif chemokine 13 Human genes 0.000 description 1
- 101710124023 C-type lectin domain family 14 member A Proteins 0.000 description 1
- 101710112758 CAMPATH-1 antigen Proteins 0.000 description 1
- 108091016585 CD44 antigen Proteins 0.000 description 1
- 102000013135 CD52 Antigen Human genes 0.000 description 1
- 108010001445 CD79 Antigens Proteins 0.000 description 1
- 102000000796 CD79 Antigens Human genes 0.000 description 1
- 101150089397 CEG1 gene Proteins 0.000 description 1
- AUJXLBOHYWTPFV-BLWRDSOESA-N CS[C@H]1SC[C@H]2N(C)C(=O)[C@@H](C)NC(=O)[C@H](COC(=O)[C@@H](C(C)C)N(C)C(=O)[C@@H]1N(C)C(=O)[C@@H](C)NC(=O)[C@H](COC(=O)[C@@H](C(C)C)N(C)C2=O)NC(=O)c1cnc2ccccc2n1)NC(=O)c1cnc2ccccc2n1 Chemical compound CS[C@H]1SC[C@H]2N(C)C(=O)[C@@H](C)NC(=O)[C@H](COC(=O)[C@@H](C(C)C)N(C)C(=O)[C@@H]1N(C)C(=O)[C@@H](C)NC(=O)[C@H](COC(=O)[C@@H](C(C)C)N(C)C2=O)NC(=O)c1cnc2ccccc2n1)NC(=O)c1cnc2ccccc2n1 AUJXLBOHYWTPFV-BLWRDSOESA-N 0.000 description 1
- 102000007590 Calpain Human genes 0.000 description 1
- 108010032088 Calpain Proteins 0.000 description 1
- 102000003900 Calpain-2 Human genes 0.000 description 1
- 108090000232 Calpain-2 Proteins 0.000 description 1
- 241000282836 Camelus dromedarius Species 0.000 description 1
- KLWPJMFMVPTNCC-UHFFFAOYSA-N Camptothecin Natural products CCC1(O)C(=O)OCC2=C1C=C3C4Nc5ccccc5C=C4CN3C2=O KLWPJMFMVPTNCC-UHFFFAOYSA-N 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- GAGWJHPBXLXJQN-UORFTKCHSA-N Capecitabine Chemical compound C1=C(F)C(NC(=O)OCCCCC)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](C)O1 GAGWJHPBXLXJQN-UORFTKCHSA-N 0.000 description 1
- GAGWJHPBXLXJQN-UHFFFAOYSA-N Capecitabine Natural products C1=C(F)C(NC(=O)OCCCCC)=NC(=O)N1C1C(O)C(O)C(C)O1 GAGWJHPBXLXJQN-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- SHHKQEUPHAENFK-UHFFFAOYSA-N Carboquone Chemical compound O=C1C(C)=C(N2CC2)C(=O)C(C(COC(N)=O)OC)=C1N1CC1 SHHKQEUPHAENFK-UHFFFAOYSA-N 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- AOCCBINRVIKJHY-UHFFFAOYSA-N Carmofur Chemical compound CCCCCCNC(=O)N1C=C(F)C(=O)NC1=O AOCCBINRVIKJHY-UHFFFAOYSA-N 0.000 description 1
- DLGOEMSEDOSKAD-UHFFFAOYSA-N Carmustine Chemical compound ClCCNC(=O)N(N=O)CCCl DLGOEMSEDOSKAD-UHFFFAOYSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000004225 Cathepsin B Human genes 0.000 description 1
- 108090000712 Cathepsin B Proteins 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 102000016289 Cell Adhesion Molecules Human genes 0.000 description 1
- 108010067225 Cell Adhesion Molecules Proteins 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 102100025828 Centromere protein C Human genes 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 229930184471 Chicamycin Natural products 0.000 description 1
- GHOKWGTUZJEAQD-UHFFFAOYSA-N Chick antidermatitis factor Natural products OCC(C)(C)C(O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-UHFFFAOYSA-N 0.000 description 1
- JWBOIMRXGHLCPP-UHFFFAOYSA-N Chloditan Chemical compound C=1C=CC=C(Cl)C=1C(C(Cl)Cl)C1=CC=C(Cl)C=C1 JWBOIMRXGHLCPP-UHFFFAOYSA-N 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 206010008609 Cholangitis sclerosing Diseases 0.000 description 1
- 208000006344 Churg-Strauss Syndrome Diseases 0.000 description 1
- WBYWAXJHAXSJNI-SREVYHEPSA-N Cinnamic acid Chemical compound OC(=O)\C=C/C1=CC=CC=C1 WBYWAXJHAXSJNI-SREVYHEPSA-N 0.000 description 1
- 208000015943 Coeliac disease Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 102100032768 Complement receptor type 2 Human genes 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 208000011231 Crohn disease Diseases 0.000 description 1
- 229930188224 Cryptophycin Natural products 0.000 description 1
- 108010069514 Cyclic Peptides Proteins 0.000 description 1
- 102000001189 Cyclic Peptides Human genes 0.000 description 1
- PMPVIKIVABFJJI-UHFFFAOYSA-N Cyclobutane Chemical compound C1CCC1 PMPVIKIVABFJJI-UHFFFAOYSA-N 0.000 description 1
- XDTMQSROBMDMFD-UHFFFAOYSA-N Cyclohexane Chemical compound C1CCCCC1 XDTMQSROBMDMFD-UHFFFAOYSA-N 0.000 description 1
- LVZWSLJZHVFIQJ-UHFFFAOYSA-N Cyclopropane Chemical compound C1CC1 LVZWSLJZHVFIQJ-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-CBPJZXOFSA-N D-Gulose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O WQZGKKKJIJFFOK-CBPJZXOFSA-N 0.000 description 1
- WQZGKKKJIJFFOK-IVMDWMLBSA-N D-allopyranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@H](O)[C@@H]1O WQZGKKKJIJFFOK-IVMDWMLBSA-N 0.000 description 1
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-AGQMPKSLSA-N D-lyxofuranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@H]1O HMFHBZSHGGEWLO-AGQMPKSLSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- HMFHBZSHGGEWLO-IOVATXLUSA-N D-xylofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@H]1O HMFHBZSHGGEWLO-IOVATXLUSA-N 0.000 description 1
- 108010013198 Daptomycin Proteins 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- SBJKKFFYIZUCET-JLAZNSOCSA-N Dehydro-L-ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(=O)C1=O SBJKKFFYIZUCET-JLAZNSOCSA-N 0.000 description 1
- SBJKKFFYIZUCET-UHFFFAOYSA-N Dehydroascorbic acid Natural products OCC(O)C1OC(=O)C(=O)C1=O SBJKKFFYIZUCET-UHFFFAOYSA-N 0.000 description 1
- 238000006646 Dess-Martin oxidation reaction Methods 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- ZMJOVJSTYLQINE-UHFFFAOYSA-N Dichloroacetylene Chemical compound ClC#CCl ZMJOVJSTYLQINE-UHFFFAOYSA-N 0.000 description 1
- 102100037981 Dickkopf-like protein 1 Human genes 0.000 description 1
- 101710125833 Dickkopf-like protein 1 Proteins 0.000 description 1
- 101001098806 Dictyostelium discoideum cGMP-specific 3',5'-cGMP phosphodiesterase 3 Proteins 0.000 description 1
- BUDQDWGNQVEFAC-UHFFFAOYSA-N Dihydropyran Chemical compound C1COC=CC1 BUDQDWGNQVEFAC-UHFFFAOYSA-N 0.000 description 1
- LCGLNKUTAGEVQW-UHFFFAOYSA-N Dimethyl ether Chemical compound COC LCGLNKUTAGEVQW-UHFFFAOYSA-N 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- ZQZFYGIXNQKOAV-OCEACIFDSA-N Droloxifene Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=C(O)C=CC=1)\C1=CC=C(OCCN(C)C)C=C1 ZQZFYGIXNQKOAV-OCEACIFDSA-N 0.000 description 1
- 229930193152 Dynemicin Natural products 0.000 description 1
- 102100026245 E3 ubiquitin-protein ligase RNF43 Human genes 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102000012545 EGF-like domains Human genes 0.000 description 1
- 108050002150 EGF-like domains Proteins 0.000 description 1
- 108010009858 Echinomycin Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 208000017701 Endocrine disease Diseases 0.000 description 1
- 102000012085 Endoglin Human genes 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 108050009340 Endothelin Proteins 0.000 description 1
- 102000002045 Endothelin Human genes 0.000 description 1
- AFMYMMXSQGUCBK-UHFFFAOYSA-N Endynamicin A Natural products C1#CC=CC#CC2NC(C=3C(=O)C4=C(O)C=CC(O)=C4C(=O)C=3C(O)=C3)=C3C34OC32C(C)C(C(O)=O)=C(OC)C41 AFMYMMXSQGUCBK-UHFFFAOYSA-N 0.000 description 1
- SAMRUMKYXPVKPA-VFKOLLTISA-N Enocitabine Chemical compound O=C1N=C(NC(=O)CCCCCCCCCCCCCCCCCCCCC)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](CO)O1 SAMRUMKYXPVKPA-VFKOLLTISA-N 0.000 description 1
- 108010001687 Enterotoxin Receptors Proteins 0.000 description 1
- 208000018428 Eosinophilic granulomatosis with polyangiitis Diseases 0.000 description 1
- HTIJFSOGRVMCQR-UHFFFAOYSA-N Epirubicin Natural products COc1cccc2C(=O)c3c(O)c4CC(O)(CC(OC5CC(N)C(=O)C(C)O5)c4c(O)c3C(=O)c12)C(=O)CO HTIJFSOGRVMCQR-UHFFFAOYSA-N 0.000 description 1
- 102000056372 ErbB-3 Receptor Human genes 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 241000289695 Eutheria Species 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 239000001263 FEMA 3042 Substances 0.000 description 1
- 108091008794 FGF receptors Proteins 0.000 description 1
- 102100031507 Fc receptor-like protein 5 Human genes 0.000 description 1
- 108010087819 Fc receptors Proteins 0.000 description 1
- 102000009109 Fc receptors Human genes 0.000 description 1
- 108091006020 Fc-tagged proteins Proteins 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 102400001063 Fibrinopeptide B Human genes 0.000 description 1
- 101800003778 Fibrinopeptide B Proteins 0.000 description 1
- 102000044168 Fibroblast Growth Factor Receptor Human genes 0.000 description 1
- 102100026535 Fibronectin type III domain-containing protein 5 Human genes 0.000 description 1
- 206010053717 Fibrous histiocytoma Diseases 0.000 description 1
- 102100035139 Folate receptor alpha Human genes 0.000 description 1
- 102000016970 Follistatin Human genes 0.000 description 1
- 108010014612 Follistatin Proteins 0.000 description 1
- 244000182067 Fraxinus ornus Species 0.000 description 1
- VWUXBMIQPBEWFH-WCCTWKNTSA-N Fulvestrant Chemical compound OC1=CC=C2[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3[C@H](CCCCCCCCCS(=O)CCCC(F)(F)C(F)(F)F)CC2=C1 VWUXBMIQPBEWFH-WCCTWKNTSA-N 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- DSLZVSRJTYRBFB-UHFFFAOYSA-N Galactaric acid Natural products OC(=O)C(O)C(O)C(O)C(O)C(O)=O DSLZVSRJTYRBFB-UHFFFAOYSA-N 0.000 description 1
- 208000018522 Gastrointestinal disease Diseases 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 229930191978 Gibberellin Natural products 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- 208000024869 Goodpasture syndrome Diseases 0.000 description 1
- 241000282575 Gorilla Species 0.000 description 1
- BLCLNMBMMGCOAS-URPVMXJPSA-N Goserelin Chemical compound C([C@@H](C(=O)N[C@H](COC(C)(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(=O)NNC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 BLCLNMBMMGCOAS-URPVMXJPSA-N 0.000 description 1
- 108010069236 Goserelin Proteins 0.000 description 1
- 206010072579 Granulomatosis with polyangiitis Diseases 0.000 description 1
- 108091008603 HGF receptors Proteins 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 101710119866 HLA class II histocompatibility antigen gamma chain Proteins 0.000 description 1
- 102100031546 HLA class II histocompatibility antigen, DO beta chain Human genes 0.000 description 1
- 102100029966 HLA class II histocompatibility antigen, DP alpha 1 chain Human genes 0.000 description 1
- 102000006354 HLA-DR Antigens Human genes 0.000 description 1
- 108010058597 HLA-DR Antigens Proteins 0.000 description 1
- 101150112743 HSPA5 gene Proteins 0.000 description 1
- 208000032041 Hearing impaired Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 102000008055 Heparan Sulfate Proteoglycans Human genes 0.000 description 1
- 229920002971 Heparan sulfate Polymers 0.000 description 1
- 108090000100 Hepatocyte Growth Factor Proteins 0.000 description 1
- 102100021866 Hepatocyte growth factor Human genes 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 101100274557 Heterodera glycines CLE1 gene Proteins 0.000 description 1
- 102100038009 High affinity immunoglobulin epsilon receptor subunit beta Human genes 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000924488 Homo sapiens Atrial natriuretic peptide receptor 3 Proteins 0.000 description 1
- 101000914489 Homo sapiens B-cell antigen receptor complex-associated protein alpha chain Proteins 0.000 description 1
- 101000934359 Homo sapiens B-cell differentiation antigen CD72 Proteins 0.000 description 1
- 101000922405 Homo sapiens C-X-C chemokine receptor type 5 Proteins 0.000 description 1
- 101000858064 Homo sapiens C-X-C motif chemokine 13 Proteins 0.000 description 1
- 101100439859 Homo sapiens CLEC14A gene Proteins 0.000 description 1
- 101000914321 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 7 Proteins 0.000 description 1
- 101000914241 Homo sapiens Centromere protein C Proteins 0.000 description 1
- 101000941929 Homo sapiens Complement receptor type 2 Proteins 0.000 description 1
- 101000927579 Homo sapiens D-dopachrome decarboxylase Proteins 0.000 description 1
- 101100223941 Homo sapiens DKK1 gene Proteins 0.000 description 1
- 101000932213 Homo sapiens Dipeptidase 1 Proteins 0.000 description 1
- 101000692702 Homo sapiens E3 ubiquitin-protein ligase RNF43 Proteins 0.000 description 1
- 101100118549 Homo sapiens EGFR gene Proteins 0.000 description 1
- 101001010541 Homo sapiens Electron transfer flavoprotein subunit alpha, mitochondrial Proteins 0.000 description 1
- 101000899240 Homo sapiens Endoplasmic reticulum chaperone BiP Proteins 0.000 description 1
- 101100119857 Homo sapiens FCRL2 gene Proteins 0.000 description 1
- 101100227587 Homo sapiens FOLH1 gene Proteins 0.000 description 1
- 101100391538 Homo sapiens FUT3 gene Proteins 0.000 description 1
- 101000846908 Homo sapiens Fc receptor-like protein 5 Proteins 0.000 description 1
- 101000935587 Homo sapiens Flavin reductase (NADPH) Proteins 0.000 description 1
- 101001023230 Homo sapiens Folate receptor alpha Proteins 0.000 description 1
- 101100229898 Homo sapiens GPNMB gene Proteins 0.000 description 1
- 101001058904 Homo sapiens Gamma-tubulin complex component 2 Proteins 0.000 description 1
- 101001014636 Homo sapiens Golgin subfamily A member 4 Proteins 0.000 description 1
- 101000866281 Homo sapiens HLA class II histocompatibility antigen, DO beta chain Proteins 0.000 description 1
- 101000864089 Homo sapiens HLA class II histocompatibility antigen, DP alpha 1 chain Proteins 0.000 description 1
- 101000930802 Homo sapiens HLA class II histocompatibility antigen, DQ alpha 1 chain Proteins 0.000 description 1
- 101000968032 Homo sapiens HLA class II histocompatibility antigen, DR beta 3 chain Proteins 0.000 description 1
- 101000878594 Homo sapiens High affinity immunoglobulin epsilon receptor subunit beta Proteins 0.000 description 1
- 101001044893 Homo sapiens Interleukin-20 receptor subunit alpha Proteins 0.000 description 1
- 101001034314 Homo sapiens Lactadherin Proteins 0.000 description 1
- 101000917826 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-a Proteins 0.000 description 1
- 101000917824 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-b Proteins 0.000 description 1
- 101000576802 Homo sapiens Mesothelin Proteins 0.000 description 1
- 101001023705 Homo sapiens Nectin-4 Proteins 0.000 description 1
- 101000581981 Homo sapiens Neural cell adhesion molecule 1 Proteins 0.000 description 1
- 101000604168 Homo sapiens Neuromedin-B Proteins 0.000 description 1
- 101001024605 Homo sapiens Next to BRCA1 gene 1 protein Proteins 0.000 description 1
- 101000617725 Homo sapiens Pregnancy-specific beta-1-glycoprotein 2 Proteins 0.000 description 1
- 101000831286 Homo sapiens Protein timeless homolog Proteins 0.000 description 1
- 101001010819 Homo sapiens Receptor tyrosine-protein kinase erbB-3 Proteins 0.000 description 1
- 101000752245 Homo sapiens Rho guanine nucleotide exchange factor 5 Proteins 0.000 description 1
- 101100042692 Homo sapiens SLAMF7 gene Proteins 0.000 description 1
- 101000829138 Homo sapiens Somatostatin receptor type 3 Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- VSNHCAURESNICA-UHFFFAOYSA-N Hydroxyurea Chemical compound NC(=O)NO VSNHCAURESNICA-UHFFFAOYSA-N 0.000 description 1
- 241000282620 Hylobates sp. Species 0.000 description 1
- 206010020853 Hypertonic bladder Diseases 0.000 description 1
- WQZGKKKJIJFFOK-YIDFTEPTSA-N IDOSE Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@H](O)[C@H]1O WQZGKKKJIJFFOK-YIDFTEPTSA-N 0.000 description 1
- XDXDZDZNSLXDNA-TZNDIEGXSA-N Idarubicin Chemical compound C1[C@H](N)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1C2=C(O)C(C(=O)C3=CC=CC=C3C3=O)=C3C(O)=C2C[C@@](O)(C(C)=O)C1 XDXDZDZNSLXDNA-TZNDIEGXSA-N 0.000 description 1
- XDXDZDZNSLXDNA-UHFFFAOYSA-N Idarubicin Natural products C1C(N)C(O)C(C)OC1OC1C2=C(O)C(C(=O)C3=CC=CC=C3C3=O)=C3C(O)=C2CC(O)(C(C)=O)C1 XDXDZDZNSLXDNA-UHFFFAOYSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- WRYCSMQKUKOKBP-UHFFFAOYSA-N Imidazolidine Chemical compound C1CNCN1 WRYCSMQKUKOKBP-UHFFFAOYSA-N 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 208000022559 Inflammatory bowel disease Diseases 0.000 description 1
- 102100024065 Inhibitor of growth protein 1 Human genes 0.000 description 1
- 208000027601 Inner ear disease Diseases 0.000 description 1
- 108010040765 Integrin alphaV Proteins 0.000 description 1
- 108010047852 Integrin alphaVbeta3 Proteins 0.000 description 1
- 108010032774 Interleukin-2 Receptor alpha Subunit Proteins 0.000 description 1
- 102000007351 Interleukin-2 Receptor alpha Subunit Human genes 0.000 description 1
- 102100022706 Interleukin-20 receptor subunit alpha Human genes 0.000 description 1
- 208000005016 Intestinal Neoplasms Diseases 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- 108090000862 Ion Channels Proteins 0.000 description 1
- 101800001026 Irisin Proteins 0.000 description 1
- 208000007766 Kaposi sarcoma Diseases 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VSOAQEOCSA-N L-altropyranose Chemical compound OC[C@@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-VSOAQEOCSA-N 0.000 description 1
- HMFHBZSHGGEWLO-HWQSCIPKSA-N L-arabinofuranose Chemical compound OC[C@@H]1OC(O)[C@H](O)[C@H]1O HMFHBZSHGGEWLO-HWQSCIPKSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 239000005411 L01XE02 - Gefitinib Substances 0.000 description 1
- 239000005551 L01XE03 - Erlotinib Substances 0.000 description 1
- 239000005511 L01XE05 - Sorafenib Substances 0.000 description 1
- 239000002136 L01XE07 - Lapatinib Substances 0.000 description 1
- 239000003798 L01XE11 - Pazopanib Substances 0.000 description 1
- 239000002118 L01XE12 - Vandetanib Substances 0.000 description 1
- JLERVPBPJHKRBJ-UHFFFAOYSA-N LY 117018 Chemical compound C1=CC(O)=CC=C1C1=C(C(=O)C=2C=CC(OCCN3CCCC3)=CC=2)C2=CC=C(O)C=C2S1 JLERVPBPJHKRBJ-UHFFFAOYSA-N 0.000 description 1
- 208000017119 Labyrinth disease Diseases 0.000 description 1
- 102100039648 Lactadherin Human genes 0.000 description 1
- 241000283953 Lagomorpha Species 0.000 description 1
- 102000052922 Large Neutral Amino Acid-Transporter 1 Human genes 0.000 description 1
- 239000005639 Lauric acid Substances 0.000 description 1
- 101710197063 Lectin-3 Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 229920001491 Lentinan Polymers 0.000 description 1
- 208000035036 Lethal congenital contracture syndrome type 2 Diseases 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101710146560 Leukocyte antigen CD37 Proteins 0.000 description 1
- 108010000817 Leuprolide Proteins 0.000 description 1
- 108090000543 Ligand-Gated Ion Channels Proteins 0.000 description 1
- 102000004086 Ligand-Gated Ion Channels Human genes 0.000 description 1
- 229940123917 Lipid kinase inhibitor Drugs 0.000 description 1
- GQYIWUVLTXOXAJ-UHFFFAOYSA-N Lomustine Chemical compound ClCCN(N=O)C(=O)NC1CCCCC1 GQYIWUVLTXOXAJ-UHFFFAOYSA-N 0.000 description 1
- 102100029204 Low affinity immunoglobulin gamma Fc region receptor II-a Human genes 0.000 description 1
- 101000929342 Lytechinus pictus Actin, cytoskeletal 1 Proteins 0.000 description 1
- 102100025069 MARVEL domain-containing protein 1 Human genes 0.000 description 1
- 241000289581 Macropus sp. Species 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930126263 Maytansine Natural products 0.000 description 1
- 201000005505 Measles Diseases 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000003735 Mesothelin Human genes 0.000 description 1
- 108090000015 Mesothelin Proteins 0.000 description 1
- 102100025096 Mesothelin Human genes 0.000 description 1
- 108090000192 Methionyl aminopeptidases Proteins 0.000 description 1
- VFKZTMPDYBFSTM-KVTDHHQDSA-N Mitobronitol Chemical compound BrC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CBr VFKZTMPDYBFSTM-KVTDHHQDSA-N 0.000 description 1
- 229930192392 Mitomycin Natural products 0.000 description 1
- 101000623899 Mus musculus Mucin-13 Proteins 0.000 description 1
- 101100189356 Mus musculus Papolb gene Proteins 0.000 description 1
- 101000866339 Mus musculus Transcription factor E2F6 Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 208000002033 Myoclonus Diseases 0.000 description 1
- FXHOOIRPVKKKFG-UHFFFAOYSA-N N,N-Dimethylacetamide Chemical compound CN(C)C(C)=O FXHOOIRPVKKKFG-UHFFFAOYSA-N 0.000 description 1
- QPCDCPDFJACHGM-UHFFFAOYSA-N N,N-bis{2-[bis(carboxymethyl)amino]ethyl}glycine Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(=O)O)CCN(CC(O)=O)CC(O)=O QPCDCPDFJACHGM-UHFFFAOYSA-N 0.000 description 1
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 1
- KJAXFLHZRHZTEK-UHFFFAOYSA-N NN.C1CNNC1 Chemical compound NN.C1CNNC1 KJAXFLHZRHZTEK-UHFFFAOYSA-N 0.000 description 1
- 239000007832 Na2SO4 Substances 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 108050003738 Neural cell adhesion molecule 1 Proteins 0.000 description 1
- 102100023616 Neural cell adhesion molecule L1-like protein Human genes 0.000 description 1
- 102400000058 Neuregulin-1 Human genes 0.000 description 1
- 108090000556 Neuregulin-1 Proteins 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 1
- 102000004459 Nitroreductase Human genes 0.000 description 1
- IOVCWXUNBOPUCH-UHFFFAOYSA-N Nitrous acid Chemical compound ON=O IOVCWXUNBOPUCH-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108090000304 Occludin Proteins 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 229930187135 Olivomycin Natural products 0.000 description 1
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical compound C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 1
- WYNCHZVNFNFDNH-UHFFFAOYSA-N Oxazolidine Chemical compound C1COCN1 WYNCHZVNFNFDNH-UHFFFAOYSA-N 0.000 description 1
- 102100037603 P2X purinoceptor 5 Human genes 0.000 description 1
- 101710189969 P2X purinoceptor 5 Proteins 0.000 description 1
- SUDAHWBOROXANE-SECBINFHSA-N PD 0325901 Chemical compound OC[C@@H](O)CONC(=O)C1=CC=C(F)C(F)=C1NC1=CC=C(I)C=C1F SUDAHWBOROXANE-SECBINFHSA-N 0.000 description 1
- SUDAHWBOROXANE-VIFPVBQESA-N PD 0325901-Cl Chemical compound OC[C@H](O)CONC(=O)C1=CC=C(F)C(F)=C1NC1=CC=C(I)C=C1F SUDAHWBOROXANE-VIFPVBQESA-N 0.000 description 1
- ZJOKWAWPAPMNIM-UHFFFAOYSA-N PD-153035 hydrochloride Chemical compound Cl.C=12C=C(OC)C(OC)=CC2=NC=NC=1NC1=CC=CC(Br)=C1 ZJOKWAWPAPMNIM-UHFFFAOYSA-N 0.000 description 1
- 108010014865 PLIalpha Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102100026459 POU domain, class 3, transcription factor 2 Human genes 0.000 description 1
- 101710133394 POU domain, class 3, transcription factor 2 Proteins 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 241001504519 Papio ursinus Species 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 206010034277 Pemphigoid Diseases 0.000 description 1
- 201000011152 Pemphigus Diseases 0.000 description 1
- LRBQNJMCXXYXIU-PPKXGCFTSA-N Penta-digallate-beta-D-glucose Natural products OC1=C(O)C(O)=CC(C(=O)OC=2C(=C(O)C=C(C=2)C(=O)OC[C@@H]2[C@H]([C@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)O2)OC(=O)C=2C=C(OC(=O)C=3C=C(O)C(O)=C(O)C=3)C(O)=C(O)C=2)O)=C1 LRBQNJMCXXYXIU-PPKXGCFTSA-N 0.000 description 1
- 241000405070 Percophidae Species 0.000 description 1
- 206010057249 Phagocytosis Diseases 0.000 description 1
- 241001520299 Phascolarctos cinereus Species 0.000 description 1
- 108091007643 Phosphate carriers Proteins 0.000 description 1
- 108010030678 Phosphatidylethanolamine N-Methyltransferase Proteins 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- KMSKQZKKOZQFFG-HSUXVGOQSA-N Pirarubicin Chemical compound O([C@H]1[C@@H](N)C[C@@H](O[C@H]1C)O[C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1CCCCO1 KMSKQZKKOZQFFG-HSUXVGOQSA-N 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 239000010103 Podophyllin Substances 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 206010036105 Polyneuropathy Diseases 0.000 description 1
- 241000282405 Pongo abelii Species 0.000 description 1
- HFVNWDWLWUCIHC-GUPDPFMOSA-N Prednimustine Chemical compound O=C([C@@]1(O)CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)[C@@H](O)C[C@@]21C)COC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 HFVNWDWLWUCIHC-GUPDPFMOSA-N 0.000 description 1
- 208000012654 Primary biliary cholangitis Diseases 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102100033237 Pro-epidermal growth factor Human genes 0.000 description 1
- 101710124304 Proline-rich protein 4 Proteins 0.000 description 1
- 101710129873 Prolyl endopeptidase FAP Proteins 0.000 description 1
- OFOBLEOULBTSOW-UHFFFAOYSA-N Propanedioic acid Natural products OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 1
- XBDQKXXYIPTUBI-UHFFFAOYSA-M Propionate Chemical compound CCC([O-])=O XBDQKXXYIPTUBI-UHFFFAOYSA-M 0.000 description 1
- 101710120463 Prostate stem cell antigen Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 102000016611 Proteoglycans Human genes 0.000 description 1
- 108010067787 Proteoglycans Proteins 0.000 description 1
- 201000001263 Psoriatic Arthritis Diseases 0.000 description 1
- 208000036824 Psoriatic arthropathy Diseases 0.000 description 1
- 241000567363 Puccinellia maritima Species 0.000 description 1
- 102000000033 Purinergic Receptors Human genes 0.000 description 1
- 108010080192 Purinergic Receptors Proteins 0.000 description 1
- AHHFEZNOXOZZQA-ZEBDFXRSSA-N Ranimustine Chemical compound CO[C@H]1O[C@H](CNC(=O)N(CCCl)N=O)[C@@H](O)[C@H](O)[C@H]1O AHHFEZNOXOZZQA-ZEBDFXRSSA-N 0.000 description 1
- 101001039269 Rattus norvegicus Glycine N-methyltransferase Proteins 0.000 description 1
- 208000003782 Raynaud disease Diseases 0.000 description 1
- 208000012322 Raynaud phenomenon Diseases 0.000 description 1
- 102000004278 Receptor Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000873 Receptor Protein-Tyrosine Kinases Proteins 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 208000025747 Rheumatic disease Diseases 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- URWAJWIAIPFPJE-UHFFFAOYSA-N Rickamicin Natural products O1CC(O)(C)C(NC)C(O)C1OC1C(O)C(OC2C(CC=C(CN)O2)N)C(N)CC1N URWAJWIAIPFPJE-UHFFFAOYSA-N 0.000 description 1
- 108010040181 SF 1126 Proteins 0.000 description 1
- 108050003189 SH2B adapter protein 1 Proteins 0.000 description 1
- 101710083287 SLAM family member 7 Proteins 0.000 description 1
- 108091006576 SLC34A2 Proteins 0.000 description 1
- 108091006232 SLC7A5 Proteins 0.000 description 1
- 229910006069 SO3H Inorganic materials 0.000 description 1
- 101100123851 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) HER1 gene Proteins 0.000 description 1
- 101100111629 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR2 gene Proteins 0.000 description 1
- 206010061934 Salivary gland cancer Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 101710107921 Secreted protein BARF1 Proteins 0.000 description 1
- 102000014105 Semaphorin Human genes 0.000 description 1
- 108050003978 Semaphorin Proteins 0.000 description 1
- 101710199399 Semaphorin-5B Proteins 0.000 description 1
- 108010029180 Sialic Acid Binding Ig-like Lectin 3 Proteins 0.000 description 1
- 229930192786 Sisomicin Natural products 0.000 description 1
- 229940124269 Somatostatin receptor 2 agonist Drugs 0.000 description 1
- 102100029329 Somatostatin receptor type 1 Human genes 0.000 description 1
- 101710192613 Somatostatin receptor type 2 Proteins 0.000 description 1
- 102100023803 Somatostatin receptor type 3 Human genes 0.000 description 1
- 102100023801 Somatostatin receptor type 4 Human genes 0.000 description 1
- 101710192646 Somatostatin receptor type 5 Proteins 0.000 description 1
- 101000668858 Spinacia oleracea 30S ribosomal protein S1, chloroplastic Proteins 0.000 description 1
- 231100000632 Spindle poison Toxicity 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 238000006619 Stille reaction Methods 0.000 description 1
- 208000004350 Strabismus Diseases 0.000 description 1
- 101000898746 Streptomyces clavuligerus Clavaminate synthase 1 Proteins 0.000 description 1
- ZSJLQEPLLKMAKR-UHFFFAOYSA-N Streptozotocin Natural products O=NN(C)C(=O)NC1C(O)OC(CO)C(O)C1O ZSJLQEPLLKMAKR-UHFFFAOYSA-N 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 238000006069 Suzuki reaction reaction Methods 0.000 description 1
- 108050006774 Syndecan Proteins 0.000 description 1
- 102000003705 Syndecan-1 Human genes 0.000 description 1
- 108090000054 Syndecan-2 Proteins 0.000 description 1
- BXFOFFBJRFZBQZ-QYWOHJEZSA-N T-2 toxin Chemical compound C([C@@]12[C@]3(C)[C@H](OC(C)=O)[C@@H](O)[C@H]1O[C@H]1[C@]3(COC(C)=O)C[C@@H](C(=C1)C)OC(=O)CC(C)C)O2 BXFOFFBJRFZBQZ-QYWOHJEZSA-N 0.000 description 1
- 102000003618 TRPM4 Human genes 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 108010012944 Tetragastrin Proteins 0.000 description 1
- XBDYBAVJXHJMNQ-UHFFFAOYSA-N Tetrahydroanthracene Natural products C1=CC=C2C=C(CCCC3)C3=CC2=C1 XBDYBAVJXHJMNQ-UHFFFAOYSA-N 0.000 description 1
- YPWFISCTZQNZAU-UHFFFAOYSA-N Thiane Chemical compound C1CCSCC1 YPWFISCTZQNZAU-UHFFFAOYSA-N 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- 206010043561 Thrombocytopenic purpura Diseases 0.000 description 1
- 108060008245 Thrombospondin Proteins 0.000 description 1
- 102000002938 Thrombospondin Human genes 0.000 description 1
- 201000007023 Thrombotic Thrombocytopenic Purpura Diseases 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 239000000365 Topoisomerase I Inhibitor Substances 0.000 description 1
- IWEQQRMGNVVKQW-OQKDUQJOSA-N Toremifene citrate Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O.C1=CC(OCCN(C)C)=CC=C1C(\C=1C=CC=CC=1)=C(\CCCl)C1=CC=CC=C1 IWEQQRMGNVVKQW-OQKDUQJOSA-N 0.000 description 1
- 208000003441 Transfusion reaction Diseases 0.000 description 1
- 108010037150 Transient Receptor Potential Channels Proteins 0.000 description 1
- 102000011753 Transient Receptor Potential Channels Human genes 0.000 description 1
- 102100031228 Transient receptor potential cation channel subfamily M member 4 Human genes 0.000 description 1
- 101710170091 Transmembrane glycoprotein NMB Proteins 0.000 description 1
- 101710149792 Triosephosphate isomerase, chloroplastic Proteins 0.000 description 1
- 101710195516 Triosephosphate isomerase, glycosomal Proteins 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 1
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 1
- 101710178300 Tumor necrosis factor receptor superfamily member 13C Proteins 0.000 description 1
- 101710192735 Tyrosine-protein kinase receptor UFO Proteins 0.000 description 1
- 208000024780 Urticaria Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 206010046851 Uveitis Diseases 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 101710160666 Vascular cell adhesion protein 1 Proteins 0.000 description 1
- 206010047115 Vasculitis Diseases 0.000 description 1
- JXLYSJRDGCGARV-WWYNWVTFSA-N Vinblastine Natural products O=C(O[C@H]1[C@](O)(C(=O)OC)[C@@H]2N(C)c3c(cc(c(OC)c3)[C@]3(C(=O)OC)c4[nH]c5c(c4CCN4C[C@](O)(CC)C[C@H](C3)C4)cccc5)[C@@]32[C@H]2[C@@]1(CC)C=CCN2CC3)C JXLYSJRDGCGARV-WWYNWVTFSA-N 0.000 description 1
- 108010066342 Virus Receptors Proteins 0.000 description 1
- 102000018265 Virus Receptors Human genes 0.000 description 1
- 108010048673 Vitronectin Receptors Proteins 0.000 description 1
- 206010047741 Vulval cancer Diseases 0.000 description 1
- DGEZNRSVGBDHLK-UHFFFAOYSA-N [1,10]phenanthroline Chemical compound C1=CN=C2C3=NC=CC=C3C=CC2=C1 DGEZNRSVGBDHLK-UHFFFAOYSA-N 0.000 description 1
- USDJGQLNFPZEON-UHFFFAOYSA-N [[4,6-bis(hydroxymethylamino)-1,3,5-triazin-2-yl]amino]methanol Chemical compound OCNC1=NC(NCO)=NC(NCO)=N1 USDJGQLNFPZEON-UHFFFAOYSA-N 0.000 description 1
- GXVKHKJETWAWRR-UHFFFAOYSA-N a805143 Chemical compound C1CCNC1.C1CCNC1 GXVKHKJETWAWRR-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 125000004036 acetal group Chemical group 0.000 description 1
- HXGDTGSAIMULJN-UHFFFAOYSA-N acetnaphthylene Natural products C1=CC(C=C2)=C3C2=CC=CC3=C1 HXGDTGSAIMULJN-UHFFFAOYSA-N 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 125000003668 acetyloxy group Chemical group [H]C([H])([H])C(=O)O[*] 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 229930188522 aclacinomycin Natural products 0.000 description 1
- USZYSDMBJDPRIF-SVEJIMAYSA-N aclacinomycin A Chemical compound O([C@H]1[C@@H](O)C[C@@H](O[C@H]1C)O[C@H]1[C@H](C[C@@H](O[C@H]1C)O[C@H]1C[C@]([C@@H](C2=CC=3C(=O)C4=CC=CC(O)=C4C(=O)C=3C(O)=C21)C(=O)OC)(O)CC)N(C)C)[C@H]1CCC(=O)[C@H](C)O1 USZYSDMBJDPRIF-SVEJIMAYSA-N 0.000 description 1
- 229960004176 aclarubicin Drugs 0.000 description 1
- 150000001252 acrylic acid derivatives Chemical class 0.000 description 1
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 125000004442 acylamino group Chemical group 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 150000001345 alkine derivatives Chemical class 0.000 description 1
- 150000005215 alkyl ethers Chemical class 0.000 description 1
- 125000004414 alkyl thio group Chemical group 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 230000002152 alkylating effect Effects 0.000 description 1
- SHGAZHPCJJPHSC-YCNIQYBTSA-N all-trans-retinoic acid Chemical compound OC(=O)\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-YCNIQYBTSA-N 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- WYTGDNHDOZPMIW-RCBQFDQVSA-N alstonine Natural products C1=CC2=C3C=CC=CC3=NC2=C2N1C[C@H]1[C@H](C)OC=C(C(=O)OC)[C@H]1C2 WYTGDNHDOZPMIW-RCBQFDQVSA-N 0.000 description 1
- 229960000473 altretamine Drugs 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 125000006242 amine protecting group Chemical group 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- 229960003437 aminoglutethimide Drugs 0.000 description 1
- ROBVIMPUHSLWNV-UHFFFAOYSA-N aminoglutethimide Chemical compound C=1C=C(N)C=CC=1C1(CC)CCC(=O)NC1=O ROBVIMPUHSLWNV-UHFFFAOYSA-N 0.000 description 1
- 229960002749 aminolevulinic acid Drugs 0.000 description 1
- 229960003896 aminopterin Drugs 0.000 description 1
- 125000004397 aminosulfonyl group Chemical group NS(=O)(=O)* 0.000 description 1
- XCPGHVQEEXUHNC-UHFFFAOYSA-N amsacrine Chemical compound COC1=CC(NS(C)(=O)=O)=CC=C1NC1=C(C=CC=C2)C2=NC2=CC=CC=C12 XCPGHVQEEXUHNC-UHFFFAOYSA-N 0.000 description 1
- 229960001220 amsacrine Drugs 0.000 description 1
- 201000007538 anal carcinoma Diseases 0.000 description 1
- 229960002932 anastrozole Drugs 0.000 description 1
- YBBLVLTVTVSKRW-UHFFFAOYSA-N anastrozole Chemical compound N#CC(C)(C)C1=CC(C(C)(C#N)C)=CC(CN2N=CN=C2)=C1 YBBLVLTVTVSKRW-UHFFFAOYSA-N 0.000 description 1
- BBDAGFIXKZCXAH-CCXZUQQUSA-N ancitabine Chemical compound N=C1C=CN2[C@@H]3O[C@H](CO)[C@@H](O)[C@@H]3OC2=N1 BBDAGFIXKZCXAH-CCXZUQQUSA-N 0.000 description 1
- 229950000242 ancitabine Drugs 0.000 description 1
- 239000003098 androgen Substances 0.000 description 1
- 229940030486 androgens Drugs 0.000 description 1
- 239000004037 angiogenesis inhibitor Substances 0.000 description 1
- 230000002491 angiogenic effect Effects 0.000 description 1
- 125000002490 anilino group Chemical group [H]N(*)C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 239000010775 animal oil Substances 0.000 description 1
- 150000001449 anionic compounds Chemical class 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 239000003817 anthracycline antibiotic agent Substances 0.000 description 1
- 230000002280 anti-androgenic effect Effects 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000002494 anti-cea effect Effects 0.000 description 1
- 229940046836 anti-estrogen Drugs 0.000 description 1
- 230000001833 anti-estrogenic effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 239000000051 antiandrogen Substances 0.000 description 1
- 229940030495 antiandrogen sex hormone and modulator of the genital system Drugs 0.000 description 1
- 229940125644 antibody drug Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 239000013059 antihormonal agent Substances 0.000 description 1
- 229940045687 antimetabolites folic acid analogs Drugs 0.000 description 1
- 229940045719 antineoplastic alkylating agent nitrosoureas Drugs 0.000 description 1
- 239000003972 antineoplastic antibiotic Substances 0.000 description 1
- 229940045988 antineoplastic drug protein kinase inhibitors Drugs 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 239000003886 aromatase inhibitor Substances 0.000 description 1
- 229940046844 aromatase inhibitors Drugs 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 229950002882 aselizumab Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 229960005261 aspartic acid Drugs 0.000 description 1
- 230000003140 astrocytic effect Effects 0.000 description 1
- 238000011914 asymmetric synthesis Methods 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 201000005000 autoimmune gastritis Diseases 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 208000010928 autoimmune thyroid disease Diseases 0.000 description 1
- 229940120638 avastin Drugs 0.000 description 1
- 108010023337 axl receptor tyrosine kinase Proteins 0.000 description 1
- 229960002756 azacitidine Drugs 0.000 description 1
- 229950011321 azaserine Drugs 0.000 description 1
- XYOVOXDWRFGKEX-UHFFFAOYSA-N azepine Chemical compound N1C=CC=CC=C1 XYOVOXDWRFGKEX-UHFFFAOYSA-N 0.000 description 1
- HONIICLYMWZJFZ-UHFFFAOYSA-N azetidine Chemical compound C1CNC1 HONIICLYMWZJFZ-UHFFFAOYSA-N 0.000 description 1
- 125000002393 azetidinyl group Chemical group 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 description 1
- 125000004069 aziridinyl group Chemical group 0.000 description 1
- 229950001863 bapineuzumab Drugs 0.000 description 1
- 229960004669 basiliximab Drugs 0.000 description 1
- SRSXLGNVWSONIS-UHFFFAOYSA-N benzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC=CC=C1 SRSXLGNVWSONIS-UHFFFAOYSA-N 0.000 description 1
- 229940049706 benzodiazepine Drugs 0.000 description 1
- 150000001557 benzodiazepines Chemical class 0.000 description 1
- BNBQRQQYDMDJAH-UHFFFAOYSA-N benzodioxan Chemical compound C1=CC=C2OCCOC2=C1 BNBQRQQYDMDJAH-UHFFFAOYSA-N 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 229960004365 benzoic acid Drugs 0.000 description 1
- 150000001558 benzoic acid derivatives Chemical class 0.000 description 1
- WXNOJTUTEXAZLD-UHFFFAOYSA-L benzonitrile;dichloropalladium Chemical compound Cl[Pd]Cl.N#CC1=CC=CC=C1.N#CC1=CC=CC=C1 WXNOJTUTEXAZLD-UHFFFAOYSA-L 0.000 description 1
- QRUDEWIWKLJBPS-UHFFFAOYSA-N benzotriazole Chemical compound C1=CC=C2N[N][N]C2=C1 QRUDEWIWKLJBPS-UHFFFAOYSA-N 0.000 description 1
- 239000012964 benzotriazole Substances 0.000 description 1
- 125000003236 benzoyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C(*)=O 0.000 description 1
- HSDAJNMJOMSNEV-UHFFFAOYSA-N benzyl chloroformate Chemical compound ClC(=O)OCC1=CC=CC=C1 HSDAJNMJOMSNEV-UHFFFAOYSA-N 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 229960000997 bicalutamide Drugs 0.000 description 1
- SHOMMGQAMRXRRK-UHFFFAOYSA-N bicyclo[3.1.1]heptane Chemical compound C1C2CC1CCC2 SHOMMGQAMRXRRK-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 229940126587 biotherapeutics Drugs 0.000 description 1
- 150000004663 bisphosphonates Chemical class 0.000 description 1
- 229950006844 bizelesin Drugs 0.000 description 1
- 210000004952 blastocoel Anatomy 0.000 description 1
- 201000000053 blastoma Diseases 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 210000002798 bone marrow cell Anatomy 0.000 description 1
- 229940112869 bone morphogenetic protein Drugs 0.000 description 1
- 229910021538 borax Inorganic materials 0.000 description 1
- 150000001638 boron Chemical class 0.000 description 1
- GXJABQQUPOEUTA-RDJZCZTQSA-N bortezomib Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)B(O)O)NC(=O)C=1N=CC=NC=1)C1=CC=CC=C1 GXJABQQUPOEUTA-RDJZCZTQSA-N 0.000 description 1
- 229960001467 bortezomib Drugs 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 229960005520 bryostatin Drugs 0.000 description 1
- MJQUEDHRCUIRLF-TVIXENOKSA-N bryostatin 1 Chemical compound C([C@@H]1CC(/[C@@H]([C@@](C(C)(C)/C=C/2)(O)O1)OC(=O)/C=C/C=C/CCC)=C\C(=O)OC)[C@H]([C@@H](C)O)OC(=O)C[C@H](O)C[C@@H](O1)C[C@H](OC(C)=O)C(C)(C)[C@]1(O)C[C@@H]1C\C(=C\C(=O)OC)C[C@H]\2O1 MJQUEDHRCUIRLF-TVIXENOKSA-N 0.000 description 1
- MUIWQCKLQMOUAT-AKUNNTHJSA-N bryostatin 20 Natural products COC(=O)C=C1C[C@@]2(C)C[C@]3(O)O[C@](C)(C[C@@H](O)CC(=O)O[C@](C)(C[C@@]4(C)O[C@](O)(CC5=CC(=O)O[C@]45C)C(C)(C)C=C[C@@](C)(C1)O2)[C@@H](C)O)C[C@H](OC(=O)C(C)(C)C)C3(C)C MUIWQCKLQMOUAT-AKUNNTHJSA-N 0.000 description 1
- 208000000594 bullous pemphigoid Diseases 0.000 description 1
- 229960002092 busulfan Drugs 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 235000008207 calcium folinate Nutrition 0.000 description 1
- 239000011687 calcium folinate Substances 0.000 description 1
- BPKIGYQJPYCAOW-FFJTTWKXSA-I calcium;potassium;disodium;(2s)-2-hydroxypropanoate;dichloride;dihydroxide;hydrate Chemical compound O.[OH-].[OH-].[Na+].[Na+].[Cl-].[Cl-].[K+].[Ca+2].C[C@H](O)C([O-])=O BPKIGYQJPYCAOW-FFJTTWKXSA-I 0.000 description 1
- BMLSTPRTEKLIPM-UHFFFAOYSA-I calcium;potassium;disodium;hydrogen carbonate;dichloride;dihydroxide;hydrate Chemical compound O.[OH-].[OH-].[Na+].[Na+].[Cl-].[Cl-].[K+].[Ca+2].OC([O-])=O BMLSTPRTEKLIPM-UHFFFAOYSA-I 0.000 description 1
- ZEWYCNBZMPELPF-UHFFFAOYSA-J calcium;potassium;sodium;2-hydroxypropanoic acid;sodium;tetrachloride Chemical compound [Na].[Na+].[Cl-].[Cl-].[Cl-].[Cl-].[K+].[Ca+2].CC(O)C(O)=O ZEWYCNBZMPELPF-UHFFFAOYSA-J 0.000 description 1
- 229930195731 calicheamicin Natural products 0.000 description 1
- VSJKWCGYPAHWDS-FQEVSTJZSA-N camptothecin Chemical compound C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-FQEVSTJZSA-N 0.000 description 1
- 229940127093 camptothecin Drugs 0.000 description 1
- 229960004117 capecitabine Drugs 0.000 description 1
- 125000001951 carbamoylamino group Chemical group C(N)(=O)N* 0.000 description 1
- 125000002837 carbocyclic group Chemical group 0.000 description 1
- 238000001460 carbon-13 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 1
- 229960002115 carboquone Drugs 0.000 description 1
- 150000003857 carboxamides Chemical class 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 150000007942 carboxylates Chemical group 0.000 description 1
- 150000001244 carboxylic acid anhydrides Chemical class 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 125000003262 carboxylic acid ester group Chemical group [H]C([H])([*:2])OC(=O)C([H])([H])[*:1] 0.000 description 1
- 150000001733 carboxylic acid esters Chemical class 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- XREUEWVEMYWFFA-CSKJXFQVSA-N carminomycin Chemical compound C1[C@H](N)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1C2=C(O)C(C(=O)C3=C(O)C=CC=C3C3=O)=C3C(O)=C2C[C@@](O)(C(C)=O)C1 XREUEWVEMYWFFA-CSKJXFQVSA-N 0.000 description 1
- 229930188550 carminomycin Natural products 0.000 description 1
- XREUEWVEMYWFFA-UHFFFAOYSA-N carminomycin I Natural products C1C(N)C(O)C(C)OC1OC1C2=C(O)C(C(=O)C3=C(O)C=CC=C3C3=O)=C3C(O)=C2CC(O)(C(C)=O)C1 XREUEWVEMYWFFA-UHFFFAOYSA-N 0.000 description 1
- 229960003261 carmofur Drugs 0.000 description 1
- 229960005243 carmustine Drugs 0.000 description 1
- 229950001725 carubicin Drugs 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000022534 cell killing Effects 0.000 description 1
- 230000011748 cell maturation Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- MVPPADPHJFYWMZ-UHFFFAOYSA-N chlorobenzene Chemical compound ClC1=CC=CC=C1 MVPPADPHJFYWMZ-UHFFFAOYSA-N 0.000 description 1
- VZWXIQHBIQLMPN-UHFFFAOYSA-N chromane Chemical compound C1=CC=C2CCCOC2=C1 VZWXIQHBIQLMPN-UHFFFAOYSA-N 0.000 description 1
- QZHPTGXQGDFGEN-UHFFFAOYSA-N chromene Chemical compound C1=CC=C2C=C[CH]OC2=C1 QZHPTGXQGDFGEN-UHFFFAOYSA-N 0.000 description 1
- ZYVSOIYQKUDENJ-WKSBCEQHSA-N chromomycin A3 Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@@H]1OC(C)=O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@@H](O)[C@H](O[C@@H]3O[C@@H](C)[C@H](OC(C)=O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@@H]1C[C@@H](O)[C@@H](OC)[C@@H](C)O1 ZYVSOIYQKUDENJ-WKSBCEQHSA-N 0.000 description 1
- 208000025302 chronic primary adrenal insufficiency Diseases 0.000 description 1
- 235000013985 cinnamic acid Nutrition 0.000 description 1
- 229930016911 cinnamic acid Natural products 0.000 description 1
- 229960002286 clodronic acid Drugs 0.000 description 1
- ACSIXWWBWUQEHA-UHFFFAOYSA-L clondronate(2-) Chemical compound OP([O-])(=O)C(Cl)(Cl)P(O)([O-])=O ACSIXWWBWUQEHA-UHFFFAOYSA-L 0.000 description 1
- 229960002271 cobimetinib Drugs 0.000 description 1
- BSMCAPRUBJMWDF-KRWDZBQOSA-N cobimetinib Chemical compound C1C(O)([C@H]2NCCCC2)CN1C(=O)C1=CC=C(F)C(F)=C1NC1=CC=C(I)C=C1F BSMCAPRUBJMWDF-KRWDZBQOSA-N 0.000 description 1
- 229960001338 colchicine Drugs 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000004540 complement-dependent cytotoxicity Effects 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 238000011254 conventional chemotherapy Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 239000013058 crude material Substances 0.000 description 1
- 239000012045 crude solution Substances 0.000 description 1
- 201000003278 cryoglobulinemia Diseases 0.000 description 1
- 108010006226 cryptophycin Proteins 0.000 description 1
- 108010089438 cryptophycin 1 Proteins 0.000 description 1
- 108010090203 cryptophycin 8 Proteins 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 208000004921 cutaneous lupus erythematosus Diseases 0.000 description 1
- 125000001651 cyanato group Chemical group [*]OC#N 0.000 description 1
- 125000002993 cycloalkylene group Chemical group 0.000 description 1
- CFBGXYDUODCMNS-UHFFFAOYSA-N cyclobutene Chemical compound C1CC=C1 CFBGXYDUODCMNS-UHFFFAOYSA-N 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000005725 cyclohexenylene group Chemical group 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000004956 cyclohexylene group Chemical group 0.000 description 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000004979 cyclopentylene group Chemical group 0.000 description 1
- OOXWYYGXTJLWHA-UHFFFAOYSA-N cyclopropene Chemical compound C1C=C1 OOXWYYGXTJLWHA-UHFFFAOYSA-N 0.000 description 1
- WLVKDFJTYKELLQ-UHFFFAOYSA-N cyclopropylboronic acid Chemical compound OB(O)C1CC1 WLVKDFJTYKELLQ-UHFFFAOYSA-N 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- 239000000824 cytostatic agent Substances 0.000 description 1
- 229960003901 dacarbazine Drugs 0.000 description 1
- 229960000640 dactinomycin Drugs 0.000 description 1
- 229950006418 dactolisib Drugs 0.000 description 1
- JOGKUKXHTYWRGZ-UHFFFAOYSA-N dactolisib Chemical compound O=C1N(C)C2=CN=C3C=CC(C=4C=C5C=CC=CC5=NC=4)=CC3=C2N1C1=CC=C(C(C)(C)C#N)C=C1 JOGKUKXHTYWRGZ-UHFFFAOYSA-N 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- DOAKLVKFURWEDJ-QCMAZARJSA-N daptomycin Chemical compound C([C@H]1C(=O)O[C@H](C)[C@@H](C(NCC(=O)N[C@@H](CCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@H](CO)C(=O)N[C@H](C(=O)N1)[C@H](C)CC(O)=O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](CC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CCCCCCCCC)C(=O)C1=CC=CC=C1N DOAKLVKFURWEDJ-QCMAZARJSA-N 0.000 description 1
- 229960005484 daptomycin Drugs 0.000 description 1
- 235000020960 dehydroascorbic acid Nutrition 0.000 description 1
- 239000011615 dehydroascorbic acid Substances 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000003831 deregulation Effects 0.000 description 1
- 201000001981 dermatomyositis Diseases 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- NKLCNNUWBJBICK-UHFFFAOYSA-N dess–martin periodinane Chemical compound C1=CC=C2I(OC(=O)C)(OC(C)=O)(OC(C)=O)OC(=O)C2=C1 NKLCNNUWBJBICK-UHFFFAOYSA-N 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- MHDVGSVTJDSBDK-UHFFFAOYSA-N dibenzyl ether Chemical class C=1C=CC=CC=1COCC1=CC=CC=C1 MHDVGSVTJDSBDK-UHFFFAOYSA-N 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 150000004683 dihydrates Chemical class 0.000 description 1
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 1
- HGGNZMUHOHGHBJ-UHFFFAOYSA-N dioxepane Chemical compound C1CCOOCC1 HGGNZMUHOHGHBJ-UHFFFAOYSA-N 0.000 description 1
- 150000004862 dioxolanes Chemical class 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- VSJKWCGYPAHWDS-UHFFFAOYSA-N dl-camptothecin Natural products C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)C5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-UHFFFAOYSA-N 0.000 description 1
- AMRJKAQTDDKMCE-UHFFFAOYSA-N dolastatin Chemical compound CC(C)C(N(C)C)C(=O)NC(C(C)C)C(=O)N(C)C(C(C)C)C(OC)CC(=O)N1CCCC1C(OC)C(C)C(=O)NC(C=1SC=CN=1)CC1=CC=CC=C1 AMRJKAQTDDKMCE-UHFFFAOYSA-N 0.000 description 1
- 229930188854 dolastatin Natural products 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- ZWAOHEXOSAUJHY-ZIYNGMLESA-N doxifluridine Chemical compound O[C@@H]1[C@H](O)[C@@H](C)O[C@H]1N1C(=O)NC(=O)C(F)=C1 ZWAOHEXOSAUJHY-ZIYNGMLESA-N 0.000 description 1
- 229950005454 doxifluridine Drugs 0.000 description 1
- 229950004203 droloxifene Drugs 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 229960005501 duocarmycin Drugs 0.000 description 1
- VQNATVDKACXKTF-XELLLNAOSA-N duocarmycin Chemical compound COC1=C(OC)C(OC)=C2NC(C(=O)N3C4=CC(=O)C5=C([C@@]64C[C@@H]6C3)C=C(N5)C(=O)OC)=CC2=C1 VQNATVDKACXKTF-XELLLNAOSA-N 0.000 description 1
- 229930184221 duocarmycin Natural products 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- AFMYMMXSQGUCBK-AKMKHHNQSA-N dynemicin a Chemical compound C1#C\C=C/C#C[C@@H]2NC(C=3C(=O)C4=C(O)C=CC(O)=C4C(=O)C=3C(O)=C3)=C3[C@@]34O[C@]32[C@@H](C)C(C(O)=O)=C(OC)[C@H]41 AFMYMMXSQGUCBK-AKMKHHNQSA-N 0.000 description 1
- 229960002224 eculizumab Drugs 0.000 description 1
- FSIRXIHZBIXHKT-MHTVFEQDSA-N edatrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CC(CC)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FSIRXIHZBIXHKT-MHTVFEQDSA-N 0.000 description 1
- 229950006700 edatrexate Drugs 0.000 description 1
- 230000002500 effect on skin Effects 0.000 description 1
- 229960002759 eflornithine Drugs 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- XOPYFXBZMVTEJF-PDACKIITSA-N eleutherobin Chemical compound C(/[C@H]1[C@H](C(=CC[C@@H]1C(C)C)C)C[C@@H]([C@@]1(C)O[C@@]2(C=C1)OC)OC(=O)\C=C\C=1N=CN(C)C=1)=C2\CO[C@@H]1OC[C@@H](O)[C@@H](O)[C@@H]1OC(C)=O XOPYFXBZMVTEJF-PDACKIITSA-N 0.000 description 1
- XOPYFXBZMVTEJF-UHFFFAOYSA-N eleutherobin Natural products C1=CC2(OC)OC1(C)C(OC(=O)C=CC=1N=CN(C)C=1)CC(C(=CCC1C(C)C)C)C1C=C2COC1OCC(O)C(O)C1OC(C)=O XOPYFXBZMVTEJF-UHFFFAOYSA-N 0.000 description 1
- 229960004137 elotuzumab Drugs 0.000 description 1
- 201000008184 embryoma Diseases 0.000 description 1
- 150000002081 enamines Chemical class 0.000 description 1
- 201000003914 endometrial carcinoma Diseases 0.000 description 1
- 230000002357 endometrial effect Effects 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- ZUBDGKVDJUIMQQ-UBFCDGJISA-N endothelin-1 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)NC(=O)[C@H]1NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@@H](CC=2C=CC(O)=CC=2)NC(=O)[C@H](C(C)C)NC(=O)[C@H]2CSSC[C@@H](C(N[C@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N2)=O)NC(=O)[C@@H](CO)NC(=O)[C@H](N)CSSC1)C1=CNC=N1 ZUBDGKVDJUIMQQ-UBFCDGJISA-N 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- JOZGNYDSEBIJDH-UHFFFAOYSA-N eniluracil Chemical compound O=C1NC=C(C#C)C(=O)N1 JOZGNYDSEBIJDH-UHFFFAOYSA-N 0.000 description 1
- 229950010213 eniluracil Drugs 0.000 description 1
- 229950011487 enocitabine Drugs 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 108010087914 epidermal growth factor receptor VIII Proteins 0.000 description 1
- 229960001904 epirubicin Drugs 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 210000004955 epithelial membrane Anatomy 0.000 description 1
- 229930013356 epothilone Natural products 0.000 description 1
- 150000003883 epothilone derivatives Chemical class 0.000 description 1
- 229940082789 erbitux Drugs 0.000 description 1
- AAKJLRGGTJKAMG-UHFFFAOYSA-N erlotinib Chemical compound C=12C=C(OCCOC)C(OCCOC)=CC2=NC=NC=1NC1=CC=CC(C#C)=C1 AAKJLRGGTJKAMG-UHFFFAOYSA-N 0.000 description 1
- 229960001433 erlotinib Drugs 0.000 description 1
- 239000000328 estrogen antagonist Substances 0.000 description 1
- AFAXGSQYZLGZPG-UHFFFAOYSA-N ethanedisulfonic acid Chemical compound OS(=O)(=O)CCS(O)(=O)=O AFAXGSQYZLGZPG-UHFFFAOYSA-N 0.000 description 1
- RQIFXTOWUNAUJC-UHFFFAOYSA-N ethanesulfinic acid Chemical compound CCS(O)=O RQIFXTOWUNAUJC-UHFFFAOYSA-N 0.000 description 1
- 125000001033 ether group Chemical group 0.000 description 1
- 125000001301 ethoxy group Chemical group [H]C([H])([H])C([H])([H])O* 0.000 description 1
- 125000004494 ethyl ester group Chemical group 0.000 description 1
- PQVSTLUFSYVLTO-UHFFFAOYSA-N ethyl n-ethoxycarbonylcarbamate Chemical compound CCOC(=O)NC(=O)OCC PQVSTLUFSYVLTO-UHFFFAOYSA-N 0.000 description 1
- WHRIKZCFRVTHJH-UHFFFAOYSA-N ethylhydrazine Chemical compound CCNN WHRIKZCFRVTHJH-UHFFFAOYSA-N 0.000 description 1
- 125000006125 ethylsulfonyl group Chemical group 0.000 description 1
- 125000004705 ethylthio group Chemical group C(C)S* 0.000 description 1
- VJJPUSNTGOMMGY-MRVIYFEKSA-N etoposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@H](C)OC[C@H]4O3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 VJJPUSNTGOMMGY-MRVIYFEKSA-N 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 229960000255 exemestane Drugs 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 229950011548 fadrozole Drugs 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- MYRIFIVQGRMHRF-OECXYHNASA-N fibrinopeptide b Chemical compound N([C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(=O)CNC(=O)[C@@H]1CCC(=O)N1 MYRIFIVQGRMHRF-OECXYHNASA-N 0.000 description 1
- 108010072257 fibroblast activation protein alpha Proteins 0.000 description 1
- 230000000893 fibroproliferative effect Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000012065 filter cake Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- ODKNJVUHOIMIIZ-RRKCRQDMSA-N floxuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(F)=C1 ODKNJVUHOIMIIZ-RRKCRQDMSA-N 0.000 description 1
- 229960000961 floxuridine Drugs 0.000 description 1
- 229960000390 fludarabine Drugs 0.000 description 1
- GIUYCYHIANZCFB-FJFJXFQQSA-N fludarabine phosphate Chemical compound C1=NC=2C(N)=NC(F)=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O GIUYCYHIANZCFB-FJFJXFQQSA-N 0.000 description 1
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 1
- RMBPEFMHABBEKP-UHFFFAOYSA-N fluorene Chemical compound C1=CC=C2C3=C[CH]C=CC3=CC2=C1 RMBPEFMHABBEKP-UHFFFAOYSA-N 0.000 description 1
- IRXSLJNXXZKURP-UHFFFAOYSA-N fluorenylmethyloxycarbonyl chloride Chemical compound C1=CC=C2C(COC(=O)Cl)C3=CC=CC=C3C2=C1 IRXSLJNXXZKURP-UHFFFAOYSA-N 0.000 description 1
- 239000003269 fluorescent indicator Substances 0.000 description 1
- MKXKFYHWDHIYRV-UHFFFAOYSA-N flutamide Chemical compound CC(C)C(=O)NC1=CC=C([N+]([O-])=O)C(C(F)(F)F)=C1 MKXKFYHWDHIYRV-UHFFFAOYSA-N 0.000 description 1
- 229960002074 flutamide Drugs 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- 229960000304 folic acid Drugs 0.000 description 1
- 150000002224 folic acids Chemical class 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 229960004421 formestane Drugs 0.000 description 1
- OSVMTWJCGUFAOD-KZQROQTASA-N formestane Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)(C(CC4)=O)[C@@H]4[C@@H]3CCC2=C1O OSVMTWJCGUFAOD-KZQROQTASA-N 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 229960004783 fotemustine Drugs 0.000 description 1
- YAKWPXVTIGTRJH-UHFFFAOYSA-N fotemustine Chemical compound CCOP(=O)(OCC)C(C)NC(=O)N(CCCl)N=O YAKWPXVTIGTRJH-UHFFFAOYSA-N 0.000 description 1
- 238000001640 fractional crystallisation Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 125000002446 fucosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)[C@@H](O1)C)* 0.000 description 1
- 229960002258 fulvestrant Drugs 0.000 description 1
- 239000001530 fumaric acid Substances 0.000 description 1
- 150000002243 furanoses Chemical class 0.000 description 1
- DSLZVSRJTYRBFB-DUHBMQHGSA-N galactaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)C(O)=O DSLZVSRJTYRBFB-DUHBMQHGSA-N 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 229940044658 gallium nitrate Drugs 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- XGALLCVXEZPNRQ-UHFFFAOYSA-N gefitinib Chemical compound C=12C=C(OCCCN3CCOCC3)C(OC)=CC2=NC=NC=1NC1=CC=C(F)C(Cl)=C1 XGALLCVXEZPNRQ-UHFFFAOYSA-N 0.000 description 1
- 229960002584 gefitinib Drugs 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 239000003448 gibberellin Substances 0.000 description 1
- IXORZMNAPKEEDV-OBDJNFEBSA-N gibberellin A3 Chemical class C([C@@]1(O)C(=C)C[C@@]2(C1)[C@H]1C(O)=O)C[C@H]2[C@]2(C=C[C@@H]3O)[C@H]1[C@]3(C)C(=O)O2 IXORZMNAPKEEDV-OBDJNFEBSA-N 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 210000004907 gland Anatomy 0.000 description 1
- 230000000762 glandular Effects 0.000 description 1
- 230000002518 glial effect Effects 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 229950002441 glucurolactone Drugs 0.000 description 1
- 229930182480 glucuronide Natural products 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229960002913 goserelin Drugs 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 101150028578 grp78 gene Proteins 0.000 description 1
- 125000002795 guanidino group Chemical group C(N)(=N)N* 0.000 description 1
- 229940093915 gynecological organic acid Drugs 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000010370 hearing loss Effects 0.000 description 1
- 231100000888 hearing loss Toxicity 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 230000002489 hematologic effect Effects 0.000 description 1
- 210000005096 hematological system Anatomy 0.000 description 1
- 125000001976 hemiacetal group Chemical group 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 125000003187 heptyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 150000002391 heterocyclic compounds Chemical class 0.000 description 1
- UUVWYPNAQBNQJQ-UHFFFAOYSA-N hexamethylmelamine Chemical compound CN(C)C1=NC(N(C)C)=NC(N(C)C)=N1 UUVWYPNAQBNQJQ-UHFFFAOYSA-N 0.000 description 1
- 125000006038 hexenyl group Chemical group 0.000 description 1
- 125000004051 hexyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 201000000284 histiocytoma Diseases 0.000 description 1
- 108010077157 histocompatibility antigen 37 Proteins 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 231100000508 hormonal effect Toxicity 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940071870 hydroiodic acid Drugs 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 229960001330 hydroxycarbamide Drugs 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 230000002267 hypothalamic effect Effects 0.000 description 1
- 229960005236 ibandronic acid Drugs 0.000 description 1
- 229960000908 idarubicin Drugs 0.000 description 1
- 150000002453 idose derivatives Chemical class 0.000 description 1
- 229960003685 imatinib mesylate Drugs 0.000 description 1
- YLMAHDNUQAMNNX-UHFFFAOYSA-N imatinib methanesulfonate Chemical compound CS(O)(=O)=O.C1CN(C)CCN1CC1=CC=C(C(=O)NC=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)C=C1 YLMAHDNUQAMNNX-UHFFFAOYSA-N 0.000 description 1
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Substances C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 1
- MTNDZQHUAFNZQY-UHFFFAOYSA-N imidazoline Chemical compound C1CN=CN1 MTNDZQHUAFNZQY-UHFFFAOYSA-N 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 102000027596 immune receptors Human genes 0.000 description 1
- 108091008915 immune receptors Proteins 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- DBIGHPPNXATHOF-UHFFFAOYSA-N improsulfan Chemical compound CS(=O)(=O)OCCCNCCCOS(C)(=O)=O DBIGHPPNXATHOF-UHFFFAOYSA-N 0.000 description 1
- 229950008097 improsulfan Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 150000002468 indanes Chemical class 0.000 description 1
- 150000002473 indoazoles Chemical class 0.000 description 1
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 1
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 238000002329 infrared spectrum Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229910001412 inorganic anion Inorganic materials 0.000 description 1
- 108010021309 integrin beta6 Proteins 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108040006849 interleukin-2 receptor activity proteins Proteins 0.000 description 1
- 201000002313 intestinal cancer Diseases 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- LNQCUTNLHUQZLR-OZJWLQQPSA-N iridin Chemical compound OC1=C(OC)C(OC)=CC(C=2C(C3=C(O)C(OC)=C(O[C@H]4[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O4)O)C=C3OC=2)=O)=C1 LNQCUTNLHUQZLR-OZJWLQQPSA-N 0.000 description 1
- 229960004768 irinotecan Drugs 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- 229940045996 isethionic acid Drugs 0.000 description 1
- 125000002510 isobutoxy group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])O* 0.000 description 1
- 125000002462 isocyano group Chemical group *[N+]#[C-] 0.000 description 1
- GWVMLCQWXVFZCN-UHFFFAOYSA-N isoindoline Chemical compound C1=CC=C2CNCC2=C1 GWVMLCQWXVFZCN-UHFFFAOYSA-N 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 125000001972 isopentyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000000555 isopropenyl group Chemical group [H]\C([H])=C(\*)C([H])([H])[H] 0.000 description 1
- 125000003253 isopropoxy group Chemical group [H]C([H])([H])C([H])(O*)C([H])([H])[H] 0.000 description 1
- 125000004254 isoquinolin-1-yl group Chemical group [H]C1=C([H])C2=C([H])C([H])=C([H])C([H])=C2C(*)=N1 0.000 description 1
- 125000004552 isoquinolin-4-yl group Chemical group C1=NC=C(C2=CC=CC=C12)* 0.000 description 1
- ZLTPDFXIESTBQG-UHFFFAOYSA-N isothiazole Chemical compound C=1C=NSC=1 ZLTPDFXIESTBQG-UHFFFAOYSA-N 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 208000017169 kidney disease Diseases 0.000 description 1
- 229940043355 kinase inhibitor Drugs 0.000 description 1
- 210000004561 lacrimal apparatus Anatomy 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229960000448 lactic acid Drugs 0.000 description 1
- 229940099563 lactobionic acid Drugs 0.000 description 1
- 150000002596 lactones Chemical class 0.000 description 1
- 229960004891 lapatinib Drugs 0.000 description 1
- 229940033355 lauric acid Drugs 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 229940115286 lentinan Drugs 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 201000004803 lethal congenital contracture syndrome 2 Diseases 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- GFIJNRVAKGFPGQ-LIJARHBVSA-N leuprolide Chemical compound CCNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)CC1=CC=C(O)C=C1 GFIJNRVAKGFPGQ-LIJARHBVSA-N 0.000 description 1
- 229960004338 leuprorelin Drugs 0.000 description 1
- QDLAGTHXVHQKRE-UHFFFAOYSA-N lichenxanthone Natural products COC1=CC(O)=C2C(=O)C3=C(C)C=C(OC)C=C3OC2=C1 QDLAGTHXVHQKRE-UHFFFAOYSA-N 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 229940040692 lithium hydroxide monohydrate Drugs 0.000 description 1
- GLXDVVHUTZTUQK-UHFFFAOYSA-M lithium hydroxide monohydrate Substances [Li+].O.[OH-] GLXDVVHUTZTUQK-UHFFFAOYSA-M 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 229960002247 lomustine Drugs 0.000 description 1
- DHMTURDWPRKSOA-RUZDIDTESA-N lonafarnib Chemical compound C1CN(C(=O)N)CCC1CC(=O)N1CCC([C@@H]2C3=C(Br)C=C(Cl)C=C3CCC3=CC(Br)=CN=C32)CC1 DHMTURDWPRKSOA-RUZDIDTESA-N 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 201000005249 lung adenocarcinoma Diseases 0.000 description 1
- 201000005296 lung carcinoma Diseases 0.000 description 1
- 210000004324 lymphatic system Anatomy 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 230000002132 lysosomal effect Effects 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 description 1
- 239000011976 maleic acid Substances 0.000 description 1
- 229940098895 maleic acid Drugs 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 229940099690 malic acid Drugs 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 229950008612 mannomustine Drugs 0.000 description 1
- 241001515942 marmosets Species 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- WKPWGQKGSOKKOO-RSFHAFMBSA-N maytansine Chemical compound CO[C@@H]([C@@]1(O)C[C@](OC(=O)N1)([C@H]([C@@H]1O[C@@]1(C)[C@@H](OC(=O)[C@H](C)N(C)C(C)=O)CC(=O)N1C)C)[H])\C=C\C=C(C)\CC2=CC(OC)=C(Cl)C1=C2 WKPWGQKGSOKKOO-RSFHAFMBSA-N 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- QZIQJVCYUQZDIR-UHFFFAOYSA-N mechlorethamine hydrochloride Chemical compound Cl.ClCCN(C)CCCl QZIQJVCYUQZDIR-UHFFFAOYSA-N 0.000 description 1
- 229960002868 mechlorethamine hydrochloride Drugs 0.000 description 1
- 210000003593 megakaryocyte Anatomy 0.000 description 1
- RQZAXGRLVPAYTJ-GQFGMJRRSA-N megestrol acetate Chemical compound C1=C(C)C2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@@](C(C)=O)(OC(=O)C)[C@@]1(C)CC2 RQZAXGRLVPAYTJ-GQFGMJRRSA-N 0.000 description 1
- 229960004296 megestrol acetate Drugs 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 229960001924 melphalan Drugs 0.000 description 1
- SGDBTWWWUNNDEQ-LBPRGKRZSA-N melphalan Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N(CCCl)CCCl)C=C1 SGDBTWWWUNNDEQ-LBPRGKRZSA-N 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- XNEFVTBPCXGIRX-UHFFFAOYSA-N methanesulfinic acid Chemical compound CS(O)=O XNEFVTBPCXGIRX-UHFFFAOYSA-N 0.000 description 1
- 229940098779 methanesulfonic acid Drugs 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- QRMNENFZDDYDEF-GOSISDBHSA-N methyl (8s)-8-(bromomethyl)-2-methyl-4-(4-methylpiperazine-1-carbonyl)oxy-6-(5,6,7-trimethoxy-1h-indole-2-carbonyl)-7,8-dihydro-3h-pyrrolo[3,2-e]indole-1-carboxylate Chemical compound C1([C@H](CBr)CN(C1=C1)C(=O)C=2NC3=C(OC)C(OC)=C(OC)C=C3C=2)=C2C(C(=O)OC)=C(C)NC2=C1OC(=O)N1CCN(C)CC1 QRMNENFZDDYDEF-GOSISDBHSA-N 0.000 description 1
- 150000005217 methyl ethers Chemical class 0.000 description 1
- WBYWAXJHAXSJNI-UHFFFAOYSA-N methyl p-hydroxycinnamate Natural products OC(=O)C=CC1=CC=CC=C1 WBYWAXJHAXSJNI-UHFFFAOYSA-N 0.000 description 1
- GYNNXHKOJHMOHS-UHFFFAOYSA-N methyl-cycloheptane Natural products CC1CCCCCC1 GYNNXHKOJHMOHS-UHFFFAOYSA-N 0.000 description 1
- VNXBKJFUJUWOCW-UHFFFAOYSA-N methylcyclopropane Chemical compound CC1CC1 VNXBKJFUJUWOCW-UHFFFAOYSA-N 0.000 description 1
- HPNSFSBZBAHARI-UHFFFAOYSA-N micophenolic acid Natural products OC1=C(CC=C(C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-UHFFFAOYSA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 229950003734 milatuzumab Drugs 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 229960005485 mitobronitol Drugs 0.000 description 1
- MXWHMTNPTTVWDM-NXOFHUPFSA-N mitoguazone Chemical compound NC(N)=N\N=C(/C)\C=N\N=C(N)N MXWHMTNPTTVWDM-NXOFHUPFSA-N 0.000 description 1
- 229960003539 mitoguazone Drugs 0.000 description 1
- VFKZTMPDYBFSTM-GUCUJZIJSA-N mitolactol Chemical compound BrC[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)CBr VFKZTMPDYBFSTM-GUCUJZIJSA-N 0.000 description 1
- 229950010913 mitolactol Drugs 0.000 description 1
- 229960004857 mitomycin Drugs 0.000 description 1
- 229960000350 mitotane Drugs 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 230000009149 molecular binding Effects 0.000 description 1
- 150000004682 monohydrates Chemical class 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 125000006518 morpholino carbonyl group Chemical group [H]C1([H])OC([H])([H])C([H])([H])N(C(*)=O)C1([H])[H] 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 206010028417 myasthenia gravis Diseases 0.000 description 1
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 1
- 229960000951 mycophenolic acid Drugs 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- BRZOTEHEMOQUOY-UHFFFAOYSA-N n-[bis(aziridin-1-yl)phosphoryl]benzamide Chemical compound C=1C=CC=CC=1C(=O)NP(=O)(N1CC1)N1CC1 BRZOTEHEMOQUOY-UHFFFAOYSA-N 0.000 description 1
- ZOGSYTPLAROSTP-UHFFFAOYSA-N n-[bis(ethylamino)phosphoryl]ethanamine Chemical compound CCNP(=O)(NCC)NCC ZOGSYTPLAROSTP-UHFFFAOYSA-N 0.000 description 1
- 125000006606 n-butoxy group Chemical group 0.000 description 1
- DIOQZVSQGTUSAI-UHFFFAOYSA-N n-butylhexane Natural products CCCCCCCCCC DIOQZVSQGTUSAI-UHFFFAOYSA-N 0.000 description 1
- 125000003136 n-heptyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000000740 n-pentyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000003506 n-propoxy group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])O* 0.000 description 1
- KVBGVZZKJNLNJU-UHFFFAOYSA-M naphthalene-2-sulfonate Chemical compound C1=CC=CC2=CC(S(=O)(=O)[O-])=CC=C21 KVBGVZZKJNLNJU-UHFFFAOYSA-M 0.000 description 1
- 229960005027 natalizumab Drugs 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- CTMCWCONSULRHO-UHQPFXKFSA-N nemorubicin Chemical compound C1CO[C@H](OC)CN1[C@@H]1[C@H](O)[C@H](C)O[C@@H](O[C@@H]2C3=C(O)C=4C(=O)C5=C(OC)C=CC=C5C(=O)C=4C(O)=C3C[C@](O)(C2)C(=O)CO)C1 CTMCWCONSULRHO-UHQPFXKFSA-N 0.000 description 1
- 229950010159 nemorubicin Drugs 0.000 description 1
- 125000001971 neopentyl group Chemical group [H]C([*])([H])C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000004766 neurogenesis Effects 0.000 description 1
- 208000008795 neuromyelitis optica Diseases 0.000 description 1
- XWXYUMMDTVBTOU-UHFFFAOYSA-N nilutamide Chemical compound O=C1C(C)(C)NC(=O)N1C1=CC=C([N+]([O-])=O)C(C(F)(F)F)=C1 XWXYUMMDTVBTOU-UHFFFAOYSA-N 0.000 description 1
- 229960002653 nilutamide Drugs 0.000 description 1
- VFEDRRNHLBGPNN-UHFFFAOYSA-N nimustine Chemical compound CC1=NC=C(CNC(=O)N(CCCl)N=O)C(N)=N1 VFEDRRNHLBGPNN-UHFFFAOYSA-N 0.000 description 1
- 229960001420 nimustine Drugs 0.000 description 1
- IHRSXGONVFFQQF-SDXDJHTJSA-N nitrazine Chemical compound OS(=O)(=O)C1=CC2=CC(S(O)(=O)=O)=CC=C2C(=O)\C1=N/NC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O IHRSXGONVFFQQF-SDXDJHTJSA-N 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- 239000012299 nitrogen atmosphere Substances 0.000 description 1
- 108020001162 nitroreductase Proteins 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 125000005151 nonafluorobutanesulfonyl group Chemical group FC(C(C(S(=O)(=O)*)(F)F)(F)F)(C(F)(F)F)F 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- UMRZSTCPUPJPOJ-KNVOCYPGSA-N norbornane Chemical compound C1C[C@H]2CC[C@@H]1C2 UMRZSTCPUPJPOJ-KNVOCYPGSA-N 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 108010027581 nucleoside triphosphate pyrophosphatase Proteins 0.000 description 1
- NIHNNTQXNPWCJQ-UHFFFAOYSA-N o-biphenylenemethane Natural products C1=CC=C2CC3=CC=CC=C3C2=C1 NIHNNTQXNPWCJQ-UHFFFAOYSA-N 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 235000021313 oleic acid Nutrition 0.000 description 1
- CZDBNBLGZNWKMC-MWQNXGTOSA-N olivomycin Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@@H]1O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1)O[C@H]1O[C@@H](C)[C@H](O)[C@@H](OC2O[C@@H](C)[C@H](O)[C@@H](O)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@H]1C[C@H](O)[C@H](OC)[C@H](C)O1 CZDBNBLGZNWKMC-MWQNXGTOSA-N 0.000 description 1
- 229950005848 olivomycin Drugs 0.000 description 1
- 229950011093 onapristone Drugs 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 208000008798 osteoma Diseases 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 150000005063 oxadiazines Chemical class 0.000 description 1
- GHCAUEMXBSLMGU-UHFFFAOYSA-N oxadiazole;1,2,5-oxadiazole Chemical compound C=1C=NON=1.C1=CON=N1 GHCAUEMXBSLMGU-UHFFFAOYSA-N 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- DWAFYCQODLXJNR-BNTLRKBRSA-L oxaliplatin Chemical compound O1C(=O)C(=O)O[Pt]11N[C@@H]2CCCC[C@H]2N1 DWAFYCQODLXJNR-BNTLRKBRSA-L 0.000 description 1
- 229960001756 oxaliplatin Drugs 0.000 description 1
- IVMHDOBGNQOUHO-UHFFFAOYSA-N oxathiane Chemical compound C1CCSOC1 IVMHDOBGNQOUHO-UHFFFAOYSA-N 0.000 description 1
- AZHVQJLDOFKHPZ-UHFFFAOYSA-N oxathiazine Chemical class O1SN=CC=C1 AZHVQJLDOFKHPZ-UHFFFAOYSA-N 0.000 description 1
- CQDAMYNQINDRQC-UHFFFAOYSA-N oxatriazole Chemical compound C1=NN=NO1 CQDAMYNQINDRQC-UHFFFAOYSA-N 0.000 description 1
- SFJGCXYXEFWEBK-UHFFFAOYSA-N oxazepine Chemical compound O1C=CC=CC=N1 SFJGCXYXEFWEBK-UHFFFAOYSA-N 0.000 description 1
- OZQGLZFAWYKKLQ-UHFFFAOYSA-N oxazinane Chemical class C1CCONC1 OZQGLZFAWYKKLQ-UHFFFAOYSA-N 0.000 description 1
- 150000004893 oxazines Chemical class 0.000 description 1
- AHHWIHXENZJRFG-UHFFFAOYSA-N oxetane Chemical compound C1COC1 AHHWIHXENZJRFG-UHFFFAOYSA-N 0.000 description 1
- 150000002923 oximes Chemical class 0.000 description 1
- AUONHKJOIZSQGR-UHFFFAOYSA-N oxophosphane Chemical compound P=O AUONHKJOIZSQGR-UHFFFAOYSA-N 0.000 description 1
- 125000005740 oxycarbonyl group Chemical group [*:1]OC([*:2])=O 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 125000003854 p-chlorophenyl group Chemical group [H]C1=C([H])C(*)=C([H])C([H])=C1Cl 0.000 description 1
- 125000000636 p-nitrophenyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1*)[N+]([O-])=O 0.000 description 1
- 229960000402 palivizumab Drugs 0.000 description 1
- WLJNZVDCPSBLRP-UHFFFAOYSA-N pamoic acid Chemical compound C1=CC=C2C(CC=3C4=CC=CC=C4C=C(C=3O)C(=O)O)=C(O)C(C(O)=O)=CC2=C1 WLJNZVDCPSBLRP-UHFFFAOYSA-N 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000011713 pantothenic acid Substances 0.000 description 1
- 229940055726 pantothenic acid Drugs 0.000 description 1
- 235000019161 pantothenic acid Nutrition 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- CUIHSIWYWATEQL-UHFFFAOYSA-N pazopanib Chemical compound C1=CC2=C(C)N(C)N=C2C=C1N(C)C(N=1)=CC=NC=1NC1=CC=C(C)C(S(N)(=O)=O)=C1 CUIHSIWYWATEQL-UHFFFAOYSA-N 0.000 description 1
- 229960000639 pazopanib Drugs 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 201000001976 pemphigus vulgaris Diseases 0.000 description 1
- 208000030940 penile carcinoma Diseases 0.000 description 1
- 201000008174 penis carcinoma Diseases 0.000 description 1
- 125000004817 pentamethylene group Chemical group [H]C([H])([*:2])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[*:1] 0.000 description 1
- 125000002255 pentenyl group Chemical group C(=CCCC)* 0.000 description 1
- 229960002340 pentostatin Drugs 0.000 description 1
- FPVKHBSQESCIEP-JQCXWYLXSA-N pentostatin Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC[C@H]2O)=C2N=C1 FPVKHBSQESCIEP-JQCXWYLXSA-N 0.000 description 1
- 201000002628 peritoneum cancer Diseases 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 230000008782 phagocytosis Effects 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 229940127557 pharmaceutical product Drugs 0.000 description 1
- 238000009521 phase II clinical trial Methods 0.000 description 1
- NQFOGDIWKQWFMN-UHFFFAOYSA-N phenalene Chemical compound C1=CC([CH]C=C2)=C3C2=CC=CC3=C1 NQFOGDIWKQWFMN-UHFFFAOYSA-N 0.000 description 1
- 229950000688 phenothiazine Drugs 0.000 description 1
- GJSGGHOYGKMUPT-UHFFFAOYSA-N phenoxathiine Chemical compound C1=CC=C2OC3=CC=CC=C3SC2=C1 GJSGGHOYGKMUPT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 125000003170 phenylsulfonyl group Chemical group C1(=CC=CC=C1)S(=O)(=O)* 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 150000008301 phosphite esters Chemical class 0.000 description 1
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 1
- 125000001476 phosphono group Chemical group [H]OP(*)(=O)O[H] 0.000 description 1
- 229910000073 phosphorus hydride Inorganic materials 0.000 description 1
- LFGREXWGYUGZLY-UHFFFAOYSA-N phosphoryl Chemical group [P]=O LFGREXWGYUGZLY-UHFFFAOYSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000003757 phosphotransferase inhibitor Substances 0.000 description 1
- 239000003504 photosensitizing agent Substances 0.000 description 1
- LFSXCDWNBUNEEM-UHFFFAOYSA-N phthalazine Chemical compound C1=NN=CC2=CC=CC=C21 LFSXCDWNBUNEEM-UHFFFAOYSA-N 0.000 description 1
- 125000005545 phthalimidyl group Chemical group 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 229960000952 pipobroman Drugs 0.000 description 1
- NJBFOOCLYDNZJN-UHFFFAOYSA-N pipobroman Chemical compound BrCCC(=O)N1CCN(C(=O)CCBr)CC1 NJBFOOCLYDNZJN-UHFFFAOYSA-N 0.000 description 1
- NUKCGLDCWQXYOQ-UHFFFAOYSA-N piposulfan Chemical compound CS(=O)(=O)OCCC(=O)N1CCN(C(=O)CCOS(C)(=O)=O)CC1 NUKCGLDCWQXYOQ-UHFFFAOYSA-N 0.000 description 1
- 229950001100 piposulfan Drugs 0.000 description 1
- 229960001221 pirarubicin Drugs 0.000 description 1
- 150000003057 platinum Chemical class 0.000 description 1
- CLSUSRZJUQMOHH-UHFFFAOYSA-L platinum dichloride Chemical compound Cl[Pt]Cl CLSUSRZJUQMOHH-UHFFFAOYSA-L 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 208000005987 polymyositis Diseases 0.000 description 1
- 230000007824 polyneuropathy Effects 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920001282 polysaccharide Chemical class 0.000 description 1
- 239000005017 polysaccharide Chemical class 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 229960004694 prednimustine Drugs 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 201000000742 primary sclerosing cholangitis Diseases 0.000 description 1
- CPTBDICYNRMXFX-UHFFFAOYSA-N procarbazine Chemical class CNNCC1=CC=C(C(=O)NC(C)C)C=C1 CPTBDICYNRMXFX-UHFFFAOYSA-N 0.000 description 1
- 229960000624 procarbazine Drugs 0.000 description 1
- 229940002612 prodrug Drugs 0.000 description 1
- 239000000651 prodrug Substances 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 125000004368 propenyl group Chemical group C(=CC)* 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 229940095574 propionic acid Drugs 0.000 description 1
- 125000001501 propionyl group Chemical group O=C([*])C([H])([H])C([H])([H])[H] 0.000 description 1
- QQONPFPTGQHPMA-UHFFFAOYSA-N propylene Natural products CC=C QQONPFPTGQHPMA-UHFFFAOYSA-N 0.000 description 1
- 125000004805 propylene group Chemical group [H]C([H])([H])C([H])([*:1])C([H])([H])[*:2] 0.000 description 1
- 208000023958 prostate neoplasm Diseases 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 239000003909 protein kinase inhibitor Substances 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- CPNGPNLZQNNVQM-UHFFFAOYSA-N pteridine Chemical compound N1=CN=CC2=NC=CN=C21 CPNGPNLZQNNVQM-UHFFFAOYSA-N 0.000 description 1
- 125000001107 pteroylglutamyl group Chemical group 0.000 description 1
- WOLQREOUPKZMEX-UHFFFAOYSA-N pteroyltriglutamic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(=O)NC(CCC(=O)NC(CCC(O)=O)C(O)=O)C(O)=O)C(O)=O)C=C1 WOLQREOUPKZMEX-UHFFFAOYSA-N 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 150000003214 pyranose derivatives Chemical class 0.000 description 1
- CRTBNOWPBHJICM-UHFFFAOYSA-N pyrazine Chemical compound C1=CN=CC=N1.C1=CN=CC=N1 CRTBNOWPBHJICM-UHFFFAOYSA-N 0.000 description 1
- IOXGEAHHEGTLMQ-UHFFFAOYSA-N pyridazine Chemical compound C1=CC=NN=C1.C1=CC=NN=C1 IOXGEAHHEGTLMQ-UHFFFAOYSA-N 0.000 description 1
- YMXFJTUQQVLJEN-UHFFFAOYSA-N pyrimidine Chemical compound C1=CN=CN=C1.C1=CN=CN=C1 YMXFJTUQQVLJEN-UHFFFAOYSA-N 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 1
- 150000003236 pyrrolines Chemical class 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- JWVCLYRUEFBMGU-UHFFFAOYSA-N quinazoline Chemical compound N1=CN=CC2=CC=CC=C21 JWVCLYRUEFBMGU-UHFFFAOYSA-N 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 125000004159 quinolin-2-yl group Chemical group [H]C1=C([H])C([H])=C2C([H])=C([H])C(*)=NC2=C1[H] 0.000 description 1
- 125000004549 quinolin-4-yl group Chemical group N1=CC=C(C2=CC=CC=C12)* 0.000 description 1
- AUJXLBOHYWTPFV-UHFFFAOYSA-N quinomycin A Natural products CN1C(=O)C(C)NC(=O)C(NC(=O)C=2N=C3C=CC=CC3=NC=2)COC(=O)C(C(C)C)N(C)C(=O)C2N(C)C(=O)C(C)NC(=O)C(NC(=O)C=3N=C4C=CC=CC4=NC=3)COC(=O)C(C(C)C)N(C)C(=O)C1CSC2SC AUJXLBOHYWTPFV-UHFFFAOYSA-N 0.000 description 1
- MKWQJYNEKZKCSA-UHFFFAOYSA-N quinoxaline Chemical compound N1=C=C=NC2=CC=CC=C21 MKWQJYNEKZKCSA-UHFFFAOYSA-N 0.000 description 1
- UOWVMDUEMSNCAV-WYENRQIDSA-N rachelmycin Chemical compound C1([C@]23C[C@@H]2CN1C(=O)C=1NC=2C(OC)=C(O)C4=C(C=2C=1)CCN4C(=O)C1=CC=2C=4CCN(C=4C(O)=C(C=2N1)OC)C(N)=O)=CC(=O)C1=C3C(C)=CN1 UOWVMDUEMSNCAV-WYENRQIDSA-N 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 229960003876 ranibizumab Drugs 0.000 description 1
- 229960002185 ranimustine Drugs 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- QEVHRUUCFGRFIF-MDEJGZGSSA-N reserpine Chemical compound O([C@H]1[C@@H]([C@H]([C@H]2C[C@@H]3C4=C(C5=CC=C(OC)C=C5N4)CCN3C[C@H]2C1)C(=O)OC)OC)C(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-MDEJGZGSSA-N 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 229930002330 retinoic acid Natural products 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000006798 ring closing metathesis reaction Methods 0.000 description 1
- 229960005224 roxithromycin Drugs 0.000 description 1
- VHXNKPBCCMUMSW-FQEVSTJZSA-N rubitecan Chemical compound C1=CC([N+]([O-])=O)=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 VHXNKPBCCMUMSW-FQEVSTJZSA-N 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 201000003804 salivary gland carcinoma Diseases 0.000 description 1
- 229930182947 sarcodictyin Natural products 0.000 description 1
- 208000010157 sclerosing cholangitis Diseases 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 125000005920 sec-butoxy group Chemical group 0.000 description 1
- 125000000467 secondary amino group Chemical group [H]N([*:1])[*:2] 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 229950008834 seribantumab Drugs 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 229910001923 silver oxide Inorganic materials 0.000 description 1
- 229960005456 sisomicin Drugs 0.000 description 1
- URWAJWIAIPFPJE-YFMIWBNJSA-N sisomycin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H](CC=C(CN)O2)N)[C@@H](N)C[C@H]1N URWAJWIAIPFPJE-YFMIWBNJSA-N 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 229910052938 sodium sulfate Inorganic materials 0.000 description 1
- 235000010339 sodium tetraborate Nutrition 0.000 description 1
- VBDRTGFACFYFCT-UHFFFAOYSA-M sodium;hydroxy-[(1r)-1-hydroxy-3-[methyl(pentyl)amino]-1-phosphonopropyl]phosphinate;hydrate Chemical compound O.[Na+].CCCCCN(C)CCC(O)(P(O)(O)=O)P(O)([O-])=O VBDRTGFACFYFCT-UHFFFAOYSA-M 0.000 description 1
- 239000011877 solvent mixture Substances 0.000 description 1
- 108010064556 somatostatin receptor subtype-4 Proteins 0.000 description 1
- 108010082379 somatostatin receptor type 1 Proteins 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 229950006315 spirogermanium Drugs 0.000 description 1
- ICXJVZHDZFXYQC-UHFFFAOYSA-N spongistatin 1 Natural products OC1C(O2)(O)CC(O)C(C)C2CCCC=CC(O2)CC(O)CC2(O2)CC(OC)CC2CC(=O)C(C)C(OC(C)=O)C(C)C(=C)CC(O2)CC(C)(O)CC2(O2)CC(OC(C)=O)CC2CC(=O)OC2C(O)C(CC(=C)CC(O)C=CC(Cl)=C)OC1C2C ICXJVZHDZFXYQC-UHFFFAOYSA-N 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 229960004274 stearic acid Drugs 0.000 description 1
- ZSJLQEPLLKMAKR-GKHCUFPYSA-N streptozocin Chemical compound O=NN(C)C(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O ZSJLQEPLLKMAKR-GKHCUFPYSA-N 0.000 description 1
- 229960001052 streptozocin Drugs 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 229960002317 succinimide Drugs 0.000 description 1
- 229950000244 sulfanilic acid Drugs 0.000 description 1
- BUUPQKDIAURBJP-UHFFFAOYSA-N sulfinic acid Chemical compound OS=O BUUPQKDIAURBJP-UHFFFAOYSA-N 0.000 description 1
- 125000001010 sulfinic acid amide group Chemical group 0.000 description 1
- 125000000213 sulfino group Chemical group [H]OS(*)=O 0.000 description 1
- 125000000475 sulfinyl group Chemical group [*:2]S([*:1])=O 0.000 description 1
- 125000000020 sulfo group Chemical group O=S(=O)([*])O[H] 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 125000000565 sulfonamide group Chemical group 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- 125000001174 sulfone group Chemical group 0.000 description 1
- 150000003459 sulfonic acid esters Chemical class 0.000 description 1
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 125000004434 sulfur atom Chemical group 0.000 description 1
- WINHZLLDWRZWRT-ATVHPVEESA-N sunitinib Chemical compound CCN(CC)CCNC(=O)C1=C(C)NC(\C=C/2C3=CC(F)=CC=C3NC\2=O)=C1C WINHZLLDWRZWRT-ATVHPVEESA-N 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 208000032649 susceptibility to 9 autism Diseases 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 210000000225 synapse Anatomy 0.000 description 1
- 230000005062 synaptic transmission Effects 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- FQZYTYWMLGAPFJ-OQKDUQJOSA-N tamoxifen citrate Chemical compound [H+].[H+].[H+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 FQZYTYWMLGAPFJ-OQKDUQJOSA-N 0.000 description 1
- 229960003454 tamoxifen citrate Drugs 0.000 description 1
- LRBQNJMCXXYXIU-NRMVVENXSA-N tannic acid Chemical compound OC1=C(O)C(O)=CC(C(=O)OC=2C(=C(O)C=C(C=2)C(=O)OC[C@@H]2[C@H]([C@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)O2)OC(=O)C=2C=C(OC(=O)C=3C=C(O)C(O)=C(O)C=3)C(O)=C(O)C=2)O)=C1 LRBQNJMCXXYXIU-NRMVVENXSA-N 0.000 description 1
- 229920002258 tannic acid Polymers 0.000 description 1
- 235000015523 tannic acid Nutrition 0.000 description 1
- 229940033123 tannic acid Drugs 0.000 description 1
- 238000002626 targeted therapy Methods 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 229960001367 tartaric acid Drugs 0.000 description 1
- 125000003666 tauryl group Chemical group [H]N([H])C([H])([H])C([H])([H])S(*)(=O)=O 0.000 description 1
- 229960004964 temozolomide Drugs 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- NRUKOCRGYNPUPR-QBPJDGROSA-N teniposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@@H](OC[C@H]4O3)C=3SC=CC=3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 NRUKOCRGYNPUPR-QBPJDGROSA-N 0.000 description 1
- 229960001278 teniposide Drugs 0.000 description 1
- 125000004213 tert-butoxy group Chemical group [H]C([H])([H])C(O*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000000037 tert-butyldiphenylsilyl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1[Si]([H])([*]C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 150000003512 tertiary amines Chemical class 0.000 description 1
- 125000001302 tertiary amino group Chemical group 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- BPEWUONYVDABNZ-DZBHQSCQSA-N testolactone Chemical compound O=C1C=C[C@]2(C)[C@H]3CC[C@](C)(OC(=O)CC4)[C@@H]4[C@@H]3CCC2=C1 BPEWUONYVDABNZ-DZBHQSCQSA-N 0.000 description 1
- 229960005353 testolactone Drugs 0.000 description 1
- IFLREYGFSNHWGE-UHFFFAOYSA-N tetracene Chemical compound C1=CC=CC2=CC3=CC4=CC=CC=C4C=C3C=C21 IFLREYGFSNHWGE-UHFFFAOYSA-N 0.000 description 1
- RGYLYUZOGHTBRF-BIHRQFPBSA-N tetragastrin Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCSC)C(N)=O)C1=CC=CC=C1 RGYLYUZOGHTBRF-BIHRQFPBSA-N 0.000 description 1
- WHRNULOCNSKMGB-UHFFFAOYSA-N tetrahydrofuran thf Chemical compound C1CCOC1.C1CCOC1 WHRNULOCNSKMGB-UHFFFAOYSA-N 0.000 description 1
- RAOIDOHSFRTOEL-UHFFFAOYSA-N tetrahydrothiophene Chemical compound C1CCSC1 RAOIDOHSFRTOEL-UHFFFAOYSA-N 0.000 description 1
- 125000000383 tetramethylene group Chemical group [H]C([H])([*:1])C([H])([H])C([H])([H])C([H])([H])[*:2] 0.000 description 1
- 150000003536 tetrazoles Chemical class 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- GVIJJXMXTUZIOD-UHFFFAOYSA-N thianthrene Chemical compound C1=CC=C2SC3=CC=CC=C3SC2=C1 GVIJJXMXTUZIOD-UHFFFAOYSA-N 0.000 description 1
- CBDKQYKMCICBOF-UHFFFAOYSA-N thiazoline Chemical compound C1CN=CS1 CBDKQYKMCICBOF-UHFFFAOYSA-N 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- XSROQCDVUIHRSI-UHFFFAOYSA-N thietane Chemical compound C1CSC1 XSROQCDVUIHRSI-UHFFFAOYSA-N 0.000 description 1
- 150000003566 thiocarboxylic acids Chemical class 0.000 description 1
- 125000000858 thiocyanato group Chemical group *SC#N 0.000 description 1
- 125000005031 thiocyano group Chemical group S(C#N)* 0.000 description 1
- ATGUDZODTABURZ-UHFFFAOYSA-N thiolan-2-ylideneazanium;chloride Chemical compound Cl.N=C1CCCS1 ATGUDZODTABURZ-UHFFFAOYSA-N 0.000 description 1
- BRNULMACUQOKMR-UHFFFAOYSA-N thiomorpholine Chemical compound C1CSCCN1 BRNULMACUQOKMR-UHFFFAOYSA-N 0.000 description 1
- 125000005505 thiomorpholino group Chemical group 0.000 description 1
- 125000004568 thiomorpholinyl group Chemical group 0.000 description 1
- WEMNATFLVGEPEW-UHFFFAOYSA-N thiophene Chemical compound C=1C=CSC=1.C=1C=CSC=1 WEMNATFLVGEPEW-UHFFFAOYSA-N 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- PLHJCIYEEKOWNM-HHHXNRCGSA-N tipifarnib Chemical compound CN1C=NC=C1[C@](N)(C=1C=C2C(C=3C=C(Cl)C=CC=3)=CC(=O)N(C)C2=CC=1)C1=CC=C(Cl)C=C1 PLHJCIYEEKOWNM-HHHXNRCGSA-N 0.000 description 1
- 229960003989 tocilizumab Drugs 0.000 description 1
- 229960004167 toremifene citrate Drugs 0.000 description 1
- 125000002088 tosyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1C([H])([H])[H])S(*)(=O)=O 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- IUCJMVBFZDHPDX-UHFFFAOYSA-N tretamine Chemical compound C1CN1C1=NC(N2CC2)=NC(N2CC2)=N1 IUCJMVBFZDHPDX-UHFFFAOYSA-N 0.000 description 1
- 229950001353 tretamine Drugs 0.000 description 1
- 229960001727 tretinoin Drugs 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 125000002306 tributylsilyl group Chemical group C(CCC)[Si](CCCC)(CCCC)* 0.000 description 1
- 229930013292 trichothecene Natural products 0.000 description 1
- 150000003327 trichothecene derivatives Chemical class 0.000 description 1
- ITMCEJHCFYSIIV-UHFFFAOYSA-M triflate Chemical compound [O-]S(=O)(=O)C(F)(F)F ITMCEJHCFYSIIV-UHFFFAOYSA-M 0.000 description 1
- 125000001889 triflyl group Chemical group FC(F)(F)S(*)(=O)=O 0.000 description 1
- 150000004684 trihydrates Chemical class 0.000 description 1
- NOYPYLRCIDNJJB-UHFFFAOYSA-O trimetrexate Chemical compound COC1=C(OC)C(OC)=CC(NCC=2C(=C3C(N)=[NH+]C(N)=NC3=CC=2)C)=C1 NOYPYLRCIDNJJB-UHFFFAOYSA-O 0.000 description 1
- 229960001099 trimetrexate Drugs 0.000 description 1
- BPLUKJNHPBNVQL-UHFFFAOYSA-N triphenylarsine Chemical compound C1=CC=CC=C1[As](C=1C=CC=CC=1)C1=CC=CC=C1 BPLUKJNHPBNVQL-UHFFFAOYSA-N 0.000 description 1
- RIOQSEWOXXDEQQ-UHFFFAOYSA-N triphenylphosphine Chemical compound C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 RIOQSEWOXXDEQQ-UHFFFAOYSA-N 0.000 description 1
- 229910000404 tripotassium phosphate Inorganic materials 0.000 description 1
- 235000019798 tripotassium phosphate Nutrition 0.000 description 1
- BSVBQGMMJUBVOD-UHFFFAOYSA-N trisodium borate Chemical compound [Na+].[Na+].[Na+].[O-]B([O-])[O-] BSVBQGMMJUBVOD-UHFFFAOYSA-N 0.000 description 1
- 229950010147 troxacitabine Drugs 0.000 description 1
- RXRGZNYSEHTMHC-BQBZGAKWSA-N troxacitabine Chemical compound O=C1N=C(N)C=CN1[C@H]1O[C@@H](CO)OC1 RXRGZNYSEHTMHC-BQBZGAKWSA-N 0.000 description 1
- 229960001005 tuberculin Drugs 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 239000000225 tumor suppressor protein Substances 0.000 description 1
- 201000007912 type 1 diabetes mellitus 10 Diseases 0.000 description 1
- GFNNBHLJANVSQV-UHFFFAOYSA-N tyrphostin AG 1478 Chemical compound C=12C=C(OC)C(OC)=CC2=NC=NC=1NC1=CC=CC(Cl)=C1 GFNNBHLJANVSQV-UHFFFAOYSA-N 0.000 description 1
- 229950009811 ubenimex Drugs 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 229960001055 uracil mustard Drugs 0.000 description 1
- XSQUKJJJFZCRTK-UHFFFAOYSA-N urea group Chemical group NC(=O)N XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 206010046766 uterine cancer Diseases 0.000 description 1
- 208000012991 uterine carcinoma Diseases 0.000 description 1
- 229940005605 valeric acid Drugs 0.000 description 1
- 229950000578 vatalanib Drugs 0.000 description 1
- YCOYDOIWSSHVCK-UHFFFAOYSA-N vatalanib Chemical compound C1=CC(Cl)=CC=C1NC(C1=CC=CC=C11)=NN=C1CC1=CC=NC=C1 YCOYDOIWSSHVCK-UHFFFAOYSA-N 0.000 description 1
- 229960003048 vinblastine Drugs 0.000 description 1
- JXLYSJRDGCGARV-XQKSVPLYSA-N vincaleukoblastine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1NC1=CC=CC=C21 JXLYSJRDGCGARV-XQKSVPLYSA-N 0.000 description 1
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- UGGWPQSBPIFKDZ-KOTLKJBCSA-N vindesine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(N)=O)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1N=C1[C]2C=CC=C1 UGGWPQSBPIFKDZ-KOTLKJBCSA-N 0.000 description 1
- 229960004355 vindesine Drugs 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
- 239000008215 water for injection Substances 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- FBTUMDXHSRTGRV-ALTNURHMSA-N zorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(\C)=N\NC(=O)C=1C=CC=CC=1)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 FBTUMDXHSRTGRV-ALTNURHMSA-N 0.000 description 1
- 229960000641 zorubicin Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D487/00—Heterocyclic compounds containing nitrogen atoms as the only ring hetero atoms in the condensed system, not provided for by groups C07D451/00 - C07D477/00
- C07D487/02—Heterocyclic compounds containing nitrogen atoms as the only ring hetero atoms in the condensed system, not provided for by groups C07D451/00 - C07D477/00 in which the condensed system contains two hetero rings
- C07D487/04—Ortho-condensed systems
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/06—Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite
- A61K47/22—Heterocyclic compounds, e.g. ascorbic acid, tocopherol or pyrrolidones
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
- A61P35/02—Antineoplastic agents specific for leukemia
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/06—Immunosuppressants, e.g. drugs for graft rejection
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06008—Dipeptides with the first amino acid being neutral
- C07K5/06017—Dipeptides with the first amino acid being neutral and aliphatic
- C07K5/06026—Dipeptides with the first amino acid being neutral and aliphatic the side chain containing 0 or 1 carbon atom, i.e. Gly or Ala
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Engineering & Computer Science (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Epidemiology (AREA)
- Biochemistry (AREA)
- Physical Education & Sports Medicine (AREA)
- Oncology (AREA)
- Transplantation (AREA)
- Hematology (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicinal Preparation (AREA)
- Peptides Or Proteins (AREA)
- Nitrogen And Oxygen Or Sulfur-Condensed Heterocyclic Ring Systems (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Nitrogen Condensed Heterocyclic Rings (AREA)
Abstract
本发明涉及一种用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物。具体地,本发明涉及一种式(I)的化合物或其药用盐或溶剂化物。
Description
本申请为申请日为2013年12月20日,申请号为201380073646.6,发明名称为“用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物”的发明专利申请的分案申请。
技术领域
本发明涉及吡咯并苯并二氮杂卓(吡咯并苯并二氮杂pyrrolobenzodiazepine,PBD)及其包含在靶向结合物中。本发明的PBD是混合的二聚物形式,其中,一个PBD部分包含亚胺或等价基团,而另一部分包含胺或酰氨基基团。PBD通过C2位置上的取代基连接至细胞结合剂。
背景技术
一些吡咯并苯并二氮杂卓(PBD)具有识别并结合特定DNA序列的能力;优选的序列是PuGPu。于1965年发现了第一种PBD抗肿瘤抗生素,安曲霉素(Leimgruber,et al.,J.Am.Chem.Soc.,87,5793-5795(1965);Leimgruber,et al.,J.Am.Chem.Soc.,87,5791-5793(1965))。其后,已报道了许多天然存在的PBD,并且针对各种类似物已开发多种合成途径(Thurston,et al.,Chem.Rev.1994,433-465(1994);Antonow,D.和Thurston,D.E.,Chem.Rev.2011 111(4),2815-2864)。家族成员包括赤霉素(abbeymycin)(Hochlowski,etal.,J.Antibiotics,40,145-148(1987))、契卡霉素(chicamycin)(Konishi,et al.,J.Antibiotics,37,200-206(1984))、DC-81(日本专利58-180487;Thurston,et al.,Chem.Brit.,26,767-772(1990);Bose,et al.,Tetrahedron,48,751-758(1992))、甲基氨茴霉素(Kuminoto,et al.,J.Antibiotics,33,665-667(1980))、新茴霉素(neothramycin)A和B(Takeuchi,et al.,J.Antibiotics,29,93-96(1976))、普罗茴霉素(porothramycin)(Tsunakawa,et al.,J.Antibiotics,41,1366-1373(1988))、普拉斯卡素(prothracarcin)(Shimizu,et al,J.Antibiotics,29,2492-2503(1982);Langley和Thurston,J.Org.Chem.,52,91-97(1987))、西班米星(sibanomicin)(DC-102)(Hara,et al.,J.Antibiotics,41,702-704(1988);Itoh,et al.,J.Antibiotics,41,1281-1284(1988))、西伯利亚霉素(sibiromycin)(Leber,et al.,J.Am.Chem.Soc.,110,2992-2993(1988))和托马霉素(tomamycin)(Arima,et al.,J.Antibiotics,25,437-444(1972))。PBD是以下一般结构:
它们的区别在于取代基的数量、类型和位置,在于它们的芳香族A环和吡咯并C环两者,以及在于C环的饱和度。在B环中,在N10-C11位置(其是负责使DNA烷基化的亲电中心)处存在亚胺(N=C)、甲醇胺(NH-CH(OH))或甲醇胺甲醚(NH-CH(OMe))。所有已知的天然产物都在手性C11a位置处具有(S)-构型,当从C环向A环观察时,该构型为它们提供右手扭转。这给予它们用于与B型DNA的小沟的等螺旋性(isohelicity)的适当的三维形状,获得在结合位点处的滑动配合(Kohn,InAntibiotics III.Springer-Verlag,New York,pp.3-11(1975);Hurley和Needham-VanDevanter,Acc.Chem.Res.,19,230-237(1986))。它们在小沟中形成加合物的能力使它们能够干扰DNA的加工,因此它们可用作抗肿瘤剂。
之前公开了经由柔性的亚烷基接头通过它们的C8/C’-羟基官能团将两个PBD单元结合在一起可以加强这些分子的生物活性(Bose,D.S.,et al.,J.Am.Chem.Soc.,114,4939-4941(1992);Thurston,D.E.,et al.,J.Org.Chem.,61,8141-8147(1996))。认为PBD二聚物形成序列选择性的DNA损伤,如回文结构5’-Pu-GATC-Py-3’链间交联(Smellie,M.,et al.,Biochemistry,42,8232-8239(2003);Martin,C.,et al.,Biochemistry,44,4135-4147),认为这是它们的生物活性的主要原因。PBD二聚物的一个实例,SG2000(SJG-136):
近年来已经进入肿瘤学领域的II期临床试验(Gregson,S.,et al.,J.Med.Chem.,44,737-748(2001);Alley,M.C.,et al.,Cancer Research,64,6700-6706(2004);Hartley,J.A.,et al.,Cancer Research,64,6693-6699(2004))。
WO 2010/043880公开了在每个单体的C2位置带有芳基基团的非对称二聚PBD化合物,其中,这些芳基基团中的一个带有设计为提供连接化合物至另一部分的锚状物的取代基。WO 2011/130613公开了包含在靶向结合物中的这些PBD二聚物。WO 2011/130616公开了在一个单体的C2位置带有芳基基团的非对称二聚PBD化合物,该芳基基团带有设计为提供连接化合物至另一部分的锚状物的取代基,另一单体在C2位置带有非芳香族基团。还公开了包含在靶向结合物中的这些化合物。2012年10月12日提交的共同未决国际申请PCT/EP2012/070233公开了在一个单体的C2位置带有丙烯基基团的非对称二聚PBD化合物,该丙烯基基团带有设计为提供连接化合物至另一部分的锚状物的取代基,另一单体在C2位置带有芳香族或非芳香族基团。
在2002年,Kamal描述了在一个PBD中具有亚胺键且在另一PBD中具有酰胺基团的PBD二聚物的合成和评估(Kamal,A,et al.,J.Med.Chem.,2002,4679-4688),如:
在2004年,它描述了在一个PBD中具有亚胺键且在另一PBD中具有胺键的PBD二聚物的合成和评估(Kamal,A,et al.,Bioorg.Med.Chem.,12(2004)5427-5436),如:
这些化合物不能交联DNA,但是示出具有一些细胞毒性。
发明内容
本发明的一个方面包括具有式I的化合物:
或其药用盐或溶剂化物,
其中:
R2是式IIa、式IIb或式IIc:
(a)
其中,A是C5-7芳基基团,并且以下两者之一:
(i)Q1是单键,并且Q2选自单键和-Z-(CH2)n-,其中Z选自单键、O、S和NH,并且n是1至3;或
(ii)Q1是-CH=CH-,并且Q2是单键;
(b)
其中:
RC1、RC2和RC3独立地选自H和未取代的C1-2烷基;
(c)
其中,Q选自OH、SH和NRN,并且RN选自H、甲基和乙基;
X选自包括以下的组:OH、SH、CO2H、COH、N=C=O、NHNH2、CONHNH2、NHRN,其中,RN选自包括H和C1-4烷基的组;
并且以下两者之一:
当C2’和C3’之间存在双键时,R12选自由以下组成的组:
(ia)C5-10芳基基团,可选地被选自包括以下的组中的一个或多个取代基取代:卤素、硝基、氰基、醚、羧基、酯、C1-7烷基、C3-7杂环基和双-氧基-C1-3亚烷基;
(ib)C1-5饱和脂肪族烷基;
(ic)C3-6饱和环烷基;
(id)其中,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5;
(ie)其中,R25a和R25b中的一个是H,并且另一个选自:苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;和
(if)其中,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;
当C2’和C3’之间存在单键时,
R12是H或其中,R26a和R26b独立地选自H、F、C1-4饱和烷基、C2-3烯基,该烷基和烯基基团可选地被选自C1-4烷基酰氨基和C1-4烷基酯的基团取代;或,当R26a和R26b中的一个是H时,另一个选自腈和C1-4烷基酯;
R6和R9独立地选自H、R、OH、OR、SH、SR、NH2、NHR、NRR′、硝基、Me3Sn和卤素;
其中,R和R’独立地选自可选取代的C1-12烷基、C3-20杂环基和C5-20芳基基团;
R7选自H、R、OH、OR、SH、SR、NH2、NHR、NHRR’、硝基、Me3Sn和卤素;
R″是C3-12亚烷基基团,该链可以被一个或多个以下项所中断:杂原子,例如O、S、NRN2(其中,RN2是H或C1-4烷基),和/或芳环,例如苯或吡啶;
Y和Y′选自O、S、或NH;
R6’、R7’、R9’分别选自与R6、R7和R9相同的基团;
以下两者之一:
(A)R20是H或Me且R21a和R21b都是H或一起形成=O,并且以下各项之一:
(i)R10是H,R11a是H且R11b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R10和R11b形成它们结合的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R10是H,R11a是H且R11b是SOzM,其中,z是2或3且M是单价药用阳离子;或者
(B)R10是H或Me且R11a和R11b都是H或一起形成=O,并且以下各项之一:
(i)R20是H,R21a是H且R21b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R20是H,R21a是H且R21b是SOzM,其中,z是2或3且M是单价药用阳离子。
因此,以上选项(A)和(B)可以产生下式(IA-a、IA-b、IB-a、IB-b)的化合物:
本发明的二聚物因此在一个单体中具有亚胺键,该亚胺键可以作为甲醇胺、甲醇胺醚或亚硫酸氢盐形式存在,并且在另一单体中具有仲/叔胺或(甲基)酰氨基官能团。
本发明的第二方面提供了本发明的第一方面的化合物在制备用于治疗增殖性(增生性)疾病的药剂中的用途。第二方面还提供了在增殖性疾病的治疗中使用的本发明的第一方面的化合物。
本领域普通技术人员能够容易地确定候选化合物是否治疗任何特定细胞类型的增殖性病症。例如,在以下的实例中描述了测定法,它们可以方便地用于评价特定化合物所提供的活性。
本发明的第三方面提供了制备本发明的第一方面的化合物的方法,包括以下列出的方法步骤中的至少一个。
本发明的第四方面提供了以下式III的化合物:
或其药用盐或溶剂化物,
其中:
R22选自:
(a)式IVa:
其中,A、Q1、Q2如本发明的第一方面所定义;
(b)式IVb:
其中,RC1、RC2和RC3如本发明的第一方面所定义;
(c)式IVc:
L4选自单键和以下组:
(a):
其中,n是0至3;
(b)
其中,n如以下定义;
(c)
其中,n如以上定义;和
(d)
其中,n如以上定义,E是O、S或NR,D是N、CH或CR,并且F是N、CH或CR;
L3是:
其中,X使得L3是氨基酸残基、二肽残基或三肽残基;Prot选自Fmoc(芴甲氧羰基)、Teoc(2-(三甲基硅烷基)乙氧基羰基)、Boc(叔丁氧基羰基)和Alloc(烯丙氧基羰基);
且R6、R7、R9、R6’、R7’、R9’、R12、R″、Y、Y’、R10、R11a、R11b、R20、R21a和R21b如本发明的第一方面所定义。
在第五方面,本发明涉及包含连接至靶向剂的PBD的二聚物的结合物或其药用盐或溶剂化物(同上),其中,PBD二聚物是式I。
在一些实施方式中,结合物具有下式V:
L-(LU-D)p (V),
或其药用盐或溶剂化物,其中,L是配体单元(即,靶向剂),LU是接头单元以及D是药物单元,该药物单元是PBD二聚物(参见以下)。下标p是1至20的整数。因此,结合物包含通过接头单元共价连接至至少一个药物单元的配体单元。配体单元是结合至靶向部分的靶向剂,下文进行了更全面的描述。例如,配体单元可以特异性结合至细胞组分(细胞结合剂)或结合至其他感兴趣的靶分子。因此,本发明还提供了用于治疗例如各种癌症和自身免疫疾病的方法。这些方法包括结合物的使用,其中,配体单元是特异性结合至靶分子的靶向剂。配体单元可以是例如蛋白质、多肽或肽如抗体、抗体的抗原结合片段或其他结合剂,如Fc融合蛋白。
在本发明的结合物中,PBD二聚物D是式I或其药用盐或溶剂化物,除了X选自包括以下的组:*-O-+、*-S-+、*-CO2-+、*-CO-+、*-NH(C=O)-+、*-NHNH-+、*-CONHNH-+、其中,RN选自包括H和C1-4烷基的组,且星号表示附接至药物单元的剩余部分的点以及波浪线或+表示附接至接头单元的点。
载药量(drug loading)由p代表,每配体单元(例如,抗体)药物分子的数目。载药量可以在每配体单元(例如,Ab或mAb)1至20个药物单元(D)的范围内。对于组合物,p代表该组合物中结合物的平均载药量,并且p在1至20的范围内。
本发明的第六方面提供了本发明的第五方面的结合物在制备用于治疗增殖性疾病的药剂中的用途。第六方面还提供了在增殖性疾病的治疗中使用的本发明的第五方面的结合物。
本领域普通技术人员能够容易地确定候选结合物是否治疗任何特定细胞类型的增殖性病症。例如,在以下的实例中描述了测定法,它们可以方便地用于评价特定化合物所提供的活性。
在第七方面,本发明涉及包含连接至连接单元的PBD的二聚物(参见以上)的接头-药物化合物(即,药物-接头)。这些药物-接头可以用作合成包含连接至靶向剂的PBD的二聚物的结合物的中间体。
这些药物-接头具有下式VI:
LU-D (VI),
或其药用盐或溶剂化物,其中,LU是接头单元以及D是药物单元,该药物单元是PBD二聚物。
在本发明的药物-接头中,PBD二聚物D是式I或其药用盐或溶剂化物,除了X选自包括以下的组:*-O-q、*-S-q、*-CO2-q、*-CO-q、*-NH(C=O)-q、*-NHNH-q、*-CONHNH-q、其中,RN选自包括H和C1-4烷基的组,且星号表示附接至药物单元的剩余部分的点以及波浪线或q表示附接至接头单元的点。
在一些实施方式中,药物接头是以上所定义的式III。
具体实施方式
定义
药用阳离子
在Berge,et al.,J.Pharm.Sci.,66,1-19(1977)中讨论了药用单价和二价阳离子的实例,通过引用将其结合于此。
药用阳离子可以是无机阳离子或有机阳离子。
药用单价无机阳离子的实例包括但不限于碱金属离子如Na+和K+。药用二价无机阳离子的实例包括但不限于碱土金属阳离子如Ca2+和Mg2+。药用有机阳离子的实例包括但不限于铵离子(即,NH4 +)和取代的铵离子(例如,NH3R+、NH2R2 +、NHR3 +、NR4 +)。一些合适的取代的铵离子的实例是源自以下的那些:乙胺、二乙胺、二环己胺、三乙胺、丁胺、乙二胺、乙醇胺、二乙醇胺、哌嗪、苄胺、苯基苄胺、胆碱、葡甲胺和氨丁三醇,以及氨基酸,如赖氨酸和精氨酸。常见的季铵离子的实例是N(CH3)4 +。
取代基
如本文所使用的,短语“可选取代的”涉及其可以是未取代的或其可以是取代的母基团。
除非另外指定,否则本文所用的术语“取代的”涉及带有一个或多个取代基的母基团。术语“取代基”在本文中是在常规意义上使用并且是指这样的化学部分,其共价连接于,或者如果合适的话,稠合于母基团。各种取代基是众所周知的,并且用于它们的形成和引入至各种母基团的方法也是众所周知的。
下文中更详细地描述取代基的实例。
C1-12烷基:如本文所使用的,术语“C1-12烷基”涉及通过从具有1至12个碳原子的烃化合物的碳原子除去氢原子所得到的单价部分,其可以是脂肪族或脂环族,并且其可以是饱和或不饱和的(例如部分不饱和的、完全不饱和的)。如本文所使用的,术语“C1-4烷基”涉及通过从具有1至4个碳原子的烃化合物的碳原子除去氢原子所得到的单价部分,其可以是脂肪族或脂环族,并且其可以是饱和或不饱和的(例如部分不饱和的、完全不饱和的)。因此,术语“烷基”包括以下讨论的亚类:烯基、炔基、环烷基等。
饱和烷基基团的实例包括但不限于甲基(C1)、乙基(C2)、丙基(C3)、丁基(C4)、戊基(C5)、己基(C6)和庚基(C7)。
饱和线性烷基基团的实例包括但不限于甲基(C1)、乙基(C2)、正丙基(C3)、正丁基(C4)、正戊基(戊基)(C5)、正己基(C6)和正庚基(C7)。
饱和支链烷基基团的实例包括异丙基(C3)、异丁基(C4)、仲丁基(C4)、叔丁基(C4)、异戊基(C5)和新戊基(C5)。
C2-12烯基:本文所使用的,术语“C2-12烯基”涉及具有一个或多个碳-碳双键的烷基基团。
不饱和烯基基团的实例包括但不限于乙烯基(ethenyl)(乙烯基(vinyl),-CH=CH2)、1-丙烯基(-CH=CH-CH3)、2-丙烯基(烯丙基,-CH-CH=CH2)、异丙烯基(1-甲基乙烯基,-C(CH3)=CH2)、丁烯基(C4)、戊烯基(C5)和己烯基(C6)。
C2-12炔基:如本文所使用的,术语“C2-12炔基”涉及具有一个或多个碳-碳三键的烷基基团。
不饱和炔基基团的实例包括但不限于乙炔基(-C≡CH)和2-丙炔基(炔丙基,-CH2-C≡CH)。
C3-12环烷基:如本文所使用的,术语“C3-12环烷基”涉及烷基基团,其也是环基基团;即,通过从环烃(碳环)化合物的脂环原子除去氢原子得到的单价部分,该单价部分具有3至7个碳原子,包括3至7个环原子。
环烷基基团的实例包括但不限于源自以下的那些:
饱和单环烃化合物:
环丙烷(C3)、环丁烷(C4)、环戊烷(C5)、环己烷(C6)、环庚烷(C7)、甲基环丙烷(C4)、二甲基环丙烷(C5)、甲基环丁烷(C5)、二甲基环丁烷(C6)、甲基环戊烷(C6)、二甲基环戊烷(C7)、和甲基环己烷(C7);
不饱和单环烃化合物:
环丙烯(C3)、环丁烯(C4)、环戊烯(C5)、环己烯(C6)、甲基环丙烯(C4)、二甲基环丙烯(C5)、甲基环丁烯(C5)、二甲基环丁烯(C6)、甲基环戊烯(C6)、二甲基环戊烯(C7)和甲基环己烯(C7);以及
饱和多环烃化合物:
降蒈烷(C7)、降蒎烷(C7)、降莰烷(C7)。
C3-20杂环基:如本文所使用的,术语“C3-20杂环基”涉及通过从杂环化合物的环原子上除去氢原子得到的单价部分,该部分具有3至20个环原子,其中1至10个是环杂原子。优选地,每个环具有3至7个环原子,其中1至4个是环杂原子。
在该上下文中,前缀(例如C3-20、C3-7、C5-6等)表示环原子的数目或环原子的数目范围,而不管是碳原子或杂原子。例如,如本文所使用的,术语“C5-6杂环基”涉及具有5或6个环原子的杂环基基团。
单环杂环基基团的实例包括但不限于源自以下的那些:
N1:氮杂环丙烷(C3)、氮杂环丁烷(C4)、吡咯烷(四氢化吡咯)(C5)、吡咯啉(例如,3-吡咯啉、2,5-二氢吡咯)(C5)、2H-吡咯或3H-吡咯(异吡咯、异噁唑)(C5)、哌啶(C6)、二氢吡啶(C6)、四氢吡啶(C6)、氮杂卓(C7);
O1:氧杂环丙烷(C3)、氧杂环丁烷(C4)、氧杂环戊烷(四氢呋喃)(C5)、氧杂环戊二烯(二氢呋喃)(C5)、环氧己烷(四氢吡喃)(C6)、二氢吡喃(C6)、吡喃(C6)、氧杂卓((C7);
S1:硫杂环丙烷(C3)、硫杂环丁烷(C4)、硫杂环戊烷(四氢噻吩)(C5)、硫杂环己烷(四氢噻喃)(C6)、硫杂环庚烷(thiepane)(C7);
O2:二氧杂环戊烷(C5)、二氧杂环己烷(C6)和二氧杂环庚烷(C7);
O3:三氧杂环己烷(C6);
N2:咪唑烷(C5)、吡唑烷(二偶氮烷)(C5)、咪唑啉(C5)、吡唑啉(二氢吡唑)(C5)、哌嗪(C6);
N1O1:四氢噁唑(C5)、二氢噁唑(C5)、四氢异噁唑(C5)、二氢异噁唑(C5)、吗啉(C6)、四氢噁嗪(C6)、二氢噁嗪(C6)、噁嗪(C6);
N1S1:噻唑啉(C5)、噻唑烷(C5)、硫代吗啉(C6);
N2O1:噁二嗪(C6);
O1S1:氧硫杂环戊二烯(oxathiole)(C5)和氧硫杂环己烷(C6);和
N1O1S1:噁噻嗪(C6)。
取代的单环杂环基基团的实例包括源自环形式的糖类的那些,例如,呋喃糖(C5)如阿拉伯呋喃糖、来苏呋喃糖、呋喃核糖和呋喃木糖,和吡喃糖(C6)如吡喃阿洛糖、吡喃阿卓糖、吡喃葡萄糖、吡喃甘露糖、吡喃古洛糖、吡喃艾杜糖(idopyranose)、吡喃半乳糖和吡喃塔罗糖(talopyranose)。
C5-20芳基:如本文所使用的,术语“C5-20芳基”涉及通过从芳香族化合物的芳环原子上除去氢原子得到的单价部分,该部分具有3至20个环原子。如本文所用的,术语“C5-7芳基”涉及从芳香族化合物的芳环原子除去氢原子得到的单价部分,该部分具有5至7个环原子,以及如本文使用的,术语“C5-10芳基”涉及从芳香族化合物的芳环原子除去氢原子得到的单价部分,该部分具有5至10个环原子。优选地,每个环具有5至7个环原子。
在该上下文中,前缀(例如C3-20、C5-7、C5-6、C5-10等)表示环原子(碳原子或杂原子)的数目或环原子的数目范围。例如,如本文所使用的,术语“C5-6芳基”涉及具有5或6个环原子的芳基基团。
如在“碳芳基基团”中,环原子可以全部是碳原子。
碳芳基基团的实例包括但不限于源自以下的那些:苯(即,苯基)(C6)、萘(C10)、薁(C10)、蒽(C14)、菲(C14)、并四苯(C18)和芘(C16)。
包含稠环(其中的至少一个是芳环)的芳基基团的实例包括但不限于源自以下的基团:茚满(例如,2,3-二氢-1H-茚)(C9)、茚(C9)、异茚(C9)、四氢化萘(1,2,3,4-四氢化萘)(C10)、苊(C12)、芴(C13)、非那烯(C13)、醋菲(C15)和醋蒽(C16)。
可替代地,如在“杂芳基基团”中,环原子可以包括一个或多个杂原子。单环杂芳基基团的实例包括但不限于源自以下的那些:
N1:吡咯(唑)(C5)、吡啶(吖嗪)(C6);
O1:呋喃(氧杂茂,oxole)(C5);
S1:噻吩(thiophene)(噻吩,thiole)(C5);
N1O1:噁唑(C5)、异噁唑(C5)、异噁嗪(C6);
N2O1:噁二唑(呋咱)(C5);
N3O1:噁三唑(C5);
N1S1:噻唑(C5)、异噻唑(C5);
N2:咪唑(1,3-二唑)(C5)、吡唑(1,2-二唑)(C5)、哒嗪(1,2-二嗪)(C6)、嘧啶(1,3-二嗪)(C6)(例如,胞嘧啶、胸腺嘧啶、尿嘧啶)、吡嗪(1,4-二嗪)(C6);
N3:三唑(C5)、三嗪(C6);和
N4:四唑(C5)。
包含稠环的杂芳基的实例包括但不限于:
源自苯并呋喃(O1)、异苯并呋喃(O1)、吲哚(N1)、异吲哚(N1)、氮茚(N1)、吲哚啉(N1)、异吲哚啉(N1)、嘌呤(N4)(例如,腺嘌呤、鸟嘌呤)、苯并咪唑(N2)、吲唑(N2)、苯并噁唑(N1O1)、苯并异噁唑(N1O1)、苯并二噁茂(O2)、苯并呋咱(N2O1)、苯并三唑(N3)、苯并噻吩(S1)、苯并噻唑(N1S1)、苯并噻二唑((N2S)的C9(具有2个稠环);
源自色烯(O1)、异色烯(O1)、色满(O1)、异色满(O1)、苯并二噁烷(O2)、喹啉(N1)、异喹啉(N1)、喹嗪(N1)、苯并噁嗪(N1O1)、苯并二嗪(N2)、吡啶并吡啶(N2)、喹喔啉(N2)、喹唑啉(N2)、噌啉(N2)、酞嗪(N2)、萘啶(N2)、蝶啶(N4)的C10(具有2个稠环);
源自苯并二氮杂卓(N2)的C11(具有2个稠环);
源自咔唑(N1)、二苯并呋喃(O1)、二苯并噻吩(S1)、咔啉(N2)、呸啶(N2)、吡啶并吲哚(N2)的C13(具有3个稠环);和
源自吖啶(N1)、呫吨(O1)、噻吨(S1)、氧杂呫吨(O2)、吩噁噻(O1S1)、吩嗪(N2)、吩噁嗪(N1O1)、吩噻嗪(N1S1)、噻蒽(S2)、菲啶(N1)、菲绕啉(N2)、吩嗪(N2)的C14(具有3个稠环)。
以上基团,无论单独或作为另一取代基的一部分,其自身可以可选地被选自自身和一个或多个以下所列的另外的取代基所取代。
卤素:-F、-Cl、-Br和-I。
羟基:-OH。
醚:-OR,其中,R是醚取代基,例如,C1-7烷基基团(也称为C1-7烷氧基基团,在以下讨论)、C3-20杂环基基团(也称为C3-20杂环氧基基团)或C5-20芳基基团(也称为C5-20芳氧基基团),优选地是C1-7烷基基团。
烷氧基:-OR,其中,R是烷基基团,例如,C1-7烷基基团。C1-7烷基基团的实例包括但不限于-OMe(甲氧基)、-OEt(乙氧基)、-O(nPr)(正丙氧基)、-O(iPr)(异丙氧基)、-O(nBu)(正丁氧基)、-O(sBu)(仲丁氧基)、-O(iBu)(异丁氧基)和-O(tBu)(叔-丁氧基)。
缩醛:-CH(OR1)(OR2),其中,R1和R2独立地是缩醛取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团,或在“环”缩醛基团的情况中,R1和R2与它们附接的两个氧原子和它们附接的碳原子一起形成具有4至8个环原子的杂环。缩醛基团的实例包括但不限于-CH(OMe)2、-CH(OEt)2和-CH(OMe)(OEt)。
半缩醛:-CH(OH)(OR1),其中,R1是半缩醛取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。半缩醛基团的实例包括但不限于-CH(OH)(OMe)和-CH(OH)(OEt)。
缩酮:-CR(OR1)(OR2),其中,R1和R2如对缩醛定义的,并且R是除氢之外的缩酮取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。缩酮基团的实例包括但不限于-C(Me)(OMe)2、-C(Me)(OEt)2、-C(Me)(OMe)(OEt)、-C(Et)(OMe)2、-C(Et)(OEt)2和-C(Et)(OMe)(OEt)。
半缩酮:-CR(OH)(OR1),其中,R1如对半缩醛定义的,并且R是除氢之外的半缩酮取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。半缩酮基团的实例包括但不限于-C(Me)(OH)(OMe)、-C(Et)(OH)(OMe)、-C(Me)(OH)(OEt)和-C(Et)(OH)(OEt)。
氧代(酮基、-酮):=O。
硫酮(Thione)(硫代酮(thioketone)):=S。
亚氨基(亚胺):=NR,其中,R是亚氨基取代基,例如,氢、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是氢或C1-7烷基基团。酯基团的实例包括但不限于=NH、=NMe、=NEt和=NPh。
甲酰基(甲醛(carbaldehyde),甲醛基(carboxaldehyde)):-C(=O)H。
酰基(酮基):-C(=O)R,其中,R是酰基取代基,例如,C1-7烷基基团(也称为C1-7烷酰基基团或C1-7烷酰基(alkanoyl))、C3-20杂环基基团(也称为C3-20杂环酰基基团)或C5-20芳基基团(也称为C5-20芳酰基基团),优选地是C1-7烷基基团。酰基基团的实例包括但不限于-C(=O)CH3(乙酰基)、-C(=O)CH2CH3(丙酰基)、-C(=O)C(CH3)3(叔-丁酰基)和-C(=O)Ph(苯甲酰基,酰苯)。
羧基(羧酸):-C(=O)OH。
硫代羧基(硫代羧酸):-C(=S)SH。
一硫代羧基(一硫代羧酸):-C(=O)SH。
硫羰羧基(硫羰羧酸):-C(=S)OH。
亚胺酸:-C(=NH)OH。
异羟肟酸:-C(=NOH)OH。
酯(羧酸酯基(carboxylate),羧酸酯(carboxylic acid ester),氧羰基):-C(=O)OR,其中,R是酯取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。酯基团的实例包括但不限于-C(=O)OCH3、-C(=O)OCH2CH3、-C(=O)OC(CH3)3和-C(=O)OPh。
酰氧基(反向酯):-OC(=O)R,其中,R是酰氧基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。酰氧基基团的实例包括但不限于-OC(=O)CH3(乙酰氧基)、-OC(=O)CH2CH3、-OC(=O)C(CH3)3、-OC(=O)Ph和-OC(=O)CH2Ph。
氧基羰氧基(oxycarboyloxy):-OC(=O)OR,其中,R是酯取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。酯基团的实例包括但不限于-OC(=O)OCH3、-OC(=O)OCH2CH3、-OC(=O)OC(CH3)3和-OC(=O)OPh。
氨基:-NR1R2,其中,R1和R2独立地是氨基取代基,例如,氢、C1-7烷基基团(也称为C1-7烷基氨基或二-C1-7烷基氨基)、C3-20杂环基基团或C5-20芳基基团,优选地是H或C1-7烷基基团,或在“环”氨基基团的情况中,R1和R2与它们附接的氮原子一起形成具有4至8个环原子的杂环。氨基基团可以是伯氨基(-NH2)、仲氨基(-NHR1)、或叔氨基(-NHR1R2),以及以阳离子形式,可以是季氨基(-+NR1R2R3)。氨基基团的实例包括但不限于-NH2、-NHCH3、-NHC(CH3)2、-N(CH3)2、-N(CH2CH3)2和-NHPh。环氨基基团的实例包括但不限于氮杂环丙基、氮杂环丁基、吡咯烷基、哌啶基、哌嗪基、吗啉代和硫代吗啉代。
酰氨基(氨基甲酰基,氨甲酰基,氨基羰基,甲酰胺):-C(=O)NR1R2,其中,R1和R2独立地是如对氨基基团定义的氨基取代基。酰氨基基团的实例包括但不限于-C(=O)NH2、-C(=O)NHCH3、-C(=O)N(CH3)2、-C(=O)NHCH2CH3和-C(=O)N(CH2CH3)2、以及氨基基团,其中,R1和R2与它们附接的氮原子一起形成杂环结构,例如,哌啶羰基、吗啉代羰基、硫代吗啉代羰基和哌嗪羰基。
硫代酰氨基(硫代氨基甲酰基):-C(=S)NR1R2,其中,R1和R2独立地是如对氨基基团定义的氨基取代基。酰氨基基团的实例包括但不限于-C(=S)NH2、-C(=S)NHCH3、-C(=S)N(CH3)2和-C(=S)NHCH2CH3。
酰基酰氨基(酰氨基):-NR1C(=O)R2,其中,R1是酰胺取代基,例如,氢、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是氢或C1-7烷基基团,以及R2是酰基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是氢或C1-7烷基基团。酰胺基团的实例包括但不限于-NHC(=O)CH3、-NHC(=O)CH2CH3和-NHC(=O)Ph。R1和R2可以一起形成环结构,例如,在琥珀酰亚胺基、马来酰亚胺基和酞酰亚胺基中:
氨基羰氧基:-OC(=O)NR1R2,其中,R1和R2独立地是如对氨基基团定义的氨基取代基。氨基羰氧基基团的实例包括但不限于-OC(=O)NH2、-OC(=O)NHMe、-OC(=O)NMe2和-OC(=O)NEt2。
脲基:-N(R1)CONR2R3,其中,R2和R3独立地是如对氨基基团定义的氨基取代基,并且R1是脲基取代基,例如,氢、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是氢或C1-7烷基基团。脲基基团的实例包括但不限于-NHCONH2、-NHCONHMe、-NHCONHEt、-NHCONMe2、-NHCONEt2、-NMeCONH2、-NMeCONHMe、-NMeCONHEt、-NMeCONMe2和-NMeCONEt2。
胍基:-NH-C(=NH)NH2。
四唑基:具有4个氮原子和1个碳原子的五元芳环,
亚氨基:=NR,其中,R是亚氨基取代基,例如,氢、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是H或C1-7烷基基团。亚氨基基团的实例包括但不限于=NH、=NMe和=NEt。
脒(脒基):-C(=NR)NR2,其中,每个R是脒取代基,例如,氢、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是H或C1-7烷基基团。脒基团的实例包括但不限于-C(=NH)NH2、-C(=NH)NMe2和-C(=NMe)NMe2。
硝基:-NO2。
亚硝基:-NO。
叠氮基:-N3。
氰基(腈(nitrile),腈(carbonitrile)):-CN。
异氰基:-NC。
氰酰基:-OCN。
异氰酰基:-NCO。
硫代氰基(硫氰酰基):-SCN。
异硫代氰基(异硫氰基):-NCS。
巯基(sulfhydryl)(硫醇(thiol),巯基(mercapto)):-SH。
硫醚(硫化物):-SR,其中,R是硫醚取代基,例如,C1-7烷基基团(也称为C1-7烷硫基基团)、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。C1-7烷硫基基团的实例包括但不限于-SCH3和-SCH2CH3。
二硫化物:-SS-R,其中,R是二硫化物取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团(在本文中也称为C1-7烷基二硫化物)。C1-7烷基二硫化物基团的实例包括但不限于-SSCH3和-SSCH2CH3。
锍化物(亚硫酰基,亚砜):-S(=O)R,其中,R是锍化物取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。锍化物基团的实例包括但不限于-S(=O)CH3和-S(=O)CH2CH3。
砜(磺酰基):-S(=O)2R,其中,R是砜取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团,包括,例如,氟化或全氟化C1-7烷基基团。砜基团的实例包括但不限于-S(=O)2CH3(甲磺酰基(methanesulfonyl),甲磺酰基(mesyl))、-S(=O)2CF3(三氟甲磺酰基)、-S(=O)2CH2CH3(乙磺酰基)、-S(=O)2C4F9(九氟丁磺酰基)、-S(=O)2CH2CF3(三氟乙磺酰基)、-S(=O)2CH2CH2NH2(牛磺酰基)、-S(=O)2Ph(苯磺酰基,苯磺酰基)、4-甲基苯磺酰基(甲苯磺酰基)、4-氯苯磺酰基(氯苯磺酰基)、4-溴苯磺酰基(溴苯磺酰基)、4-硝基苯基(硝基苯磺酰基)、2-萘磺酸酯(萘磺酰基)和5-二甲基氨基-萘-1-基磺酸酯(丹磺酰基)。
亚磺酸(亚磺基):-S(=O)OH、-SO2H。
磺酸(磺基):-S(=O)2OH、-SO3H。
亚磺酸酯基(sulfinate)(亚磺酸酯,sulfinic acid ester):-S(=O)OR;其中,R是亚磺酸酯取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。亚磺酸酯基团的实例包括但不限于-S(=O)OCH3(甲氧基亚硫酰基;亚磺酸甲酯)和-S(=O)OCH2CH3(乙氧基亚硫酰基;亚磺酸乙酯)。
磺酸酯(sulfonate)(磺酸酯(sulfonic acid ester)):-S(=O)2OR;其中,R是磺酸酯取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。磺酸酯基团的实例包括但不限于-S(=O)2OCH3(甲氧基磺酰基;磺酸甲酯)和-S(=O)2OCH2CH3(乙氧基磺酰基;磺酸乙酯)。
亚磺酰氧基:-OS(=O)R,其中,R是亚磺酰氧基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。亚磺酰氧基基团的实例包括但不限于-OS(=O)CH3和-OS(=O)CH2CH3。
磺酰氧基:-OS(=O)2R,其中,R是磺酰氧基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。磺酰氧基基团的实例包括但不限于-OS(=O)2CH3(甲磺酸酯)和-OS(=O)2CH2CH3(乙磺酸酯)。
硫酸酯:-OS(=O)2OR;其中,R是硫酸酯取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。硫酸酯基团的实例包括但不限于-OS(=O)2OCH3和-SO(=O)2OCH2CH3。
氨磺酰(氨磺酰基;亚磺酸酰胺;亚磺酰胺):-S(=O)NR1R2,其中,R1和R2独立地是如对氨基基团定义的氨基取代基。氨磺酰基团的实例包括但不限于-S(=O)NH2、-S(=O)NH(CH3)、-S(=O)N(CH3)2、-S(=O)NH(CH2CH3)、-S(=O)N(CH2CH3)2和-S(=O)NHPh。
亚磺酰氨基(亚氨磺酰基;磺酸酰胺;氨磺酰):-S(=O)2NR1R2,其中,R1和R2独立地是如对氨基基团定义的氨基取代基。亚磺酰氨基基团的实例包括但不限于-S(=O)2NH2、-S(=O)2NH(CH3)、-S(=O)2N(CH3)2、-S(=O)2NH(CH2CH3)、-S(=O)2N(CH2CH3)2和-S(=O)2NHPh。
磺氨基:-NR1S(=O)2OH,其中,R1是如对氨基基团定义的氨基取代基。磺氨基基团的实例包括但不限于-NHS(=O)2OH和-N(CH3)S(=O)2OH。
磺酰氨基:-NR1S(=O)2R,其中,R1是如对氨基基团定义的氨基取代基,并且R是磺酰氨基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。磺酰氨基基团的实例包括但不限于-NHS(=O)2CH3和-N(CH3)S(=O)2C6H5。
亚磺酰氨基:-NR1S(=O)R,其中,R1是如对氨基基团定义的氨基取代基,并且R是亚磺酰氨基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团。亚磺酰氨基基团的实例包括但不限于-NHS(=O)CH3和-N(CH3)S(=O)C6H5。
膦基(膦):-PR2;其中,R是膦基取代基,例如,-H、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。膦基基团的实例包括但不限于-PH2、-P(CH3)2、-P(CH2CH3)2、-P(t-Bu)2和-P(Ph)2。
二氧膦基:-P(=O)2。
磷酰基(氧化膦):-P(=O)R2;其中,R是磷酰基取代基,例如,C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是C1-7烷基基团或C5-20芳基基团。磷酰基基团的实例包括但不限于-P(=O)(CH3)2、-P(=O)(CH2CH3)2、-P(=O)(t-Bu)2和-P(=O)(Ph)2。
膦酸(膦酰基):-P(=O)(OH)2。
膦酸酯(膦酰基酯):-P(=O)(OR)2;其中,R是膦酸酯取代基,例如,-H、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。膦酸酯基团的实例包括但不限于-P(=O)(OCH3)2、-P(=O)(OCH2CH3)2、-P(=O)(O-t-Bu)2和-P(=O)(OPh)2。
磷酸(膦酰氧基):-OP(=O)(OH)2。
磷酸酯(膦酰氧基酯):-OP(=O)(OR)2;其中,R是磷酸酯取代基,例如,-H、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。磷酸酯基团的实例包括但不限于-OP(=O)(OCH3)2、-OP(=O)(OCH2CH3)2、-OP(=O)(O-t-Bu)2和-OP(=O)(OPh)2。
亚磷酸:-OP(OH)2。
亚磷酸酯:-OP(OR)2;其中,R是亚磷酸酯取代基,例如,-H、C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。亚磷酸酯基团的实例包括但不限于-OP(OCH3)2、-OP(OCH2CH3)2、-OP(O-t-Bu)2和-OP(OPh)2。
亚磷酰胺:-OP(OR1)-NR2 2;其中,R1和R2是亚磷酰胺取代基,例如,-H、(可选取代的)C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。亚磷酰胺基团的实例包括但不限于-OP(OCH2CH3)-N(CH3)2、-OP(OCH2CH3)-N(i-Pr)2和-OP(OCH2CH2CN)-N(i-Pr)2。
氨基磷酸酯:-OP(=O)(OR1)-NR2 2;其中,R1和R2是氨基磷酸酯取代基,例如,-H、(可选取代的)C1-7烷基基团、C3-20杂环基基团或C5-20芳基基团,优选地是-H、C1-7烷基基团或C5-20芳基基团。氨基磷酸酯基团的实例包括但不限于-OP(=O)(OCH2CH3)-N(CH3)2、-OP(=O)(OCH2CH3)-N(i-Pr)2和-OP(=O)(OCH2CH2CN)-N(i-Pr)2。
亚烷基
C3-12亚烷基:如本文所使用的,术语“C3-12亚烷基”涉及通过从具有3至12个碳原子(除非另外指定)的烃化合物的相同碳原子除去两个氢原子或从两个不同碳原子各除去一个氢原子得到的二齿部分,其可以是脂肪族或脂环族的并且可以是饱和、部分不饱和或完全不饱和的。因此,术语“亚烷基”包括以下讨论的亚类:亚烯基、亚炔基、环亚烷基等。
直链饱和C3-12亚烷基基团的实例包括但不限于-(CH2)n-,其中,n是3至12的整数,例如,-CH2CH2CH2-(亚丙基)、-CH2CH2CH2CH2-(亚丁基)、-CH2CH2CH2CH2CH2-(亚戊基)和-CH2CH2CH2CH2CH2CH2CH2-(亚庚基)。
支链饱和C3-12亚烷基基团的实例包括但不限于-CH(CH3)CH2-、-CH(CH3)CH2CH2-、-CH(CH3)CH2CH2CH2-、-CH2CH(CH3)CH2-、-CH2CH(CH3)CH2CH2-、-CH(CH2CH3)-、-CH(CH2CH3)CH2-和-CH2CH(CH2CH3)CH2-。
直链部分不饱和C3-12亚烷基基团(C3-12亚烯基和亚炔基基团)的实例包括但不限于-CH=CH-CH2-、-CH2-CH=CH2-、-CH=CH-CH2-CH2-、-CH=CH-CH2-CH2-CH2-、-CH=CH-CH=CH-、-CH=CH-CH=CH-CH2-、-CH=CH-CH=CH-CH2-CH2-、-CH=CH-CH2-CH=CH-、-CH=CH-CH2-CH2-CH=CH-和-CH2-C≡C-CH2-。
支链部分不饱和C3-12亚烷基基团(C3-12亚烯基和亚炔基基团)的实例包括但不限于-C(CH3)=CH-、-C(CH3)=CH-CH2-、-CH=CH-CH(CH3)-和-C≡C-CH(CH3)-。
脂环族饱和C3-12亚烷基基团(C3-12环亚烷基)的实例包括但不限于亚环戊基(例如,环戊基-1,3-亚基)和亚环己基(例如,环己基-1,4-亚基)。
脂环族部分不饱和C3-12亚烷基基团(C3-12环亚烷基)的实例包括但不限于亚环戊烯基(例如,4-环戊烯-1,3-亚基)、亚环己烯基(例如,2-环己烯-1,4-亚基;3-环己烯-1,2-亚基;2,5-环己二烯-1,4-亚基)。
氧保护基团:术语“氧保护基团”是指掩蔽羟基基团的部分,并且这些在本领域中是众所周知的。大量的合适基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups inOrganic Synthesis,第3版,John Wiley&Sons,Inc.,1999的第23至200页,通过引用将其结合于此。特别感兴趣的类别包括甲硅烷基醚(例如,TMS、TBDMS)、取代的甲硅烷基醚(例如,THP)和酯(例如,乙酸酯)。
氨基甲酸酯氮保护基团:术语“氨基甲酸酯氮保护基团”涉及掩蔽亚胺键中的氮的部分,并且它们在本领域中是众所周知的。这些基团具有以下结构:
其中,R’10是如以上定义的R。大量的合适基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups in Organic Synthesis,第3版,John Wiley&Sons,Inc.,1999的第503至549页,通过引用将其结合于此。
半缩醛胺氮保护基团:术语“半缩醛胺氮保护基团”涉及具有以下结构的基团:
其中,R’10是如以上定义的R。大量的作为酰胺保护基团的合适基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups in Organic Synthesis,第3版,John Wiley&Sons,Inc.,1999的第633至647页,通过引用将其结合于此。
结合物
本发明提供了包含通过接头单元连接至配体单元的PBD二聚物的结合物。在一个实施方式中,接头单元包含延伸子单元(A)、特异性单元(L1)和间隔子单元(L2)。接头单元在一端连接至配体单元(L)并在另一端连接至PBD二聚物化合物(D)。
一方面,以下在式Va中示出了这种结合物:
L-(A1 a-L1 s-L2 y-D)p (Va),
或其药用盐或溶剂化物,其中:
L是配体单元;并且
-A1 a-L1 s-L2 y-是接头单元(LU),其中:
-A1-是延伸子单元,
a是1或2,
-L1-是特异性单元,
s是0至12范围的整数,
-L2-是间隔子单元,
y是0、1或2;
-D是PBD二聚物;并且
p是1至20。
在另一方面,以下在式Vb中示出了这种结合物:
还示出为:
L-(A1 a-L2 y(-L1 s)-D)p (Vb),
或其药用盐或溶剂化物,其中:
L是配体单元;并且
-A1 a-L1 s(L2 y)-是接头单元(LU),其中:
-A1-是连接至间隔子单元(L2)的延伸子单元,
a是1或2,
-L1-是连接至间隔子单元(L2)的特异性单元,
s是0至12范围的整数,
-L2-是间隔子单元,
y是0、1或2;
-D是PBD二聚物;并且
p是1至20。
优选项
以下优选项可以适用于以上描述的发明的所有方面或可涉及单一方面。该优选项可以以任何组合组合在一起。
在一个实施方式中,结合物具有下式:
L-(A1 a-L1 s-L2 y-D)p、
L-(A1 a-Ls 1-D)p、
L-(A1-L1-D)p或
L-(A1-D)p,
或它们的药用盐或溶剂化物,其中,L、A1、a、L1、s、L2、D、y和p如以上所描述。
本发明适用于提供PBD化合物至受试者中优选的位点的用途。在优选的实施方式中,结合物允许释放不保留任何接头部分的活性PBD化合物。不存在可以影响PBD化合物的反应活性的残留(stub)。
在某些实施方式中,本发明提供了包含PBD二聚物基团的结合物,该PBD二聚物基团具有连接至细胞结合剂的接头。本发明人在本文中描述了合成方法,其能够制备这种二聚物结合物。
接头通过共价键将配体单元(L),例如抗体,附接至PBD药物部分D。接头是双官能的或多官能的部分,可被用于连接一个或多个药物部分(D)和抗体单元(Ab)以形成抗体-药物结合物(ADC)。接头在细胞外侧(即,细胞外)可以是稳定的,或它是通过酶活性、水解或其他新陈代谢条件可切割的。使用具有用于结合至药物部分和抗体的反应官能团的接头可以方便地制备抗体-药物结合物(ADC)。抗体(Ab)的半胱氨酸硫醇或胺,例如N端或氨基酸侧链(如赖氨酸),可以与接头或间隔子试剂、PBD药物部分(D)或药物-接头试剂(D-RL)的官能团形成键。
ADC的接头优选地防止ADC分子的聚集,并且保持ADC易溶于水性介质以及将其保持在单体状态。
ADC的接头优选地在细胞外稳定。在输送或递送至细胞之前,抗体-药物结合物(ADC)优选地是稳定的并且保持完整,即,抗体保持连接至药物部分。接头在靶细胞外是稳定的,并且在细胞内可以以有效速率切割。有效的接头将:(i)保持抗体的特异性结合性质;(ii)允许结合物或药物部分的细胞内递送;(iii)保持稳定和完整,即,直到结合物被递送或输送至它的靶向位点才切割;以及(iv)保持PBD药物部分的细胞毒性作用、细胞杀死作用或细胞抑制作用。可以通过标准分析技术如质谱、HPLC和分离/分析技术LC/MS测量ADC的稳定性。
抗体和药物部分的共价附接需要接头具有两个反应性官能团,即,反应意义上的二价。可用于连接两个或多个官能部分或生物活性部分如肽、核酸、药物、毒素、抗体、半抗原和报告基团的二价接头是已知的,并且已经有方法描述了它们得到的结合物(Hermanson,G.T.(1996)Bioconjugate Techniques;Academic Press:New York,p 234-242)。
在另一个实施方式中,接头可以由调节聚集、溶解度或反应性的基团取代。例如,磺酸酯取代基可以增加试剂的水溶解度并且促进接头试剂与抗体或药物部分的偶联反应,或促进Ab-L与D或D-L与Ab的偶联反应,这取决于制备ADC使用的合成途径。
在一个实施方式中,配体单元(L)是细胞结合剂(CBA),其特异性地结合至靶细胞的表面上的靶分子。示例性的式示出在以下:
其中,星号表示附接至药物单元(D)的点,CBA是细胞结合剂,L1是特异性单元,A1是连接L1至细胞结合剂的延伸子单元,L2是间隔子单元,其是共价键、自分解的基团或与-OC(=O)-一起形成自分解的基团,且L2是可选的。视情况,可以将OC(=O)-视为L1或L2的部分。
在另一个实施方式中,配体单元(L)是细胞结合剂(CBA),其特异性地结合至靶细胞的表面上的靶分子。示例性的式示出在以下:
CBA–A1 a–L1 s–L2 y–*,
其中,星号表示附接至药物单元(D)的点,CBA是细胞结合剂,L1是特异性单元,A1是将L1连接至细胞结合剂的延伸子单元,L2是间隔子单元,其是共价键或自分解的基团,并且a是1或2,s是0、1或2,并且y是0或1或2。
在以上示出的实施方式中,L1可以是可切割的特异性单元,并且当存在一个或多个自分解的基团时,当切割活化自分解的基团(或多个自分解的基团)L2时,L1可以称为“触发器”。当特异性单元L1被切割或当L1和L2之间的连接键(即,共价键)被切割时,自分解的基团释放药物单元(D)。
在另一个实施方式中,配体单元(L)是细胞结合剂(CBA),其特异性地结合至靶细胞的表面上的靶分子。示例性的式示出在以下:
其中,星号表示附接至药物(D)的点,CBA是细胞结合剂,L1是连接至L2的特异性单元,A1是将L2连接至细胞结合剂的延伸子单元,L2是自分解的基团,并且a是1或2,s是1或2,且y是1或2。
在本文讨论的各个实施方式中,L1和L2的性质可以广泛地变化。这些基团是基于它们的特征来选择的,其可以部分地由递送结合物的位点的条件来决定。在特异性单元L1是可切割的情况下,选择L1的结构和/或序列以便其由存在于靶位点(例如,靶细胞)的酶的作用切割。还可以使用可通过pH的改变(例如,酸或碱的不稳定性)、温度的改变或通过辐射(例如,光不稳定性)而切割的L1单元。在还原条件或氧化条件下可切割的L1单元也可以在结合物中使用。
在一些实施方式中,L1可以包含一个氨基酸或氨基酸的连续序列。氨基酸序列可以是酶的靶底物。
在一个实施方式中,L1可由酶的作用下切割。在一个实施方式中,酶是酯酶或肽酶。例如,L1可由溶酶体蛋白酶如组织蛋白酶切割。
在一个实施方式中,存在L2并且其与-C(=O)O-一起形成一个或多个自分解的基团。在一些实施方式中,-C(=O)O-也是自分解的基团。
在一个实施方式中,在L1可由酶的作用切割且存在L2的情况下,酶切割L1和L2之间的键,由此自分解的基团释放药物单元。
L1和L2(当存在时)可以通过选自以下的键连接:
-C(=O)NH-、
-C(=O)O-、
-NHC(=O)-、
-OC(=O)-、
-OC(=O)O-、
-NHC(=O)O-、
-OC(=O)NH-、
-NHC(=O)NH和
-O-(糖苷键)。
连接至L2的L1的氨基基团可以是氨基酸的N端,或可以源自氨基酸侧链(例如赖氨酸的氨基酸侧链)的氨基基团。
连接至L2的L1的羧基基团可以是氨基酸的C端,或可以源自氨基酸侧链(例如谷氨酸的氨基酸侧链)的羧基基团。
连接至L2的L1的羟基基团可以源自氨基酸侧链(例如丝氨酸的氨基酸侧链)的羟基基团。
在一个实施方式中,-C(=O)O-和L2一起形成以下基团:
其中,星号表示附接至药物单元的点,波浪线表示附接至L1的点,Y是-N(H)-、-O-、-C(=O)N(H)-或-C(=O)O-,并且n是0至3。亚苯基环可选地被本文所描述的一个、两个或三个取代基取代。
在一个实施方式中,Y是NH。
在一个实施方式中,n是0或1。优选地,n是0。
在Y是NH并且n是0的情况下,自分解的接头可以被称为对氨基苄基羰基接头(PABC)。
当远端位点被活化时,自分解的接头将允许释放药物单元(即,非对称PBD),其沿以下所示的路线进行(n=0):
其中,星号表示至药物的附接,L*是接头剩余部分的活化形式,且释放的药物单元未示出。这些基团具有将活化的位点与药物分离的优点。
在另一个实施方式中,-C(=O)O-和L2一起形成选自以下的基团:
其中,星号、波形线、Y和n如以上所定义。每个亚苯基环可选地被本文所描述的一个、两个或三个取代基取代。在一个实施方式中,具有Y取代基的亚苯基环是可选取代的并且不具有Y取代基的亚苯基环是未取代的。
在另一个实施方式中,-C(=O)O-和L2一起形成选自以下的基团:
其中,星号、波形线、Y和n如以上所定义的,E是O、S或NR,D是N、CH或CR,并且F是N、CH或CR。
在一个实施方式中,D是N。
在一个实施方式中,D是CH。
在一个实施方式中,E是O或S。
在一个实施方式中,F是CH。
在一个优选的实施方式中,L1和L2之间的共价键是组织蛋白酶不稳定(即,可切割)的键。
在一个实施方式中,L1包含二肽。二肽中的氨基酸可以是天然氨基酸和非天然氨基酸的任意组合。在一些实施方式中,二肽包含天然氨基酸。在接头是组织蛋白酶不稳定的接头的情况下,二肽是组织蛋白酶介导的切割的作用位点。然后二肽是组织蛋白酶的识别位点。
在一个实施方式中,二肽-NH-X1-X2-CO-中的基团-X1-X2-选自:
-Phe-Lys-、
-Val-Ala-、
-Val-Lys-、
-Ala-Lys-、
-Val-Cit-、
-Phe-Cit-、
-Leu-Cit-、
-Ile-Cit-、
-Phe-Arg-和
-Trp-Cit-;
其中,Cit是瓜氨酸。在这种二肽中,-NH-是X1的氨基基团,并且CO是X2的羰基基团。
优选地,二肽-NH-X1-X2-CO-中的基团-X1-X2-选自:
-Phe-Lys-、
-Val-Ala-、
-Val-Lys-、
-Ala-Lys-和
-Val-Cit-。
最优选地,二肽-NH-X1-X2-CO-中的基团-X1-X2-是-Phe-Lys-、Val-Cit或-Val-Ala-。
其它感兴趣的二肽组合包括:
-Gly-Gly-、
-Pro-Pro-和
-Val-Glu-。
可以使用其它二肽组合,包括Dubowchik等人描述的那些,通过引用将其结合于此。
在一个实施方式中,在适当情况下,氨基酸侧链是受化学保护的。侧链保护基团可以是以下讨论的基团。受保护的氨基酸序列可以由酶切割。例如,包含Boc侧链保护的Lys残基的二肽序列可以被组织蛋白酶切割。
用于氨基酸侧链的保护基团在本领域中是众所周知的并且描述在Novabiochem目录中。在Greene和Wuts的Protective groups in Organic Synthesis中示出了其它保护基团策略。
对于具有反应性侧链官能团的那些氨基酸,以下示出了可能的侧链保护基团:
Arg:Z、Mtr、Tos;
Asn:Trt、Xan;
Asp:Bzl、t-Bu;
Cys:Acm、Bzl、Bzl-OMe、Bzl-Me、Trt;
Glu:Bzl、t-Bu;
Gln:Trt、Xan;
His:Boc、Dnp、Tos、Trt;
Lys:Boc、Z-Cl、Fmoc、Z;
Ser:Bzl、TBDMS、TBDPS;
Thr:Bz;
Trp:Boc;
Tyr:Bzl、Z、Z-Br。
在一个实施方式中,-X2-间接连接至药物单元。在这种实施方式中,存在间隔子单元L2。
在一个实施方式中,将二肽与一个或多个自分解的基团(间隔子单元)组合使用。自分解的基团可以连接至-X2-。
在存在自分解的基团的情况下,-X2-直接连接至自分解的基团。在一个实施方式中,-X2-连接至自分解的基团的基团Y。优选地,基团-X2-CO-连接至Y,此处Y是NH。
在一个实施方式中,-X1-直接连接至A1。优选地,基团NH-X1-(X1的氨基端)连接至A1。A1可以包含官能团-CO-,从而形成与-X1-的酰胺键。
在一个实施方式中,L1和L2与-OC(=O)-一起包含基团-X1-X2-PABC-。PABC基团直接连接至药物单元。在一个实施例中,自分解的基团和二肽一起形成基团-Phe-Lys-PABC-,其在以下示出:
其中,星号表示附接至药物单元的点,且波浪线表示附接至L1的剩余部分的点或附接至A1的点。优选地,波浪线表示附接至A1的点。
可替代地,自分解的基团和二肽一起形成基团-Val-Ala-PABC-,其在以下示出:
其中,星号和波形线如以上所定义。
在一个实施方式中,L1和L2以及-OC(=O)-一起代表:
其中,星号表示附接至药物单元的点,波浪线表示附接至A1的点,Y是共价键或官能团,且E是易于切割从而活化自分解的基团的基团。
选择E使得基团易由例如光或酶的作用切割。E可以是-NO2或葡萄糖醛酸(例如,β-葡萄糖醛酸)。前者可能易受硝基还原酶的作用,后者可能易受β-葡萄糖醛酸酶的作用。
基团Y可以是共价键。
基团Y可以是选自以下的官能团:
-C(=O)-、
-NH-、
-O-、
-C(=O)NH-、
-C(=O)O-、
-NHC(=O)-、
-OC(=O)-、
-OC(=O)O-、
-NHC(=O)O-、
-OC(=O)NH-、
-NHC(=O)NH-、
-NHC(=O)NH、
-C(=O)NHC(=O)-、
SO2和
-S-。
基团Y优选地是-NH-、-CH2-、-O-和-S-。
在一些实施方式中,L1和L2与-OC(=O)-一起代表:
其中,星号表示附接至药物单元的点,波浪线表示附接至A的点,Y是共价键或官能团且E是葡萄糖醛酸(例如,β-葡萄糖醛酸)。Y优选地是选自-NH-的官能团。
在一些实施方式中,L1和L2一起代表:
其中,星号表示附接至L2的剩余部分或药物单元的点,波浪线表示附接至A1的点,Y是共价键或官能团且E是葡萄糖醛酸(例如,β-葡萄糖醛酸)。Y优选地是选自-NH-、-CH2-、-O-和-S-的官能团。
在一些进一步的实施方式中,Y是以上示出的官能团,该官能团连接至氨基酸,且该氨基酸连接至延伸子单元A1。在一些实施方式中,氨基酸是β-丙氨酸。在这种实施方式中,等效地认为氨基酸是延伸子单元的一部分。
特异性单元L1和配体单元通过延伸子单元间接地连接。
L1和A1可以通过选自以下的键连接:
-C(=O)NH-、
-C(=O)O-、
-NHC(=O)-、
-OC(=O)-、
-OC(=O)O-、
-NHC(=O)O-、
-OC(=O)NH-和
-NHC(=O)NH-。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团A1是:
其中,星号表示附接至L1的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,配体单元和A1之间通过配体单元的硫醇残基和A1的马来酰亚胺基团连接。
在一个实施方式中,配体单元和A1之间的连接是:
其中,星号表示附接至A1、L1、L2或D的剩余部分的点,且波浪线表示附接至配体单元的剩余部分的点。在这个实施方式中,S原子通常源自配体单元。
在以上实施方式的每一个中,可以使用可替代的官能团代替以下示出的源自马来酰亚胺的基团:
其中,波浪线像以前那样表示附接至配体单元的点,并且星号表示至A1基团的剩余部分或至L1、L2或D的键。
在一个实施方式中,用以下基团代替源自马来酰亚胺的基团:
其中,波浪线表示附接至配体单元的点,并且星号表示至A1基团的剩余部分或至L1、L2或D的键。
在一个实施方式中,源自马来酰亚胺的基团被基团代替,该基团可选地与配体单元(例如,细胞结合剂)一起选自:
-C(=O)NH-、
-C(=O)O-、
-NHC(=O)-、
-OC(=O)-、
-OC(=O)O-、
-NHC(=O)O-、
-OC(=O)NH-、
-NHC(=O)NH-、
-NHC(=O)NH、
-C(=O)NHC(=O)-、
-S-、
-S-S-、
-CH2C(=O)-、
-C(=O)CH2-、
=N-NH-和
-NH-N=。
在这些中,尤其当羰基基团结合至-NH-时,-C(=O)CH2-可以是优选的。
在一个实施方式中,源自马来酰亚胺的基团被基团代替,该基团可选地与配体单元一起选自:
其中,波浪线表示附接至配体单元或结合至A1基团的剩余部分的点,并且星号表示附接至配体单元或结合至A1基团的剩余部分的另一个点。
WO 2005/082023中描述了适用于将L1连接至细胞结合剂的其他基团。
在一个实施方式中,存在延伸子单元A1,存在特异性单元L1,而不存在间隔子单元L2。因此,L1和药物单元通过键直接连接。等同于在该实施方式中,L2是键。
L1和D可以通过选自以下的键连接:
-C(=O)N<,
-C(=O)O-、
-NHC(=O)-、
-OC(=O)-、
-OC(=O)O-、
-NHC(=O)O-、
-OC(=O)N<和
-NHC(=O)N<,
其中,N<或O-是D的一部分。
在一个实施方式中,L1和D通过选自以下的键连接:
-C(=O)N<和
-NHC(=O)-。
在一个实施方式中,L1包含二肽并且二肽的一端连接至D。如以上所描述的,二肽中的氨基酸可以是天然氨基酸和非天然氨基酸的任意组合。在一些实施方式中,二肽包含天然氨基酸。在接头是组织蛋白酶不稳定的接头的情况下,二肽是组织蛋白酶介导的切割的作用位点。然后二肽是组织蛋白酶的识别位点。
在一个实施方式中,二肽-NH-X1-X2-CO-中的基团-X1-X2-选自:
-Phe-Lys-、
-Val-Ala-、
-Val-Lys-、
-Ala-Lys-、
-Val-Cit-、
-Phe-Cit-、
-Leu-Cit-、
-Ile-Cit-、
-Phe-Arg-和
-Trp-Cit-;
其中,Cit是瓜氨酸。在这种二肽中,-NH-是X1的氨基基团,并且CO是X2的羰基基团。
优选地,二肽-NH-X1-X2-CO-中的基团-X1-X2-选自:
-Phe-Lys-、
-Val-Ala-、
-Val-Lys-、
-Ala-Lys-和
-Val-Cit-。
更优选地,二肽-NH-X1-X2-CO-中的基团-X1-X2-是-Phe-Lys-或-Val-Ala-。
其它感兴趣的二肽组合包括:
-Gly-Gly-、
-Pro-Pro-和
-Val-Glu-。
可以使用其它二肽组合,包括以上描述的那些。
在一个实施方式中,L1-D是:
其中,-NH-X1-X2-CO是二肽,-N<是药物单元的一部分,星号表示附接至药物单元的剩余部分的点,并且波浪线表示附接至L1的剩余部分的点或附接至A1的点。优选地,波浪线表示附接至A1的点。
在一个实施方式中,二肽是缬氨酸-丙氨酸并且L1-D是:
其中星号、-N<和波浪线如以上所定义。
在一个实施方式中,二肽是苯丙氨酸-赖氨酸并且L1-D是:
其中星号、-N<和波浪线如以上所定义。
在一个实施方式中,二肽是缬氨酸-瓜氨酸。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至7、优选地3至7、最优选地3或7。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,S是配体单元的硫基团,波浪线表示附接至配体单元的其余部分的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,S是配体单元的硫基团,波浪线表示附接至配体单元的剩余部分的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,S是配体单元的硫基团,波浪线表示附接至配体单元的剩余部分的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至7、优选地4至8、最优选地4或8。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的剩余部分的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的剩余部分的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的剩余部分的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,基团L-A1-L1是:
其中,星号表示附接至L2或D的点,波浪线表示附接至配体单元的剩余部分的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、最优选地4或8。
在一个实施方式中,延伸子单元是具有下式的乙酰胺单元:
其中,星号表示附接至延伸子单元的剩余部分、L1或D的点,并且波浪线表示附接至配体单元的点。
接头-药物
在其它实施方式中,提供了用于结合至配体单元的接头-药物化合物。在一个实施方式中,将接头-药物化合物设计为用于连接细胞结合剂。
在一个实施方式中,药物接头化合物具有下式:
其中,星号表示附接至药物单元(D,如以上定义)的点,G1是形成与配体单元的连接的延伸子基团(A1),L1是特异性单元,L2(间隔子单元)是共价键或与-OC(=O)-一起形成自分解的基团。
在另一个实施方式中,药物接头化合物具有下式:
G1-L1-L2-*,
其中,星号表示附接至药物单元(D)的点,G1是形成与配体单元的连接的延伸子单元(A1),L1是特异性单元,L2(间隔子单元)是共价键或自分解的基团。
L1和L2如以上所定义。提及至A1的连接在这里可以解释为提及至G1的连接。
在一个实施方式中,在L1包含氨基酸的情况下,该氨基酸的侧链可以是受保护的。可以使用任何合适的保护基团。在一个实施方式中,如果存在,可用该化合物中的其它保护基团移除侧链保护基团。在其它实施方式中,如果存在,保护基团与分子中的其它保护基团可以是正交的(相邻的,垂直的,orthogonal)。
氨基酸侧链的合适的保护基团包括描述于Novabiochem目录2006/2007中的那些。在Dubowchik等人中也讨论了在组织蛋白酶不稳定接头中使用的保护基团。
在本发明的某些实施方式中,基团L1包含Lys氨基酸残基。可以用Boc或Alloc保护基团保护该氨基酸的侧链。Boc保护基团是最优选的。
官能团G1一旦与配体单元(例如,细胞结合剂)反应就形成连接基团。
在一个实施方式中,官能团G1是或包含氨基、羧酸、羟基、硫醇或马来酰亚胺基团,以与配体单元上的适合的基团反应。在一个优选的实施方式中,G1包含马来酰亚胺基团。
在一个实施方式中,基团G1是烷基马来酰亚胺基团。该基团适用于与存在于细胞结合剂(例如,存在于抗体)中的硫醇基团反应,具体地与半胱氨酸硫醇基团反应。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1、L2或D的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1、L2或D的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至2、优选地4至8、以及最优选地4或8。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、以及最优选地4或8。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1、L2或D的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1、L2或D的点,并且n是0至6。在一个实施方式中,n是5。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至2、优选地4至8、以及最优选地4或8。
在一个实施方式中,基团G1是:
其中,星号表示附接至L1的点,n是0或1,并且m是0至30。在一个优选的实施方式中,n是1并且m是0至10、1至8、优选地4至8、以及最优选地4或8。
在以上实施方式的每一个中,可以使用可替代的官能团来替代以下所示的马来酰亚胺基团:
其中,星号表示与G基团的剩余部分的键。
在一个实施方式中,用以下基团代替源自马来酰亚胺的基团:
其中,星号表示与G基团的剩余部分的键。
在一个实施方式中,用选自以下的基团代替马来酰亚胺基团:
-C(=O)OH、
-OH、
-NH2、
-SH、
-C(=O)CH2X(其中,X是Cl、Br或I)、
-CHO、
-NHNH2、
-C≡CH和
-N3(叠氮化物)。
在这些中,尤其当羰基基团结合至-NH-时,-C(=O)CH2X可以是优选的。
在一个实施方式中,存在L1,并且G1是-NH2、-NHMe、-COOH、-OH或-SH。
在一个实施方式中,在存在L1的情况下,G1是-NH2或-NHMe。每个基团可以是L1氨基酸序列的N端。
在一个实施方式中,存在L1并且G1是-NH2,且L1是以上定义的氨基酸序列-X1-X2-。
在一个实施方式中,存在L1且G1是COOH。该基团可以是L1氨基酸序列的C端。
在一个实施方式中,存在L1且G1是OH。
在一个实施方式中,存在L1且G1是SH。
基团G1可以是从一个官能团向另一个官能团可转化的。在一个实施方式中,存在L1且G1是-NH2。该基团是向包含马来酰亚胺基团的另一个基团G1可转化的。例如,基团-NH2可以与包含以上所示的马来酰亚胺的那些G1基团的酸或活化的酸(例如,N-琥珀酰亚胺形式)反应。
因此,可以将基团G1转化为更适合与配体单元反应的官能团。
如以上所提及的,在一个实施方式中,存在L1,并且G1是-NH2、-NHMe、-COOH、-OH或-SH。在一个进一步的实施方式中,以化学受保护形式提供这些基团。因此,化学受保护形式是提供有官能团的接头的前体。
在一个实施方式中,G1是化学受保护形式的-NH2。可以用氨基甲酸酯保护基团保护该基团。氨基甲酸酯保护基团可以选自由以下组成的组:
Alloc、Fmoc、Boc、Troc、Teoc、Cbz和PNZ。
优选地,在G1是-NH2的情况下,其用Alloc或Fmoc基团保护。
在一个实施方式中,在G1是-NH2的情况下,其用Fmoc基团保护。
在一个实施方式中,保护基团与封端基团(capping group)的氨基甲酸酯保护基团相同。
在一个实施方式中,保护基团与封端基团的氨基甲酸酯保护基团不同。在这个实施方式中,优选的是保护基团在不除去封端基团的氨基甲酸酯保护基团的条件下是可除去的。
可以除去化学保护基团以提供与配体单元形成连接的官能团。可选地,该官能团然后可转化为以上所描述的另一种官能团。
在一个实施方式中,活性基团是胺。该胺优选地是肽的N端,并可以是本发明优选的二肽的N端。
可以使活性基团反应以产生旨在与配体单元形成连接的官能团。
在其它实施方式中,接头单元是具有活性基团的接头单元的前体。在这个实施方式中,接头单元包含活性基团,该活性基团受保护基团保护。可以除去该保护基团以提供具有活性基团的接头单元。
在活性基团是胺的情况下,保护基团可以是胺保护基团,如Green和Wuts中描述的那些。
保护基团优选地与接头单元中的其它保护基团(如果存在)是正交的。
在一个实施方式中,保护基团与封端基团正交。因此,活性基团保护基团在保留封端基团时是可以除去的。在其它实施方式中,在与用于除去封端基团的那些条件相同的条件下,可以除去保护基团和封端基团。
在一个实施方式中,接头单元是:
其中,星号表示附接至药物单元的点,并且波浪线表示附接至接头单元的剩余部分的点或如果合适是附接至G1的点。优选地,波浪线表示附接至G1的点。
在一个实施方式中,接头单元是:
其中,星号和波形线如以上所定义。
在WO 2005/082023中描述了适用于形成L1和细胞结合剂之间的连接的其它官能团。
配体单元
配体单元可以是任何类型的,并且包括蛋白质、多肽、肽以及特异性结合至靶分子的非肽试剂。在一些实施方式中,配体单元可以是蛋白质、多肽或肽。在一些实施方式中,配体单元可以是环状的多肽。这些配体单元可以包括抗体或抗体的片段,其含有至少一个靶分子-结合位点、淋巴因子、激素、生长因子或可以特异性结合至靶的任何其它细胞结合分子或物质。该配体单元在本文中也称为“结合剂”或“靶向剂”。
术语“特异性地结合”和“特异性结合”是指抗体或其它蛋白质、多肽或肽与预定分子(例如,抗原)的结合。通常,抗体或其他分子以至少约1x107 M-1的亲和力结合,并且以比它结合至除预定分子或紧密相关分子之外的非特异性分子(例如,BSA、酪蛋白)的亲和力高至少两倍的亲和力结合至预定的分子。
配体单元的实例包括在WO 2007/085930中描述使用的那些试剂,通过引用将其结合于此。
在一些实施方式中,配体单元是结合细胞上的细胞外靶的细胞结合剂。这种细胞结合剂可以是蛋白质、多肽、肽或非肽试剂。在一些实施方式中,细胞结合剂可以是蛋白质、多肽或肽。在一些实施方式中,细胞结合剂可以是环状多肽。细胞结合剂还可以是抗体或抗体的抗原结合片段。因此,在一个实施方式中,本发明提供了抗体-药物结合物(ADC)。
肽
在一个实施方式中,细胞结合剂是包含4-30、优选地6-20个连续氨基酸残基的直链或环状肽。在该实施方式中,优选的是细胞结合剂连接至一个单体或二聚物吡咯并苯并二氮杂卓化合物。
在一个实施方式中,细胞结合剂包含结合整合蛋白ανβ6的肽。该肽通过XYS可能对ανβ6是有选择性的。
在一个实施方式中,细胞结合剂包含A20FMDV-Cys多肽。A20FMDV-Cys具有序列:NAVPNLRGDLQVLAQKVARTC。可替代地,可以使用A20FMDV-Cys序列的变体,其中,一个、两个、三个、四个、五个、六个、七个、八个、九个或十个氨基酸残基被另一氨基酸残基取代。此外,多肽可以具有序列NAVXXXXXXXXXXXXXXXRTC。
抗体
本文的术语“抗体”是在最广义上使用并且确切地覆盖单克隆抗体、多克隆抗体、二聚物、多聚物、多特异性抗体(例如,双特异性抗体)和抗体片段,只要它们表现出期望的生物活性(Miller et al(2003)Jour.of Immunology 170:4854-4861)。抗体可以是鼠科抗体、人类抗体、人源化抗体、嵌合抗体或来自其他物种的抗体。抗体是能够识别和结合至特异性抗原的由免疫系统产生的蛋白质。(Janeway,C.,Travers,P.,Walport,M.,Shlomchik(2001)Immuno Biology,5th Ed.,Garland Publishing,New York)。靶抗原通常具有许多结合位点,也称为表位,其由在多种抗体上的CDR所识别。特异性结合至不同表位的每种抗体具有不同的结构。因此,一种抗原可以具有一种以上的对应抗体。抗体包括全长免疫球蛋白分子或全长免疫球蛋白分子的免疫活性部分,即,包含免疫特异性结合感兴趣的靶的抗原或其部分的抗原结合位点的分子,这种靶包括但不限于癌细胞或产生与自身免疫疾病相关的自身免疫性抗体的细胞。免疫球蛋白可以是免疫球蛋白分子的任何类型(例如IgG、IgE、IgM、IgD和IgA)、类别(例如IgG1、IgG2、IgG3、IgG4、IgA1和IgA2)或子类别。免疫球蛋白可以源自任何物种,包括人源、鼠源或兔源。
"抗体片段"包括全长抗体的一部分,通常是其抗原结合区或可变区。抗体片段的实例包括Fab、Fab'、F(ab')2和scFv片段;双体;线性抗体;Fab表达文库产生的片段,抗独特型(抗-Id)抗体,CDR(互补决定区)和任何以上免疫特异性结合至癌细胞抗原、病毒抗原或微生物抗原的表位结合片段,单链抗体分子以及由抗体片段形成的多特异性抗体。
如本文中所使用的,术语“单克隆抗体”是指由基本上同源抗体的群体得到的抗体,即,除可以以少量存在的可能的天然存在的突变之外,包括该群体的单个抗体是相同的。单克隆抗体针对单个抗原位点是高度特异性的。此外,相对于多克隆抗体制剂,其包括针对不同决定簇(表位)的不同抗体,每个单克隆抗体针对抗原上的单个决定簇。除它们的特异性以外,单克隆抗体是有利的,因为它们可以被合成而不被其他抗体污染。修饰语“单克隆”表示由抗体的基本上同源群体得到的抗体的特性,并且不应当解释为需要通过任何特定方法来生产抗体。例如,通过Kohler et al(1975)Nature 256:495首次描述的杂交瘤方法可以制备,或可以通过DNA重组方法(参见US 4816567)制备根据本发明使用的单克隆抗体。也可以使用Clackson et al(1991)Nature,352:624-628;Marks et al(1991)J.Mol.Biol.,222:581-597中描述的技术从噬菌体抗体库或从携带完整人类免疫球蛋白系统的转基因小鼠(Lonberg(2008)Curr.Opinion 20(4):450-459)分离单克隆抗体。
本文的单克隆抗体确切地包括“嵌合”抗体以及这种抗体的片段,只要它们表现出期望的生物活性,其中,重链和/或轻链部分与源自特定物种的抗体中的对应序列相同或同源或属于特定特定抗体类别或子类别,而链的剩余部分与源自另一物种的抗体中的对应序列相同或同源或属于另一抗体类别或子类别(US 4816567;和Morrison et al(1984)Proc.Natl.Acad.Sci.USA,81:6851-6855)。嵌合抗体包括包含源自非人类灵长目动物(例如,旧大陆猴或猿猴)的可变结构域抗原结合序列和人类恒定区域序列的“灵长目化的(primatized)”抗体。
本文的“完整抗体”是包含VL和VH结构域以及轻链恒定结构域(CL)和重链恒定结构域CH1、CH2和CH3的抗体。恒定结构域可以是天然的序列恒定结构域(例如,人类天然序列恒定结构域)或它的氨基酸序列变体。完整抗体可以具有一种或多种“效应子功能”,其是指由抗体的Fc区域(天然序列Fc区域或氨基酸序列变体Fc区域)引起的生物活性。抗体效应子功能的实例包括C1q结合;补体依赖的细胞毒;Fc受体结合;抗体依赖的细胞介导的细胞毒(ADCC);吞噬作用;和细胞表面受体(如B细胞受体和BCR)的下调。
根据它们的重链的恒定结构域的氨基酸序列,可以将完整抗体分为不同“类别”。存在完整抗体的5种主要类别:IgA、IgD、IgE、IgG和IgM,并且这些中的一些可以进一步分为“子类别”(同型),例如,IgG1、IgG2、IgG3、IgG4、IgA和IgA2。对应于抗体的不同类别的重链恒定结构域分别称为α、δ、ε、γ和μ。免疫球蛋白的不同类别的子单元结构和三种尺寸构型是众所周知的。
人源化
降低非人类抗体或抗体片段的体内免疫原性的技术包括称作“人源化”的那些。
“人源化抗体”是指包含人类抗体的至少部分修饰的可变区域的多肽,其中,部分可变区域,优选地明显小于完整的人类可变结构域的部分,已经被来自非人类物种的对应序列置换,并且其中,修饰的可变区域连接至另一蛋白质的至少另一部分,优选地连接至人类抗体的恒定区。表达“人源化抗体”包括人类抗体,其中,一个或多个互补决定区(“CDR”)氨基酸残基和/或一个或多个框架区(“FW”或“FR”)氨基酸残基被来自啮齿动物或其他非人类抗体中的同功(analogous)位点置换。表达“人源化抗体”还包含免疫球蛋白氨基酸序列或它的片段,其包括基本具有人类免疫球蛋白的氨基酸序列的FR和基本具有非人类免疫球蛋白的氨基酸序列的CDR。
“人源化”形式的非人类(例如,鼠科)抗体是包含源自非人类免疫球蛋白的最小序列的嵌合抗体。或以另一种方式看,人源化抗体是还包含代替人类序列、来自非人类(例如,鼠科)抗体的选定序列的人类抗体。人源化抗体还包括不显著改变它的结合和/或生物活性的来自相同或不同物种的保守性氨基酸置换或非天然残基。这种抗体是包含源自非人类免疫球蛋白的最小序列的嵌合抗体。
存在大量人源化技术,包括‘CDR移植’、‘定向选择’、‘去免疫’、‘表面重建’(还称为‘外饰’(veneering))、‘复合抗体’、‘人类链含量优化(Human String ContentOptimisation)’和框架混合。
CDR移植
在该技术中,人源化抗体是人类免疫球蛋白(受体抗体),其中,来自受体抗体的互补决定区(CDR)的残基被具有期望性质的来自非人类物种如小鼠、大鼠、骆驼、牛、山羊或兔的CDR(供体抗体)取代(实际上,非人类CDR被‘移’至人类框架上)。在一些实施例中,人类免疫球蛋白的框架区(FR)残基被对应的非人类残基取代(例如,这可以发生在当特定的FR残基对结合抗原具有显著作用时)。
此外,人源化抗体可以包含在受体抗体或植入的CDR或框架序列中都不能找到的残基。进行这些修饰以进一步改善和最大化抗体性能。因此,一般来说,人源化抗体将包含至少一种以及在一方面中两种的全部可变结构域,其中,所有的高可变区环或全部的高可变区环对应于非人类免疫球蛋白的那些,并且所有或基本所有的FR区是人类免疫球蛋白序列的那些。人源化抗体可选地还将包含免疫球蛋白恒定区(Fc)的至少一部分或人类免疫球蛋白的至少一部分。
定向选择
该方法由以下组成:将对特定表位特异性的给定非人类抗体的VH或VL结构域与人类VH或VL文库组合并且针对感兴趣的抗原选择特异的人类V结构域。然后将该选定的人类VH与VL文库组合以生成完整的人类VHxVL组合。Nature Biotechnology(N.Y.)12,(1994)899-903中描述了该方法。
复合抗体
在该方法中,将来自人类抗体的氨基酸序列的两个或多个区段(segment)组合在最终的抗体分子内。它们通过将多个人类VH和VL序列区段组合在组合体中而构建,该组合体在最终的复合抗体V区中限制或避开人类T细胞表位。当需要时,通过用避开T细胞表位的可替代区段替换引起或编码T细胞表位的V区区段来限制或避开T细胞表位。US 2008/0206239 A1中描述了该方法。
去免疫
该方法涉及从治疗性抗体(或其他分子)的V区除去人类(或其他第二物种)T细胞表位。例如,通过与MHC结合基序(如,存储在www.wehi.edu.au中的“基序”数据库)比较,分析治疗性抗体V区序列中MHC类II结合基序的存在。可替代地,可以使用如Altuvia et al.(J.Mol.Biol.249 244-250(1995))设计的计算线程方法识别MHC类II结合基序;在这些方法中,针对来自V区序列的连续的重叠肽测试了它们与MHC类II蛋白质的结合能。然后将该数据与其他序列特征上的信息,该信息涉及成功地呈递肽,如两亲性、Rothbard基序和组织蛋白酶B和其他加工酶的切割位点。
当已经识别可能的第二物种(例如,人类)的T细胞表位时,通过改变一种或多种氨基酸消除它们。修饰的氨基酸通常在T细胞表位自身内,但是也可以临近蛋白质的初级或二级结构的表位(因此,在初级结构中可能是不临近的)。最常见地,改变是通过置换的方式,但是在一些情况中,氨基酸添加或删除更适当。
通过DNA重组技术可以实现所有改变,使得可以通过使用良好建立的方法(如定向诱变)由重组宿主的表达制备最终的分子。然而,也可以使用蛋白质化学或任何其他分子改变方式。
表面重建
该方法涉及:
(a)通过构建非人类抗体可变区的三维模型,确定非人类(例如,啮齿动物)抗体(或它的片段)的构象结构;
(b)使用相对可达到分布由足量的非人类和人类抗体可变区重链和轻链的x-射线晶体结构生成序列对齐,以给出一组重链和轻链框架位置,其中,对齐位置在98%足量的非人类抗体重链和轻链中是相同的;
(c)使用步骤(b)中生成的框架位置组定义待人源化的非人类抗体,一组重链和轻链表面暴露的氨基酸残基;
(d)由人类抗体氨基酸序列识别一组重链和轻链表面暴露的氨基酸残基,该氨基酸残基与步骤(c)中定义的一组表面暴露的氨基酸残基最为相似,其中,来自人类抗体的重链和轻链自然成对或不能自然成对;
(e)在待人源化的非人类抗体的氨基酸序列中,用步骤(d)中识别的一组重链和轻链表面暴露的氨基酸残基置换步骤(c)中定义的一组重链和轻链表面暴露的氨基酸残基;
(f)构建由步骤(e)指定的置换得到的非人类抗体的可变区的三维模型;
(g)通过比较步骤(a)和(f)中构建的三维模型,识别步骤(c)或(d)中识别的组的任何氨基酸残基,该氨基酸残基在待人源化非人类抗体的互补决定区的任何残基的任何原子的5埃内;以及
(h)将步骤(g)中识别的任何残基从人类氨基酸残基变化为原始的非人类氨基酸残基,从而定义表面暴露的氨基酸残基的非人类抗体人源化组;条件是不需要首先进行步骤(a),但是必须在步骤(g)之前进行步骤(a)。
超人源化
该方法比较非人类序列与功能性人类生殖种系基因谱系。选择编码与非人类序列相同或密切相关的典型(canonical)结构的那些人类基因。将在CDR内具有最高同源性的这些选定的基因选作FR供体。最终,非人类CDR被移植到这些人类FR上。专利WO 2005/079479A2中描述了该方法。
人类链含量优化
该方法比较非人类(例如,小鼠)序列与人类生殖系基因的谱系,并且将差异划分为在可能的MHC/T细胞表位程度上定量序列的人类链含量(HSC)。然后通过使靶序列的HSC最大化而不是使用全局同一性测量来将靶序列人源化以生成多种不同的人源化变体(在Molecular Immunology,44,(2007)1986-1998中描述)。
框架混合(shuffling)
将非人类抗体的CDR框内融合至包含所有已知的重链和轻链人类生殖系基因框架的cDNA池中。然后通过例如筛选噬菌体展示的抗体文库选择人源化抗体。这在Methods 36,43-60(2005)中被描述。
细胞结合剂的实例包括在WO 2007/085930中描述使用的那些试剂,通过引用将其结合于此。
用于本发明的实施方式的肿瘤相关抗原和同源抗体在以下列出。
肿瘤相关抗原和同源抗体
(1)BMPR1B(骨形态发生蛋白受体-IB型)
核苷酸
Genbank登录号NM_001203
Genbank版本号NM_001203.2 GI:169790809
Genbank记录更新日期:2012年9月23日,02:06PM
多肽
Genbank登录号NP_001194
Genbank版本号NP_001194.1 GI:4502431
Genbank记录更新日期:2012年9月23日,02:06PM
交叉引用
ten Dijke,P.,et al Science 264(5155):101-104(1994),Oncogene 14 10(11):1377-1382(1997));WO2004/063362(权利要求2);WO2003/042661(权利要求12);
US2003/134790-A1(第38-39页);WO2002/102235(权利要求13;第296页);WO2003/055443(第91-92页);WO2002/99122(实施例2;第528-530页);WO2003/029421(权利要求6);WO2003/024392(权利要求2;图112);WO2002/98358(权利要求1;第183页);WO2002/54940(第100-101页);WO2002/59377(第349-350页);WO2002/30268(权利要求27;第376页);15WO2001/48204(实施例;图4);NP_001194骨形态发生蛋白受体,1B型/pid=NP_001194.1.;MIM:603248;AY065994
(2)E16(LAT1,SLC7A5)
核苷酸
Genbank登录号NM_003486
Genbank版本号NM_003486.5 GI:71979931
Genbank记录更新日期:2012年6月27日,12:06PM
多肽
Genbank登录号NP_003477
Genbank版本号NP_003477.4 GI:71979932
Genbank记录更新日期:2012年6月27日,12:06PM
交叉引用
Biochem.Biophys.Res.
Commun.255(2),283-288(1999),Nature 395(6699):288-291(1998),Gaugitsch,H.W.,et20 al(1992)J.Biol.Chem.267(16):11267-11273);WO2004/048938(实施例2);WO2004/032842(实施例IV);WO2003/042661(权利要求12);WO2003/016475(权利要求1);WO2002/78524(实施例2);WO2002/99074(权利要求19;第127-129页);WO2002/86443(权利要求27;第222、393页);WO2003/003906(权利要求10;第293页);WO2002/64798(权利要求33;第93-95页);WO2000/14228(权利要求5;第133-136页);US2003/224454(图3);25WO2003/025138(权利要求12;第150页);NP_003477溶质载体家族7(阳离子氨基酸运载体,y+系统),成员5/pid=NP_003477.3-智人;MIM:600182;NM_015923。
(3)STEAP1(前列腺的六种跨膜上皮抗原)
核苷酸
Genbank登录号NM_012449
Genbank版本号NM_012449.2 GI:22027487
Genbank记录更新日期:2012年9月9日,02:57PM
多肽
Genbank登录号NP_036581
Genbank版本号NP_036581.1 GI:9558759
Genbank记录更新日期:2012年9月9日,02:57PM
交叉引用
Cancer Res.61(15),5857-5860(2001),Hubert,R.S.,et al(1999)Proc.Natl.Acad.Sci.U.S.A.96(25):14523-14528);WO2004/065577(权利要求6);WO2004/027049(图1L);EP1394274(实施例11);WO2004/016225(权利要求2);WO2003/042661(权利要求12);US2003/157089(实施例5);US2003/185830(实施例5);US2003/064397(图2);WO2002/89747(实施例5;第618-619页);WO2003/022995(实施例9;图13A、35;实施例53;第173页,实施例2;图2A);前列腺的六种跨膜上皮抗原;MIM:604415。
(4)0772P(CA125,MUC16)
核苷酸
Genbank登录号AF361486
Genbank版本号AF361486.3 GI:34501466
Genbank记录更新日期:2010年3月11日,07:56AM
多肽
Genbank登录号AAK74120
Genbank版本号AAK74120.3 GI:34501467
Genbank记录更新日期:2010年3月11日,07:56AM
交叉引用
J.Biol.Chem.276(29):27371-27375(2001));WO2004/045553(权利要求14);WO2002/92836(权利要求6;图12);WO2002/83866(权利要求15;第116-121页);US2003/124140(实施例16);GI:34501467;
(5)MPF(MPF,MSLN,SMR,巨核细胞强化因子,间皮素)
核苷酸
Genbank登录号NM_005823
Genbank版本号NM_005823.5 GI:293651528
Genbank记录更新日期:2012年9月2日,01:47PM
多肽
Genbank登录号NP_005814
Genbank版本号NP_005814.2 GI:53988378
Genbank记录更新日期:2012年9月2日,01:47PM
交叉引用
Yamaguchi,N.,et al Biol.Chem.269(2),805-808(1994),Proc.Natl.Acad.Sci.U.S.A.96(20):11531-11536(1999),Proc.Natl.Acad.Sci.U.S.A.9310(1):136-140(1996),J.Biol.Chem.270(37):21984-21990(1995));WO2003/101283(权利要求14);WO2002/102235(权利要求13;第287-288页);WO2002/101075(权利要求4;第308-309页);WO2002/71928(第320-321页);WO94/10312(第52-57页);IM:601051。
(6)Napi3b(NAPI-3B,NPTIIb,SLC34A2,溶质载体家族34(磷酸钠),成员2,II型钠依赖型磷酸盐运载体3b
核苷酸
Genbank登录号NM_006424
Genbank版本号NM_006424.2 GI:110611905
Genbank记录更新日期:2012年7月22日,03:39PM
多肽
Genbank登录号NP_006415
Genbank版本号NP_006415.2 GI:110611906
Genbank记录更新日期:2012年7月22日,03:39PM
交叉引用
J.Biol.Chem.277(22):19665-19672(2002),Genomics 62(2):281-284(1999),Feild,J.A.,et al(1999)Biochem.Biophys.Res.Commun.258(3):578-582);WO2004/022778(权利要求2);EP1394274(实施例11);WO2002/102235(权利要求13;第20 326页);EP0875569(权利要求1;第17-19页);WO2001/57188(权利要求20;第329页);WO2004/032842(实施例IV);WO2001/75177(权利要求24;第139-140页);MIM:604217。
(7)Sema 5b(FLJ10372,KIAA1445,Mm.42015,SEMA5B,SEMAG,脑信号蛋白5b Hlog,25sema结构域,七种血小板反应蛋白重复(1型和类1型),跨膜结构域(TM)和短细胞质结构域,(脑信号蛋白)5B)
核苷酸
Genbank登录号AB040878
Genbank版本号AB040878.1 GI:7959148
Genbank记录更新日期:2006年8月2日,05:40PM
多肽
Genbank登录号BAA95969
Genbank版本号BAA95969.1 GI:7959149
Genbank记录更新日期:2006年8月2日,05:40PM
交叉引用
Nagase T.,et al(2000)DNA Res.7(2):143-150);WO2004/000997(权利要求1);WO2003/003984(权利要求1);WO2002/06339(权利要求1;第50页);WO2001/88133(权利要求1;第41-43、48-58页);WO2003/054152(权利要求20);WO2003/101400(权利要求11);登录:30Q9P283;Genew;HGNC:10737
(8)PSCA hlg(2700050C12Rik,C530008O16Rik,RIKEN cDNA2700050C12,RIKENcDNA
2700050C12基因)
核苷酸
Genbank登录号AY358628
Genbank版本号AY358628.1 GI:37182377
Genbank记录更新日期:2009年12月1日,04:15AM
多肽
Genbank登录号AAQ88991
Genbank版本号AAQ88991.1 GI:37182378
Genbank记录更新日期:2009年12月1日,04:15AM
交叉引用
Ross et al(2002)Cancer Res.62:2546-2553;US2003/129192(权利要求2);US2004/044180(权利要求12);US2004/044179 35(权利要求11);US2003/096961(权利要求11);US2003/232056(实施例5);WO2003/10575816(权利要求12);US2003/206918(实施例5);EP1347046(权利要求1);WO2003/025148(权利要求20);GI:37182378。
(9)ETBR(内皮素B型受体)
核苷酸
Genbank登录号AY275463
Genbank版本号AY275463.1 GI:30526094
Genbank记录更新日期:2010年3月11日,02:26AM
多肽
Genbank登录号AAP32295
Genbank版本号AAP32295.1 GI:30526095
Genbank记录更新日期:2010年3月11日,02:26AM
交叉引用
Nakamuta M.,et al Biochem.Biophys.Res.Commun.177,34-39,1991;Ogawa Y.,et al Biochem.Biophys.Res.Commun.178,248-255,1991;Arai H.,et alJpn.Circ.J.56,1303-1307,1992;Arai H.,et al J.Biol.Chem.268,3463-3470,1993;Sakamoto A.,Yanagisawa M.,et al Biochem.Biophys.Res.Commun.178,656-663,1991;Elshourbagy N.A.,et al J.Biol.Chem.268,3873-3879,1993;Haendler B.,et alJ.Cardiovasc.Pharmacol.20,s1-S4,1992;Tsutsumi M.,et al Gene 228,43-49,1999;Strausberg R.L.,et al Proc.Natl.Acad.Sci.U.S.A.99,16899-16903,2002;BourgeoisC.,et al J.Clin.Endocrinol.Metab.82,3116-3123,1997;
Okamoto Y.,et al Biol.Chem.272,21589-21596,1997;Verheij J.B.,et alAm.J.Med.
Genet.108,223-225,2002;Hofstra R.M.W.,et al Eur.J.Hum.Genet.5,180-185,1997;
Puffenberger E.G.,et al Cell 79,1257-1266,1994;Attie T.,et al,Hum.Mol.Genet.4,2407-
15 2409,1995;Auricchio A.,et al Hum.Mol.Genet.5:351-354,1996;AmielJ.,et al Hum.Mol.
Genet.5,355-357,1996;Hofstra R.M.W.,et al Nat.Genet.12,445-447,1996;Svensson
P.J.,et al Hum.Genet.103,145-148,1998;Fuchs S.,et al Mol.Med.7,115-124,2001;
Pingault V.,et al(2002)Hum.Genet.111,198-206;WO2004/045516(权利要求1);WO2004/048938(实施例2);WO2004/040000(权利要求151);WO2003/087768(权利要求1);20WO2003/016475(权利要求1);WO2003/016475(权利要求1);WO2002/61087(图1);WO2003/016494(图6);WO2003/025138(权利要求12;第144页);WO2001/98351(权利要求1;
第124-125页);EP0522868(权利要求8;第2页);WO2001/77172(权利要求1;第297-299页);
US2003/109676;US6518404(图3);US5773223(权利要求1a;Col 31-34);WO2004/001004。
(10)MSG783(RNF124,假定蛋白质FLJ20315)
核苷酸
Genbank登录号NM_017763
Genbank版本号NM_017763.4 GI:167830482
Genbank记录更新日期:2012年7月22日,12:34AM
多肽
Genbank登录号NP_060233
Genbank版本号NP_060233.3 GI:56711322
Genbank记录更新日期:2012年7月22日,12:34AM
交叉引用
WO2003/104275(权利要求1);WO2004/046342(实施例2);WO2003/042661(权利要求12);WO2003/083074(权利要求14;第61页);WO2003/018621(权利要求1);WO2003/024392(权利要求2;图93);WO2001/66689(实施例6);LocusID:54894。
(11)STEAP2(HGNC_8639,IPCA-1,PCANAP1,STAMP1,STEAP2,STMP,前列腺癌
相关基因1,前列腺癌相关蛋白质1,前列腺2的六种跨膜上皮抗原,六种跨膜前列腺蛋白质)
核苷酸
Genbank登录号AF455138
Genbank版本号AF455138.1 GI:22655487
Genbank记录更新日期:2010年3月11日,01:54AM
多肽
Genbank登录号AAN04080
Genbank版本号AAN04080.1 GI:22655488
Genbank记录更新日期:2010年3月11日,1:54AM
交叉引用
Lab.Invest.82(11):1573-1582(2002));WO2003/087306;US2003/064397(权利要求1;图1);WO2002/72596(权利要求13;第54-55页);WO2001/72962(权利要求1;图4B);35WO2003/104270(权利要求11);WO2003/104270(权利要求16);US2004/005598(权利要求22);WO2003/042661(权利要求12);US2003/060612(权利要求12;图10);WO2002/26822(权利要求23;图2);WO2002/16429(权利要求12;图10)GI:22655488。
(12)TrpM4(BR22450,FLJ20041,TRPM4,TRPM4B,瞬时受体电位阳离子通道5,亚科M,成员4)
核苷酸
Genbank登录号NM_017636
Genbank版本号NM_017636.3 GI:304766649
Genbank记录更新日期:2012年6月29日,11:27AM
多肽
Genbank登录号NP_060106
Genbank版本号NP_060106.2 GI:21314671
Genbank记录更新日期:2012年6月29日,11:27AM
交叉引用
Xu,X.Z.,et al Proc.Natl.Acad.Sci.U.S.A.98(19):10692-10697(2001),Cell109(3):397-407(2002),J.Biol.Chem.278(33):30813-30820(2003));US2003/143557(权利要求4);WO2000/40614(权利要求14;第100-103页);WO2002/10382(权利要求1;图9A);WO2003/042661(权利要求12);WO2002/30268(权利要求27;第391页);US2003/219806(权利要求4);WO2001/62794(权利要求10 14;图1A-D);MIM:606936。
(13)CRIPTO(CR,CR1,CRGF,CRIPTO,TDGF1,畸胎瘤衍化生长因子)
核苷酸
Genbank登录号NM_003212
Genbank版本号NM_003212.3 GI:292494881
Genbank记录更新日期:2012年9月23日,02:27PM
多肽
Genbank登录号NP_003203
Genbank版本号NP_003203.1 GI:4507425
Genbank记录更新日期:2012年9月23日,02:27PM
交叉引用
Ciccodicola,A.,et al EMBO J.8(7):1987-1991(1989),Am.J.Hum.Genet.49(3):555-565(1991));US2003/224411(权利要求1);WO2003/083041(实施例1);WO2003/034984(权利要求12);WO2002/88170(权利要求2;第52-53页);WO2003/024392(权利要求2;图58);WO2002/16413(权利要求1;第94-95、105页);WO2002/22808(权利要求2;图1);US5854399(实施例2;Col 17-18);US5792616(图2);MIM:187395。
(14)CD21(CR2(补体受体2)或C3DR(C3d/EB病毒受体)或Hs.73792)
核苷酸
Genbank登录号M26004
Genbank版本号M26004.1 GI:181939
Genbank记录更新日期:2010年6月23日,08:47AM
多肽
Genbank登录号AAA35786
Genbank版本号AAA35786.1 GI:181940
Genbank记录更新日期:2010年6月23日,08:47AM
交叉引用
Fujisaku et al(1989)J.Biol.Chem.264(4):2118-2125);Weis J.J.,et alJ.Exp.Med.167,1047-1066,1988;Moore M.,et al Proc.Natl.Acad.Sci.U.S.A.84,9194-9198,1987;Barel M.,et al Mol.Immunol.35,1025-1031,1998;Weis J.J.,et alProc.Natl.Acad.Sci.U.S.A.83,5639-5643,1986;Sinha S.K.,et al(1993)J.Immunol.150,5311-5320;WO2004/045520(实施例4);US2004/005538(实施例1);WO2003/062401(权利要求9);WO2004/045520(实施例4);WO91/02536(图9.1-9.9);WO2004/020595(权利要求1);登录:P20023;Q13866;Q14212;EMBL;M26004;AAA35786.1。
(15)CD79b(CD79B,CD79β,IGb(免疫球蛋白相关β),B29)
核苷酸
Genbank登录号NM_000626
Genbank版本号NM_000626.2 GI:90193589
Genbank记录更新日期:2012年6月26日,01:53PM
多肽
Genbank登录号NP_000617
Genbank版本号NP_000617.1 GI:11038674
Genbank记录更新日期:2012年6月26日,01:53PM
交叉引用
Proc.Natl.Acad.Sci.U.S.A.(2003)100(7):4126-4131,Blood(2002)100(9):3068-3076,Muller et al(1992)Eur.J.Immunol.22(6):1621-1625);WO2004/016225(权利要求2,图140);WO2003/087768、US2004/101874(权利要求1,第102页);WO2003/062401(权利要求9);WO2002/78524(实施例2);US2002/150573(权利要求35 5,第15页);US5644033;WO2003/048202(权利要求1,第306和309页);WO 99/58658、US6534482(权利要求13,图17A/B);WO2000/55351(权利要求11,第1145-1146页);MIM:147245
(16)FcRH2(IFGP4,IRTA4,SPAP1A(包含磷酸酶锚蛋白质5 1a的SH2),SPAP1B,SPAP1C)
核苷酸
Genbank登录号NM_030764
Genbank版本号NM_030764.3 GI:227430280
Genbank记录更新日期:2012年6月30日,12:30AM
多肽
Genbank登录号NP_110391
Genbank版本号NP_110391.2 GI:19923629
Genbank记录更新日期:2012年6月30日,12:30AM
交叉引用
AY358130);Genome Res.13(10):2265-2270(2003),Immunogenetics 54(2):87-95(2002),Blood 99(8):2662-2669(2002),Proc.Natl.Acad.Sci.U.S.A.98(17):9772-9777(2001),Xu,M.J.,et al(2001)Biochem.Biophys.Res.Commun.280(3):768-775;WO2004/016225(权利要求2);WO2003/077836;WO2001/38490(权利要求5;图18D-1-18D-2);WO2003/097803(权利要求12);
10 WO2003/089624(权利要求25);:MIM:606509。
(17)HER2(ErbB2)
核苷酸
Genbank登录号M11730
Genbank版本号M11730.1 GI:183986
Genbank记录更新日期:2010年6月23日,08:47AM
多肽
Genbank登录号AAA75493
Genbank版本号AAA75493.1 GI:306840
Genbank记录更新日期:2010年6月23日,08:47AM
交叉引用
Coussens L.,et al Science(1985)230(4730):1132-1139);Yamamoto T.,et alNature 319,230-234,1986;Semba K.,et al Proc.Natl.Acad.Sci.U.S.A.82,6497-6501,1985;Swiercz J.M.,et al J.Cell Biol.165,869-15 880,2004;Kuhns J.J.,et alJ.Biol.Chem.274,36422-36427,1999;Cho H.-S.,et al Nature 421,756-760,2003;Ehsani A.,et al(1993)Genomics 15,426-429;WO2004/048938(实施例2);WO2004/027049(图1I);WO2004/009622;WO2003/081210;
WO2003/089904(权利要求9);WO2003/016475(权利要求1);US2003/118592;WO2003/008537(权利要求1);WO2003/055439(权利要求29);图1A-B);WO2003/025228(权利要求37;图5C);20WO2002/22636(实施例13;第95-107页);WO2002/12341(权利要求68;图7);WO2002/13847(第71-74页);WO2002/14503(第114-117页);WO2001/53463(权利要求2;第41-46页);WO2001/41787(第15页);WO2000/44899(权利要求52;图7);
WO2000/20579
(权利要求3;图2);US5869445(权利要求3;Col 31-38);WO9630514
(权利要求2;第56-61页);EP1439393(权利要求7);WO2004/043361
(权利要求7);WO2004/022709;WO2001/00244
25(实施例3;图4);登录:P04626;EMBL;M11767;AAA35808.1.EMBL;M11761;AAA35808.1
抗体
Abbott:US20110177095
例如,包含CDR的抗体,该CDR整体至少80%的序列与具有SEQ ID NO:3(CDR-H1)、SEQ ID NO:4(CDR-H2)、SEQ ID NO:5(CDR-H3)、SEQ ID NO:104和/或SEQ ID NO:6(CDR-L1)、SEQ ID NO:7(CDR-L2)以及SEQ ID NO:8(CDR-L3)的氨基酸序列的CDR相同,其中,抗-HER2抗体或抗HER2结合片段与具有SEQ ID NO:1的VH和SEQ ID NO:2的VL的抗体相比降低的免疫原性。
Biogen:US20100119511
例如,ATCC登录号:PTA-10355、PTA-10356、PTA-10357、PTA-10358
例如,纯化的抗体分子,其结合至包含来自抗体的所有六种CDR或包含与所述CDR相同或具有所述CDR的不多于两种改变的CDR的HER2,所述所有六种CDR选自由BIIB71F10(SEQ ID NO:11,13)、BIIB69A09(SEQ ID NO:15,17);BIIB67F10(SEQ ID NO:19,21);BIIB67F11(SEQ ID NO:23,25)、BIIB66A12(SEQ ID NO:27,29)、BIIB66C01(SEQ ID NO:31,33)、BIIB65C10(SEQ ID NO:35,37)、BIIB65H09(SEQ ID NO:39,41)和BIIB65B03(SEQ IDNO:43,45)组成的组。
赫赛汀(Genentech)-US6,054,297;ATCC登录号CRL-10463(Genentech)帕妥珠单抗(Genentech)
US20110117097
例如,参见SEQ ID No.15&16、SEQ ID No.17&18、SEQ ID No.23&24&ATCC登录号HB-12215、HB-12216、CRL 10463、HB-12697。
US20090285837
US20090202546
例如,ATCC登录号:HB-12215、HB-12216、CRL 10463、HB-12698。
US20060088523
-例如,ATCC登录号:HB-12215、HB-12216
-例如,分别包含SEQ ID No.3和4中的可变轻氨基酸序列和可变重氨基酸序列的抗体。
-例如,包含选自SEQ ID No.15和23的轻链氨基酸序列以及选自SEQ ID No.16和24的重链氨基酸序列的抗体
US20060018899
-例如,ATCC登录号:(7C2)HB-12215、(7F3)HB-12216、(4D5)CRL-10463、(2C4)HB-12697。
-例如,包含SEQ ID No.23中的氨基酸序列或其去酰胺基和/或氧化变体的抗体。
US2011/0159014
-例如,具有包含SEQ ID NO:1”的高变异区的轻链可变结构域的抗体。
-例如,具有包含SEQ ID NO:2的高变异区的重链可变结构域的抗体。
US20090187007
Glycotope:TrasGEX抗体http://www.glycotope.com/pipeline
例如,参见国际关节癌研究院(International Joint Cancer Institute)和长海医院癌症中心(Changhai Hospital Cancer Cent):HMTI-Fc Ab-Gao J.,et al BMBRep.2009年10月31日;42(10):636-41。
Symphogen:US20110217305
中国联合干细胞和基因工程(Union Stem Cell&Gene Engineering,China)-LiuHQ.,et al Xi Bao Yu Fen Zi Mian Yi Xue Za Zhi.2010May;26(5):456-8。
(18)NCA(CEACAM6)
核苷酸
Genbank登录号M18728
Genbank版本号M18728.1 GI:189084
Genbank记录更新日期:2010年6月23日,08:48AM
多肽
Genbank登录号AAA59907
Genbank版本号AAA59907.1 GI:189085
Genbank记录更新日期:2010年6月23日,08:48AM
交叉引用
Barnett T.,et al Genomics 3,59-66,1988;Tawaragi Y.,et alBiochem.Biophys.Res.Commun.150,89-96,1988;Strausberg R.L.,et alProc.Natl.Acad.Sci.U.S.A.99:16899-16903,2002;WO2004/063709;EP1439393(权利要求7);WO2004/044178(实施例4);WO2004/031238;WO2003/042661(权利要求12);WO2002/78524(实施例2);WO2002/86443(权利要求27;第427页);WO2002/60317(权利要求2);登录:P40199;Q14920;EMBL;M29541;AAA59915.1。
EMBL;M18728.
(19)MDP(DPEP1)
核苷酸
Genbank登录号BC017023
Genbank版本号BC017023.1 GI:16877538
Genbank记录更新日期:2012年3月6日,01:00PM
多肽
Genbank登录号AAH17023
Genbank版本号AAH17023.1 GI:16877539
Genbank记录更新日期:2012年3月6日,01:00PM
交叉引用
Proc.Natl.Acad.Sci.U.S.A.99(26):16899-16903(2002));WO2003/016475(权利要求1);WO2002/64798(权利要求33;第85-87页);JP05003790(图6-8);WO99/46284(图9);MIM:179780。
(20)IL20R-α(IL20Ra,ZCYTOR7)
核苷酸
Genbank登录号AF184971
Genbank版本号AF184971.1 GI:6013324
Genbank记录更新日期:2010年3月10日,10:00PM
多肽
Genbank登录号AAF01320
Genbank版本号AAF01320.1 GI:6013325
Genbank记录更新日期:2010年3月10日,10:00PM
交叉引用
Clark H.F.,et al Genome Res.13,2265-2270,2003;Mungall A.J.,et alNature 425,805-811,2003;Blumberg H.,et al Cell 104,9-19,2001;Dumoutier L.,etal J.Immunol.167,3545-3549,2001;Parrish-Novak J.,et al J.Biol.Chem.277,47517-47523,2002;Pletnev S.,et al(2003)10Biochemistry 42:12617-12624;Sheikh F.,etal(2004)J.Immunol.172,2006-2010;EP1394274(实施例11);US2004/005320(实施例5);WO2003/029262(第74-75页);WO2003/002717(权利要求2;第63页);WO2002/22153(第45-47页);US2002/042366(第20-21页);WO2001/46261(第57-59页);WO2001/46232(第63-65页);WO98/37193(权利要求1;第55-59页);登录:Q9UHF4;Q6UWA9;Q96SH8;EMBL;AF184971;AAF01320.1。
(21)短蛋白聚糖(BCAN,BEHAB)
核苷酸
Genbank登录号AF229053
Genbank版本号AF229053.1 GI:10798902
Genbank记录更新日期:2010年3月11日,12:58AM
多肽
Genbank登录号AAG23135
Genbank版本号AAG23135.1 GI:10798903
Genbank记录更新日期:2010年3月11日,12:58AM
交叉引用
Gary S.C.,et al Gene 256,139-147,2000;Clark H.F.,et al Genome Res.13,2265-2270,2003;Strausberg R.L.,et al Proc.Natl.Acad.Sci.U.S.A.99,16899-16903,2002;US2003/186372(权利要求11);US2003/186373(权利要求11);US2003/119131(权利要求1;图52);US2003/119122(权利要求1;20图52);US2003/119126(权利要求1);US2003/119121(权利要求1;图52);US2003/119129(权利要求1);US2003/119130(权利要求1);US2003/119128(权利要求1;图52);US2003/119125(权利要求1);WO2003/016475(权利要求1);WO2002/02634(权利要求1)
(22)EphB2R(DRT,ERK,Hek5,EPHT3,Tyro5)
核苷酸
Genbank登录号NM_004442
Genbank版本号NM_004442.6 GI:111118979
Genbank记录更新日期:2012年9月8日,04:43PM
多肽
Genbank登录号NP_004433
Genbank版本号NP_004433.2 GI:21396504
Genbank记录更新日期:2012年9月8日,04:43PM
交叉引用
Chan,J.and Watt,V.M.,Oncogene 6(6),1057-1061(1991)Oncogene 10(5):897-905(1995),Annu.Rev.Neurosci.21:309-345(1998),Int.Rev.Cytol.196:177-244(2000));WO2003042661(权利要求12);WO200053216(权利要求1;第41页);WO2004065576(权利要求1);WO2004020583(权利要求9);WO2003004529(第128-132页);WO200053216(权利要求1;第42页);MIM:600997。
(23)ASLG659(B7h)
核苷酸
Genbank登录号AX092328
Genbank版本号AX092328.1 GI:13444478
Genbank记录更新日期:2011年1月26日,07:37AM
交叉引用
US2004/0101899(权利要求2);WO2003104399(权利要求11);WO2004000221(图3);US2003/165504(权利要求1);US2003/124140(实施例2);US2003/065143(图60);WO2002/102235(权利要求13;第299页)US2003/091580(实施例2);WO2002/10187(权利要求6;图10);WO2001/94641(权利要求12;图7b);WO2002/02624(权利要求13;图1A-1B);US2002/034749(权利要求54;第45-46页);WO2002/06317(实施例2;第320-321页,权利要求34;第321-322页);WO2002/71928(第468-469页);WO2002/02587(实施例1;图1);WO2001/40269(实施例3;第190-192页);WO2000/36107(实施例2;第205-207页);WO2004/053079(权利要求12);WO2003/004989(权利要求1);WO2002/71928(第233-234,452-453页);WO 01/16318。
(24)PSCA(前列腺干细胞抗原前体)
核苷酸
Genbank登录号AJ297436
Genbank版本号AJ297436.1 GI:9367211
Genbank记录更新日期:2011年2月1日,11:25AM
多肽
Genbank登录号CAB97347
Genbank版本号CAB97347.1 GI:9367212
Genbank记录更新日期:2011年2月1日,11:25AM
交叉引用
Reiter R.E.,et al Proc.Natl.Acad.Sci.U.S.A.95,1735-1740,1998;Gu Z.,etal Oncogene 19,
1288-1296,2000;Biochem.Biophys.Res.Commun.(2000)275(3):783-788;WO2004/022709;EP1394274(实施例11);US2004/018553(权利要求17);WO2003/008537(权利要求1);WO2002/81646(权利要求1;第164页);WO2003/003906(权利要求10;第288页);WO2001/40309(实施例1;图17);US2001/055751(实施例1;图1b);WO2000/32752(权利要求18;图1);WO98/51805(权利要求17;第97页);WO98/51824(权利要求10;第94页);WO98/40403(权利要求2;图1B);登录:O43653;EMBL;AF043498;AAC39607.1。
(25)GEDA
核苷酸
Genbank登录号AY260763
Genbank版本号AY260763.1 GI:30102448
Genbank记录更新日期:2010年3月11日,02:24AM
多肽
Genbank登录号AAP14954
Genbank版本号AAP14954.1 GI:30102449
Genbank记录更新日期:2010年3月11日,02:24AM
交叉引用
AP14954脂肪瘤HMGIC融合-伴侣样蛋白质/pid=AAP14954.1-智人(人类)(AP14954 lipoma HMGIC fusion-partnerlike protein/pid=AAP14954.1-Homo sapiens(human));WO2003/054152(权利要求20);WO2003/000842(权利要求1);WO2003/023013(实施例3,权利要求20);US2003/194704(权利要求45);GI:30102449;
(26)BAFF-R(B细胞激活因子受体,BLyS受体3,BR3)
核苷酸
Genbank登录号AF116456
Genbank版本号AF116456.1 GI:4585274
Genbank记录更新日期:2010年3月10日,09:44PM
多肽
Genbank登录号AAD25356
Genbank版本号AAD25356.1 GI:4585275
Genbank记录更新日期:2010年3月10日,09:44PM
交叉引用
BAFF受体/pid=NP_443177.1-智人(BAFF receptor/pid=NP_443177.1-Homosapiens):Thompson,J.S.,et al Science 293(5537),2108-2111(2001);WO2004/058309;WO2004/011611;WO2003/045422(实施例;第32-33页);WO2003/014294(权利要求35;图6B);WO2003/035846(权利要求70;第615-616页);WO2002/94852(Col 136-137);WO2002/3876625(权利要求3;第133页);WO2002/24909(实施例3;图3);MIM:606269;NP_443177.1;NM_052945_1;AF132600
(27)CD22(B细胞受体CD22-B同种型,BL-CAM,Lyb-8,Lyb8,SIGLEC-2,FLJ22814)
核苷酸
Genbank登录号AK026467
Genbank版本号AK026467.1 GI:10439337
Genbank记录更新日期:2006年9月11日,11:24PM
多肽
Genbank登录号BAB15489
Genbank版本号BAB15489.1 GI:10439338
Genbank记录更新日期:2006年9月11日,11:24PM
交叉引用
Wilson et al(1991)J.Exp.Med.173:137-146;30WO2003/072036(权利要求1;图1);IM:107266;NP_001762.1;NM_001771_1。
(27a)CD22(CD22分子)
核苷酸
Genbank登录号X52785
Genbank版本号X52785.1 GI:29778
Genbank记录更新日期:2011年2月2日,10:09AM
多肽
Genbank登录号CAA36988
Genbank版本号CAA36988.1 GI:29779
Genbank记录更新日期:2011年2月2日,10:09AM
交叉引用
Stamenkovic I.et al.,Nature 345(6270),74-77(1990)??
其他信息
官方符号(官方符号,Official Symbol):CD22
其他别名:SIGLEC-2,SIGLEC2
其他命名:B细胞受体CD22;B淋巴细胞粘附分子;BL-CAM;CD22抗原;T细胞表面抗原Leu-14;唾液酸结合Ig样凝集素2(sialic acid binding Ig-like lectin 2);唾液酸结合Ig样凝集素2
抗体
G5/44(伊珠单抗(Inotuzumab)):DiJoseph JF.,et al Cancer ImmunolImmunother.2005年1月;54(1):11-24.
依帕珠单抗(Epratuzumab)-Goldenberg DM.,et al Expert Rev AnticancerTher.6(10):1341-53,2006.
(28)CD79a(CD79A,CD79α),免疫球蛋白相关α,与Igβ(CD79B)共价相互作用并在表面上与Ig M 35分子形成络合物的B细胞特异性蛋白质,转导参与B细胞分化的信号),pI:4.84,MW:25028 TM:2
[P]基因染色体:19q13.2).
核苷酸
Genbank登录号NM_001783
Genbank版本号NM_001783.3 GI:90193587
Genbank记录更新日期:2012年6月26日01:48PM
多肽
Genbank登录号NP_001774
Genbank版本号.NP_001774.1 GI:4502685
Genbank记录更新日期:2012年6月26日01:48PM
交叉引用
WO2003/088808、US2003/0228319;WO2003/062401(权利要求9);US2002/150573(权利要求4,第13-14页);WO99/58658(权利要求13,图16);WO92/07574(图1);US5644033;Ha et al(1992)J.Immunol.148(5):1526-1531;Müller et al(1992)Eur.J.Immunol..22:1621-1625;Hashimoto et al(1994)Immunogenetics 40(4):287-295;Preud’homme et al(1992)Clin.Exp.
5 Immunol.90(1):141-146;Yu et al(1992)J.Immunol.148(2)633-637;Sakaguchi et al(1988)
EMBO J.7(11):3457-3464
(29)CXCR5(Burkitt's淋巴瘤受体1,由CXCL13激活的G蛋白偶联型受体,在淋巴细胞迁移和体液防御中起作用,在HIV-2感染中发挥10作用并可能发展AIDS、淋巴瘤、骨髓瘤和白血病);372 aa,pI:8.54 MW:41959 TM:7[P]基因染色体:11q23.3,
核苷酸
Genbank登录号NM_001716
Genbank版本号NM_001716.4 GI:342307092
Genbank记录更新日期:2012年9月30日,01:49PM
多肽
Genbank登录号NP_001707
Genbank版本号NP_001707.1 GI:4502415
Genbank记录更新日期:2012年9月30日,01:49PM
交叉引用
WO2004/040000;WO2004/015426;US2003/105292(实施例2);US6555339(实施例2);WO2002/61087(图1);WO2001/57188(权利要求20,第269页);WO2001/72830(第12-13页);WO2000/22129(实施例1,第152-153页,15实施例2,第254-256页)WO99/28468(权利要求1,第38页);US5440021(实施例2,col 49-52);WO94/28931(第56-58页);WO92/17497(权利要求7,图5);Dobner et al(1992)Eur.J.Immunol.22:2795-2799;Barella et al(1995)Biochem.J.309:773-779。
(30)HLA-DOB(结合肽并将它们20呈现至CD4+T淋巴细胞的MHC II类分子(Ia抗原)的子单元,273 aa,pI:56,MW:30820.TM:1[P]基因染色体:6p21.3)
核苷酸
Genbank登录号NM_002120
Genbank版本号NM_002120.3 GI:118402587
Genbank记录更新日期:2012年9月8日,04:46PM
多肽
Genbank登录号NP_002111
Genbank版本号NP_002111.1 GI:4504403
Genbank记录更新日期:2012年9月8日,04:46PM
交叉引用
Tonnelle et al(1985)EMBO J.4(11):2839-2847;Jonsson et al(1989)Immunogenetics 29(6):411-413;Beck et al(1992)J.Mol.Biol.228:433-441;Strausberg et al(2002)Proc.Natl.Acad.Sci USA 99:16899-16903;Servenius et al(1987)J.Biol.Chem.262:8759-8766;Beck et al(1996)J.Mol.Biol.25255:1-13;Naruseet al(2002)Tissue Antigens 59:512-519;WO99/58658(权利要求13,图15);US6153408(Col 35-38);US5976551(col 168-170);US6011146(col 145-146);Kasahara et al(1989)Immunogenetics 30(1):66-68;Larhammar et al(1985)J.Biol.Chem.260(26):14111-14119。
(31)P2X5(嘌呤受体P2X配体门控离子通道5,由细胞外ATP门控的离子通道,可以涉及突触传递和神经发生,缺乏可能导致特发性逼尿肌不稳定的病理);422 aa),pI:7.63,MW:47206 TM:1[P]基因染色体:17p13.3)。
核苷酸
Genbank登录号NM_002561
Genbank版本号NM_002561.3 GI:325197202
Genbank记录更新日期:2012年6月27日,12:41AM
多肽
Genbank登录号NP_002552
Genbank版本号.NP_002552.2 GI:28416933
Genbank记录更新日期:2012年6月27日,12:41AM
交叉引用
Le et al(1997)FEBS Lett.418(1-2):195-199;WO2004/047749;WO2003/072035(权利要求10)Touchman et al(2000)Genome Res.10:165-173;WO2002/22660(权利要求20);WO2003/093444(权利要求1);WO2003/087768(权利要求1);WO2003/029277(第82页)。
(32)CD72(B细胞分化抗原CD72,Lyb-2);359 aa,pI:8.66,MW:40225,TM:1 5[P]基因染色体:9p13.3)。
核苷酸
Genbank登录号NM_001782
Genbank版本号NM_001782.2 GI:194018444
Genbank记录更新日期:2012年6月26日,01:43PM
多肽
Genbank登录号NP_001773
Genbank版本号NP_001773.1 GI:4502683
Genbank记录更新日期:2012年6月26日,01:43PM
交叉引用
WO2004042346(权利要求65);WO2003/026493(第51-52,57-58页);WO2000/75655(第105-106页);Von Hoegen et al(1990)J.Immunol.144(12):4870-4877;Strausberg etal(2002)Proc.Natl.Acad.Sci USA 99:16899-16903。
(33)LY64(淋巴细胞抗原64(RP105),I型富含亮氨酸的膜蛋白重复(LRR)家族,调节B细胞的活化和细胞凋亡,功能丧失与患者中系统性红斑狼疮的病情加重相关);661 aa,pI:6.20,MW:74147 TM:1[P]基因染色体:5q12)。
核苷酸
Genbank登录号NM_005582
Genbank版本号NM_005582.2 GI:167555126
Genbank记录更新日期:2012年9月2日,01:50PM
多肽
Genbank登录号NP_005573
Genbank版本号NP_005573.2 GI:167555127
Genbank记录更新日期:2012年9月2日,01:50PM
交叉引用
US2002/193567;WO97/07198(权利要求11,第39-42页);Miura et al(1996)15Genomics 38(3):299-304;Miura et al(1998)Blood 92:2815-2822;WO2003/083047;WO97/44452(权利要求8,第57-61页);WO2000/12130(第24-26页)。
(34)FcRH1(Fc受体样蛋白质1,用于包含C2型Ig样和ITAM结构域的免疫球蛋白Fc结构域的假定受体,可能在B淋巴细胞20分化中发挥作用);429 aa,pI:5.28,MW:46925 TM:1[P]基因染色体:1q21-1q22)
核苷酸
Genbank登录号NM_052938
Genbank版本号NM_052938.4 GI:226958543
Genbank记录更新日期:2012年9月2日,01:43PM
多肽
Genbank登录号NP_443170
Genbank版本号NP_443170.1 GI:16418419
Genbank记录更新日期:2012年9月2日,01:43PM
交叉引用
WO2003/077836;WO2001/38490(权利要求6,图18E-1-18-E-2);Davis et al(2001)Proc.Natl.Acad.Sci USA 98(17):9772-9777;WO2003/089624(权利要求8);EP1347046(权利要求1);WO2003/089624(权利要求7)。
(35)IRTA2(免疫球蛋白超家族受体易位相关2,在B细胞发育和淋巴瘤生成中可能起作用的免疫受体;由易位引起的基因放松管制发生在一些B细胞恶性肿瘤中);977 aa,pI:6.88,MW:106468,TM:1[P]基因染色体:1q21)
核苷酸
Genbank登录号AF343662
Genbank版本号AF343662.1 GI:13591709
Genbank记录更新日期:2010年3月11日,01:16AM
多肽
Genbank登录号AAK31325
Genbank版本号AAK31325.1 GI:13591710
Genbank记录更新日期:2010年3月11日,01:16AM
交叉引用
AF343663、AF343664、AF343665、AF369794、AF397453、AK090423、AK090475、AL834187、AY358085;小鼠:AK089756、AY158090、AY506558;NP_112571.1;WO2003/024392(权利要求2,图97);Nakayama et al(2000)Biochem.Biophys.Res.Commun.277(1):124-127;WO2003/077836;WO2001/38490(权利要求3,图18B-1-18B-2)。
(36)TENB2(TMEFF2,脑肿瘤抑癌蛋白(tomoregulin),TPEF,HPP1,TR,假定的跨膜35蛋白聚糖,涉及生长因子和卵泡抑素的EGF/调蛋白家族);374 aa)
核苷酸
Genbank登录号AF179274
Genbank版本号AF179274.2 GI:12280939
Genbank记录更新日期:2010年3月11日,01:05AM
多肽
Genbank登录号AAD55776
Genbank版本号AAD55776.2 GI:12280940
Genbank记录更新日期:2010年3月11日,01:05AM
交叉引用
NCBI登录:AAD55776、AAF91397、AAG49451、NCBI参考序列:NP_057276;NCBI Gene:23671;OMIM:605734;SwissProt Q9UIK5;AY358907、CAF85723、CQ782436;WO2004/074320;JP2004113151;WO2003/042661;WO2003/009814;EP1295944(第69-70页);WO2002/30268(第329页);WO2001/90304;US2004/249130;US2004/022727;WO2004/063355;US2004/197325;US2003/232350;5US2004/005563;US2003/124579;Horie et al(2000)Genomics 67:146-152;Uchida et al(1999)Biochem.Biophys.Res.Commun.266:593-602;Liang et al(2000)Cancer Res.60:4907-12;Glynne-Jones et al(2001)Int J Cancer.10月15日;94(2):178-84.
(37)PSMA–FOLH1(叶酸水解酶(前列腺特异性膜抗原)1)
核苷酸
Genbank登录号M99487
Genbank版本号M99487.1 GI:190663
Genbank记录更新日期:2010年6月23日,08:48AM
多肽
Genbank登录号AAA60209
Genbank版本号AAA60209.1 GI:190664
Genbank记录更新日期:2010年6月23日,08:48AM
交叉引用
Israeli R.S.,et al Cancer Res.53(2),227-230(1993)
其他信息
官方符号:FOLH1
其他别名:GIG27、FGCP、FOLH、GCP2、GCPII、NAALAD1、NAALAdase、PSM、PSMA、mGCP
其他命名:N-乙酰化α连接的酸性二肽酶1;N-乙酰化-α-连接的酸性二肽酶I;NAALADase I;细胞生长抑制基因27蛋白质;叶酰聚-γ-谷氨酸羧肽酶;谷氨酸羧化酶II;谷氨酸羧肽酶2;谷氨酸羧肽酶II;膜谷氨酸羧肽酶;前列腺特异性膜抗原变体F;蝶酰聚-γ-谷氨酸羧肽酶抗体
US 7,666,425:
由杂交瘤细胞(Hybridomas)生产的具有以下ATCC参考的抗体:ATCC登录号HB-12101、ATCC登录号HB-12109、ATCC登录号HB-12127和ATCC登录号HB-12126。
Proscan:选自由以下组成的组的单克隆抗体:8H12、3E11、17G1、29B4、30C1和20F2(US 7,811,564;Moffett S.,et al Hybridoma(Larchmt).2007Dec;26(6):363-72)。
Cytogen:单克隆抗体7E11-C5(ATCC登录号HB 10494)和9H10-A4(ATCC登录号HB11430)–US 5,763,202
GlycoMimetics:NUH2-ATCC登录号HB 9762(US 7,135,301)
人类基因组科学(Human Genome Science):HPRAJ70-ATCC登录号97131(US 6,824,993);作为美国模式培养物保藏所(American Type Culture Collection)("ATCC")保藏号97131保藏的由cDNA克隆(HPRAJ70)编码的氨基酸序列
Medarex:缺乏岩藻糖基残基的抗-PSMA抗体-US 7,875,278
小鼠抗PSMA抗体包括3F5.4G6、3D7.1.1、4E10-1.14、3E11、4D8、3E6、3C9、2C7、1G3、3C4、3C6、4D4、1G9、5C8B9、3G6、4C8B9和单克隆抗体。杂交瘤细胞分泌3F5.4G6、3D7.1.1、4E10-1.14、3E11、4D8、3E6、3C9、2C7、1G3、3C4、3C6、4D4、1G9、5C8B9、3G6或4C8B9已经公开保藏并描述于美国专利号6,159,508中。相关的杂交瘤细胞已经公开保藏并描述于美国专利号6,107,090中。此外,人源化的抗PSMA抗体,包括J591的人源化版本,进一步详细地描述于PCT公开WO 02/098897中。本领域中已经描述了其他的小鼠抗人类PSMA抗体,如mAb 107-1A4(Wang,S.et al.(2001)Int.J.Cancer 92:871-876)和mAb 2C9(Kato,K.et al.(2003)Int.J.Urol.10:439-444)。
人类抗PSMA单克隆抗体的实例包括如2005年2月18日提交的题目为“HumanMonoclonal Antibodies to Prostate Specific Membrane Antigen(PSMA)”的PCT公开WO01/09192和WO 03/064606以及美国临时申请序列号60/654,125最初描述的分离和结构表征的4A3、7F12、8C12、8A11、16F9、2A10、2C6、2F5和1C3抗体。4A3、7F12、8C12、8A11、16F9、2A10、2C6、2F5和1C3的V.sub.H氨基酸序列分别示出于SEQ ID NO:1-9中。4A3、7F12、8C12、8A11、16F9、2A10、2C6、2F5和1C3的V.sub.L氨基酸序列分别示出于SEQ ID NO:10-18中。
其他的人类抗PSMA抗体包括公开于PCT公开WO 03/034903和美国申请号2004/0033229中的抗体。
NW Biotherapeutics:选自由以下组成的组的杂交瘤细胞系:具有ATCC登录号HB12060的3F5.4G6、具有ATCC登录号HB12309的3D7-1.I.、具有ATCC登录号HB12310的4E10-1.14、3E11(ATCC HB12488)、4D8(ATCC HB12487)、3E6(ATCC HB12486)、3C9(ATCCHB12484)、2C7(ATCC HB12490)、1G3(ATCC HB12489)、3C4(ATCC HB12494)、3C6(ATCCHB12491)、4D4(ATCC HB12493)、1G9(ATCC HB12495)、5C8B9(ATCC HB12492)和3G6(ATCCHB12485)-参见US 6,150,508
PSMA Development Company/Progenics/Cytogen–Seattle Genetics:由保藏在ATCC登录号PTA-3258下的杂交瘤细胞产生的mAb 3.9或由保藏在ATCC登录号PTA-3347下的杂交瘤细胞产生的mAb 10.3-US 7,850,971PSMA Development Company–PSMA抗体的组成(US 20080286284,表1)该申请是2003年3月21日提交的美国专利申请序列号10/395,894的分案(US 7,850,971)。
University Hospital Freiburg,Germany-mAbs 3/A12、3/E7和3/F11(Wolf P.,et al Prostate.2010年4月1日;70(5):562-9)。
(38)SST(生长抑素受体;注意存在5种亚型)
(38.1)SSTR2(生长抑素受体2)
核苷酸
Genbank登录号NM_001050
Genbank版本号NM_001050.2 GI:44890054
Genbank记录更新日期:2012年8月19日,01:37PM
多肽
Genbank登录号NP_001041
Genbank版本号NP_001041.1 GI:4557859
Genbank记录更新日期:2012年8月19日,01:37PM
交叉引用
Yamada Y.,et al Proc.Natl.Acad.Sci.U.S.A.89(1),251-255(1992);SusiniC.,et al Ann Oncol.2006年12月;17(12):1733-42
其他信息
官方符号:SSTR2
其他命名:SRIF-1;SS2R;生长抑素受体2型
(38.2)SSTR5(生长抑素受体5)
核苷酸
Genbank登录号D16827
Genbank版本号D16827.1 GI:487683
Genbank记录更新日期:2006年8月1日,12:45PM
多肽
Genbank登录号BAA04107
Genbank版本号BAA04107.1 GI:487684
Genbank记录更新日期:2006年8月1日,12:45PM
交叉引用
Yamada,Y.,et al Biochem.Biophys.Res.Commun.195(2),844-852(1993)
其他信息
官方符号:SSTR5
其他别名:SS-5-R
其他命名:生长抑素受体亚型5;生长抑素受体5型
(38.3)SSTR1
(38.4)SSTR3
(38.5)SSTR4
AvB6-两种子单元(39+40)
(39)ITGAV(整合蛋白,αV;
核苷酸
Genbank登录号M14648 J02826 M18365
Genbank版本号M14648.1 GI:340306
Genbank记录更新日期:2010年6月23日,08:56AM
多肽
Genbank登录号AAA36808
Genbank版本号AAA36808.1 GI:340307
Genbank记录更新日期:2010年6月23日,08:56AM
交叉引用
Suzuki S.,et al Proc.Natl.Acad.Sci.U.S.A.83(22),8614-8618(1986)
其他信息
官方符号:ITGAV
其他别名:CD51、MSK8、VNRA、VTNR
其他命名:由单克隆抗体L230识别的抗原;整合蛋白αV;整合蛋白αVβ3;整合蛋白,αV(玻连蛋白受体,α多肽,抗原CD51);玻连蛋白受体α子单元
(40)ITGB6(整合蛋白,β6)
核苷酸
Genbank登录号NM_000888
Genbank版本号NM_000888.3 GI:9966771
Genbank记录更新日期:2012年6月27日,12:46AM
多肽
Genbank登录号NP_000879
Genbank版本号NP_000879.2 GI:9625002
Genbank记录更新日期:2012年6月27日,12:46AM
交叉引用
Sheppard D.J.,et al Biol.Chem.265(20),11502-11507(1990)
其他信息
官方符号:ITGB6
其他命名:整合蛋白β6
抗体
Biogen:US 7,943,742-由ATCC保藏的杂交瘤细胞克隆体6.3G9和6.8G6,登录号分别为ATCC PTA-3649和-3645。
Biogen:US7,465,449-在一些实施方式中,抗体包含与由杂交瘤细胞6.1A8、6.3G9、6.8G6、6.2B1、6.2B10、6.2A1、6.2E5、7.1G10、7.7G5或7.1C5产生的抗体相同的重链和轻链多肽序列。
Centocor(J&J):US7,550,142;US7,163,681
例如,在US 7,550,142中-具有包含示出于SEQ ID NO:7和SEQ ID NO:8中的氨基酸序列的人类重链和人类轻链可变区域的抗体。
西雅图基因(Seattle Genetics):15H3(Ryan MC.,et al Cancer Res 2012年4月15日;72(8增刊):4630)
(41)CEACAM5(癌胚抗原相关的细胞粘附分子5)
核苷酸
Genbank登录号M17303
Genbank版本号M17303.1 GI:178676
Genbank记录更新日期:2010年6月23日,08:47AM
多肽
Genbank登录号AAB59513
Genbank版本号.AAB59513.1 GI:178677
Genbank记录更新日期:2010年6月23日,08:47AM
交叉引用
Beauchemin N.,et al Mol.Cell.Biol.7(9),3221-3230(1987)
其他信息
官方符号:CEACAM5
其他别名:CD66e,CEA
其他命名:胎便抗原100
抗体
AstraZeneca-MedImmune:US 20100330103;US20080057063;US20020142359
-例如具有互补决定区(CDR)与以下序列的抗体:重链;CDR1-DNYMH、CDR2-WIDPENGDTE YAPKFRG、CDR3-LIYAGYLAMDY;和轻链CDR1-SASSSVTYMH、CDR2-STSNLAS、CDR3-QQRSTYPLT。
-保藏为欧洲细胞培养物保藏所(European Collection of Cell Cultures,ECACC)的杂交瘤细胞806.077,保藏号96022936。
Research Corporation Technologies,Inc.:US5,047,507
Bayer Corporation:US6,013,772
BioAlliance:US7,982,017;US7,674,605
●US 7,674,605
-包含来自SEQ ID NO:1的氨基酸序列的重链可变区序列和来自SEQ ID NO:2的氨基酸序列的轻链可变区序列的抗体。
-包含来自SEQ ID NO:5的氨基酸序列的重链可变区序列和来自SEQ ID NO:6的氨基酸序列的轻链可变区序列的抗体。
Celltech Therapeutics Limited:US5,877,293
The Dow Chemical Company:US5,472,693;US6,417,337;US6,333,405
US5,472,693–例如,ATCC号CRL-11215
US6,417,337–例如,ATCC CRL-12208
US6,333,405–例如,ATCC CRL-12208
Immunomedics,Inc:US7,534,431;US7,230,084;US7,300,644;US6,730,300;US20110189085
-具有轻链可变区的CDR的抗体包括:CDR1包括KASQDVGTSVA(SEQ ID NO:20);CDR2包括WTSTRHT(SEQ ID NO:21)以及CDR3包括QQYSLYRS(SEQ ID NO:22);
-以及所述抗CEA抗体的重链可变区的CDR包括:CDR1包括TYWMS(SEQ ID NO:23);CDR2包括EIHPDSSTINYAPSLKD(SEQ ID NO:24);以及CDR3包括LYFGFPWFAY(SEQ ID NO:25)。
US20100221175;US20090092598;US20070202044;US20110064653;US20090185974;US20080069775。
(42)MET(met原癌基因;肝细胞生长因子受体)
核苷酸
Genbank登录号M35073
Genbank版本号M35073.1 GI:187553
Genbank记录更新日期:2012年3月6日,11:12AM
多肽
Genbank登录号AAA59589
Genbank版本号AAA59589.1 GI:553531
Genbank记录更新日期:2012年3月6日,11:12AM
交叉引用
Dean M.,et al Nature 318(6044),385-388(1985)
其他信息
官方符号:MET
其他别名:AUTS9,HGFR,RCCP2,c-Met
其他命名:HGF受体;HGF/SF受体;SF受体;肝细胞生长因子受体;met原癌基因酪氨酸激酶;原癌基因c-Met;肝细胞生长因子;酪氨酸-蛋白质激酶Met
抗体
Abgenix/Pfizer:US20100040629
例如,由具有美国模式培养物保藏所(ATCC)登录号PTA-5026的杂交瘤细胞13.3.2产生的抗体;由具有ATCC登录号PTA-5027的杂交瘤细胞9.1.2产生的抗体;由具有ATCC登录号PTA-5028的杂交瘤细胞8.70.2产生的抗体;由具有ATCC登录号PTA-5029的杂交瘤细胞6.90.3产生的抗体。
Amgen/Pfizer:US20050054019
例如,包含具有SEQ ID NO:2中示出的氨基酸序列的重链和具有SEQ ID NO:4中示出的氨基酸序列的轻链而没有信号序列的抗体,其中,X2是谷氨酸以及X4是丝氨酸,X8是丙氨酸;包含具有SEQ ID NO:6中示出的氨基酸序列的重链和具有SEQ ID NO:8中示出的氨基酸序列的轻链而没有信号序列的抗体;包含具有SEQ ID NO:10中示出的氨基酸序列的重链和具有SEQ ID NO:12中示出的氨基酸序列的轻链而没有信号序列的抗体;包含具有SEQ IDNO:14中示出的氨基酸序列的重链和具有SEQ ID NO:16中示出的氨基酸序列的轻链而没有信号序列的抗体。
Agouron Pharmaceuticals(现名Pfizer):US20060035907
Eli Lilly:US20100129369
Genentech:US5,686,292;US20100028337;US20100016241;US20070129301;US20070098707;US20070092520、US20060270594;US20060134104;US20060035278;US20050233960;US20050037431
US 5,686,292–例如,ATCC HB-11894和ATCC HB-11895
US 20100016241–例如,ATCC HB-11894(杂交瘤细胞1A3.3.13)或HB-11895(杂交瘤细胞5D5.11.6)
台湾国防医学院(National Defense Medical Center,Taiwan):Lu RM.,et alBiomaterials.2011年4月;32(12):3265-74.
Novartis:US20090175860
-例如,包含重链4687的CDR1、CDR2和CDR3的序列和轻链5097的CDR1、CDR2和CDR3的序列的抗体,其中,重链4687的CDR1、CDR2和CDR3的序列分别是SEQ ID NO:58的残基26-35、50-65和98-102;其中,轻链5097的CDR1、CDR2和CDR3的序列是SEQ ID NO:37的残基24-39、55-61和94-100。
Pharmacia Corporation:US20040166544
Pierre Fabre:US20110239316,US20110097262,US20100115639
Sumsung:US 20110129481–例如,由具有登录号KCLRF-BP-00219或登录号KCLRF-BP-00223的杂交瘤细胞产生的单克隆抗体。
Samsung:US 20110104176–例如,由具有登录号KCLRF-BP-00220的杂交瘤细胞产生的抗体。
都灵大学医学院(University of Turin Medical School):DN-30Pacchiana G.,et al J Biol Chem.2010年11月12日;285(46):36149-57
安德尔研究所(Van Andel Research Institute):Jiao Y.,et alMolBiotechnol.2005年9月;31(1):41-54.
(43)MUC1(细胞表面相关的粘蛋白1)
核苷酸
Genbank登录号J05581
Genbank版本号J05581.1 GI:188869
Genbank记录更新日期:2010年6月23日,08:48AM
多肽
Genbank登录号AAA59876
Genbank版本号AAA59876.1 GI:188870
Genbank记录更新日期:2010年6月23日,08:48AM
交叉引用
Gendler S.J.,et al J.Biol.Chem.265(25),15286-15293(1990)
其他信息
官方符号:MUC1
其他别名:RP11-263K19.2,CD227,EMA,H23AG,KL-6,MAM6,MUC-1,MUC-1/SEC,MUC-1/X,MUC1/ZD,PEM,PEMT,PUM其他命名:DF3抗原;H23抗原;乳腺癌相关抗原DF3;癌相关粘蛋白;上皮膜抗原(episialin);肺泡细胞表面抗原6;粘蛋白1,跨膜;粘蛋白-1;花生反应性尿粘蛋白;多形上皮粘蛋白;肿瘤相关的上皮粘蛋白;肿瘤相关的上皮膜抗原;肿瘤相关的粘蛋白
抗体
AltaRex-Quest Pharma Tech:US 6,716,966–例如,由杂交瘤细胞ATCC号PTA-975产生的Alt-1抗体。
AltaRex-Quest Pharma Tech:US7,147,850
CRT:5E5-AL.,et al Glycobiology第16卷,第2期,第96–107页,2006;HMFG2–Burchell J.,et al Cancer Res.,47,5476–5482(1987)
Glycotope GT-MAB:GT-MAB 2.5-GEX(网址:http://www.glycotope.com/pipeline/pankomab-gex)
Immunogen:US7,202,346
-例如,抗体MJ-170:杂交瘤细胞系MJ-170,ATCC登录号PTA-5286;单克隆抗体MJ-171:杂交瘤细胞系MJ-171,ATCC登录号PTA-5287;单克隆抗体MJ-172:杂交瘤细胞系MJ-172,ATCC登录号PTA-5288;或单克隆抗体MJ-173:杂交瘤细胞系MJ-173,ATCC登录号PTA-5302
Immunomedics:US 6,653,104
Ramot Tel Aviv Uni:US7,897,351
Regents Uni.CA:US 7,183,388;US20040005647;US20030077676。
Roche GlycArt:US8,021,856
俄罗斯国家癌症研究中心(Russian National Cancer Research Center):Imuteran-Ivanov PK.,et al Biotechnol J.2007年7月;2(7):863-70Technische UnivBraunschweig:(IIB6,HT186-B7,HT186-D11,HT186-G2,HT200-3A-C1,HT220-M-D1,HT220-M-G8)-Thie H.,et al PLoS One.2011年1月14日;6(1):e15921
(44)CA9(羧酸酐酶IX)
核苷酸
Genbank登录号X66839
Genbank版本号X66839.1 GI:1000701
Genbank记录更新日期:2011年2月2日,10:15AM
多肽
Genbank登录号CAA47315
Genbank版本号CAA47315.1 GI:1000702
Genbank记录更新日期:2011年2月2日,10:15AM
交叉引用
Pastorek J.,et al Oncogene 9(10),2877-2888(1994)
其他信息
官方符号:CA9
其他别名:CAIX,MN
其他命名:CA-IX;P54/58N;RCC相关抗原G250;RCC相关蛋白质G250;碳酸盐脱水酶IX;碳酸酐酶9;碳酸水解酶;膜抗原MN;pMW1;肾细胞癌相关抗原G250
抗体
Abgenix/Amgen:US20040018198
Affibody:抗CAIX亲合体(Anti-CAIX Affibody)分子
(http://www.affibody.com/en/Product-Portfolio/Pipeline/)
Bayer:US7,462,696
Bayer/Morphosys:3ee9 mAb-Petrul HM.,et al Mol Cancer Ther.201年2月;11(2):340-9
哈弗医学院(Harvard Medical School):抗体G10,G36,G37,G39,G45,G57,G106,G119,G6,G27,G40和G125。Xu C.,et al PLoS One.2010年3月10日;5(3):e9625
病毒学研究院,斯洛伐克科学院(Institute of Virology,Slovak Academy ofSciences)(Bayer)-US5,955,075
-例如,M75-ATCC登录号HB 11128或MN12–ATCC登录号HB 11647
病毒学研究院,斯洛伐克科学院:US7,816,493
-例如,由杂交瘤细胞VU-M75分泌的M75单克隆抗体,该VU-M75在ATCC号HB 11128下保藏在美国模式培养物保藏所;或由杂交瘤细胞V/10-VU分泌的V/10单克隆抗体,在V/10-VU登录号LMBP 6009CB下保藏在Gent,Belgium的Universeit Gent的Laboratoriumvoor Moleculaire Bioloqie-Plasmidencollectie(LMBP)的比利时微生物协作保藏中心(Belgian Coordinated Collection of Microorganisms,BCCM)的国际保藏机构(International Depository Authority)。
病毒学研究院,斯洛伐克科学院US20080177046;US20080176310;US20080176258;US20050031623
Novartis:US20090252738
Wilex:US7,691,375–例如,由杂交瘤细胞系DSM ASC 2526产生的抗体。Wilex:US20110123537;Rencarex:Kennett RH.,et al Curr Opin Mol Ther.2003年2月;5(1):70-5
Xencor:US20090162382
(45)EGFRvIII(表皮生长因子受体(EGFR),转录变体3,
核苷酸
Genbank登录号NM_201283
Genbank版本号NM_201283.1 GI:41327733
Genbank记录更新日期:2012年9月30日,01:47PM
多肽
Genbank登录号NP_958440
Genbank版本号.NP_958440.1 GI:41327734
Genbank记录更新日期:2012年9月30日,01:47PM
交叉引用
Batra SK.,et al Cell Growth Differ 1995;6:1251–1259.
抗体:
US7,628,986和US7,736,644(Amgen)
例如,选自由SEQ ID NO:142和变体组成的组的重链可变区氨基酸序列以及选自由SEQ ID NO:144和变体组成的组的轻链可变区氨基酸序列。US20100111979(Amgen)
例如,包含含有以下的重链氨基酸序列的抗体:
由选自由用于抗体13.1.2(SEQ ID NO:138)、131(SEQ ID NO:2)、170(SEQ ID NO:4)、150(SEQ ID NO:5)、095(SEQ ID NO:7)、250(SEQ ID NO:9)、139(SEQ ID NO:10)、211(SEQ ID NO:12)、124(SEQ ID NO:13)、318(SEQ ID NO:15)、342(SEQ ID NO:16)和333(SEQID NO:17)的CDR1区的氨基酸序列组成的组的序列组成的CDR1;
选自由用于抗体13.1.2(SEQ ID NO:138)、131(SEQ ID NO:2)、170(SEQ ID NO:4)、150(SEQ ID NO:5)、095(SEQ ID NO:7)、250(SEQ ID NO:9)、139(SEQ ID NO:10)、211(SEQ ID NO:12)、124(SEQ ID NO:13)、318(SEQ ID NO:15)、342(SEQ ID NO:16)和333(SEQID NO:17)的CDR2区的氨基酸序列组成的组的序列组成的CDR2;以及
选自由用于抗体13.1.2(SEQ ID NO:138)、131(SEQ ID NO:2)、170(SEQ ID NO:4)、150(SEQ ID NO:5)、095(SEQ ID NO:7)、250(SEQ ID NO:9)、139(SEQ ID NO:10)、211(SEQ ID NO:12)、124(SEQ ID NO:13)、318(SEQ ID NO:15)、342(SEQ ID NO:16)和333(SEQID NO:17)的CDR3区的氨基酸序列组成的组的序列组成的CDR3。
US20090240038(Amgen)
例如,具有包含与选自由以下组成的组中的氨基酸序列至少90%相同的氨基酸序列的重链多肽或轻链多肽中的至少一种的抗体:SEQ ID NO:2、SEQ ID NO:19、SEQ ID NO:142、SEQ ID NO:144和它们的任意组合。
US20090175887(Amgen)
例如,具有选自由以下抗体的重链氨基酸序列组成的组的重链氨基酸序列的抗体:13.1.2(SEQ ID NO:138)、131(SEQ ID NO:2)、170(SEQ ID NO:4)、150(SEQ ID NO:5)、095(SEQ ID NO:7)、250(SEQ ID NO:9)、139(SEQ ID NO:10)、211(SEQ ID NO:12)、124(SEQID NO:13)、318(SEQ ID NO:15)、342(SEQ ID NO:16)和333(SEQ ID NO:17)。
US20090156790(Amgen)
例如,具有重链多肽和轻链多肽的抗体,其中,重链或轻链多肽中的至少一种包含与选自由以下组成的组中的氨基酸序列至少90%相同的氨基酸序列:SEQ ID NO:2、SEQ IDNO:19、SEQ ID NO:142、SEQ ID NO:144和它们的任意组合。
US20090155282,US20050059087和US20050053608(Amgen)
例如,具有选自由以下抗体的重链氨基酸序列组成的组中的重链氨基酸序列的抗体:13.1.2(SEQ ID NO:138)、131(SEQ ID NO:2)、170(SEQ ID NO:4)、150(SEQ ID NO:5)、095(SEQ ID NO:7)、250(SEQ ID NO:9)、139(SEQ ID NO:10)、211(SEQ ID NO:12)、124(SEQID NO:13)、318(SEQ ID NO:15)、342(SEQ ID NO:16)和333(SEQ ID NO:17)。
MR1-1(US7,129,332;Duke)
例如,具有SEQ ID NO.18的序列的变体抗体,其中,CDR3 VH中置换了S98P-T99Y以及CDR3 VL中置换了F92W。
L8A4,H10,Y10(Wikstrand CJ.,et al Cancer Res.1995年7月15日;55(14):3140-8;Duke)
US20090311803(哈弗大学)
例如,用于抗体重链可变区的SEQ ID NO:9以及用于轻链可变区氨基酸序列的SEQID NO:3。
US20070274991(EMD72000,也称为马妥珠单抗;哈弗大学)
例如,分别用于轻链和重链的SEQ ID NO:3和9
US6,129,915(Schering)
例如,SEQ.ID NO:1、2、3、4、5和6。
mAb CH12-Wang H.,et al FASEB J.2012 Jan;26(1):73-80(上海市肿瘤研究所)。
RAbDMvIII-Gupta P.,et al BMC Biotechnol.2010 Oct 7;10:72(斯坦福大学医学中心)。
mAb Ua30-Ohman L.,et al Tumour Biol.2002年3月-4月;23(2):61-9(乌普萨拉大学)。
Han DG.,et al Nan Fang Yi Ke Da Xue Xue Bao.2010 Jan;30(1):25-9(西安交通大学)。
(46)CD33(CD33分子)
核苷酸
Genbank登录号M_23197
Genbank版本号NM_23197.1 GI:180097
Genbank记录更新日期:2010年6月23日,08:47AM
多肽
Genbank登录号AAA51948
Genbank版本号AAA51948.1 GI:188098
Genbank记录更新日期:2010年6月23日,08:47AM
交叉引用
Simmons D.,et al J.Immunol.141(8),2797-2800(1988)
其他信息
官方符号:CD33
其他别名:SIGLEC-3,SIGLEC3,p67
其他命名:CD33抗原(gp67);gp67;骨髓细胞表面抗原CD33;唾液酸Ig样凝集素3;唾液酸Ig样凝集素
抗体
H195(林妥珠单抗)-Raza A.,et al Leuk Lymphoma.2009年8月;50(8):1336-44;US6,759,045(Seattle Genetics/Immunomedics)
mAb OKT9:Sutherland,D.R.et al.Proc Natl Acad Sci USA 78(7):4515-45191981,Schneider,C.,et al J Biol Chem 257,8516-8522(1982)
mAb E6:Hoogenboom,H.R.,et al J Immunol 144,3211-3217(1990)
US6,590,088(人类基因组科学(Human Genome Sciences))
例如,SEQ ID NO:1和2以及ATCC登录号97521
US7,557,189(Immunogen)
例如,包含重链可变区和轻链可变区的抗体或它们的片段,该重链可变区包含具有SEQ ID NO:1-3的氨基酸序列的三种CDR,该轻链可变区包含具有SEQ ID NO:4-6的氨基酸序列的三种CDR。
(47)CD19(CD19分子)
核苷酸
Genbank登录号NM_001178098
Genbank版本号NM_001178098.1 GI:296010920
Genbank记录更新日期:2012年9月10日,12:43AM
多肽
Genbank登录号NP_001171569
Genbank版本号NP_001171569.1 GI:296010921
Genbank记录更新日期:2012年9月10日,12:43AM
交叉引用
Tedder TF.,et al J.Immunol.143(2):712-7(1989)
其他信息
官方符号:CD19
其他别名:B4,CVID3
其他命名:B淋巴细胞CD19;B淋巴细胞表面抗原B4;T细胞表面抗原Leu-12;分化抗原CD19
抗体
Immunogen:HuB4-Al-Katib AM.,et al Clin Cancer Res.2009年6月15日;15(12):4038-45.
4G7:Kügler M.,et al Protein Eng Des Sel.2009年3月;22(3):135-47
例如,Knappik,A.et al.J Mol Biol 2000年2月;296(1):57-86中的图3中的序列
AstraZeneca/MedImmune:MEDI-551-Herbst R.,et al J Pharmacol ExpTher.2010年10月;335(1):213-22
Glenmark Pharmaceuticals:GBR-401-Hou S.,et al Mol Cancer Ther 2011年11月10日(会议文摘增刊(Meeting Abstract Supplement))C164
US7,109,304(Immunomedics)
例如,包含hA19Vk(SEQ ID NO:7)的序列和hA19VH(SEQ ID NO:10)的序列的抗体
US7,902,338(Immunomedics)
例如,包含轻链互补决定区CDR序列和重链CDR序列,而且还包含一个或多个框架区氨基酸残基被母体鼠科动物抗体的对应框架区序列置换的人类抗体框架(FR)和恒定区序列的抗体或它们的抗原结合片段,该轻链互补决定区CDR序列包括SEQ ID NO:16(KASQSVDYDGDSYLN)的CDR1;SEQ ID NO:17(DASNLVS)的CDR2;和SEQ ID NO:18(QQSTEDPWT)的CDR3;该重链CDR序列包括SEQ ID NO:19(SYWMN)的CDR1;SEQ ID NO:20(QIWPGDGDTNYNGKFKG)的CDR2;和SEQ ID NO:21(RETTTVGRYYYAMDY)的CDR3,并且其中,所述置换的FR残基包括在重链可变区的Kabat残基91处丝氨酸对苯丙氨酸的置换。
Medarex:MDX-1342–Cardarelli PM.,et al Cancer Immunol Immunother.2010年2月;59(2):257-65.
MorphoSys/Xencor:MOR-208/XmAb-5574-Zalevsky J.,et al Blood.2009年4月16日;113(16):3735-43
US7,968,687(Seattle Genetics)
包含重链可变结构域和轻链可变结构域的抗体或抗原结合片段,该重链可变结构域包含SEQ ID NO:9的氨基酸序列,该轻链可变结构域包含SEQ ID NO:24的氨基酸序列。
4G7 chim-Lang P.,et al Blood.2004年5月15日;103(10):3982-5(Tübingen大学)
例如,US20120082664的图6和SEQ ID No:80
浙江大学医学院(Zhejiang University School of Medicine):2E8-Zhang J.,et al J Drug Target.2010年11月;18(9):675-8
(48)IL2RA(白介素2受体,α);NCBI参考序列:NM_000417.2);
核苷酸
Genbank登录号NM_000417
Genbank版本号NM_000417.2 GI:269973860
Genbank记录更新日期:2012年9月9日,04:59PM
多肽
Genbank登录号NP_000408
Genbank版本号NP_000408.1 GI:4557667
Genbank记录更新日期:2012年9月9日,04:59PM
交叉引用
Kuziel W.A.,et al J.Invest.Dermatol.94(6SUPPL),27S-32S(1990)
其他信息
官方符号:IL2RA
其他别名:RP11-536K7.1,CD25,IDDM10,IL2R,TCGFR
其他命名:FIL-2受体α子单元;IL-2-RA;IL-2Rα子单元;IL2-RA;TAC抗原;白介素-2受体α子单元;p55
抗体
US6,383,487(Novartis/UCL:巴克斯单抗(Baxilisimab)[Simulect])
US6,521,230(Novartis/UCL:巴克斯单抗(Baxilisimab)[Simulect])
例如,具有抗原结合位点的抗体包含至少一种结构域,该结构域包含具有SEQ.ID.NO:7中的氨基酸序列的CDR1、具有SEQ.ID.NO:8中的氨基酸序列的CDR2和具有SEQ.ID.NO:9中的氨基酸序列的CDR3;或所述CDR1、CDR2和CDR3的序列整体包含与SEQ.ID.NO:7、8和9的整体序列至少90%相同的氨基酸序列。
达利珠单抗(Daclizumab)–Rech AJ.,et al Ann N Y Acad Sci.2009年9月;1174:99-106(Roche)
(49)AXL(AXL受体酪氨酸激酶)
核苷酸
Genbank登录号M76125
Genbank版本号M76125.1 GI:292869
Genbank记录更新日期:2010年6月23日,08:53AM
多肽
Genbank登录号AAA61243
Genbank版本号AAA61243.1 GI:29870
Genbank记录更新日期:2010年6月23日,08:53AM
交叉引用
O'Bryan J.P.,et al Mol.Cell.Biol.11(10),5016-5031(1991);BergsagelP.L.,etal J.Immunol.148(2),590-596(1992)
其他信息
官方符号:AXL
其他别名:JTK11,UFO
其他命名:AXL原癌基因;AXL转化序列/基因;原癌基因AXL;酪氨酸-蛋白质激酶受体UFO
抗体
YW327.6S2-Ye X.,et al Oncogene.2010年9月23日;29(38):5254-64.(Genentech)
BergenBio:BGB324(http://www.bergenbio.com/BGB324)
(50)CD30-TNFRSF8(肿瘤坏死因子受体超家族,成员8)
核苷酸
Genbank登录号M83554
Genbank版本号M83554.1 GI:180095
Genbank记录更新日期:2010年6月23日,08:53AM
多肽
Genbank登录号AAA51947
Genbank版本号AAA51947.1 GI:180096
Genbank记录更新日期:2010年6月23日,08:53AM
交叉引用
Durkop H.,et al Cell 68(3),421-427(1992)
其他信息
官方符号:TNFRSF8
其他别名:CD30,D1S166E,Ki-1
其他命名:CD30L受体;Ki-1抗原;细胞因子受体CD30;淋巴细胞活化抗原CD30;肿瘤坏死因子受体超家族成员8
(51)BCMA(B细胞成熟抗原)-TNFRSF17(肿瘤坏死因子受体超家族,成员17)
核苷酸
Genbank登录号Z29574
Genbank版本号Z29574.1 GI:471244
Genbank记录更新日期:2011年2月2日,10:40AM
多肽
Genbank登录号CAA82690
Genbank版本号CAA82690.1 GI:471245
Genbank记录更新日期:2011年2月2日,10:40AM
交叉引用
Laabi Y.,et al Nucleic Acids Res.22(7),1147-1154(1994)
其他信息
官方符号:TNFRSF17
其他别名:BCM,BCMA,CD269
其他命名:B细胞成熟抗原;B细胞成熟因子;B细胞成熟蛋白质;肿瘤坏死因子受体超家族成员17
(52)CT Ags–CTA(肿瘤睾丸抗原)
交叉引用
Fratta E.,et al.Mol Oncol.2011年4月;5(2):164-82;Lim SH.,at al Am JBlood Res.2012;2(1):29-35.
(53)CD174(Lewis Y)-FUT3(岩藻糖转移酶3(半乳糖苷3(4)-L-岩藻糖转移酶,Lewis血型)
核苷酸
Genbank登录号NM000149
Genbank版本号NM000149.3 GI:148277008
Genbank记录更新日期:2012年6月26日,04:49PM
多肽
Genbank登录号NP_000140
Genbank版本号NP_000140.1 GI:4503809
Genbank记录更新日期:2012年6月26日,04:49PM
交叉引用
Kukowska-Latallo,J.F.,et al Genes Dev.4(8),1288-1303(1990)
其他信息
官方符号:FUT3
其他别名:CD174,FT3B,FucT-III,LE,Les
其他命名:Lewis FT;α-(1,3/1,4)-岩藻糖转移酶;血型Lewisα-4-岩藻糖转移酶;岩藻糖转移酶III;半乳糖苷3(4)-L-岩藻糖转移酶
(54)CLEC14A(C型凝集素家族,成员A;Genbank登录号NM175060)
核苷酸
Genbank登录号NM175060
Genbank版本号NM175060.2 GI:371123930
Genbank记录更新日期:2012年4月1日,03:34PM
多肽
Genbank登录号NP_778230
Genbank版本号NP_778230.1 GI:28269707
Genbank记录更新日期:2012年4月1日,03:34PM
其他信息
官方符号:CLEC14A
其他别名:UNQ236/PRO269,C14orf27,CEG1,EGFR-5
其他命名:C型凝集素结构域家族14成员A;包含蛋白质的ClECT和EGF样结构域;表皮生长因子受体5
(55)GRP78–HSPA5(热休克70kDa蛋白质5(葡萄糖调节的蛋白质,78kDa)
核苷酸
Genbank登录号NM005347
Genbank版本号NM005347.4 GI:305855105
Genbank记录更新日期:2012年9月30日,01:42PM
多肽
Genbank登录号NP_005338
Genbank版本号NP_005338.1 GI:16507237
Genbank记录更新日期:2012年9月30日,01:42PM
交叉引用
Ting J.,et al DNA 7(4),275-286(1988)
其他信息
官方符号:HSPA5
其他别名:BIP,GRP78,MIF2
其他命名:78kDa葡萄糖调节的蛋白质;内质网腔Ca(2+)结合蛋白质grp78;免疫球蛋白重链结合蛋白质
(56)CD70(CD70分子)L08096
核苷酸
Genbank登录号L08096
Genbank版本号L08096.1 GI:307127
Genbank记录更新日期:2012年6月23日,08:54AM
多肽
Genbank登录号AAA36175
Genbank版本号AAA36175.1 GI:307128
Genbank记录更新日期:2012年6月23日,08:54AM
交叉引用
Goodwin R.G.,et al Cell 73(3),447-456(1993)
其他信息
官方符号:CD70
其他别名:CD27L,CD27LG,TNFSF7
其他命名:CD27配体;CD27-L;CD70抗原;Ki-24抗原;表面抗原CD70;肿瘤坏死因子(配体)超家族,成员7;肿瘤坏死因子配体超家族成员7抗体针对CD70的MDX-1411(Medarex)
h1F6(Oflazoglu,E.,et al,Clin Cancer Res.2008年10月1日;14(19):6171-80;Seattle Genetics)
例如,参见US20060083736SEQ ID NO:1、2、11和12以及图1。
(57)干细胞特异性抗原。例如:
●5T4(参见以下条目(63))
●CD25(参见以上条目(48))
●CD32
○多肽
■Genbank登录号ABK42161
■Genbank版本号ABK42161.1 GI:117616286
■Genbank记录更新日期:2007年6月25日,03:00PM
●LGR5/GPR49
○核苷酸
■Genbank登录号NM_003667
■Genbank版本号NM_003667.2 GI:24475886
■Genbank记录更新日期:2012年6月22日,03:38PM
○多肽
■Genbank登录号NP_003658
■Genbank版本号NP_003658.1 GI:4504379
■Genbank记录更新日期:2012年7月22日,03:38PM
●Prominin/CD133
○核苷酸
■Genbank登录号NM_006017
■Genbank版本号NM_006017.2 GI:224994187
■Genbank记录更新日期:2012年9月30日,01:47PM
○多肽
■Genbank登录号NP_006008
■Genbank版本号NP_006008.1 GI:5174387
■Genbank记录更新日期:2012年9月30日,01:47PM
(58)ASG-5
交叉引用
(Smith L.M.,et.al AACR 2010 Annual Meeting(摘要#2590);Gudas J.M.,et.al.AACR 2010 Annual Meeting(摘要#4393)
抗体
抗AGS-5抗体:M6.131(Smith,L.M.,et.al AACR 2010 Annual Meeting(摘要#2590))
(59)ENPP3(核苷酸内焦磷酸酶/磷酸二酯酶3)
核苷酸
Genbank登录号AF005632
Genbank版本号AF005632.2 GI:4432589
Genbank记录更新日期:2010年3月10日,09:41PM
多肽
Genbank登录号AAC51813
Genbank版本号AAC51813.1 GI:2465540
Genbank记录更新日期:2010年3月10日,09:41PM
交叉引用
Jin-Hua P.,et al Genomics 45(2),412-415(1997)
其他信息
官方符号:ENPP3
其他别名:RP5-988G15.3,B10,CD203c,NPP3,PD-IBETA,PDNP3其他命名:E-NPP 3;dJ1005H11.3(磷酸二酯酶I/核苷酸焦磷酸酶3);dJ914N13.3(磷酸二酯酶/核苷酸焦磷酸酶3);核苷酸内焦磷酸酶/磷酸二酯酶家族成员3;gp130RB13-6;β磷酸二酯酶I;磷酸二酯酶I/核苷酸焦磷酸酶3;β磷酸二酯酶-I
(60)PRR4(富含脯氨酸4(泪腺))
核苷酸
Genbank登录号NM_007244
Genbank版本号NM_007244.2 GI:154448885
Genbank记录更新日期:2012年6月28日,12:39PM
多肽
Genbank登录号NP_009175
Genbank版本号NP_009175.2 GI:154448886
Genbank记录更新日期:2012年6月28日,12:39PM
交叉引用
Dickinson D.P.,et al Invest.Ophthalmol.Vis.Sci.36(10),2020-2031(1995)
其他信息
官方符号:PRR4
其他别名:LPRP,PROL4
其他命名:富含泪腺脯氨酸的蛋白质;鼻咽癌相关的富含脯氨酸的蛋白质4;富含脯氨酸的多肽4;富含脯氨酸的蛋白质4
(61)GCC–GUCY2C(鸟苷酸环化酶2C(热稳定的肠毒素受体)
核苷酸
Genbank登录号NM_004963
Genbank版本号NM_004963.3 GI:222080082
Genbank记录更新日期:2012年9月2日,01:50PM
多肽
Genbank登录号NP_004954
Genbank版本号NP_004954.2 GI:222080083
Genbank记录更新日期:2012年9月2日,01:50PM
交叉引用
De Sauvage F.J.,et al J.Biol.Chem.266(27),17912-17918(1991);Singh S.,et al Biochem.Biophys.Res.Commun.179(3),1455-1463(1991)
其他信息
官方符号:GUCY2C
其他别名:DIAR6,GUC2C,MUCIL,STAR
其他命名:GC-C;STA受体;鸟苷酸环化酶C;hSTAR;热稳定的肠毒素受体;肠道鸟苷酸环化酶
(62)Liv-1–SLC39A6(溶质载体家族39(锌运载体),成员6)
核苷酸
Genbank登录号U41060
Genbank版本号U41060.2 GI:12711792
Genbank记录更新日期:2009年11月30日,04:35PM
多肽
Genbank登录号AAA96258
Genbank版本号AAA96258.2 GI:12711793
Genbank记录更新日期:2009年11月30日,04:35PM
交叉引用
Taylor KM.,et al Biochim Biophys Acta.2003年4月1日;1611(1-2):16-30
其他信息
官方符号:SLC39A6
其他别名:LIV-1
其他命名:雌激素调节的LIV-1蛋白质;ZIP-6;雌激素调节的蛋白质LIV-1;溶质载体家族39(金属离子运载体),成员6;溶质载体家族39成员6;锌运载体ZIP6;zrt和Irt样蛋白质6
(63)5T4,滋养层糖蛋白,TPBG–TPBG(滋养层糖蛋白)
核苷酸
Genbank登录号AJ012159
Genbank版本号AJ012159.1 GI:3805946
Genbank记录更新日期:2011年2月1日,10:27AM
多肽
Genbank登录号CAA09930
Genbank版本号CAA09930.1 GI:3805947
Genbank记录更新日期:2011年2月1日,10:27AM
交叉引用
King K.W.,et al Biochim.Biophys.Acta 1445(3),257-270(1999)
其他信息
●官方符号:TPBG
●其他别名:5T4,5T4AG,M6P1
●其他命名:5T4癌胚抗原;5T4癌胚滋养层糖蛋白(5T4 oncofetal trophoblastglycoprotein);5T4癌胚滋养层糖蛋白(5T4 oncotrophoblast glycoprotein)
(64)CD56–NCMA1(神经细胞粘附分子1)
核苷酸
Genbank登录号NM_000615
Genbank版本号NM_000615.6 GI:336285433
Genbank记录更新日期:2012年9月23日,02:32PM
多肽
Genbank登录号NP_000606
Genbank版本号NP_000606.3 GI:94420689
Genbank记录更新日期:2012年9月23日,02:32PM
交叉引用
Dickson,G.,et al,Cell 50(7),1119-1130(1987)
其他信息
官方符号:NCAM1
其他别名:CD56,MSK39,NCAM
其他命名:由单克隆抗体5.1H11识别的抗原;神经细胞粘附分子,NCAM抗体
Immunogen:HuN901(Smith SV.,et al Curr Opin Mol Ther.2005年8月;7(4):394-401)
例如,参见人源化得鼠科动物N901抗体。参见Roguska,M.A.,et al.Proc NatlAcad Sci USA 1994年2月;91:969-973的图1b和1e。
(65)CanAg(肿瘤相关抗原CA242)
交叉引用
Haglund C.,et al Br J Cancer 60:845-851,1989;Baeckstrom D.,et al JBiol Chem 266:21537-21547,1991
抗体
huC242(Tolcher AW et al.,J Clin Oncol.2003年1月15日;21(2):211-22;Immunogen)
例如,参见US20080138898A1 SEQ ID NO:1和2
(66)FOLR1(叶酸受体1)
核苷酸
Genbank登录号J05013
Genbank版本号J05013.1 GI:182417
Genbank记录更新日期:2010年6月23日,08:47AM
多肽
Genbank登录号AAA35823
Genbank版本号AAA35823.1 GI:182418
Genbank记录更新日期:2010年6月23日,08:47AM
交叉引用
Elwood P.C.,et al J.Biol.Chem.264(25),14893-14901(1989)
其他信息
官方符号:FOLR1
其他别名:FBP,FOLR
其他命名:FR-α;KB细胞FBP;成人叶酸结合蛋白质;叶酸结合蛋白质;叶酸α受体;成人叶酸受体;卵巢肿瘤相关抗原MOv18
抗体
M9346A-Whiteman KR.,et al Cancer Res,2012年4月15日;72(8增刊):4628(Immunogen)
(67)GPNMB(糖蛋白(跨膜)nmb)
核苷酸
Genbank登录号X76534
Genbank版本号X76534.1 GI:666042
Genbank记录更新日期:2011年2月2日,10:10AM
多肽
Genbank登录号CAA54044
Genbank版本号CAA54044.1 GI:666043
Genbank记录更新日期:2011年2月2日,10:10AM
交叉引用
Weterman M.A.,et al Int.J.Cancer 60(1),73-81(1995)
其他信息
官方符号:GPNMB
其他别名:UNQ1725/PRO9925,HGFIN,NMB
其他命名:糖蛋白NMB;糖蛋白nmb样蛋白质;osteoactivin;跨膜糖蛋白HGFIN;跨膜糖蛋白NMB
抗体
Celldex Therapeutics:CR011(Tse KF.,et al Clin Cancer Res.2006年2月15日;12(4):1373-82)
例如,参见EP1827492B1 SEQ ID NO:22、24、26、31、33和35
(68)TIM-1–HAVCR1(甲型肝炎病毒细胞受体1)
核苷酸
Genbank登录号AF043724
Genbank版本号AF043724.1 GI:2827453
Genbank记录更新日期:2010年3月10日,06:24PM
多肽
Genbank登录号AAC39862
Genbank版本号AAC39862.1 GI:2827454
Genbank记录更新日期:2010年3月10日,06:24PM
交叉引用
Feigelstock D.,et al J.Virol.72(8),6621-6628(1998)
其他信息
官方符号:HAVCR1
其他别名:HAVCR,HAVCR-1,KIM-1,KIM1,TIM,TIM-1,TIM1,TIMD-1,TIMD1
其他命名:T细胞免疫球蛋白结构域和粘蛋白结构域蛋白质1;T细胞膜蛋白质1;肾脏损伤分子1
(69)RG-1/前列腺肿瘤靶向Mindin–Mindin/RG-1
交叉引用
Parry R.,et al Cancer Res.2005年9月15日;65(18):8397-405
(70)B7-H4–VTCN1(包含T细胞活化抑制剂1的V-set结构域
核苷酸
Genbank登录号BX648021
Genbank版本号BX648021.1 GI:34367180
Genbank记录更新日期:2011年2月2日,08:40AM
交叉引用
Sica GL.,et al Immunity.2003年1月;18(6):849-61
其他信息
官方符号:VTCN1
其他别名:RP11-229A19.4,B7-H4,B7H4,B7S1,B7X,B7h.5,PRO1291,VCTN1
其他命名:B7家族成员H4;B7超家族成员1;T细胞共刺激分子B7x;T-细胞共刺激分子B7x;包含T细胞活化抑制剂1的V-set结构域;免疫共刺激蛋白质B7-H4
(71)PTK7(PTK7蛋白质酪氨酸激酶7)
核苷酸
Genbank登录号AF447176
Genbank版本号AF447176.1 GI:17432420
Genbank记录更新日期:2008年11月28日,01:51PM
多肽
Genbank登录号AAL39062
Genbank版本号AAL39062.1 GI:17432421
Genbank记录更新日期:2008年11月28日,01:51PM
交叉引用
Park S.K.,et al J.Biochem.119(2),235-239(1996)
其他信息
官方符号:PTK7
其他别名:CCK-4,CCK4
其他命名:结肠癌激酶4;不活跃的酪氨酸-蛋白质激酶7;伪酪氨酸激酶受体7;酪氨酸-蛋白质激酶样7
(72)CD37(CD37分子)
核苷酸
Genbank登录号NM_001040031
Genbank版本号NM_001040031.1 GI:91807109
Genbank记录更新日期:2012年7月29日,02:08PM
多肽
Genbank登录号NP_001035120
Genbank版本号NP_001035120.1 GI:91807110
Genbank记录更新日期:2012年7月29日,02:08PM
交叉引用
Schwartz-Albiez R.,et al J.Immunol.140(3),905-914(1988)
其他信息
官方符号:CD37
其他别名:GP52-40,TSPAN26
其他命名:CD37抗原;细胞分化抗原37;白细胞抗原CD37;白细胞表面抗原CD37;四次穿膜蛋白26;tspan-26
抗体
Boehringer Ingelheim:mAb 37.1(Heider KH.,et al Blood.2011年10月13日;118(15):4159-68)
Trubion:CD37-SMIP(G28-1 scFv-Ig)((Zhao X.,et al Blood.2007;110:2569-2577)
例如,参见US20110171208A1 SEQ ID NO:253
Immunogen:K7153A(Deckert J.,et al Cancer Res,2012年4月15日;72(8增刊):4625)
(73)CD138–SDC1(多配体聚糖1)
核苷酸
Genbank登录号AJ551176
Genbank版本号AJ551176.1 GI:29243141
Genbank记录更新日期:2011年2月1日,12:09PM
多肽
Genbank登录号CAD80245
Genbank版本号CAD80245.1 GI:29243142
Genbank记录更新日期:2011年2月1日,12:09PM
交叉引用
O'Connell FP.,et al Am J Clin Pathol.2004年2月;121(2):254-63
其他信息
官方符号:SDC1
其他别名:CD138,SDC,SYND1,多配体聚糖其他命名:CD138抗原;硫酸乙酰肝素蛋白多糖成纤维细胞生长因子受体;多配体聚糖蛋白多糖1;多配体聚糖1
抗体
Biotest:嵌合MAb(nBT062)-(Jagannath S.,et al Poster ASH#3060,2010;WIPO专利申请WO/2010/128087)
例如,参见US20090232810SEQ ID NO:1和2
Immunogen:B-B4(Tassone P.,et al Blood 104_3688-3696)
例如,参见US20090175863A1 SEQ ID NO:1和2
(74)CD74(CD74分子,主要的组织相容性复合物,II类不变链)
核苷酸
Genbank登录号NM_004355
Genbank版本号NM_004355.1 GI:343403784
Genbank记录更新日期:2012年9月23日,02:30PM
多肽
Genbank登录号NP_004346
Genbank版本号NP_004346.1 GI:10835071
Genbank记录更新日期:2012年9月23日,02:30PM
交叉引用
Kudo,J.,et al Nucleic Acids Res.13(24),8827-8841(1985)
其他信息
官方符号:CD74
其他别名:DHLAG,HLADG,II,Ia-GAMMA
其他命名:CD74抗原(II类抗原相关的主要的组织相容性复合物的不变多肽);HLAII类组织相容性抗原γ链;HLA-DR抗原相关的不变链;HLA-DR-γ;Ia相关的不变链;MHCHLA-DRγ链;II类抗原的γ链;p33抗体
Immunomedics:hLL1(米拉珠单抗(Milatuzumab))-Berkova Z.,et al ExpertOpin Investig Drugs.2010年1月;19(1):141-9)
例如,参见US20040115193SEQ ID NO:19、20、21、22、23和24Genmab:HuMax-CD74(参见网址)
(75)Claudins–CLs(闭合蛋白)
交叉引用
Offner S.,et al Cancer Immunol Immunother.2005年5月;54(5):431-45,Suzuki H.,et al Ann N Y Acad Sci.2012年7月;1258:65-70)
在人类中,描述了家族的24成员–参见参考文献
(76)EGFR(表皮生长因子受体)
核苷酸
Genbank登录号NM_005228
Genbank版本号NM_005228.3 GI:41927737
Genbank记录更新日期:2012年9月30日,01:47PM
多肽
Genbank登录号NP_005219
Genbank版本号NP_005219.2 GI:29725609
Genbank记录更新日期:2012年9月30日,01:47PM
交叉引用
Dhomen NS.,et al Crit Rev Oncog.2012;17(1):31-50
其他信息
官方符号:EGFR
其他别名:ERBB,ERBB1,HER1,PIG61,mENA
其他命名:禽流感成红细胞白血病病毒(v-erb-b)致癌基因同源体;细胞生长抑制蛋白质40;细胞增殖诱导蛋白质61;原癌基因c-ErbB-1;受体酪氨酸-蛋白质激酶erbB-1
抗体
BMS:西妥昔单抗(Erbitux)-Broadbridge VT.,et al Expert Rev AnticancerTher.2012年5月;12(5):555-65.
例如,参见US6217866–ATTC保藏号9764。
Amgen:帕尼单抗(Vectibix)-Argiles G.,et al Future Oncol.2012年4月;8(4):373-89
例如,参见US6235883SEQ ID NO:23-38.
Genmab:扎鲁目单抗(Zalutumumab)-Rivera F.,et al Expert Opin BiolTher.2009年5月;9(5):667-74.
YM Biosciences:尼妥珠单抗-Ramakrishnan MS.,et al MAbs.2009年1-2月;1(1):41-8.
例如,参见US5891996SEQ ID NO:27-34.
(77)Her3(ErbB3)–ERBB3(v-erb-b2成红细胞白血病病毒致癌基因同源体(禽流感))
核苷酸
Genbank登录号M34309
Genbank版本号M34309.1 GI:183990
Genbank记录更新日期:2010年6月23日,08:47PM
多肽
Genbank登录号AAA35979
Genbank版本号AAA35979.1 GI:306841
Genbank记录更新日期:2010年6月23日,08:47PM
交叉引用
Plowman,G.D.,et al.,Proc.Natl.Acad.Sci.U.S.A.87(13),4905-4909(1990)
其他信息
官方符号:ERBB3
其他别名:ErbB-3,HER3,LCCS2,MDA-BF-1,c-erbB-3,c-erbB3,erbB3-S,p180-ErbB3,p45-sErbB3,p85-sErbB3
其他命名:原癌基因样蛋白质c-ErbB-3;受体酪氨酸-蛋白质激酶erbB-3;酪氨酸激酶型细胞表面受体HER3
抗体
Merimack Pharma:MM-121(Schoeberl B.,et al Cancer Res.2010年3月15;70(6):2485-2494)
例如,参见US2011028129SEQ ID NO:1、2、3、4、5、6、7和8。
(78)RON-MST1R(巨噬细胞刺激1受体(c-met相关酪氨酸激酶))
核苷酸
Genbank登录号X70040
Genbank版本号X70040.1 GI:36109
Genbank记录更新日期:2011年2月2日,10:17PM
多肽
Genbank登录号CCA49634
Genbank版本号CCA49634.1 GI:36110
Genbank记录更新日期:2011年2月2日,10:17PM
交叉引用
Ronsin C.,et al Oncogene 8(5),1195-1202(1993)
其他信息
官方符号:MST1R
其他别名:CD136,CDw136,PTK8,RON
其他命名:MSP受体;MST1R变体RON30;MST1R变体RON62;PTK8蛋白质酪氨酸激酶8;RON变体E2E3;c-met-相关酪氨酸激酶;巨噬细胞刺激蛋白质受体;p185-Ron;溶质RON变体1;溶质RON变体2;溶质RON变体3;溶质RON变体4
(79)EPHA2(EPH受体A2)
核苷酸
Genbank登录号BC037166
Genbank版本号BC037166.2 GI:33879863
Genbank记录更新日期:2012年3月6日,01:59PM
多肽
Genbank登录号AAH37166
Genbank版本号AAH37166.1 GI:22713539
Genbank记录更新日期:2012年3月6日,01:59PM
交叉引用
Strausberg R.L.,et al Proc.Natl.Acad.Sci.U.S.A.99(26),16899-16903(2002)
其他信息
官方符号:EPHA2
其他别名:ARCC2,CTPA,CTPP1,ECK
其他命名:肝配蛋白型A受体2;上皮细胞受体蛋白质酪氨酸激酶;溶质EPHA2变体1;酪氨酸-蛋白质激酶受体ECK
抗体
Medimmune:1C1(Lee JW.,et al Clin Cancer Res.2010年5月1日;16(9):2562-2570)
例如,参见US20090304721A1图7和8。
(80)CD20–MS4A1(跨膜4-结构域,子家族A,成员1)
核苷酸
Genbank登录号M27394
Genbank版本号M27394.1 GI:179307
Genbank记录更新日期:2009年11月30日,11:16AM
多肽
Genbank登录号AAA35581
Genbank版本号AAA35581.1 GI:179308
Genbank记录更新日期:2009年11月30日,11:16AM
交叉引用
Tedder T.F.,et al Proc.Natl.Acad.Sci.U.S.A.85(1),208-212(1988)
其他信息
官方符号:MS4A1
其他别名:B1,Bp35,CD20,CVID5,LEU-16,MS4A2,S7其他命名:B淋巴细胞抗原CD20;B淋巴细胞表面抗原B1;CD20抗原;CD20受体;白细胞表面抗原Leu-16
抗体
Genentech/Roche:利妥昔单抗-Abdulla NE.,et al BioDrugs.2012年4月1日;26(2):71-82.
例如,参见US5736137,ATCC保藏号HB-69119。
GSK/Genmab:奥法木单抗(Ofatumumab)-Nightingale G.,et al AnnPharmacother.2011年10月;45(10):1248-55.
例如,参见US20090169550A1 SEQ ID NO:2、4和5。
Immunomedics:维妥珠单抗(Veltuzumab)-Goldenberg DM.,et al LeukLymphoma.2010年5月;51(5):747-55.
例如,参见US7919273B2 SEQ ID NO:1、2、3、4、5和6。
(81)腱生蛋白C–TNC(腱生蛋白C)
核苷酸
Genbank登录号NM_002160
Genbank版本号NM_002160.3 GI:340745336
Genbank记录更新日期:2012年9月23日,02:33PM
多肽
Genbank登录号NP_002151
Genbank版本号NP_002151.2 GI:153946395
Genbank记录更新日期:2012年9月23日,02:33PM
交叉引用
Nies D.E.,et al J.Biol.Chem.266(5),2818-2823(1991);Siri A.,et alNucleic Acids Res.19(3),525-531(1991)
其他信息
官方符号:TNC
其他别名:150-225,GMEM,GP,HXB,JI,TN,TN-C
其他命名:GP 150-225;腱生蛋白(cytotactin);神经胶质瘤相关的细胞外基质抗原;肌腱蛋白(hexabrachion)(腱生蛋白);肌腱抗原;神经粘连蛋白;腱生蛋白;腱生蛋白C同种型14/AD1/16
抗体
Philogen:G11(von Lukowicz T.,et al J Nucl Med.2007年4月;48(4):582-7)和F16(Pedretti M.,et al Lung Cancer.2009年4月;64(1):28-33)
例如,参见US7968685SEQ ID NO:29、35、45和47。
(82)FAP(成纤维细胞活化蛋白质,α)
核苷酸
Genbank登录号U09278
Genbank版本号U09278.1 GI:1888315
Genbank记录更新日期:2010年6月23日,09:22AM
多肽
Genbank登录号AAB49652
Genbank版本号AAB49652.1 GI:1888316
Genbank记录更新日期:2010年6月23日,09:22AM
交叉引用
Scanlan,M.J.,et al Proc.Natl.Acad.Sci.U.S.A.91(12),5657-5661(1994)
其他信息
官方符号:FAP
其他别名:DPPIV,FAPA
其他命名:170kDa黑色素瘤膜结合的白明胶酶;膜内在丝氨酸蛋白酶;seprase
(83)DKK-1(Dickkopf 1同源体(非洲爪蛙(Xenopus laevis))
核苷酸
Genbank登录号NM_012242
Genbank版本号NM_012242.2 GI:61676924
Genbank记录更新日期:2012年9月30日,01:48PM
多肽
Genbank登录号NP_036374
Genbank版本号NP_036374.1 GI:7110719
Genbank记录更新日期:2012年9月30日,01:48PM
交叉引用
Fedi P.et al J.Biol.Chem.274(27),19465-19472(1999)
其他信息
官方符号:DKK1
其他别名:UNQ492/PRO1008,DKK-1,SK
其他命名:dickkopf相关蛋白质1;dickkopf-1样;dickkopf样蛋白质1;dickkopf-相关蛋白质1;hDkk-1
抗体
Novartis:BHQ880(Fulciniti M.,et al Blood.2009年7月9日;114(2):371-379)例如,参见US20120052070A1 SEQ ID NO:100和108。
(84)CD52(CD52分子)
核苷酸
Genbank登录号NM_001803
Genbank版本号NM_001803.2 GI:68342029
Genbank记录更新日期:2012年9月30日,01:48PM
多肽
Genbank登录号NP_001794
Genbank版本号NP_001794.2 GI:68342030
Genbank记录更新日期:2012年9月30日,01:48PM
交叉引用
Xia M.Q.,et al Eur.J.Immunol.21(7),1677-1684(1991)
其他信息
官方符号:CD52
其他别名:CDW52
其他命名:CAMPATH-1抗原;CD52抗原(CAMPATH-1抗原);CDW52抗原(CAMPATH-1抗原);剑桥(cambridge)病理学1抗原;附睾分泌蛋白E5;he5;人类附睾特异性蛋白5
抗体
阿伦单抗(Alemtuzumab)(Campath)-Skoetz N.,et al Cochrane Database SystRev.2012年2月15日;2:CD008078.
例如,参见Drugbank登录号DB00087(BIOD00109,BTD00109)
(85)CS1-SLAMF7(SLAM家族成员7)
核苷酸
Genbank登录号NM_021181
Genbank版本号NM_021181.3 GI:1993571
Genbank记录更新日期:2012年6月29日,11:24AM
多肽
Genbank登录号NP_067004
Genbank版本号NP_067004.3 GI:19923572
Genbank记录更新日期:2012年6月29日,11:24AM
交叉引用
Boles K.S.,et al Immunogenetics 52(3-4),302-307(2001)
其他信息
官方符号:SLAMF7
其他别名:UNQ576/PRO1138,19A,CD319,CRACC,CS1
其他命名:19A24蛋白质;CD2子集1;CD2样受体激活细胞毒性细胞;CD2样受体-激活细胞毒性细胞;膜蛋白FOAP-12;新型LY9(淋巴细胞抗原9)样蛋白质;蛋白质19A
抗体
BMS:埃罗妥珠单抗(elotuzumab)/HuLuc63(Benson DM.,et al J ClinOncol.2012年6月1日;30(16):2013-2015)
例如,参见US20110206701SEQ ID NO:9、10、11、12、13、14、15和16。
(86)内皮糖蛋白–ENG(内皮糖蛋白)
核苷酸
Genbank登录号AF035753
Genbank版本号AF035753.1 GI:3452260
Genbank记录更新日期:2010年3月10日,06:36PM
多肽
Genbank登录号AAC32802
Genbank版本号AAC32802.1 GI:3452261
Genbank记录更新日期:2010年3月10日,06:36PM
交叉引用
Rius C.,et al Blood 92(12),4677-4690(1998)
官方符号:ENG
其他信息
其他别名:RP11-228B15.2,CD105,END,HHT1,ORW,ORW1其他命名:CD105抗原
(87)膜联蛋白A1–ANXA1(膜联蛋白)
核苷酸
Genbank登录号X05908
Genbank版本号X05908.1 GI:34387
Genbank记录更新日期:2011年2月2日,10:02AM
多肽
Genbank登录号CCA29338
Genbank版本号CCA29338.1 GI:34388
Genbank记录更新日期:2011年2月2日,10:02AM
交叉引用
Wallner B.P.,et al Nature 320(6057),77-81(1986)
其他信息
官方符号:ANXA1
其他别名:RP11-71A24.1,ANX1,LPC1
其他命名:膜联蛋白I(脂皮质蛋白I);膜联蛋白-1;依钙蛋白II;依钙蛋白-2;嗜铬粒结合蛋白-9;脂皮质蛋白I;p35;磷脂酶A2抑制蛋白
(88)V-CAM(CD106)-(VCAM1(血管细胞粘附分子1)
核苷酸
Genbank登录号M60335
Genbank版本号M60335.1 GI:340193
Genbank记录更新日期:2010年6月23日,08:56AM
多肽
Genbank登录号AAA61269
Genbank版本号AAA61269.1 GI:340194
Genbank记录更新日期:2010年6月23日,08:56AM
交叉引用
Hession C.,et al J.Biol.Chem.266(11),6682-6685(1991)
其他信息
官方符号:VCAM1
其他别名:CD106,INCAM-100
其他命名:CD106抗原;血管细胞粘附蛋白1
抗体序列
抗整合素αvβ6
RHAB6.2
RHCB6.2
RHF
RHFB6
RHAY100bP
RKF
RKFL36L50
RKC
抗-CD33
CD33 Hum195 VH
CD33 Hum195 VK
抗-CD19
CD19 B4表面重建的VH
CD19 B4表面重建的VK
抗Her2
赫塞汀VH链
赫塞汀VL链
抗CD25
舒莱VK(也称为巴利昔单抗)
舒莱VH
抗PSMA
去免疫的VH‘1
去免疫的VK‘1
去免疫的VH1‘5
去免疫的VH2‘5
去免疫的VH3‘5
去免疫的VH4‘5
去免疫的VK1‘5
去免疫的VK2‘5
去免疫的VK3‘5
去免疫的VK4‘5
去免疫的VK
DI‘5
去免疫的VH
DI‘5
人源化RHA‘5
人源化RHB‘5
人源化RHC‘5
人源化RHD‘5
人源化RHE‘5
人源化RHF‘5
人源化RHG‘5
人源化RKA‘5
人源化RKB‘5
人源化RKC‘5
人源化RKD‘5
人源化RKE‘5
人源化RKF‘5
人源化RKG‘5
母源抗体也可以是包含白蛋白结合肽(ABP)序列的融合蛋白质(Dennis et al.(2002)“Albumin Binding As A General Strategy For Improving ThePharmacokinetics Of Proteins”J Biol Chem.277:35035-35043;WO 01/45746)。本发明的抗体包含以下教导的具有ABP序列的融合蛋白质:(i)Dennis et al(2002)J BiolChem.277:35035-35043在表III和IV,第35038页;(ii)US 2004/0001827在[0076];以及(iii)WO 01/45746在第12-13页,且通过引用将其所有结合于此。
在一个实施方式中,抗体已经提升到特异性靶向肿瘤相关抗原ανβ6。
可以标记细胞结合剂,例如有助于在作为结合物结合之前或作为结合物的一部分的试剂的检测或纯化。标记可以是生物素标记。在另一个实施方式中,可以用放射性同位素标记细胞结合剂。
细胞结合剂连接至接头。在一个实施方式中,细胞结合剂连接至接头的A(如果存在)。
在一个实施方式中,细胞结合剂和接头之间通过硫醚键连接。
在一个实施方式中,细胞结合剂和接头之间通过二硫键连接。
在一个实施方式中,细胞结合剂和接头之间通过酰胺键连接。
在一个实施方式中,细胞结合剂和接头之间通过酯键连接。
在一个实施方式中,细胞结合剂和接头之间的连接在该细胞结合剂的半胱氨酸残基的硫醇基团和该接头的马来酰亚胺基团之间形成。
细胞结合剂的半胱氨酸残基可用于与RL的官能团反应形成连接。在其他实施方式中,例如在细胞结合剂是抗体的情况下,该抗体的硫醇基团可以参与链内二硫键。在与RL的官能团反应之前,可以将这些链内键转化为游离的硫醇基团,例如通过用DTT处理抗体。
可以标记细胞结合剂,例如有助于在作为结合物结合之前或作为结合物的一部分的试剂的检测或纯化。标记可以是生物素标记。在另一个实施方式中,可以用放射性同位素标记细胞结合剂。
载药量
载药量是每细胞结合剂(例如,抗体)PBD药物的平均数目。当本发明的化合物结合至半胱氨酸时,载药量可以在每细胞结合剂1至8个药物(D)的范围内,即,1、2、3、4、5、6、7和8个药物部分共价附接至细胞结合剂。结合物的组合物包括结合有1至8个范围内药物的细胞结合剂(例如,抗体)的集合。当本发明的化合物结合至赖氨酸时,载药量可以在每细胞结合剂1至80个药物(D)的范围内,尽管40、20、10或8个的上限可以是优选的。结合物的组合物包括结合有1至80、1至40、1至20、1至10或1至8个范围内药物的细胞结合剂(例如,抗体)的集合。
在来自结合反应的ADC制备品中,每抗体药物的平均数目可以由常规方式如UV、反向HPLC、HIC、质谱、ELISA测定和电泳表征。也可以确定用p表示的ADC的定量分布。通过ELISA,可以确定在特定的ADC的制备品中p的平均值(Hamblett et al(2004)Clin.CancerRes.10:7063-7070;Sanderson et al(2005)Clin.Cancer Res.11:843-852)。然而,p(药物)值的分布不可通过抗体-抗原结合和ELISA的检测限识别。另外,用于检测抗体-药物结合物的ELISA测定不能确定药物部分在什么位置附接至抗体,如重链或轻链片段或特定的氨基酸残基。在某些情况下,可以通过如反相HPLC或电泳的方式来实现均相ADC(其中,p是特定的值)与具有其它载药量的ADC的分离、纯化和表征。这种技术也适用于其他类型的结合物。
对于某些抗体-药物结合物,p可以被抗体上附接位点的数目限制。例如,抗体可以具有仅一个或具有多个半胱氨酸硫醇基团,或可以具有仅一个或具有多个充分反应性的硫醇基团,通过该硫醇基团可以连接接头。较高的载药量,例如p>5,可以引起某些抗体-药物结合物的聚集、不溶、毒性或细胞通透性丧失。
通常,将少于理论最大量的药物部分通过结合反应结合至抗体。例如,抗体可以包含许多不与药物-接头中间体(D-L)或接头试剂反应的赖氨酸残基。仅最具反应性的赖氨酸基团可以与胺反应性接头试剂反应。另外,仅最具反应性的半胱氨酸基团可以与硫醇反应性接头试剂反应。通常,抗体不包含许多(如果有的话)游离和反应性的可以连接至药物部分的半胱氨酸硫醇基团。化合物的抗体中的大多数半胱氨酸硫醇残基作为二硫桥存在,并且必须在部分或全部还原条件下用还原剂(如二硫苏糖醇(DTT)或TCEP)将其还原。可以以多种方式控制ADC的载药量(药物/抗体比),包括:(i)限制药物-接头中间体(D-L)或接头试剂相对于抗体的摩尔过量,(ii)限制结合反应时间或温度,以及(iii)用于半胱氨酸硫醇修饰的部分或限制性还原条件。
某些抗体具有可还原的链内二硫化物,即,半胱氨酸桥。对于与接头试剂的结合,通过用还原剂如DTT(二硫苏糖醇)处理可以使抗体变得具有反应性。因此,理论上每个半胱氨酸桥将形成两个反应性硫醇亲核体。通过得到胺至硫醇的转化的赖氨酸与2-亚氨基噻吩(Traut氏试剂)的反应,可以将另外的亲核基团引入至抗体中。通过设计1、2、3、4或更多半胱氨酸残基(例如,制备包含一个或多个非天然半胱氨酸残基的突变抗体)可以将硫醇基团引入至抗体(或它们的片段)。US 7521541教导通过引入反应性半胱氨酸氨基酸设计抗体。
可以在抗体中的反应位点处设计半胱氨酸残基并且其不形成链内或分子内二硫键(Junutula,et al.,2008b Nature Biotech.,26(8):925-932;Dornan et al(2009)Blood 114(13):2721-2729;US 7521541;US 7723485;WO2009/052249)。设计的半胱氨酸硫醇可以与具有硫醇反应性、亲电基团(如马来酰亚胺或α-卤代酰胺)的接头试剂或本发明的药物-接头试剂反应,以形成具有设计的半胱氨酸的抗体和PBD药物部分的ADC。因此,可以设计、控制以及已知药物部分的位置。由于设计的半胱氨酸硫醇基团通常以高产率与硫醇反应性接头试剂或药物-接头试剂反应,所有可以控制载药量。通过在重链或轻链的单一位点置换来设计IgG抗体以引入半胱氨酸氨基酸在对称抗体上产生两个新的半胱氨酸。通过结合产物ADC的接近同质性,可以实现接近2的载药量。
当抗体的多于一个亲核或亲电基团与药物-接头中间体或药物部分试剂之前的接头试剂反应时,那么得到的产物是具有连接至抗体的药物部分(例如,1、2、3个等)的分布的ADC化合物的混合物。液相色谱法,如聚合物反相(PLRP)和疏水相互作用(HIC)可以通过载药量值分离混合物中的化合物。可以分离具有单一载药量值(p)的ADC的制备品,然而,这些单一载药量值ADC仍然可以是多种的混合物,因为药物部分可以经由接头附接在抗体的不同位点上。
因此,本发明的抗体-药物结合组合物包括抗体-药物结合物化合物的混合物,其中,抗体具有一个或多个PBD药物部分,并且其中,药物部分可以在不同氨基酸残基处附接至抗体。
在一个实施方式中,每细胞结合剂中二聚物吡咯并苯并二氮杂卓基团的平均数目在1至20的范围内。在一些实施方式中,范围选自1至8、2至8、2至6、2至4和4至8。
在一些实施方式中,每细胞结合剂中存在一个二聚物吡咯并苯并二氮杂卓。
用途
所述化合物和结合物可以用来治疗增殖性疾病和自身免疫疾病。术语“增殖性疾病”涉及不需要的过度或异常细胞的不希望或失控的细胞增殖,如肿瘤或增生性生长,无论是在体外或体内。
增殖性病症的实例包括但不限于良性、癌前和恶性细胞的增殖,包括但不限于瘤和肿瘤(例如,组织细胞瘤、神经胶质瘤、星形细胞瘤、骨瘤)、癌症(例如,肺癌、小细胞肺癌、胃肠癌、肠癌、结肠癌、乳腺癌、卵巢癌、前列腺癌、睾丸癌、肝癌、肾癌、膀胱癌、胰腺癌、脑癌、肉瘤、骨肉瘤、卡波西氏肉瘤、黑色素瘤)、白血病、银屑病、骨疾病、纤维增殖性疾病(例如,结缔组织的)、和动脉粥样硬化。特别感兴趣的癌症包括但不限于白血病和卵巢癌。
可以治疗任何类型的细胞,包括但不限于肺、胃肠道(包括,例如,肠、结肠)、乳腺(乳房)、卵巢、前列腺、肝脏(肝)、肾脏(肾)、膀胱、胰腺、脑和皮肤。
在一个实施方式中,该治疗是治疗胰腺癌。
在一个实施方式中,该治疗是治疗在细胞表面上具有ανβ6整合蛋白的肿瘤。
预期本发明的抗体-药物结合物(ADC)可被用于治疗例如由肿瘤抗原的过表达表征的多种疾病或紊乱。示例性病症或过度增殖性紊乱包括良性或恶性肿瘤;白血病、血液和淋巴系统恶性肿瘤。其他包括以下的紊乱:神经元的、神经胶质细胞的、星形细胞的、下丘脑的、腺体的、巨噬细胞的、上皮的、基质的、囊胚腔的、炎症性的、血管产生的和免疫学的,包括自身免疫性紊乱。
通常,治疗的疾病或紊乱是过度增殖性疾病,如癌症。本文中治疗的癌症的实例包括但不限于癌、淋巴瘤、母细胞瘤、肉瘤和白血病或淋巴恶性肿瘤。这种癌症的更具体的实例包括鳞状细胞癌(例如,上皮鳞状细胞癌)、肺癌(包括小细胞肺癌、非小细胞肺癌)、肺的鳞状细胞癌和肺的腺癌、腹膜癌、肝细胞癌、胃癌或胃部癌(包括胃肠癌)、胰腺癌、成胶质细胞瘤、宫颈癌、卵巢癌、肝癌、膀胱癌、肝细胞瘤、乳腺癌、结肠癌、直肠癌、结肠直肠癌、子宫内膜癌或子宫癌、唾液腺癌、肾癌或肾脏癌、前列腺癌、外阴癌、甲状腺癌、肝肿瘤、肛门癌、阴茎癌以及头颈癌。
在治疗中可以使用ADC的自身免疫疾病包括风湿性疾病(如例如,类风湿关节炎、干燥综合征、硬皮病、狼疮如SLE和狼疮肾炎、多肌炎/皮肌炎、冷球蛋白血症、抗磷脂抗体综合征和银屑病关节炎),骨关节炎、自身免疫性胃肠和肝紊乱(如例如,炎症性肠疾病(例如,溃疡性结肠炎和克罗恩病)、自身免疫性胃炎和恶性贫血、自身免疫性肝炎、原发性胆汁性肝硬化、原发性硬化性胆管炎和乳糜泻、血管炎(如例如,ANCA相关的血管炎,包括丘-施血管炎、韦格纳肉芽肿病和多动脉炎)、自身免疫性神经紊乱(如,例如,多发性硬化、斜视眼肌阵挛综合征、重症肌无力、视神经脊髓炎、帕金森氏病、阿尔茨海默氏病和自身免疫性多神经病)、肾脏疾病(如例如,肾小球肾炎、古德帕斯彻氏综合征和贝格尔氏病)、自身免疫性皮肤病紊乱(如例如,牛皮癣、荨麻疹、麻疹、寻常天疱疮、大疱性类天疱疮和皮肤红斑狼疮)、血液学紊乱(如例如,血小板减少性紫癜、血栓性血小板减少性紫癜、输血后紫癜和自身免疫性溶血性贫血)、动脉粥样硬化、葡萄膜炎、自身免疫性听觉紊乱(如例如,内耳病和听力损失)、贝切特氏病、雷诺氏综合征、器官移植和自身免疫性内分泌紊乱(如例如,糖尿病相关自身免疫病诸如胰岛素依赖性糖尿病(IDDM)、阿狄森氏病和自身免疫性甲状腺病(例如,格雷夫斯病和甲状腺炎))。更优选的这种疾病包括例如风湿性关节炎、溃疡性结肠炎、ANCA相关的血管炎、狼疮、多发性硬化症、干燥综合征、格雷夫斯病、IDDM、恶性贫血、甲状腺炎和肾小球肾炎。
治疗方法
本发明的结合物可以在治疗方法中使用。还提供的是一种治疗的方法,包括向需要的受试者给予治疗有效量的本发明的结合物化合物。术语“治疗有效量”是足以示出对患者有益处的量。这种益处可以是至少缓解至少一种症状。给予的实际量以及给予的速率和时程将取决于待治疗的性质和严重性。治疗处方(例如,用量决定)由全科医生和其他医生负责。
可以单独给予或与其他治疗组合给予本发明的化合物,同时给予还是顺序给予取决于待治疗的病症。治疗和疗法的实例包括但不限于化学疗法(给予活性剂,包括例如药物,如化疗);手术;和放射疗法。
不考虑作用机制,“化疗剂”是在癌症治疗中有用的化学化合物。化疗剂的类别包括但不限于烷化试剂、抗代谢物、纺锤体毒植物生物碱、细胞毒性/抗肿瘤抗生素、拓扑异构酶抑制剂、抗体、光敏剂和激酶抑制剂。化疗剂包括在“靶向疗法”和常规化疗中使用的化合物。
化疗剂的实例包括:厄洛替尼(Genentech/OSI Pharm.)、多西他赛(Sanofi-Aventis)、5-FU(氟尿嘧啶,5-氟尿嘧啶,CAS号51-21-8)、吉西他滨(Lilly)、PD-0325901(CAS号391210-10-9,Pfizer)、顺铂(顺-二胺,二氯铂(II),CAS号15663-27-1)、卡铂(CAS号41575-94-4)、紫杉醇(Bristol-Myers Squibb Oncology,Princeton,N.J.)、曲妥珠单抗(Genentech)、替莫唑胺(4-甲基-5-氧基-2,3,4,6,8-五氮杂双环[4.3.0]壬-2,7,9-三烯-9-甲酰胺,CAS号85622-93-1,Schering Plough)、他莫昔芬((Z)-2-[4-(1,2-二苯基丁-1-烯基)苯氧基]-N,N-二甲基乙胺、)和多柔比星Akti-1/2、HPPD和雷帕霉素。
化疗剂的更多实例包括:奥沙利铂(Sanofi)、硼替佐米(Millennium Pharm.)、索坦(SU11248,Pfizer)、来曲唑(Novartis)、甲磺酸伊马替尼(Novartis)、XL-518(Mek抑制剂,Exelixis,WO 2007/044515)、ARRY-886(Mek抑制剂,AZD6244,Array BioPharma,AstraZeneca)、SF-1126(PI3K抑制剂,Semafore Pharmaceuticals)、BEZ-235(PI3K抑制剂,Novartis)、XL-147(PI3K抑制剂,Exelixis)、PTK787/ZK 222584(Novartis)、氟维司群(AstraZeneca)、亚叶酸钙(亚叶酸)、雷帕霉素(西罗莫司,Wyeth)、拉帕替尼(GSK572016,Glaxo Smith Kline)、洛那法尼(SARASARTM,SCH 66336,Schering Plough)、索拉非尼(BAY43-9006,Bayer Labs)、吉非替尼(AstraZeneca)、伊立替康(CPT-11,Pfizer)、替吡法尼(ZARNESTRATM,Johnson&Johnson)、ABRAXANETM(不含聚氧乙烯蓖麻油)、紫杉醇的白蛋白纳米工程制剂(American Pharmaceutical Partners,Schaumberg,Il)、凡德他尼(rINN,ZD6474,AstraZeneca)、苯丁酸氮芥、AG1478、AG1571(SU 5271;Sugen)、西罗莫司(Wyeth)、帕唑帕尼(GlaxoSmithKline)、莰佛(Telik)、塞替派和环磷酰胺烷基磺酸酯如白消安、英丙舒凡和哌泊舒凡;氮丙啶类例如苯佐替哌(benzodopa)、卡波醌、美妥替哌(meturedopa)和乌瑞替哌(uredopa);乙撑亚胺和甲基蜜胺(methylamelamines)(包括六甲蜜胺、三乙撑蜜胺、三乙撑磷酰胺、三乙撑硫代磷酰胺和三羟甲基蜜胺(trimethylolomelamine);内酯(尤其是布拉它辛和布拉它辛酮);喜树碱(包括合成的类拓扑替康);苔藓抑素;callystatin;CC-1065(包括它的阿多来新、卡折来新、和比折来新合成类似物);隐藻素(特别是隐藻素1和的隐藻素8);多拉司他汀;倍癌霉素(包括合成类似物,KW-2189和CB1-TM1);艾榴素(eleutherobin);水鬼蕉碱;sarcodictyin;海绵抑制素(spongistatin);氮芥如苯丁酸氮芥、萘氮芥、环磷酰胺、雌莫司汀、异环磷酰胺、甲二氯二乙胺、氮芥氧化物盐酸盐、美法仑、新恩比兴、苯芥胆甾醇、泼尼氮芥、氯乙环磷酰胺、尿嘧啶氮芥;亚硝基脲如卡莫司汀、氯脲霉素、福莫司汀、洛莫司汀、尼莫司汀和雷尼司汀(ranimnustine);抗生素如烯二炔抗生素(例如刺孢霉素、加利车霉素γ1I、卡奇霉素ΩI1(Angew Chem.Intl.Ed.Engl.(1994)33:183-186);蒽环类抗生素(dynemicin)、达内霉素(dynemicin A);二膦酸盐,如氯膦酸盐;埃斯培拉霉素;以及新制癌菌素生色团和相关色蛋白烯二炔抗生素生色团)、阿克拉霉素,放线菌素、氨茴霉素(authramycin)、重氮丝氨酸、博来霉素、放线菌素、卡柔比星(carabicin)、洋红霉素、嗜癌菌素、色霉素(chromomycinis)、更生霉素、柔红霉素、地托比星、6-重氮-5-氧基-L-正亮氨酸、吗啉基-阿霉素、氰基吗啉基-阿霉素、2-吡咯啉基-阿霉素和脱氧阿霉素、表柔比星、依索比星、伊达比星、奈莫柔比星、麻西罗霉素、丝裂霉素如丝裂霉素C、霉酚酸、诺加霉素、橄榄霉素类、培洛霉素、紫菜霉素、嘌呤霉素、三铁阿霉素、罗多比星、链黑菌素、链脲菌素、结核菌素、乌苯美司、新制癌菌素、佐柔比星;抗代谢物,如甲氨蝶呤和5-氟尿嘧啶(5-FU);叶酸类似物如二甲叶酸、甲氨蝶呤、蝶罗呤、三甲曲沙;嘌呤类似物,如氟达拉滨、6-巯基嘌呤、硫咪嘌呤、硫鸟嘌呤;嘧啶类似物,如安西他滨、阿扎胞苷、6-氮尿苷、卡莫氟、阿糖胞苷、二脱氧尿苷、去氧氟尿苷、依诺他滨、氟尿苷;雄激素,如卡普睾酮、屈他雄酮丙酸酯、环硫雄醇、美雄烷、睾内酯;抗肾上腺,如鲁米特、米托坦、曲洛司坦;叶酸补充剂,如亚叶酸;醋葡醛内酯;醛磷酰胺糖苷;氨基乙酰丙酸;恩尿嘧啶;安吖啶;bestrabucil;蒽双咪腙;依达曲沙(edatraxate);地磷酰胺(defofamine);秋水仙胺;亚丝醌;依氟鸟氨酸(elfornithine);依利醋铵;埃博霉素;依托格鲁;硝酸镓;羟基脲;香菇多糖;lonidainine;美登素类,如美登素和安丝菌素;米托胍腙;米托蒽醌;莫哌达醇(mopidanmol);硝嗪;喷司他丁;苯来美特;吡柔比星;洛索;鬼臼树脂酸;2-乙基肼;甲基苄肼;多糖复合物(JHSNatural Products,Eugene,OR);雷佐生;根霉素;西佐喃;螺环锗;细交链孢菌酮酸;三亚胺醌;2,2’,2”-三氯三乙胺;单端孢(尤其是T-2毒素、疣孢菌素(verracurin)A、杆孢菌素A和蛇形菌素);乌拉坦;长春地辛;达卡巴嗪;甘露醇氮芥;二溴甘露醇;二溴卫矛醇;哌泊溴烷;gacytosine;阿糖胞苷(“Ara-C”);环磷酰胺;塞替派;6-硫鸟嘌呤;巯基嘌呤;甲氨蝶呤;铂类似物,如顺铂和卡铂;长春碱;依托泊苷(VP-16);异环磷酰胺;米托蒽醌;长春新碱;长春瑞滨诺肖林;替尼泊苷;依达曲沙;柔红霉素;氨基蝶呤;卡培他滨(Roche);伊班膦酸钠;CPT-11;拓扑异构酶抑制剂RFS2000;二氟甲基鸟氨酸(DFMO);类视黄酸,如视黄酸;以上任一种的药用盐,酸和衍生物。
还包括在“化疗剂”的定义中的是:(i)作用于调节或抑制肿瘤上的激素作用的抗激素试剂如抗雌激素和选择性雌激素受体调节剂(SERM),例如包括它莫西芬(包括枸橼酸它莫西芬)、雷洛昔芬、屈洛昔芬、4-羟基它莫西芬、曲沃昔芬、那洛昔芬(keoxifene)、LY117018、奥那司酮和(枸橼酸托瑞米芬);(ii)抑制酶芳香酶的芳香酶抑制剂,其调节产生于肾上腺的雌激素如例如4(5)-咪唑、氨鲁米特、(醋酸甲地孕酮)、(依西美坦;Pfizer)、福美司坦(formestanie)、法倔唑、(伏氯唑)、(来曲唑;Novartis)、和(阿那曲唑;AstraZeneca);(iii)抗雄激素如氟他米特、尼鲁米特、比卡鲁胺、亮丙瑞林和戈舍瑞林;以及曲沙他滨(1,3-二氧戊环核苷酸胞嘧啶类似物);(iv)蛋白质激酶抑制剂如MEK抑制剂(WO 2007/044515);(v)脂质激酶抑制剂;(vi)反义寡核苷酸,特别是抑制畸变细胞增殖中受影响的信号通路的基因表达的那些,例如PKC-α、Raf和H-Ras,如奥利默森(Genta Inc.);(vii)核糖酶如VEGF表达抑制剂(例如,)和HER2表达抑制剂;(viii)疫苗如基因疗法疫苗,例如和rIL-2;拓扑异构酶1抑制剂,如rmRH;(ix)抗血管生成剂如安维汀(Genentech)和以上任一种的药用盐、酸和衍生物。
还包括在“化疗剂”的定义中的是治疗性抗体如阿仑单抗(Campath)、贝伐单抗(Genentech);西妥昔单抗(Imclone);帕尼单抗(Amgen)、利妥昔单抗(Genentech/Biogen Idec)、帕妥珠单抗(OMNITARGTM,2C4,Genentech)、曲妥单抗(Genentech)、托西莫单抗(Bexxar,Corixia)和抗体药物结合物、吉妥单抗(Wyeth)。
具有与本发明的结合物组合作为化疗剂的治疗潜力的人源化单克隆抗体包括:阿仑单抗、阿泊珠单抗、阿塞珠单抗、托珠单抗(atlizumab)、巴品珠单抗(bapineuzumab)、贝伐单抗、莫比伐珠单抗、莫坎妥珠单抗、西利珠单抗、赛妥珠单抗、cidfusituzumab、cidtuzumab、达利珠单抗、依库丽单抗、依法利珠单抗、依帕珠单抗、厄利珠单抗、非维珠单抗、芳妥珠单抗(fontolizumab)、吉妥珠单抗、英妥珠单抗(inotuzumab ozogamicin)、伊匹单抗、拉贝珠单抗、林妥珠单抗、马妥珠单抗、美泊利单抗、莫维珠单抗、motovizumab、那他珠单抗、尼妥珠单抗、nolovizumab、numavizumab、奥瑞珠单抗(ocrelizumab)、马珠单抗、帕利珠单抗、帕考珠单抗、pecfusituzumab、pectuzumab、帕妥珠单抗、培克珠单抗、ralivizumab、雷珠单抗、reslivizumab、瑞利珠单抗、resyvizumab、rovelizumab、鲁利单抗、西罗珠单抗、西利珠单抗(siplizumab)、索土珠单抗、他珠单抗(tacatuzumabtetraxetan)、他度珠单抗、他利珠单抗、特非珠单抗、塔西单抗、托利珠单抗(toralizumab)、曲妥单抗、tucotuzumab西莫白介素(celmoleukin)、tucusituzumab、umavizumab、乌珠单抗和维西珠单抗(visilizumab)。
按照本发明的和按照本发明使用的药物组合物,除活性成分(即,结合物化合物)以外,还可以包含药用赋形剂、载体、缓冲剂、稳定剂或本领域技术人员众所周知的其他物质。这样的物质应是无毒的,并且不应干扰活性成分的效力。载体或其他物质的确切性质将取决于给予途径,该给予途径可以是口服或通过注射(例如皮肤、皮下或静脉内)。
口服给予的药物组合物可以是片剂、胶囊剂、散剂或液体形式。片剂可以包含固体载体或佐剂。液体药物组合物通常包含液体载体如水、石油(petroleum)、动物或植物油、矿物油或合成油。可以包括生理盐水溶液、右旋糖或其它糖类溶液或二醇类如乙二醇、丙二醇或聚乙二醇。胶囊剂可以包含固体载体如明胶。
对于静脉注射、皮肤或皮下注射、或在痛苦部位的注射,活性成分可以是肠胃外可接受的水溶液的形式,该水溶液是无热原的并且具有合适的pH、等渗性和稳定性。本领域相关技术人员完全能够制备合适的溶液,例如使用等渗载体如氯化钠注射液、林格注射液、乳酸盐林格注射液。根据需要可以包括防腐剂、稳定剂、缓冲剂、抗氧化剂和/或其它添加剂。
制剂
尽管可以单独使用(例如,给予)结合物化合物,但是往往优选的是使其作为组合物或制剂存在。
在一个实施方式中,组合物是包含本文所描述的结合物化合物和药用载体、稀释剂或赋形剂的药物组合物(例如,制剂、配制品、药品)。
在一个实施方式中,组合物是包含至少一种本文所描述的结合物化合物与一种或多种本领域技术人员众所周知的药用成分的药物组合物,该成分包括但不限于药用载体、稀释剂、赋形剂、佐剂、填料、缓冲液、防腐剂、抗氧化剂、润滑剂、稳定剂、增溶剂、表面活性剂(例如,润湿剂)、掩蔽剂、着色剂、调味剂和甜味剂。
在一个实施方式中,组合物进一步包含其他活性剂,例如其他治疗剂或预防剂。
在标准药品文本中可以找到合适的载体、稀释剂、赋形剂等。参见,例如,Handbook of Pharmaceutical Additives,第2版(eds.M.Ash and I.Ash),2001(SynapseInformation Resources,Inc.,Endicott,New York,USA),Remington's Pharmaceutical Sciences,第20版,pub.Lippincott,Williams&Wilkins,2000;和Handbook of Pharmaceutical Excipients,第2版,1994。
本发明的另一方面涉及制作药物组合物的方法,包括将本文定义的至少一种[11C]-放射性同位素示踪的结合物或结合类化合物与一种或多种本领域技术人员众所周知的其他药用成分混合在一起,例如,载体、稀释剂、赋形剂等。如果配制为离散单元(例如,片剂等),那么每个单元包含预定量(剂量)的活性化合物。
如本文所使用的,术语“药用”涉及化合物、成分、材料、组合物、剂型等,其在合理的医学判断范围内、适用于与所讨论的受试者(例如,人类)的组织接触而没有过度的毒性、刺激、过敏反应或其他问题或并发症,并与合理的利益/风险比相称。每种载体、稀释剂、赋形剂等在与制剂的其他成分相容的意义上也必须是“可接受的”。
该制剂可通过药学领域众所周知的任何方法来制备。这样的方法包括使活性化合物与构成一种或多种辅助成分的载体结合的步骤。一般而言,制剂通过使活性化合物与载体(例如,液体载体、精细固体载体等)均匀和紧密地结合的,然后如必要的话使产品成形来制备。
可以将制剂制备为快速或慢速释放;立即、延迟、即时、或缓慢释放;或它们的组合。
适于胃肠外给予(例如,通过注射)的制剂包括水性或非水性的、等渗、无热原的、无菌液体(例如溶液,悬浮液),其中活性成分被溶解、悬浮或以其他方式提供(例如,在脂质体或其他微粒中)。这种液体可以另外含有其他药用成分,如抗氧化剂、缓冲剂、防腐剂、稳定剂、抑菌剂、助悬剂、增稠剂以及使制剂与预期受体的血液(或其他相关体液)等渗的溶质。赋形剂的实例包括,例如,水、醇、多元醇、甘油、植物油等。用于此类制剂的合适等渗载体的实例包括氯化钠注射液、林格氏液(Ringer'sSolution)或乳酸林格氏注射液。通常,液体中活性成分的浓度是约1ng/ml至约10μg/ml,例如约10ng/ml至约1μg/ml。该制剂可存在于单位剂量或多剂量密封容器,例如安瓿和小药瓶中,并且可在冷冻干燥(冻干)条件下储藏,仅需在临用前加入无菌液体载体例如注射用水。可以由无菌粉末、颗粒和片剂当场制备注射溶液和悬浮液。
剂量
本领域技术人员应当理解的是,结合物化合物和包含该结合物化合物的组合物的适当剂量可因患者而异。确定最佳剂量一般将牵涉使治疗益处的水平与任何风险或有害副作用平衡。选择的剂量水平取决于多个因素,包括但不限于特定化合物的活性、给予途径、给予时间、化合物的排泄时间、治疗的持续时间、其他药物、化合物、和/或组合使用的物质、病症的严重性和物种、性别、年龄、体重、病症、总体健康状况和患者的病史。化合物的量和给予途径最终将由医师、兽医或临床医师慎重决定,但是一般而言,将选择剂量以在作用部位达到实现所期望的作用而不会造成实质性损害或有害副作用的局部浓度。
在整个治疗过程中,可以单剂量、连续或间歇地(例如,以适当时间间隔分剂量)实现给予。确定最有效的给予手段和剂量的方法是本领域技术人员众所周知的,并且将随用于疗法的制剂、疗法目的、所治疗的(多种)靶细胞和所治疗的受试者的不同而改变。可以用治疗医师、兽医或临床医师选择的剂量水平和模式进行单次或多次给予。
一般而言,活性化合物的合适剂量是在每千克受试者体重每天约100μg至约25mg(更通常地约1μg至约10mg)的范围内。在该化合物是盐、酯、酰胺、前药等的情况下,基于母体化合物计算给予量,所以所用的实际重量成比例增加。
在一个实施方式中,根据以下剂量方案将活性化合物给予至人类患者:约100mg,每天3次。
在一个实施方式中,根据以下剂量方案将活性化合物给予至人类患者:约150mg,每天2次。
在一个实施方式中,根据以下剂量方案将活性化合物给予至人类患者:约200mg,每天2次。
然而,在一个实施方式中,根据以下剂量方案将结合物化合物给予至人类患者:约50或约75mg,每天3或4次。
在一个实施方式中,根据以下剂量方案将结合物化合物给予至人类患者:约100或约125mg,每天2次。
以上描述的剂量可以适用于结合物(包含PBD部分和至抗体的接头)或适用于提供的PBD化合物的有效量,例如,接头切割后释放的化合物的量。
对于预防或治疗疾病,本发明的ADC的适当剂量将取决于待治疗的以上定义的疾病类型、疾病的严重度和情况(无论给予分子是用于预防还是治疗目的)、先前治疗、患者的临床病史和对抗体的反应以及主治医生的判断。一次或通过一系列治疗将分子合适地给予至患者。根据疾病类型和严重程度,无论,例如通过一次或多次单独给予或通过连续输注,约1μg/kg至15mg/kg(例如,0.1-20mg/kg)分子是用于给予至患者的初始候选剂量。通常的每日剂量可以在约1μg/kg至100mg/kg以上的范围内,这取决于上述的因素。给予至患者的ADC的示例性剂量在患者重量的约0.1至约10mg/kg范围内。对于几天或更长的重复给予,根据病症,治疗持续到实现疾病症状的期望抑制。示例性剂量方案包括给予约4mg/kg初始载药量的情况,随后每周、每两周、或每三周添加另外剂量的ADC。其他剂量方案可以是有用的。该疗法的进展易于通过常规技术和测定法监测。
治疗
在治疗病症的情况下本文所使用的术语“治疗”通常涉及无论人类或动物(例如,兽医应用)的治疗和疗法,其中,其中实现了一些期望的疗效,例如,对病症发展的抑制,并且包括发展速度降低、发展速度减半、病症恢复、病症改善和病症康复。还包括作为预防性措施(即,预防、防止)的治疗。
如本文所使用的,术语“治疗有效量”涉及活性化合物或包含活性化合物的物质、组合物或剂型的量,当根据期望的治疗方案给予时,其对于产生一些期望的疗效是有效的、与合理的利益/风险比相当。
类似地,如本文所使用的,术语“预防有效量”涉及活性化合物或包含活性化合物的物质、组合物或剂型的量,当根据期望的治疗方案给予时,其对于产生一些期望的预防效果是有效的、与合理的利益/风险比相当。
受试者/患者
受试者/患者可以是动物、哺乳动物、胎盘哺乳动物、有袋动物(例如,袋鼠、袋熊)、单孔目动物(例如,鸭嘴兽)、啮齿动物(例如,豚鼠、仓鼠、大鼠、小鼠)、鼠科动物(例如,小鼠)、兔类动物(例如,家兔)、鸟类(例如,鸟)、犬科动物(例如,狗)、猫科动物(例如,猫)、马科动物(例如,马)、猪科动物(例如,猪)、绵羊类(例如,绵羊)、牛科动物(例如,奶牛)、灵长类动物、猿类(例如,猴子或类人猿)、猴(例如,狨猴、狒狒)、类人猿(例如,大猩猩、黑猩猩、猩猩、长臂猿)或人类。
此外,受试者/患者可是其发育的任何形式,例如胎儿。在一个优选的实施方式中,受试者/患者是人。
在一个实施方式中,患者是其中每个患者具有在细胞表面上具有ανβ6整合蛋白的肿瘤的群体。
包括其它形式
除非另外指定,否则上文包括这些取代基的众所周知的离子、盐、溶剂化物和受保护形式。例如,提及羧酸(-COOH)还包括其阴离子(羧酸根)形式(-COO-)、盐或溶剂化物,以及常规的受保护形式。类似地,提及氨基基团包括氨基基团的质子化形式(-N+HR1R2)、盐或溶剂化物(例如,盐酸盐)以及氨基基团的常规受保护形式。类似地,提及羟基基团还包括其阴离子形式(-O-)、盐或溶剂化物,以及常规的受保护形式。
盐
可以方便的或期望是制备、纯化和/或处理活性化合物的相应盐,例如,药用盐。药用盐的实例讨论于Berge,et al.,J.Pharm.Sci.,66,1-19(1977)。
例如,如果化合物是阴离子或具有可以是阴离子的官能团(例如,-COOH可以是-COO-),那么可以与合适的阳离子形成盐。合适的无机阳离子的实例包括但不限于碱金属离子如Na+和K+,碱土金属阳离子如Ca2+和Mg2+,以及其他阳离子如Al+3。合适的有机阳离子的实例包括但不限于铵离子(即,NH4 +)和取代的铵离子(例如,NH3R+、NH2R2 +、NHR3 +、NR4 +)。一些合适的取代的铵离子的实例是源自以下的那些:乙胺、二乙胺、二环己胺、三乙胺、丁胺、乙二胺、乙醇胺、二乙醇胺、哌嗪、苄胺、苯基苄胺、胆碱、葡甲胺和氨丁三醇,以及氨基酸,如赖氨酸和精氨酸。常见的季铵离子的实例是N(CH3)4 +。
如果化合物是阳离子化合物或具有可以是阳离子的官能团(例如,-NH2可以是-NH3 +),那么可以与合适的阴离子形成盐。合适的无机阴离子的实例包括但不限于源自以下无机酸的那些:盐酸、氢溴酸、氢碘酸、硫酸、亚硫酸、硝酸、亚硝酸、磷酸和亚磷酸。
合适的有机阴离子的实例包括但不限于源自以下有机酸的那些:2-乙酰氧基苯甲酸、乙酸、抗坏血酸、天冬氨酸、苯甲酸、樟脑磺酸、肉桂酸、柠檬酸、乙二胺四乙酸、乙二磺酸、乙磺酸、富马酸、庚糖酸、葡萄糖酸、谷氨酸、乙醇酸、羟基马来酸、萘酚羧酸、羟乙基磺酸、乳酸、乳糖酸、月桂酸、马来酸、苹果酸、甲基磺酸、粘酸、油酸、草酸、棕榈酸、帕莫酸、泛酸、苯乙酸、苯基磺酸、丙酸、丙酮酸、水杨酸、硬脂酸、琥珀酸、对氨基苯磺酸、酒石酸、甲苯磺酸和缬草酸。合适的聚合物有机阴离子的实例包括但不限于源自以下聚合物酸的那些:丹宁酸、羧甲基纤维素。
溶剂化物
可以方便的或期望的是制备、纯化和/或处理活性化合物的相应溶剂化物。术语“溶剂化物”在本文中以常规含义使用,是指溶质(例如,活性化合物、活性化合物的盐)与溶剂的复合物。如果溶剂是水,则该溶剂化物可以方便地称为水合物,例如,一水合物、二水合物、三水合物等。
醇胺
本发明包括这样的化合物,其中溶剂通过PBD部分的亚胺键加成,见下示出,其中该溶剂是水或醇(RAOH,其中RA是C1-4烷基):
这些形式可以称之为PBD的甲醇胺和甲醇胺醚的形式。这些等式的平衡取决于发现化合物的条件以及该部分本身的属性。
例如,可以通过冷冻干燥以固体形式来分离这些特定化合物。
异构体
本发明的某些化合物可以以一种或多种特定的几何形式、光学形式、对映体形式、非对映体形式、差向异构体形式、异位异构体(atropic)形式、立体异构体形式、互变异构体形式、构象形式或异头异构体(anomeric)形式,包括但不限于顺式和反式;E型和Z型;c式、t式和r式;内形式和外形式;R型、S型和内消旋型;D型和L型;d型和l型;(+)和(-)形式;酮式、烯醇式和烯醇化物形式;顺式和反式;向斜形式和背斜形式;α形式和β形式;轴向形式和平伏形式;船式、椅式、扭式、信封式和半椅式;并且它们的组合,在下文中统称为“异构体”(或“异构体形式”)。
要注意的是,除非如下文针对互变异构体形式所讨论的,从术语“异构体”(如在本文中所使用的)明确排除结构(或构造)异构体(即,其差异在于在原子之间的连接而不仅在于原子在空间中的位置的异构体)。例如,提及甲氧基基团(-OCH3)并不被解释为提及它的结构异构体(羟甲基基团,-CH2OH)。类似地,提及邻氯苯基并不被解释为提及它的结构异构体(间氯苯基)。然而,提及一类结构可以包括落在上述类型的范围之内的结构异构体形式(例如,C1-7烷基包括正丙基和异丙基;丁基包括正丁基、异丁基、仲丁基和叔丁基;甲氧基苯基包括邻甲氧基苯基、间甲氧基苯基和对甲氧基苯基)。
上述排除并不涉及互变异构体形式,如在,例如,以下互变异构体对中:酮/烯醇(如下图所示)、亚胺/烯胺、酰胺/亚氨基醇、脒/脒、亚硝基/肟、硫酮/烯硫醇、N-亚硝基/羟基偶氮和硝基/异硝基中的例如,酮式、烯醇式和烯醇化物形式。
要注意的是,术语“异构体”特别包括具有一个或多个同位素取代的化合物。例如,H可以以任何同位素形式,包括1H、2H(D)、和3H(T);C可以以任何同位素形式,包括12C、13C、和14C;O可以以任何同位素形式,包括16O和18O;等。
除非另外指定,否则提到的特定化合物包括全部这些异构体形式,包括其(全部或部分地)外消旋和其它混合物。用于这种异构体形式的制备(例如,不对称合成)和分离(例如,分步结晶和色谱法)的方法是本领域已知的,或通过已知的方式调整本文所教导的方法或已知的方法是容易得到的。
一般合成途径
在以下参考文献中广泛讨论了包含两个亚胺部分的PBD化合物的合成,通过引用将其中的讨论结合于此:
a)WO 00/12508(第14至30页);
b)WO 2005/023814(第3至10页);
c)WO 2004/043963(第28至29页);
d)WO 2005/085251(第30至39页);
e)WO 2010/043880(第26至29页);
f)WO 2011/130613(第56至59页)和
g)WO 2011/130616(第57至61页)。
合成途径
其中R10和R11b或R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键的式I的化合物可以由以下式2的化合物合成:
其中,R2、R6、R7、R9、R6’、R7’、R9’、R12、X、X’和R”如以上针对式I的化合物所定义,R11和ProtN1以及R21和ProtN2对中的一对是用于合成的OProtO和氨基甲酸酯氮保护基团且另一对选自:
(a)用于合成的=O和半-缩醛胺氮保护基团;
(b)用于合成的H和氨基甲酸酯氮保护基团,
通过施加必要的条件除去保护基团。
可以将式2的化合物直接用于制作药物-接头和本发明的结合物,且因此式2的化合物可以是本发明的进一步方面。可以添加连接基团的一部分(例如,以形成式III)的受保护化合物),随后可以进行以上讨论的脱保护。
在以上描述的合成步骤中可以保护式2中的基团X或Q(R2的一部分),在该情况下,可以除去其以给出式2的期望化合物。
可以通过偶联式3a和4a的化合物或式3b和4b的化合物合成式2的化合物:
其中,Hal选自I、Cl和Br。
例如在回流的丙酮与碱(如K2CO3)中可以实现偶联。
可以通过使式3a和4b的化合物分别与式5的化合物反应合成式3b和4a的化合物:
Hal-R”-Q 式5,
其中,Q选自I、Cl和Br。例如在回流的丙酮与碱(如K2CO3)中可以实现该反应。需要过量的式5的化合物来实现期望的产物。
可以以与2012年10月12日提交的共同未决PCT申请PCT/EP2012/070232中描述的方式类似的方式合成包含亚胺或等价基团的单体,通过引用将其结合于此。以下关于式3a的化合物描述了该方法,其中,R11和ProtN1是用于合成(式3a-I)的OProtO和氨基甲酸酯氮保护基团,但是该方法同样适合于式4b的化合物,其中,R21和ProtN2是用于合成的OProtO和氨基甲酸酯氮保护基团。
ProtN-carb代表用于合成的氨基甲酸酯氮保护基团。
式3a-I的化合物可以由式6-I的化合物合成:
其中,ProtY是与化合物中的其他保护基团正交的用于Y的保护基团。在标准条件下,通过对Y脱保护实现合成。
通过在非外消旋条件下用ProtO保护OH基团,式6-I的化合物可以由式7-I的化合物合成:
通过氧化,式7-I的化合物可以由式8-I的化合物合成:
例如可以用Dess-Martin高价碘化物(或可替代的TPAP/NMO、TFAA/DMSO、SO3、吡啶络合物/DMSO、PDC、PCC、BAIB/TEMPO或在Swern条件下)进行氧化。
通过在标准条件下对OH基团脱保护,式8-I的化合物可以由式9-I的化合物合成:
通过在标准条件下由ProtN-carb保护胺基团,式9-I的化合物可以由式10-I的化合物合成:
通过还原硝基基团,式10-I的化合物可以由式11-I的化合物合成:
例如可以在甲醇中用Zn灰尘与5%甲酸通过标准方式实现还原。
通过包含-R2的适当化合物的钯介导的偶联,式11-I的化合物可以由式12-I的化合物合成:
这种偶联包括但不限于:与适当的硼衍生物的Suzuki偶联;与烯烃包括丙烯酰胺和丙烯酸酯的Heck偶联;与有机锡试剂如烷基锡试剂的Stille偶联;与炔烃的Sonagishira偶联;和使用三乙基硅烷的氢转移。
通过在-35℃以下的温度下在干燥的有机溶剂中,在惰性气氛下使用三氟甲磺酸酐和无水2,6-二甲基砒啶或无水2,6-叔丁基吡啶三氟甲磺酰化,式12-I的化合物可以由式13-I的化合物合成:
在C2’和C3’之间不存在双键的式4b的化合物的合成中,可以在该阶段引入相关的R12。
如果其他单体包含胺基团,那么可以以与以上描述的方式类似的方式合成它。以下关于式3a的化合物描述了该方法,其中,R11和ProtN1是用于合成(式3a-II)的H和氨基甲酸酯氮保护基团,但是该方法同样适合于式4b的化合物,其中,R21和ProtN2是用于合成的H和氨基甲酸酯氮保护基团。
式3a-II的化合物可以由式6-II的化合物合成:
其中,ProtY是与化合物中的其他保护基团正交的用于Y的保护基团。在标准条件下,通过对Y脱保护实现合成。
通过在标准条件下用ProtN-carb保护NH基团,式6-II的化合物可以由式7-II的化合物合成:
通过还原性胺化,式7-II的化合物可以由式8-II的化合物合成:
通过醇的氧化,式8-II的化合物可以由式9-II的化合物合成:
通过在标准条件下对OH基团脱保护,式9-II的化合物可以由式11-I的化合物合成:
通过包含-R2(如以上所描述的)的适当化合物的钯介导的偶联,式6-II的化合物可以可替代地由式12的化合物合成:
通过三氟甲磺酰化,式12的化合物可以由式13的化合物合成:
这可以在以上所描述的条件或标准条件下进行。
通过使用标准条件氧化醇基团,式13的化合物可以由式14的化合物合成:
通过除去ProtO基团,式14的化合物可以由式15的化合物合成:
该ProtO基团是与化合物中的其他基团正交的醇保护基团。
通过用氨基甲酸酯氮保护基团保护胺,式15的化合物可以由式16的化合物合成:
通过还原性胺化,式16的化合物可以由式17的化合物合成:
通过氧化未保护的醇基团,式17的化合物可以由式18的化合物合成:
式18的化合物可以用于合成式13-I的化合物。
通过还原酯官能团,式18的化合物可以由式19的化合物合成:
在胺偶联条件下,式19的化合物可以由式20和21的偶联化合物合成:
如果其他单体包含酰胺基团,那么可以以与以上描述的方式类似的方式合成它。以下关于式3a的化合物描述了该方法,其中,R11和ProtN1是用于合成(式3a-III)的=O和半-缩醛胺氮保护基团,但是该方法同样适合于式4b的化合物,其中,R21和ProtN2是用于合成的=O和半-缩醛胺氮保护基团。
其中,ProtN-amin代表用于合成的半-缩醛胺氮保护基团。
通过在标准条件下对Y脱保护,式3a-III的化合物可以由式22的化合物合成:
通过包含-R2(如以上所描述的)的适当化合物的钯介导的偶联,式22的化合物可以由式23的化合物合成:
通过三氟甲磺酰化,式23的化合物可以由式24的化合物合成:
这可以在以上所描述的条件或标准条件下进行。
通过氧化醇基团,式24的化合物可以由式25的化合物合成:
通过除去ProtO基团,式25的化合物可以由式26的化合物合成:
该ProtO基团是与化合物中的其他基团正交的醇保护基团。
通过用半-缩醛胺氮保护基团保护胺,式26的化合物可以由式27的化合物合成:
通过用氢和Pd/C还原酯官能团实现闭环,可以由式19的化合物合成式27的化合物。
以下实施例1中示出了可替代的合成策略。
用于合成的氮保护基团
用于合成的氮保护基团是本领域众所周知的在本发明中,特别感兴趣的保护基团是氨基甲酸酯氮保护基团和半缩醛胺氮保护基团。
氨基甲酸酯氮保护基团具有以下结构:
其中,R’10是以上定义的R。大量合适基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups in Organic Synthesis,第3版,John Wiley&Sons,Inc.,1999的第503至549页,通过引用将其结合于此。
特别优选的保护基团包括Troc、Teoc、Fmoc、BOC、Doc、Hoc、TcBOC、1-Adoc和2-Adoc。
其它可能的基团是硝基苄氧基羰基(例如,4-硝基苄氧基羰基)和2-(苯基磺酰基)乙氧基羰基。
这些可以用钯催化剂除去的保护基团是非优选的,例如Alloc。
半-缩醛胺氮保护基团具有以下结构:
其中,R’10是以上定义的R。大量的作为酰胺保护基团的合适基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups in Organic Synthesis,第3版,John Wiley&Sons,Inc.,1999的第633至647页,通过引用将其结合于此。本文公开的基团可以适用于本发明的化合物。这些基团包括但不限于SEM、MOM、MTM、MEM、BOM、硝基或甲氧基取代的BOM、Cl3CCH2OCH2-。
用于合成的氧保护基团
用于合成的氧保护基团是本领域众所周知的。大量合适的氧保护基团描述于Greene,T.W.和Wuts,G.M.,Protective Groups in Organic Synthesis,第3版,JohnWiley&Sons,Inc.,1999的第23至200页,通过引用将其结合于此。
特别感兴趣的类别包括甲硅烷基醚、甲基醚、烷基醚、苄醚、酯、乙酸酯、苯甲酸酯、碳酸酯和磺酸酯。
优选的氧保护基团包括乙酸酯类、TBS和THP。
合成药物结合物
可以如之前所描述的制备结合物。可以如美国专利号6,214,345中所述的来制备具有马来酰亚胺基团(A)、肽基团(L1)和自分解基团(L2)的接头,通过引用将其结合于此。可以如WO 2009/0117531中所述的来制备具有马来酰亚胺基团(A)和肽基团(L1)的接头,通过引用将其结合于此。其它接头可以根据本文所引用的参考文献或本领域技术人员已知的来制备。
可以根据本领域中已知的方法来制备接头-药物化合物。可以根据通常描述于美国专利号6,214,345和7,498,298;以及WO 2009-0117531中的方法或技术人员已知的方法来进行(PDB二聚物药物单元的)胺类X取代基和接头单元的活性基团的连接。
可以如Doronina et al.,Nature Biotechnology,2003,21,778-784)中所描述的将抗体结合至接头-药物化合物。简要地,在37℃下,用三(羧乙基)膦盐酸盐(TCEP)还原在pH 7.4含有50mM硼酸钠的PBS中的抗体(4-5mg/mL)。还原链内二硫化物的该反应的进程是通过与5,5'-二硫代双(2-硝基苯甲酸)的反应来监控的并且允许其进行直至达到该水平的硫醇/mAb。然后将还原的抗体冷却至0℃并用1.5当量的马来酰亚胺药物-接头/抗体硫醇烷基化。1小时之后,通过添加5当量的N-乙酰半胱氨酸淬灭(冷激,quench)反应。在PD-10柱上通过凝胶过滤除去淬灭的药物-接头。然后通过0.22μm针筒过滤器无菌过滤ADC。通过分别在280nm和329nm的频谱分析,用在280nm的药物吸光度的贡献进行校正,来确定蛋白质浓度。可以使用尺寸排阻色谱法来确定抗体聚集的程度,并且可以使用RP-HPLC来确定残留NAC-淬灭的药物-接头的水平。
如在国际专利公开WO2008/070593中或如下所描述的,可以将具有引入的半胱氨酸残基的抗体结合于接头-药物化合物,将其结合于于此。通过添加10当量的TCEP和1mMEDTA并且用1M Tris缓冲液(pH 9.0)将pH调节为7.4,将在重链中包含引入的半胱氨酸残基的抗体完全还原。在37℃下温育1小时之后,将反应物冷却至22℃并且添加30当量的脱氢抗坏血酸以选择性地再氧化天然二硫化物,同时将引入的半胱氨酸保持在还原状态。用1MTris缓冲液(pH 3.7)将pH调节为6.5并且在22℃下使反应进行1小时。然后通过添加1MTris缓冲液(pH 9.0)将溶液的pH再次升高至7.4。将3.5当量在DMSO中的PBD药物接头放置在合适的容器中,在添加至反应之前用丙二醇稀释。为了维持PBD药物接头的溶解性,首先用丙二醇将抗体本身稀释至33%的最终浓度(例如,如果抗体溶液是60mL反应体积,那么添加30mL丙二醇)。向PBD药物接头中添加相同体积的丙二醇(在本实例中是30mL)作为稀释剂。混合后,将PBD药物接头在丙二醇中的溶液添加至抗体溶液中以实现结合;丙二醇的最终浓度是50%。使反应进行30分钟,然后添加5当量的N-乙酰半胱氨酸来淬灭反应。通过30kD的膜超滤来纯化ADC。(需要注意的是,对于任何特定的PBD,都可以降低在反应中使用的丙二醇的浓度,因为其唯一目的是维持药物接头在含水介质中的溶解性)。
对于卤代-乙酰胺类接头-药物化合物,通常可以如下来进行结合。向还原且再氧化的抗体(在重链中具有引入的半胱氨酸)的10mM Tris(pH7.4)、50mM NaCl和2mM DTPA溶液中,添加0.5体积的丙二醇。在结合之间,立即制备10mM乙酰胺类接头-药物化合物在二甲基乙酰胺中的溶液。将添加至抗体溶液的当量丙二醇添加至6倍摩尔过量的接头-药物化合物中。将释稀的接头-药物溶液添加至抗体溶液中,并且使用1M Tris(pH 9)将pH调节为8-8.5。在37℃下使反应进行45分钟。通过还原和变性反相PLRP-S色谱证实结合。用QuadrasilMP树脂除去过量的接头-药物化合物,并且使用PD-10脱盐柱将缓冲液转换为10mM Tris(pH7.4)、50mM NaCl和5%丙二醇。
药物接头的说明性合成方案
以下方案是用于合成药物接头的说明性途径,其中,PBD代表本发明的式I的化合物,其中,X是NH2,其可以在本发明的范围内变化。
方案A
可以在-78℃下,在二氯甲烷中使用双光气处理葡糖苷酸接头中间体S1(参考:Jeffrey et al.,Bioconjugate Chemistry,2006,17,831-840)以提供葡糖苷酸氯甲酸酯,然后通过逐滴添加使其与溶解于CH2Cl2中的PBD二聚物S2反应。在2小时内将反应升温至0℃,然后萃取产生化合物S3。用氢氧化锂一水合物处理S3在MeOH、四氢呋喃和水的等量溶剂混合物中的溶液(冷却至0℃)4小时,随后与冰乙酸反应产生化合物S4。向S4在DMF中的溶液中添加马来酰亚胺基己酰基NHS酯,随后添加二异丙基乙胺并且在氮气下在室温下搅拌2小时,产生期望的药物接头S5。
方案B
在无水二氯甲烷中,在EEDQ的存在下,可以将马来酰亚胺接头S6(其可以通过使马来酰亚胺基己酰基N-羟基琥珀酰亚胺与H-Val-Ala-OH反应合成)连接至示例性的化合物S2。
方案C
在5%甲醇/二氯甲烷中,在EEDQ的存在下,可以将接头S8连接至示例性的化合物S2。可以在无水二氯甲烷中,用Ph3P、吡咯烷和钯四(tetrakis palladium)进行S9的脱保护。在DMF中在DIPEA的存在下,通过添加马来酰亚胺己酰基-NHS酯将S10转化为期望的产物。
进一步的优选项
以下优选可以适用于以上描述的发明的所有方面或可涉及单一方面。该优选项可以以任何组合组合在一起。
在一些实施方式中,R6’、R7’、R9’和Y’优选地分别与R6、R7、R9和Y相同。
二聚物连接
Y和Y′优选地是O。
R”优选地是不具有取代基的C3-7亚烷基基团。更优选地,R”是C3、C5或C7亚烷基。最优选地,R”是C3或C5亚烷基。
R6至R9
R9优选地是H。
R6优选地选自H、OH、OR、SH、NH2、硝基和卤素,并且更优选地选自H或卤素,以及最优选地是H。
R7优选地选自H、OH、OR、SH、SR、NH2、NHR、NRR’和卤素,并且更优选地独立地选自H、OH和OR,其中,R优选地选自可选取代的C1-7烷基、C3-10杂环基和C5-10芳基基团。可以更优选的是C1-4烷基基团,其可以或不可以被取代。感兴趣的取代基是C5-6芳基基团(例如,苯基)。在7位置处的特别优选的取代基是OMe和OCH2Ph。特别感兴趣的其他取代基是二甲基氨基(即,-NMe2);-(OC2H4)qOMe,其中,q是0至2;含氮的C6杂环基,包括吗啉基、哌啶基和N-甲基-哌嗪基。
这些优选项分别适用于R9’、R6’和R7’。
R2
在一些实施方式中,R2是式IIa。
R2中的A,当它是式IIa时,可以是苯基基团或C5-7杂芳基基团,例如呋喃基、硫苯基和吡啶基。在一些实施方式中,A优选是苯基。
Q2-X可以在C5-7芳基基团的任何可用的环原子上,但优选地在与化合物的剩余部分的键不相邻的环原子上,即优选地是在化合物的剩余部分的键的β或γ位。因此,当C5-7芳基基团(A)是苯基时,取代基(Q2-X)优选地在间位或对位,并且更优选地在对位。
在一些实施方式中,Q1是单键。在这些实施方式中,Q2选自单键和-Z-(CH2)n-,其中,Z选自单键、O、S和NH,并且n是1至3。在这些实施方式的一些中,Q2是单键。在其他实施方式中,Q2是-Z-(CH2)n-。在这些实施方式中,Z可以是O或S,并且n可以是1或n可以是2。在这些实施方式的其他的中,Z可以是单键,并且n可以是1。
在其它实施方式中,Q1是-CH=CH-。
在其他实施方式中,R2是式IIb。在这些实施方式中,RC1、RC2和RC3独立地选自H和未取代的C1-2烷基。在一些优选的实施方式中,RC1、RC2和RC3都是H。在其他实施方式中,RC1、RC2和RC3都是甲基。在某些实施方式中,RC1、RC2和RC3独立地选自H和甲基。
X选自包含以下的列表的基团:OH、SH、CO2H、COH、N=C=O、NHNH2、CONHNH2、和NHRN,其中,RN选自包括H和C1-4烷基的组。X可以优选地是:OH、SH、CO2H、-N=C=O或NHRN,并且可以更优选地是:OH、SH、CO2H、-N=C=O或NH2。特别优选的基团包括:OH、SH和NH2,NH2是最优选的基团。
在一些实施方式中,R2是式IIc。在这些实施方式中,优选的是Q是NRN。在其他实施方式中,Q是OH。在进一步的实施方式中,R是SH。RN优选地选自H和甲基。在一些实施方式中,RN是H。在其他实施方式中,RN是甲基。
在一些实施方式中,R2可以是-A-CH2-X和-A-X。在这些实施方式中,X可以是OH、SH、CO2H、COH和NH2。在特别优选的实施方式中,X可以是NH2。
R12
当C2’和C3’之间存在双键时,R12选自:
(a)C5-10芳基基团,可选地被选自包含以下的组中的一个或多个取代基取代:卤素、硝基、氰基、醚、C1-7烷基、C3-7杂环基和双-氧基-C1-3亚烷基;
(b)C1-5饱和脂肪族烷基;
(c)C3-6饱和环烷基;
(d)其中,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5;
(e)其中,R25a和R25b中的一个是H,并且另一个选自:苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;和
(f)其中,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基。
当R12是C5-10芳基基团时,它可以是C5-7芳基基团。C5-7芳基基团可以是苯基基团或C5-7杂芳基基团,例如呋喃基、苯硫基和吡啶基。在一些实施方式中,R12优选地是苯基。在其它实施方式中,R12优选地是苯硫基,例如噻吩-2-基和噻吩-3-基。
当R12是C5-10芳基基团时,它可以是C8-10芳基,例如,喹啉基或异喹啉基基团。喹啉基或异喹啉基基团可以通过任何可用的环位置而结合至PBD核。例如,喹啉基可以是喹啉-2-基,喹啉-3-基,喹啉-4-基,喹啉-5-基,喹啉-6-基,喹啉-7-基和喹啉-8-基。在这些中,喹啉-3-基和喹啉-6-基可以是优选的。异喹啉基可以是异喹啉-1-基,异喹啉-3-基,异喹啉-4-基,异喹啉-5-基,异喹啉-6-基,异喹啉-7-基和异喹啉-8-基。在这些中,异喹啉-3-基和异喹啉-6-基可以是优选的。
当R12是C5-10芳基基团时,它可以带有任何数量的取代基。优选地,它带有1至3个取代基,其中,1和2个取代基是更优选的,并且单取代基团是最优选的。取代基可以在任何位置。
在R12是C5-7芳基基团的情况下,单取代基优选是在环原子上,其并不相邻于与化合物的剩余部分的键,即,它优选是与化合物的剩余部分的键的β或γ位。因此,在C5-7芳基基团是苯基的情况下,取代基优选地在间位或对位,并且更优选地在对位。
在R12是C8-10芳基基团,例如喹啉基或异喹啉基的情况下,其可以在喹啉或异喹啉环的任何位置处带有任何数目的取代基。在一些实施方式中,它带有一个、两个或三个取代基,并且这些取代基可以是在近侧或远侧环或两者上(如果存在一个以上的取代基)。
R12取代基,当R12是C5-10芳基基团时
当R12是C5-10芳基基团时,如果R12上的取代基是卤素,那么它优选地是F或Cl,更优选地是Cl。
当R12是C5-10芳基基团时,如果R12上的取代基是醚,那么在一些实施方式中它可以是烷氧基基团,例如C1-7烷氧基基团(例如甲氧基、乙氧基),或在一些实施方式中它可以是C5-7芳氧基基团(例如苯氧基、吡啶氧基、呋喃氧基)。烷氧基本身可以被进一步取代,例如被氨基基团(例如二甲基氨基)取代。
当R12是C5-10芳基基团时,如果R12上的取代基是C1-7烷基,那么它可以优选地是C1-4烷基基团(例如,甲基,乙基、丙基、丁基)。
当R12是C5-10芳基基团时,如果R12上的取代基是C3-7杂环基,那么在一些实施方式中,它可以是C6含氮杂环基基团,例如,吗啉基、硫代吗啉基、哌啶基、哌嗪基。这些基团可以经由氮原子结合至PBD部分的剩余部分。这些基团可以被进一步取代,例如,由C1-4烷基基团。如果C6含氮杂环基基团是哌嗪基,那么所述进一步的取代基可以在第二个氮环原子上。
当R12是C5-10芳基基团时,如果R12上的取代基是双-氧基-C1-3亚烷基,那么它优选地是双-氧基-亚甲基或双-氧基-亚乙基。
当R12是C5-10芳基基团时,如果R12上的取代基是酯,那么它优选地是甲基酯或乙基酯。
当R12是C5-10芳基基团时,特别优选的取代基包括甲氧基、乙氧基、氟、氯、氰基、双-氧基-亚甲基、甲基-哌嗪基、吗啉基和甲基-苯硫基。用于R12的其他特别优选的取代基是二甲基氨基丙氧基和羧基。
当R12是C5-10芳基基团时,特别优选的取代的R12基团包括但不限于4-甲氧基-苯基、3-甲氧基苯基、4-乙氧基-苯基、3-乙氧基-苯基、4-氟-苯基、4-氯-苯基、3,4-二氧亚甲基-苯基、4-甲基苯硫基、4-氰基苯基、4-苯氧基苯基、喹啉-3-基和喹啉-6-基、异喹啉-3-基和异喹啉-6-基、2-噻吩基、2-呋喃基、甲氧基萘基和萘基。另一种可能的取代的R12基团是4-硝基苯基。特别感兴趣的R12基团包括4-(4-甲基哌嗪-1-基)苯基和3,4-二氧亚甲基-苯基。
当R12是C1-5饱和脂族烷基时,它可以是甲基、乙基、丙基、丁基或戊基。在一些实施方式中,它可以是甲基、乙基或丙基(正戊基或异丙基)。在这些实施方式的一些中,它可以是甲基。在其他实施方式中,它可以是丁基或戊基,其可以是直链或支链的。
当R12是C3-6饱和环烷基时,它可以是环丙基、环丁基、环戊基或环己基。在一些实施方式中,它可以是环丙基。
当R12是时,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5。在一些实施方式中,R12基团中的碳原子的总数不大于4或不大于3。
在一些实施方式中,R21、R22和R23中的一个是H,另外两个基团选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基。
在其他实施方式中,R21、R22和R23中的两个是H,另外一个基团选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基。
在一些实施方式中,不是H的基团选自甲基和乙基。在这些实施方式的一些中,不是H的基团是甲基。
在一些实施方式中,R21是H。
在一些实施方式中,R22是H。
在一些实施方式中,R23是H。
在一些实施方式中,R21和R22是H。
在一些实施方式中,R21和R23是H。
在一些实施方式中,R22和R23是H。
特别感兴趣的R12基团是:
当R12是时,其中,R25a和R25b中的一个是H,并且另一个选自:苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基。在一些实施方式中,不是H的基团是可选取代的苯基。如果苯基可选的取代基是卤素,那么它优选地是氟。在一些实施方式中,苯基基团是未取代的。
当R12是时,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,该苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基。如果苯基可选的取代基是卤素,那么它优选地是氟。在一些实施方式中,苯基基团是未取代的。
在一些实施方式中,R24选自H、甲基、乙基、乙烯基和乙炔基。在这些实施方式的一些中,R24选自H和甲基。
当C2’和C3’之间存在单键时,
R12是H或其中,R26a和R26b独立地选自H、F、C1-4饱和烷基、C2-3烯基,该烷基和烯基基团可选地被选自C1-4烷基氨基和C1-4烷基酯的基团取代;或,当R26a和R26b中的一个是H时,另一个选自腈和C1-4烷基酯;
在一些实施方式中,R12是H。
在一些实施方式中,R12是
在一些实施方式中,优选的是R26a和R26b都是H。
在其他实施方式中,优选的是R26a和R26b都是甲基。
在进一步的实施方式中,优选的是R26a和R26b中的一个是H而另一个选自C1-4饱和烷基、C2-3烯基,该烷基和烯基基团是可选取代的。在这些进一步的实施方式中,可以进一步优选的是不是H的基团选自甲基和乙基。
R10、R11a、R11b、R20、R21a、R21b
在一些实施方式中,R20是H并且R21a和R21b都是H。可替代地,当R21a和R21b都是H时,R20可以是Me。
在一些实施方式中,R20是H并且R21a和R21b一起形成=O。可替代地,当R21a和R21b一起形成=O时,R20可以是Me。
在这些组的实施方式的任一个中,可能优选的是R10和R11b形成它们结合的氮和碳原子之间的氮-碳双键,并且R11a是H。可能可替代优选的是R10是H,R11a是H并且R11b是OH。可能进一步可替代优选的是R10是H,R11a是H且R11b是SOzM,其中,z是2或3且M是单价药用阳离子。
在一些实施方式中,R10是H并且R11a和R11b都是H。可替代地,当R11a和R11b都是H时,R10可以是Me。
在一些实施方式中,R10是H并且R11a和R11b一起形成=O。可替代地,当R11a和R11b一起形成=O时,R10可以是Me。
在这些组的实施方式的任一个中,可能优选的是R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键,并且R21a是H。可能可替代优选的是R20是H,R21a是H并且R21b是OH。可能进一步可替代优选的是R20是H,R21a是H且R21b是SOzM,其中,z是2或3且M是单价药用阳离子。
M和z
优选地,M是单价药用阳离子,并且更优选地是Na+。
z优选地是3。
第四方面
L4
在一些实施方式中,L4是单键。
在一些实施方式中,L4是:
其中,n是0至3。在这些实施方式中,n可以是0、1、2或3。n=0和n=1可以是优选的
在一些实施方式中,L4是:
其中,n是0至3。在这些实施方式中,n可以是0、1、2或3。n=0和n=1可以是优选的
在一些实施方式中,L4是:
其中,n是0至3。在这些实施方式中,n可以是0、1、2或3。n=0和n=1可以是优选的。
在一些实施方式中,L4是:
其中,n是0至3。在这些实施方式中,n可以是0、1、2或3。n=0和n=1可以是优选的。在这些实施方式的一个中,D是N。在这些实施方式的其他的中,D是CH。在这些实施方式的一个中,E是O或S。在这些实施方式的其他的中,F是CH。
L3
在一个实施方式中,L3是氨基酸残基。氨基酸可以是天然氨基酸或非天然氨基酸。
在一个实施方式中,L3选自:Phe、Lys、Val、Ala、Cit、Leu、Ile、Arg和Trp,其中,Cit是瓜氨酸。
在一个实施方式中,L3包含二肽残基。二肽中的氨基酸可以是天然氨基酸和非天然氨基酸的任意组合。在一些实施方式中,二肽包含天然氨基酸。在接头是组织蛋白酶不稳定的接头的情况下,二肽是组织蛋白酶介导的切割的作用位点。然后二肽是组织蛋白酶的识别位点。
在一个实施方式中,L3选自:
Prot-Phe-Lys-L4、
Prot-Val-Ala-L4、
Prot-Val-Lys-L4、
Prot-Ala-Lys-L4、
Prot-Val-Cit-L4、
Prot-Phe-Cit-L4、
Prot-Leu-Cit-L4、
Prot-Ile-Cit-L4、
Prot-Phe-Arg-L4和
Prot-Trp-Cit-L4,
其中,Cit是瓜氨酸。
优选地,L3选自:
Prot-Phe-Lys-L4、
Prot-Val-Ala-L4、
Prot-Val-Lys-L4、
Prot-Ala-Lys-L4和
Prot-Val-Cit-L4。
最优选地,L3选自Prot-Phe-Lys-L4、Prot-Val-Cit-L4或Prot-Val-Ala-L4。
其它感兴趣的二肽组合包括:
Prot-Gly-Gly-L4、
Prot-Pro-Pro-L4和
Prot-Val-Glu-L4。
可以使用其它二肽组合,其包括Dubowchik et al.,Bioconjugate Chemistry,2002,13,855-869描述的那些,通过引用将其结合于此。
在一些实施方式中,L3是三肽残基。三肽中的氨基酸可以是天然氨基酸和非天然氨基酸的任意组合。在一些实施方式中,三肽包含天然氨基酸。在接头是组织蛋白酶不稳定的接头的情况下,三肽是组织蛋白酶介导的切割的作用位点。然后三肽是组织蛋白酶的识别位点。
在一个实施方式中,在适当情况下,氨基酸侧链是受化学保护的。侧链保护基团可以是以下讨论的基团。受保护的氨基酸序列可以由酶切割。例如,包含Boc侧链保护的Lys残基的二肽序列可以被组织蛋白酶切割。
用于氨基酸侧链的保护基团在本领域中是众所周知的并且描述在Novabiochem目录中且如以上所描述。
Prot
Prot选自Fmoc(芴甲氧羰基)、Teoc(2-(三甲基硅烷基)乙氧基羰基)、Boc(叔丁氧基羰基)和Alloc(烯丙氧基羰基)。在一些实施方式中,Prot选自Fmoc和Teoc。
在一些实施方式中,Prot是Fmoc。
在一些实施方式中,Prot是Teoc。
本发明的第一方面的特别优选的化合物是以下式Ia-1、Ia-2、Ia-3或Ia-4:
其中,R12a选自:
(a)
(b)
(c)
(d)
(e)
(f)
(g)和
(h)氨基基团在苯基基团的间位或对位。
本发明的第一方面进一步特别优选的化合物是以下式Ib-1、Ib-2、Ib-3或Ib-4:
其中,
n是1或3;
R1a是甲基或苯基;
R12a选自:
(a)
(b)
(c)
(d)
(e)
(f)
(g)和
(h)
本发明的第一方面进一步特别优选的化合物是以下式Ic-1、Ic-2、Ic-3或Ic-4:
其中,
n是1或3;
R1a是甲基或苯基;
R12a选自:
(a)
(b)
(c)
(d)
(e)
(f)
(g)和
(h)
第五方面
视情况,式I化合物的优选项适用于本发明第五方面中的D。。例如,在第五方面,PBD二聚物是本文描述的式I的任何化合物或其药用盐或溶剂化物,除了被代替,被代替,以及*-NHRN被代替,其中,波浪线表示附接至接头单元的点。
因此,本发明的结合物包括具有下式(V)的那些:
L-(LU-D)p (V),
或其药用盐或溶剂化物,其中,L是配体单元(即,靶向剂),LU是接头单元和PBD二聚物D。D是本文描述的式I的任何化合物或其药用盐或溶剂化物,除了被代替,被代替,以及*-NHRN被代替,其中,波浪线表示附接至接头单元的点。
(a)本发明的结合物包括例如以下式的那些:
CBA–A1–L1–*,
其中,星号表示附接至PBD二聚物(D)的点,CBA是细胞结合剂,L1是通过酶的作用可切割的特异性单元,并且A1是将L1连接至细胞结合剂的延伸子单元。
(b)本发明的结合物包括例如以下式的那些:
CBA–A1–L1–*,
其中,星号表示附接至PBD二聚物(D)的点,CBA是细胞结合剂,A1是将L1连接至细胞结合剂的延伸子单元并且L1是通过组织蛋白酶的作用可切割的特异性单元,L1是二肽,L1是通过组织蛋白酶的作用可切割的二肽或L1是选自-Phe-Lys-、-Val-Ala-、-Val-Lys-、-Ala-Lys-和-Val-Cit-的二肽。
本发明优选的结合物包括在(a)和(b)中描述的那些的任一种,其中,A1是:
其中,星号表示附接至L1的点,波浪线表示附接至CBA的点,且n是0至6(优选n是5)。
说明性的合成方案
以下方案示出了制作本发明的某些化合物的方式,其中,一般作为R、R’和R2示出某些基团。应当根据本发明的公开理解其中这些形成一部分的基团。在明确描述了保护基团的方案中,这些还可以在本发明的范围内变化。
方案1a-胺构建嵌段的合成
方案1b-受保护的氨基甲酸酯构建嵌段的合成
方案1c-附接有接头的二聚物(药物-接头)的合成
方案2-仲胺构建嵌段的可替代合成
方案3a-二内酰胺构建嵌段的合成
方案3b-约束下受保护的氨基甲酸酯构建嵌段的合成
方案3c-附接有接头的二聚物(药物-接头)的合成
在以上方案中,保护基团可以彼此正交以提供合成灵活性。
实施例
通用实验方法
通过使用MeRCk Kieselgel 60 F254硅胶的薄层色谱法(TLC)在铝板上用荧光指示剂监测反应进程。除非另外指出,否则TLC的显影用UV光或碘蒸气实现。使用MeRCkKieselgel 60 F254硅胶进行快速色谱法。取出来自Fisher Scientific,U.K的萃取和色谱溶剂并在没有进一步纯化的情况下使用。所有化学品是购自Aldrich、Lancaster或BDH。
在Bruker Avance 400光谱仪上得到1H和13C NMR光谱。耦合常数以赫兹(Hz)为单位报告。将化学位移记录为从四甲基硅烷向低场的每百万份数(ppm)。将自旋多重性描述为s(单峰)、bs(宽单峰)、d(双峰)、t(三重峰)、q(四重峰)、p(五重峰)和m(多重峰)。通过施加样品于氯仿溶液中,使用ATR“golden gate”系统在Perkin-Elmer FT/IR paragon 1000分光光度计上记录IR光谱。在室温下使用Bellingham和Stanley ADP 220偏振计测量旋光度。在来自Thermo Electron的ThermoQuest Navigator上进行质谱法。在20至30V下得到点喷射(ES)光谱。使用Micromass Q-TOF总体串联进行精确的质量测量。在电喷射离子化模式下使用水中的50%乙腈和0.1%甲酸作为溶剂运行所有样品。在W模式下运行样品,其在FWHH下给出通常的19000的分辨率。在测量之前,立即用[Glu]-血纤维蛋白肽B校正仪器。
通常的LC/MS条件:
方法1(默认方法,除非另外指出,否则使用)
使用水(A)(甲酸0.1%)和乙腈(B)(甲酸0.1%)的流动相进行HPLC(WatersAlliance 2695)。梯度:初始组成5%B,持续1.0分钟,然后,在3分钟时间段内,从5%B升高至95%B。将组成保持在95%B下0.1min,然后在0.03分钟内回到5%B并保持0.87min。总梯度运行时间等于5分钟。
流速3.0mL/min,经由零死容积T形管(其通入质谱仪)来分开400μL。波长检测范围:220至400nm。功能类型:二极管阵列(535次扫描)。柱:Phenomenex Onyx MonolithicC18 50×4.60mm。
反相快速纯化条件如下:使用水(A)和乙腈(B)流动相进行快速纯化系统(Varian971-Fp)。梯度:在20C.V(柱体积)下,初始组成5%B,然后在60C.V内从5%B至70%B。将95%B的组成保持在15C.V.,然后在5C.V.内回到5%B,并保持在5%B持续10C.V.。总梯度运行时间等于120C.V。流速:6.0mL/min波长检测范围:254nm。柱:Agilent AX1372-1 SF10-5.5gC8。
快速甲酸:
使用Shimazu LCMS-2020(单个四极质谱仪)进行正向模式的电喷射质谱法(ESI-MS)。使用的流动相是水(A)(甲酸0.1%)和乙腈(B)(甲酸0.1%)。梯度:初始组成5%B,保持0.25min,然后在2min时间段内从5%B升高至100%B。将组成保持在100%B下0.50min,然后在0.05分钟内回到5%B并保持0.05min。总梯度运行时间等于3分钟。流速0.8mL/min。波长检测范围:220至400nm。柱:Waters Acquity UPLC BEH Shield RP18 1.7μm 2.1x50mm。
C18 15min甲酸:
使用Shimazu LCMS-2020(单个四极质谱仪)进行正向模式的电喷射质谱法(ESI-MS)。烘箱温度50℃。使用的流动相是水(A)(甲酸0.1%)和乙腈(B)(甲酸0.1%)。梯度:初始组成5%B,保持1min,然后在9min时间段内,从5%B升高至100%B。将组成保持在100%B下2min,然后在0.10分钟内回到5%B并保持在此2.90min。总梯度运行时间等于15分钟。流速0.6mL/min。波长检测范围:220至400nm。柱:Gemini-NX UPLC C18 3μm 2x100mm.
制备HPLC:在以下尺寸的Phenomenex Gemini NX 5μC-18柱上进行反相超高性能液相色谱法(UPLC):150x 4.6mm用于分析,以及150x 21.20mm用于准备工作。通过以下梯度条件进行所有UPLC实验:初始固定组成15min内13%B至75%B,保持在75%B下2.0min,然后在0.10min内从75%B至13%B,保持在13%2.90min。运行的总梯度持续时间是20.00min。洗脱液是溶剂A(H2O与0.1%甲酸)和溶剂B(CH3CN与0.1%甲酸)。使用的流速是1.0ml/min用于分析,和20.0ml/min用于制备HPLC。在254nm和280nm处检测。
实施例1
(a)(R)-2-((R)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-甲基丁氨基)丙酸(2)将HO-Ala-Val-H 1(350mg,1.86mmol)和Na2CO3(493mg,4.65mmol)溶解在蒸馏的H2O(15mL)中,并在添加二氧六环(15mL)之前将混合物冷却至0℃(发生氨基酸盐的部分沉淀)。在剧烈搅拌下,经10分钟,逐滴添加Fmoc-Cl(504mg,1.95mmol)在二氧六环(15mL)中的溶液。在除去冰浴以前,在0℃下搅拌得到的混合物2小时,然后持续搅拌16小时。在减压下通过旋转蒸发除去溶剂,并将残留溶解在水(150mL)中。用1N HCl将pH从9调节至2,随后用EtOAc(3x100mL)萃取水层。用盐水(100mL)洗涤合并的有机物,用MgSO4干燥,过滤并在减压下通过旋转蒸发除去挥发物,以提供纯的HO-Ala-Val-Fmoc 2(746mg,97%产率)。LC/MS2.85min(ES+)m/z(相对强度)410.60:1H-NMR(400MHz,CDCl3)δ7.79(d,J=7.77Hz.2H),7.60(d,J=7.77Hz,2H),7.43(d,J=7.5Hz,2H),7.34(d,J=7.5Hz,2H),6.30(bs,1H),5.30(bs,1H),4.71-7.56(m,1H),4.54-4.36(m,2H),4.08-3.91(m,1H),2.21-2.07(m,1H),1.50(d,J=7.1Hz,3H),1.06-0.90(m,6H).
(b)(9H-芴-9-基)甲基((S)-3-甲基-1-氧代-1-(((S)-1-氧代-1-((4-(4,4,5,5-四甲基-1,3,2-二氧代硼戊烷-2-基)苯基)氨基)丙烷-2-基)氨基)丁烷-2-基)氨基甲酸酯(3)
在室温下,将4-氨基苯基硼酸频哪醇酯(146.9mg,0.67mmol)添加至先前已搅拌30分钟的用氩气冲洗的烧瓶中的HO-Ala-Val-Fmoc 2(330mg,0.8mmol)、DCC(166mg,0.8mmol)和DMAP(5mg,催化剂)在干燥DCM(8mL)中的溶液中。然后允许在室温下搅拌反应混合物过夜。反应后伴随着LCMS和TLC。用CH2Cl2稀释反应混合物,并在用MgSO4干燥之前用H2O和盐水洗涤有机物,过滤且在减压下通过旋转蒸发除去溶剂。将粗产物干燥负载在硅胶色谱柱(己烷/EtOAc,6∶4)上,并且以88%产率(360mg)分离白色固体纯产物3。
(c)8-(3-((2-(4-((S)-2-((S)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-甲基丁氨基)丙氨基)苯基)-7-甲氧基-5,11-二氧代-10-((2-(三甲基甲硅烷基)乙氧基)甲基)-5,10,11,11a-四氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-8-基)氧基)丙氧基)-7-甲氧基-5,11-二氧代-10-((2-(三甲基甲硅烷基)乙氧基)甲基)-5,10,11,11a-四氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-2-基三氟甲磺酸酯(5)
将1,1’-[[(丙烷-1,3-二基)二氧基]双(11aS)-7-甲氧基-2-[[(三氟甲基)磺酰基]氧基]-10-((2-(三甲基硅烷基)乙氧基)甲基)-1,10,11,11a-四氢-5H-吡咯并[2,1-c][1,4]-苯并二氮杂卓-5,11-二酮]4(2.03g,1.81mmol)、硼酸频哪醇酯(1g,1.63mmol)和Na2CO3(881mg,8.31mmol)溶解在2∶1∶1(40mL)的甲苯/MeOH/H2O的混合物中。在添加四(三苯基膦)钯(0)(41mg,0.035mmol)之前,用氩气吹扫和填充反应烧瓶,并且将反应混合物整夜加热至30℃。在减压下除去溶剂,以及将残留提取在H2O(100mL)中,且用EtOAc(3x100mL)萃取。用盐水(100mL)洗涤合并的有机物,用MgSO4干燥,过滤并在减压下通过旋转蒸发除去挥发物。通过硅胶色谱柱(己烷/EtOAc,8∶2至25∶75)纯化粗产物,以提供33%产率(885mg)的纯5。LC/MS 3.85min(ES+)m/z(相对强度)1452.90;1H NMR(400MHz,CDCl3)δ7.78-7.16(m,17H),7.13(s,1H),6.51-6.24(m,1H),5.51(dd,J=10.0,5.1Hz,2H),5.36-5.11(m,1H),4.74(dd,J=10.1,4.4Hz,2H),4.70-4.53(m,2H),4.47(d,J=6.4Hz,1H),4.37(d,J=7.2Hz,1H),4.27(m,4H),4.20-4.14(m,1H),3.90(s,3H),3.89(s,3H),3.77(ddd,J=16.7,9.0,6.4Hz,3H),3.71-3.61(m,2H),3.24-2.91(m,3H),2.55-2.33(m,2H),2.22-2.07(m,1H),1.52-1.37(m,3H),1.04-0.86(m,10H),0.00(s,18H).
(d)(9H-芴-9-基)甲基((2S)-1-(((2S)-1-((4-(8-(3-((2-环丙基-7-甲氧基-5,11-二氧代-10-((2-(三甲基甲硅烷基)乙氧基)甲基)-5,10,11,11a-四氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-8-基)氧基)丙氧基)-7-甲氧基-5,11-二氧代-10-((2-(三甲基甲硅烷基)乙氧基)甲基-5,10,11,11a-四氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-2-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(6)
在氩气气氛下,将三苯胂(42mg,0.137mmol)添加至PBD-三氟甲磺酸酯5(250mg,0.172mmol)、环丙基硼酸(73.9mg,0.86mmol)、氧化银(159mg,0.688mmol)和磷酸三钾(438mg,2.06mmol)在干燥二氧六环(10mL)中的混合物中。用氩气冲洗反应3次,并添加双(苯甲腈)氯化钯(II)(13.2mg,0.034mmol)。在将反应升温至75℃之前,用氩气冲洗反应3次以上,并搅拌10分钟。通过C盐滤板过滤反应混合物,随后用乙酸乙酯漂洗。在减压下通过旋转蒸发除去溶剂。使得到的残留进行快速柱色谱(硅胶;1%甲醇/氯仿)。收集并合并纯化部分,并在减压下通过旋转蒸发除去过量洗脱液以提供期望的产物22(132mg,50%产率)。LC/MS3.83min(ES+)m/z(相对强度)1345.91;1H NMR(400MHz,CDCl3)δ7.88-7.14(m,17H),6.69(s,1H),6.45-6.25(m,1H),5.57-5.41(m,2H),5.34-5.14(m,1H),4.78-4.67(m,2H),4.62-4.55(m,1H),4.50-4.45(m,2H),4.51-4.44(m,1H),4.31-4.21(m,4H),4.16(m,1H),3.92(s,3H),3.86(s,3H),3.82-3.71(m,2H),3.66(m,3H),3.40-3.28(m,1H),3.07(m,1H),2.70-2.57(m,1H),2.47-2.36(m,2H),2.15(m,1H),1.51-1.40(m,3H),1.03-0.87(m,11H),0.77-0.71(m,2H),0.60-0.54(m,2H),0.00(t,J=3.0Hz,18H).
(e)(9H-芴-9-基)甲基((2S)-1-(((2S)-1-((4-(8-(3-((2-环丙基-7-甲氧基-5-氧代-5,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-8-基)氧基)丙氧基)-7-甲氧基-5-氧代-5,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-2-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(7)
在氩气气氛下,在-78℃下将Super-(0.5mL,在THF中1M)逐滴添加至SEM双内酰胺6(265mg g,0.19mmol)在THF(10mL)中的溶液中。在5分钟内完成添加,以保持反应混合物的内部温度恒定。20分钟后,用水淬灭等分反应,用于LC/MS分析,该分析显示反应完成。添加水(20mL)至反应混合物中并除去冷浴。用EtOAc(3x30mL)萃取有机层,并且用盐水(50mL)洗涤合并的有机物,用MgSO4干燥,过滤并在减压下通过旋转蒸发除去溶剂。将粗产物溶解在MeOH(12mL)、CH2Cl2(6mL)和水(2mL)以及足够的硅胶中以形成稠密的搅拌悬浮液。5天后,通过烧结漏斗过滤悬浮液,并且用CH2Cl2/MeOH(9∶1)(200mL)洗涤直到产物被完全洗脱。用盐水(2x70mL)洗涤有机层,用MgSO4干燥,过滤并在减压下通过旋转蒸发除去溶剂。通过硅胶柱色谱(100%CHCl3至96%CHCl3/4%MeOH)纯化提供了黄色固体产物23(162mg,78%)。LC/MS 3.02min(ES+)m/z(相对强度)1052.37。
(f)亚胺还原
在氩气气氛下,在-78℃下将Super-(95mL,1当量,在THF中1M)逐滴添加至双亚胺7(100mg,0.095mmol)在THF(10mL)中的溶液中。20分钟后,用水淬灭等分反应,用于LC/MS分析,该分析反映反应完成,一些过度还原为双胺10。(观察的LC:双亚胺7 19%,亚胺-胺8+9 36%,双胺10 45%;对于1eq还原剂的理论目标是7 25%,8+9 50%,1025%)。添加水(20mL)至反应混合物中并除去冷浴。用氯仿(40mL)萃取有机层,并且用水(1x40mL)、盐水(50mL)洗涤合并的有机物,用Na2SO4干燥,过滤并在减压下通过旋转蒸发除去溶剂。通过硅胶柱色谱(100%CHCl3至96%CHCl3/4%MeOH)纯化提供了产物8+9至:7 25%,8+950%,10 25%(20mg,25%混合物)。
LC/MS(快速甲酸,2.5min系统)双亚胺7 1.66min(ES+)m/z(相对强度)1052.15;混合的胺-亚胺8+9(在2.5min系统上没有分离)1.71min(ES+)m/z(相对强度)1054.45;双胺101.66min(ES+)m/z(相对强度)1056.95,以1/2/1比。
(g)Fmoc脱保护
添加过量哌啶(0.1mL,1mmol)至Fmoc保护的7、8+9和10(20mg,0.019mmol)在DMF(1mL)的1/2/1混合物中。允许在室温下搅拌混合物20分钟,此时,反应进行至完全(通过LC/MS监控)。用CH2Cl2(30mL)释稀反应混合物,并且使用H2O(2x30 mL)洗涤有机相直至完全除去哌啶。用MgSO4干燥有机相,过滤并在减压下通过旋转蒸发除去过量溶剂,以提供粗产物11、12+13和14(1/2/1比),直接在下一步骤中使用该粗产物。
LC/MS(快速甲酸,2.5min系统)双亚胺11 1.12min(ES+)m/z(相对强度)830.45;混合的胺-亚胺12+13(没有分离)1.15min(ES+)m/z(相对强度)832.35;双胺14 1.19min(ES+)m/z(相对强度)834.35,以1/2/1比。
实施例2
PEG马来酰亚胺偶联
在氩气气氛下,将EDCI盐酸盐(5.46mg,0.028mmol,1.5eq)添加至马来酰亚胺-PEG8-酸(11.3mg,0.019mmol,1eq)在氯仿(5mL)中的悬浮液中。在添加PBD混合物11+12+13+14(20mg,0.019mmol,1eq)之前,在室温下搅拌混合物1小时。保持搅拌直到反应完成(通常是2小时)。用CH2Cl2稀释反应,并在用MgSO4干燥之前用H2O和盐水洗涤有机相,过滤并在减压下通过旋转蒸发除去过量的溶剂。通过反相色谱(参见以下方法)单独纯化并分离四种产物。分离出双亚胺15(2.0mg,7.5%),随后是可分离的胺-亚胺混合物16(3.8mg,14.2%)和17(2.7mg,10.1%)以及双胺18(2.2mg,8.2%)。1H NMR分析明确识别16和17。一个关键特征是亚胺质子(d,J=4.0Hz,1H),其在环丙基侧是7.78ppm以及在分子的芳香族侧是7.88ppm。
LC/MS C18 15min甲酸:
双亚胺15 5.23min(ES+)m/z(相对强度)703.20(100,(M+2H)/2),1404.55(10,M+H);1H NMR(400MHz,CDCl3)δ8.96(s,1H),7.89(d,J=4.0Hz,1H),7.78(d,J=4.0Hz,1H),7.77-7.64(m,2H),7.53-7.47(m,2H),7.44(s,1H),7.33(d,J=8.7Hz,2H),7.12(s,1H),6.89-6.81(m,2H),6.74(s,1H),6.69(d,J=1.6Hz,2H),6.50(s,1H),4.72-4.57(m,1H),4.43-4.14(m,7H),4.11-4.03(m,1H),3.93(d,J=4.6Hz,6H),3.83(t,J=7.2Hz,2H),3.80-3.71(m,2H),3.68-3.56(m,28H),3.55-3.48(m,3H),3.44-3.34(m,3H),3.16-3.04(m,1H),2.95-2.84(m,1H),2.58-2.47(m,4H),2.47-2.37(m,2H),2.33-2.16(m,1H),1.51-1.41(m,4H),1.05-0.94(m,6H),0.77(dt,J=5.5,4.8Hz,2H),0.55(dd,J=9.3,4.9Hz,2H).
混合的胺-亚胺16 5.48min(ES+)m/z(相对强度)704.20(100,(M+2H)/2),1406.70(5,M+H);1H NMR(400MHz,CDCl3)δ8.97(s,1H),7.88(d,J=3.9Hz,1H),7.78-7.64(m,2H),7.55-7.48(m,2H),7.43(s,1H),7.36-7.29(m,2H),7.12(s,1H),6.93-6.80(m,3H),6.69(d,J=1.7Hz,2H),6.49(s,1H),6.10(s,1H),4.72-4.58(m,1H),4.41-4.32(m,1H),4.32-4.24(m,2H),4.24-4.16(m,2H),4.15-4.02(m,2H),3.94(s,3H),3.87-3.80(m,5H),3.79-3.71(m,2H),3.69-3.55(m,28H),3.55-3.49(m,2H),3.46-3.34(m,4H),2.93-2.81(m,2H),2.60-2.46(m,4H),2.42-2.34(m,2H),2.24(dd,J=14.0,6.5Hz,2H),1.52-1.38(m,4H),1.07-0.92(m,6H),0.75-0.66(m,2H),0.53-0.44(m,2H).
混合的亚胺-胺17 5.41min(ES+)m/z(相对强度)704.25(100,(M+2H)/2),1406.45(3,M+H);1H NMR(400MHz,CDCl3)δ8.90(s,1H),7.78(d,J=4.0Hz,1H),7.74-7.58(m,2H),7.56(s,1H),7.53-7.49(m,2H),7.36-7.27(m,2H),7.17-7.04(m,1H),6.95-6.86(m,1H),6.83(s,1H),6.74(s,1H),6.71-6.65(m,2H),6.53(s,1H),6.12(s,1H),4.74-4.57(m,2H),4.40-4.14(m,7H),4.13-4.05(m,1H),3.93(s,3H),3.88-3.79(m,5H),3.79-3.68(m,2H),3.69-3.55(m,28H),3.55-3.48(m,3H),3.46-3.29(m,3H),3.16-3.03(m,1H),2.91(s,1H),2.71(s,1H),2.60-2.45(m,4H),2.42-2.33(m,2H),2.29-2.19(m,1H),1.51-1.39(m,4H),1.08-0.92(m,6H),0.81-0.74(m,2H),0.55(dd,J=9.3,5.4Hz,2H).
双胺18 5.72min(ES+)m/z(相对强度)705.15(100,(M+2H)/2),1408.45(3,M+H);1H NMR(400MHz,CDCl3)δ8.91(s,1H),7.73-7.59(m,2H),7.57-7.49(m,3H),7.29(d,J=2.5Hz,2H),7.13(s,1H),6.91(s,1H),6.84(s,1H),6.68(d,J=2.0Hz,2H),6.49(s,1H),6.08(d,J=3.9Hz,3H),4.72-4.57(m,2H),4.19(dd,J=10.6,4.3Hz,5H),4.09(dd,J=12.6,5.8Hz,2H),3.88-3.80(m,8H),3.79-3.69(m,2H),3.68-3.56(m,28H),3.54-3.46(m,2H),3.44-3.26(m,4H),2.85(dd,J=15.9,10.4Hz,2H),2.73(dd,J=16.2,4.7Hz,2H),2.61-2.43(m,4H),2.38-2.16(m,5H),1.51-1.38(m,4H),1.05-0.93(m,6H),0.69(dt,J=4.9,4.3Hz,2H),0.49(dd,J=5.1,3.3Hz,2H).
实施例3
(a)叔丁基(11S)-2-(4-((S)-2-((S)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-
甲基丁氨基)丙氨基)苯基)-11-((叔丁基二甲基甲硅烷基)氧基)-8-羟基-7-甲氧基-5-氧
代-11,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-10(5H)-羧酸酯(27)
(i)(S)-(4-(4-氨基苯基)-2-(((叔丁基二甲基甲硅烷基)氧基)甲基)-2,3-二氢-1H-吡咯-1-基)(5-甲氧基-2-硝基-4-((三异丙基甲硅烷基)氧基)苯基)甲酮(20)
将Pd(PPh3)4(609mg,0.52mmol)添加至三氟甲磺酸酯19(18.8g,26.3mmol)、4-氨基苯基硼酸频哪醇酯(8.64g,39.4mmol)、Na2CO3(12.78g,120mmol)、MeOH(80mL)、甲苯(160mL)和水(80mL)的搅拌混合物中。允许在氮气气氛下在30℃下搅拌反应混合物24小时,该时间之后,所有硼酸酯被耗尽。然后在将残留取出至EtOAc(100mL)之前,蒸发反应混合物至干燥,并且用H2O(100mL)、盐水(100mL)洗涤,干燥(MgSO4)、过滤并在减压下蒸发,以提供粗产物。通过硅色谱(己烷/EtOAc;100%至70:30)纯化提供了黄色泡沫状产物20(11.06g,64%)。1H-NMR(400MHz,CDCl3)δ7.74(s,1H),7.00(d,J=8.3Hz,2H),6.81(s,1H),6.58(d,J=8.3Hz,2H),6.06(s,1H),4.77(bm,1H),3.91(d,J=6.7Hz,3H),3.68(bs,2H),3.13(bm,1H),2.97(d,J=14.5Hz,1H),1.36-1.21(m,3H),1.12(d,J=7.3Hz,18H),0.89(s,10H),0.10(s,6H).);ES+=2.27min,m/z698[M+CH3CN]+.
(ii)(9H-芴-9-基)甲基((S)-1-(((S)-1-((4-((S)-5-(((叔丁基二甲基甲硅烷基)氧基)甲基)-1-(5-甲氧基-2-硝基-4-((三异丙基甲硅烷基)氧基)苯甲酰基)-4,5-二氢-1H-吡咯-3-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(21)
添加苯胺20(10.05g,15.3mmol)、二肽(6.3g,15.3mmol)和干燥CH2Cl2(500mL)至之前用氩气冲洗的干燥圆底烧瓶中。在添加EEDQ(3.79g,15.3mmol)之前用氩气吹扫烧瓶三次,并在室温下搅拌混合物。反应之后是LCMS,并且在3.5小时之后,反应完成。用H2O(200mL)淬灭反应,并用CH2Cl2(250mL)萃取两次。用盐水(150mL)洗涤合并的有机物,用MgSO4干燥,真空过滤除去溶剂。通过硅胶色谱(己烷/EtOAc;100%至55∶45)纯化粗产物,以提供纯产物21(13.821g,86%)。1H-NMR(400MHz,CDCl3)δ8.26(s,1H),7.64(s+d,J=4.9Hz,3H),7.43(t,J=7.3Hz,1H),7.36(d,J=7.3Hz,1H),7.28(t,J=7.3Hz,1H),7.19(d,J=7.7Hz,1H),6.99(d,J=7.9Hz,1H),6.71(s,1H),6.27(d,J=6.3Hz,1H),6.08(s,1H),5.11(d,J=6.6Hz,1H),4.69(bs,1H),4.52(bm,1H),4.36(d,J=6.5Hz,2H),4.08(t,J=5.9Hz,1H),3.89(m,1H),3.80(s,3H),3.11-2.97(bm,1H),2.88(bd,J=15.2Hz,1H),2.03(bs,1H),1.33(d,J=6.9Hz,3H),1.24-1.11(m,3H),1.01(d,J=7.4Hz,18H),0.86-0.79(m,6H),0.77(s,9H),0.00(s,6H);ES+=2.37min,无质量。
(iii)(9H-芴-9-基)甲基((S)-1-(((S)-1-((4-((S)-1-(2-氨基-5-甲氧基-4-((三异丙基甲硅烷基)氧基)苯甲酰基)-5-(((叔丁基二甲基甲硅烷基)氧基)甲基)-4,5-二氢-1H-吡咯-3-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(22)
在之前用氩气冲洗并固定有温度计的干燥两颈圆底烧瓶中,将硝基苯21(2.97g,2.8mmol)溶解在甲醇中5%甲酸的溶液(50mL)中。快速将锌(1.85g,28mmol)倒入溶液中。温度瞬时升高至40℃并缓慢冷却回到室温,在该点,反应完成(约15分钟,由LCMS监控反应)。然后通过C盐过滤反应混合物并用EtOAc(2x150mL)进一步洗涤滤饼。随后在用MgSO4干燥之前,用饱和NaHCO3(溶液)(100mL)、H2O(100mL)和盐水(100mL)洗涤合并的有机物,过滤并真空除去挥发物。用硅胶色谱(己烷/EtOAC 75∶25至50∶50)纯化粗材料,并分离淡黄色油状纯产物22(2.291g,79%产率)。1H-NMR(400MHz,CDCl3)δ8.37(s,1H),7.74(s+d,J=4.9Hz,3H),7.53(t,J=7.4Hz,2H),7.46(d,J=11.3Hz,2H),7.39(t,J=7.3Hz,2H),7.28(t,J=11.3Hz,2H),7.09(d,J=7.9Hz,2H),6.38(d,J=6.3Hz,1H),6.18(s,1H),5.21(d,J=2.9Hz,1H),4.81(bs,1H),4.72-4.57(m,1H),4.47(d,J=6.5Hz,2H),4.19(t,J=5.0Hz,1H),4.00-3.94(m,1H),3.91(s,3H),3.23-3.07(m,1H),2.98(d,J=16.8Hz,1H),2.15(s,1H),1.43(d,J=6.9Hz,3H),1.36-1.18(m,3H),1.12(d,J=7.4Hz,18H),0.97-0.89(m,6H),0.88(s,9H),0.10(s,6H)。ES+=2.37min,m/z无质量。
(iv)(9H-芴-9-基)甲基((S)-1-(((S)-1-((4-((S)-1-(2-((叔丁氧基羰基)氨基)-5-甲氧基-4-((三异丙基甲硅烷基)氧基)苯甲酰基)-5-(((叔丁基二甲基甲硅烷基)氧基)甲基)-4,5-二氢-1H-吡咯-3-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(23)
在70℃下于圆底烧瓶中同时加热胺22(14.913g,14.6mmol)和Boc2O(3.83g,17.5mmol)。为了帮助溶解,添加CHCl3(25mL)并搅拌混合物直到反应完成(随后是LCMS)。在直接负载在硅胶色谱柱(己烷/EtOAc;100%至65:35)上之前,将稠的粗溶液冷却至室温。分离膏体泡沫状产物23(13.2g,80%产率)。1H-NMR(400MHz,CDCl3)δ8.40(s,1H),8.21(s,1H),7.74(d,J=7.8Hz,3H),7.54(t,J=7.0Hz,2H),7.48(d,J=7.7Hz,2H),7.38(t,J=7.4Hz,2H),7.31-7.25(m,3H),7.14(d,J=6.7Hz,2H),6.84(bs,1H),6.80(s,1H),6.50(d,J=6.4Hz,1H),5.28(d,J=6.0Hz,1H),4.77(d,J=2.6Hz,1H),4.70-4.58(m,1H),4.47(t,J=5.7Hz,2H),4.19(t,J=6.1Hz,1H),4.00(m,2H),3.88(bs,1H),3.73(s,3H),3.05(m,1H),2.98(dd,J=15.4,3.3Hz,1H),2.15(bm,1H),1.46(s,9H),1.43(d,J=11.7Hz,3H),1.36-1.22(m,3H),1.12(d,J=7.4Hz,18H),1.00-0.89(m,6H),0.84(s,9H),0.050.05(d,J=6.0Hz,6H));ES+=2.53min,无质量。
(v)(9H-芴-9-基)甲基((S)-1-(((S)-1-((4-((S)-1-(2-((叔丁氧基羰基)氨基)-5-甲氧基-4-((三异丙基甲硅烷基)氧基)苯甲酰基)-5-(羟甲基)-4,5-二氢-1H-吡咯-3-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)氨基甲酸酯(24)
将甲硅烷基醚23(13.2g,11.8mmol)溶解在AcOH/H2O/MeOH/THF的7∶2∶1∶1混合物(220mL)中,并在室温下搅拌混合物直到反应完成(过夜)。真空除去挥发物并将残留取出至EtOAc(400mL)中。在用MgSO4干燥之前,用饱和NaHCO3(溶液)(200mL)、H2O(200mL)和盐水(10mL)洗涤合并的有机相,真空过滤并浓缩。通过硅胶色谱(己烷/EtOAC;50∶50至0∶100)纯化粗材料,并分离浅黄色泡沫状纯产物24(11.168g,94%产率)。1H-NMR(400MHz,CDCl3)δ8.45(s,1H),7.93(s,1H),7.74(d,J=7.4Hz,2H),7.64(s,1H),7.52(dd,J=17.9,8.9Hz,4H),7.39(t,J=7.4Hz,2H),7.33-7.26(m,3H),7.13(d,J=7.4Hz,2H),6.81(s,1H),6.45(s,1H),5.26(s,1H),4.84(s,1H),4.69-4.58(m,1H),4.47(d,J=6.2Hz,2H),4.43(s,1H),4.17(d,J=14.2Hz,1H),3.99(s,1H),3.89(s,2H),3.74(s,3H),3.30-3.17(m,1H),2.64(d,J=16.9Hz,1H),2.23-2.09(m,1H),1.44(s,9H),1.44(d,J=10.9Hz,2H),1.29(ddd,J=14.3,13.0,7.4Hz,3H),1.12(d,J=7.4Hz,18H),0.92(m,6H);ES+=2.23min,无质量。
(vi)叔丁基(11S)-2-(4-((S)-2-((S)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-甲基丁氨基)丙氨基)苯基)-11-羟基-7-甲氧基-5-氧代-8-((三异丙基甲硅烷基)氧基)-11,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-10(5H)-羧酸酯(25)
在-78℃下将DMSO(1.55L,21.9mmol)添加至草酰氯(0.89mL,10.5mmol)在CH2Cl2(50mL)中的冷却溶液中。15分钟之后,将醇24(8.8mg,8.76mmol)在CH2Cl2(100mL)中的溶液逐滴添加至氧化混合物中。在添加NEt3(6.11mL,43.8mmol)之前,在-78℃下搅拌反应1小时,并使混合物升温至室温。完成时,用CH2Cl2(100mL)稀释反应混合物,并用0.1M HCl(溶液)(250mL)、H2O(250mL)、饱和NaHCO3(溶液)(250mL)和盐水(200mL)洗涤溶液。用MgSO4干燥有机物、过滤并真空除去挥发物。通过硅胶色谱(CH2Cl2/EtOAc;100%至50∶50)纯化粗材料以提供黄色油状纯25(8.8mg,100%)。1H-NMR(400MHz,CDCl3)δ8.71(s,1H),7.74(t,J=8.4Hz,3H),7.52(d,J=7.4Hz,5H),7.43-7.33(m,4H),7.23-7.17(m,2H),6.69(s,1H),6.42(d,J=7.9Hz,1H),5.78(d,J=7.8Hz,1H),5.62(s,1H),5.23(d,J=7.7Hz,1H),4.84-4.69(m,1H),4.65(d,J=22.5Hz,1H),4.45-4.29(m,2H),3.91(dd,J=11.3,8.1Hz,1H),3.86(s,3H),3.28(q,J=11.9Hz,1H),2.98(t,J=12.6Hz,1H),2.14(dd,J=12.9,10.0Hz,1H),1.52-1.42(m,3H),1.38(s,9H),1.26(m,3H),1.16-1.05(m,18H),0.93(d,J=6.0Hz,6H);ES+=2.19min,无质量。
(vii)叔丁基(11S)-2-(4-((S)-2-((S)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-甲基丁氨基)丙氨基)苯基)-11-((叔丁基二甲基甲硅烷基)氧基)-7-甲氧基-5-氧代-8-((三异丙基甲硅烷基)氧基)-11,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-10(5H)-羧酸酯(26)
在之前用氩气冲洗三次的密封圆底烧瓶中,将醇25(8.8g,8.78mmol)溶解在干燥的CH2Cl2(150mL)中。将溶液冷却至0℃,随后添加二甲基吡啶(4mL,35.1mmol)和TBS-OTf(6mL,26.3mmol)。将反应混合物升温至室温并搅拌直到完成(由LCMS监控)。完成时,用CH2Cl2(100mL)稀释,用饱和NH4Cl(溶液)、H2O(100mL)、饱和NaHCO3(溶液)(100mL)和盐水(100mL)洗涤溶液。用MgSO4干燥有机物,过滤并真空除去挥发物。通过硅胶色谱(己烷/EtOAc:100%至80∶20)纯化粗材料以提供无色油状纯26(6.18mg,70%)。1H-NMR(400MHz,CDCl3)δ8.40(s,1H),7.76(d,J=7.5Hz,2H),7.55(dd,J=13.0,6.7Hz,4H),7.40(t,J=7.3Hz,4H),7.33-7.27(m,3H),7.21(s,1H),6.67(s,1H),6.49(s,1H),5.87(d,J=8.8Hz,1H),5.30(d,J=5.7Hz,1H),4.71-4.59(m,1H),4.48(d,J=6.8Hz,2H),4.20(t,J=6.7Hz,1H),4.04-3.96(m,1H),3.86(s,3H),3.84-3.77(m,1H),3.25(m,1H),2.79(d,J=1.5Hz,1H),2.26-2.11(m,1H),1.46(d,J=6.9Hz,3H),1.33(s,9H),1.27(dd,J=17.1,9.7Hz,3H),1.11(dd,J=7.4,4.0Hz,18H),0.93(s,6H),0.89(s,9H),0.27(s,3H),0.22(s,3H);ES+=2.55min,m/z116.30[M+H]+.
(viii)叔丁基(11S)-2-(4-((S)-2-((S)-2-((((9H-芴-9-基)甲氧基)羰基)氨基)-3-甲基丁氨基)丙氨基)苯基)-11-((叔丁基二甲基甲硅烷基)氧基)-8-羟基-7-甲氧基-5-氧代-11,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-10(5H)-羧酸酯(27)
在添加LiOAc(91mg,0.89mmol)之前,将单体26(1g,0.89mmol)溶解在湿润的DMF(5mL+0.5mL H2O)中,并在室温下搅拌混合物直到完成(约3h,随后是LCMS)。随后用EtOAc(50mL)稀释混合物,用柠檬酸(溶液)(pH=3,40mL)淬灭,然后用H2O(50mL)和盐水(50mL)洗涤。用MgSO4干燥有机层,过滤并真空除去挥发物。通过硅胶色谱(己烷/EtOAc/MeOH;60∶40∶0至60∶30∶10)纯化粗产物,并分离膏状固体纯产物27(675mg,78%产率)。1H-NMR(400MHz,CDCl3)δ8.36(s,1H),7.76(d,J=7.6Hz,2H),7.55(dd,J=16.0,7.5Hz,4H),7.40(t,J=7.4Hz,4H),7.30(ddd,J=14.7,7.4,1.1Hz,3H),7.24(s,1H),6.72(s,1H),6.38(d,J=5.3Hz,1H),5.87(s,1H),5.23(d,J=6.2Hz,1H),4.69-4.57(m,1H),4.49(d,J=6.6Hz,2H),4.20(t,J=5.3Hz,1H),4.04-3.96(m,1H),3.96(s,3H),3.87(dd,J=10.1,3.5Hz,1H),3.29(dd,J=18.0,8.5Hz,1H),2.80(d,J=19.4Hz,1H),2.24-2.08(m,1H),1.46(d,J=10.5Hz,3H),1.33(s,9H),1.00-0.91(m,6H),0.90(s,9H),0.25(d,J=8.6Hz,6H).;ES+=2.08min,m/z 960.35[M+H]+.
(b)(S)-8-((5-碘代戊烷基)氧基)-7-甲氧基-1,2,3,11a-四氢-5H-吡咯并[2,1-
c][1,4]苯并二氮杂卓-5,11(10H)-二酮(33)
(i)(S)-8-(苄氧基)-7-甲氧基-1,2,3,11a-四氢-5H-吡咯并[2,1-c][1,4]苯并二氮杂卓-5,11(10H)-二酮(30)
在密封的小瓶中,在150℃下在微波辐射下加热苄基靛红酸酐28(1.34g,4.48mmol,1.0eq.)和L-脯氨酸29(0.705g,6.12mmol,1.36eq.)在无水DMSO(20mL)中的悬浮液,并搅拌12分钟。将得到的黄色溶液冷却至室温,并倒在冰上。通过过滤收集沉淀产物,将其溶解在DCM(200mL)中并用饱和NaCl溶液(200mL)洗涤溶液,干燥(MgSO4)以及在减压下蒸发以产生黄色固体产物30(1.35g,85%)。分析数据:RT 1.39min;MS(ES+)m/z(相对强度)353([M+H]+.,100)。
(ii)(S)-8-羟基-7-甲氧基-1,2,3,11a-四氢-5H-吡咯并[2,1-c][1,4]苯并二氮杂卓-5,11(10H)-二酮(31)
将碳(0.27g,20wt.%)上的10%钯在乙酸乙酯(10mL)中的浆料添加至苄基二内酰胺30(1.35g,3.8mmol)在乙醇(60mL)、乙酸乙酯(40mL)和DMF(5mL)的混合物中的悬浮液中。在45psi下氢化混合物2小时。通过C盐过滤反应混合物并在减压下蒸发溶剂以产生粘性胶体。用二乙醚(50mL)超声处理胶体并通过过滤收集得到的产物。这产生期望的米色粉末产物31(0.86g,85%)。分析数据:RT 1.02min;MS(ES+)m/z(相对强度)263([M+H]+.,100)。
(iii)(S)-8-((5-碘代戊烷基)氧基)-7-甲氧基-1,2,3,11a-四氢-5H-吡咯并[2,1-c][1,4]苯并二氮杂卓-5,11(10H)-二酮(33)
在用氩气吹扫的烧瓶中,将K2CO3(320mg,1.5mmol)和1,5-二碘代戊烷32(1.1mL,7.6mmol)添加至31(400mg,1.5mmol)在无水DMF(4mL)中的溶液中。将反应混合物加热至60℃直到完成(30分钟)。用CH2Cl2(50mL)稀释溶液,并在用MgSO4干燥有机物之前用H2O(50mL)和盐水(50mL)洗涤,过滤并真空除去挥发物。通过硅胶柱色谱(己烷/EtOAc;100%至3:7)纯化粗材料以提供浅棕色泡沫状胶体纯产物33(611mg,87%产率)。分析数据:RT 1.51min;MS(ES+)m/z(相对强度)458.95([M+H]+.,100)。
(c)1-(3-(2,5-二氧代-2,5-二氢-1H-吡咯-1-基)丙氨基)-N-((S)-1-(((S)-1-
((4-((S)-7-甲氧基-8-((5-(((S)-7-甲氧基-5,11-二氧代-2,3,5,10,11,11a-六氢-1H-吡
咯并[2,1-c][1,4]苯并二氮杂卓-8-基)氧基)戊烷基)氧基)-5-氧代-5,11a-二氢-1H-吡咯
并[2,1-c][1,4]苯并二氮杂卓-2-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧
代丁烷-2-基)-3,6,9,12,15,18,21-七氧杂二十四烷-24-酰胺(36)
(i)叔丁基(11S,11aS)-2-(4-((S)-2-((S)-2-氨基-3-甲基丁氨基)丙氨基)苯基)-11-((叔丁基二甲基甲硅烷基)氧基)-7-甲氧基-8-((5-(((S)-7-甲氧基-5,11-二氧代-2,3,5,10,11,11a-六氢-1H-吡咯并[2,1-c][1,4]苯并二氮杂卓-8-基)氧基)戊烷基)氧基)-5-氧代-11,11a-二氢-1H-吡咯并[2,1-c][1,4]苯并二氮杂卓-10(5H)-羧酸酯(34)
在用氩气吹扫的烧瓶中,将K2CO3(115mg,0.545mmol)添加至33(250mg,0.545mmol)和27(570mg,0.6mmol)在无水DMF(4mL)的溶液中,并将混合物加热至60℃直到反应完成(45分钟)。用CH2Cl2(50mL)稀释溶液,并在用MgSO4干燥有机物之前用H2O(50mL)和盐水(50mL)洗涤,过滤并真空除去挥发物。通过硅胶柱色谱(CHCl3/MeOH;100%至95:5)纯化粗材料以提供白色泡沫状纯产物34(337mg,58%产率)。分析数据:RT 1.42min;MS(ES+)m/z(相对强度)1069.05([M+H]+.,80)。
(ii)叔丁基(11S)-11-((叔丁基二甲基甲硅烷基)氧基)-2-(4-((2S,5S)-37-(2,5-二氧代-2,5-二氢-1H-吡咯-1-基)-5-异丙基-2-甲基-4,7,35-三氧代-10,13,16,19,22,25,28,31-八氧杂-3,6,34-三氮杂七三十烷氨基)苯基)-7-甲氧基-8-((5-(((S)-7-甲氧基-5,11-二氧代-2,3,5,10,11,11a-六氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-8-基)氧基)戊烷基)氧基)-5-氧代-11,11a-二氢-1H-苯并[e]吡咯并[1,2-a][1,4]二氮杂卓-10(5H)-羧酸酯(35)
将PEG部分(186mg,0.31mmol)和EDCI.HCl(60mg,0.31mmol)添加至34(337mg,0.31mmol)在干燥CH2Cl2(5mL)的溶液中。在氩气气氛下在室温下搅拌混合物直到完成。随后用CH2Cl2(50mL)稀释混合物,并在真空除去挥发物之前用H2O(50mL)和盐水(50mL)洗涤。。通过硅胶柱色谱(CHCl3/MeOH,100%至95:5)纯化粗材料以提供浅黄色泡沫状纯35(408.8mg,58%产率)。分析数据:RT 1.75min;MS(ES+)m/z(相对强度)1643.15([M+H]+.,10)822.25([M+2H]2+,100)。
(iii)1-(3-(2,5-二氧代-2,5-二氢-1H-吡咯-1-基)丙氨基)-N-((S)-1-(((S)-1-((4-((S)-7-甲氧基-8-((5-(((S)-7-甲氧基-5,11-二氧代-2,3,5,10,11,11a-1H-吡咯并[2,1-c][1,4]苯并二氮杂卓-8-基)氧基)戊烷基)氧基)-5-氧代-5,11a-二氢-1H-吡咯并[2,1-c][1,4]苯并二氮杂卓-2-基)苯基)氨基)-1-氧代丙烷-2-基)氨基)-3-甲基-1-氧代丁烷-2-基)-3,6,9,12,15,18,21-七氧杂四十烷-24-酰胺(36)
随后添加H2O(160μL)和TFA(3.5mL)至冷却至0℃的包含35(400mg,0.24mmol)的烧瓶中。在用冰冷的NaHCO3(50mL)淬灭之前搅拌混合物直到完成,以及用CH2Cl2(50mL+25mL)萃取。然后用盐水(25mL)洗涤有机物,用MgSO4干燥,过滤并真空除去挥发物以产生无进一步纯化的粗产物36。分析数据:RT 1.40min;MS(ES+)m/z(相对强度)1410.60([M+H]+.,5)706.10([M+2H]2+,100)。
序列表
<110> 麦迪穆有限责任公司
<120> 吡咯并苯并二氮杂卓
<130> RJW/FP6955306
<140> PCT/EP2013/077695
<141> 2013-12-20
<150> US 61/740,605
<151> 2012-12-21
<160> 233
<170> PatentIn version 3.3
<210> 1
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: A20FMDV-Cys 多肽
<400> 1
Asn Ala Val Pro Asn Leu Arg Gly Asp Leu Gln Val Leu Ala Gln Lys
1 5 10 15
Val Ala Arg Thr Cys
20
<210> 2
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: A20FMDV-Cys 多肽
<220>
<221> VARIANT
<222> (4)..(18)
<223> Xaa是任何氨基酸残基
<400> 2
Asn Ala Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
1 5 10 15
Xaa Xaa Arg Thr Cys
20
<210> 3
<211> 137
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RHAB6.2
<400> 3
Gln Val Gln Leu Val Gln Ser Gly Ser Glu Leu Lys Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Ile Ser Cys Lys Ala Ser Gly Phe Ala Phe Thr Asp Ser
20 25 30
Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln Gly Leu Glu Trp Met
35 40 45
Gly Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Phe Val Phe Ser Leu Asp Thr Ser Val Ser Thr Ala Tyr
65 70 75 80
Leu Gln Ile Ser Ser Leu Lys Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Arg Gly Thr Pro Thr Ala Val Pro Asn Leu Arg Gly Asp Leu Gln
100 105 110
Val Leu Ala Gln Lys Val Ala Gly Pro Tyr Pro Phe Asp Tyr Trp Gly
115 120 125
Gln Gly Thr Leu Val Thr Val Ser Ser
130 135
<210> 4
<211> 137
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RHCB6.2
<400> 4
Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ile Asp Ser
20 25 30
Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln Arg Leu Glu Trp Met
35 40 45
Gly Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Val Thr Ile Thr Thr Asp Thr Ser Ala Ser Thr Ala Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Gly Thr Pro Thr Ala Val Pro Asn Leu Arg Gly Asp Leu Gln
100 105 110
Val Leu Ala Gln Lys Val Ala Gly Pro Tyr Pro Phe Asp Tyr Trp Gly
115 120 125
Gln Gly Thr Leu Val Thr Val Ser Ser
130 135
<210> 5
<211> 120
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RHF
<400> 5
Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Phe Asn Phe Ile Asp Ser
20 25 30
Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln Arg Leu Glu Trp Met
35 40 45
Gly Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Val Thr Phe Thr Thr Asp Thr Ser Ala Ser Thr Ala Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Asn Glu Gly Thr Pro Thr Gly Pro Tyr Tyr Phe Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser
115 120
<210> 6
<211> 137
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RHFB6
<400> 6
Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Phe Asn Phe Ile Asp Ser
20 25 30
Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln Arg Leu Glu Trp Met
35 40 45
Gly Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Val Thr Phe Thr Thr Asp Thr Ser Ala Ser Thr Ala Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Asn Glu Gly Thr Pro Thr Ala Val Pro Asn Leu Arg Gly Asp Leu Gln
100 105 110
Val Leu Ala Gln Lys Val Ala Gly Pro Tyr Tyr Phe Asp Tyr Trp Gly
115 120 125
Gln Gly Thr Leu Val Thr Val Ser Ser
130 135
<210> 7
<211> 120
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RHAY100bP
<400> 7
Gln Val Gln Leu Val Gln Ser Gly Ser Glu Leu Lys Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Ile Ser Cys Lys Ala Ser Gly Phe Ala Phe Thr Asp Ser
20 25 30
Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln Gly Leu Glu Trp Met
35 40 45
Gly Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Arg Phe Val Phe Ser Leu Asp Thr Ser Val Ser Thr Ala Tyr
65 70 75 80
Leu Gln Ile Ser Ser Leu Lys Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Thr Arg Gly Thr Pro Thr Gly Pro Tyr Pro Phe Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser
115 120
<210> 8
<211> 106
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RKF
<400> 8
Glu Asn Val Leu Thr Gln Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly
1 5 10 15
Glu Arg Ala Thr Leu Ser Cys Ser Ala Ser Ser Ser Val Ser Tyr Met
20 25 30
His Trp Phe Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu Ile Tyr
35 40 45
Ser Thr Ser Asn Leu Ala Ser Gly Ile Pro Asp Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Arg Leu Glu Pro Glu
65 70 75 80
Asp Phe Ala Val Tyr Tyr Cys Gln Gln Arg Ser Ser Tyr Pro Leu Thr
85 90 95
Phe Gly Gly Gly Thr Lys Val Glu Ile Lys
100 105
<210> 9
<211> 106
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RKFL36L50
<400> 9
Glu Asn Val Leu Thr Gln Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly
1 5 10 15
Glu Arg Ala Thr Leu Ser Cys Ser Ala Ser Ser Ser Val Ser Tyr Met
20 25 30
His Trp Leu Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu Ile Tyr
35 40 45
Leu Thr Ser Asn Leu Ala Ser Gly Ile Pro Asp Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Arg Leu Glu Pro Glu
65 70 75 80
Asp Phe Ala Val Tyr Tyr Cys Gln Gln Arg Ser Ser Tyr Pro Leu Thr
85 90 95
Phe Gly Gly Gly Thr Lys Val Glu Ile Lys
100 105
<210> 10
<211> 106
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗整合蛋白 ανβ6, RKC
<400> 10
Glu Ile Val Leu Thr Gln Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly
1 5 10 15
Glu Arg Ala Thr Leu Ser Cys Ser Ala Ser Ser Ser Val Ser Tyr Met
20 25 30
His Trp Phe Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu Ile Tyr
35 40 45
Ser Thr Ser Asn Leu Ala Ser Gly Ile Pro Asp Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Arg Leu Glu Pro Glu
65 70 75 80
Asp Phe Ala Val Tyr Tyr Cys Gln Gln Arg Ser Ser Tyr Pro Leu Thr
85 90 95
Phe Gly Gly Gly Thr Lys Val Glu Ile Lys
100 105
<210> 11
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD33, CD33 Hum195 VH
<400> 11
Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ser
1 5 10 15
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asp Tyr
20 25 30
Asn Met His Trp Val Arg Gln Ala Pro Gly Gln Gly Leu Glu Trp Ile
35 40 45
Gly Tyr Ile Tyr Pro Tyr Asn Gly Gly Thr Gly Tyr Asn Gln Lys Phe
50 55 60
Lys Ser Lys Ala Thr Ile Thr Ala Asp Glu Ser Thr Asn Thr Ala Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Gly Arg Pro Ala Met Asp Tyr Trp Gly Gln Gly Thr Leu Val
100 105 110
Thr Val Ser Ser
115
<210> 12
<211> 111
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD33, CD33 Hum195 VK
<400> 12
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Glu Ser Val Asp Asn Tyr
20 25 30
Gly Ile Ser Phe Met Asn Trp Phe Gln Gln Lys Pro Gly Lys Ala Pro
35 40 45
Lys Leu Leu Ile Tyr Ala Ala Ser Asn Gln Gly Ser Gly Val Pro Ser
50 55 60
Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser
65 70 75 80
Ser Leu Gln Pro Asp Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Ser Lys
85 90 95
Glu Val Pro Trp Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105 110
<210> 13
<211> 120
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD19, CD19 B4表面重建的VH
<400> 13
Gln Val Gln Leu Val Gln Pro Gly Ala Glu Val Val Lys Pro Gly Ala
1 5 10 15
Ser Val Lys Leu Ser Cys Lys Thr Ser Gly Tyr Thr Phe Thr Ser Asn
20 25 30
Trp Met His Trp Val Lys Gln Arg Pro Gly Gln Gly Leu Glu Trp Ile
35 40 45
Gly Glu Ile Asp Pro Ser Asp Ser Tyr Thr Asn Tyr Asn Gln Asn Phe
50 55 60
Lys Gly Lys Ala Lys Leu Thr Val Asp Lys Ser Thr Ser Thr Ala Tyr
65 70 75 80
Met Glu Val Ser Ser Leu Arg Ser Asp Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Gly Ser Asn Pro Tyr Tyr Tyr Ala Met Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Ser Val Thr Val Ser Ser
115 120
<210> 14
<211> 104
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD19, CD19 B4表面重建的VK
<400> 14
Glu Ile Val Leu Thr Gln Ser Pro Ala Ile Met Ser Ala Ser Pro Gly
1 5 10 15
Glu Arg Val Thr Met Thr Cys Ser Ala Ser Ser Gly Val Asn Tyr Met
20 25 30
His Trp Tyr Gln Gln Lys Pro Gly Thr Ser Pro Arg Arg Trp Ile Tyr
35 40 45
Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Ser Tyr Ser Leu Thr Ile Ser Ser Met Glu Pro Glu
65 70 75 80
Asp Ala Ala Thr Tyr Tyr Cys His Gln Arg Gly Ser Tyr Thr Phe Gly
85 90 95
Gly Gly Thr Lys Leu Glu Ile Lys
100
<210> 15
<211> 120
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗Her2, 赫塞汀 VH 链
<400> 15
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp Thr
20 25 30
Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser
115 120
<210> 16
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗Her2, 赫塞汀 VL链
<400> 16
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn Thr Ala
20 25 30
Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro Pro
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 17
<211> 104
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD25, 舒莱 VK (也称为
巴利昔单抗)
<400> 17
Gln Ile Val Ser Thr Gln Ser Pro Ala Ile Met Ser Ala Ser Pro Gly
1 5 10 15
Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser Arg Ser Tyr Met
20 25 30
Gln Trp Tyr Gln Gln Lys Pro Gly Thr Ser Pro Lys Arg Trp Ile Tyr
35 40 45
Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Ser Tyr Ser Leu Thr Ile Ser Ser Met Glu Ala Glu
65 70 75 80
Asp Ala Ala Thr Tyr Tyr Cys His Gln Arg Ser Ser Tyr Thr Phe Gly
85 90 95
Gly Gly Thr Lys Leu Glu Ile Lys
100
<210> 18
<211> 115
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗CD25, 舒莱 VH
<400> 18
Gln Leu Gln Gln Ser Gly Thr Val Leu Ala Arg Pro Gly Ala Ser Val
1 5 10 15
Lys Met Ser Cys Lys Ala Ser Gly Tyr Ser Phe Thr Arg Tyr Trp Met
20 25 30
His Trp Ile Lys Gln Arg Pro Gly Gln Gly Leu Glu Trp Ile Gly Ala
35 40 45
Ile Tyr Pro Gly Asn Ser Asp Thr Ser Tyr Asn Gln Lys Phe Glu Gly
50 55 60
Lys Ala Lys Leu Thr Ala Val Thr Ser Ala Ser Thr Ala Tyr Met Glu
65 70 75 80
Leu Ser Ser Leu Thr His Glu Asp Ser Ala Val Tyr Tyr Cys Ser Arg
85 90 95
Asp Tyr Gly Tyr Tyr Phe Asp Phe Trp Gly Gln Gly Thr Thr Leu Thr
100 105 110
Val Ser Ser
115
<210> 19
<211> 115
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH '1
<400> 19
Glu Val Gln Leu Val Gln Ser Gly Pro Glu Val Lys Lys Pro Gly Ala
1 5 10 15
Thr Val Lys Ile Ser Cys Lys Thr Ser Gly Tyr Thr Phe Thr Glu Tyr
20 25 30
Thr Ile His Trp Val Lys Gln Ala Pro Gly Lys Gly Leu Glu Trp Ile
35 40 45
Gly Asn Ile Asn Pro Asn Asn Gly Gly Thr Thr Tyr Asn Gln Lys Phe
50 55 60
Glu Asp Lys Ala Thr Leu Thr Val Asp Lys Ser Thr Asp Thr Ala Tyr
65 70 75 80
Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Ala Gly Trp Asn Phe Asp Tyr Trp Gly Gln Gly Thr Leu Leu Thr
100 105 110
Val Ser Ser
115
<210> 20
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK '1
<400> 20
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Thr Ser Val Gly
1 5 10 15
Asp Arg Val Thr Leu Thr Cys Lys Ala Ser Gln Asp Val Gly Thr Ala
20 25 30
Val Asp Trp Tyr Gln Gln Lys Pro Gly Pro Ser Pro Lys Leu Leu Ile
35 40 45
Tyr Trp Ala Ser Thr Arg His Thr Gly Ile Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Asp Tyr Tyr Cys Gln Gln Tyr Asn Ser Tyr Pro Leu
85 90 95
Thr Phe Gly Pro Gly Thr Lys Val Asp Ile Lys
100 105
<210> 21
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH1 '5
<400> 21
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Met Lys Leu Ser Cys Val Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Thr Ile Ser Arg Asp Asp Ser Lys Ser Ile
65 70 75 80
Val Tyr Leu Gln Met Asn Asn Leu Arg Ala Glu Asp Thr Gly Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 22
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH2 '5
<400> 22
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Val Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Thr Ile Ser Arg Asp Asp Ser Lys Ser Ile
65 70 75 80
Val Tyr Leu Gln Met Asn Asn Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 23
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH3 '5
<400> 23
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Val Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Thr Ile Ser Arg Asp Asp Ser Lys Ser Ile
65 70 75 80
Val Tyr Leu Gln Met Asn Asn Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 24
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH4 '5
<400> 24
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Val Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Ser Ile
65 70 75 80
Val Tyr Leu Gln Met Asn Asn Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 25
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK1 '5
<400> 25
Asn Ile Val Met Thr Gln Phe Pro Ser Ser Met Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Asp Gln Ser Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Thr Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Thr
65 70 75 80
Glu Asp Leu Ala Asp Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Leu Glu Met Lys
100 105
<210> 26
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK2 '5
<400> 26
Asn Ile Val Met Thr Gln Phe Pro Ser Ser Met Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Asp Gln Ser Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Ala
65 70 75 80
Glu Asp Leu Ala Asp Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 27
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK3 '5
<400> 27
Asn Ile Gln Met Thr Gln Phe Pro Ser Ala Met Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Asp Gln Ser Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Ala
65 70 75 80
Glu Asp Leu Ala Asp Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 28
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK4 '5
<400> 28
Asn Ile Gln Met Thr Gln Phe Pro Ser Ala Met Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Asp Gln Ser Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Ala
65 70 75 80
Glu Asp Glu Ala Asp Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys
100 105
<210> 29
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VK DI '5
<400> 29
Asn Ile Val Met Thr Gln Phe Pro Lys Ser Met Ser Ala Ser Ala Gly
1 5 10 15
Glu Arg Met Thr Leu Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Thr Gln Ser Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Ile Leu Thr Ile Ser Ser Val Gln Ala
65 70 75 80
Glu Asp Leu Val Asp Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gly Gly Thr Lys Leu Glu Met Lys
100 105
<210> 30
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 去免疫的VH DI '5
<400> 30
Glu Val Lys Leu Glu Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Met Lys Ile Ser Cys Val Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ser Pro Glu Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Ser Ser
65 70 75 80
Val Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 31
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHA '5
<400> 31
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Gly Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Ala Tyr Leu Gln Met Asn Ser Leu Lys Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 32
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHB '5
<400> 32
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Val Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 33
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHC '5
<400> 33
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Val Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 34
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHD '5
<400> 34
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Gly Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Val Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 35
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHE '5
<400> 35
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Val Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 36
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHF '5
<400> 36
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Ala Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 37
<211> 116
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RHG '5
<400> 37
Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asn Tyr
20 25 30
Trp Met Asn Trp Val Arg Gln Ala Ser Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Glu Ile Arg Ser Gln Ser Asn Asn Phe Ala Thr His Tyr Ala Glu
50 55 60
Ser Val Lys Gly Arg Val Ile Ile Ser Arg Asp Asp Ser Lys Asn Thr
65 70 75 80
Ala Tyr Leu Gln Met Asn Ser Leu Arg Thr Glu Asp Thr Ala Val Tyr
85 90 95
Tyr Cys Thr Arg Arg Trp Asn Asn Phe Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 38
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKA '5
<400> 38
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 39
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKB '5
<400> 39
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 40
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKC '5
<400> 40
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 41
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKD '5
<400> 41
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 42
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKE '5
<400> 42
Asn Ile Val Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Thr Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Ile Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 43
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKF '5
<400> 43
Asn Ile Val Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Ile Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 44
<211> 107
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 抗PSMA, 人源化RKG '5
<400> 44
Asn Ile Val Met Thr Gln Ser Pro Ser Ser Val Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Ala Ser Glu Asn Val Gly Thr Tyr
20 25 30
Val Ser Trp Tyr Gln Gln Lys Pro Gly Thr Ala Pro Lys Met Leu Ile
35 40 45
Tyr Gly Ala Ser Asn Arg Phe Thr Gly Val Pro Asp Arg Phe Thr Gly
50 55 60
Ser Gly Ser Ala Thr Asp Phe Thr Leu Thr Ile Asn Asn Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gly Gln Ser Tyr Thr Phe Pro Tyr
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys
100 105
<210> 45
<211> 5560
<212> DNA
<213> 智人
<400> 45
cgcggggcgc ggagtcggcg gggcctcgcg ggacgccggg cagtgcggag accgcggcgc 60
tgaggacgcg ggagccggga gcgcagccgc ggggtggagt tcagcctact ctttcttaga 120
tgtgaaagga aaggaagatc atttcatgcc ttgttgataa aggttcagac ttctgctgat 180
tcataaccat ttggctctga gctatgacaa gagaggaaac aaaaagttaa acaagcaagc 240
ctgccataag tgagaagcaa acttccttga taacatgctt ttgcgaagtg caggaaaatt 300
aaatgtgggc accaagaaag aggatggtga gagtacagcc cccacccccc gtccaaaggt 360
cttgcgttgt aaatgccacc accattgtcc agaagactca gtcaacaata tttgcagcac 420
agacggatat tgtttcacga tgatagaaga ggatgactct gggttgcctg tggtcacttc 480
tggttgccta ggactagaag gctcagattt tcagtgtcgg gacactccca ttcctcatca 540
aagaagatca attgaatgct gcacagaaag gaacgaatgt aataaagacc tacaccctac 600
actgcctcca ttgaaaaaca gagattttgt tgatggacct atacaccaca gggctttact 660
tatatctgtg actgtctgta gtttgctctt ggtccttatc atattatttt gttacttccg 720
gtataaaaga caagaaacca gacctcgata cagcattggg ttagaacagg atgaaactta 780
cattcctcct ggagaatccc tgagagactt aattgagcag tctcagagct caggaagtgg 840
atcaggcctc cctctgctgg tccaaaggac tatagctaag cagattcaga tggtgaaaca 900
gattggaaaa ggtcgctatg gggaagtttg gatgggaaag tggcgtggcg aaaaggtagc 960
tgtgaaagtg ttcttcacca cagaggaagc cagctggttc agagagacag aaatatatca 1020
gacagtgttg atgaggcatg aaaacatttt gggtttcatt gctgcagata tcaaagggac 1080
agggtcctgg acccagttgt acctaatcac agactatcat gaaaatggtt ccctttatga 1140
ttatctgaag tccaccaccc tagacgctaa atcaatgctg aagttagcct actcttctgt 1200
cagtggctta tgtcatttac acacagaaat ctttagtact caaggcaaac cagcaattgc 1260
ccatcgagat ctgaaaagta aaaacattct ggtgaagaaa aatggaactt gctgtattgc 1320
tgacctgggc ctggctgtta aatttattag tgatacaaat gaagttgaca taccacctaa 1380
cactcgagtt ggcaccaaac gctatatgcc tccagaagtg ttggacgaga gcttgaacag 1440
aaatcacttc cagtcttaca tcatggctga catgtatagt tttggcctca tcctttggga 1500
ggttgctagg agatgtgtat caggaggtat agtggaagaa taccagcttc cttatcatga 1560
cctagtgccc agtgacccct cttatgagga catgagggag attgtgtgca tcaagaagtt 1620
acgcccctca ttcccaaacc ggtggagcag tgatgagtgt ctaaggcaga tgggaaaact 1680
catgacagaa tgctgggctc acaatcctgc atcaaggctg acagccctgc gggttaagaa 1740
aacacttgcc aaaatgtcag agtcccagga cattaaactc tgataggaga ggaaaagtaa 1800
gcatctctgc agaaagccaa caggtactct tctgtttgtg ggcagagcaa aagacatcaa 1860
ataagcatcc acagtacaag ccttgaacat cgtcctgctt cccagtgggt tcagacctca 1920
cctctcaggg agcgacctgg gcaaagacag agaagctccc agaaggagag attgatccat 1980
gtctgtttgt aggacggaga aaccgcttgg gtaacttgtt caagatatga tgcatgttgc 2040
tttctaagaa agccctgtat tttgtgattg cctttttttt tttttaagat gctttcattt 2100
tgccaaaata aaacagataa tgtggatggt ttaagggtta tagtattata gtttaaataa 2160
taacaacaaa attcttccca ggaactctgc tggaaggtaa attaaaatac ttgtttttcc 2220
attggtaaaa tattgttgca ctctgtgaac caaaagacag tctaagttgg aggacataga 2280
acggaactca tcttaaacat actccccacc ccgtcttggc ctcctcagac cactttggcc 2340
atccctgcat ttggggccgc tatggtaatg tgaatgcact gggtacaaac accgcctgtc 2400
taggaccaca tttggaattc ctgcaggtgg ccttttgcag cttcaggcaa tatggaacaa 2460
atgaaggttt atgtgactct aatagaagta attgttgata ggtgttcttc agatccactt 2520
ctgtttctga ttgagttagg catctctttc atggtaaaac ccttttcatt aaacacaaag 2580
aaagcttttt tttttttttt tttttttttt tttttttaat gtgcagagga ttgacctgtg 2640
catgcttttg atctctcatt caaaggatca atattaaata aaattgtcat gagctgtgtt 2700
gaagacaggg tgctttcaaa tagaggtaat ttgctcttgt gttgtaagag gaacatgtca 2760
acaaagatag gaaatgaggg tgatcgtgca gatggcttgt atcttatata tgcaaaggag 2820
ccaatctcag aagcacaaag aaaaaagtgt gcatacctta ttttgtacag ataaagatga 2880
tgtctttttg ttattgtctg tctgttttgt atgtgtctga gataagggat agagaggaaa 2940
catccgtcag gctaatttaa ctacatttta ttttaaaaat agagaaacat aacctctaga 3000
tgggacagca gaggacagtt agtagaggcc acaaactgtt atgggctgct gtgttttgtt 3060
ctaaaatcaa tatggttgga gcatgtatat cttaggtgat catttcacat cttaggaatg 3120
cctactcatt ttattttatt ctagtgatgc tcaattcact atttaattta ttatattttc 3180
tcttctgtgg cacttataca aaatatctct tcacctactt agttctacag ggttttaact 3240
ttggagcaac atgaataaaa tcatcgagaa ggccaatatt gtttagcaac atgaatacaa 3300
tacagtttaa agttgtacac atcctgctca actttattca tatacatttc ctttctgtgg 3360
ttttcttttg cttcttagaa attctgttag tggttagtaa agaatttgaa agtactttct 3420
ccttgctgtt tttttttttt tttaagacat tcctcccaga atactccagg gggcagtgtt 3480
ttataacaca ttttccccac tgggtgattg aaggatggag gatttttgaa aatttgacag 3540
ctacatgaaa catgagaaaa cattttcctc acttctgaag tcggtttgca gctggtaact 3600
tgttcatcca gaaaacattc taaagcaatg agactttgtg agctgtgctt acagtttggg 3660
agaatcatga agattctttc tatattttgc atttacttcc cagtgcttca tagctgcatt 3720
ttgtttgtaa ctaagacaga agaatttcgt aatccttgaa attgaaaaaa aaaaaattgt 3780
gtttttaaag agtgaaaaca gttagaaaac aagtagaact gtaatcagaa cgctgcttca 3840
attgatatta aaaataacct caataataat gtaaaggttc ctttctcttg tgtcagttat 3900
attcttaggg atagcctaga aggaatatat ggttagaact aagtgtgact aatcatctga 3960
gccttgaaga gaaacttcag tgcctctaaa cagatcatct acaaaacaac aggtaaacat 4020
ttatgccagt taagtgggtc atgtttttgt ttcttgggtt tttcctaaat ttaagtgagg 4080
ttgggcttac cttgtagata aaattatgtt ttctttttgg taaatacttg aacgtggata 4140
acgtcaaatc agaatatttt gtgaggaggt gatgatttga aattaagcta gatttctagg 4200
gaggtgttgg ttccaatgaa ggatgggaag aaattaaaat agtcttcaaa cttcttcctt 4260
attatatttg gttgctttgg aaaagattgg tcctatcctc aatctaattt attcactatt 4320
aatattttaa aaacattcct gagatactta aaaagaccca cttagcgatt atagttgctc 4380
aatgaaacaa gaatttattt atgcatagat ttttctctgt atcttaccaa aatccacttt 4440
acttagataa cactaaattg ttcttaaaga ctactcattt cccaataatc ctttatgatt 4500
tcaaaatttc tagtggctca gaagtgaatt ttattttatt tgtctttcac ttgaataaat 4560
gagaacccag aaattaataa tgttgtttat tgcttactgt caggactatt tcaaagacta 4620
agaagagttt cttctaaccc ctccctctca aaggaatcct aaattattag ttgttagata 4680
agttttgtat gctaagatat tcaggtttat agtttatgta tgtgtgtata tatataaata 4740
tatatgtata tataaatatt atgttcagtt tggagtctgg cacaactcca ttatgtggat 4800
tagagagtaa gatattatgg atgataaagt actaaatgaa acataatatt tatttataaa 4860
agtgtgtaga ttgttaaatc acaaaaagag tgctatgacc attatgtatg aggaaacagg 4920
cctttgacct cctggaaagc actgctcaaa agtcattagt gcccattttt gaattcccca 4980
aacagaaagc ttcttagaaa acatgctgag attttattta cagggaattc tttgacacat 5040
ttcaattggt gtgtagtcaa gtatagcaag tacttaataa tgactgaatt tcatgttcct 5100
acagtcatac atattcatta gaagttttat gttgttggtc tgatctgatt cttctttgtt 5160
tgtgggtgga acggcactga gagaagtata gttttttaaa cttgaacatg ttcagtagtt 5220
acattgcctt agaaaaccca gacacatagc agtggaaatg aaagaaatgg catcagaagt 5280
gacttaattt agcaattgtg attcctcttg taaaacaaaa caaaaaaaca atgccatatt 5340
ttttggagaa aagttggcaa tataggggtt tcgttgtctg tttcacaaga agactcattt 5400
gttcttttgg gggaaccagt gccttacaga ttttgtatat actgtaatta ttcaggacta 5460
gggaacaaac aattgtattg tatttgttac agattgtata tggctttgtt ttaacattcc 5520
cctaaataaa atggcttcat tctccccttg gaaaaaaaca 5560
<210> 46
<211> 502
<212> PRT
<213> 智人
<400> 46
Met Leu Leu Arg Ser Ala Gly Lys Leu Asn Val Gly Thr Lys Lys Glu
1 5 10 15
Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys Val Leu Arg Cys
20 25 30
Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn Ile Cys Ser
35 40 45
Thr Asp Gly Tyr Cys Phe Thr Met Ile Glu Glu Asp Asp Ser Gly Leu
50 55 60
Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp Phe Gln
65 70 75 80
Cys Arg Asp Thr Pro Ile Pro His Gln Arg Arg Ser Ile Glu Cys Cys
85 90 95
Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro
100 105 110
Leu Lys Asn Arg Asp Phe Val Asp Gly Pro Ile His His Arg Ala Leu
115 120 125
Leu Ile Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu Ile Ile Leu
130 135 140
Phe Cys Tyr Phe Arg Tyr Lys Arg Gln Glu Thr Arg Pro Arg Tyr Ser
145 150 155 160
Ile Gly Leu Glu Gln Asp Glu Thr Tyr Ile Pro Pro Gly Glu Ser Leu
165 170 175
Arg Asp Leu Ile Glu Gln Ser Gln Ser Ser Gly Ser Gly Ser Gly Leu
180 185 190
Pro Leu Leu Val Gln Arg Thr Ile Ala Lys Gln Ile Gln Met Val Lys
195 200 205
Gln Ile Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg
210 215 220
Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser
225 230 235 240
Trp Phe Arg Glu Thr Glu Ile Tyr Gln Thr Val Leu Met Arg His Glu
245 250 255
Asn Ile Leu Gly Phe Ile Ala Ala Asp Ile Lys Gly Thr Gly Ser Trp
260 265 270
Thr Gln Leu Tyr Leu Ile Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr
275 280 285
Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu
290 295 300
Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu Ile Phe
305 310 315 320
Ser Thr Gln Gly Lys Pro Ala Ile Ala His Arg Asp Leu Lys Ser Lys
325 330 335
Asn Ile Leu Val Lys Lys Asn Gly Thr Cys Cys Ile Ala Asp Leu Gly
340 345 350
Leu Ala Val Lys Phe Ile Ser Asp Thr Asn Glu Val Asp Ile Pro Pro
355 360 365
Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val Leu Asp
370 375 380
Glu Ser Leu Asn Arg Asn His Phe Gln Ser Tyr Ile Met Ala Asp Met
385 390 395 400
Tyr Ser Phe Gly Leu Ile Leu Trp Glu Val Ala Arg Arg Cys Val Ser
405 410 415
Gly Gly Ile Val Glu Glu Tyr Gln Leu Pro Tyr His Asp Leu Val Pro
420 425 430
Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Ile Val Cys Ile Lys Lys
435 440 445
Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys Leu Arg
450 455 460
Gln Met Gly Lys Leu Met Thr Glu Cys Trp Ala His Asn Pro Ala Ser
465 470 475 480
Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met Ser Glu
485 490 495
Ser Gln Asp Ile Lys Leu
500
<210> 47
<211> 4543
<212> DNA
<213> 智人
<400> 47
cggcgggcgg cgcgcacact gctcgctggg ccgcggctcc cgggtgtccc aggcccggcc 60
ggtgcgcaga gcatggcggg tgcgggcccg aagcggcgcg cgctagcggc gccggcggcc 120
gaggagaagg aagaggcgcg ggagaagatg ctggccgcca agagcgcgga cggctcggcg 180
ccggcaggcg agggcgaggg cgtgaccctg cagcggaaca tcacgctgct caacggcgtg 240
gccatcatcg tggggaccat tatcggctcg ggcatcttcg tgacgcccac gggcgtgctc 300
aaggaggcag gctcgccggg gctggcgctg gtggtgtggg ccgcgtgcgg cgtcttctcc 360
atcgtgggcg cgctctgcta cgcggagctc ggcaccacca tctccaaatc gggcggcgac 420
tacgcctaca tgctggaggt ctacggctcg ctgcccgcct tcctcaagct ctggatcgag 480
ctgctcatca tccggccttc atcgcagtac atcgtggccc tggtcttcgc cacctacctg 540
ctcaagccgc tcttccccac ctgcccggtg cccgaggagg cagccaagct cgtggcctgc 600
ctctgcgtgc tgctgctcac ggccgtgaac tgctacagcg tgaaggccgc cacccgggtc 660
caggatgcct ttgccgccgc caagctcctg gccctggccc tgatcatcct gctgggcttc 720
gtccagatcg ggaagggtga tgtgtccaat ctagatccca acttctcatt tgaaggcacc 780
aaactggatg tggggaacat tgtgctggca ttatacagcg gcctctttgc ctatggagga 840
tggaattact tgaatttcgt cacagaggaa atgatcaacc cctacagaaa cctgcccctg 900
gccatcatca tctccctgcc catcgtgacg ctggtgtacg tgctgaccaa cctggcctac 960
ttcaccaccc tgtccaccga gcagatgctg tcgtccgagg ccgtggccgt ggacttcggg 1020
aactatcacc tgggcgtcat gtcctggatc atccccgtct tcgtgggcct gtcctgcttc 1080
ggctccgtca atgggtccct gttcacatcc tccaggctct tcttcgtggg gtcccgggaa 1140
ggccacctgc cctccatcct ctccatgatc cacccacagc tcctcacccc cgtgccgtcc 1200
ctcgtgttca cgtgtgtgat gacgctgctc tacgccttct ccaaggacat cttctccgtc 1260
atcaacttct tcagcttctt caactggctc tgcgtggccc tggccatcat cggcatgatc 1320
tggctgcgcc acagaaagcc tgagcttgag cggcccatca aggtgaacct ggccctgcct 1380
gtgttcttca tcctggcctg cctcttcctg atcgccgtct ccttctggaa gacacccgtg 1440
gagtgtggca tcggcttcac catcatcctc agcgggctgc ccgtctactt cttcggggtc 1500
tggtggaaaa acaagcccaa gtggctcctc cagggcatct tctccacgac cgtcctgtgt 1560
cagaagctca tgcaggtggt cccccaggag acatagccag gaggccgagt ggctgccgga 1620
ggagcatgcg cagaggccag ttaaagtaga tcacctcctc gaacccactc cggttccccg 1680
caacccacag ctcagctgcc catcccagtc cctcgccgtc cctcccaggt cgggcagtgg 1740
aggctgctgt gaaaactctg gtacgaatct catccctcaa ctgagggcca gggacccagg 1800
tgtgcctgtg ctcctgccca ggagcagctt ttggtctcct tgggcccttt ttcccttccc 1860
tcctttgttt acttatatat atattttttt taaacttaaa ttttgggtca acttgacacc 1920
actaagatga ttttttaagg agctggggga aggcaggagc cttcctttct cctgccccaa 1980
gggcccagac cctgggcaaa cagagctact gagacttgga acctcattgc taccacagac 2040
ttgcactgaa gccggacagc tgcccagaca catgggcttg tgacattcgt gaaaaccaac 2100
cctgtgggct tatgtctctg ccttagggtt tgcagagtgg aaactcagcc gtagggtggc 2160
actgggaggg ggtgggggat ctgggcaagg tgggtgattc ctcccaggag gtgcttgagg 2220
ccccgatgga ctcctgacca taatcctagc cccgagacac catcctgagc cagggaacag 2280
ccccagggtt ggggggtgcc ggcatctccc ctagctcacc aggcctggcc tctgggcagt 2340
gtggcctctt ggctatttct gtgtccagtt ttggaggctg agttctggtt catgcagaca 2400
aagccctgtc cttcagtctt ctagaaacag agacaagaaa ggcagacaca ccgcggccag 2460
gcacccatgt gggcgcccac cctgggctcc acacagcagt gtcccctgcc ccagaggtcg 2520
cagctaccct cagcctccaa tgcattggcc tctgtaccgc ccggcagccc cttctggccg 2580
gtgctgggtt cccactcccg gcctaggcac ctccccgctc tccctgtcac gctcatgtcc 2640
tgtcctggtc ctgatgcccg ttgtctagga gacagagcca agcactgctc acgtctctgc 2700
cgcctgcgtt tggaggcccc tgggctctca cccagtcccc acccgcctgc agagagggaa 2760
ctagggcacc ccttgtttct gttgttcccg tgaatttttt tcgctatggg aggcagccga 2820
ggcctggcca atgcggccca ctttcctgag ctgtcgctgc ctccatggca gcagccaggg 2880
acccccagaa caagaagacc ccgcaggatc cctcctgagc tcggggggct ctgccttctc 2940
aggccccggg cttcccttct ccccagccag aggtggagcc aagtggtcca gcgtcactcc 3000
agtgctcagc tgtggctgga ggagctggcc tgtggcacag ccctgagtgt cccaagccgg 3060
gagccaacga agccggacac ggcttcactg accagcggct gctcaagccg caagctctca 3120
gcaagtgccc agtggagcct gccgcccccg cctgggcacc gggaccccct caccatccag 3180
tgggcccgga gaaacctgat gaacagtttg gggactcagg accagatgtc cgtctctctt 3240
gcttgaggaa tgaagacctt tattcacccc tgccccgttg cttcccgctg cacatggaca 3300
gacttcacag cgtctgctca taggacctgc atccttcctg gggacgaatt ccactcgtcc 3360
aagggacagc ccacggtctg gaggccgagg accaccagca ggcaggtgga ctgactgtgt 3420
tgggcaagac ctcttccctc tgggcctgtt ctcttggctg caaataagga cagcagctgg 3480
tgccccacct gcctggtgca ttgctgtgtg aatccaggag gcagtggaca tcgtaggcag 3540
ccacggcccc gggtccagga gaagtgctcc ctggaggcac gcaccactgc ttcccactgg 3600
ggccggcggg gcccacgcac gacgtcagcc tcttaccttc ccgcctcggc taggggtcct 3660
cgggatgccg ttctgttcca acctcctgct ctgggacgtg gacatgcctc aaggatacag 3720
ggagccggcg gcctctcgac ggcacgcact tgcctgttgg ctgctgcggc tgtgggcgag 3780
catgggggct gccagcgtct gttgtggaaa gtagctgcta gtgaaatggc tggggccgct 3840
ggggtccgtc ttcacactgc gcaggtctct tctgggcgtc tgagctgggg tgggagctcc 3900
tccgcagaag gttggtgggg ggtccagtct gtgatccttg gtgctgtgtg ccccactcca 3960
gcctggggac cccacttcag aaggtagggg ccgtgtcccg cggtgctgac tgaggcctgc 4020
ttccccctcc ccctcctgct gtgctggaat tccacaggga ccagggccac cgcaggggac 4080
tgtctcagaa gacttgattt ttccgtccct ttttctccac actccactga caaacgtccc 4140
cagcggtttc cacttgtggg cttcaggtgt tttcaagcac aacccaccac aacaagcaag 4200
tgcattttca gtcgttgtgc ttttttgttt tgtgctaacg tcttactaat ttaaagatgc 4260
tgtcggcacc atgtttattt atttccagtg gtcatgctca gccttgctgc tctgcgtggc 4320
gcaggtgcca tgcctgctcc ctgtctgtgt cccagccacg cagggccatc cactgtgacg 4380
tcggccgacc aggctggaca ccctctgccg agtaatgacg tgtgtggctg ggaccttctt 4440
tattctgtgt taatggctaa cctgttacac tgggctgggt tgggtagggt gttctggctt 4500
ttttgtgggg tttttatttt taaagaaaca ctcaatcatc cta 4543
<210> 48
<211> 507
<212> PRT
<213> 智人
<400> 48
Met Ala Gly Ala Gly Pro Lys Arg Arg Ala Leu Ala Ala Pro Ala Ala
1 5 10 15
Glu Glu Lys Glu Glu Ala Arg Glu Lys Met Leu Ala Ala Lys Ser Ala
20 25 30
Asp Gly Ser Ala Pro Ala Gly Glu Gly Glu Gly Val Thr Leu Gln Arg
35 40 45
Asn Ile Thr Leu Leu Asn Gly Val Ala Ile Ile Val Gly Thr Ile Ile
50 55 60
Gly Ser Gly Ile Phe Val Thr Pro Thr Gly Val Leu Lys Glu Ala Gly
65 70 75 80
Ser Pro Gly Leu Ala Leu Val Val Trp Ala Ala Cys Gly Val Phe Ser
85 90 95
Ile Val Gly Ala Leu Cys Tyr Ala Glu Leu Gly Thr Thr Ile Ser Lys
100 105 110
Ser Gly Gly Asp Tyr Ala Tyr Met Leu Glu Val Tyr Gly Ser Leu Pro
115 120 125
Ala Phe Leu Lys Leu Trp Ile Glu Leu Leu Ile Ile Arg Pro Ser Ser
130 135 140
Gln Tyr Ile Val Ala Leu Val Phe Ala Thr Tyr Leu Leu Lys Pro Leu
145 150 155 160
Phe Pro Thr Cys Pro Val Pro Glu Glu Ala Ala Lys Leu Val Ala Cys
165 170 175
Leu Cys Val Leu Leu Leu Thr Ala Val Asn Cys Tyr Ser Val Lys Ala
180 185 190
Ala Thr Arg Val Gln Asp Ala Phe Ala Ala Ala Lys Leu Leu Ala Leu
195 200 205
Ala Leu Ile Ile Leu Leu Gly Phe Val Gln Ile Gly Lys Gly Asp Val
210 215 220
Ser Asn Leu Asp Pro Asn Phe Ser Phe Glu Gly Thr Lys Leu Asp Val
225 230 235 240
Gly Asn Ile Val Leu Ala Leu Tyr Ser Gly Leu Phe Ala Tyr Gly Gly
245 250 255
Trp Asn Tyr Leu Asn Phe Val Thr Glu Glu Met Ile Asn Pro Tyr Arg
260 265 270
Asn Leu Pro Leu Ala Ile Ile Ile Ser Leu Pro Ile Val Thr Leu Val
275 280 285
Tyr Val Leu Thr Asn Leu Ala Tyr Phe Thr Thr Leu Ser Thr Glu Gln
290 295 300
Met Leu Ser Ser Glu Ala Val Ala Val Asp Phe Gly Asn Tyr His Leu
305 310 315 320
Gly Val Met Ser Trp Ile Ile Pro Val Phe Val Gly Leu Ser Cys Phe
325 330 335
Gly Ser Val Asn Gly Ser Leu Phe Thr Ser Ser Arg Leu Phe Phe Val
340 345 350
Gly Ser Arg Glu Gly His Leu Pro Ser Ile Leu Ser Met Ile His Pro
355 360 365
Gln Leu Leu Thr Pro Val Pro Ser Leu Val Phe Thr Cys Val Met Thr
370 375 380
Leu Leu Tyr Ala Phe Ser Lys Asp Ile Phe Ser Val Ile Asn Phe Phe
385 390 395 400
Ser Phe Phe Asn Trp Leu Cys Val Ala Leu Ala Ile Ile Gly Met Ile
405 410 415
Trp Leu Arg His Arg Lys Pro Glu Leu Glu Arg Pro Ile Lys Val Asn
420 425 430
Leu Ala Leu Pro Val Phe Phe Ile Leu Ala Cys Leu Phe Leu Ile Ala
435 440 445
Val Ser Phe Trp Lys Thr Pro Val Glu Cys Gly Ile Gly Phe Thr Ile
450 455 460
Ile Leu Ser Gly Leu Pro Val Tyr Phe Phe Gly Val Trp Trp Lys Asn
465 470 475 480
Lys Pro Lys Trp Leu Leu Gln Gly Ile Phe Ser Thr Thr Val Leu Cys
485 490 495
Gln Lys Leu Met Gln Val Val Pro Gln Glu Thr
500 505
<210> 49
<211> 1330
<212> DNA
<213> 智人
<400> 49
gcggacgcgg ggcgccagca ggtggcgctg gacgcgcaac ggacaaggag gcggggcctg 60
cagctggctt ggaggctccg cgctctggag gctcaggcgc cgcgtggggc ccgcacctct 120
gggcagcagc ggcagccgag actcacggtc aagctaaggc gaagagtggg tggctgaagc 180
catactattt tatagaatta atggaaagca gaaaagacat cacaaaccaa gaagaacttt 240
ggaaaatgaa gcctaggaga aatttagaag aagacgatta tttgcataag gacacgggag 300
agaccagcat gctaaaaaga cctgtgcttt tgcatttgca ccaaacagcc catgctgatg 360
aatttgactg cccttcagaa cttcagcaca cacaggaact ctttccacag tggcacttgc 420
caattaaaat agctgctatt atagcatctc tgacttttct ttacactctt ctgagggaag 480
taattcaccc tttagcaact tcccatcaac aatattttta taaaattcca atcctggtca 540
tcaacaaagt cttgccaatg gtttccatca ctctcttggc attggtttac ctgccaggtg 600
tgatagcagc aattgtccaa cttcataatg gaaccaagta taagaagttt ccacattggt 660
tggataagtg gatgttaaca agaaagcagt ttgggcttct cagtttcttt tttgctgtac 720
tgcatgcaat ttatagtctg tcttacccaa tgaggcgatc ctacagatac aagttgctaa 780
actgggcata tcaacaggtc caacaaaata aagaagatgc ctggattgag catgatgttt 840
ggagaatgga gatttatgtg tctctgggaa ttgtgggatt ggcaatactg gctctgttgg 900
ctgtgacatc tattccatct gtgagtgact ctttgacatg gagagaattt cactatattc 960
agagcaagct aggaattgtt tcccttctac tgggcacaat acacgcattg atttttgcct 1020
ggaataagtg gatagatata aaacaatttg tatggtatac acctccaact tttatgatag 1080
ctgttttcct tccaattgtt gtcctgatat ttaaaagcat actattcctg ccatgcttga 1140
ggaagaagat actgaagatt agacatggtt gggaagacgt caccaaaatt aacaaaactg 1200
agatatgttc ccagttgtag aattactgtt tacacacatt tttgttcaat attgatatat 1260
tttatcacca acatttcaag tttgtatttg ttaataaaat gattattcaa ggaaaaaaaa 1320
aaaaaaaaaa 1330
<210> 50
<211> 339
<212> PRT
<213> 智人
<400> 50
Met Glu Ser Arg Lys Asp Ile Thr Asn Gln Glu Glu Leu Trp Lys Met
1 5 10 15
Lys Pro Arg Arg Asn Leu Glu Glu Asp Asp Tyr Leu His Lys Asp Thr
20 25 30
Gly Glu Thr Ser Met Leu Lys Arg Pro Val Leu Leu His Leu His Gln
35 40 45
Thr Ala His Ala Asp Glu Phe Asp Cys Pro Ser Glu Leu Gln His Thr
50 55 60
Gln Glu Leu Phe Pro Gln Trp His Leu Pro Ile Lys Ile Ala Ala Ile
65 70 75 80
Ile Ala Ser Leu Thr Phe Leu Tyr Thr Leu Leu Arg Glu Val Ile His
85 90 95
Pro Leu Ala Thr Ser His Gln Gln Tyr Phe Tyr Lys Ile Pro Ile Leu
100 105 110
Val Ile Asn Lys Val Leu Pro Met Val Ser Ile Thr Leu Leu Ala Leu
115 120 125
Val Tyr Leu Pro Gly Val Ile Ala Ala Ile Val Gln Leu His Asn Gly
130 135 140
Thr Lys Tyr Lys Lys Phe Pro His Trp Leu Asp Lys Trp Met Leu Thr
145 150 155 160
Arg Lys Gln Phe Gly Leu Leu Ser Phe Phe Phe Ala Val Leu His Ala
165 170 175
Ile Tyr Ser Leu Ser Tyr Pro Met Arg Arg Ser Tyr Arg Tyr Lys Leu
180 185 190
Leu Asn Trp Ala Tyr Gln Gln Val Gln Gln Asn Lys Glu Asp Ala Trp
195 200 205
Ile Glu His Asp Val Trp Arg Met Glu Ile Tyr Val Ser Leu Gly Ile
210 215 220
Val Gly Leu Ala Ile Leu Ala Leu Leu Ala Val Thr Ser Ile Pro Ser
225 230 235 240
Val Ser Asp Ser Leu Thr Trp Arg Glu Phe His Tyr Ile Gln Ser Lys
245 250 255
Leu Gly Ile Val Ser Leu Leu Leu Gly Thr Ile His Ala Leu Ile Phe
260 265 270
Ala Trp Asn Lys Trp Ile Asp Ile Lys Gln Phe Val Trp Tyr Thr Pro
275 280 285
Pro Thr Phe Met Ile Ala Val Phe Leu Pro Ile Val Val Leu Ile Phe
290 295 300
Lys Ser Ile Leu Phe Leu Pro Cys Leu Arg Lys Lys Ile Leu Lys Ile
305 310 315 320
Arg His Gly Trp Glu Asp Val Thr Lys Ile Asn Lys Thr Glu Ile Cys
325 330 335
Ser Gln Leu
<210> 51
<211> 21112
<212> DNA
<213> 智人
<400> 51
cctgtgactt ctcttctcac ccctggcctg gtgataacca cagacaggat gggcataagc 60
agagaacctg gaaccagttc cacttcaaat ttgagcagca cctcccatga gagactgacc 120
actttggaag acactgtaga tacagaagcc atgcagcctt ccacacacac agcagtgacc 180
aacgtgagga cctccatttc tggacatgaa tcacaatctt ctgtcctatc tgactcagag 240
acacccaaag ccacatctcc aatgggtacc acctacacca tgggggaaac gagtgtttcc 300
atatccactt ctgacttctt tgagaccagc agaattcaga tagaaccaac atcctccctg 360
acttctggat tgagggagac cagcagctct gagaggatca gctcagccac agagggaagc 420
actgtccttt ctgaagtgcc cagtggtgct accactgagg tctccaggac agaagtgata 480
tcctctaggg gaacatccat gtcagggcct gatcagttca ccatatcacc agacatctct 540
actgaagcga tcaccaggct ttctacttcc cccattatga cagaatcagc agaaagtgcc 600
atcactattg agacaggttc tcctggggct acatcagagg gtaccctcac cttggacacc 660
tcaacaacaa ccttttggtc agggacccac tcaactgcat ctccaggatt ttcacactca 720
gagatgacca ctcttatgag tagaactcct ggagatgtgc catggccgag ccttccctct 780
gtggaagaag ccagctctgt ctcttcctca ctgtcttcac ctgccatgac ctcaacttct 840
tttttctcca cattaccaga gagcatctcc tcctctcctc atcctgtgac tgcacttctc 900
acccttggcc cagtgaagac cacagacatg ttgcgcacaa gctcagaacc tgaaaccagt 960
tcacctccaa atttgagcag cacctcagct gaaatattag ccacgtctga agtcaccaaa 1020
gatagagaga aaattcatcc ctcctcaaac acacctgtag tcaatgtagg gactgtgatt 1080
tataaacatc tatccccttc ctctgttttg gctgacttag tgacaacaaa acccacatct 1140
ccaatggcta ccacctccac tctggggaat acaagtgttt ccacatcaac tcctgccttc 1200
ccagaaacta tgatgacaca gccaacttcc tccctgactt ctggattaag ggagatcagt 1260
acctctcaag agaccagctc agcaacagag agaagtgctt ctctttctgg aatgcccact 1320
ggtgctacta ctaaggtctc cagaacagaa gccctctcct taggcagaac atccacccca 1380
ggtcctgctc aatccacaat atcaccagaa atctccacgg aaaccatcac tagaatttct 1440
actcccctca ccacgacagg atcagcagaa atgaccatca cccccaaaac aggtcattct 1500
ggggcatcct cacaaggtac ctttaccttg gacacatcaa gcagagcctc ctggccagga 1560
actcactcag ctgcaactca cagatctcca cactcaggga tgaccactcc tatgagcaga 1620
ggtcctgagg atgtgtcatg gccaagccgc ccatcagtgg aaaaaactag ccctccatct 1680
tccctggtgt ctttatctgc agtaacctca ccttcgccac tttattccac accatctgag 1740
agtagccact cgtctcctct ccgggtgact tctcttttca cccctgtcat gatgaagacc 1800
acagacatgt tggacacaag cttggaacct gtgaccactt cacctcccag tatgaatatc 1860
acctcagatg agagtctggc cacttctaaa gccaccatgg agacagaggc aattcagctt 1920
tcagaaaaca cagctgtgac tcagatgggc accatcagtg ctagacaaga attctattcc 1980
tcttatccag gcctcccaga gccatccaaa gtgacatctc cagtggtcac ctcttccacc 2040
ataaaagaca ttgtttctac aaccatacct gcttcctctg agataacaag aattgagatg 2100
gagtcaacat ccaccctgac ccccacacca agggagacca gcacctccca ggagatccac 2160
tcagccacaa agccaagcac tgttccttac aaggcactca ctagtgccac gattgaggac 2220
tccatgacac aagtcatgtc ctctagcaga ggacctagcc ctgatcagtc cacaatgtca 2280
caagacatat ccactgaagt gatcaccagg ctctctacct cccccatcaa gacagaatct 2340
acagaaatga ccattaccac ccaaacaggt tctcctgggg ctacatcaag gggtaccctt 2400
accttggaca cttcaacaac ttttatgtca gggacccatt caactgcatc tcaaggattt 2460
tcacactcac agatgaccgc tcttatgagt agaactcctg gagaggtgcc atggctaagc 2520
catccctctg tggaagaagc cagctctgcc tctttctcac tgtcttcacc tgtcatgacc 2580
tcatcttctc ccgtttcttc cacattacca gacagcatcc actcttcttc gcttcctgtg 2640
acatcacttc tcacctcagg gctggtgaag accacagagc tgttgggcac aagctcagaa 2700
cctgaaacca gttcaccccc aaatttgagc agcacctcag ctgaaatact ggccaccact 2760
gaagtcacta cagatacaga gaaactggag atgaccaatg tggtaacctc aggttataca 2820
catgaatctc cttcctctgt cctagctgac tcagtgacaa caaaggccac atcttcaatg 2880
ggtatcacct accccacagg agatacaaat gttctcacat caacccctgc cttctctgac 2940
accagtagga ttcaaacaaa gtcaaagctc tcactgactc ctgggttgat ggagaccagc 3000
atctctgaag agaccagctc tgccacagaa aaaagcactg tcctttctag tgtgcccact 3060
ggtgctacta ctgaggtctc caggacagaa gccatctctt ctagcagaac atccatccca 3120
ggccctgctc aatccacaat gtcatcagac acctccatgg aaaccatcac tagaatttct 3180
acccccctca caaggaaaga atcaacagac atggccatca cccccaaaac aggtccttct 3240
ggggctacct cgcagggtac ctttaccttg gactcatcaa gcacagcctc ctggccagga 3300
actcactcag ctacaactca gagatttcca cggtcagtgg tgacaactcc tatgagcaga 3360
ggtcctgagg atgtgtcatg gccaagcccg ctgtctgtgg aaaaaaacag ccctccatct 3420
tccctggtat cttcatcttc agtaacctca ccttcgccac tttattccac accatctggg 3480
agtagccact cctctcctgt ccctgtcact tctcttttca cctctatcat gatgaaggcc 3540
acagacatgt tggatgcaag tttggaacct gagaccactt cagctcccaa tatgaatatc 3600
acctcagatg agagtctggc cgcttctaaa gccaccacgg agacagaggc aattcacgtt 3660
tttgaaaata cagcagcgtc ccatgtggaa accaccagtg ctacagagga actctattcc 3720
tcttccccag gcttctcaga gccaacaaaa gtgatatctc cagtggtcac ctcttcctct 3780
ataagagaca acatggtttc cacaacaatg cctggctcct ctggcattac aaggattgag 3840
atagagtcaa tgtcatctct gacccctgga ctgagggaga ccagaacctc ccaggacatc 3900
acctcatcca cagagacaag cactgtcctt tacaagatgc cctctggtgc cactcctgag 3960
gtctccagga cagaagttat gccctctagc agaacatcca ttcctggccc tgctcagtcc 4020
acaatgtcac tagacatctc cgatgaagtt gtcaccaggc tgtctacctc tcccatcatg 4080
acagaatctg cagaaataac catcaccacc caaacaggtt attctctggc tacatcccag 4140
gttacccttc ccttgggcac ctcaatgacc tttttgtcag ggacccactc aactatgtct 4200
caaggacttt cacactcaga gatgaccaat cttatgagca ggggtcctga aagtctgtca 4260
tggacgagcc ctcgctttgt ggaaacaact agatcttcct cttctctgac atcattacct 4320
ctcacgacct cactttctcc tgtgtcctcc acattactag acagtagccc ctcctctcct 4380
cttcctgtga cttcacttat cctcccaggc ctggtgaaga ctacagaagt gttggataca 4440
agctcagagc ctaaaaccag ttcatctcca aatttgagca gcacctcagt tgaaataccg 4500
gccacctctg aaatcatgac agatacagag aaaattcatc cttcctcaaa cacagcggtg 4560
gccaaagtga ggacctccag ttctgttcat gaatctcatt cctctgtcct agctgactca 4620
gaaacaacca taaccatacc ttcaatgggt atcacctccg ctgtggagga taccactgtt 4680
ttcacatcaa atcctgcctt ctctgagact aggaggattc cgacagagcc aacattctca 4740
ttgactcctg gattcaggga gactagcacc tctgaagaga ccacctcaat cacagaaaca 4800
agtgcagtcc tttttggagt gcccactagt gctactactg aagtctccat gacagaaata 4860
atgtcctcta atagaacaca catccctgac tctgatcagt ccacgatgtc tccagacatc 4920
atcactgaag tgatcaccag gctctcttcc tcatccatga tgtcagaatc aacacaaatg 4980
accatcacca cccaaaaaag ttctcctggg gctacagcac agagtactct taccttggcc 5040
acaacaacag cccccttggc aaggacccac tcaactgttc ctcctagatt tttacactca 5100
gagatgacaa ctcttatgag taggagtcct gaaaatccat catggaagag ctctcccttt 5160
gtggaaaaaa ctagctcttc atcttctctg ttgtccttac ctgtcacgac ctcaccttct 5220
gtttcttcca cattaccgca gagtatccct tcctcctctt tttctgtgac ttcactcctc 5280
accccaggca tggtgaagac tacagacaca agcacagaac ctggaaccag tttatctcca 5340
aatctgagtg gcacctcagt tgaaatactg gctgcctctg aagtcaccac agatacagag 5400
aaaattcatc cttcttcaag catggcagtg accaatgtgg gaaccaccag ttctggacat 5460
gaactatatt cctctgtttc aatccactcg gagccatcca aggctacata cccagtgggt 5520
actccctctt ccatggctga aacctctatt tccacatcaa tgcctgctaa ttttgagacc 5580
acaggatttg aggctgagcc attttctcat ttgacttctg gacttaggaa gaccaacatg 5640
tccctggaca ccagctcagt cacaccaaca aatacacctt cttctcctgg gtccactcac 5700
cttttacaga gttccaagac tgatttcacc tcttctgcaa aaacatcatc cccagactgg 5760
cctccagcct cacagtatac tgaaattcca gtggacataa tcaccccctt taatgcttct 5820
ccatctatta cggagtccac tgggataacc tccttcccag aatccaggtt tactatgtct 5880
gtaacagaaa gtactcatca tctgagtaca gatttgctgc cttcagctga gactatttcc 5940
actggcacag tgatgccttc tctatcagag gccatgactt catttgccac cactggagtt 6000
ccacgagcca tctcaggttc aggtagtcca ttctctagga cagagtcagg ccctggggat 6060
gctactctgt ccaccattgc agagagcctg ccttcatcca ctcctgtgcc attctcctct 6120
tcaaccttca ctaccactga ttcttcaacc atcccagccc tccatgagat aacttcctct 6180
tcagctaccc catatagagt ggacaccagt cttgggacag agagcagcac tactgaagga 6240
cgcttggtta tggtcagtac tttggacact tcaagccaac caggcaggac atcttcatca 6300
cccattttgg ataccagaat gacagagagc gttgagctgg gaacagtgac aagtgcttat 6360
caagttcctt cactctcaac acggttgaca agaactgatg gcattatgga acacatcaca 6420
aaaataccca atgaagcagc acacagaggt accataagac cagtcaaagg ccctcagaca 6480
tccacttcgc ctgccagtcc taaaggacta cacacaggag ggacaaaaag aatggagacc 6540
accaccacag ctctgaagac caccaccaca gctctgaaga ccacttccag agccaccttg 6600
accaccagtg tctatactcc cactttggga acactgactc ccctcaatgc atcaatgcaa 6660
atggccagca caatccccac agaaatgatg atcacaaccc catatgtttt ccctgatgtt 6720
ccagaaacga catcctcatt ggctaccagc ctgggagcag aaaccagcac agctcttccc 6780
aggacaaccc catctgtttt caatagagaa tcagagacca cagcctcact ggtctctcgt 6840
tctggggcag agagaagtcc ggttattcaa actctagatg tttcttctag tgagccagat 6900
acaacagctt catgggttat ccatcctgca gagaccatcc caactgtttc caagacaacc 6960
cccaattttt tccacagtga attagacact gtatcttcca cagccaccag tcatggggca 7020
gacgtcagct cagccattcc aacaaatatc tcacctagtg aactagatgc actgacccca 7080
ctggtcacta tttcggggac agatactagt acaacattcc caacactgac taagtcccca 7140
catgaaacag agacaagaac cacatggctc actcatcctg cagagaccag ctcaactatt 7200
cccagaacaa tccccaattt ttctcatcat gaatcagatg ccacaccttc aatagccacc 7260
agtcctgggg cagaaaccag ttcagctatt ccaattatga ctgtctcacc tggtgcagaa 7320
gatctggtga cctcacaggt cactagttct ggcacagaca gaaatatgac tattccaact 7380
ttgactcttt ctcctggtga accaaagacc atagcctcat tagtcaccca tcctgaagca 7440
cagacaagtt cggccattcc aacttcaact atctcgcctg ctgtatcacg gttggtgacc 7500
tcaatggtca ccagtttggc ggcaaagaca agtacaacta atcgagctct gacaaactcc 7560
cctggtgaac cagctacaac agtttcattg gtcacgcatt ctgcacagac cagcccaaca 7620
gttccctgga caacttccat ttttttccat agtaaatcag acaccacacc ttcaatgacc 7680
accagtcatg gggcagaatc cagttcagct gttccaactc caactgtttc aactgaggta 7740
ccaggagtag tgaccccttt ggtcaccagt tctagggcag tgatcagtac aactattcca 7800
attctgactc tttctcctgg tgaaccagag accacacctt caatggccac cagtcatggg 7860
gaagaagcca gttctgctat tccaactcca actgtttcac ctggggtacc aggagtggtg 7920
acctctctgg tcactagttc tagggcagtg actagtacaa ctattccaat tctgactttt 7980
tctcttggtg aaccagagac cacaccttca atggccacca gtcatgggac agaagctggc 8040
tcagctgttc caactgtttt acctgaggta ccaggaatgg tgacctctct ggttgctagt 8100
tctagggcag taaccagtac aactcttcca actctgactc tttctcctgg tgaaccagag 8160
accacacctt caatggccac cagtcatggg gcagaagcca gctcaactgt tccaactgtt 8220
tcacctgagg taccaggagt ggtgacctct ctggtcacta gttctagtgg agtaaacagt 8280
acaagtattc caactctgat tctttctcct ggtgaactag aaaccacacc ttcaatggcc 8340
accagtcatg gggcagaagc cagctcagct gttccaactc caactgtttc acctggggta 8400
tcaggagtgg tgacccctct ggtcactagt tccagggcag tgaccagtac aactattcca 8460
attctaactc tttcttctag tgagccagag accacacctt caatggccac cagtcatggg 8520
gtagaagcca gctcagctgt tctaactgtt tcacctgagg taccaggaat ggtgaccttt 8580
ctggtcacta gttctagagc agtaaccagt acaactattc caactctgac tatttcttct 8640
gatgaaccag agaccacaac ttcattggtc acccattctg aggcaaagat gatttcagcc 8700
attccaactt taggtgtctc ccctactgta caagggctgg tgacttcact ggtcactagt 8760
tctgggtcag agaccagtgc gttttcaaat ctaactgttg cctcaagtca accagagacc 8820
atagactcat gggtcgctca tcctgggaca gaagcaagtt ctgttgttcc aactttgact 8880
gtctccactg gtgagccgtt tacaaatatc tcattggtca cccatcctgc agagagtagc 8940
tcaactcttc ccaggacaac ctcaaggttt tcccacagtg aattagacac tatgccttct 9000
acagtcacca gtcctgaggc agaatccagc tcagccattt caacaactat ttcacctggt 9060
ataccaggtg tgctgacatc actggtcact agctctggga gagacatcag tgcaactttt 9120
ccaacagtgc ctgagtcccc acatgaatca gaggcaacag cctcatgggt tactcatcct 9180
gcagtcacca gcacaacagt tcccaggaca acccctaatt attctcatag tgaaccagac 9240
accacaccat caatagccac cagtcctggg gcagaagcca cttcagattt tccaacaata 9300
actgtctcac ctgatgtacc agatatggta acctcacagg tcactagttc tgggacagac 9360
accagtataa ctattccaac tctgactctt tcttctggtg agccagagac cacaacctca 9420
tttatcacct attctgagac acatacaagt tcagccattc caactctccc tgtctcccct 9480
gatgcatcaa agatgctgac ctcactggtc atcagttctg ggacagacag cactacaact 9540
ttcccaacac tgacggagac cccatatgaa ccagagacaa cagccataca gctcattcat 9600
cctgcagaga ccaacacaat ggttcccagg acaactccca agttttccca tagtaagtca 9660
gacaccacac tcccagtagc catcaccagt cctgggccag aagccagttc agctgtttca 9720
acgacaacta tctcacctga tatgtcagat ctggtgacct cactggtccc tagttctggg 9780
acagacacca gtacaacctt cccaacattg agtgagaccc catatgaacc agagactaca 9840
gccacgtggc tcactcatcc tgcagaaacc agcacaacgg tttctgggac aattcccaac 9900
ttttcccata ggggatcaga cactgcaccc tcaatggtca ccagtcctgg agtagacacg 9960
aggtcaggtg ttccaactac aaccatccca cccagtatac caggggtagt gacctcacag 10020
gtcactagtt ctgcaacaga cactagtaca gctattccaa ctttgactcc ttctcctggt 10080
gaaccagaga ccacagcctc atcagctacc catcctggga cacagactgg cttcactgtt 10140
ccaattcgga ctgttccctc tagtgagcca gatacaatgg cttcctgggt cactcatcct 10200
ccacagacca gcacacctgt ttccagaaca acctccagtt tttcccatag tagtccagat 10260
gccacacctg taatggccac cagtcctagg acagaagcca gttcagctgt actgacaaca 10320
atctcacctg gtgcaccaga gatggtgact tcacagatca ctagttctgg ggcagcaacc 10380
agtacaactg ttccaacttt gactcattct cctggtatgc cagagaccac agccttattg 10440
agcacccatc ccagaacaga gacaagtaaa acatttcctg cttcaactgt gtttcctcaa 10500
gtatcagaga ccacagcctc actcaccatt agacctggtg cagagactag cacagctctc 10560
ccaactcaga caacatcctc tctcttcacc ctacttgtaa ctggaaccag cagagttgat 10620
ctaagtccaa ctgcttcacc tggtgtttct gcaaaaacag ccccactttc cacccatcca 10680
gggacagaaa ccagcacaat gattccaact tcaactcttt cccttggttt actagagact 10740
acaggcttac tggccaccag ctcttcagca gagaccagca cgagtactct aactctgact 10800
gtttcccctg ctgtctctgg gctttccagt gcctctataa caactgataa gccccaaact 10860
gtgacctcct ggaacacaga aacctcacca tctgtaactt cagttggacc cccagaattt 10920
tccaggactg tcacaggcac cactatgacc ttgataccat cagagatgcc aacaccacct 10980
aaaaccagtc atggagaagg agtgagtcca accactatct tgagaactac aatggttgaa 11040
gccactaatt tagctaccac aggttccagt cccactgtgg ccaagacaac aaccaccttc 11100
aatacactgg ctggaagcct ctttactcct ctgaccacac ctgggatgtc caccttggcc 11160
tctgagagtg tgacctcaag aacaagttat aaccatcggt cctggatctc caccaccagc 11220
agttataacc gtcggtactg gacccctgcc accagcactc cagtgacttc tacattctcc 11280
ccagggattt ccacatcctc catccccagc tccacagcag ccacagtccc attcatggtg 11340
ccattcaccc tcaacttcac catcaccaac ctgcagtacg aggaggacat gcggcaccct 11400
ggttcaagga agttcaacgc cacagagaga gaactgcagg gtctgctcaa acccttgttc 11460
aggaatagca gtctggaata cctctattca ggctgcagac tagcctcact caggccagag 11520
aaggatagct cagccacggc agtggatgcc atctgcacac atcgccctga ccctgaagac 11580
ctcggactgg acagagagcg actgtactgg gagctgagca atctgacaaa tggcatccag 11640
gagctgggcc cttacaccct ggaccggaac agtctctatg tcaatggttt cacccatcga 11700
agctctatgc ccaccaccag cactcctggg acctccacag tggatgtggg aacctcaggg 11760
actccatcct ccagccccag ccccacgact gctggccctc tcctgatgcc gttcaccctc 11820
aacttcacca tcaccaacct gcagtacgag gaggacatgc gtcgcactgg ctccaggaag 11880
ttcaacacca tggagagtgt cctgcagggt ctgctcaagc cattgttcaa gaacaccagt 11940
gttggccctt tgtactctgg ctgcagattg accttgctca ggcccgagaa agatggggca 12000
gccactggag tggatgccat ctgcacccac cgccttgacc ccaaaagccc tggactcaac 12060
agggagcagc tgtactggga gctaagcaaa ctgaccaatg acattgaaga gctgggcccc 12120
tacaccctgg acaggaacag tctctatgtc aatggtttca cccatcagag ctctgtgtcc 12180
accaccagca ctcctgggac ctccacagtg gatctcagaa cctcagggac tccatcctcc 12240
ctctccagcc ccacaattat ggctgctggc cctctcctgg taccattcac cctcaacttc 12300
accatcacca acctgcagta tggggaggac atgggtcacc ctggctccag gaagttcaac 12360
accacagaga gggtcctgca gggtctgctt ggtcccatat tcaagaacac cagtgttggc 12420
cctctgtact ctggctgcag actgacctct ctcaggtccg agaaggatgg agcagccact 12480
ggagtggatg ccatctgcat ccatcatctt gaccccaaaa gccctggact caacagagag 12540
cggctgtact gggagctgag ccaactgacc aatggcatca aagagctggg cccctacacc 12600
ctggacagga acagtctcta tgtcaatggt ttcacccatc ggacctctgt gcccaccacc 12660
agcactcctg ggacctccac agtggacctt ggaacctcag ggactccatt ctccctccca 12720
agccccgcaa ctgctggccc tctcctggtg ctgttcaccc tcaacttcac catcaccaac 12780
ctgaagtatg aggaggacat gcatcgccct ggctccagga agttcaacac cactgagagg 12840
gtcctgcaga ccctggttgg tcctatgttc aagaacacca gtgttggcct tctgtactct 12900
ggctgcagac tgaccttgct caggtccgag aaggatggag cagccactgg agtggatgcc 12960
atctgcaccc accgtcttga ccccaaaagc cctggagtgg acagggagca gctatactgg 13020
gagctgagcc aactgaccaa tggcatcaaa gagctgggcc cctacaccct ggacaggaac 13080
agtctctatg tcaatggttt cacccattgg atccctgtgc ccaccagcag cacccctggg 13140
acctccacag tggaccttgg gtcagggact ccatcctccc tccccagccc cacaagtgct 13200
actgctggcc ctctcctggt gccgttcacc ctcaacttca ccatcaccaa cctgaagtac 13260
gaggaggaca tgcattgccc tggctccagg aagttcaaca ccacagagag agtcctgcag 13320
agtctgcttg gtcccatgtt caagaacacc agtgttggcc ctctgtactc tggctgcaga 13380
ctgaccttgc tcaggtccga gaaggatgga gcagccactg gagtggatgc catctgcacc 13440
caccgtcttg accccaaaag ccctggagtg gacagggagc agctatactg ggagctgagc 13500
cagctgacca atggcatcaa agagctgggt ccctacaccc tggacagaaa cagtctctat 13560
gtcaatggtt tcacccatca gacctctgcg cccaacacca gcactcctgg gacctccaca 13620
gtggaccttg ggacctcagg gactccatcc tccctcccca gccctacatc tgctggccct 13680
ctcctggtgc cattcaccct caacttcacc atcaccaacc tgcagtacga ggaggacatg 13740
catcacccag gctccaggaa gttcaacacc acggagcggg tcctgcaggg tctgcttggt 13800
cccatgttca agaacaccag tgtcggcctt ctgtactctg gctgcagact gaccttgctc 13860
aggcctgaga agaatggggc agccactgga atggatgcca tctgcagcca ccgtcttgac 13920
cccaaaagcc ctggactcaa cagagagcag ctgtactggg agctgagcca gctgacccat 13980
ggcatcaaag agctgggccc ctacaccctg gacaggaaca gtctctatgt caatggtttc 14040
acccatcgga gctctgtggc ccccaccagc actcctggga cctccacagt ggaccttggg 14100
acctcaggga ctccatcctc cctccccagc cccacaacag ctgttcctct cctggtgccg 14160
ttcaccctca actttaccat caccaatctg cagtatgggg aggacatgcg tcaccctggc 14220
tccaggaagt tcaacaccac agagagggtc ctgcagggtc tgcttggtcc cttgttcaag 14280
aactccagtg tcggccctct gtactctggc tgcagactga tctctctcag gtctgagaag 14340
gatggggcag ccactggagt ggatgccatc tgcacccacc accttaaccc tcaaagccct 14400
ggactggaca gggagcagct gtactggcag ctgagccaga tgaccaatgg catcaaagag 14460
ctgggcccct acaccctgga ccggaacagt ctctacgtca atggtttcac ccatcggagc 14520
tctgggctca ccaccagcac tccttggact tccacagttg accttggaac ctcagggact 14580
ccatcccccg tccccagccc cacaactgct ggccctctcc tggtgccatt caccctaaac 14640
ttcaccatca ccaacctgca gtatgaggag gacatgcatc gccctggatc taggaagttc 14700
aacgccacag agagggtcct gcagggtctg cttagtccca tattcaagaa ctccagtgtt 14760
ggccctctgt actctggctg cagactgacc tctctcaggc ccgagaagga tggggcagca 14820
actggaatgg atgctgtctg cctctaccac cctaatccca aaagacctgg gctggacaga 14880
gagcagctgt actgggagct aagccagctg acccacaaca tcactgagct gggcccctac 14940
agcctggaca gggacagtct ctatgtcaat ggtttcaccc atcagaactc tgtgcccacc 15000
accagtactc ctgggacctc cacagtgtac tgggcaacca ctgggactcc atcctccttc 15060
cccggccaca cagagcctgg ccctctcctg ataccattca ctttcaactt taccatcacc 15120
aacctgcatt atgaggaaaa catgcaacac cctggttcca ggaagttcaa caccacggag 15180
agggttctgc agggtctgct caagcccttg ttcaagaaca ccagtgttgg ccctctgtac 15240
tctggctgca gactgacctt gctcagacct gagaagcagg aggcagccac tggagtggac 15300
accatctgta cccaccgcgt tgatcccatc ggacctggac tggacagaga gcggctatac 15360
tgggagctga gccagctgac caacagcatc acagagctgg gaccctacac cctggatagg 15420
gacagtctct atgtcaatgg cttcaaccct tggagctctg tgccaaccac cagcactcct 15480
gggacctcca cagtgcacct ggcaacctct gggactccat cctccctgcc tggccacaca 15540
gcccctgtcc ctctcttgat accattcacc ctcaacttta ccatcaccaa cctgcattat 15600
gaagaaaaca tgcaacaccc tggttccagg aagttcaaca ccacggagag ggttctgcag 15660
ggtctgctca agcccttgtt caagagcacc agcgttggcc ctctgtactc tggctgcaga 15720
ctgaccttgc tcagacctga gaaacatggg gcagccactg gagtggacgc catctgcacc 15780
ctccgccttg atcccactgg tcctggactg gacagagagc ggctatactg ggagctgagc 15840
cagctgacca acagcgttac agagctgggc ccctacaccc tggacaggga cagtctctat 15900
gtcaatggct tcacccatcg gagctctgtg ccaaccacca gtattcctgg gacctctgca 15960
gtgcacctgg aaacctctgg gactccagcc tccctccctg gccacacagc ccctggccct 16020
ctcctggtgc cattcaccct caacttcact atcaccaacc tgcagtatga ggaggacatg 16080
cgtcaccctg gttccaggaa gttcaacacc acggagagag tcctgcaggg tctgctcaag 16140
cccttgttca agagcaccag tgttggccct ctgtactctg gctgcagact gaccttgctc 16200
aggcctgaaa aacgtggggc agccaccggc gtggacacca tctgcactca ccgccttgac 16260
cctctaaacc ctggactgga cagagagcag ctatactggg agctgagcaa actgacccgt 16320
ggcatcatcg agctgggccc ctacctcctg gacagaggca gtctctatgt caatggtttc 16380
acccatcgga actttgtgcc catcaccagc actcctggga cctccacagt acacctagga 16440
acctctgaaa ctccatcctc cctacctaga cccatagtgc ctggccctct cctggtgcca 16500
ttcaccctca acttcaccat caccaacttg cagtatgagg aggccatgcg acaccctggc 16560
tccaggaagt tcaataccac ggagagggtc ctacagggtc tgctcaggcc cttgttcaag 16620
aataccagta tcggccctct gtactccagc tgcagactga ccttgctcag gccagagaag 16680
gacaaggcag ccaccagagt ggatgccatc tgtacccacc accctgaccc tcaaagccct 16740
ggactgaaca gagagcagct gtactgggag ctgagccagc tgacccacgg catcactgag 16800
ctgggcccct acaccctgga cagggacagt ctctatgtcg atggtttcac tcattggagc 16860
cccataccaa ccaccagcac tcctgggacc tccatagtga acctgggaac ctctgggatc 16920
ccaccttccc tccctgaaac tacagccacc ggccctctcc tggtgccatt cacactcaac 16980
ttcaccatca ctaacctaca gtatgaggag aacatgggtc accctggctc caggaagttc 17040
aacatcacgg agagtgttct gcagggtctg ctcaagccct tgttcaagag caccagtgtt 17100
ggccctctgt attctggctg cagactgacc ttgctcaggc ctgagaagga cggagtagcc 17160
accagagtgg acgccatctg cacccaccgc cctgacccca aaatccctgg gctagacaga 17220
cagcagctat actgggagct gagccagctg acccacagca tcactgagct gggaccctac 17280
accctggata gggacagtct ctatgtcaat ggtttcaccc agcggagctc tgtgcccacc 17340
accagcactc ctgggacttt cacagtacag ccggaaacct ctgagactcc atcatccctc 17400
cctggcccca cagccactgg ccctgtcctg ctgccattca ccctcaattt taccatcatt 17460
aacctgcagt atgaggagga catgcatcgc cctggctcca ggaagttcaa caccacggag 17520
agggtccttc agggtctgct tatgcccttg ttcaagaaca ccagtgtcag ctctctgtac 17580
tctggttgca gactgacctt gctcaggcct gagaaggatg gggcagccac cagagtggat 17640
gctgtctgca cccatcgtcc tgaccccaaa agccctggac tggacagaga gcggctgtac 17700
tggaagctga gccagctgac ccacggcatc actgagctgg gcccctacac cctggacagg 17760
cacagtctct atgtcaatgg tttcacccat cagagctcta tgacgaccac cagaactcct 17820
gatacctcca caatgcacct ggcaacctcg agaactccag cctccctgtc tggacctacg 17880
accgccagcc ctctcctggt gctattcaca attaacttca ccatcactaa cctgcggtat 17940
gaggagaaca tgcatcaccc tggctctaga aagtttaaca ccacggagag agtccttcag 18000
ggtctgctca ggcctgtgtt caagaacacc agtgttggcc ctctgtactc tggctgcaga 18060
ctgaccttgc tcaggcccaa gaaggatggg gcagccacca aagtggatgc catctgcacc 18120
taccgccctg atcccaaaag ccctggactg gacagagagc agctatactg ggagctgagc 18180
cagctaaccc acagcatcac tgagctgggc ccctacaccc tggacaggga cagtctctat 18240
gtcaatggtt tcacacagcg gagctctgtg cccaccacta gcattcctgg gacccccaca 18300
gtggacctgg gaacatctgg gactccagtt tctaaacctg gtccctcggc tgccagccct 18360
ctcctggtgc tattcactct caacttcacc atcaccaacc tgcggtatga ggagaacatg 18420
cagcaccctg gctccaggaa gttcaacacc acggagaggg tccttcaggg cctgctcagg 18480
tccctgttca agagcaccag tgttggccct ctgtactctg gctgcagact gactttgctc 18540
aggcctgaaa aggatgggac agccactgga gtggatgcca tctgcaccca ccaccctgac 18600
cccaaaagcc ctaggctgga cagagagcag ctgtattggg agctgagcca gctgacccac 18660
aatatcactg agctgggccc ctatgccctg gacaacgaca gcctctttgt caatggtttc 18720
actcatcgga gctctgtgtc caccaccagc actcctggga cccccacagt gtatctggga 18780
gcatctaaga ctccagcctc gatatttggc ccttcagctg ccagccatct cctgatacta 18840
ttcaccctca acttcaccat cactaacctg cggtatgagg agaacatgtg gcctggctcc 18900
aggaagttca acactacaga gagggtcctt cagggcctgc taaggccctt gttcaagaac 18960
accagtgttg gccctctgta ctctggctgc aggctgacct tgctcaggcc agagaaagat 19020
ggggaagcca ccggagtgga tgccatctgc acccaccgcc ctgaccccac aggccctggg 19080
ctggacagag agcagctgta tttggagctg agccagctga cccacagcat cactgagctg 19140
ggcccctaca cactggacag ggacagtctc tatgtcaatg gtttcaccca tcggagctct 19200
gtacccacca ccagcaccgg ggtggtcagc gaggagccat tcacactgaa cttcaccatc 19260
aacaacctgc gctacatggc ggacatgggc caacccggct ccctcaagtt caacatcaca 19320
gacaacgtca tgcagcacct gctcagtcct ttgttccaga ggagcagcct gggtgcacgg 19380
tacacaggct gcagggtcat cgcactaagg tctgtgaaga acggtgctga gacacgggtg 19440
gacctcctct gcacctacct gcagcccctc agcggcccag gtctgcctat caagcaggtg 19500
ttccatgagc tgagccagca gacccatggc atcacccggc tgggccccta ctctctggac 19560
aaagacagcc tctaccttaa cggttacaat gaacctggtc cagatgagcc tcctacaact 19620
cccaagccag ccaccacatt cctgcctcct ctgtcagaag ccacaacagc catggggtac 19680
cacctgaaga ccctcacact caacttcacc atctccaatc tccagtattc accagatatg 19740
ggcaagggct cagctacatt caactccacc gagggggtcc ttcagcacct gctcagaccc 19800
ttgttccaga agagcagcat gggccccttc tacttgggtt gccaactgat ctccctcagg 19860
cctgagaagg atggggcagc cactggtgtg gacaccacct gcacctacca ccctgaccct 19920
gtgggccccg ggctggacat acagcagctt tactgggagc tgagtcagct gacccatggt 19980
gtcacccaac tgggcttcta tgtcctggac agggatagcc tcttcatcaa tggctatgca 20040
ccccagaatt tatcaatccg gggcgagtac cagataaatt tccacattgt caactggaac 20100
ctcagtaatc cagaccccac atcctcagag tacatcaccc tgctgaggga catccaggac 20160
aaggtcacca cactctacaa aggcagtcaa ctacatgaca cattccgctt ctgcctggtc 20220
accaacttga cgatggactc cgtgttggtc actgtcaagg cattgttctc ctccaatttg 20280
gaccccagcc tggtggagca agtctttcta gataagaccc tgaatgcctc attccattgg 20340
ctgggctcca cctaccagtt ggtggacatc catgtgacag aaatggagtc atcagtttat 20400
caaccaacaa gcagctccag cacccagcac ttctacctga atttcaccat caccaaccta 20460
ccatattccc aggacaaagc ccagccaggc accaccaatt accagaggaa caaaaggaat 20520
attgaggatg cgctcaacca actcttccga aacagcagca tcaagagtta tttttctgac 20580
tgtcaagttt caacattcag gtctgtcccc aacaggcacc acaccggggt ggactccctg 20640
tgtaacttct cgccactggc tcggagagta gacagagttg ccatctatga ggaatttctg 20700
cggatgaccc ggaatggtac ccagctgcag aacttcaccc tggacaggag cagtgtcctt 20760
gtggatgggt attctcccaa cagaaatgag cccttaactg ggaattctga ccttcccttc 20820
tgggctgtca tcctcatcgg cttggcagga ctcctgggac tcatcacatg cctgatctgc 20880
ggtgtcctgg tgaccacccg ccggcggaag aaggaaggag aatacaacgt ccagcaacag 20940
tgcccaggct actaccagtc acacctagac ctggaggatc tgcaatgact ggaacttgcc 21000
ggtgcctggg gtgcctttcc cccagccagg gtccaaagaa gcttggctgg ggcagaaata 21060
aaccatattg gtcggaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 21112
<210> 52
<211> 6995
<212> PRT
<213> 智人
<400> 52
Pro Val Thr Ser Leu Leu Thr Pro Gly Leu Val Ile Thr Thr Asp Arg
1 5 10 15
Met Gly Ile Ser Arg Glu Pro Gly Thr Ser Ser Thr Ser Asn Leu Ser
20 25 30
Ser Thr Ser His Glu Arg Leu Thr Thr Leu Glu Asp Thr Val Asp Thr
35 40 45
Glu Ala Met Gln Pro Ser Thr His Thr Ala Val Thr Asn Val Arg Thr
50 55 60
Ser Ile Ser Gly His Glu Ser Gln Ser Ser Val Leu Ser Asp Ser Glu
65 70 75 80
Thr Pro Lys Ala Thr Ser Pro Met Gly Thr Thr Tyr Thr Met Gly Glu
85 90 95
Thr Ser Val Ser Ile Ser Thr Ser Asp Phe Phe Glu Thr Ser Arg Ile
100 105 110
Gln Ile Glu Pro Thr Ser Ser Leu Thr Ser Gly Leu Arg Glu Thr Ser
115 120 125
Ser Ser Glu Arg Ile Ser Ser Ala Thr Glu Gly Ser Thr Val Leu Ser
130 135 140
Glu Val Pro Ser Gly Ala Thr Thr Glu Val Ser Arg Thr Glu Val Ile
145 150 155 160
Ser Ser Arg Gly Thr Ser Met Ser Gly Pro Asp Gln Phe Thr Ile Ser
165 170 175
Pro Asp Ile Ser Thr Glu Ala Ile Thr Arg Leu Ser Thr Ser Pro Ile
180 185 190
Met Thr Glu Ser Ala Glu Ser Ala Ile Thr Ile Glu Thr Gly Ser Pro
195 200 205
Gly Ala Thr Ser Glu Gly Thr Leu Thr Leu Asp Thr Ser Thr Thr Thr
210 215 220
Phe Trp Ser Gly Thr His Ser Thr Ala Ser Pro Gly Phe Ser His Ser
225 230 235 240
Glu Met Thr Thr Leu Met Ser Arg Thr Pro Gly Asp Val Pro Trp Pro
245 250 255
Ser Leu Pro Ser Val Glu Glu Ala Ser Ser Val Ser Ser Ser Leu Ser
260 265 270
Ser Pro Ala Met Thr Ser Thr Ser Phe Phe Ser Thr Leu Pro Glu Ser
275 280 285
Ile Ser Ser Ser Pro His Pro Val Thr Ala Leu Leu Thr Leu Gly Pro
290 295 300
Val Lys Thr Thr Asp Met Leu Arg Thr Ser Ser Glu Pro Glu Thr Ser
305 310 315 320
Ser Pro Pro Asn Leu Ser Ser Thr Ser Ala Glu Ile Leu Ala Thr Ser
325 330 335
Glu Val Thr Lys Asp Arg Glu Lys Ile His Pro Ser Ser Asn Thr Pro
340 345 350
Val Val Asn Val Gly Thr Val Ile Tyr Lys His Leu Ser Pro Ser Ser
355 360 365
Val Leu Ala Asp Leu Val Thr Thr Lys Pro Thr Ser Pro Met Ala Thr
370 375 380
Thr Ser Thr Leu Gly Asn Thr Ser Val Ser Thr Ser Thr Pro Ala Phe
385 390 395 400
Pro Glu Thr Met Met Thr Gln Pro Thr Ser Ser Leu Thr Ser Gly Leu
405 410 415
Arg Glu Ile Ser Thr Ser Gln Glu Thr Ser Ser Ala Thr Glu Arg Ser
420 425 430
Ala Ser Leu Ser Gly Met Pro Thr Gly Ala Thr Thr Lys Val Ser Arg
435 440 445
Thr Glu Ala Leu Ser Leu Gly Arg Thr Ser Thr Pro Gly Pro Ala Gln
450 455 460
Ser Thr Ile Ser Pro Glu Ile Ser Thr Glu Thr Ile Thr Arg Ile Ser
465 470 475 480
Thr Pro Leu Thr Thr Thr Gly Ser Ala Glu Met Thr Ile Thr Pro Lys
485 490 495
Thr Gly His Ser Gly Ala Ser Ser Gln Gly Thr Phe Thr Leu Asp Thr
500 505 510
Ser Ser Arg Ala Ser Trp Pro Gly Thr His Ser Ala Ala Thr His Arg
515 520 525
Ser Pro His Ser Gly Met Thr Thr Pro Met Ser Arg Gly Pro Glu Asp
530 535 540
Val Ser Trp Pro Ser Arg Pro Ser Val Glu Lys Thr Ser Pro Pro Ser
545 550 555 560
Ser Leu Val Ser Leu Ser Ala Val Thr Ser Pro Ser Pro Leu Tyr Ser
565 570 575
Thr Pro Ser Glu Ser Ser His Ser Ser Pro Leu Arg Val Thr Ser Leu
580 585 590
Phe Thr Pro Val Met Met Lys Thr Thr Asp Met Leu Asp Thr Ser Leu
595 600 605
Glu Pro Val Thr Thr Ser Pro Pro Ser Met Asn Ile Thr Ser Asp Glu
610 615 620
Ser Leu Ala Thr Ser Lys Ala Thr Met Glu Thr Glu Ala Ile Gln Leu
625 630 635 640
Ser Glu Asn Thr Ala Val Thr Gln Met Gly Thr Ile Ser Ala Arg Gln
645 650 655
Glu Phe Tyr Ser Ser Tyr Pro Gly Leu Pro Glu Pro Ser Lys Val Thr
660 665 670
Ser Pro Val Val Thr Ser Ser Thr Ile Lys Asp Ile Val Ser Thr Thr
675 680 685
Ile Pro Ala Ser Ser Glu Ile Thr Arg Ile Glu Met Glu Ser Thr Ser
690 695 700
Thr Leu Thr Pro Thr Pro Arg Glu Thr Ser Thr Ser Gln Glu Ile His
705 710 715 720
Ser Ala Thr Lys Pro Ser Thr Val Pro Tyr Lys Ala Leu Thr Ser Ala
725 730 735
Thr Ile Glu Asp Ser Met Thr Gln Val Met Ser Ser Ser Arg Gly Pro
740 745 750
Ser Pro Asp Gln Ser Thr Met Ser Gln Asp Ile Ser Thr Glu Val Ile
755 760 765
Thr Arg Leu Ser Thr Ser Pro Ile Lys Thr Glu Ser Thr Glu Met Thr
770 775 780
Ile Thr Thr Gln Thr Gly Ser Pro Gly Ala Thr Ser Arg Gly Thr Leu
785 790 795 800
Thr Leu Asp Thr Ser Thr Thr Phe Met Ser Gly Thr His Ser Thr Ala
805 810 815
Ser Gln Gly Phe Ser His Ser Gln Met Thr Ala Leu Met Ser Arg Thr
820 825 830
Pro Gly Glu Val Pro Trp Leu Ser His Pro Ser Val Glu Glu Ala Ser
835 840 845
Ser Ala Ser Phe Ser Leu Ser Ser Pro Val Met Thr Ser Ser Ser Pro
850 855 860
Val Ser Ser Thr Leu Pro Asp Ser Ile His Ser Ser Ser Leu Pro Val
865 870 875 880
Thr Ser Leu Leu Thr Ser Gly Leu Val Lys Thr Thr Glu Leu Leu Gly
885 890 895
Thr Ser Ser Glu Pro Glu Thr Ser Ser Pro Pro Asn Leu Ser Ser Thr
900 905 910
Ser Ala Glu Ile Leu Ala Thr Thr Glu Val Thr Thr Asp Thr Glu Lys
915 920 925
Leu Glu Met Thr Asn Val Val Thr Ser Gly Tyr Thr His Glu Ser Pro
930 935 940
Ser Ser Val Leu Ala Asp Ser Val Thr Thr Lys Ala Thr Ser Ser Met
945 950 955 960
Gly Ile Thr Tyr Pro Thr Gly Asp Thr Asn Val Leu Thr Ser Thr Pro
965 970 975
Ala Phe Ser Asp Thr Ser Arg Ile Gln Thr Lys Ser Lys Leu Ser Leu
980 985 990
Thr Pro Gly Leu Met Glu Thr Ser Ile Ser Glu Glu Thr Ser Ser Ala
995 1000 1005
Thr Glu Lys Ser Thr Val Leu Ser Ser Val Pro Thr Gly Ala Thr
1010 1015 1020
Thr Glu Val Ser Arg Thr Glu Ala Ile Ser Ser Ser Arg Thr Ser
1025 1030 1035
Ile Pro Gly Pro Ala Gln Ser Thr Met Ser Ser Asp Thr Ser Met
1040 1045 1050
Glu Thr Ile Thr Arg Ile Ser Thr Pro Leu Thr Arg Lys Glu Ser
1055 1060 1065
Thr Asp Met Ala Ile Thr Pro Lys Thr Gly Pro Ser Gly Ala Thr
1070 1075 1080
Ser Gln Gly Thr Phe Thr Leu Asp Ser Ser Ser Thr Ala Ser Trp
1085 1090 1095
Pro Gly Thr His Ser Ala Thr Thr Gln Arg Phe Pro Arg Ser Val
1100 1105 1110
Val Thr Thr Pro Met Ser Arg Gly Pro Glu Asp Val Ser Trp Pro
1115 1120 1125
Ser Pro Leu Ser Val Glu Lys Asn Ser Pro Pro Ser Ser Leu Val
1130 1135 1140
Ser Ser Ser Ser Val Thr Ser Pro Ser Pro Leu Tyr Ser Thr Pro
1145 1150 1155
Ser Gly Ser Ser His Ser Ser Pro Val Pro Val Thr Ser Leu Phe
1160 1165 1170
Thr Ser Ile Met Met Lys Ala Thr Asp Met Leu Asp Ala Ser Leu
1175 1180 1185
Glu Pro Glu Thr Thr Ser Ala Pro Asn Met Asn Ile Thr Ser Asp
1190 1195 1200
Glu Ser Leu Ala Ala Ser Lys Ala Thr Thr Glu Thr Glu Ala Ile
1205 1210 1215
His Val Phe Glu Asn Thr Ala Ala Ser His Val Glu Thr Thr Ser
1220 1225 1230
Ala Thr Glu Glu Leu Tyr Ser Ser Ser Pro Gly Phe Ser Glu Pro
1235 1240 1245
Thr Lys Val Ile Ser Pro Val Val Thr Ser Ser Ser Ile Arg Asp
1250 1255 1260
Asn Met Val Ser Thr Thr Met Pro Gly Ser Ser Gly Ile Thr Arg
1265 1270 1275
Ile Glu Ile Glu Ser Met Ser Ser Leu Thr Pro Gly Leu Arg Glu
1280 1285 1290
Thr Arg Thr Ser Gln Asp Ile Thr Ser Ser Thr Glu Thr Ser Thr
1295 1300 1305
Val Leu Tyr Lys Met Pro Ser Gly Ala Thr Pro Glu Val Ser Arg
1310 1315 1320
Thr Glu Val Met Pro Ser Ser Arg Thr Ser Ile Pro Gly Pro Ala
1325 1330 1335
Gln Ser Thr Met Ser Leu Asp Ile Ser Asp Glu Val Val Thr Arg
1340 1345 1350
Leu Ser Thr Ser Pro Ile Met Thr Glu Ser Ala Glu Ile Thr Ile
1355 1360 1365
Thr Thr Gln Thr Gly Tyr Ser Leu Ala Thr Ser Gln Val Thr Leu
1370 1375 1380
Pro Leu Gly Thr Ser Met Thr Phe Leu Ser Gly Thr His Ser Thr
1385 1390 1395
Met Ser Gln Gly Leu Ser His Ser Glu Met Thr Asn Leu Met Ser
1400 1405 1410
Arg Gly Pro Glu Ser Leu Ser Trp Thr Ser Pro Arg Phe Val Glu
1415 1420 1425
Thr Thr Arg Ser Ser Ser Ser Leu Thr Ser Leu Pro Leu Thr Thr
1430 1435 1440
Ser Leu Ser Pro Val Ser Ser Thr Leu Leu Asp Ser Ser Pro Ser
1445 1450 1455
Ser Pro Leu Pro Val Thr Ser Leu Ile Leu Pro Gly Leu Val Lys
1460 1465 1470
Thr Thr Glu Val Leu Asp Thr Ser Ser Glu Pro Lys Thr Ser Ser
1475 1480 1485
Ser Pro Asn Leu Ser Ser Thr Ser Val Glu Ile Pro Ala Thr Ser
1490 1495 1500
Glu Ile Met Thr Asp Thr Glu Lys Ile His Pro Ser Ser Asn Thr
1505 1510 1515
Ala Val Ala Lys Val Arg Thr Ser Ser Ser Val His Glu Ser His
1520 1525 1530
Ser Ser Val Leu Ala Asp Ser Glu Thr Thr Ile Thr Ile Pro Ser
1535 1540 1545
Met Gly Ile Thr Ser Ala Val Glu Asp Thr Thr Val Phe Thr Ser
1550 1555 1560
Asn Pro Ala Phe Ser Glu Thr Arg Arg Ile Pro Thr Glu Pro Thr
1565 1570 1575
Phe Ser Leu Thr Pro Gly Phe Arg Glu Thr Ser Thr Ser Glu Glu
1580 1585 1590
Thr Thr Ser Ile Thr Glu Thr Ser Ala Val Leu Phe Gly Val Pro
1595 1600 1605
Thr Ser Ala Thr Thr Glu Val Ser Met Thr Glu Ile Met Ser Ser
1610 1615 1620
Asn Arg Thr His Ile Pro Asp Ser Asp Gln Ser Thr Met Ser Pro
1625 1630 1635
Asp Ile Ile Thr Glu Val Ile Thr Arg Leu Ser Ser Ser Ser Met
1640 1645 1650
Met Ser Glu Ser Thr Gln Met Thr Ile Thr Thr Gln Lys Ser Ser
1655 1660 1665
Pro Gly Ala Thr Ala Gln Ser Thr Leu Thr Leu Ala Thr Thr Thr
1670 1675 1680
Ala Pro Leu Ala Arg Thr His Ser Thr Val Pro Pro Arg Phe Leu
1685 1690 1695
His Ser Glu Met Thr Thr Leu Met Ser Arg Ser Pro Glu Asn Pro
1700 1705 1710
Ser Trp Lys Ser Ser Pro Phe Val Glu Lys Thr Ser Ser Ser Ser
1715 1720 1725
Ser Leu Leu Ser Leu Pro Val Thr Thr Ser Pro Ser Val Ser Ser
1730 1735 1740
Thr Leu Pro Gln Ser Ile Pro Ser Ser Ser Phe Ser Val Thr Ser
1745 1750 1755
Leu Leu Thr Pro Gly Met Val Lys Thr Thr Asp Thr Ser Thr Glu
1760 1765 1770
Pro Gly Thr Ser Leu Ser Pro Asn Leu Ser Gly Thr Ser Val Glu
1775 1780 1785
Ile Leu Ala Ala Ser Glu Val Thr Thr Asp Thr Glu Lys Ile His
1790 1795 1800
Pro Ser Ser Ser Met Ala Val Thr Asn Val Gly Thr Thr Ser Ser
1805 1810 1815
Gly His Glu Leu Tyr Ser Ser Val Ser Ile His Ser Glu Pro Ser
1820 1825 1830
Lys Ala Thr Tyr Pro Val Gly Thr Pro Ser Ser Met Ala Glu Thr
1835 1840 1845
Ser Ile Ser Thr Ser Met Pro Ala Asn Phe Glu Thr Thr Gly Phe
1850 1855 1860
Glu Ala Glu Pro Phe Ser His Leu Thr Ser Gly Leu Arg Lys Thr
1865 1870 1875
Asn Met Ser Leu Asp Thr Ser Ser Val Thr Pro Thr Asn Thr Pro
1880 1885 1890
Ser Ser Pro Gly Ser Thr His Leu Leu Gln Ser Ser Lys Thr Asp
1895 1900 1905
Phe Thr Ser Ser Ala Lys Thr Ser Ser Pro Asp Trp Pro Pro Ala
1910 1915 1920
Ser Gln Tyr Thr Glu Ile Pro Val Asp Ile Ile Thr Pro Phe Asn
1925 1930 1935
Ala Ser Pro Ser Ile Thr Glu Ser Thr Gly Ile Thr Ser Phe Pro
1940 1945 1950
Glu Ser Arg Phe Thr Met Ser Val Thr Glu Ser Thr His His Leu
1955 1960 1965
Ser Thr Asp Leu Leu Pro Ser Ala Glu Thr Ile Ser Thr Gly Thr
1970 1975 1980
Val Met Pro Ser Leu Ser Glu Ala Met Thr Ser Phe Ala Thr Thr
1985 1990 1995
Gly Val Pro Arg Ala Ile Ser Gly Ser Gly Ser Pro Phe Ser Arg
2000 2005 2010
Thr Glu Ser Gly Pro Gly Asp Ala Thr Leu Ser Thr Ile Ala Glu
2015 2020 2025
Ser Leu Pro Ser Ser Thr Pro Val Pro Phe Ser Ser Ser Thr Phe
2030 2035 2040
Thr Thr Thr Asp Ser Ser Thr Ile Pro Ala Leu His Glu Ile Thr
2045 2050 2055
Ser Ser Ser Ala Thr Pro Tyr Arg Val Asp Thr Ser Leu Gly Thr
2060 2065 2070
Glu Ser Ser Thr Thr Glu Gly Arg Leu Val Met Val Ser Thr Leu
2075 2080 2085
Asp Thr Ser Ser Gln Pro Gly Arg Thr Ser Ser Ser Pro Ile Leu
2090 2095 2100
Asp Thr Arg Met Thr Glu Ser Val Glu Leu Gly Thr Val Thr Ser
2105 2110 2115
Ala Tyr Gln Val Pro Ser Leu Ser Thr Arg Leu Thr Arg Thr Asp
2120 2125 2130
Gly Ile Met Glu His Ile Thr Lys Ile Pro Asn Glu Ala Ala His
2135 2140 2145
Arg Gly Thr Ile Arg Pro Val Lys Gly Pro Gln Thr Ser Thr Ser
2150 2155 2160
Pro Ala Ser Pro Lys Gly Leu His Thr Gly Gly Thr Lys Arg Met
2165 2170 2175
Glu Thr Thr Thr Thr Ala Leu Lys Thr Thr Thr Thr Ala Leu Lys
2180 2185 2190
Thr Thr Ser Arg Ala Thr Leu Thr Thr Ser Val Tyr Thr Pro Thr
2195 2200 2205
Leu Gly Thr Leu Thr Pro Leu Asn Ala Ser Met Gln Met Ala Ser
2210 2215 2220
Thr Ile Pro Thr Glu Met Met Ile Thr Thr Pro Tyr Val Phe Pro
2225 2230 2235
Asp Val Pro Glu Thr Thr Ser Ser Leu Ala Thr Ser Leu Gly Ala
2240 2245 2250
Glu Thr Ser Thr Ala Leu Pro Arg Thr Thr Pro Ser Val Phe Asn
2255 2260 2265
Arg Glu Ser Glu Thr Thr Ala Ser Leu Val Ser Arg Ser Gly Ala
2270 2275 2280
Glu Arg Ser Pro Val Ile Gln Thr Leu Asp Val Ser Ser Ser Glu
2285 2290 2295
Pro Asp Thr Thr Ala Ser Trp Val Ile His Pro Ala Glu Thr Ile
2300 2305 2310
Pro Thr Val Ser Lys Thr Thr Pro Asn Phe Phe His Ser Glu Leu
2315 2320 2325
Asp Thr Val Ser Ser Thr Ala Thr Ser His Gly Ala Asp Val Ser
2330 2335 2340
Ser Ala Ile Pro Thr Asn Ile Ser Pro Ser Glu Leu Asp Ala Leu
2345 2350 2355
Thr Pro Leu Val Thr Ile Ser Gly Thr Asp Thr Ser Thr Thr Phe
2360 2365 2370
Pro Thr Leu Thr Lys Ser Pro His Glu Thr Glu Thr Arg Thr Thr
2375 2380 2385
Trp Leu Thr His Pro Ala Glu Thr Ser Ser Thr Ile Pro Arg Thr
2390 2395 2400
Ile Pro Asn Phe Ser His His Glu Ser Asp Ala Thr Pro Ser Ile
2405 2410 2415
Ala Thr Ser Pro Gly Ala Glu Thr Ser Ser Ala Ile Pro Ile Met
2420 2425 2430
Thr Val Ser Pro Gly Ala Glu Asp Leu Val Thr Ser Gln Val Thr
2435 2440 2445
Ser Ser Gly Thr Asp Arg Asn Met Thr Ile Pro Thr Leu Thr Leu
2450 2455 2460
Ser Pro Gly Glu Pro Lys Thr Ile Ala Ser Leu Val Thr His Pro
2465 2470 2475
Glu Ala Gln Thr Ser Ser Ala Ile Pro Thr Ser Thr Ile Ser Pro
2480 2485 2490
Ala Val Ser Arg Leu Val Thr Ser Met Val Thr Ser Leu Ala Ala
2495 2500 2505
Lys Thr Ser Thr Thr Asn Arg Ala Leu Thr Asn Ser Pro Gly Glu
2510 2515 2520
Pro Ala Thr Thr Val Ser Leu Val Thr His Ser Ala Gln Thr Ser
2525 2530 2535
Pro Thr Val Pro Trp Thr Thr Ser Ile Phe Phe His Ser Lys Ser
2540 2545 2550
Asp Thr Thr Pro Ser Met Thr Thr Ser His Gly Ala Glu Ser Ser
2555 2560 2565
Ser Ala Val Pro Thr Pro Thr Val Ser Thr Glu Val Pro Gly Val
2570 2575 2580
Val Thr Pro Leu Val Thr Ser Ser Arg Ala Val Ile Ser Thr Thr
2585 2590 2595
Ile Pro Ile Leu Thr Leu Ser Pro Gly Glu Pro Glu Thr Thr Pro
2600 2605 2610
Ser Met Ala Thr Ser His Gly Glu Glu Ala Ser Ser Ala Ile Pro
2615 2620 2625
Thr Pro Thr Val Ser Pro Gly Val Pro Gly Val Val Thr Ser Leu
2630 2635 2640
Val Thr Ser Ser Arg Ala Val Thr Ser Thr Thr Ile Pro Ile Leu
2645 2650 2655
Thr Phe Ser Leu Gly Glu Pro Glu Thr Thr Pro Ser Met Ala Thr
2660 2665 2670
Ser His Gly Thr Glu Ala Gly Ser Ala Val Pro Thr Val Leu Pro
2675 2680 2685
Glu Val Pro Gly Met Val Thr Ser Leu Val Ala Ser Ser Arg Ala
2690 2695 2700
Val Thr Ser Thr Thr Leu Pro Thr Leu Thr Leu Ser Pro Gly Glu
2705 2710 2715
Pro Glu Thr Thr Pro Ser Met Ala Thr Ser His Gly Ala Glu Ala
2720 2725 2730
Ser Ser Thr Val Pro Thr Val Ser Pro Glu Val Pro Gly Val Val
2735 2740 2745
Thr Ser Leu Val Thr Ser Ser Ser Gly Val Asn Ser Thr Ser Ile
2750 2755 2760
Pro Thr Leu Ile Leu Ser Pro Gly Glu Leu Glu Thr Thr Pro Ser
2765 2770 2775
Met Ala Thr Ser His Gly Ala Glu Ala Ser Ser Ala Val Pro Thr
2780 2785 2790
Pro Thr Val Ser Pro Gly Val Ser Gly Val Val Thr Pro Leu Val
2795 2800 2805
Thr Ser Ser Arg Ala Val Thr Ser Thr Thr Ile Pro Ile Leu Thr
2810 2815 2820
Leu Ser Ser Ser Glu Pro Glu Thr Thr Pro Ser Met Ala Thr Ser
2825 2830 2835
His Gly Val Glu Ala Ser Ser Ala Val Leu Thr Val Ser Pro Glu
2840 2845 2850
Val Pro Gly Met Val Thr Phe Leu Val Thr Ser Ser Arg Ala Val
2855 2860 2865
Thr Ser Thr Thr Ile Pro Thr Leu Thr Ile Ser Ser Asp Glu Pro
2870 2875 2880
Glu Thr Thr Thr Ser Leu Val Thr His Ser Glu Ala Lys Met Ile
2885 2890 2895
Ser Ala Ile Pro Thr Leu Gly Val Ser Pro Thr Val Gln Gly Leu
2900 2905 2910
Val Thr Ser Leu Val Thr Ser Ser Gly Ser Glu Thr Ser Ala Phe
2915 2920 2925
Ser Asn Leu Thr Val Ala Ser Ser Gln Pro Glu Thr Ile Asp Ser
2930 2935 2940
Trp Val Ala His Pro Gly Thr Glu Ala Ser Ser Val Val Pro Thr
2945 2950 2955
Leu Thr Val Ser Thr Gly Glu Pro Phe Thr Asn Ile Ser Leu Val
2960 2965 2970
Thr His Pro Ala Glu Ser Ser Ser Thr Leu Pro Arg Thr Thr Ser
2975 2980 2985
Arg Phe Ser His Ser Glu Leu Asp Thr Met Pro Ser Thr Val Thr
2990 2995 3000
Ser Pro Glu Ala Glu Ser Ser Ser Ala Ile Ser Thr Thr Ile Ser
3005 3010 3015
Pro Gly Ile Pro Gly Val Leu Thr Ser Leu Val Thr Ser Ser Gly
3020 3025 3030
Arg Asp Ile Ser Ala Thr Phe Pro Thr Val Pro Glu Ser Pro His
3035 3040 3045
Glu Ser Glu Ala Thr Ala Ser Trp Val Thr His Pro Ala Val Thr
3050 3055 3060
Ser Thr Thr Val Pro Arg Thr Thr Pro Asn Tyr Ser His Ser Glu
3065 3070 3075
Pro Asp Thr Thr Pro Ser Ile Ala Thr Ser Pro Gly Ala Glu Ala
3080 3085 3090
Thr Ser Asp Phe Pro Thr Ile Thr Val Ser Pro Asp Val Pro Asp
3095 3100 3105
Met Val Thr Ser Gln Val Thr Ser Ser Gly Thr Asp Thr Ser Ile
3110 3115 3120
Thr Ile Pro Thr Leu Thr Leu Ser Ser Gly Glu Pro Glu Thr Thr
3125 3130 3135
Thr Ser Phe Ile Thr Tyr Ser Glu Thr His Thr Ser Ser Ala Ile
3140 3145 3150
Pro Thr Leu Pro Val Ser Pro Asp Ala Ser Lys Met Leu Thr Ser
3155 3160 3165
Leu Val Ile Ser Ser Gly Thr Asp Ser Thr Thr Thr Phe Pro Thr
3170 3175 3180
Leu Thr Glu Thr Pro Tyr Glu Pro Glu Thr Thr Ala Ile Gln Leu
3185 3190 3195
Ile His Pro Ala Glu Thr Asn Thr Met Val Pro Arg Thr Thr Pro
3200 3205 3210
Lys Phe Ser His Ser Lys Ser Asp Thr Thr Leu Pro Val Ala Ile
3215 3220 3225
Thr Ser Pro Gly Pro Glu Ala Ser Ser Ala Val Ser Thr Thr Thr
3230 3235 3240
Ile Ser Pro Asp Met Ser Asp Leu Val Thr Ser Leu Val Pro Ser
3245 3250 3255
Ser Gly Thr Asp Thr Ser Thr Thr Phe Pro Thr Leu Ser Glu Thr
3260 3265 3270
Pro Tyr Glu Pro Glu Thr Thr Ala Thr Trp Leu Thr His Pro Ala
3275 3280 3285
Glu Thr Ser Thr Thr Val Ser Gly Thr Ile Pro Asn Phe Ser His
3290 3295 3300
Arg Gly Ser Asp Thr Ala Pro Ser Met Val Thr Ser Pro Gly Val
3305 3310 3315
Asp Thr Arg Ser Gly Val Pro Thr Thr Thr Ile Pro Pro Ser Ile
3320 3325 3330
Pro Gly Val Val Thr Ser Gln Val Thr Ser Ser Ala Thr Asp Thr
3335 3340 3345
Ser Thr Ala Ile Pro Thr Leu Thr Pro Ser Pro Gly Glu Pro Glu
3350 3355 3360
Thr Thr Ala Ser Ser Ala Thr His Pro Gly Thr Gln Thr Gly Phe
3365 3370 3375
Thr Val Pro Ile Arg Thr Val Pro Ser Ser Glu Pro Asp Thr Met
3380 3385 3390
Ala Ser Trp Val Thr His Pro Pro Gln Thr Ser Thr Pro Val Ser
3395 3400 3405
Arg Thr Thr Ser Ser Phe Ser His Ser Ser Pro Asp Ala Thr Pro
3410 3415 3420
Val Met Ala Thr Ser Pro Arg Thr Glu Ala Ser Ser Ala Val Leu
3425 3430 3435
Thr Thr Ile Ser Pro Gly Ala Pro Glu Met Val Thr Ser Gln Ile
3440 3445 3450
Thr Ser Ser Gly Ala Ala Thr Ser Thr Thr Val Pro Thr Leu Thr
3455 3460 3465
His Ser Pro Gly Met Pro Glu Thr Thr Ala Leu Leu Ser Thr His
3470 3475 3480
Pro Arg Thr Glu Thr Ser Lys Thr Phe Pro Ala Ser Thr Val Phe
3485 3490 3495
Pro Gln Val Ser Glu Thr Thr Ala Ser Leu Thr Ile Arg Pro Gly
3500 3505 3510
Ala Glu Thr Ser Thr Ala Leu Pro Thr Gln Thr Thr Ser Ser Leu
3515 3520 3525
Phe Thr Leu Leu Val Thr Gly Thr Ser Arg Val Asp Leu Ser Pro
3530 3535 3540
Thr Ala Ser Pro Gly Val Ser Ala Lys Thr Ala Pro Leu Ser Thr
3545 3550 3555
His Pro Gly Thr Glu Thr Ser Thr Met Ile Pro Thr Ser Thr Leu
3560 3565 3570
Ser Leu Gly Leu Leu Glu Thr Thr Gly Leu Leu Ala Thr Ser Ser
3575 3580 3585
Ser Ala Glu Thr Ser Thr Ser Thr Leu Thr Leu Thr Val Ser Pro
3590 3595 3600
Ala Val Ser Gly Leu Ser Ser Ala Ser Ile Thr Thr Asp Lys Pro
3605 3610 3615
Gln Thr Val Thr Ser Trp Asn Thr Glu Thr Ser Pro Ser Val Thr
3620 3625 3630
Ser Val Gly Pro Pro Glu Phe Ser Arg Thr Val Thr Gly Thr Thr
3635 3640 3645
Met Thr Leu Ile Pro Ser Glu Met Pro Thr Pro Pro Lys Thr Ser
3650 3655 3660
His Gly Glu Gly Val Ser Pro Thr Thr Ile Leu Arg Thr Thr Met
3665 3670 3675
Val Glu Ala Thr Asn Leu Ala Thr Thr Gly Ser Ser Pro Thr Val
3680 3685 3690
Ala Lys Thr Thr Thr Thr Phe Asn Thr Leu Ala Gly Ser Leu Phe
3695 3700 3705
Thr Pro Leu Thr Thr Pro Gly Met Ser Thr Leu Ala Ser Glu Ser
3710 3715 3720
Val Thr Ser Arg Thr Ser Tyr Asn His Arg Ser Trp Ile Ser Thr
3725 3730 3735
Thr Ser Ser Tyr Asn Arg Arg Tyr Trp Thr Pro Ala Thr Ser Thr
3740 3745 3750
Pro Val Thr Ser Thr Phe Ser Pro Gly Ile Ser Thr Ser Ser Ile
3755 3760 3765
Pro Ser Ser Thr Ala Ala Thr Val Pro Phe Met Val Pro Phe Thr
3770 3775 3780
Leu Asn Phe Thr Ile Thr Asn Leu Gln Tyr Glu Glu Asp Met Arg
3785 3790 3795
His Pro Gly Ser Arg Lys Phe Asn Ala Thr Glu Arg Glu Leu Gln
3800 3805 3810
Gly Leu Leu Lys Pro Leu Phe Arg Asn Ser Ser Leu Glu Tyr Leu
3815 3820 3825
Tyr Ser Gly Cys Arg Leu Ala Ser Leu Arg Pro Glu Lys Asp Ser
3830 3835 3840
Ser Ala Thr Ala Val Asp Ala Ile Cys Thr His Arg Pro Asp Pro
3845 3850 3855
Glu Asp Leu Gly Leu Asp Arg Glu Arg Leu Tyr Trp Glu Leu Ser
3860 3865 3870
Asn Leu Thr Asn Gly Ile Gln Glu Leu Gly Pro Tyr Thr Leu Asp
3875 3880 3885
Arg Asn Ser Leu Tyr Val Asn Gly Phe Thr His Arg Ser Ser Met
3890 3895 3900
Pro Thr Thr Ser Thr Pro Gly Thr Ser Thr Val Asp Val Gly Thr
3905 3910 3915
Ser Gly Thr Pro Ser Ser Ser Pro Ser Pro Thr Thr Ala Gly Pro
3920 3925 3930
Leu Leu Met Pro Phe Thr Leu Asn Phe Thr Ile Thr Asn Leu Gln
3935 3940 3945
Tyr Glu Glu Asp Met Arg Arg Thr Gly Ser Arg Lys Phe Asn Thr
3950 3955 3960
Met Glu Ser Val Leu Gln Gly Leu Leu Lys Pro Leu Phe Lys Asn
3965 3970 3975
Thr Ser Val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu
3980 3985 3990
Arg Pro Glu Lys Asp Gly Ala Ala Thr Gly Val Asp Ala Ile Cys
3995 4000 4005
Thr His Arg Leu Asp Pro Lys Ser Pro Gly Leu Asn Arg Glu Gln
4010 4015 4020
Leu Tyr Trp Glu Leu Ser Lys Leu Thr Asn Asp Ile Glu Glu Leu
4025 4030 4035
Gly Pro Tyr Thr Leu Asp Arg Asn Ser Leu Tyr Val Asn Gly Phe
4040 4045 4050
Thr His Gln Ser Ser Val Ser Thr Thr Ser Thr Pro Gly Thr Ser
4055 4060 4065
Thr Val Asp Leu Arg Thr Ser Gly Thr Pro Ser Ser Leu Ser Ser
4070 4075 4080
Pro Thr Ile Met Ala Ala Gly Pro Leu Leu Val Pro Phe Thr Leu
4085 4090 4095
Asn Phe Thr Ile Thr Asn Leu Gln Tyr Gly Glu Asp Met Gly His
4100 4105 4110
Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gln Gly
4115 4120 4125
Leu Leu Gly Pro Ile Phe Lys Asn Thr Ser Val Gly Pro Leu Tyr
4130 4135 4140
Ser Gly Cys Arg Leu Thr Ser Leu Arg Ser Glu Lys Asp Gly Ala
4145 4150 4155
Ala Thr Gly Val Asp Ala Ile Cys Ile His His Leu Asp Pro Lys
4160 4165 4170
Ser Pro Gly Leu Asn Arg Glu Arg Leu Tyr Trp Glu Leu Ser Gln
4175 4180 4185
Leu Thr Asn Gly Ile Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg
4190 4195 4200
Asn Ser Leu Tyr Val Asn Gly Phe Thr His Arg Thr Ser Val Pro
4205 4210 4215
Thr Thr Ser Thr Pro Gly Thr Ser Thr Val Asp Leu Gly Thr Ser
4220 4225 4230
Gly Thr Pro Phe Ser Leu Pro Ser Pro Ala Thr Ala Gly Pro Leu
4235 4240 4245
Leu Val Leu Phe Thr Leu Asn Phe Thr Ile Thr Asn Leu Lys Tyr
4250 4255 4260
Glu Glu Asp Met His Arg Pro Gly Ser Arg Lys Phe Asn Thr Thr
4265 4270 4275
Glu Arg Val Leu Gln Thr Leu Val Gly Pro Met Phe Lys Asn Thr
4280 4285 4290
Ser Val Gly Leu Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg
4295 4300 4305
Ser Glu Lys Asp Gly Ala Ala Thr Gly Val Asp Ala Ile Cys Thr
4310 4315 4320
His Arg Leu Asp Pro Lys Ser Pro Gly Val Asp Arg Glu Gln Leu
4325 4330 4335
Tyr Trp Glu Leu Ser Gln Leu Thr Asn Gly Ile Lys Glu Leu Gly
4340 4345 4350
Pro Tyr Thr Leu Asp Arg Asn Ser Leu Tyr Val Asn Gly Phe Thr
4355 4360 4365
His Trp Ile Pro Val Pro Thr Ser Ser Thr Pro Gly Thr Ser Thr
4370 4375 4380
Val Asp Leu Gly Ser Gly Thr Pro Ser Ser Leu Pro Ser Pro Thr
4385 4390 4395
Ser Ala Thr Ala Gly Pro Leu Leu Val Pro Phe Thr Leu Asn Phe
4400 4405 4410
Thr Ile Thr Asn Leu Lys Tyr Glu Glu Asp Met His Cys Pro Gly
4415 4420 4425
Ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gln Ser Leu Leu
4430 4435 4440
Gly Pro Met Phe Lys Asn Thr Ser Val Gly Pro Leu Tyr Ser Gly
4445 4450 4455
Cys Arg Leu Thr Leu Leu Arg Ser Glu Lys Asp Gly Ala Ala Thr
4460 4465 4470
Gly Val Asp Ala Ile Cys Thr His Arg Leu Asp Pro Lys Ser Pro
4475 4480 4485
Gly Val Asp Arg Glu Gln Leu Tyr Trp Glu Leu Ser Gln Leu Thr
4490 4495 4500
Asn Gly Ile Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asn Ser
4505 4510 4515
Leu Tyr Val Asn Gly Phe Thr His Gln Thr Ser Ala Pro Asn Thr
4520 4525 4530
Ser Thr Pro Gly Thr Ser Thr Val Asp Leu Gly Thr Ser Gly Thr
4535 4540 4545
Pro Ser Ser Leu Pro Ser Pro Thr Ser Ala Gly Pro Leu Leu Val
4550 4555 4560
Pro Phe Thr Leu Asn Phe Thr Ile Thr Asn Leu Gln Tyr Glu Glu
4565 4570 4575
Asp Met His His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg
4580 4585 4590
Val Leu Gln Gly Leu Leu Gly Pro Met Phe Lys Asn Thr Ser Val
4595 4600 4605
Gly Leu Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu
4610 4615 4620
Lys Asn Gly Ala Ala Thr Gly Met Asp Ala Ile Cys Ser His Arg
4625 4630 4635
Leu Asp Pro Lys Ser Pro Gly Leu Asn Arg Glu Gln Leu Tyr Trp
4640 4645 4650
Glu Leu Ser Gln Leu Thr His Gly Ile Lys Glu Leu Gly Pro Tyr
4655 4660 4665
Thr Leu Asp Arg Asn Ser Leu Tyr Val Asn Gly Phe Thr His Arg
4670 4675 4680
Ser Ser Val Ala Pro Thr Ser Thr Pro Gly Thr Ser Thr Val Asp
4685 4690 4695
Leu Gly Thr Ser Gly Thr Pro Ser Ser Leu Pro Ser Pro Thr Thr
4700 4705 4710
Ala Val Pro Leu Leu Val Pro Phe Thr Leu Asn Phe Thr Ile Thr
4715 4720 4725
Asn Leu Gln Tyr Gly Glu Asp Met Arg His Pro Gly Ser Arg Lys
4730 4735 4740
Phe Asn Thr Thr Glu Arg Val Leu Gln Gly Leu Leu Gly Pro Leu
4745 4750 4755
Phe Lys Asn Ser Ser Val Gly Pro Leu Tyr Ser Gly Cys Arg Leu
4760 4765 4770
Ile Ser Leu Arg Ser Glu Lys Asp Gly Ala Ala Thr Gly Val Asp
4775 4780 4785
Ala Ile Cys Thr His His Leu Asn Pro Gln Ser Pro Gly Leu Asp
4790 4795 4800
Arg Glu Gln Leu Tyr Trp Gln Leu Ser Gln Met Thr Asn Gly Ile
4805 4810 4815
Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asn Ser Leu Tyr Val
4820 4825 4830
Asn Gly Phe Thr His Arg Ser Ser Gly Leu Thr Thr Ser Thr Pro
4835 4840 4845
Trp Thr Ser Thr Val Asp Leu Gly Thr Ser Gly Thr Pro Ser Pro
4850 4855 4860
Val Pro Ser Pro Thr Thr Ala Gly Pro Leu Leu Val Pro Phe Thr
4865 4870 4875
Leu Asn Phe Thr Ile Thr Asn Leu Gln Tyr Glu Glu Asp Met His
4880 4885 4890
Arg Pro Gly Ser Arg Lys Phe Asn Ala Thr Glu Arg Val Leu Gln
4895 4900 4905
Gly Leu Leu Ser Pro Ile Phe Lys Asn Ser Ser Val Gly Pro Leu
4910 4915 4920
Tyr Ser Gly Cys Arg Leu Thr Ser Leu Arg Pro Glu Lys Asp Gly
4925 4930 4935
Ala Ala Thr Gly Met Asp Ala Val Cys Leu Tyr His Pro Asn Pro
4940 4945 4950
Lys Arg Pro Gly Leu Asp Arg Glu Gln Leu Tyr Trp Glu Leu Ser
4955 4960 4965
Gln Leu Thr His Asn Ile Thr Glu Leu Gly Pro Tyr Ser Leu Asp
4970 4975 4980
Arg Asp Ser Leu Tyr Val Asn Gly Phe Thr His Gln Asn Ser Val
4985 4990 4995
Pro Thr Thr Ser Thr Pro Gly Thr Ser Thr Val Tyr Trp Ala Thr
5000 5005 5010
Thr Gly Thr Pro Ser Ser Phe Pro Gly His Thr Glu Pro Gly Pro
5015 5020 5025
Leu Leu Ile Pro Phe Thr Phe Asn Phe Thr Ile Thr Asn Leu His
5030 5035 5040
Tyr Glu Glu Asn Met Gln His Pro Gly Ser Arg Lys Phe Asn Thr
5045 5050 5055
Thr Glu Arg Val Leu Gln Gly Leu Leu Lys Pro Leu Phe Lys Asn
5060 5065 5070
Thr Ser Val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu
5075 5080 5085
Arg Pro Glu Lys Gln Glu Ala Ala Thr Gly Val Asp Thr Ile Cys
5090 5095 5100
Thr His Arg Val Asp Pro Ile Gly Pro Gly Leu Asp Arg Glu Arg
5105 5110 5115
Leu Tyr Trp Glu Leu Ser Gln Leu Thr Asn Ser Ile Thr Glu Leu
5120 5125 5130
Gly Pro Tyr Thr Leu Asp Arg Asp Ser Leu Tyr Val Asn Gly Phe
5135 5140 5145
Asn Pro Trp Ser Ser Val Pro Thr Thr Ser Thr Pro Gly Thr Ser
5150 5155 5160
Thr Val His Leu Ala Thr Ser Gly Thr Pro Ser Ser Leu Pro Gly
5165 5170 5175
His Thr Ala Pro Val Pro Leu Leu Ile Pro Phe Thr Leu Asn Phe
5180 5185 5190
Thr Ile Thr Asn Leu His Tyr Glu Glu Asn Met Gln His Pro Gly
5195 5200 5205
Ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gln Gly Leu Leu
5210 5215 5220
Lys Pro Leu Phe Lys Ser Thr Ser Val Gly Pro Leu Tyr Ser Gly
5225 5230 5235
Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys His Gly Ala Ala Thr
5240 5245 5250
Gly Val Asp Ala Ile Cys Thr Leu Arg Leu Asp Pro Thr Gly Pro
5255 5260 5265
Gly Leu Asp Arg Glu Arg Leu Tyr Trp Glu Leu Ser Gln Leu Thr
5270 5275 5280
Asn Ser Val Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asp Ser
5285 5290 5295
Leu Tyr Val Asn Gly Phe Thr His Arg Ser Ser Val Pro Thr Thr
5300 5305 5310
Ser Ile Pro Gly Thr Ser Ala Val His Leu Glu Thr Ser Gly Thr
5315 5320 5325
Pro Ala Ser Leu Pro Gly His Thr Ala Pro Gly Pro Leu Leu Val
5330 5335 5340
Pro Phe Thr Leu Asn Phe Thr Ile Thr Asn Leu Gln Tyr Glu Glu
5345 5350 5355
Asp Met Arg His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg
5360 5365 5370
Val Leu Gln Gly Leu Leu Lys Pro Leu Phe Lys Ser Thr Ser Val
5375 5380 5385
Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu
5390 5395 5400
Lys Arg Gly Ala Ala Thr Gly Val Asp Thr Ile Cys Thr His Arg
5405 5410 5415
Leu Asp Pro Leu Asn Pro Gly Leu Asp Arg Glu Gln Leu Tyr Trp
5420 5425 5430
Glu Leu Ser Lys Leu Thr Arg Gly Ile Ile Glu Leu Gly Pro Tyr
5435 5440 5445
Leu Leu Asp Arg Gly Ser Leu Tyr Val Asn Gly Phe Thr His Arg
5450 5455 5460
Asn Phe Val Pro Ile Thr Ser Thr Pro Gly Thr Ser Thr Val His
5465 5470 5475
Leu Gly Thr Ser Glu Thr Pro Ser Ser Leu Pro Arg Pro Ile Val
5480 5485 5490
Pro Gly Pro Leu Leu Val Pro Phe Thr Leu Asn Phe Thr Ile Thr
5495 5500 5505
Asn Leu Gln Tyr Glu Glu Ala Met Arg His Pro Gly Ser Arg Lys
5510 5515 5520
Phe Asn Thr Thr Glu Arg Val Leu Gln Gly Leu Leu Arg Pro Leu
5525 5530 5535
Phe Lys Asn Thr Ser Ile Gly Pro Leu Tyr Ser Ser Cys Arg Leu
5540 5545 5550
Thr Leu Leu Arg Pro Glu Lys Asp Lys Ala Ala Thr Arg Val Asp
5555 5560 5565
Ala Ile Cys Thr His His Pro Asp Pro Gln Ser Pro Gly Leu Asn
5570 5575 5580
Arg Glu Gln Leu Tyr Trp Glu Leu Ser Gln Leu Thr His Gly Ile
5585 5590 5595
Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asp Ser Leu Tyr Val
5600 5605 5610
Asp Gly Phe Thr His Trp Ser Pro Ile Pro Thr Thr Ser Thr Pro
5615 5620 5625
Gly Thr Ser Ile Val Asn Leu Gly Thr Ser Gly Ile Pro Pro Ser
5630 5635 5640
Leu Pro Glu Thr Thr Ala Thr Gly Pro Leu Leu Val Pro Phe Thr
5645 5650 5655
Leu Asn Phe Thr Ile Thr Asn Leu Gln Tyr Glu Glu Asn Met Gly
5660 5665 5670
His Pro Gly Ser Arg Lys Phe Asn Ile Thr Glu Ser Val Leu Gln
5675 5680 5685
Gly Leu Leu Lys Pro Leu Phe Lys Ser Thr Ser Val Gly Pro Leu
5690 5695 5700
Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys Asp Gly
5705 5710 5715
Val Ala Thr Arg Val Asp Ala Ile Cys Thr His Arg Pro Asp Pro
5720 5725 5730
Lys Ile Pro Gly Leu Asp Arg Gln Gln Leu Tyr Trp Glu Leu Ser
5735 5740 5745
Gln Leu Thr His Ser Ile Thr Glu Leu Gly Pro Tyr Thr Leu Asp
5750 5755 5760
Arg Asp Ser Leu Tyr Val Asn Gly Phe Thr Gln Arg Ser Ser Val
5765 5770 5775
Pro Thr Thr Ser Thr Pro Gly Thr Phe Thr Val Gln Pro Glu Thr
5780 5785 5790
Ser Glu Thr Pro Ser Ser Leu Pro Gly Pro Thr Ala Thr Gly Pro
5795 5800 5805
Val Leu Leu Pro Phe Thr Leu Asn Phe Thr Ile Ile Asn Leu Gln
5810 5815 5820
Tyr Glu Glu Asp Met His Arg Pro Gly Ser Arg Lys Phe Asn Thr
5825 5830 5835
Thr Glu Arg Val Leu Gln Gly Leu Leu Met Pro Leu Phe Lys Asn
5840 5845 5850
Thr Ser Val Ser Ser Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu
5855 5860 5865
Arg Pro Glu Lys Asp Gly Ala Ala Thr Arg Val Asp Ala Val Cys
5870 5875 5880
Thr His Arg Pro Asp Pro Lys Ser Pro Gly Leu Asp Arg Glu Arg
5885 5890 5895
Leu Tyr Trp Lys Leu Ser Gln Leu Thr His Gly Ile Thr Glu Leu
5900 5905 5910
Gly Pro Tyr Thr Leu Asp Arg His Ser Leu Tyr Val Asn Gly Phe
5915 5920 5925
Thr His Gln Ser Ser Met Thr Thr Thr Arg Thr Pro Asp Thr Ser
5930 5935 5940
Thr Met His Leu Ala Thr Ser Arg Thr Pro Ala Ser Leu Ser Gly
5945 5950 5955
Pro Thr Thr Ala Ser Pro Leu Leu Val Leu Phe Thr Ile Asn Phe
5960 5965 5970
Thr Ile Thr Asn Leu Arg Tyr Glu Glu Asn Met His His Pro Gly
5975 5980 5985
Ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gln Gly Leu Leu
5990 5995 6000
Arg Pro Val Phe Lys Asn Thr Ser Val Gly Pro Leu Tyr Ser Gly
6005 6010 6015
Cys Arg Leu Thr Leu Leu Arg Pro Lys Lys Asp Gly Ala Ala Thr
6020 6025 6030
Lys Val Asp Ala Ile Cys Thr Tyr Arg Pro Asp Pro Lys Ser Pro
6035 6040 6045
Gly Leu Asp Arg Glu Gln Leu Tyr Trp Glu Leu Ser Gln Leu Thr
6050 6055 6060
His Ser Ile Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asp Ser
6065 6070 6075
Leu Tyr Val Asn Gly Phe Thr Gln Arg Ser Ser Val Pro Thr Thr
6080 6085 6090
Ser Ile Pro Gly Thr Pro Thr Val Asp Leu Gly Thr Ser Gly Thr
6095 6100 6105
Pro Val Ser Lys Pro Gly Pro Ser Ala Ala Ser Pro Leu Leu Val
6110 6115 6120
Leu Phe Thr Leu Asn Phe Thr Ile Thr Asn Leu Arg Tyr Glu Glu
6125 6130 6135
Asn Met Gln His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg
6140 6145 6150
Val Leu Gln Gly Leu Leu Arg Ser Leu Phe Lys Ser Thr Ser Val
6155 6160 6165
Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu
6170 6175 6180
Lys Asp Gly Thr Ala Thr Gly Val Asp Ala Ile Cys Thr His His
6185 6190 6195
Pro Asp Pro Lys Ser Pro Arg Leu Asp Arg Glu Gln Leu Tyr Trp
6200 6205 6210
Glu Leu Ser Gln Leu Thr His Asn Ile Thr Glu Leu Gly Pro Tyr
6215 6220 6225
Ala Leu Asp Asn Asp Ser Leu Phe Val Asn Gly Phe Thr His Arg
6230 6235 6240
Ser Ser Val Ser Thr Thr Ser Thr Pro Gly Thr Pro Thr Val Tyr
6245 6250 6255
Leu Gly Ala Ser Lys Thr Pro Ala Ser Ile Phe Gly Pro Ser Ala
6260 6265 6270
Ala Ser His Leu Leu Ile Leu Phe Thr Leu Asn Phe Thr Ile Thr
6275 6280 6285
Asn Leu Arg Tyr Glu Glu Asn Met Trp Pro Gly Ser Arg Lys Phe
6290 6295 6300
Asn Thr Thr Glu Arg Val Leu Gln Gly Leu Leu Arg Pro Leu Phe
6305 6310 6315
Lys Asn Thr Ser Val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr
6320 6325 6330
Leu Leu Arg Pro Glu Lys Asp Gly Glu Ala Thr Gly Val Asp Ala
6335 6340 6345
Ile Cys Thr His Arg Pro Asp Pro Thr Gly Pro Gly Leu Asp Arg
6350 6355 6360
Glu Gln Leu Tyr Leu Glu Leu Ser Gln Leu Thr His Ser Ile Thr
6365 6370 6375
Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asp Ser Leu Tyr Val Asn
6380 6385 6390
Gly Phe Thr His Arg Ser Ser Val Pro Thr Thr Ser Thr Gly Val
6395 6400 6405
Val Ser Glu Glu Pro Phe Thr Leu Asn Phe Thr Ile Asn Asn Leu
6410 6415 6420
Arg Tyr Met Ala Asp Met Gly Gln Pro Gly Ser Leu Lys Phe Asn
6425 6430 6435
Ile Thr Asp Asn Val Met Gln His Leu Leu Ser Pro Leu Phe Gln
6440 6445 6450
Arg Ser Ser Leu Gly Ala Arg Tyr Thr Gly Cys Arg Val Ile Ala
6455 6460 6465
Leu Arg Ser Val Lys Asn Gly Ala Glu Thr Arg Val Asp Leu Leu
6470 6475 6480
Cys Thr Tyr Leu Gln Pro Leu Ser Gly Pro Gly Leu Pro Ile Lys
6485 6490 6495
Gln Val Phe His Glu Leu Ser Gln Gln Thr His Gly Ile Thr Arg
6500 6505 6510
Leu Gly Pro Tyr Ser Leu Asp Lys Asp Ser Leu Tyr Leu Asn Gly
6515 6520 6525
Tyr Asn Glu Pro Gly Pro Asp Glu Pro Pro Thr Thr Pro Lys Pro
6530 6535 6540
Ala Thr Thr Phe Leu Pro Pro Leu Ser Glu Ala Thr Thr Ala Met
6545 6550 6555
Gly Tyr His Leu Lys Thr Leu Thr Leu Asn Phe Thr Ile Ser Asn
6560 6565 6570
Leu Gln Tyr Ser Pro Asp Met Gly Lys Gly Ser Ala Thr Phe Asn
6575 6580 6585
Ser Thr Glu Gly Val Leu Gln His Leu Leu Arg Pro Leu Phe Gln
6590 6595 6600
Lys Ser Ser Met Gly Pro Phe Tyr Leu Gly Cys Gln Leu Ile Ser
6605 6610 6615
Leu Arg Pro Glu Lys Asp Gly Ala Ala Thr Gly Val Asp Thr Thr
6620 6625 6630
Cys Thr Tyr His Pro Asp Pro Val Gly Pro Gly Leu Asp Ile Gln
6635 6640 6645
Gln Leu Tyr Trp Glu Leu Ser Gln Leu Thr His Gly Val Thr Gln
6650 6655 6660
Leu Gly Phe Tyr Val Leu Asp Arg Asp Ser Leu Phe Ile Asn Gly
6665 6670 6675
Tyr Ala Pro Gln Asn Leu Ser Ile Arg Gly Glu Tyr Gln Ile Asn
6680 6685 6690
Phe His Ile Val Asn Trp Asn Leu Ser Asn Pro Asp Pro Thr Ser
6695 6700 6705
Ser Glu Tyr Ile Thr Leu Leu Arg Asp Ile Gln Asp Lys Val Thr
6710 6715 6720
Thr Leu Tyr Lys Gly Ser Gln Leu His Asp Thr Phe Arg Phe Cys
6725 6730 6735
Leu Val Thr Asn Leu Thr Met Asp Ser Val Leu Val Thr Val Lys
6740 6745 6750
Ala Leu Phe Ser Ser Asn Leu Asp Pro Ser Leu Val Glu Gln Val
6755 6760 6765
Phe Leu Asp Lys Thr Leu Asn Ala Ser Phe His Trp Leu Gly Ser
6770 6775 6780
Thr Tyr Gln Leu Val Asp Ile His Val Thr Glu Met Glu Ser Ser
6785 6790 6795
Val Tyr Gln Pro Thr Ser Ser Ser Ser Thr Gln His Phe Tyr Leu
6800 6805 6810
Asn Phe Thr Ile Thr Asn Leu Pro Tyr Ser Gln Asp Lys Ala Gln
6815 6820 6825
Pro Gly Thr Thr Asn Tyr Gln Arg Asn Lys Arg Asn Ile Glu Asp
6830 6835 6840
Ala Leu Asn Gln Leu Phe Arg Asn Ser Ser Ile Lys Ser Tyr Phe
6845 6850 6855
Ser Asp Cys Gln Val Ser Thr Phe Arg Ser Val Pro Asn Arg His
6860 6865 6870
His Thr Gly Val Asp Ser Leu Cys Asn Phe Ser Pro Leu Ala Arg
6875 6880 6885
Arg Val Asp Arg Val Ala Ile Tyr Glu Glu Phe Leu Arg Met Thr
6890 6895 6900
Arg Asn Gly Thr Gln Leu Gln Asn Phe Thr Leu Asp Arg Ser Ser
6905 6910 6915
Val Leu Val Asp Gly Tyr Ser Pro Asn Arg Asn Glu Pro Leu Thr
6920 6925 6930
Gly Asn Ser Asp Leu Pro Phe Trp Ala Val Ile Leu Ile Gly Leu
6935 6940 6945
Ala Gly Leu Leu Gly Leu Ile Thr Cys Leu Ile Cys Gly Val Leu
6950 6955 6960
Val Thr Thr Arg Arg Arg Lys Lys Glu Gly Glu Tyr Asn Val Gln
6965 6970 6975
Gln Gln Cys Pro Gly Tyr Tyr Gln Ser His Leu Asp Leu Glu Asp
6980 6985 6990
Leu Gln
6995
<210> 53
<211> 2187
<212> DNA
<213> 智人
<400> 53
tgccaggctc tccaccccca cttcccaatt gaggaaaccg aggcagagga ggctcagcgc 60
cacgcactcc tctttctgcc tggccggcca ctcccgtctg ctgtgacgcg cggacagaga 120
gctaccggtg gacccacggt gcctccctcc ctgggatcta cacagaccat ggccttgcca 180
acggctcgac ccctgttggg gtcctgtggg acccccgccc tcggcagcct cctgttcctg 240
ctcttcagcc tcggatgggt gcagccctcg aggaccctgg ctggagagac agggcaggag 300
gctgcgcccc tggacggagt cctggccaac ccacctaaca tttccagcct ctcccctcgc 360
caactccttg gcttcccgtg tgcggaggtg tccggcctga gcacggagcg tgtccgggag 420
ctggctgtgg ccttggcaca gaagaatgtc aagctctcaa cagagcagct gcgctgtctg 480
gctcaccggc tctctgagcc ccccgaggac ctggacgccc tcccattgga cctgctgcta 540
ttcctcaacc cagatgcgtt ctcggggccc caggcctgca cccgtttctt ctcccgcatc 600
acgaaggcca atgtggacct gctcccgagg ggggctcccg agcgacagcg gctgctgcct 660
gcggctctgg cctgctgggg tgtgcggggg tctctgctga gcgaggctga tgtgcgggct 720
ctgggaggcc tggcttgcga cctgcctggg cgctttgtgg ccgagtcggc cgaagtgctg 780
ctaccccggc tggtgagctg cccgggaccc ctggaccagg accagcagga ggcagccagg 840
gcggctctgc agggcggggg acccccctac ggccccccgt cgacatggtc tgtctccacg 900
atggacgctc tgcggggcct gctgcccgtg ctgggccagc ccatcatccg cagcatcccg 960
cagggcatcg tggccgcgtg gcggcaacgc tcctctcggg acccatcctg gcggcagcct 1020
gaacggacca tcctccggcc gcggttccgg cgggaagtgg agaagacagc ctgtccttca 1080
ggcaagaagg cccgcgagat agacgagagc ctcatcttct acaagaagtg ggagctggaa 1140
gcctgcgtgg atgcggccct gctggccacc cagatggacc gcgtgaacgc catccccttc 1200
acctacgagc agctggacgt cctaaagcat aaactggatg agctctaccc acaaggttac 1260
cccgagtctg tgatccagca cctgggctac ctcttcctca agatgagccc tgaggacatt 1320
cgcaagtgga atgtgacgtc cctggagacc ctgaaggctt tgcttgaagt caacaaaggg 1380
cacgaaatga gtcctcaggt ggccaccctg atcgaccgct ttgtgaaggg aaggggccag 1440
ctagacaaag acaccctaga caccctgacc gccttctacc ctgggtacct gtgctccctc 1500
agccccgagg agctgagctc cgtgcccccc agcagcatct gggcggtcag gccccaggac 1560
ctggacacgt gtgacccaag gcagctggac gtcctctatc ccaaggcccg ccttgctttc 1620
cagaacatga acgggtccga atacttcgtg aagatccagt ccttcctggg tggggccccc 1680
acggaggatt tgaaggcgct cagtcagcag aatgtgagca tggacttggc cacgttcatg 1740
aagctgcgga cggatgcggt gctgccgttg actgtggctg aggtgcagaa acttctggga 1800
ccccacgtgg agggcctgaa ggcggaggag cggcaccgcc cggtgcggga ctggatccta 1860
cggcagcggc aggacgacct ggacacgctg gggctggggc tacagggcgg catccccaac 1920
ggctacctgg tcctagacct cagcatgcaa gaggccctct cggggacgcc ctgcctccta 1980
ggacctggac ctgttctcac cgtcctggca ctgctcctag cctccaccct ggcctgaggg 2040
ccccactccc ttgctggccc cagccctgct ggggatcccc gcctggccag gagcaggcac 2100
gggtggtccc cgttccaccc caagagaact cgcgctcagt aaacgggaac atgccccctg 2160
cagacacgta aaaaaaaaaa aaaaaaa 2187
<210> 54
<211> 622
<212> PRT
<213> 智人
<400> 54
Met Ala Leu Pro Thr Ala Arg Pro Leu Leu Gly Ser Cys Gly Thr Pro
1 5 10 15
Ala Leu Gly Ser Leu Leu Phe Leu Leu Phe Ser Leu Gly Trp Val Gln
20 25 30
Pro Ser Arg Thr Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu
35 40 45
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg
50 55 60
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu
65 70 75 80
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu
85 90 95
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro
100 105 110
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro
115 120 125
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile
130 135 140
Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln
145 150 155 160
Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu
165 170 175
Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu
180 185 190
Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu
195 200 205
Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg
210 215 220
Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp
225 230 235 240
Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly
245 250 255
Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg
260 265 270
Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile
275 280 285
Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser
290 295 300
Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys
305 310 315 320
Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met
325 330 335
Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu
340 345 350
Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val
355 360 365
Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile
370 375 380
Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu
385 390 395 400
Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp
405 410 415
Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr
420 425 430
Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu
435 440 445
Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp
450 455 460
Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala
465 470 475 480
Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile
485 490 495
Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser
500 505 510
Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr
515 520 525
Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly
530 535 540
Pro His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg
545 550 555 560
Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu
565 570 575
Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser
580 585 590
Met Gln Glu Ala Leu Ser Gly Thr Pro Cys Leu Leu Gly Pro Gly Pro
595 600 605
Val Leu Thr Val Leu Ala Leu Leu Leu Ala Ser Thr Leu Ala
610 615 620
<210> 55
<211> 4167
<212> DNA
<213> 智人
<400> 55
accttcgcca tatatacccg gggcgctgcg ctccacctgg ccgccgcctc cagcccagca 60
cctgcggagg gagcgctgac catggctccc tggcctgaat tgggagatgc ccagcccaac 120
cccgataagt acctcgaagg ggccgcaggt cagcagccca ctgcccctga taaaagcaaa 180
gagaccaaca aaacagataa cactgaggca cctgtaacca agattgaact tctgccgtcc 240
tactccacgg ctacactgat agatgagccc actgaggtgg atgacccctg gaacctaccc 300
actcttcagg actcggggat caagtggtca gagagagaca ccaaagggaa gattctctgt 360
ttcttccaag ggattgggag attgatttta cttctcggat ttctctactt tttcgtgtgc 420
tccctggata ttcttagtag cgccttccag ctggttggag gaaaaatggc aggacagttc 480
ttcagcaaca gctctattat gtccaaccct ttgttggggc tggtgatcgg ggtgctggtg 540
accgtcttgg tgcagagctc cagcacctca acgtccatcg ttgtcagcat ggtgtcctct 600
tcattgctca ctgttcgggc tgccatcccc attatcatgg gggccaacat tggaacgtca 660
atcaccaaca ctattgttgc gctcatgcag gtgggagatc ggagtgagtt cagaagagct 720
tttgcaggag ccactgtcca tgacttcttc aactggctgt ccgtgttggt gctcttgccc 780
gtggaggtgg ccacccatta cctcgagatc ataacccagc ttatagtgga gagcttccac 840
ttcaagaatg gagaagatgc cccagatctt ctgaaagtca tcactaagcc cttcacaaag 900
ctcattgtcc agctggataa aaaagttatc agccaaattg caatgaacga tgaaaaagcg 960
aaaaacaaga gtcttgtcaa gatttggtgc aaaactttta ccaacaagac ccagattaac 1020
gtcactgttc cctcgactgc taactgcacc tccccttccc tctgttggac ggatggcatc 1080
caaaactgga ccatgaagaa tgtgacctac aaggagaaca tcgccaaatg ccagcatatc 1140
tttgtgaatt tccacctccc ggatcttgct gtgggcacca tcttgctcat actctccctg 1200
ctggtcctct gtggttgcct gatcatgatt gtcaagatcc tgggctctgt gctcaagggg 1260
caggtcgcca ctgtcatcaa gaagaccatc aacactgatt tcccctttcc ctttgcatgg 1320
ttgactggct acctggccat cctcgtcggg gcaggcatga ccttcatcgt acagagcagc 1380
tctgtgttca cgtcggcctt gacccccctg attggaatcg gcgtgataac cattgagagg 1440
gcttatccac tcacgctggg ctccaacatc ggcaccacca ccaccgccat cctggccgcc 1500
ttagccagcc ctggcaatgc attgaggagt tcactccaga tcgccctgtg ccactttttc 1560
ttcaacatct ccggcatctt gctgtggtac ccgatcccgt tcactcgcct gcccatccgc 1620
atggccaagg ggctgggcaa catctctgcc aagtatcgct ggttcgccgt cttctacctg 1680
atcatcttct tcttcctgat cccgctgacg gtgtttggcc tctcgctggc cggctggcgg 1740
gtgctggttg gtgtcggggt tcccgtcgtc ttcatcatca tcctggtact gtgcctccga 1800
ctcctgcagt ctcgctgccc acgcgtcctg ccgaagaaac tccagaactg gaacttcctg 1860
ccgctgtgga tgcgctcgct gaagccctgg gatgccgtcg tctccaagtt caccggctgc 1920
ttccagatgc gctgctgctg ctgctgccgc gtgtgctgcc gcgcgtgctg cttgctgtgt 1980
ggctgcccca agtgctgccg ctgcagcaag tgctgcgagg acttggagga ggcgcaggag 2040
gggcaggatg tccctgtcaa ggctcctgag acctttgata acataaccat tagcagagag 2100
gctcagggtg aggtccctgc ctcggactca aagaccgaat gcacggcctt gtaggggacg 2160
ccccagattg tcagggatgg ggggatggtc cttgagtttt gcatgctctc ctccctccca 2220
cttctgcacc ctttcaccac ctcgaggaga tttgctcccc attagcgaat gaaattgatg 2280
cagtcctacc taactcgatt ccctttggct tggtggtagg cctgcagggc acttttattc 2340
caacccctgg tcactcagta atcttttact ccaggaaggc acaggatggt acctaaagag 2400
aattagagaa tgaacctggc gggacggatg tctaatcctg cgcctagctg ggttggtcag 2460
tagaacctat tttcagactc aaaaaccatc ttcagaaaga aaaggcccag ggaaggaatg 2520
tatgagaggc tctcccagat gaggaagtgt actctctatg actatcaagc tcaggcctct 2580
cccttttttt aaaccaaagt ctggcaacca agagcagcag ctccatggcc tccttgcccc 2640
agatcagcct gggtcagggg acatagtgtc attgtttgga aactgcagac cacaaggtgt 2700
gggtctatcc cacttcctag tgctccccac attccccatc agggcttcct cacgtggaca 2760
ggtgtgctag tccaggcagt tcacttgcag tttccttgtc ctcatgcttc ggggatggga 2820
gccacgcctg aactagagtt caggctggat acatgtgctc acctgctgct cttgtcttcc 2880
taagagacag agagtggggc agatggagga gaagaaagtg aggaatgagt agcatagcat 2940
tctgccaaaa gggccccaga ttcttaattt agcaaactaa gaagcccaat tcaaaagcat 3000
tgtggctaaa gtctaacgct cctctcttgg tcagataaca aaagccctcc ctgttggatc 3060
ttttgaaata aaacgtgcaa gttatccagg ctcgtagcct gcatgctgcc accttgaatc 3120
ccagggagta tctgcacctg gaatagctct ccacccctct ctgcctcctt actttctgtg 3180
caagatgact tcctgggtta acttccttct ttccatccac ccacccactg gaatctcttt 3240
ccaaacattt ttccattttc ccacagatgg gctttgatta gctgtcctct ctccatgcct 3300
gcaaagctcc agatttttgg ggaaagctgt acccaactgg actgcccagt gaactgggat 3360
cattaagtac agtcgagcac acgtgtgtgc atgggtcaaa ggggtgtgtt ccttctcatc 3420
ctagatgcct tctctgtgcc ttccacagcc tcctgcctga ttacaccact gcccccgccc 3480
caccctcagc catcccaatt cttcctggcc agtgcgctcc agccttatct aggaaaggag 3540
gagtgggtgt agccgtgcag caagattggg gcctccccca tcccagcttc tccaccatcc 3600
cagcaagtca ggatatcaga cagtcctccc ctgaccctcc cccttgtaga tatcaattcc 3660
caaacagagc caaatactct atatctatag tcacagccct gtacagcatt tttcataagt 3720
tatatagtaa atggtctgca tgatttgtgc ttctagtgct ctcatttgga aatgaggcag 3780
gcttcttcta tgaaatgtaa agaaagaaac cactttgtat attttgtaat accacctctg 3840
tggccatgcc tgccccgccc actctgtata tatgtaagtt aaacccgggc aggggctgtg 3900
gccgtctttg tactctggtg atttttaaaa attgaatctt tgtacttgca ttgattgtat 3960
aataattttg agaccaggtc tcgctgtgtt gctcaggctg gtctcaaact cctgagatca 4020
agcaatccgc ccacctcagc ctcccaaagt gctgagatca caggcgtgag ccaccaccag 4080
gcctgattgt aatttttttt tttttttttt tactggttat gggaagggag aaataaaatc 4140
atcaaaccca aaaaaaaaaa aaaaaaa 4167
<210> 56
<211> 690
<212> PRT
<213> 智人
<400> 56
Met Ala Pro Trp Pro Glu Leu Gly Asp Ala Gln Pro Asn Pro Asp Lys
1 5 10 15
Tyr Leu Glu Gly Ala Ala Gly Gln Gln Pro Thr Ala Pro Asp Lys Ser
20 25 30
Lys Glu Thr Asn Lys Thr Asp Asn Thr Glu Ala Pro Val Thr Lys Ile
35 40 45
Glu Leu Leu Pro Ser Tyr Ser Thr Ala Thr Leu Ile Asp Glu Pro Thr
50 55 60
Glu Val Asp Asp Pro Trp Asn Leu Pro Thr Leu Gln Asp Ser Gly Ile
65 70 75 80
Lys Trp Ser Glu Arg Asp Thr Lys Gly Lys Ile Leu Cys Phe Phe Gln
85 90 95
Gly Ile Gly Arg Leu Ile Leu Leu Leu Gly Phe Leu Tyr Phe Phe Val
100 105 110
Cys Ser Leu Asp Ile Leu Ser Ser Ala Phe Gln Leu Val Gly Gly Lys
115 120 125
Met Ala Gly Gln Phe Phe Ser Asn Ser Ser Ile Met Ser Asn Pro Leu
130 135 140
Leu Gly Leu Val Ile Gly Val Leu Val Thr Val Leu Val Gln Ser Ser
145 150 155 160
Ser Thr Ser Thr Ser Ile Val Val Ser Met Val Ser Ser Ser Leu Leu
165 170 175
Thr Val Arg Ala Ala Ile Pro Ile Ile Met Gly Ala Asn Ile Gly Thr
180 185 190
Ser Ile Thr Asn Thr Ile Val Ala Leu Met Gln Val Gly Asp Arg Ser
195 200 205
Glu Phe Arg Arg Ala Phe Ala Gly Ala Thr Val His Asp Phe Phe Asn
210 215 220
Trp Leu Ser Val Leu Val Leu Leu Pro Val Glu Val Ala Thr His Tyr
225 230 235 240
Leu Glu Ile Ile Thr Gln Leu Ile Val Glu Ser Phe His Phe Lys Asn
245 250 255
Gly Glu Asp Ala Pro Asp Leu Leu Lys Val Ile Thr Lys Pro Phe Thr
260 265 270
Lys Leu Ile Val Gln Leu Asp Lys Lys Val Ile Ser Gln Ile Ala Met
275 280 285
Asn Asp Glu Lys Ala Lys Asn Lys Ser Leu Val Lys Ile Trp Cys Lys
290 295 300
Thr Phe Thr Asn Lys Thr Gln Ile Asn Val Thr Val Pro Ser Thr Ala
305 310 315 320
Asn Cys Thr Ser Pro Ser Leu Cys Trp Thr Asp Gly Ile Gln Asn Trp
325 330 335
Thr Met Lys Asn Val Thr Tyr Lys Glu Asn Ile Ala Lys Cys Gln His
340 345 350
Ile Phe Val Asn Phe His Leu Pro Asp Leu Ala Val Gly Thr Ile Leu
355 360 365
Leu Ile Leu Ser Leu Leu Val Leu Cys Gly Cys Leu Ile Met Ile Val
370 375 380
Lys Ile Leu Gly Ser Val Leu Lys Gly Gln Val Ala Thr Val Ile Lys
385 390 395 400
Lys Thr Ile Asn Thr Asp Phe Pro Phe Pro Phe Ala Trp Leu Thr Gly
405 410 415
Tyr Leu Ala Ile Leu Val Gly Ala Gly Met Thr Phe Ile Val Gln Ser
420 425 430
Ser Ser Val Phe Thr Ser Ala Leu Thr Pro Leu Ile Gly Ile Gly Val
435 440 445
Ile Thr Ile Glu Arg Ala Tyr Pro Leu Thr Leu Gly Ser Asn Ile Gly
450 455 460
Thr Thr Thr Thr Ala Ile Leu Ala Ala Leu Ala Ser Pro Gly Asn Ala
465 470 475 480
Leu Arg Ser Ser Leu Gln Ile Ala Leu Cys His Phe Phe Phe Asn Ile
485 490 495
Ser Gly Ile Leu Leu Trp Tyr Pro Ile Pro Phe Thr Arg Leu Pro Ile
500 505 510
Arg Met Ala Lys Gly Leu Gly Asn Ile Ser Ala Lys Tyr Arg Trp Phe
515 520 525
Ala Val Phe Tyr Leu Ile Ile Phe Phe Phe Leu Ile Pro Leu Thr Val
530 535 540
Phe Gly Leu Ser Leu Ala Gly Trp Arg Val Leu Val Gly Val Gly Val
545 550 555 560
Pro Val Val Phe Ile Ile Ile Leu Val Leu Cys Leu Arg Leu Leu Gln
565 570 575
Ser Arg Cys Pro Arg Val Leu Pro Lys Lys Leu Gln Asn Trp Asn Phe
580 585 590
Leu Pro Leu Trp Met Arg Ser Leu Lys Pro Trp Asp Ala Val Val Ser
595 600 605
Lys Phe Thr Gly Cys Phe Gln Met Arg Cys Cys Cys Cys Cys Arg Val
610 615 620
Cys Cys Arg Ala Cys Cys Leu Leu Cys Gly Cys Pro Lys Cys Cys Arg
625 630 635 640
Cys Ser Lys Cys Cys Glu Asp Leu Glu Glu Ala Gln Glu Gly Gln Asp
645 650 655
Val Pro Val Lys Ala Pro Glu Thr Phe Asp Asn Ile Thr Ile Ser Arg
660 665 670
Glu Ala Gln Gly Glu Val Pro Ala Ser Asp Ser Lys Thr Glu Cys Thr
675 680 685
Ala Leu
690
<210> 57
<211> 4559
<212> DNA
<213> 智人
<400> 57
gcggccgccc cattcccaga ccggccgcca gcccatctgg ttagctcccg ccgctccgcg 60
ccgcccggga gtcgggagcc gcggggaacc gggcacctgc acccgcctct gggagtgagt 120
ggttccagct ggtgcctggc ctgtgtctct tggatgccct gtggcttcag tccgtctcct 180
gttgcccacc acctcgtccc tgggccgcct gataccccag cccaacagct aaggtgtgga 240
tggacagtag ggggctggct tctctcactg gtcaggggtc ttctcccctg tctgcctccc 300
ggagctagga ctgcagaggg gcctatcatg gtgcttgcag gccccctggc tgtctcgctg 360
ttgctgccca gcctcacact gctggtgtcc cacctctcca gctcccagga tgtctccagt 420
gagcccagca gtgagcagca gctgtgcgcc cttagcaagc accccaccgt ggcctttgaa 480
gacctgcagc cgtgggtctc taacttcacc taccctggag cccgggattt ctcccagctg 540
gctttggacc cctccgggaa ccagctcatc gtgggagcca ggaactacct cttcagactc 600
agccttgcca atgtctctct tcttcaggcc acagagtggg cctccagtga ggacacgcgc 660
cgctcctgcc aaagcaaagg gaagactgag gaggagtgtc agaactacgt gcgagtcctg 720
atcgtcgccg gccggaaggt gttcatgtgt ggaaccaatg ccttttcccc catgtgcacc 780
agcagacagg tggggaacct cagccggact actgagaaga tcaatggtgt ggcccgctgc 840
ccctatgacc cacgccacaa ctccacagct gtcatctcct cccaggggga gctctatgca 900
gccacggtca tcgacttctc aggtcgggac cctgccatct accgcagcct gggcagtggg 960
ccaccgcttc gcactgccca atataactcc aagtggctta atgagccaaa cttcgtggca 1020
gcctatgata ttgggctgtt tgcatacttc ttcctgcggg agaacgcagt ggagcacgac 1080
tgtggacgca ccgtgtactc tcgcgtggcc cgcgtgtgca agaatgacgt ggggggccga 1140
ttcctgctgg aggacacatg gaccacattc atgaaggccc ggctcaactg ctcccgcccg 1200
ggcgaggtcc ccttctacta taacgagctg cagagtgcct tccacttgcc agagcaggac 1260
ctcatctatg gagttttcac aaccaacgta aacagcatcg cggcttctgc tgtctgcgcc 1320
ttcaacctca gtgctatctc ccaggctttc aatggcccat ttcgctacca ggagaacccc 1380
agggctgcct ggctccccat agccaacccc atccccaatt tccagtgtgg caccctgcct 1440
gagaccggtc ccaacgagaa cctgacggag cgcagcctgc aggacgcgca gcgcctcttc 1500
ctgatgagcg aggccgtgca gccggtgaca cccgagccct gtgtcaccca ggacagcgtg 1560
cgcttctcac acctcgtggt ggacctggtg caggctaaag acacgctcta ccatgtactc 1620
tacattggca ccgagtcggg caccatcctg aaggcgctgt ccacggcgag ccgcagcctc 1680
cacggctgct acctggagga gctgcacgtg ctgccccccg ggcgccgcga gcccctgcgc 1740
agcctgcgca tcctgcacag cgcccgcgcg ctcttcgtgg ggctgagaga cggcgtcctg 1800
cgggtcccac tggagaggtg cgccgcctac cgcagccagg gggcatgcct gggggcccgg 1860
gacccgtact gtggctggga cgggaagcag caacgttgca gcacactcga ggacagctcc 1920
aacatgagcc tctggaccca gaacatcacc gcctgtcctg tgcggaatgt gacacgggat 1980
gggggcttcg gcccatggtc accatggcaa ccatgtgagc acttggatgg ggacaactca 2040
ggctcttgcc tgtgtcgagc tcgatcctgt gattcccctc gaccccgctg tgggggcctt 2100
gactgcctgg ggccagccat ccacatcgcc aactgctcca ggaatggggc gtggaccccg 2160
tggtcatcgt gggcgctgtg cagcacgtcc tgtggcatcg gcttccaggt ccgccagcga 2220
agttgcagca accctgctcc ccgccacggg ggccgcatct gcgtgggcaa gagccgggag 2280
gaacggttct gtaatgagaa cacgccttgc ccggtgccca tcttctgggc ttcctggggc 2340
tcctggagca agtgcagcag caactgtgga gggggcatgc agtcgcggcg tcgggcctgc 2400
gagaacggca actcctgcct gggctgcggc gtggagttca agacgtgcaa ccccgagggc 2460
tgccccgaag tgcggcgcaa caccccctgg acgccgtggc tgcccgtgaa cgtgacgcag 2520
ggcggggcac ggcaggagca gcggttccgc ttcacctgcc gcgcgcccct tgcagacccg 2580
cacggcctgc agttcggcag gagaaggacc gagacgagga cctgtcccgc ggacggctcc 2640
ggctcctgcg acaccgacgc cctggtggag gacctcctgc gcagcgggag cacctccccg 2700
cacacggtga gcgggggctg ggccgcctgg ggcccgtggt cgtcctgctc ccgggactgc 2760
gagctgggct tccgcgtccg caagagaacg tgcactaacc cggagccccg caacgggggc 2820
ctgccctgcg tgggcgatgc tgccgagtac caggactgca acccccaggc ttgcccagtt 2880
cggggtgctt ggtcctgctg gacctcatgg tctccatgct cagcttcctg tggtgggggt 2940
cactatcaac gcacccgttc ctgcaccagc cccgcaccct ccccaggtga ggacatctgt 3000
ctcgggctgc acacggagga ggcactatgt gccacacagg cctgcccaga aggctggtcg 3060
ccctggtctg agtggagtaa gtgcactgac gacggagccc agagccgaag ccggcactgt 3120
gaggagctcc tcccagggtc cagcgcctgt gctggaaaca gcagccagag ccgcccctgc 3180
ccctacagcg agattcccgt catcctgcca gcctccagca tggaggaggc caccggctgt 3240
gcagggttca atctcatcca cttggtggcc acgggcatct cctgcttctt gggctctggg 3300
ctcctgaccc tagcagtgta cctgtcttgc cagcactgcc agcgtcagtc ccaggagtcc 3360
acactggtcc atcctgccac ccccaaccat ttgcactaca agggcggagg caccccgaag 3420
aatgaaaagt acacacccat ggaattcaag accctgaaca agaataactt gatccctgat 3480
gacagagcca acttctaccc attgcagcag accaatgtgt acacgactac ttactaccca 3540
agccccctga acaaacacag cttccggccc gaggcctcac ctggacaacg gtgcttcccc 3600
aacagctgat accgccgtcc tggggacttg ggcttcttgc cttcataagg cacagagcag 3660
atggagatgg gacagtggag ccagtttggt tttctccctc tgcactaggc caagaacttg 3720
ctgccttgcc tgtggggggt cccatccggc ttcagagagc tctggctggc attgaccatg 3780
ggggaaaggg ctggtttcag gctgacatat ggccgcaggt ccagttcagc ccaggtctct 3840
catggttatc ttccaaccca ctgtcacgct gacactatgc tgccatgcct gggctgtgga 3900
cctactgggc atttgaggaa ctggagaatg gagatggcaa gagggcaggc ttttaagttt 3960
gggttggaga caacttcctg tggcccccac aagctgagtc tggccttctc cagctggccc 4020
caaaaaaggc ctttgctaca tcctgattat ctctgaaagt aatcaatcaa gtggctccag 4080
tagctctgga ttttctgcca gggctgggcc attgtggtgc tgccccagta tgacatggga 4140
ccaaggccag cgcaggttat ccacctctgc ctggaagtct atactctacc cagggcatcc 4200
ctctggtcag aggcagtgag tactgggaac tggaggctga cctgtgctta gaagtccttt 4260
aatctgggct ggtacaggcc tcagccttgc cctcaatgca cgaaaggtgg cccaggagag 4320
aggatcaatg ccacaggagg cagaagtctg gcctctgtgc ctctatggag actatcttcc 4380
agttgctgct caacagagtt gttggctgag acctgcttgg gagtctctgc tggcccttca 4440
tctgttcagg aacacacaca cacacacact cacacacgca cacacaatca caatttgcta 4500
cagcaacaaa aaagacattg ggctgtggca ttattaatta aagatgatat ccagtctcc 4559
<210> 58
<211> 1202
<212> PRT
<213> 智人
<400> 58
Ala Ala Ala Pro Phe Pro Asp Arg Pro Pro Ala His Leu Val Ser Ser
1 5 10 15
Arg Arg Ser Ala Pro Pro Gly Ser Arg Glu Pro Arg Gly Thr Gly His
20 25 30
Leu His Pro Pro Leu Gly Val Ser Gly Ser Ser Trp Cys Leu Ala Cys
35 40 45
Val Ser Trp Met Pro Cys Gly Phe Ser Pro Ser Pro Val Ala His His
50 55 60
Leu Val Pro Gly Pro Pro Asp Thr Pro Ala Gln Gln Leu Arg Cys Gly
65 70 75 80
Trp Thr Val Gly Gly Trp Leu Leu Ser Leu Val Arg Gly Leu Leu Pro
85 90 95
Cys Leu Pro Pro Gly Ala Arg Thr Ala Glu Gly Pro Ile Met Val Leu
100 105 110
Ala Gly Pro Leu Ala Val Ser Leu Leu Leu Pro Ser Leu Thr Leu Leu
115 120 125
Val Ser His Leu Ser Ser Ser Gln Asp Val Ser Ser Glu Pro Ser Ser
130 135 140
Glu Gln Gln Leu Cys Ala Leu Ser Lys His Pro Thr Val Ala Phe Glu
145 150 155 160
Asp Leu Gln Pro Trp Val Ser Asn Phe Thr Tyr Pro Gly Ala Arg Asp
165 170 175
Phe Ser Gln Leu Ala Leu Asp Pro Ser Gly Asn Gln Leu Ile Val Gly
180 185 190
Ala Arg Asn Tyr Leu Phe Arg Leu Ser Leu Ala Asn Val Ser Leu Leu
195 200 205
Gln Ala Thr Glu Trp Ala Ser Ser Glu Asp Thr Arg Arg Ser Cys Gln
210 215 220
Ser Lys Gly Lys Thr Glu Glu Glu Cys Gln Asn Tyr Val Arg Val Leu
225 230 235 240
Ile Val Ala Gly Arg Lys Val Phe Met Cys Gly Thr Asn Ala Phe Ser
245 250 255
Pro Met Cys Thr Ser Arg Gln Val Gly Asn Leu Ser Arg Thr Thr Glu
260 265 270
Lys Ile Asn Gly Val Ala Arg Cys Pro Tyr Asp Pro Arg His Asn Ser
275 280 285
Thr Ala Val Ile Ser Ser Gln Gly Glu Leu Tyr Ala Ala Thr Val Ile
290 295 300
Asp Phe Ser Gly Arg Asp Pro Ala Ile Tyr Arg Ser Leu Gly Ser Gly
305 310 315 320
Pro Pro Leu Arg Thr Ala Gln Tyr Asn Ser Lys Trp Leu Asn Glu Pro
325 330 335
Asn Phe Val Ala Ala Tyr Asp Ile Gly Leu Phe Ala Tyr Phe Phe Leu
340 345 350
Arg Glu Asn Ala Val Glu His Asp Cys Gly Arg Thr Val Tyr Ser Arg
355 360 365
Val Ala Arg Val Cys Lys Asn Asp Val Gly Gly Arg Phe Leu Leu Glu
370 375 380
Asp Thr Trp Thr Thr Phe Met Lys Ala Arg Leu Asn Cys Ser Arg Pro
385 390 395 400
Gly Glu Val Pro Phe Tyr Tyr Asn Glu Leu Gln Ser Ala Phe His Leu
405 410 415
Pro Glu Gln Asp Leu Ile Tyr Gly Val Phe Thr Thr Asn Val Asn Ser
420 425 430
Ile Ala Ala Ser Ala Val Cys Ala Phe Asn Leu Ser Ala Ile Ser Gln
435 440 445
Ala Phe Asn Gly Pro Phe Arg Tyr Gln Glu Asn Pro Arg Ala Ala Trp
450 455 460
Leu Pro Ile Ala Asn Pro Ile Pro Asn Phe Gln Cys Gly Thr Leu Pro
465 470 475 480
Glu Thr Gly Pro Asn Glu Asn Leu Thr Glu Arg Ser Leu Gln Asp Ala
485 490 495
Gln Arg Leu Phe Leu Met Ser Glu Ala Val Gln Pro Val Thr Pro Glu
500 505 510
Pro Cys Val Thr Gln Asp Ser Val Arg Phe Ser His Leu Val Val Asp
515 520 525
Leu Val Gln Ala Lys Asp Thr Leu Tyr His Val Leu Tyr Ile Gly Thr
530 535 540
Glu Ser Gly Thr Ile Leu Lys Ala Leu Ser Thr Ala Ser Arg Ser Leu
545 550 555 560
His Gly Cys Tyr Leu Glu Glu Leu His Val Leu Pro Pro Gly Arg Arg
565 570 575
Glu Pro Leu Arg Ser Leu Arg Ile Leu His Ser Ala Arg Ala Leu Phe
580 585 590
Val Gly Leu Arg Asp Gly Val Leu Arg Val Pro Leu Glu Arg Cys Ala
595 600 605
Ala Tyr Arg Ser Gln Gly Ala Cys Leu Gly Ala Arg Asp Pro Tyr Cys
610 615 620
Gly Trp Asp Gly Lys Gln Gln Arg Cys Ser Thr Leu Glu Asp Ser Ser
625 630 635 640
Asn Met Ser Leu Trp Thr Gln Asn Ile Thr Ala Cys Pro Val Arg Asn
645 650 655
Val Thr Arg Asp Gly Gly Phe Gly Pro Trp Ser Pro Trp Gln Pro Cys
660 665 670
Glu His Leu Asp Gly Asp Asn Ser Gly Ser Cys Leu Cys Arg Ala Arg
675 680 685
Ser Cys Asp Ser Pro Arg Pro Arg Cys Gly Gly Leu Asp Cys Leu Gly
690 695 700
Pro Ala Ile His Ile Ala Asn Cys Ser Arg Asn Gly Ala Trp Thr Pro
705 710 715 720
Trp Ser Ser Trp Ala Leu Cys Ser Thr Ser Cys Gly Ile Gly Phe Gln
725 730 735
Val Arg Gln Arg Ser Cys Ser Asn Pro Ala Pro Arg His Gly Gly Arg
740 745 750
Ile Cys Val Gly Lys Ser Arg Glu Glu Arg Phe Cys Asn Glu Asn Thr
755 760 765
Pro Cys Pro Val Pro Ile Phe Trp Ala Ser Trp Gly Ser Trp Ser Lys
770 775 780
Cys Ser Ser Asn Cys Gly Gly Gly Met Gln Ser Arg Arg Arg Ala Cys
785 790 795 800
Glu Asn Gly Asn Ser Cys Leu Gly Cys Gly Val Glu Phe Lys Thr Cys
805 810 815
Asn Pro Glu Gly Cys Pro Glu Val Arg Arg Asn Thr Pro Trp Thr Pro
820 825 830
Trp Leu Pro Val Asn Val Thr Gln Gly Gly Ala Arg Gln Glu Gln Arg
835 840 845
Phe Arg Phe Thr Cys Arg Ala Pro Leu Ala Asp Pro His Gly Leu Gln
850 855 860
Phe Gly Arg Arg Arg Thr Glu Thr Arg Thr Cys Pro Ala Asp Gly Ser
865 870 875 880
Gly Ser Cys Asp Thr Asp Ala Leu Val Glu Asp Leu Leu Arg Ser Gly
885 890 895
Ser Thr Ser Pro His Thr Val Ser Gly Gly Trp Ala Ala Trp Gly Pro
900 905 910
Trp Ser Ser Cys Ser Arg Asp Cys Glu Leu Gly Phe Arg Val Arg Lys
915 920 925
Arg Thr Cys Thr Asn Pro Glu Pro Arg Asn Gly Gly Leu Pro Cys Val
930 935 940
Gly Asp Ala Ala Glu Tyr Gln Asp Cys Asn Pro Gln Ala Cys Pro Val
945 950 955 960
Arg Gly Ala Trp Ser Cys Trp Thr Ser Trp Ser Pro Cys Ser Ala Ser
965 970 975
Cys Gly Gly Gly His Tyr Gln Arg Thr Arg Ser Cys Thr Ser Pro Ala
980 985 990
Pro Ser Pro Gly Glu Asp Ile Cys Leu Gly Leu His Thr Glu Glu Ala
995 1000 1005
Leu Cys Ala Thr Gln Ala Cys Pro Glu Gly Trp Ser Pro Trp Ser
1010 1015 1020
Glu Trp Ser Lys Cys Thr Asp Asp Gly Ala Gln Ser Arg Ser Arg
1025 1030 1035
His Cys Glu Glu Leu Leu Pro Gly Ser Ser Ala Cys Ala Gly Asn
1040 1045 1050
Ser Ser Gln Ser Arg Pro Cys Pro Tyr Ser Glu Ile Pro Val Ile
1055 1060 1065
Leu Pro Ala Ser Ser Met Glu Glu Ala Thr Gly Cys Ala Gly Phe
1070 1075 1080
Asn Leu Ile His Leu Val Ala Thr Gly Ile Ser Cys Phe Leu Gly
1085 1090 1095
Ser Gly Leu Leu Thr Leu Ala Val Tyr Leu Ser Cys Gln His Cys
1100 1105 1110
Gln Arg Gln Ser Gln Glu Ser Thr Leu Val His Pro Ala Thr Pro
1115 1120 1125
Asn His Leu His Tyr Lys Gly Gly Gly Thr Pro Lys Asn Glu Lys
1130 1135 1140
Tyr Thr Pro Met Glu Phe Lys Thr Leu Asn Lys Asn Asn Leu Ile
1145 1150 1155
Pro Asp Asp Arg Ala Asn Phe Tyr Pro Leu Gln Gln Thr Asn Val
1160 1165 1170
Tyr Thr Thr Thr Tyr Tyr Pro Ser Pro Leu Asn Lys His Ser Phe
1175 1180 1185
Arg Pro Glu Ala Ser Pro Gly Gln Arg Cys Phe Pro Asn Ser
1190 1195 1200
<210> 59
<211> 1524
<212> DNA
<213> 智人
<400> 59
gcggcagcag cgcgggcccc agcagcctcg gcagccacag ccgctgcagc cggggcagcc 60
tccgctgctg tcgcctcctc tgatgcgctt gccctctccc ggccccggga ctccgggaga 120
atgtgggtcc taggcatcgc ggcaactttt tgcggattgt tcttgcttcc aggctttgcg 180
ctgcaaatcc agtgctacca gtgtgaagaa ttccagctga acaacgactg ctcctccccc 240
gagttcattg tgaattgcac ggtgaacgtt caagacatgt gtcagaaaga agtgatggag 300
caaagtgccg ggatcatgta ccgcaagtcc tgtgcatcat cagcggcctg tctcatcgcc 360
tctgccgggt accagtcctt ctgctcccca gggaaactga actcagtttg catcagctgc 420
tgcaacaccc ctctttgtaa cgggccaagg cccaagaaaa ggggaagttc tgcctcggcc 480
ctcaggccag ggctccgcac caccatcctg ttcctcaaat tagccctctt ctcggcacac 540
tgctgaagct gaaggagatg ccaccccctc ctgcattgtt cttccagccc tcgcccccaa 600
ccccccacct ccctgagtga gtttcttctg ggtgtccttt tattctgggt agggagcggg 660
agtccgtgtt ctcttttgtt cctgtgcaaa taatgaaaga gctcggtaaa gcattctgaa 720
taaattcagc ctgactgaat tttcagtatg tacttgaagg aaggaggtgg agtgaaagtt 780
cacccccatg tctgtgtaac cggagtcaag gccaggctgg cagagtcagt ccttagaagt 840
cactgaggtg ggcatctgcc ttttgtaaag cctccagtgt ccattccatc cctgatgggg 900
gcatagtttg agactgcaga gtgagagtga cgttttctta gggctggagg gccagttccc 960
actcaaggct ccctcgcttg acattcaaac ttcatgctcc tgaaaaccat tctctgcagc 1020
agaattggct ggtttcgcgc ctgagttggg ctctagtgac tcgagactca atgactggga 1080
cttagactgg ggctcggcct cgctctgaaa agtgcttaag aaaatcttct cagttctcct 1140
tgcagaggac tggcgccggg acgcgaagag caacgggcgc tgcacaaagc gggcgctgtc 1200
ggtggtggag tgcgcatgta cgcgcaggcg cttctcgtgg ttggcgtgct gcagcgacag 1260
gcggcagcac agcacctgca cgaacacccg ccgaaactgc tgcgaggaca ccgtgtacag 1320
gagcgggttg atgaccgagc tgaggtagaa aaacgtctcc gagaagggga ggaggatcat 1380
gtacgcccgg aagtaggacc tcgtccagtc gtgcttgggt ttggccgcag ccatgatcct 1440
ccgaatctgg ttgggcatcc agcatacggc caatgtcaca acaatcagcc ctgggcagac 1500
acgagcagga gggagagaca gaga 1524
<210> 60
<211> 141
<212> PRT
<213> 智人
<400> 60
Met Trp Val Leu Gly Ile Ala Ala Thr Phe Cys Gly Leu Phe Leu Leu
1 5 10 15
Pro Gly Phe Ala Leu Gln Ile Gln Cys Tyr Gln Cys Glu Glu Phe Gln
20 25 30
Leu Asn Asn Asp Cys Ser Ser Pro Glu Phe Ile Val Asn Cys Thr Val
35 40 45
Asn Val Gln Asp Met Cys Gln Lys Glu Val Met Glu Gln Ser Ala Gly
50 55 60
Ile Met Tyr Arg Lys Ser Cys Ala Ser Ser Ala Ala Cys Leu Ile Ala
65 70 75 80
Ser Ala Gly Tyr Gln Ser Phe Cys Ser Pro Gly Lys Leu Asn Ser Val
85 90 95
Cys Ile Ser Cys Cys Asn Thr Pro Leu Cys Asn Gly Pro Arg Pro Lys
100 105 110
Lys Arg Gly Ser Ser Ala Ser Ala Leu Arg Pro Gly Leu Arg Thr Thr
115 120 125
Ile Leu Phe Leu Lys Leu Ala Leu Phe Ser Ala His Cys
130 135 140
<210> 61
<211> 1329
<212> DNA
<213> 智人
<400> 61
atgcagccgc ctccaagtct gtgcggacgc gccctggttg cgctggttct tgcctgcggc 60
ctgtcgcgga tctggggaga ggagagaggc ttcccgcctg acagggccac tccgcttttg 120
caaaccgcag agataatgac gccacccact aagaccttat ggcccaaggg ttccaacgcc 180
agtctggcgc ggtcgttggc acctgcggag gtgcctaaag gagacaggac ggcaggatct 240
ccgccacgca ccatctcccc tcccccgtgc caaggaccca tcgagatcaa ggagactttc 300
aaatacatca acacggttgt gtcctgcctt gtgttcgtgc tggggatcat cgggaactcc 360
acacttctga gaattatcta caagaacaag tgcatgcgaa acggtcccaa tatcttgatc 420
gccagcttgg ctctgggaga cctgctgcac atcgtcattg acatccctat caatgtctac 480
aagctgctgg cagaggactg gccatttgga gctgagatgt gtaagctggt gcctttcata 540
cagaaagcct ccgtgggaat cactgtgctg agtctatgtg ctctgagtat tgacagatat 600
cgagctgttg cttcttggag tagaattaaa ggaattgggg ttccaaaatg gacagcagta 660
gaaattgttt tgatttgggt ggtctctgtg gttctggctg tccctgaagc cataggtttt 720
gatataatta cgatggacta caaaggaagt tatctgcgaa tctgcttgct tcatcccgtt 780
cagaagacag ctttcatgca gttttacaag acagcaaaag attggtggct attcagtttc 840
tatttctgct tgccattggc catcactgca tttttttata cactaatgac ctgtgaaatg 900
ttgagaaaga aaagtggcat gcagattgct ttaaatgatc acctaaagca gagacgggaa 960
gtggccaaaa ccgtcttttg cctggtcctt gtctttgccc tctgctggct tccccttcac 1020
ctcagcagga ttctgaagct cactctttat aatcagaatg atcccaatag atgtgaactt 1080
ttgagctttc tgttggtatt ggactatatt ggtatcaaca tggcttcact gaattcctgc 1140
attaacccaa ttgctctgta tttggtgagc aaaagattca aaaactgctt taagtcatgc 1200
ttatgctgct ggtgccagtc atttgaagaa aaacagtcct tggaggaaaa gcagtcgtgc 1260
ttaaagttca aagctaatga tcacggatat gacaacttcc gttccagtaa taaatacagc 1320
tcatcttga 1329
<210> 62
<211> 442
<212> PRT
<213> 智人
<400> 62
Met Gln Pro Pro Pro Ser Leu Cys Gly Arg Ala Leu Val Ala Leu Val
1 5 10 15
Leu Ala Cys Gly Leu Ser Arg Ile Trp Gly Glu Glu Arg Gly Phe Pro
20 25 30
Pro Asp Arg Ala Thr Pro Leu Leu Gln Thr Ala Glu Ile Met Thr Pro
35 40 45
Pro Thr Lys Thr Leu Trp Pro Lys Gly Ser Asn Ala Ser Leu Ala Arg
50 55 60
Ser Leu Ala Pro Ala Glu Val Pro Lys Gly Asp Arg Thr Ala Gly Ser
65 70 75 80
Pro Pro Arg Thr Ile Ser Pro Pro Pro Cys Gln Gly Pro Ile Glu Ile
85 90 95
Lys Glu Thr Phe Lys Tyr Ile Asn Thr Val Val Ser Cys Leu Val Phe
100 105 110
Val Leu Gly Ile Ile Gly Asn Ser Thr Leu Leu Arg Ile Ile Tyr Lys
115 120 125
Asn Lys Cys Met Arg Asn Gly Pro Asn Ile Leu Ile Ala Ser Leu Ala
130 135 140
Leu Gly Asp Leu Leu His Ile Val Ile Asp Ile Pro Ile Asn Val Tyr
145 150 155 160
Lys Leu Leu Ala Glu Asp Trp Pro Phe Gly Ala Glu Met Cys Lys Leu
165 170 175
Val Pro Phe Ile Gln Lys Ala Ser Val Gly Ile Thr Val Leu Ser Leu
180 185 190
Cys Ala Leu Ser Ile Asp Arg Tyr Arg Ala Val Ala Ser Trp Ser Arg
195 200 205
Ile Lys Gly Ile Gly Val Pro Lys Trp Thr Ala Val Glu Ile Val Leu
210 215 220
Ile Trp Val Val Ser Val Val Leu Ala Val Pro Glu Ala Ile Gly Phe
225 230 235 240
Asp Ile Ile Thr Met Asp Tyr Lys Gly Ser Tyr Leu Arg Ile Cys Leu
245 250 255
Leu His Pro Val Gln Lys Thr Ala Phe Met Gln Phe Tyr Lys Thr Ala
260 265 270
Lys Asp Trp Trp Leu Phe Ser Phe Tyr Phe Cys Leu Pro Leu Ala Ile
275 280 285
Thr Ala Phe Phe Tyr Thr Leu Met Thr Cys Glu Met Leu Arg Lys Lys
290 295 300
Ser Gly Met Gln Ile Ala Leu Asn Asp His Leu Lys Gln Arg Arg Glu
305 310 315 320
Val Ala Lys Thr Val Phe Cys Leu Val Leu Val Phe Ala Leu Cys Trp
325 330 335
Leu Pro Leu His Leu Ser Arg Ile Leu Lys Leu Thr Leu Tyr Asn Gln
340 345 350
Asn Asp Pro Asn Arg Cys Glu Leu Leu Ser Phe Leu Leu Val Leu Asp
355 360 365
Tyr Ile Gly Ile Asn Met Ala Ser Leu Asn Ser Cys Ile Asn Pro Ile
370 375 380
Ala Leu Tyr Leu Val Ser Lys Arg Phe Lys Asn Cys Phe Lys Ser Cys
385 390 395 400
Leu Cys Cys Trp Cys Gln Ser Phe Glu Glu Lys Gln Ser Leu Glu Glu
405 410 415
Lys Gln Ser Cys Leu Lys Phe Lys Ala Asn Asp His Gly Tyr Asp Asn
420 425 430
Phe Arg Ser Ser Asn Lys Tyr Ser Ser Ser
435 440
<210> 63
<211> 4573
<212> DNA
<213> 智人
<400> 63
agaacaccaa ttacaaacca caggcttcct gctctaggga gttgatccag aattgtcttt 60
ctgaaaggaa gcactcggaa tccttccgaa ctttccaagt ccatccatga ttcagagata 120
ctgccttctc tctctctggg attttatgtg tttctgatag tgaattgttg atgtatttgc 180
tactttgctt cttttctctt tcaagacttg atcattttat atgctgtttg gagaaaaaaa 240
gaacttttgt tagcaaggag gtttcagaaa tgattttgga ttttctgtaa gtgtttaatt 300
tagttctagg ggacagcatc tctcatcccg gagtaaattt ctgcctttga cctgcatgga 360
ttattttttc aggctgcgga atttctcggc acctacctgt agtatggggc acttggtttg 420
gttgcagagt aagaaggtgg aagaatgagc tgtacttggt taagcagttg aaaccttttt 480
tgagcaggat ctgtaaaagc ataattgaat ttgtttcacc cccgtggatt ccagtgggcc 540
cgacagcgca acagtgcctg gcaacttgat gcatatggaa gagcaatgcc aagtgatctg 600
acataataca aattcacgaa gtgacattca atcacaagca aagttggaaa ttccaaagag 660
aagtggtgag atctttacta gtcacagtga agatgggaga aaatgacata cctgcagcag 720
atgtgggctg aaaatatcct cttctctgcc caatcaggaa tgctacctgt ttttgggaat 780
aaactttaga gaaaggaagg gccaaaacta cgacttggct ttctgaaacg gaagcataaa 840
tgttcttttc ctccatttgt ctggatctga gaacctgcat ttggtattag ctagtggaag 900
cagtatgtat ggttgaagtg cattgctgca gctggtagca tgagtggtgg ccaccagctg 960
cagctggctg ccctctggcc ctggctgctg atggctaccc tgcaggcagg ctttggacgc 1020
acaggactgg tactggcagc agcggtggag tctgaaagat cagcagaaca gaaagctatt 1080
atcagagtga tccccttgaa aatggacccc acaggaaaac tgaatctcac tttggaaggt 1140
gtgtttgctg gtgttgctga aataactcca gcagaaggaa aattaatgca gtcccacccg 1200
ctgtacctgt gcaatgccag tgatgacgac aatctggagc ctggattcat cagcatcgtc 1260
aagctggaga gtcctcgacg ggccccccgc ccctgcctgt cactggctag caaggctcgg 1320
atggcgggtg agcgaggagc cagtgctgtc ctctttgaca tcactgagga tcgagctgct 1380
gctgagcagc tgcagcagcc gctggggctg acctggccag tggtgttgat ctggggtaat 1440
gacgctgaga agctgatgga gtttgtgtac aagaaccaaa aggcccatgt gaggattgag 1500
ctgaaggagc ccccggcctg gccagattat gatgtgtgga tcctaatgac agtggtgggc 1560
accatctttg tgatcatcct ggcttcggtg ctgcgcatcc ggtgccgccc ccgccacagc 1620
aggccggatc cgcttcagca gagaacagcc tgggccatca gccagctggc caccaggagg 1680
taccaggcca gctgcaggca ggcccggggt gagtggccag actcagggag cagctgcagc 1740
tcagcccctg tgtgtgccat ctgtctggag gagttctctg aggggcagga gctacgggtc 1800
atttcctgcc tccatgagtt ccatcgtaac tgtgtggacc cctggttaca tcagcatcgg 1860
acttgccccc tctgcatgtt caacatcaca gagggagatt cattttccca gtccctggga 1920
ccctctcgat cttaccaaga accaggtcga agactccacc tcattcgcca gcatcccggc 1980
catgcccact accacctccc tgctgcctac ctgttgggcc cttcccggag tgcagtggct 2040
cggcccccac gacctggtcc cttcctgcca tcccaggagc caggcatggg ccctcggcat 2100
caccgcttcc ccagagctgc acatccccgg gctccaggag agcagcagcg cctggcagga 2160
gcccagcacc cctatgcaca aggctgggga ctgagccacc tccaatccac ctcacagcac 2220
cctgctgctt gcccagtgcc cctacgccgg gccaggcccc ctgacagcag tggatctgga 2280
gaaagctatt gcacagaacg cagtgggtac ctggcagatg ggccagccag tgactccagc 2340
tcagggccct gtcatggctc ttccagtgac tctgtggtca actgcacgga catcagccta 2400
cagggggtcc atggcagcag ttctactttc tgcagctccc taagcagtga ctttgacccc 2460
ctagtgtact gcagccctaa aggggatccc cagcgagtgg acatgcagcc tagtgtgacc 2520
tctcggcctc gttccttgga ctcggtggtg cccacagggg aaacccaggt ttccagccat 2580
gtccactacc accgccaccg gcaccaccac tacaaaaagc ggttccagtg gcatggcagg 2640
aagcctggcc cagaaaccgg agtcccccag tccaggcctc ctattcctcg gacacagccc 2700
cagccagagc caccttctcc tgatcagcaa gtcaccagat ccaactcagc agccccttcg 2760
gggcggctct ctaacccaca gtgccccagg gccctccctg agccagcccc tggcccagtt 2820
gacgcctcca gcatctgccc cagtaccagc agtctgttca acttgcaaaa atccagcctc 2880
tctgcccgac acccacagag gaaaaggcgg gggggtccct ccgagcccac ccctggctct 2940
cggccccagg atgcaactgt gcacccagct tgccagattt ttccccatta cacccccagt 3000
gtggcatatc cttggtcccc agaggcacac cccttgatct gtggacctcc aggcctggac 3060
aagaggctgc taccagaaac cccaggcccc tgttactcaa attcacagcc agtgtggttg 3120
tgcctgactc ctcgccagcc cctggaacca catccacctg gggaggggcc ttctgaatgg 3180
agttctgaca ccgcagaggg caggccatgc ccttatccgc actgccaggt gctgtcggcc 3240
cagcctggct cagaggagga actcgaggag ctgtgtgaac aggctgtgtg agatgttcag 3300
gcctagctcc aaccaagagt gtgctccaga tgtgtttggg ccctacctgg cacagagtcc 3360
tgctcctggg aaaggaaagg accacagcaa acaccattct ttttgccgta cttcctagaa 3420
gcactggaag aggactggtg atggtggagg gtgagagggt gccgtttcct gctccagctc 3480
cagaccttgt ctgcagaaaa catctgcagt gcagcaaatc catgtccagc caggcaacca 3540
gctgctgcct gtggcgtgtg tgggctggat cccttgaagg ctgagttttt gagggcagaa 3600
agctagctat gggtagccag gtgttacaaa ggtgctgctc cttctccaac ccctacttgg 3660
tttccctcac cccaagcctc atgttcatac cagccagtgg gttcagcaga acgcatgaca 3720
ccttatcacc tccctccttg ggtgagctct gaacaccagc tttggcccct ccacagtaag 3780
gctgctacat caggggcaac cctggctcta tcattttcct tttttgccaa aaggaccagt 3840
agcataggtg agccctgagc actaaaagga ggggtccctg aagctttccc actatagtgt 3900
ggagttctgt ccctgaggtg ggtacagcag ccttggttcc tctgggggtt gagaataaga 3960
atagtgggga gggaaaaact cctccttgaa gatttcctgt ctcagagtcc cagagaggta 4020
gaaaggagga atttctgctg gactttatct gggcagagga aggatggaat gaaggtagaa 4080
aaggcagaat tacagctgag cggggacaac aaagagttct tctctgggaa aagttttgtc 4140
ttagagcaag gatggaaaat ggggacaaca aaggaaaagc aaagtgtgac ccttgggttt 4200
ggacagccca gaggcccagc tccccagtat aagccataca ggccagggac ccacaggaga 4260
gtggattaga gcacaagtct ggcctcactg agtggacaag agctgatggg cctcatcagg 4320
gtgacattca ccccagggca gcctgaccac tcttggcccc tcaggcatta tcccatttgg 4380
aatgtgaatg tggtggcaaa gtgggcagag gaccccacct gggaaccttt ttccctcagt 4440
tagtggggag actagcacct aggtacccac atgggtattt atatctgaac cagacagacg 4500
cttgaatcag gcactatgtt aagaaatata tttatttgct aatatattta tccacaaaaa 4560
aaaaaaaaaa aaa 4573
<210> 64
<211> 783
<212> PRT
<213> 智人
<400> 64
Met Ser Gly Gly His Gln Leu Gln Leu Ala Ala Leu Trp Pro Trp Leu
1 5 10 15
Leu Met Ala Thr Leu Gln Ala Gly Phe Gly Arg Thr Gly Leu Val Leu
20 25 30
Ala Ala Ala Val Glu Ser Glu Arg Ser Ala Glu Gln Lys Ala Ile Ile
35 40 45
Arg Val Ile Pro Leu Lys Met Asp Pro Thr Gly Lys Leu Asn Leu Thr
50 55 60
Leu Glu Gly Val Phe Ala Gly Val Ala Glu Ile Thr Pro Ala Glu Gly
65 70 75 80
Lys Leu Met Gln Ser His Pro Leu Tyr Leu Cys Asn Ala Ser Asp Asp
85 90 95
Asp Asn Leu Glu Pro Gly Phe Ile Ser Ile Val Lys Leu Glu Ser Pro
100 105 110
Arg Arg Ala Pro Arg Pro Cys Leu Ser Leu Ala Ser Lys Ala Arg Met
115 120 125
Ala Gly Glu Arg Gly Ala Ser Ala Val Leu Phe Asp Ile Thr Glu Asp
130 135 140
Arg Ala Ala Ala Glu Gln Leu Gln Gln Pro Leu Gly Leu Thr Trp Pro
145 150 155 160
Val Val Leu Ile Trp Gly Asn Asp Ala Glu Lys Leu Met Glu Phe Val
165 170 175
Tyr Lys Asn Gln Lys Ala His Val Arg Ile Glu Leu Lys Glu Pro Pro
180 185 190
Ala Trp Pro Asp Tyr Asp Val Trp Ile Leu Met Thr Val Val Gly Thr
195 200 205
Ile Phe Val Ile Ile Leu Ala Ser Val Leu Arg Ile Arg Cys Arg Pro
210 215 220
Arg His Ser Arg Pro Asp Pro Leu Gln Gln Arg Thr Ala Trp Ala Ile
225 230 235 240
Ser Gln Leu Ala Thr Arg Arg Tyr Gln Ala Ser Cys Arg Gln Ala Arg
245 250 255
Gly Glu Trp Pro Asp Ser Gly Ser Ser Cys Ser Ser Ala Pro Val Cys
260 265 270
Ala Ile Cys Leu Glu Glu Phe Ser Glu Gly Gln Glu Leu Arg Val Ile
275 280 285
Ser Cys Leu His Glu Phe His Arg Asn Cys Val Asp Pro Trp Leu His
290 295 300
Gln His Arg Thr Cys Pro Leu Cys Met Phe Asn Ile Thr Glu Gly Asp
305 310 315 320
Ser Phe Ser Gln Ser Leu Gly Pro Ser Arg Ser Tyr Gln Glu Pro Gly
325 330 335
Arg Arg Leu His Leu Ile Arg Gln His Pro Gly His Ala His Tyr His
340 345 350
Leu Pro Ala Ala Tyr Leu Leu Gly Pro Ser Arg Ser Ala Val Ala Arg
355 360 365
Pro Pro Arg Pro Gly Pro Phe Leu Pro Ser Gln Glu Pro Gly Met Gly
370 375 380
Pro Arg His His Arg Phe Pro Arg Ala Ala His Pro Arg Ala Pro Gly
385 390 395 400
Glu Gln Gln Arg Leu Ala Gly Ala Gln His Pro Tyr Ala Gln Gly Trp
405 410 415
Gly Leu Ser His Leu Gln Ser Thr Ser Gln His Pro Ala Ala Cys Pro
420 425 430
Val Pro Leu Arg Arg Ala Arg Pro Pro Asp Ser Ser Gly Ser Gly Glu
435 440 445
Ser Tyr Cys Thr Glu Arg Ser Gly Tyr Leu Ala Asp Gly Pro Ala Ser
450 455 460
Asp Ser Ser Ser Gly Pro Cys His Gly Ser Ser Ser Asp Ser Val Val
465 470 475 480
Asn Cys Thr Asp Ile Ser Leu Gln Gly Val His Gly Ser Ser Ser Thr
485 490 495
Phe Cys Ser Ser Leu Ser Ser Asp Phe Asp Pro Leu Val Tyr Cys Ser
500 505 510
Pro Lys Gly Asp Pro Gln Arg Val Asp Met Gln Pro Ser Val Thr Ser
515 520 525
Arg Pro Arg Ser Leu Asp Ser Val Val Pro Thr Gly Glu Thr Gln Val
530 535 540
Ser Ser His Val His Tyr His Arg His Arg His His His Tyr Lys Lys
545 550 555 560
Arg Phe Gln Trp His Gly Arg Lys Pro Gly Pro Glu Thr Gly Val Pro
565 570 575
Gln Ser Arg Pro Pro Ile Pro Arg Thr Gln Pro Gln Pro Glu Pro Pro
580 585 590
Ser Pro Asp Gln Gln Val Thr Arg Ser Asn Ser Ala Ala Pro Ser Gly
595 600 605
Arg Leu Ser Asn Pro Gln Cys Pro Arg Ala Leu Pro Glu Pro Ala Pro
610 615 620
Gly Pro Val Asp Ala Ser Ser Ile Cys Pro Ser Thr Ser Ser Leu Phe
625 630 635 640
Asn Leu Gln Lys Ser Ser Leu Ser Ala Arg His Pro Gln Arg Lys Arg
645 650 655
Arg Gly Gly Pro Ser Glu Pro Thr Pro Gly Ser Arg Pro Gln Asp Ala
660 665 670
Thr Val His Pro Ala Cys Gln Ile Phe Pro His Tyr Thr Pro Ser Val
675 680 685
Ala Tyr Pro Trp Ser Pro Glu Ala His Pro Leu Ile Cys Gly Pro Pro
690 695 700
Gly Leu Asp Lys Arg Leu Leu Pro Glu Thr Pro Gly Pro Cys Tyr Ser
705 710 715 720
Asn Ser Gln Pro Val Trp Leu Cys Leu Thr Pro Arg Gln Pro Leu Glu
725 730 735
Pro His Pro Pro Gly Glu Gly Pro Ser Glu Trp Ser Ser Asp Thr Ala
740 745 750
Glu Gly Arg Pro Cys Pro Tyr Pro His Cys Gln Val Leu Ser Ala Gln
755 760 765
Pro Gly Ser Glu Glu Glu Leu Glu Glu Leu Cys Glu Gln Ala Val
770 775 780
<210> 65
<211> 6857
<212> DNA
<213> 智人
<400> 65
gccccctccg agctccccga ctcctccccg cgctccacgg ctcttcccga ctccagtcag 60
cgttcctcgg gccctcggcg ccacaagctg tccgggcacg cagcccctag cggcgcgtcg 120
ctgccaagcc ggcctccgcg cgcctccctc cttccttctc ccctggctgt tcgcgatcca 180
gcttgggtag gcggggaagc agctggagtg cgaccgccac ggcagccacc ctgcaaccgc 240
cagtcggagg tgcagtccgt aggccctggc ccccgggtgg gcccttgggg agtcggcgcc 300
gctcccgagg agctgcaagg ctcgcccctg cccggcgtgg agggcgcggg gggcgcggag 360
gatattcttg gtgatcttgg aagtgtccgt atcatggaat caatctctat gatgggaagc 420
cctaagagcc ttagtgaaac tgttttacct aatggcataa atggtatcaa agatgcaagg 480
aaggtcactg taggtgtgat tggaagtgga gattttgcca aatccttgac cattcgactt 540
attagatgcg gctatcatgt ggtcatagga agtagaaatc ctaagtttgc ttctgaattt 600
tttcctcatg tggtagatgt cactcatcat gaagatgctc tcacaaaaac aaatataata 660
tttgttgcta tacacagaga acattatacc tccctgtggg acctgagaca tctgcttgtg 720
ggtaaaatcc tgattgatgt gagcaataac atgaggataa accagtaccc agaatccaat 780
gctgaatatt tggcttcatt attcccagat tctttgattg tcaaaggatt taatgttgtc 840
tcagcttggg cacttcagtt aggacctaag gatgccagcc ggcaggttta tatatgcagc 900
aacaatattc aagcgcgaca acaggttatt gaacttgccc gccagttgaa tttcattccc 960
attgacttgg gatccttatc atcagccaga gagattgaaa atttacccct acgactcttt 1020
actctctgga gagggccagt ggtggtagct ataagcttgg ccacattttt tttcctttat 1080
tcctttgtca gagatgtgat tcatccatat gctagaaacc aacagagtga cttttacaaa 1140
attcctatag agattgtgaa taaaacctta cctatagttg ccattacttt gctctcccta 1200
gtataccttg caggtcttct ggcagctgct tatcaacttt attacggcac caagtatagg 1260
agatttccac cttggttgga aacctggtta cagtgtagaa aacagcttgg attactaagt 1320
tttttcttcg ctatggtcca tgttgcctac agcctctgct taccgatgag aaggtcagag 1380
agatatttgt ttctcaacat ggcttatcag caggttcatg caaatattga aaactcttgg 1440
aatgaggaag aagtttggag aattgaaatg tatatctcct ttggcataat gagccttggc 1500
ttactttccc tcctggcagt cacttctatc ccttcagtga gcaatgcttt aaactggaga 1560
gaattcagtt ttattcagtc tacacttgga tatgtcgctc tgctcataag tactttccat 1620
gttttaattt atggatggaa acgagctttt gaggaagagt actacagatt ttatacacca 1680
ccaaactttg ttcttgctct tgttttgccc tcaattgtaa ttctgggtaa gattatttta 1740
ttccttccat gtataagcca aaagctaaaa cgaattaaaa aaggctggga aaagagccaa 1800
tttctggaag aaggtattgg aggaacaatt cctcatgtct ccccggagag ggtcacagta 1860
atgtgatgat aaatggtgtt cacagctgcc atataaagtt ctactcatgc cattattttt 1920
atgacttcta cgttcagtta caagtatgct gtcaaattat cgtgggttga aacttgttaa 1980
atgagatttc aactgactta gtgatagagt tttcttcaag ttaattttca caaatgtcat 2040
gtttgccaat atgaattttt ctagtcaaca tattattgta atttaggtat gttttgtttt 2100
gttttgcaca actgtaaccc tgttgttact ttatatttca taatcaggca aaaatactta 2160
cagttaataa tatagatata atgttaaaaa caatttgcaa accagcagaa ttttaagctt 2220
ttaaaataat tcaatggata tacatttttt tctgaagatt aagattttaa ttattcaact 2280
taaaaagtag aaatgcatta ttatacattt ttttaagaaa ggacacgtta tgttagcatc 2340
taggtaaggc tgcatgatag cattcctata tttctctcat aaaataggat ttgaaggatg 2400
aaattaattg tatgaagcaa tgtgattata tgaagagaca caaattaaaa agacaaatta 2460
aacctgaaat tatatttaaa atatatttga gacatgaaat acatactgat aatacatacc 2520
tcatgaaaga ttttattctt tattgtgtta cagagcagtt tcattttcat attaatatac 2580
tgatcaggaa gaggattcag taacatttgg cctccaaaac tgctatctct aatacggtac 2640
caatcctagg aactgtatac tagttcctac ttagaacaaa agtatcaagt ttgcacacaa 2700
gtaatctgcc agctgacctt tgtcgcacct taaccagtca ccacttgcta tggtatagga 2760
ttatactgat gttctttgag ggattctgat gtgctaggca tggttctaag tactttactt 2820
gtattatccc atttaatact tagaacaacc ccgtgagata agtagttatt atcctcattt 2880
tacacatgag ggaccgaagg atagaaaagt tatttttcaa aggtcatgca gttaataaat 2940
ggcagagtga gcattcaagt ccaggtagtc atattccaga ggccacggtt ttaaccacta 3000
ggctctagag ctcccgccgc gcccctatgc attatgttca caatgccaat ctagatgctt 3060
cctcttttgt ataaagtcac tgacattctt tagagtgggt tgggtgcatc caaaaatgta 3120
taaaaatatt attataataa acttattact gcttgtaggg taattcacag ttacttaccc 3180
tattcttgct tggaacatga gcctggagac ccatggcagt ccatatgcct ccctatgcag 3240
tgaagggccc tagcagtgtt aacaaattgc tgagatccca cggagtcttt caaaaatctc 3300
tgtagagtta gtcttctcct tttctcttcc tgagaagttc tcctgcctgc ataaccattc 3360
attagggagt actttacaag catgaaggat attagggtaa gtggctaatt ataaatctac 3420
tctagagaca tataatcata cagattattc ataaaatttt tcagtgctgt ccttccacat 3480
ttaattgcat tttgctcaaa ctgtagaatg ccctacattc cccccacccc aatttgctat 3540
ttccttatta aaatagaaaa ttataggcaa gatacaatta tatgcgttcc tcttcctgaa 3600
attataacat ttctaaactt acccacgtag gtactactga atccaactgc caacaataaa 3660
aagactttta tttagtagag gctacctttc ccaccagtga ctctttttct acaactgcct 3720
tgtcagtttg gtaattcact tatgattttc taatgttctc ttggtgaatt ttattatctt 3780
gtaccctctt tttttttttt ttttttttta aagacagagt cttgctctgt cacccaggct 3840
ggagtgcagt ggcacgatct cggctcactg caagctctgc ctcccgggtt cacgccattc 3900
tcctgcctca gcctcccgag tagctgggac tacaggtgcc cgccaccatg cccggctgat 3960
ttctttttgt atttttagta gagacggagt ttcaccgtgt tagccaggat ggtctcgatc 4020
tcctgacctc gtgatccgcc cgccttggcc tccaaagtgc tgggattaca ggtgtgagct 4080
accgcgcccg gcctattatc ttgtactttc taactgagcc ctctattttc tttattttaa 4140
taatatttct ccctacttga gaatcacttg ttagttcttg gtaggaattc agttgggcaa 4200
tgataacttt tatgggcaaa aacattctat tatagtgaac aaatgaaaat aacagcgtat 4260
tttcaatatt ttcttattcc ttaaattcca ctcttttaac actatgctta accacttaat 4320
gtgatgaaat attcctaaaa gttaaatgac tattaaagca tatattgttg catgtatata 4380
ttaagtagcc gatactctaa ataaaaatac cactgttaca gataaatggg gcctttaaaa 4440
atatgaaaaa caaacttgtg aaaatgtata aaagatgcat ctgttgtttc aaatggcact 4500
atcttctttt cagtactaca aaaacagaat aattttgaag ttttagaata aatgtaatat 4560
atttactata attctaaatg tttaaatgct tttctaaaaa tgcaaaacta tgatgtttag 4620
ttgctttatt ttacctctat gtgattattt ttcttaattg ttatttttta taatcattat 4680
ttttctgaac cattcttctg gcctcagaag taggactgaa ttctactatt gctaggtgtg 4740
agaaagtggt ggtgagaacc ttagagcagt ggagatttgc tacctggtct gtgttttgag 4800
aagtgcccct tagaaagtta aaagaatgta gaaaagatac tcagtcttaa tcctatgcaa 4860
aaaaaaaaat caagtaattg ttttcctatg aggaaaataa ccatgagctg tatcatgcta 4920
cttagctttt atgtaaatat ttcttatgtc tcctctatta agagtattta aaatcatatt 4980
taaatatgaa tctattcatg ctaacattat ttttcaaaac atacatggaa atttagccca 5040
gattgtctac atataaggtt tttatttgaa ttgtaaaata tttaaaagta tgaataaaat 5100
atatttatag gtatttatca gagatgatta ttttgtgcta catacaggtt ggctaatgag 5160
ctctagtgtt aaactacctg attaatttct tataaagcag cataaccttg gcttgattaa 5220
ggaattctac tttcaaaaat taatctgata atagtaacaa ggtatattat actttcatta 5280
caatcaaatt atagaaatta cttgtgtaaa agggcttcaa gaatatatcc aatttttaaa 5340
tattttaata tatctcctat ctgataactt aattcttcta aattaccact tgccattaag 5400
ctatttcata ataaattctg tacagtttcc ccccaaaaaa gagatttatt tatgaaatat 5460
ttaaagtttc taatgtggta ttttaaataa agtatcataa atgtaataag taaatattta 5520
tttaggaata ctgtgaacac tgaactaatt attcctgtgt cagtctatga aatccctgtt 5580
ttgaaatacg taaacagcct aaaatgtgtt gaaattattt tgtaaatcca tgacttaaaa 5640
caagatacat acatagtata acacacctca cagtgttaag atttatattg tgaaatgaga 5700
caccctacct tcaattgttc atcagtgggt aaaacaaatt ctgatgtaca ttcaggacaa 5760
atgattagcc ctaaatgaaa ctgtaataat ttcagtggaa actcaatctg tttttacctt 5820
taaacagtga attttacatg aatgaatggg ttcttcactt tttttttagt atgagaaaat 5880
tatacagtgc ttaattttca gagattcttt ccatatgtta ctaaaaaatg ttttgttcag 5940
cctaacatac tgagtttttt ttaactttct aaattattga atttccatca tgcattcatc 6000
caaaattaag gcagactgtt tggattcttc cagtggccag atgagctaaa ttaaatcaca 6060
aaagcagatg cttttgtatg atctccaaat tgccaacttt aaggaaatat tctcttgaaa 6120
ttgtctttaa agatcttttg cagctttgca gatacccaga ctgagctgga actggaattt 6180
gtcttcctat tgactctact tctttaaaag cggctgccca ttacattcct cagctgtcct 6240
tgcagttagg tgtacatgtg actgagtgtt ggccagtgag atgaagtctc ctcaaaggaa 6300
ggcagcatgt gtcctttttc atcccttcat cttgctgctg ggattgtgga tataacagga 6360
gccctggcag ctgtctccag aggatcaaag ccacacccaa agagtaaggc agattagaga 6420
ccagaaagac cttgactact tccctacttc cactgctttt tcctgcattt aagccattgt 6480
aaatctgggt gtgttacatg aagtgaaaat taattctttc tgcccttcag ttctttatcc 6540
tgataccatt taacactgtc tgaattaact agactgcaat aattctttct tttgaaagct 6600
tttaaaggat aatgtgcaat tcacattaaa attgattttc cattgtcaat tagttatact 6660
cattttcctg ccttgatctt tcattagata ttttgtatct gcttggaata tattatcttc 6720
tttttaactg tgtaattggt aattactaaa actctgtaat ctccaaaata ttgctatcaa 6780
attacacacc atgttttcta tcattctcat agatctgcct tataaacatt taaataaaaa 6840
gtactattta atgattt 6857
<210> 66
<211> 490
<212> PRT
<213> 智人
<400> 66
Met Glu Ser Ile Ser Met Met Gly Ser Pro Lys Ser Leu Ser Glu Thr
1 5 10 15
Val Leu Pro Asn Gly Ile Asn Gly Ile Lys Asp Ala Arg Lys Val Thr
20 25 30
Val Gly Val Ile Gly Ser Gly Asp Phe Ala Lys Ser Leu Thr Ile Arg
35 40 45
Leu Ile Arg Cys Gly Tyr His Val Val Ile Gly Ser Arg Asn Pro Lys
50 55 60
Phe Ala Ser Glu Phe Phe Pro His Val Val Asp Val Thr His His Glu
65 70 75 80
Asp Ala Leu Thr Lys Thr Asn Ile Ile Phe Val Ala Ile His Arg Glu
85 90 95
His Tyr Thr Ser Leu Trp Asp Leu Arg His Leu Leu Val Gly Lys Ile
100 105 110
Leu Ile Asp Val Ser Asn Asn Met Arg Ile Asn Gln Tyr Pro Glu Ser
115 120 125
Asn Ala Glu Tyr Leu Ala Ser Leu Phe Pro Asp Ser Leu Ile Val Lys
130 135 140
Gly Phe Asn Val Val Ser Ala Trp Ala Leu Gln Leu Gly Pro Lys Asp
145 150 155 160
Ala Ser Arg Gln Val Tyr Ile Cys Ser Asn Asn Ile Gln Ala Arg Gln
165 170 175
Gln Val Ile Glu Leu Ala Arg Gln Leu Asn Phe Ile Pro Ile Asp Leu
180 185 190
Gly Ser Leu Ser Ser Ala Arg Glu Ile Glu Asn Leu Pro Leu Arg Leu
195 200 205
Phe Thr Leu Trp Arg Gly Pro Val Val Val Ala Ile Ser Leu Ala Thr
210 215 220
Phe Phe Phe Leu Tyr Ser Phe Val Arg Asp Val Ile His Pro Tyr Ala
225 230 235 240
Arg Asn Gln Gln Ser Asp Phe Tyr Lys Ile Pro Ile Glu Ile Val Asn
245 250 255
Lys Thr Leu Pro Ile Val Ala Ile Thr Leu Leu Ser Leu Val Tyr Leu
260 265 270
Ala Gly Leu Leu Ala Ala Ala Tyr Gln Leu Tyr Tyr Gly Thr Lys Tyr
275 280 285
Arg Arg Phe Pro Pro Trp Leu Glu Thr Trp Leu Gln Cys Arg Lys Gln
290 295 300
Leu Gly Leu Leu Ser Phe Phe Phe Ala Met Val His Val Ala Tyr Ser
305 310 315 320
Leu Cys Leu Pro Met Arg Arg Ser Glu Arg Tyr Leu Phe Leu Asn Met
325 330 335
Ala Tyr Gln Gln Val His Ala Asn Ile Glu Asn Ser Trp Asn Glu Glu
340 345 350
Glu Val Trp Arg Ile Glu Met Tyr Ile Ser Phe Gly Ile Met Ser Leu
355 360 365
Gly Leu Leu Ser Leu Leu Ala Val Thr Ser Ile Pro Ser Val Ser Asn
370 375 380
Ala Leu Asn Trp Arg Glu Phe Ser Phe Ile Gln Ser Thr Leu Gly Tyr
385 390 395 400
Val Ala Leu Leu Ile Ser Thr Phe His Val Leu Ile Tyr Gly Trp Lys
405 410 415
Arg Ala Phe Glu Glu Glu Tyr Tyr Arg Phe Tyr Thr Pro Pro Asn Phe
420 425 430
Val Leu Ala Leu Val Leu Pro Ser Ile Val Ile Leu Gly Lys Ile Ile
435 440 445
Leu Phe Leu Pro Cys Ile Ser Gln Lys Leu Lys Arg Ile Lys Lys Gly
450 455 460
Trp Glu Lys Ser Gln Phe Leu Glu Glu Gly Ile Gly Gly Thr Ile Pro
465 470 475 480
His Val Ser Pro Glu Arg Val Thr Val Met
485 490
<210> 67
<211> 4103
<212> DNA
<213> 智人
<400> 67
aggctggaaa gtggaggatc cggtttgctc tgggcgggtc tggaagcaga gccggcggag 60
ggagcgccgg ggccctgggc tgcaggaggt tgcggcggcc gcggcagcat ggtggtgccg 120
gagaaggagc agagctggat ccccaagatc ttcaagaaga agacctgcac gacgttcata 180
gttgactcca cagatccggg agggaccttg tgccagtgtg ggcgcccccg gaccgcccac 240
cccgcagtgg ccatggagga tgccttcggg gcagccgtgg tgaccgtgtg ggacagcgat 300
gcacacacca cggagaagcc caccgatgcc tacggagagc tggacttcac gggggccggc 360
cgcaagcaca gcaatttcct ccggctctct gaccgaacgg atccagctgc agtttatagt 420
ctggtcacac gcacatgggg cttccgtgcc ccgaacctgg tggtgtcagt gctgggggga 480
tcggggggcc ccgtcctcca gacctggctg caggacctgc tgcgtcgtgg gctggtgcgg 540
gctgcccaga gcacaggagc ctggattgtc actgggggtc tgcacacggg catcggccgg 600
catgttggtg tggctgtacg ggaccatcag atggccagca ctgggggcac caaggtggtg 660
gccatgggtg tggccccctg gggtgtggtc cggaatagag acaccctcat caaccccaag 720
ggctcgttcc ctgcgaggta ccggtggcgc ggtgacccgg aggacggggt ccagtttccc 780
ctggactaca actactcggc cttcttcctg gtggacgacg gcacacacgg ctgcctgggg 840
ggcgagaacc gcttccgctt gcgcctggag tcctacatct cacagcagaa gacgggcgtg 900
ggagggactg gaattgacat ccctgtcctg ctcctcctga ttgatggtga tgagaagatg 960
ttgacgcgaa tagagaacgc cacccaggct cagctcccat gtctcctcgt ggctggctca 1020
gggggagctg cggactgcct ggcggagacc ctggaagaca ctctggcccc agggagtggg 1080
ggagccaggc aaggcgaagc ccgagatcga atcaggcgtt tctttcccaa aggggacctt 1140
gaggtcctgc aggcccaggt ggagaggatt atgacccgga aggagctcct gacagtctat 1200
tcttctgagg atgggtctga ggaattcgag accatagttt tgaaggccct tgtgaaggcc 1260
tgtgggagct cggaggcctc agcctacctg gatgagctgc gtttggctgt ggcttggaac 1320
cgcgtggaca ttgcccagag tgaactcttt cggggggaca tccaatggcg gtccttccat 1380
ctcgaagctt ccctcatgga cgccctgctg aatgaccggc ctgagttcgt gcgcttgctc 1440
atttcccacg gcctcagcct gggccacttc ctgaccccga tgcgcctggc ccaactctac 1500
agcgcggcgc cctccaactc gctcatccgc aaccttttgg accaggcgtc ccacagcgca 1560
ggcaccaaag ccccagccct aaaaggggga gctgcggagc tccggccccc tgacgtgggg 1620
catgtgctga ggatgctgct ggggaagatg tgcgcgccga ggtacccctc cgggggcgcc 1680
tgggaccctc acccaggcca gggcttcggg gagagcatgt atctgctctc ggacaaggcc 1740
acctcgccgc tctcgctgga tgctggcctc gggcaggccc cctggagcga cctgcttctt 1800
tgggcactgt tgctgaacag ggcacagatg gccatgtact tctgggagat gggttccaat 1860
gcagtttcct cagctcttgg ggcctgtttg ctgctccggg tgatggcacg cctggagcct 1920
gacgctgagg aggcagcacg gaggaaagac ctggcgttca agtttgaggg gatgggcgtt 1980
gacctctttg gcgagtgcta tcgcagcagt gaggtgaggg ctgcccgcct cctcctccgt 2040
cgctgcccgc tctgggggga tgccacttgc ctccagctgg ccatgcaagc tgacgcccgt 2100
gccttctttg cccaggatgg ggtacagtct ctgctgacac agaagtggtg gggagatatg 2160
gccagcacta cacccatctg ggccctggtt ctcgccttct tttgccctcc actcatctac 2220
acccgcctca tcaccttcag gaaatcagaa gaggagccca cacgggagga gctagagttt 2280
gacatggata gtgtcattaa tggggaaggg cctgtcggga cggcggaccc agccgagaag 2340
acgccgctgg gggtcccgcg ccagtcgggc cgtccgggtt gctgcggggg ccgctgcggg 2400
gggcgccggt gcctacgccg ctggttccac ttctggggcg cgccggtgac catcttcatg 2460
ggcaacgtgg tcagctacct gctgttcctg ctgcttttct cgcgggtgct gctcgtggat 2520
ttccagccgg cgccgcccgg ctccctggag ctgctgctct atttctgggc tttcacgctg 2580
ctgtgcgagg aactgcgcca gggcctgagc ggaggcgggg gcagcctcgc cagcgggggc 2640
cccgggcctg gccatgcctc actgagccag cgcctgcgcc tctacctcgc cgacagctgg 2700
aaccagtgcg acctagtggc tctcacctgc ttcctcctgg gcgtgggctg ccggctgacc 2760
ccgggtttgt accacctggg ccgcactgtc ctctgcatcg acttcatggt tttcacggtg 2820
cggctgcttc acatcttcac ggtcaacaaa cagctggggc ccaagatcgt catcgtgagc 2880
aagatgatga aggacgtgtt cttcttcctc ttcttcctcg gcgtgtggct ggtagcctat 2940
ggcgtggcca cggaggggct cctgaggcca cgggacagtg acttcccaag tatcctgcgc 3000
cgcgtcttct accgtcccta cctgcagatc ttcgggcaga ttccccagga ggacatggac 3060
gtggccctca tggagcacag caactgctcg tcggagcccg gcttctgggc acaccctcct 3120
ggggcccagg cgggcacctg cgtctcccag tatgccaact ggctggtggt gctgctcctc 3180
gtcatcttcc tgctcgtggc caacatcctg ctggtcaact tgctcattgc catgttcagt 3240
tacacattcg gcaaagtaca gggcaacagc gatctctact ggaaggcgca gcgttaccgc 3300
ctcatccggg aattccactc tcggcccgcg ctggccccgc cctttatcgt catctcccac 3360
ttgcgcctcc tgctcaggca attgtgcagg cgaccccgga gcccccagcc gtcctccccg 3420
gccctcgagc atttccgggt ttacctttct aaggaagccg agcggaagct gctaacgtgg 3480
gaatcggtgc ataaggagaa ctttctgctg gcacgcgcta gggacaagcg ggagagcgac 3540
tccgagcgtc tgaagcgcac gtcccagaag gtggacttgg cactgaaaca gctgggacac 3600
atccgcgagt acgaacagcg cctgaaagtg ctggagcggg aggtccagca gtgtagccgc 3660
gtcctggggt gggtggccga ggccctgagc cgctctgcct tgctgccccc aggtgggccg 3720
ccaccccctg acctgcctgg gtccaaagac tgagccctgc tggcggactt caaggagaag 3780
cccccacagg ggattttgct cctagagtaa ggctcatctg ggcctcggcc cccgcacctg 3840
gtggccttgt ccttgaggtg agccccatgt ccatctgggc cactgtcagg accacctttg 3900
ggagtgtcat ccttacaaac cacagcatgc ccggctcctc ccagaaccag tcccagcctg 3960
ggaggatcaa ggcctggatc ccgggccgtt atccatctgg aggctgcagg gtccttgggg 4020
taacagggac cacagacccc tcaccactca cagattcctc acactgggga aataaagcca 4080
tttcagagga atcgtgaaaa aaa 4103
<210> 68
<211> 1214
<212> PRT
<213> 智人
<400> 68
Met Val Val Pro Glu Lys Glu Gln Ser Trp Ile Pro Lys Ile Phe Lys
1 5 10 15
Lys Lys Thr Cys Thr Thr Phe Ile Val Asp Ser Thr Asp Pro Gly Gly
20 25 30
Thr Leu Cys Gln Cys Gly Arg Pro Arg Thr Ala His Pro Ala Val Ala
35 40 45
Met Glu Asp Ala Phe Gly Ala Ala Val Val Thr Val Trp Asp Ser Asp
50 55 60
Ala His Thr Thr Glu Lys Pro Thr Asp Ala Tyr Gly Glu Leu Asp Phe
65 70 75 80
Thr Gly Ala Gly Arg Lys His Ser Asn Phe Leu Arg Leu Ser Asp Arg
85 90 95
Thr Asp Pro Ala Ala Val Tyr Ser Leu Val Thr Arg Thr Trp Gly Phe
100 105 110
Arg Ala Pro Asn Leu Val Val Ser Val Leu Gly Gly Ser Gly Gly Pro
115 120 125
Val Leu Gln Thr Trp Leu Gln Asp Leu Leu Arg Arg Gly Leu Val Arg
130 135 140
Ala Ala Gln Ser Thr Gly Ala Trp Ile Val Thr Gly Gly Leu His Thr
145 150 155 160
Gly Ile Gly Arg His Val Gly Val Ala Val Arg Asp His Gln Met Ala
165 170 175
Ser Thr Gly Gly Thr Lys Val Val Ala Met Gly Val Ala Pro Trp Gly
180 185 190
Val Val Arg Asn Arg Asp Thr Leu Ile Asn Pro Lys Gly Ser Phe Pro
195 200 205
Ala Arg Tyr Arg Trp Arg Gly Asp Pro Glu Asp Gly Val Gln Phe Pro
210 215 220
Leu Asp Tyr Asn Tyr Ser Ala Phe Phe Leu Val Asp Asp Gly Thr His
225 230 235 240
Gly Cys Leu Gly Gly Glu Asn Arg Phe Arg Leu Arg Leu Glu Ser Tyr
245 250 255
Ile Ser Gln Gln Lys Thr Gly Val Gly Gly Thr Gly Ile Asp Ile Pro
260 265 270
Val Leu Leu Leu Leu Ile Asp Gly Asp Glu Lys Met Leu Thr Arg Ile
275 280 285
Glu Asn Ala Thr Gln Ala Gln Leu Pro Cys Leu Leu Val Ala Gly Ser
290 295 300
Gly Gly Ala Ala Asp Cys Leu Ala Glu Thr Leu Glu Asp Thr Leu Ala
305 310 315 320
Pro Gly Ser Gly Gly Ala Arg Gln Gly Glu Ala Arg Asp Arg Ile Arg
325 330 335
Arg Phe Phe Pro Lys Gly Asp Leu Glu Val Leu Gln Ala Gln Val Glu
340 345 350
Arg Ile Met Thr Arg Lys Glu Leu Leu Thr Val Tyr Ser Ser Glu Asp
355 360 365
Gly Ser Glu Glu Phe Glu Thr Ile Val Leu Lys Ala Leu Val Lys Ala
370 375 380
Cys Gly Ser Ser Glu Ala Ser Ala Tyr Leu Asp Glu Leu Arg Leu Ala
385 390 395 400
Val Ala Trp Asn Arg Val Asp Ile Ala Gln Ser Glu Leu Phe Arg Gly
405 410 415
Asp Ile Gln Trp Arg Ser Phe His Leu Glu Ala Ser Leu Met Asp Ala
420 425 430
Leu Leu Asn Asp Arg Pro Glu Phe Val Arg Leu Leu Ile Ser His Gly
435 440 445
Leu Ser Leu Gly His Phe Leu Thr Pro Met Arg Leu Ala Gln Leu Tyr
450 455 460
Ser Ala Ala Pro Ser Asn Ser Leu Ile Arg Asn Leu Leu Asp Gln Ala
465 470 475 480
Ser His Ser Ala Gly Thr Lys Ala Pro Ala Leu Lys Gly Gly Ala Ala
485 490 495
Glu Leu Arg Pro Pro Asp Val Gly His Val Leu Arg Met Leu Leu Gly
500 505 510
Lys Met Cys Ala Pro Arg Tyr Pro Ser Gly Gly Ala Trp Asp Pro His
515 520 525
Pro Gly Gln Gly Phe Gly Glu Ser Met Tyr Leu Leu Ser Asp Lys Ala
530 535 540
Thr Ser Pro Leu Ser Leu Asp Ala Gly Leu Gly Gln Ala Pro Trp Ser
545 550 555 560
Asp Leu Leu Leu Trp Ala Leu Leu Leu Asn Arg Ala Gln Met Ala Met
565 570 575
Tyr Phe Trp Glu Met Gly Ser Asn Ala Val Ser Ser Ala Leu Gly Ala
580 585 590
Cys Leu Leu Leu Arg Val Met Ala Arg Leu Glu Pro Asp Ala Glu Glu
595 600 605
Ala Ala Arg Arg Lys Asp Leu Ala Phe Lys Phe Glu Gly Met Gly Val
610 615 620
Asp Leu Phe Gly Glu Cys Tyr Arg Ser Ser Glu Val Arg Ala Ala Arg
625 630 635 640
Leu Leu Leu Arg Arg Cys Pro Leu Trp Gly Asp Ala Thr Cys Leu Gln
645 650 655
Leu Ala Met Gln Ala Asp Ala Arg Ala Phe Phe Ala Gln Asp Gly Val
660 665 670
Gln Ser Leu Leu Thr Gln Lys Trp Trp Gly Asp Met Ala Ser Thr Thr
675 680 685
Pro Ile Trp Ala Leu Val Leu Ala Phe Phe Cys Pro Pro Leu Ile Tyr
690 695 700
Thr Arg Leu Ile Thr Phe Arg Lys Ser Glu Glu Glu Pro Thr Arg Glu
705 710 715 720
Glu Leu Glu Phe Asp Met Asp Ser Val Ile Asn Gly Glu Gly Pro Val
725 730 735
Gly Thr Ala Asp Pro Ala Glu Lys Thr Pro Leu Gly Val Pro Arg Gln
740 745 750
Ser Gly Arg Pro Gly Cys Cys Gly Gly Arg Cys Gly Gly Arg Arg Cys
755 760 765
Leu Arg Arg Trp Phe His Phe Trp Gly Ala Pro Val Thr Ile Phe Met
770 775 780
Gly Asn Val Val Ser Tyr Leu Leu Phe Leu Leu Leu Phe Ser Arg Val
785 790 795 800
Leu Leu Val Asp Phe Gln Pro Ala Pro Pro Gly Ser Leu Glu Leu Leu
805 810 815
Leu Tyr Phe Trp Ala Phe Thr Leu Leu Cys Glu Glu Leu Arg Gln Gly
820 825 830
Leu Ser Gly Gly Gly Gly Ser Leu Ala Ser Gly Gly Pro Gly Pro Gly
835 840 845
His Ala Ser Leu Ser Gln Arg Leu Arg Leu Tyr Leu Ala Asp Ser Trp
850 855 860
Asn Gln Cys Asp Leu Val Ala Leu Thr Cys Phe Leu Leu Gly Val Gly
865 870 875 880
Cys Arg Leu Thr Pro Gly Leu Tyr His Leu Gly Arg Thr Val Leu Cys
885 890 895
Ile Asp Phe Met Val Phe Thr Val Arg Leu Leu His Ile Phe Thr Val
900 905 910
Asn Lys Gln Leu Gly Pro Lys Ile Val Ile Val Ser Lys Met Met Lys
915 920 925
Asp Val Phe Phe Phe Leu Phe Phe Leu Gly Val Trp Leu Val Ala Tyr
930 935 940
Gly Val Ala Thr Glu Gly Leu Leu Arg Pro Arg Asp Ser Asp Phe Pro
945 950 955 960
Ser Ile Leu Arg Arg Val Phe Tyr Arg Pro Tyr Leu Gln Ile Phe Gly
965 970 975
Gln Ile Pro Gln Glu Asp Met Asp Val Ala Leu Met Glu His Ser Asn
980 985 990
Cys Ser Ser Glu Pro Gly Phe Trp Ala His Pro Pro Gly Ala Gln Ala
995 1000 1005
Gly Thr Cys Val Ser Gln Tyr Ala Asn Trp Leu Val Val Leu Leu
1010 1015 1020
Leu Val Ile Phe Leu Leu Val Ala Asn Ile Leu Leu Val Asn Leu
1025 1030 1035
Leu Ile Ala Met Phe Ser Tyr Thr Phe Gly Lys Val Gln Gly Asn
1040 1045 1050
Ser Asp Leu Tyr Trp Lys Ala Gln Arg Tyr Arg Leu Ile Arg Glu
1055 1060 1065
Phe His Ser Arg Pro Ala Leu Ala Pro Pro Phe Ile Val Ile Ser
1070 1075 1080
His Leu Arg Leu Leu Leu Arg Gln Leu Cys Arg Arg Pro Arg Ser
1085 1090 1095
Pro Gln Pro Ser Ser Pro Ala Leu Glu His Phe Arg Val Tyr Leu
1100 1105 1110
Ser Lys Glu Ala Glu Arg Lys Leu Leu Thr Trp Glu Ser Val His
1115 1120 1125
Lys Glu Asn Phe Leu Leu Ala Arg Ala Arg Asp Lys Arg Glu Ser
1130 1135 1140
Asp Ser Glu Arg Leu Lys Arg Thr Ser Gln Lys Val Asp Leu Ala
1145 1150 1155
Leu Lys Gln Leu Gly His Ile Arg Glu Tyr Glu Gln Arg Leu Lys
1160 1165 1170
Val Leu Glu Arg Glu Val Gln Gln Cys Ser Arg Val Leu Gly Trp
1175 1180 1185
Val Ala Glu Ala Leu Ser Arg Ser Ala Leu Leu Pro Pro Gly Gly
1190 1195 1200
Pro Pro Pro Pro Asp Leu Pro Gly Ser Lys Asp
1205 1210
<210> 69
<211> 2174
<212> DNA
<213> 智人
<400> 69
aatgatagag atattagggc tagttaacca cagttttaca agactcctct tcccgcgtgt 60
gggccattgt catgctggtg ggcgtcccgc ccacctgaaa ggtctccccg ccccgactgg 120
ggtttgttgt tgaagaagga gaatccccgg aaaggctgag tctccagctc aaggtcaaaa 180
cgtccaaggc cgaaagccct ccagtttccc ctggacgcct tgctcctgct tctgctacga 240
ccttctgggg aaaacgaatt tctcattttc ttcttaaatt gccattttcg ctttaggaga 300
tgaatgtttt cctttggctg ttttggcaat gactctgaat taaagcgatg ctaacgcctc 360
ttttccccct aattgttaaa agctatggac tgcaggaaga tggcccgctt ctcttacagt 420
gtgatttgga tcatggccat ttctaaagtc tttgaactgg gattagttgc cgggctgggc 480
catcaggaat ttgctcgtcc atctcgggga tacctggcct tcagagatga cagcatttgg 540
ccccaggagg agcctgcaat tcggcctcgg tcttcccagc gtgtgccgcc catggggata 600
cagcacagta aggagctaaa cagaacctgc tgcctgaatg ggggaacctg catgctgggg 660
tccttttgtg cctgccctcc ctccttctac ggacggaact gtgagcacga tgtgcgcaaa 720
gagaactgtg ggtctgtgcc ccatgacacc tggctgccca agaagtgttc cctgtgtaaa 780
tgctggcacg gtcagctccg ctgctttcct caggcatttc tacccggctg tgatggcctt 840
gtgatggatg agcacctcgt ggcttccagg actccagaac taccaccgtc tgcacgtact 900
accactttta tgctagttgg catctgcctt tctatacaaa gctactatta atcgacattg 960
acctatttcc agaaatacaa ttttagatat catgcaaatt tcatgaccag taaaggctgc 1020
tgctacaatg tcctaactga aagatgatca tttgtagttg ccttaaaata atgaatacat 1080
ttccaaaatg gtctctaaca tttccttaca gaactacttc ttacttcttt gccctgccct 1140
ctcccaaaaa actacttctt ttttcaaaag aaagtcagcc atatctccat tgtgcctaag 1200
tccagtgttt cttttttttt ttttttttga gacggagtct cactctgtca cccaggctgg 1260
actgcaatga cgcgatcttg gttcactgca acctccgcat ccggggttca agccattctc 1320
ctgcctcagc ctcccaagta actgggatta caggcatgtg tcaccatgcc cagctaattt 1380
ttttgtattt ttagtagaga tgggggtttc accatattgg ccagtctggt ctcgaactcc 1440
tgaccttgtg atccactcgc ctcagcctct cgaagtgctg agattacaca cgtgagcaac 1500
tgtgcaaggc ctggtgtttc ttgatacatg taattctacc aaggtcttct taatatgttc 1560
ttttaaatga ttgaattata tgttcagatt attggagact aattctaatg tggaccttag 1620
aatacagttt tgagtagagt tgatcaaaat caattaaaat agtctcttta aaaggaaaga 1680
aaacatcttt aaggggagga accagagtgc tgaaggaatg gaagtccatc tgcgtgtgtg 1740
cagggagact gggtaggaaa gaggaagcaa atagaagaga gaggttgaaa aacaaaatgg 1800
gttacttgat tggtgattag gtggtggtag agaagcaagt aaaaaggcta aatggaaggg 1860
caagtttcca tcatctatag aaagctatat aagacaagaa ctcccctttt tttcccaaag 1920
gcattataaa aagaatgaag cctccttaga aaaaaaatta tacctcaatg tccccaacaa 1980
gattgcttaa taaattgtgt ttcctccaag ctattcaatt cttttaactg ttgtagaaga 2040
caaaatgttc acaatatatt tagttgtaaa ccaagtgatc aaactacata ttgtaaagcc 2100
catttttaaa atacattgta tatatgtgta tgcacagtaa aaatggaaac tatattgacc 2160
taaaaaaaaa aaaa 2174
<210> 70
<211> 188
<212> PRT
<213> 智人
<400> 70
Met Asp Cys Arg Lys Met Ala Arg Phe Ser Tyr Ser Val Ile Trp Ile
1 5 10 15
Met Ala Ile Ser Lys Val Phe Glu Leu Gly Leu Val Ala Gly Leu Gly
20 25 30
His Gln Glu Phe Ala Arg Pro Ser Arg Gly Tyr Leu Ala Phe Arg Asp
35 40 45
Asp Ser Ile Trp Pro Gln Glu Glu Pro Ala Ile Arg Pro Arg Ser Ser
50 55 60
Gln Arg Val Pro Pro Met Gly Ile Gln His Ser Lys Glu Leu Asn Arg
65 70 75 80
Thr Cys Cys Leu Asn Gly Gly Thr Cys Met Leu Gly Ser Phe Cys Ala
85 90 95
Cys Pro Pro Ser Phe Tyr Gly Arg Asn Cys Glu His Asp Val Arg Lys
100 105 110
Glu Asn Cys Gly Ser Val Pro His Asp Thr Trp Leu Pro Lys Lys Cys
115 120 125
Ser Leu Cys Lys Cys Trp His Gly Gln Leu Arg Cys Phe Pro Gln Ala
130 135 140
Phe Leu Pro Gly Cys Asp Gly Leu Val Met Asp Glu His Leu Val Ala
145 150 155 160
Ser Arg Thr Pro Glu Leu Pro Pro Ser Ala Arg Thr Thr Thr Phe Met
165 170 175
Leu Val Gly Ile Cys Leu Ser Ile Gln Ser Tyr Tyr
180 185
<210> 71
<211> 3934
<212> DNA
<213> 智人
<400> 71
gccctcccag agctgccgga cgctcgcggg tctcggaacg catcccgccg cgggggcttc 60
ggccgtggca tgggcgccgc gggcctgctc ggggttttct tggctctcgt cgcaccgggg 120
gtcctcggga tttcttgtgg ctctcctccg cctatcctaa atggccggat tagttattat 180
tctaccccca ttgctgttgg taccgtgata aggtacagtt gttcaggtac cttccgcctc 240
attggagaaa aaagtctatt atgcataact aaagacaaag tggatggaac ctgggataaa 300
cctgctccta aatgtgaata tttcaataaa tattcttctt gccctgagcc catagtacca 360
ggaggataca aaattagagg ctctacaccc tacagacatg gtgattctgt gacatttgcc 420
tgtaaaacca acttctccat gaacggaaac aagtctgttt ggtgtcaagc aaataatatg 480
tgggggccga cacgactacc aacctgtgta agtgttttcc ctctcgagtg tccagcactt 540
cctatgatcc acaatggaca tcacacaagt gagaatgttg gctccattgc tccaggattg 600
tctgtgactt acagctgtga atctggttac ttgcttgttg gagaaaagat cattaactgt 660
ttgtcttcgg gaaaatggag tgctgtcccc cccacatgtg aagaggcacg ctgtaaatct 720
ctaggacgat ttcccaatgg gaaggtaaag gagcctccaa ttctccgggt tggtgtaact 780
gcaaactttt tctgtgatga agggtatcga ctgcaaggcc caccttctag tcggtgtgta 840
attgctggac agggagttgc ttggaccaaa atgccagtat gtgaagaaat tttttgccca 900
tcacctcccc ctattctcaa tggaagacat ataggcaact cactagcaaa tgtctcatat 960
ggaagcatag tcacttacac ttgtgacccg gacccagagg aaggagtgaa cttcatcctt 1020
attggagaga gcactctccg ttgtacagtt gatagtcaga agactgggac ctggagtggc 1080
cctgccccac gctgtgaact ttctacttct gcggttcagt gtccacatcc ccagatccta 1140
agaggccgaa tggtatctgg gcagaaagat cgatatacct ataacgacac tgtgatattt 1200
gcttgcatgt ttggcttcac cttgaagggc agcaagcaaa tccgatgcaa tgcccaaggc 1260
acatgggagc catctgcacc agtctgtgaa aaggaatgcc aggcccctcc taacatcctc 1320
aatgggcaaa aggaagatag acacatggtc cgctttgacc ctggaacatc tataaaatat 1380
agctgtaacc ctggctatgt gctggtggga gaagaatcca tacagtgtac ctctgagggg 1440
gtgtggacac cccctgtacc ccaatgcaaa gtggcagcgt gtgaagctac aggaaggcaa 1500
ctcttgacaa aaccccagca ccaatttgtt agaccagatg tcaactcttc ttgtggtgaa 1560
gggtacaagt taagtgggag tgtttatcag gagtgtcaag gcacaattcc ttggtttatg 1620
gagattcgtc tttgtaaaga aatcacctgc ccaccacccc ctgttatcta caatggggca 1680
cacaccggga gttccttaga agattttcca tatggaacca cggtcactta cacatgtaac 1740
cctgggccag aaagaggagt ggaattcagc ctcattggag agagcaccat ccgttgtaca 1800
agcaatgatc aagaaagagg cacctggagt ggccctgctc ccctatgtaa actttccctc 1860
cttgctgtcc agtgctcaca tgtccatatt gcaaatggat acaagatatc tggcaaggaa 1920
gccccatatt tctacaatga cactgtgaca ttcaagtgtt atagtggatt tactttgaag 1980
ggcagtagtc agattcgttg caaagctgat aacacctggg atcctgaaat accagtttgt 2040
gaaaaagaaa catgccagca tgtgagacag agtcttcaag aacttccagc tggttcacgt 2100
gtggagctag ttaatacgtc ctgccaagat gggtaccagt tgactggaca tgcttatcag 2160
atgtgtcaag atgctgaaaa tggaatttgg ttcaaaaaga ttccactttg taaagttatt 2220
cactgtcacc ctccaccagt gattgtcaat gggaagcaca cagggatgat ggcagaaaac 2280
tttctatatg gaaatgaagt ctcttatgaa tgtgaccaag gattctatct cctgggagag 2340
aaaaaattgc agtgcagaag tgattctaaa ggacatggat cttggagcgg gccttcccca 2400
cagtgcttac gatctcctcc tgtgactcgc tgccctaatc cagaagtcaa acatgggtac 2460
aagctcaata aaacacattc tgcatattcc cacaatgaca tagtgtatgt tgactgcaat 2520
cctggcttca tcatgaatgg tagtcgcgtg attaggtgtc atactgataa cacatgggtg 2580
ccaggtgtgc caacttgtat gaaaaaagcc ttcatagggt gtccacctcc gcctaagacc 2640
cctaacggga accatactgg tggaaacata gctcgatttt ctcctggaat gtcaatcctg 2700
tacagctgtg accaaggcta cctgctggtg ggagaggcac tccttctttg cacacatgag 2760
ggaacctgga gccaacctgc ccctcattgt aaagaggtaa actgtagctc accagcagat 2820
atggatggaa tccagaaagg gctggaacca aggaaaatgt atcagtatgg agctgttgta 2880
actctggagt gtgaagatgg gtatatgctg gaaggcagtc cccagagcca gtgccaatcg 2940
gatcaccaat ggaaccctcc cctggcggtt tgcagatccc gttcacttgc tcctgtcctt 3000
tgtggtattg ctgcaggttt gatacttctt accttcttga ttgtcattac cttatacgtg 3060
atatcaaaac acagagaacg caattattat acagatacaa gccagaaaga agcttttcat 3120
ttagaagcac gagaagtata ttctgttgat ccatacaacc cagccagctg atcagaagac 3180
aaactggtgt gtgcctcatt gcttggaatt cagcggaata ttgattagaa agaaactgct 3240
ctaatatcag caagtctctt tatatggcct caagatcaat gaaatgatgt cataagcgat 3300
cacttcctat atgcacttat tctcaagaag aacatcttta tggtaaagat gggagcccag 3360
tttcactgcc atatactctt caaggacttt ctgaagcctc acttatgaga tgcctgaagc 3420
caggccatgg ctataaacaa ttacatggct ctaaaaagtt ttgccctttt taaggaaggc 3480
actaaaaaga gctgtcctgg tatctagacc catcttcttt ttgaaatcag catactcaat 3540
gttactatct gcttttggtt ataatgtgtt tttaattatc taaagtatga agcattttct 3600
ggggttatga tggccttacc tttattagga agtatggttt tattttgata gtagcttcct 3660
cctctggtgg tgttaatcat ttcattttta cccttactgt ttgagtttct ctcacattac 3720
tgtatatact ttgcctttcc ataatcactc agtgattgca atttgcacaa gtttttttaa 3780
attatgggaa tcaagattta atcctagaga tttggtgtac aattcaggct ttggatgttt 3840
ctttagcagt tttgtgataa gttctagttg cttgtaaaat ttcacttaat aatgtgtaca 3900
ttagtcattc aataaattgt aattgtaaag aaaa 3934
<210> 72
<211> 1033
<212> PRT
<213> 智人
<400> 72
Met Gly Ala Ala Gly Leu Leu Gly Val Phe Leu Ala Leu Val Ala Pro
1 5 10 15
Gly Val Leu Gly Ile Ser Cys Gly Ser Pro Pro Pro Ile Leu Asn Gly
20 25 30
Arg Ile Ser Tyr Tyr Ser Thr Pro Ile Ala Val Gly Thr Val Ile Arg
35 40 45
Tyr Ser Cys Ser Gly Thr Phe Arg Leu Ile Gly Glu Lys Ser Leu Leu
50 55 60
Cys Ile Thr Lys Asp Lys Val Asp Gly Thr Trp Asp Lys Pro Ala Pro
65 70 75 80
Lys Cys Glu Tyr Phe Asn Lys Tyr Ser Ser Cys Pro Glu Pro Ile Val
85 90 95
Pro Gly Gly Tyr Lys Ile Arg Gly Ser Thr Pro Tyr Arg His Gly Asp
100 105 110
Ser Val Thr Phe Ala Cys Lys Thr Asn Phe Ser Met Asn Gly Asn Lys
115 120 125
Ser Val Trp Cys Gln Ala Asn Asn Met Trp Gly Pro Thr Arg Leu Pro
130 135 140
Thr Cys Val Ser Val Phe Pro Leu Glu Cys Pro Ala Leu Pro Met Ile
145 150 155 160
His Asn Gly His His Thr Ser Glu Asn Val Gly Ser Ile Ala Pro Gly
165 170 175
Leu Ser Val Thr Tyr Ser Cys Glu Ser Gly Tyr Leu Leu Val Gly Glu
180 185 190
Lys Ile Ile Asn Cys Leu Ser Ser Gly Lys Trp Ser Ala Val Pro Pro
195 200 205
Thr Cys Glu Glu Ala Arg Cys Lys Ser Leu Gly Arg Phe Pro Asn Gly
210 215 220
Lys Val Lys Glu Pro Pro Ile Leu Arg Val Gly Val Thr Ala Asn Phe
225 230 235 240
Phe Cys Asp Glu Gly Tyr Arg Leu Gln Gly Pro Pro Ser Ser Arg Cys
245 250 255
Val Ile Ala Gly Gln Gly Val Ala Trp Thr Lys Met Pro Val Cys Glu
260 265 270
Glu Ile Phe Cys Pro Ser Pro Pro Pro Ile Leu Asn Gly Arg His Ile
275 280 285
Gly Asn Ser Leu Ala Asn Val Ser Tyr Gly Ser Ile Val Thr Tyr Thr
290 295 300
Cys Asp Pro Asp Pro Glu Glu Gly Val Asn Phe Ile Leu Ile Gly Glu
305 310 315 320
Ser Thr Leu Arg Cys Thr Val Asp Ser Gln Lys Thr Gly Thr Trp Ser
325 330 335
Gly Pro Ala Pro Arg Cys Glu Leu Ser Thr Ser Ala Val Gln Cys Pro
340 345 350
His Pro Gln Ile Leu Arg Gly Arg Met Val Ser Gly Gln Lys Asp Arg
355 360 365
Tyr Thr Tyr Asn Asp Thr Val Ile Phe Ala Cys Met Phe Gly Phe Thr
370 375 380
Leu Lys Gly Ser Lys Gln Ile Arg Cys Asn Ala Gln Gly Thr Trp Glu
385 390 395 400
Pro Ser Ala Pro Val Cys Glu Lys Glu Cys Gln Ala Pro Pro Asn Ile
405 410 415
Leu Asn Gly Gln Lys Glu Asp Arg His Met Val Arg Phe Asp Pro Gly
420 425 430
Thr Ser Ile Lys Tyr Ser Cys Asn Pro Gly Tyr Val Leu Val Gly Glu
435 440 445
Glu Ser Ile Gln Cys Thr Ser Glu Gly Val Trp Thr Pro Pro Val Pro
450 455 460
Gln Cys Lys Val Ala Ala Cys Glu Ala Thr Gly Arg Gln Leu Leu Thr
465 470 475 480
Lys Pro Gln His Gln Phe Val Arg Pro Asp Val Asn Ser Ser Cys Gly
485 490 495
Glu Gly Tyr Lys Leu Ser Gly Ser Val Tyr Gln Glu Cys Gln Gly Thr
500 505 510
Ile Pro Trp Phe Met Glu Ile Arg Leu Cys Lys Glu Ile Thr Cys Pro
515 520 525
Pro Pro Pro Val Ile Tyr Asn Gly Ala His Thr Gly Ser Ser Leu Glu
530 535 540
Asp Phe Pro Tyr Gly Thr Thr Val Thr Tyr Thr Cys Asn Pro Gly Pro
545 550 555 560
Glu Arg Gly Val Glu Phe Ser Leu Ile Gly Glu Ser Thr Ile Arg Cys
565 570 575
Thr Ser Asn Asp Gln Glu Arg Gly Thr Trp Ser Gly Pro Ala Pro Leu
580 585 590
Cys Lys Leu Ser Leu Leu Ala Val Gln Cys Ser His Val His Ile Ala
595 600 605
Asn Gly Tyr Lys Ile Ser Gly Lys Glu Ala Pro Tyr Phe Tyr Asn Asp
610 615 620
Thr Val Thr Phe Lys Cys Tyr Ser Gly Phe Thr Leu Lys Gly Ser Ser
625 630 635 640
Gln Ile Arg Cys Lys Ala Asp Asn Thr Trp Asp Pro Glu Ile Pro Val
645 650 655
Cys Glu Lys Glu Thr Cys Gln His Val Arg Gln Ser Leu Gln Glu Leu
660 665 670
Pro Ala Gly Ser Arg Val Glu Leu Val Asn Thr Ser Cys Gln Asp Gly
675 680 685
Tyr Gln Leu Thr Gly His Ala Tyr Gln Met Cys Gln Asp Ala Glu Asn
690 695 700
Gly Ile Trp Phe Lys Lys Ile Pro Leu Cys Lys Val Ile His Cys His
705 710 715 720
Pro Pro Pro Val Ile Val Asn Gly Lys His Thr Gly Met Met Ala Glu
725 730 735
Asn Phe Leu Tyr Gly Asn Glu Val Ser Tyr Glu Cys Asp Gln Gly Phe
740 745 750
Tyr Leu Leu Gly Glu Lys Lys Leu Gln Cys Arg Ser Asp Ser Lys Gly
755 760 765
His Gly Ser Trp Ser Gly Pro Ser Pro Gln Cys Leu Arg Ser Pro Pro
770 775 780
Val Thr Arg Cys Pro Asn Pro Glu Val Lys His Gly Tyr Lys Leu Asn
785 790 795 800
Lys Thr His Ser Ala Tyr Ser His Asn Asp Ile Val Tyr Val Asp Cys
805 810 815
Asn Pro Gly Phe Ile Met Asn Gly Ser Arg Val Ile Arg Cys His Thr
820 825 830
Asp Asn Thr Trp Val Pro Gly Val Pro Thr Cys Met Lys Lys Ala Phe
835 840 845
Ile Gly Cys Pro Pro Pro Pro Lys Thr Pro Asn Gly Asn His Thr Gly
850 855 860
Gly Asn Ile Ala Arg Phe Ser Pro Gly Met Ser Ile Leu Tyr Ser Cys
865 870 875 880
Asp Gln Gly Tyr Leu Leu Val Gly Glu Ala Leu Leu Leu Cys Thr His
885 890 895
Glu Gly Thr Trp Ser Gln Pro Ala Pro His Cys Lys Glu Val Asn Cys
900 905 910
Ser Ser Pro Ala Asp Met Asp Gly Ile Gln Lys Gly Leu Glu Pro Arg
915 920 925
Lys Met Tyr Gln Tyr Gly Ala Val Val Thr Leu Glu Cys Glu Asp Gly
930 935 940
Tyr Met Leu Glu Gly Ser Pro Gln Ser Gln Cys Gln Ser Asp His Gln
945 950 955 960
Trp Asn Pro Pro Leu Ala Val Cys Arg Ser Arg Ser Leu Ala Pro Val
965 970 975
Leu Cys Gly Ile Ala Ala Gly Leu Ile Leu Leu Thr Phe Leu Ile Val
980 985 990
Ile Thr Leu Tyr Val Ile Ser Lys His Arg Glu Arg Asn Tyr Tyr Thr
995 1000 1005
Asp Thr Ser Gln Lys Glu Ala Phe His Leu Glu Ala Arg Glu Val
1010 1015 1020
Tyr Ser Val Asp Pro Tyr Asn Pro Ala Ser
1025 1030
<210> 73
<211> 1300
<212> DNA
<213> 智人
<400> 73
ctgcagccgg tgcagttaca cgttttcctc caaggagcct cggacgttgt cacgggtttg 60
gggtcgggga cagagcggtg accatggcca ggctggcgtt gtctcctgtg cccagccact 120
ggatggtggc gttgctgctg ctgctctcag ctgagccagt accagcagcc agatcggagg 180
accggtaccg gaatcccaaa ggtagtgctt gttcgcggat ctggcagagc ccacgtttca 240
tagccaggaa acggggcttc acggtgaaaa tgcactgcta catgaacagc gcctccggca 300
atgtgagctg gctctggaag caggagatgg acgagaatcc ccagcagctg aagctggaaa 360
agggccgcat ggaagagtcc cagaacgaat ctctcgccac cctcaccatc caaggcatcc 420
ggtttgagga caatggcatc tacttctgtc agcagaagtg caacaacacc tcggaggtct 480
accagggctg cggcacagag ctgcgagtca tgggattcag caccttggca cagctgaagc 540
agaggaacac gctgaaggat ggtatcatca tgatccagac gctgctgatc atcctcttca 600
tcatcgtgcc tatcttcctg ctgctggaca aggatgacag caaggctggc atggaggaag 660
atcacaccta cgagggcctg gacattgacc agacagccac ctatgaggac atagtgacgc 720
tgcggacagg ggaagtgaag tggtctgtag gtgagcaccc aggccaggag tgagagccag 780
gtcgccccat gacctgggtg caggctccct ggcctcagtg actgcttcgg agctgcctgg 840
ctcatggccc aacccctttc ctggaccccc cagctggcct ctgaagctgg cccaccagag 900
ctgccatttg tctccagccc ctggtcccca gctcttgcca aagggcctgg agtagaagga 960
caacagggca gcaacttgga gggagttctc tggggatgga cgggacccag ccttctgggg 1020
gtgctatgag gtgatccgtc cccacacatg ggatggggga ggcagagact ggtccagagc 1080
ccgcaaatgg actcggagcc gagggcctcc cagcagagct tgggaagggc catggaccca 1140
actgggcccc agaagagcca caggaacatc attcctctcc cgcaaccact cccaccccag 1200
ggaggccctg gcctccagtg ccttcccccg tggaataaac ggtgtgtcct gagaaaccac 1260
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1300
<210> 74
<211> 229
<212> PRT
<213> 智人
<400> 74
Met Ala Arg Leu Ala Leu Ser Pro Val Pro Ser His Trp Met Val Ala
1 5 10 15
Leu Leu Leu Leu Leu Ser Ala Glu Pro Val Pro Ala Ala Arg Ser Glu
20 25 30
Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys Ser Arg Ile Trp Gln
35 40 45
Ser Pro Arg Phe Ile Ala Arg Lys Arg Gly Phe Thr Val Lys Met His
50 55 60
Cys Tyr Met Asn Ser Ala Ser Gly Asn Val Ser Trp Leu Trp Lys Gln
65 70 75 80
Glu Met Asp Glu Asn Pro Gln Gln Leu Lys Leu Glu Lys Gly Arg Met
85 90 95
Glu Glu Ser Gln Asn Glu Ser Leu Ala Thr Leu Thr Ile Gln Gly Ile
100 105 110
Arg Phe Glu Asp Asn Gly Ile Tyr Phe Cys Gln Gln Lys Cys Asn Asn
115 120 125
Thr Ser Glu Val Tyr Gln Gly Cys Gly Thr Glu Leu Arg Val Met Gly
130 135 140
Phe Ser Thr Leu Ala Gln Leu Lys Gln Arg Asn Thr Leu Lys Asp Gly
145 150 155 160
Ile Ile Met Ile Gln Thr Leu Leu Ile Ile Leu Phe Ile Ile Val Pro
165 170 175
Ile Phe Leu Leu Leu Asp Lys Asp Asp Ser Lys Ala Gly Met Glu Glu
180 185 190
Asp His Thr Tyr Glu Gly Leu Asp Ile Asp Gln Thr Ala Thr Tyr Glu
195 200 205
Asp Ile Val Thr Leu Arg Thr Gly Glu Val Lys Trp Ser Val Gly Glu
210 215 220
His Pro Gly Gln Glu
225
<210> 75
<211> 2592
<212> DNA
<213> 智人
<400> 75
agagtctcca gttggtgacc aagagtacat ctcttttcaa atagctggat taggtcctca 60
tgctgctgtg gtcattgctg gtcatctttg atgcagtcac tgaacaggca gattcgctga 120
cccttgtggc gccctcttct gtcttcgaag gagacagcat cgttctgaaa tgccagggag 180
aacagaactg gaaaattcag aagatggctt accataagga taacaaagag ttatctgttt 240
tcaaaaaatt ctcagatttc cttatccaaa gtgcagtttt aagtgacagt ggtaactatt 300
tctgtagtac caaaggacaa ctctttctct gggataaaac ttcaaatata gtaaagataa 360
aagtccaaga gctctttcaa cgtcctgtgc tgactgccag ctccttccag cccatcgaag 420
ggggtccagt gagcctgaaa tgtgagaccc ggctctctcc acagaggttg gatgttcaac 480
tccagttctg cttcttcaga gaaaaccagg tcctggggtc aggctggagc agctctccgg 540
agctccagat ttctgccgtg tggagtgaag acacagggtc ttactggtgc aaggcagaaa 600
cggtgactca caggatcaga aaacagagcc tccaatccca gattcacgtg cagagaatcc 660
ccatctctaa tgtaagcttg gagatccggg cccccggggg acaggtgact gaaggacaaa 720
aactgatcct gctctgctca gtggctgggg gtacaggaaa tgtcacattc tcctggtaca 780
gagaggccac aggaaccagt atgggaaaga aaacccagcg ttccctgtca gcagagctgg 840
agatcccagc tgtgaaagag agtgatgccg gcaaatatta ctgtagagct gacaacggcc 900
atgtgcctat ccagagcaag gtggtgaata tccctgtgag aattccagtg tctcgccctg 960
tcctcaccct caggtctcct ggggcccagg ctgcagtggg ggacctgctg gagcttcact 1020
gtgaggccct gagaggctct cccccaatct tgtaccaatt ttatcatgag gatgtcaccc 1080
ttgggaacag ctcggccccc tctggaggag gggcctcctt caacctctct ttgactgcag 1140
aacattctgg aaactactcc tgtgaggcca acaacggcct gggggcccag tgcagtgagg 1200
cagtgccagt ctccatctca ggacctgatg gctatagaag agacctcatg acagctggag 1260
ttctctgggg actgtttggt gtccttggtt tcactggtgt tgctttgctg ttgtatgcct 1320
tgttccacaa gatatcagga gaaagttctg ccactaatga acccagaggg gcttccaggc 1380
caaatcctca agagttcacc tattcaagcc caaccccaga catggaggag ctgcagccag 1440
tgtatgtcaa tgtgggctct gtagatgtgg atgtggttta ttctcaggtc tggagcatgc 1500
agcagccaga aagctcagca aacatcagga cacttctgga gaacaaggac tcccaagtca 1560
tctactcttc tgtgaagaaa tcataacact tggaggaatc agaagggaag atcaacagca 1620
aggatggggc atcattaaga cttgctataa aaccttatga aaatgcttga ggcttatcac 1680
ctgccacagc cagaacgtgc ctcaggaggc acctcctgtc atttttgtcc tgatgatgtt 1740
tcttctccaa tatcttcttt tacctatcaa tattcattga actgctgcta catccagaca 1800
ctgtgcaaat aaattatttc tgctaccttc tcttaagcaa tcagtgtgta aagatttgag 1860
ggaagaatga ataagagata caaggtctca ccttcatcta ctgtgaagtg atgagaacag 1920
gacttgatag tggtgtatta acttatttat gtgctgctgg atacagtttg ctaatatttt 1980
gttgagaatt tttgcaaata tgttcattgg gaatattggc ctgaaatttt cttttccact 2040
gtgtctctgc cagaatgttt gtatcaggct gatgctggct tcatagaatg agttaggcag 2100
gagcccttcc tccttgattt tttggcatag tttcagcagg attggtacca gttattcttt 2160
ctgcatcttg tagaattcag ctatgaatcc atctggtcta gggcttttgt gttggttggt 2220
aagtttttta ttactaattc aacttcagcg cttgatattg gtctaggagg ggtttctgtc 2280
tcttcctggt tcaatcttgg gagattgtgt gtttccagga atttagccgt ttcctccaga 2340
ttttcttctt tatgtgcatc gacttgagtg taaacataac ttatatgcac tgggaaacca 2400
aaaaatctgt gtgacttgct ttattgcagc atttgtttta ttttggtagt ctggaactga 2460
acctgcaata tcaccaaagt atgcatatag ttgcaaaaat gtgatttttg acatagtaaa 2520
tatgagtatt tgcaataaac tatgatatta cttttgtaag tatatagaat aaaatgtaaa 2580
taatctataa aa 2592
<210> 76
<211> 508
<212> PRT
<213> 智人
<400> 76
Met Leu Leu Trp Ser Leu Leu Val Ile Phe Asp Ala Val Thr Glu Gln
1 5 10 15
Ala Asp Ser Leu Thr Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp
20 25 30
Ser Ile Val Leu Lys Cys Gln Gly Glu Gln Asn Trp Lys Ile Gln Lys
35 40 45
Met Ala Tyr His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe
50 55 60
Ser Asp Phe Leu Ile Gln Ser Ala Val Leu Ser Asp Ser Gly Asn Tyr
65 70 75 80
Phe Cys Ser Thr Lys Gly Gln Leu Phe Leu Trp Asp Lys Thr Ser Asn
85 90 95
Ile Val Lys Ile Lys Val Gln Glu Leu Phe Gln Arg Pro Val Leu Thr
100 105 110
Ala Ser Ser Phe Gln Pro Ile Glu Gly Gly Pro Val Ser Leu Lys Cys
115 120 125
Glu Thr Arg Leu Ser Pro Gln Arg Leu Asp Val Gln Leu Gln Phe Cys
130 135 140
Phe Phe Arg Glu Asn Gln Val Leu Gly Ser Gly Trp Ser Ser Ser Pro
145 150 155 160
Glu Leu Gln Ile Ser Ala Val Trp Ser Glu Asp Thr Gly Ser Tyr Trp
165 170 175
Cys Lys Ala Glu Thr Val Thr His Arg Ile Arg Lys Gln Ser Leu Gln
180 185 190
Ser Gln Ile His Val Gln Arg Ile Pro Ile Ser Asn Val Ser Leu Glu
195 200 205
Ile Arg Ala Pro Gly Gly Gln Val Thr Glu Gly Gln Lys Leu Ile Leu
210 215 220
Leu Cys Ser Val Ala Gly Gly Thr Gly Asn Val Thr Phe Ser Trp Tyr
225 230 235 240
Arg Glu Ala Thr Gly Thr Ser Met Gly Lys Lys Thr Gln Arg Ser Leu
245 250 255
Ser Ala Glu Leu Glu Ile Pro Ala Val Lys Glu Ser Asp Ala Gly Lys
260 265 270
Tyr Tyr Cys Arg Ala Asp Asn Gly His Val Pro Ile Gln Ser Lys Val
275 280 285
Val Asn Ile Pro Val Arg Ile Pro Val Ser Arg Pro Val Leu Thr Leu
290 295 300
Arg Ser Pro Gly Ala Gln Ala Ala Val Gly Asp Leu Leu Glu Leu His
305 310 315 320
Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Gln Phe Tyr His
325 330 335
Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala
340 345 350
Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys
355 360 365
Glu Ala Asn Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala Val Pro Val
370 375 380
Ser Ile Ser Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala Gly
385 390 395 400
Val Leu Trp Gly Leu Phe Gly Val Leu Gly Phe Thr Gly Val Ala Leu
405 410 415
Leu Leu Tyr Ala Leu Phe His Lys Ile Ser Gly Glu Ser Ser Ala Thr
420 425 430
Asn Glu Pro Arg Gly Ala Ser Arg Pro Asn Pro Gln Glu Phe Thr Tyr
435 440 445
Ser Ser Pro Thr Pro Asp Met Glu Glu Leu Gln Pro Val Tyr Val Asn
450 455 460
Val Gly Ser Val Asp Val Asp Val Val Tyr Ser Gln Val Trp Ser Met
465 470 475 480
Gln Gln Pro Glu Ser Ser Ala Asn Ile Arg Thr Leu Leu Glu Asn Lys
485 490 495
Asp Ser Gln Val Ile Tyr Ser Ser Val Lys Lys Ser
500 505
<210> 77
<211> 4530
<212> DNA
<213> 智人
<400> 77
aattctcgag ctcgtcgacc ggtcgacgag ctcgagggtc gacgagctcg agggcgcgcg 60
cccggccccc acccctcgca gcaccccgcg ccccgcgccc tcccagccgg gtccagccgg 120
agccatgggg ccggagccgc agtgagcacc atggagctgg cggccttgtg ccgctggggg 180
ctcctcctcg ccctcttgcc ccccggagcc gcgagcaccc aagtgtgcac cggcacagac 240
atgaagctgc ggctccctgc cagtcccgag acccacctgg acatgctccg ccacctctac 300
cagggctgcc aggtggtgca gggaaacctg gaactcacct acctgcccac caatgccagc 360
ctgtccttcc tgcaggatat ccaggaggtg cagggctacg tgctcatcgc tcacaaccaa 420
gtgaggcagg tcccactgca gaggctgcgg attgtgcgag gcacccagct ctttgaggac 480
aactatgccc tggccgtgct agacaatgga gacccgctga acaataccac ccctgtcaca 540
ggggcctccc caggaggcct gcgggagctg cagcttcgaa gcctcacaga gatcttgaaa 600
ggaggggtct tgatccagcg gaacccccag ctctgctacc aggacacgat tttgtggaag 660
gacatcttcc acaagaacaa ccagctggct ctcacactga tagacaccaa ccgctctcgg 720
gcctgccacc cctgttctcc gatgtgtaag ggctcccgct gctggggaga gagttctgag 780
gattgtcaga gcctgacgcg cactgtctgt gccggtggct gtgcccgctg caaggggcca 840
ctgcccactg actgctgcca tgagcagtgt gctgccggct gcacgggccc caagcactct 900
gactgcctgg cctgcctcca cttcaaccac agtggcatct gtgagctgca ctgcccagcc 960
ctggtcacct acaacacaga cacgtttgag tccatgccca atcccgaggg ccggtataca 1020
ttcggcgcca gctgtgtgac tgcctgtccc tacaactacc tttctacgga cgtgggatcc 1080
tgcaccctcg tctgccccct gcacaaccaa gaggtgacag cagaggatgg aacacagcgg 1140
tgtgagaagt gcagcaagcc ctgtgcccga gtgtgctatg gtctgggcat ggagcacttg 1200
cgagaggtga gggcagttac cagtgccaat atccaggagt ttgctggctg caagaagatc 1260
tttgggagcc tggcatttct gccggagagc tttgatgggg acccagcctc caacactgcc 1320
ccgctccagc cagagcagct ccaagtgttt gagactctgg aagagatcac aggttaccta 1380
tacatctcag catggccgga cagcctgcct gacctcagcg tcttccagaa cctgcaagta 1440
atccggggac gaattctgca caatggcgcc tactcgctga ccctgcaagg gctgggcatc 1500
agctggctgg ggctgcgctc actgagggaa ctgggcagtg gactggccct catccaccat 1560
aacacccacc tctgcttcgt gcacacggtg ccctgggacc agctctttcg gaacccgcac 1620
caagctctgc tccacactgc caaccggcca gaggacgagt gtgtgggcga gggcctggcc 1680
tgccaccagc tgtgcgcccg agggcactgc tggggtccag ggcccaccca gtgtgtcaac 1740
tgcagccagt tccttcgggg ccaggagtgc gtggaggaat gccgagtact gcaggggctc 1800
cccagggagt atgtgaatgc caggcactgt ttgccgtgcc accctgagtg tcagccccag 1860
aatggctcag tgacctgttt tggaccggag gctgaccagt gtgtggcctg tgcccactat 1920
aaggaccctc ccttctgcgt ggcccgctgc cccagcggtg tgaaacctga cctctcctac 1980
atgcccatct ggaagtttcc agatgaggag ggcgcatgcc agccttgccc catcaactgc 2040
acccactcct gtgtggacct ggatgacaag ggctgccccg ccgagcagag agccagccct 2100
ctgacgtcca tcgtctctgc ggtggttggc attctgctgg tcgtggtctt gggggtggtc 2160
tttgggatcc tcatcaagcg acggcagcag aagatccgga agtacacgat gcggagactg 2220
ctgcaggaaa cggagctggt ggagccgctg acacctagcg gagcgatgcc caaccaggcg 2280
cagatgcgga tcctgaaaga gacggagctg aggaaggtga aggtgcttgg atctggcgct 2340
tttggcacag tctacaaggg catctggatc cctgatgggg agaatgtgaa aattccagtg 2400
gccatcaaag tgttgaggga aaacacatcc cccaaagcca acaaagaaat cttagacgaa 2460
gcatacgtga tggctggtgt gggctcccca tatgtctccc gccttctggg catctgcctg 2520
acatccacgg tgcagctggt gacacagctt atgccctatg gctgcctctt agaccatgtc 2580
cgggaaaacc gcggacgcct gggctcccag gacctgctga actggtgtat gcagattgcc 2640
aaggggatga gctacctgga ggatgtgcgg ctcgtacaca gggacttggc cgctcggaac 2700
gtgctggtca agagtcccaa ccatgtcaaa attacagact tcgggctggc tcggctgctg 2760
gacattgacg agacagagta ccatgcagat gggggcaagg tgcccatcaa gtggatggcg 2820
ctggagtcca ttctccgccg gcggttcacc caccagagtg atgtgtggag ttatggtgtg 2880
actgtgtggg agctgatgac ttttggggcc aaaccttacg atgggatccc agcccgggag 2940
atccctgacc tgctggaaaa gggggagcgg ctgccccagc cccccatctg caccattgat 3000
gtctacatga tcatggtcaa atgttggatg attgactctg aatgtcggcc aagattccgg 3060
gagttggtgt ctgaattctc ccgcatggcc agggaccccc agcgctttgt ggtcatccag 3120
aatgaggact tgggcccagc cagtcccttg gacagcacct tctaccgctc actgctggag 3180
gacgatgaca tgggggacct ggtggatgct gaggagtatc tggtacccca gcagggcttc 3240
ttctgtccag accctgcccc gggcgctggg ggcatggtcc accacaggca ccgcagctca 3300
tctaccagga gtggcggtgg ggacctgaca ctagggctgg agccctctga agaggaggcc 3360
cccaggtctc cactggcacc ctccgaaggg gctggctccg atgtatttga tggtgacctg 3420
ggaatggggg cagccaaggg gctgcaaagc ctccccacac atgaccccag ccctctacag 3480
cggtacagtg aggaccccac agtacccctg ccctctgaga ctgatggcta cgttgccccc 3540
ctgacctgca gcccccagcc tgaatatgtg aaccagccag atgttcggcc ccagccccct 3600
tcgccccgag agggccctct gcctgctgcc cgacctgctg gtgccactct ggaaagggcc 3660
aagactctct ccccagggaa gaatggggtc gtcaaagacg tttttgcctt tgggggtgcc 3720
gtggagaacc ccgagtactt gacaccccag ggaggagctg cccctcagcc ccaccctcct 3780
cctgccttca gcccagcctt cgacaacctc tattactggg accaggaccc accagagcgg 3840
ggggctccac ccagcacctt caaagggaca cctacggcag agaacccaga gtacctgggt 3900
ctggacgtgc cagtgtgaac cagaaggcca agtccgcaga agccctgatg tgtcctcagg 3960
gagcagggaa ggcctgactt ctgctggcat caagaggtgg gagggccctc cgaccacttc 4020
caggggaacc tgccatgcca ggaacctgtc ctaaggaacc ttccttcctg cttgagttcc 4080
cagatggctg gaaggggtcc agcctcgttg gaagaggaac agcactgggg agtctttgtg 4140
gattctgagg ccctgcccaa tgagactcta gggtccagtg gatgccacag cccagcttgg 4200
ccctttcctt ccagatcctg ggtactgaaa gccttaggga agctggcctg agaggggaag 4260
cggccctaag ggagtgtcta agaacaaaag cgacccattc agagactgtc cctgaaacct 4320
agtactgccc cccatgagga aggaacagca atggtgtcag tatccaggct ttgtacagag 4380
tgcttttctg tttagttttt actttttttg ttttgttttt ttaaagacga aataaagacc 4440
caggggagaa tgggtgttgt atggggaggc aagtgtgggg ggtccttctc cacacccact 4500
ttgtccattt gcaaatatat tttggaaaac 4530
<210> 78
<211> 1255
<212> PRT
<213> 智人
<400> 78
Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu
1 5 10 15
Pro Pro Gly Ala Ala Ser Thr Gln Val Cys Thr Gly Thr Asp Met Lys
20 25 30
Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His
35 40 45
Leu Tyr Gln Gly Cys Gln Val Val Gln Gly Asn Leu Glu Leu Thr Tyr
50 55 60
Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gln Asp Ile Gln Glu Val
65 70 75 80
Gln Gly Tyr Val Leu Ile Ala His Asn Gln Val Arg Gln Val Pro Leu
85 90 95
Gln Arg Leu Arg Ile Val Arg Gly Thr Gln Leu Phe Glu Asp Asn Tyr
100 105 110
Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro
115 120 125
Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gln Leu Arg Ser
130 135 140
Leu Thr Glu Ile Leu Lys Gly Gly Val Leu Ile Gln Arg Asn Pro Gln
145 150 155 160
Leu Cys Tyr Gln Asp Thr Ile Leu Trp Lys Asp Ile Phe His Lys Asn
165 170 175
Asn Gln Leu Ala Leu Thr Leu Ile Asp Thr Asn Arg Ser Arg Ala Cys
180 185 190
His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser
195 200 205
Ser Glu Asp Cys Gln Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys
210 215 220
Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gln Cys
225 230 235 240
Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu
245 250 255
His Phe Asn His Ser Gly Ile Cys Glu Leu His Cys Pro Ala Leu Val
260 265 270
Thr Tyr Asn Thr Asp Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg
275 280 285
Tyr Thr Phe Gly Ala Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu
290 295 300
Ser Thr Asp Val Gly Ser Cys Thr Leu Val Cys Pro Leu His Asn Gln
305 310 315 320
Glu Val Thr Ala Glu Asp Gly Thr Gln Arg Cys Glu Lys Cys Ser Lys
325 330 335
Pro Cys Ala Arg Val Cys Tyr Gly Leu Gly Met Glu His Leu Arg Glu
340 345 350
Val Arg Ala Val Thr Ser Ala Asn Ile Gln Glu Phe Ala Gly Cys Lys
355 360 365
Lys Ile Phe Gly Ser Leu Ala Phe Leu Pro Glu Ser Phe Asp Gly Asp
370 375 380
Pro Ala Ser Asn Thr Ala Pro Leu Gln Pro Glu Gln Leu Gln Val Phe
385 390 395 400
Glu Thr Leu Glu Glu Ile Thr Gly Tyr Leu Tyr Ile Ser Ala Trp Pro
405 410 415
Asp Ser Leu Pro Asp Leu Ser Val Phe Gln Asn Leu Gln Val Ile Arg
420 425 430
Gly Arg Ile Leu His Asn Gly Ala Tyr Ser Leu Thr Leu Gln Gly Leu
435 440 445
Gly Ile Ser Trp Leu Gly Leu Arg Ser Leu Arg Glu Leu Gly Ser Gly
450 455 460
Leu Ala Leu Ile His His Asn Thr His Leu Cys Phe Val His Thr Val
465 470 475 480
Pro Trp Asp Gln Leu Phe Arg Asn Pro His Gln Ala Leu Leu His Thr
485 490 495
Ala Asn Arg Pro Glu Asp Glu Cys Val Gly Glu Gly Leu Ala Cys His
500 505 510
Gln Leu Cys Ala Arg Gly His Cys Trp Gly Pro Gly Pro Thr Gln Cys
515 520 525
Val Asn Cys Ser Gln Phe Leu Arg Gly Gln Glu Cys Val Glu Glu Cys
530 535 540
Arg Val Leu Gln Gly Leu Pro Arg Glu Tyr Val Asn Ala Arg His Cys
545 550 555 560
Leu Pro Cys His Pro Glu Cys Gln Pro Gln Asn Gly Ser Val Thr Cys
565 570 575
Phe Gly Pro Glu Ala Asp Gln Cys Val Ala Cys Ala His Tyr Lys Asp
580 585 590
Pro Pro Phe Cys Val Ala Arg Cys Pro Ser Gly Val Lys Pro Asp Leu
595 600 605
Ser Tyr Met Pro Ile Trp Lys Phe Pro Asp Glu Glu Gly Ala Cys Gln
610 615 620
Pro Cys Pro Ile Asn Cys Thr His Ser Cys Val Asp Leu Asp Asp Lys
625 630 635 640
Gly Cys Pro Ala Glu Gln Arg Ala Ser Pro Leu Thr Ser Ile Val Ser
645 650 655
Ala Val Val Gly Ile Leu Leu Val Val Val Leu Gly Val Val Phe Gly
660 665 670
Ile Leu Ile Lys Arg Arg Gln Gln Lys Ile Arg Lys Tyr Thr Met Arg
675 680 685
Arg Leu Leu Gln Glu Thr Glu Leu Val Glu Pro Leu Thr Pro Ser Gly
690 695 700
Ala Met Pro Asn Gln Ala Gln Met Arg Ile Leu Lys Glu Thr Glu Leu
705 710 715 720
Arg Lys Val Lys Val Leu Gly Ser Gly Ala Phe Gly Thr Val Tyr Lys
725 730 735
Gly Ile Trp Ile Pro Asp Gly Glu Asn Val Lys Ile Pro Val Ala Ile
740 745 750
Lys Val Leu Arg Glu Asn Thr Ser Pro Lys Ala Asn Lys Glu Ile Leu
755 760 765
Asp Glu Ala Tyr Val Met Ala Gly Val Gly Ser Pro Tyr Val Ser Arg
770 775 780
Leu Leu Gly Ile Cys Leu Thr Ser Thr Val Gln Leu Val Thr Gln Leu
785 790 795 800
Met Pro Tyr Gly Cys Leu Leu Asp His Val Arg Glu Asn Arg Gly Arg
805 810 815
Leu Gly Ser Gln Asp Leu Leu Asn Trp Cys Met Gln Ile Ala Lys Gly
820 825 830
Met Ser Tyr Leu Glu Asp Val Arg Leu Val His Arg Asp Leu Ala Ala
835 840 845
Arg Asn Val Leu Val Lys Ser Pro Asn His Val Lys Ile Thr Asp Phe
850 855 860
Gly Leu Ala Arg Leu Leu Asp Ile Asp Glu Thr Glu Tyr His Ala Asp
865 870 875 880
Gly Gly Lys Val Pro Ile Lys Trp Met Ala Leu Glu Ser Ile Leu Arg
885 890 895
Arg Arg Phe Thr His Gln Ser Asp Val Trp Ser Tyr Gly Val Thr Val
900 905 910
Trp Glu Leu Met Thr Phe Gly Ala Lys Pro Tyr Asp Gly Ile Pro Ala
915 920 925
Arg Glu Ile Pro Asp Leu Leu Glu Lys Gly Glu Arg Leu Pro Gln Pro
930 935 940
Pro Ile Cys Thr Ile Asp Val Tyr Met Ile Met Val Lys Cys Trp Met
945 950 955 960
Ile Asp Ser Glu Cys Arg Pro Arg Phe Arg Glu Leu Val Ser Glu Phe
965 970 975
Ser Arg Met Ala Arg Asp Pro Gln Arg Phe Val Val Ile Gln Asn Glu
980 985 990
Asp Leu Gly Pro Ala Ser Pro Leu Asp Ser Thr Phe Tyr Arg Ser Leu
995 1000 1005
Leu Glu Asp Asp Asp Met Gly Asp Leu Val Asp Ala Glu Glu Tyr
1010 1015 1020
Leu Val Pro Gln Gln Gly Phe Phe Cys Pro Asp Pro Ala Pro Gly
1025 1030 1035
Ala Gly Gly Met Val His His Arg His Arg Ser Ser Ser Thr Arg
1040 1045 1050
Ser Gly Gly Gly Asp Leu Thr Leu Gly Leu Glu Pro Ser Glu Glu
1055 1060 1065
Glu Ala Pro Arg Ser Pro Leu Ala Pro Ser Glu Gly Ala Gly Ser
1070 1075 1080
Asp Val Phe Asp Gly Asp Leu Gly Met Gly Ala Ala Lys Gly Leu
1085 1090 1095
Gln Ser Leu Pro Thr His Asp Pro Ser Pro Leu Gln Arg Tyr Ser
1100 1105 1110
Glu Asp Pro Thr Val Pro Leu Pro Ser Glu Thr Asp Gly Tyr Val
1115 1120 1125
Ala Pro Leu Thr Cys Ser Pro Gln Pro Glu Tyr Val Asn Gln Pro
1130 1135 1140
Asp Val Arg Pro Gln Pro Pro Ser Pro Arg Glu Gly Pro Leu Pro
1145 1150 1155
Ala Ala Arg Pro Ala Gly Ala Thr Leu Glu Arg Ala Lys Thr Leu
1160 1165 1170
Ser Pro Gly Lys Asn Gly Val Val Lys Asp Val Phe Ala Phe Gly
1175 1180 1185
Gly Ala Val Glu Asn Pro Glu Tyr Leu Thr Pro Gln Gly Gly Ala
1190 1195 1200
Ala Pro Gln Pro His Pro Pro Pro Ala Phe Ser Pro Ala Phe Asp
1205 1210 1215
Asn Leu Tyr Tyr Trp Asp Gln Asp Pro Pro Glu Arg Gly Ala Pro
1220 1225 1230
Pro Ser Thr Phe Lys Gly Thr Pro Thr Ala Glu Asn Pro Glu Tyr
1235 1240 1245
Leu Gly Leu Asp Val Pro Val
1250 1255
<210> 79
<211> 2533
<212> DNA
<213> 智人
<400> 79
ggagctcaag ctcctctaca aagaggtgga cagagaagac agcagagacc atgggacccc 60
cctcagcccc tccctgcaga ttgcatgtcc cctggaagga ggtcctgctc acagcctcac 120
ttctaacctt ctggaaccca cccaccactg ccaagctcac tattgaatcc acgccattca 180
atgtcgcaga ggggaaggag gttcttctac tcgcccacaa cctgccccag aatcgtattg 240
gttacagctg gtacaaaggc gaaagagtgg atggcaacag tctaattgta ggatatgtaa 300
taggaactca acaagctacc ccagggcccg catacagtgg tcgagagaca atatacccca 360
atgcatccct gctgatccag aacgtcaccc agaatgacac aggattctat accctacaag 420
tcataaagtc agatcttgtg aatgaagaag caaccggaca gttccatgta tacccggagc 480
tgcccaagcc ctccatctcc agcaacaact ccaaccccgt ggaggacaag gatgctgtgg 540
ccttcacctg tgaacctgag gttcagaaca caacctacct gtggtgggta aatggtcaga 600
gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa catgaccctc actctactca 660
gcgtcaaaag gaacgatgca ggatcctatg aatgtgaaat acagaaccca gcgagtgcca 720
accgcagtga cccagtcacc ctgaatgtcc tctatggccc agatgtcccc accatttccc 780
cctcaaaggc caattaccgt ccaggggaaa atctgaacct ctcctgccac gcagcctcta 840
acccacctgc acagtactct tggtttatca atgggacgtt ccagcaatcc acacaagagc 900
tctttatccc caacatcact gtgaataata gcggatccta tatgtgccaa gcccataact 960
cagccactgg cctcaatagg accacagtca cgatgatcac agtctctgga agtgctcctg 1020
tcctctcagc tgtggccacc gtcggcatca cgattggagt gctggccagg gtggctctga 1080
tatagcagcc ctggtgtatt ttcgatattt caggaagact ggcagattgg accagaccct 1140
gaattcttct agctcctcca atcccatttt atcccatgga accactaaaa acaaggtctg 1200
ctctgctcct gaagccctat atgctggaga tggacaactc aatgaaaatt taaagggaaa 1260
accctcaggc ctgaggtgtg tgccactcag agacttcacc taactagaga cagtcaaact 1320
gcaaaccatg gtgagaaatt gacgacttca cactatggac agcttttccc aagatgtcaa 1380
aacaagactc ctcatcatga taaggctctt accccctttt aatttgtcct tgcttatgcc 1440
tgcctctttc gcttggcagg atgatgctgt cattagtatt tcacaagaag tagcttcaga 1500
gggtaactta acagagtgtc agatctatct tgtcaatccc aacgttttac ataaaataag 1560
agatccttta gtgcacccag tgactgacat tagcagcatc tttaacacag ccgtgtgttc 1620
aaatgtacag tggtcctttt cagagttgga cttctagact cacctgttct cactccctgt 1680
tttaattcaa cccagccatg caatgccaaa taatagaatt gctccctacc agctgaacag 1740
ggaggagtct gtgcagtttc tgacacttgt tgttgaacat ggctaaatac aatgggtatc 1800
gctgagacta agttgtagaa attaacaaat gtgctgcttg gttaaaatgg ctacactcat 1860
ctgactcatt ctttattcta ttttagttgg tttgtatctt gcctaaggtg cgtagtccaa 1920
ctcttggtat taccctccta atagtcatac tagtagtcat actccctggt gtagtgtatt 1980
ctctaaaagc tttaaatgtc tgcatgcagc cagccatcaa atagtgaatg gtctctcttt 2040
ggctggaatt acaaaactca gagaaatgtg tcatcaggag aacatcataa cccatgaagg 2100
ataaaagccc caaatggtgg taactgataa tagcactaat gctttaagat ttggtcacac 2160
tctcacctag gtgagcgcat tgagccagtg gtgctaaatg ctacatactc caactgaaat 2220
gttaaggaag aagatagatc caattaaaaa aaattaaaac caatttaaaa aaaaaaaaga 2280
acacaggaga ttccagtcta cttgagttag cataatacag aagtcccctc tactttaact 2340
tttacaaaaa agtaacctga actaatctga tgttaaccaa tgtatttatt tctgtggttc 2400
tgtttccttg ttccaatttg acaaaaccca ctgttcttgt attgtattgc ccagggggag 2460
ctatcactgt acttgtagag tggtgctgct ttaattcata aatcacaaat aaaagccaat 2520
tagctctata act 2533
<210> 80
<211> 344
<212> PRT
<213> 智人
<400> 80
Met Gly Pro Pro Ser Ala Pro Pro Cys Arg Leu His Val Pro Trp Lys
1 5 10 15
Glu Val Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr
20 25 30
Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu Gly
35 40 45
Lys Glu Val Leu Leu Leu Ala His Asn Leu Pro Gln Asn Arg Ile Gly
50 55 60
Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Ser Leu Ile Val
65 70 75 80
Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr Ser
85 90 95
Gly Arg Glu Thr Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn Val
100 105 110
Thr Gln Asn Asp Thr Gly Phe Tyr Thr Leu Gln Val Ile Lys Ser Asp
115 120 125
Leu Val Asn Glu Glu Ala Thr Gly Gln Phe His Val Tyr Pro Glu Leu
130 135 140
Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Asn Pro Val Glu Asp Lys
145 150 155 160
Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Val Gln Asn Thr Thr Tyr
165 170 175
Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser Pro Arg Leu Gln
180 185 190
Leu Ser Asn Gly Asn Met Thr Leu Thr Leu Leu Ser Val Lys Arg Asn
195 200 205
Asp Ala Gly Ser Tyr Glu Cys Glu Ile Gln Asn Pro Ala Ser Ala Asn
210 215 220
Arg Ser Asp Pro Val Thr Leu Asn Val Leu Tyr Gly Pro Asp Val Pro
225 230 235 240
Thr Ile Ser Pro Ser Lys Ala Asn Tyr Arg Pro Gly Glu Asn Leu Asn
245 250 255
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp Phe
260 265 270
Ile Asn Gly Thr Phe Gln Gln Ser Thr Gln Glu Leu Phe Ile Pro Asn
275 280 285
Ile Thr Val Asn Asn Ser Gly Ser Tyr Met Cys Gln Ala His Asn Ser
290 295 300
Ala Thr Gly Leu Asn Arg Thr Thr Val Thr Met Ile Thr Val Ser Gly
305 310 315 320
Ser Ala Pro Val Leu Ser Ala Val Ala Thr Val Gly Ile Thr Ile Gly
325 330 335
Val Leu Ala Arg Val Ala Leu Ile
340
<210> 81
<211> 1583
<212> DNA
<213> 智人
<400> 81
gtgcctctcc tggcaggcag agtggctcct cacagcctga agctcatcct tctgcacggg 60
ccagccaggc cagcacagag gcaccagggc agcagtgcac acaggtcccc ggggacccca 120
ccatgtggag cggatggtgg ctgtggcccc ttgtggccgt ctgcactgca gacttctttc 180
gggacgaggc agagaggatc atgagggact cccctgtcat tgatgggcac aatgacctcc 240
cctggcagct gctggatatg ttcaacaacc ggctgcagga cgagagggcc aacctgacca 300
ccttggccgg cacacacacc aacatcccca agctgagggc cggctttgtg ggaggccagt 360
tctggtccgt gtacacgccc tgcgacaccc agaacaaaga cgccgtgcgg aggacgctgg 420
agcagatgga cgtggtccac cgcatgtgcc ggatgtaccc ggagaccttc ctgtatgtca 480
ccagcagtgc aggcattcgg caggccttcc gggaagggaa ggtggccagc ctgatcggcg 540
tggagggcgg ccactccatt gacagcagtt tgggcgtcct gcgggcactc tatcagctgg 600
gcatgcggta cctgaccctc acccacagct gcaacacgcc ctgggctgac aactggctgg 660
tggacacggg agacagcgag ccccagagcc aaggcttgtc accctttggg cagcgtgtgg 720
tgaaggagct gaaccgtctg ggggtcctca tcgacttggc tcacgtgtct gtggccacca 780
tgaaggccac cctgcagctg tccagagccc cggtcatctt cagccactcc tcggcctaca 840
gcgtgtgcgc aagccggcgc aacgtgcctg acgacgtcct gaggctggtg aaacagacag 900
acagcctggt gatggtgaac ttctacaaca attacatttc ctgcaccaac aaggccaacc 960
tgtcccaagt ggccgaccat ctggatcaca tcaaggaggt ggcaggagcc agagccgtgg 1020
gttttggtgg ggactttgat ggtgttccaa gggtccctga ggggctggag gacgtctcca 1080
agtatccaga cctgatcgct gagctgctca ggaggaactg gacggaggcg gaggtcaagg 1140
gcgcactggc tgacaacctg ctgagggtct tcgaggctgt ggaacaggcc agcaacctca 1200
cacaggctcc cgaggaggag cccatcccgc tggaccagct gggtggctcc tgcaggaccc 1260
attacggcta ctcctctggg gcttccagcc tccatcgcca ctgggggctc ctgctggcct 1320
ccctcgctcc cctggtcctc tgtctgtctc tcctgtgaaa cctgggagac cagagtcccc 1380
tttagggttc ccggagctcc gggaagaccc gcccatccca ggactccaga tgccaggagc 1440
cctgctgccc acatgcaagg accagcatct cctgagagga cgcctgggct tacctggggg 1500
gcaggatgcc tggggacagt tcaggacaca cacacagtag gcccgcaata aaagcaacac 1560
cccttcaaaa aaaaaaaaaa aaa 1583
<210> 82
<211> 411
<212> PRT
<213> 智人
<400> 82
Met Trp Ser Gly Trp Trp Leu Trp Pro Leu Val Ala Val Cys Thr Ala
1 5 10 15
Asp Phe Phe Arg Asp Glu Ala Glu Arg Ile Met Arg Asp Ser Pro Val
20 25 30
Ile Asp Gly His Asn Asp Leu Pro Trp Gln Leu Leu Asp Met Phe Asn
35 40 45
Asn Arg Leu Gln Asp Glu Arg Ala Asn Leu Thr Thr Leu Ala Gly Thr
50 55 60
His Thr Asn Ile Pro Lys Leu Arg Ala Gly Phe Val Gly Gly Gln Phe
65 70 75 80
Trp Ser Val Tyr Thr Pro Cys Asp Thr Gln Asn Lys Asp Ala Val Arg
85 90 95
Arg Thr Leu Glu Gln Met Asp Val Val His Arg Met Cys Arg Met Tyr
100 105 110
Pro Glu Thr Phe Leu Tyr Val Thr Ser Ser Ala Gly Ile Arg Gln Ala
115 120 125
Phe Arg Glu Gly Lys Val Ala Ser Leu Ile Gly Val Glu Gly Gly His
130 135 140
Ser Ile Asp Ser Ser Leu Gly Val Leu Arg Ala Leu Tyr Gln Leu Gly
145 150 155 160
Met Arg Tyr Leu Thr Leu Thr His Ser Cys Asn Thr Pro Trp Ala Asp
165 170 175
Asn Trp Leu Val Asp Thr Gly Asp Ser Glu Pro Gln Ser Gln Gly Leu
180 185 190
Ser Pro Phe Gly Gln Arg Val Val Lys Glu Leu Asn Arg Leu Gly Val
195 200 205
Leu Ile Asp Leu Ala His Val Ser Val Ala Thr Met Lys Ala Thr Leu
210 215 220
Gln Leu Ser Arg Ala Pro Val Ile Phe Ser His Ser Ser Ala Tyr Ser
225 230 235 240
Val Cys Ala Ser Arg Arg Asn Val Pro Asp Asp Val Leu Arg Leu Val
245 250 255
Lys Gln Thr Asp Ser Leu Val Met Val Asn Phe Tyr Asn Asn Tyr Ile
260 265 270
Ser Cys Thr Asn Lys Ala Asn Leu Ser Gln Val Ala Asp His Leu Asp
275 280 285
His Ile Lys Glu Val Ala Gly Ala Arg Ala Val Gly Phe Gly Gly Asp
290 295 300
Phe Asp Gly Val Pro Arg Val Pro Glu Gly Leu Glu Asp Val Ser Lys
305 310 315 320
Tyr Pro Asp Leu Ile Ala Glu Leu Leu Arg Arg Asn Trp Thr Glu Ala
325 330 335
Glu Val Lys Gly Ala Leu Ala Asp Asn Leu Leu Arg Val Phe Glu Ala
340 345 350
Val Glu Gln Ala Ser Asn Leu Thr Gln Ala Pro Glu Glu Glu Pro Ile
355 360 365
Pro Leu Asp Gln Leu Gly Gly Ser Cys Arg Thr His Tyr Gly Tyr Ser
370 375 380
Ser Gly Ala Ser Ser Leu His Arg His Trp Gly Leu Leu Leu Ala Ser
385 390 395 400
Leu Ala Pro Leu Val Leu Cys Leu Ser Leu Leu
405 410
<210> 83
<211> 3485
<212> DNA
<213> 智人
<400> 83
tccagctggg tagccggggg agcgcgcgtg ggggctccgc gagtcgctcg cccttggttt 60
ctggggaagc ctgggggacg cggctgtggc ggaggcgccc tgggactcag gtcgcctgga 120
gcgtggcacg cagagcccca ggcgcggagc tgaggccgcg cggccgcgct tggccccagc 180
gggcgtggga ctgagcagtc tgctgccccc cgacatgtga cccagccccg ccgcccatgc 240
gggctcccgg ccgcccggcc ctgcggccgc tgccgctgcc gccgctgctg ctgttgctcc 300
tggcggcgcc ttggggacgg gcagttccct gtgtctctgg tggtttgcct aaacctgcaa 360
acatcacctt cttatccatc aacatgaaga atgtcctaca atggactcca ccagagggtc 420
ttcaaggagt taaagttact tacactgtgc agtatttcat atatgggcaa aagaaatggc 480
tgaataaatc agaatgcaga aatatcaata gaacctactg tgatctttct gctgaaactt 540
ctgactacga acaccagtat tatgccaaag ttaaggccat ttggggaaca aagtgttcca 600
aatgggctga aagtggacgg ttctatcctt ttttagaaac acaaattggc ccaccagagg 660
tggcactgac tacagatgag aagtccattt ctgttgtcct gacagctcca gagaagtgga 720
agagaaatcc agaagacctt cctgtttcca tgcaacaaat atactccaat ctgaagtata 780
acgtgtctgt gttgaatact aaatcaaaca gaacgtggtc ccagtgtgtg accaaccaca 840
cgctggtgct cacctggctg gagccgaaca ctctttactg cgtacacgtg gagtccttcg 900
tcccagggcc ccctcgccgt gctcagcctt ctgagaagca gtgtgccagg actttgaaag 960
atcaatcatc agagttcaag gctaaaatca tcttctggta tgttttgccc atatctatta 1020
ccgtgtttct tttttctgtg atgggctatt ccatctaccg atatatccac gttggcaaag 1080
agaaacaccc agcaaatttg attttgattt atggaaatga atttgacaaa agattctttg 1140
tgcctgctga aaaaatcgtg attaacttta tcaccctcaa tatctcggat gattctaaaa 1200
tttctcatca ggatatgagt ttactgggaa aaagcagtga tgtatccagc cttaatgatc 1260
ctcagcccag cgggaacctg aggccccctc aggaggaaga ggaggtgaaa catttagggt 1320
atgcttcgca tttgatggaa attttttgtg actctgaaga aaacacggaa ggtacttctt 1380
tcacccagca agagtccctc agcagaacaa tacccccgga taaaacagtc attgaatatg 1440
aatatgatgt cagaaccact gacatttgtg cggggcctga agagcaggag ctcagtttgc 1500
aggaggaggt gtccacacaa ggaacattat tggagtcgca ggcagcgttg gcagtcttgg 1560
gcccgcaaac gttacagtac tcatacaccc ctcagctcca agacttagac cccctggcgc 1620
aggagcacac agactcggag gaggggccgg aggaagagcc atcgacgacc ctggtcgact 1680
gggatcccca aactggcagg ctgtgtattc cttcgctgtc cagcttcgac caggattcag 1740
agggctgcga gccttctgag ggggatgggc tcggagagga gggtcttcta tctagactct 1800
atgaggagcc ggctccagac aggccaccag gagaaaatga aacctatctc atgcaattca 1860
tggaggaatg ggggttatat gtgcagatgg aaaactgatg ccaacacttc cttttgcctt 1920
ttgtttcctg tgcaaacaag tgagtcaccc ctttgatccc agccataaag tacctgggat 1980
gaaagaagtt ttttccagtt tgtcagtgtc tgtgagaatt acttatttct tttctctatt 2040
ctcatagcac gtgtgtgatt ggttcatgca tgtaggtctc ttaacaatga tggtgggcct 2100
ctggagtcca ggggctggcc ggttgttcta tgcagagaaa gcagtcaata aatgtttgcc 2160
agactgggtg cagaatttat tcaggtgggt gtactctggc ctcttggttc attattttca 2220
aacaagcaca cttgtacaat tattttctgg gtacttccca tatgcacata gcactgtaaa 2280
aaatatttcc caaagatcac tcattttata aataccactt tttcagaatt gggtttattg 2340
cgagcaggag gagatactta aaacatgcac atataccagg ttggtggtaa gttggtcaca 2400
tgtgaaaacc tcaactattt aatcatcatg attcatattt tgagtgaata catcaggcac 2460
agaccttcat gatatcacac actcttggct actttaagag gccatcttta atactttatg 2520
agtagttctg gagtgtaaac ataaacgagt attcttttgt agtcagaaaa gtgtcctctc 2580
aataatttag taggggctta ttgtctctca aaactaacct aaaagaaaat gacacatttt 2640
ataatagaat attacattta tttctggaag tgtgttttca aaaagatatt tacatagtct 2700
gtaaactaga aagtgttagg taaagctcta ggttactgtg ttactattat aatattaaac 2760
attcgaatag gcagtcgttc aaagactctt tggaatatct atgaatgaat atcctctatt 2820
cttataatat taaaacccat aagtaaatat aggacataca agagaaatga gttaaatgac 2880
tatgtaaggg agagtttatt aaaatttgat gaaatttact gtaggaacta aactatgcca 2940
taaaacaata gctttctagt tcatttccag taactgttcc catctccttt accacttgtt 3000
aagaaaatta aattcttcag tcacgctgct ttaaaatggg acaaaatcta ttaagttgaa 3060
ccatatataa ttgtggatat ttggctgttt ttaatctgac aagcagtaac ttcatatggt 3120
ttgccttaat atatatttgt tttagtcatg aactcataat ccattgatgc tctttcatga 3180
gaagagatat gacccatatt tccttattga tattattggt acaggcagac aaccctggta 3240
ggagagatgg attctggggt catgaccttt cgtgattatc cgcaaatgca aacagtttca 3300
gatctaatgg tttaatttag ggagtaatta tattaatcag agtgttctgt tattctcaat 3360
ctttatagaa acgattctgc tggttttgaa gaacagatgt attacactaa ctgtaaaagt 3420
agttcaagag tgagaaagaa taaattgtta ttaagagcaa aagaaaaata aagtgattga 3480
tgata 3485
<210> 84
<211> 553
<212> PRT
<213> 智人
<400> 84
Met Arg Ala Pro Gly Arg Pro Ala Leu Arg Pro Leu Pro Leu Pro Pro
1 5 10 15
Leu Leu Leu Leu Leu Leu Ala Ala Pro Trp Gly Arg Ala Val Pro Cys
20 25 30
Val Ser Gly Gly Leu Pro Lys Pro Ala Asn Ile Thr Phe Leu Ser Ile
35 40 45
Asn Met Lys Asn Val Leu Gln Trp Thr Pro Pro Glu Gly Leu Gln Gly
50 55 60
Val Lys Val Thr Tyr Thr Val Gln Tyr Phe Ile Tyr Gly Gln Lys Lys
65 70 75 80
Trp Leu Asn Lys Ser Glu Cys Arg Asn Ile Asn Arg Thr Tyr Cys Asp
85 90 95
Leu Ser Ala Glu Thr Ser Asp Tyr Glu His Gln Tyr Tyr Ala Lys Val
100 105 110
Lys Ala Ile Trp Gly Thr Lys Cys Ser Lys Trp Ala Glu Ser Gly Arg
115 120 125
Phe Tyr Pro Phe Leu Glu Thr Gln Ile Gly Pro Pro Glu Val Ala Leu
130 135 140
Thr Thr Asp Glu Lys Ser Ile Ser Val Val Leu Thr Ala Pro Glu Lys
145 150 155 160
Trp Lys Arg Asn Pro Glu Asp Leu Pro Val Ser Met Gln Gln Ile Tyr
165 170 175
Ser Asn Leu Lys Tyr Asn Val Ser Val Leu Asn Thr Lys Ser Asn Arg
180 185 190
Thr Trp Ser Gln Cys Val Thr Asn His Thr Leu Val Leu Thr Trp Leu
195 200 205
Glu Pro Asn Thr Leu Tyr Cys Val His Val Glu Ser Phe Val Pro Gly
210 215 220
Pro Pro Arg Arg Ala Gln Pro Ser Glu Lys Gln Cys Ala Arg Thr Leu
225 230 235 240
Lys Asp Gln Ser Ser Glu Phe Lys Ala Lys Ile Ile Phe Trp Tyr Val
245 250 255
Leu Pro Ile Ser Ile Thr Val Phe Leu Phe Ser Val Met Gly Tyr Ser
260 265 270
Ile Tyr Arg Tyr Ile His Val Gly Lys Glu Lys His Pro Ala Asn Leu
275 280 285
Ile Leu Ile Tyr Gly Asn Glu Phe Asp Lys Arg Phe Phe Val Pro Ala
290 295 300
Glu Lys Ile Val Ile Asn Phe Ile Thr Leu Asn Ile Ser Asp Asp Ser
305 310 315 320
Lys Ile Ser His Gln Asp Met Ser Leu Leu Gly Lys Ser Ser Asp Val
325 330 335
Ser Ser Leu Asn Asp Pro Gln Pro Ser Gly Asn Leu Arg Pro Pro Gln
340 345 350
Glu Glu Glu Glu Val Lys His Leu Gly Tyr Ala Ser His Leu Met Glu
355 360 365
Ile Phe Cys Asp Ser Glu Glu Asn Thr Glu Gly Thr Ser Phe Thr Gln
370 375 380
Gln Glu Ser Leu Ser Arg Thr Ile Pro Pro Asp Lys Thr Val Ile Glu
385 390 395 400
Tyr Glu Tyr Asp Val Arg Thr Thr Asp Ile Cys Ala Gly Pro Glu Glu
405 410 415
Gln Glu Leu Ser Leu Gln Glu Glu Val Ser Thr Gln Gly Thr Leu Leu
420 425 430
Glu Ser Gln Ala Ala Leu Ala Val Leu Gly Pro Gln Thr Leu Gln Tyr
435 440 445
Ser Tyr Thr Pro Gln Leu Gln Asp Leu Asp Pro Leu Ala Gln Glu His
450 455 460
Thr Asp Ser Glu Glu Gly Pro Glu Glu Glu Pro Ser Thr Thr Leu Val
465 470 475 480
Asp Trp Asp Pro Gln Thr Gly Arg Leu Cys Ile Pro Ser Leu Ser Ser
485 490 495
Phe Asp Gln Asp Ser Glu Gly Cys Glu Pro Ser Glu Gly Asp Gly Leu
500 505 510
Gly Glu Glu Gly Leu Leu Ser Arg Leu Tyr Glu Glu Pro Ala Pro Asp
515 520 525
Arg Pro Pro Gly Glu Asn Glu Thr Tyr Leu Met Gln Phe Met Glu Glu
530 535 540
Trp Gly Leu Tyr Val Gln Met Glu Asn
545 550
<210> 85
<211> 2558
<212> DNA
<213> 智人
<400> 85
tgtggcactg cctgcgtacc caaccccagc cctgggtagc ctgcagcatg gcccagctgt 60
tcctgcccct gctggcagcc ctggtcctgg cccaggctcc tgcagcttta gcagatgttc 120
tggaaggaga cagctcagag gaccgcgctt ttcgcgtgcg catcgcgggc gacgcgccac 180
tgcagggcgt gctcggcggc gccctcacca tcccttgcca cgtccactac ctgcggccac 240
cgccgagccg ccgggctgtg ctgggctctc cgcgggtcaa gtggactttc ctgtcccggg 300
gccgggaggc agaggtgctg gtggcgcggg gagtgcgcgt caaggtgaac gaggcctacc 360
ggttccgcgt ggcactgcct gcgtacccag cgtcgctcac cgacgtctcc ctggcgctga 420
gcgagctgcg ccccaacgac tcaggtatct atcgctgtga ggtccagcac ggcatcgatg 480
acagcagcga cgctgtggag gtcaaggtca aaggggtcgt ctttctctac cgagagggct 540
ctgcccgcta tgctttctcc ttttctgggg cccaggaggc ctgtgcccgc attggagccc 600
acatcgccac cccggagcag ctctatgccg cctaccttgg gggctatgag caatgtgatg 660
ctggctggct gtcggatcag accgtgaggt atcccatcca gaccccacga gaggcctgtt 720
acggagacat ggatggcttc cccggggtcc ggaactatgg tgtggtggac ccggatgacc 780
tctatgatgt gtactgttat gctgaagacc taaatggaga attgttcctg ggtgaccctc 840
cagagaagct gacattggag gaagcacggg cgtactgcca ggagcggggt gcagagattg 900
ccaccacggg ccaactgtat gcagcctggg atggtggcct ggaccactgc agcccagggt 960
ggctagctga tggcagtgtg cgctacccca tcgtcacacc cagccagcgc tgtggtgggg 1020
gcttgcctgg tgtcaagact ctcttcctct tccccaacca gactggcttc cccaataagc 1080
acagccgctt caacgtctac tgcttccgag actcggccca gccttctgcc atccctgagg 1140
cctccaaccc agcctccaac ccagcctctg atggactaga ggctatcgtc acagtgacag 1200
agaccctgga ggaactgcag ctgcctcagg aagccacaga gagtgaatcc cgtggggcca 1260
tctactccat ccccatcatg gaggacggag gaggtggaag ctccactcca gaagacccag 1320
cagaggcccc taggacgctc ctagaatttg aaacacaatc catggtaccg cccacggggt 1380
tctcagaaga ggaaggtaag gcattggagg aagaagagaa atatgaagat gaagaagaga 1440
aagaggagga agaagaagag gaggaggtgg aggatgaggc tctgtgggca tggcccagcg 1500
agctcagcag cccgggccct gaggcctctc tccccactga gccagcagcc caggaggagt 1560
cactctccca ggcgccagca agggcagtcc tgcagcctgg tgcatcacca cttcctgatg 1620
gagagtcaga agcttccagg cctccaaggg tccatggacc acctactgag actctgccca 1680
ctcccaggga gaggaaccta gcatccccat caccttccac tctggttgag gcaagagagg 1740
tgggggaggc aactggtggt cctgagctat ctggggtccc tcgaggagag agcgaggaga 1800
caggaagctc cgagggtgcc ccttccctgc ttccagccac acgggcccct gagggtacca 1860
gggagctgga ggccccctct gaagataatt ctggaagaac tgccccagca gggacctcag 1920
tgcaggccca gccagtgctg cccactgaca gcgccagccg aggtggagtg gccgtggtcc 1980
ccgcatcagg taattctgcc caaggctcaa ctgccctctc tatcctactc cttttcttcc 2040
ccctgcagct ctgggtcacc tgacctgtag tcctttaacc caccatcatc ccaaactctc 2100
ctgtcctttg ccttcattct cttacccacc tctacctatg ggtctccaat ctcggatatc 2160
caccttgtgg gtatctcagc tctccgcgtc tttaccctgt gatcccagcc ccgccactga 2220
ccatctgtga cccttccctg ccattgggcc ctccacctgt ggctcacatc tcgccagccc 2280
cacagagcat cctcaggcct ctccaagggt cctcatcacc tattgcagcc ttcagggctc 2340
ggcctatttt ccactactcc cttcatccgc ctgtgtgccg tcccctttag ctgcctccta 2400
ttgatctcag ggaagcctgg gagtcccttc tcacccctca acctccggag tccaggagaa 2460
cccgtacccc cacagagcct taagcaacta cttctgtgaa gtattttttg actgtttcat 2520
ggaaaacaag ccttggaaat aaatctctat taaaccgc 2558
<210> 86
<211> 671
<212> PRT
<213> 智人
<400> 86
Met Ala Gln Leu Phe Leu Pro Leu Leu Ala Ala Leu Val Leu Ala Gln
1 5 10 15
Ala Pro Ala Ala Leu Ala Asp Val Leu Glu Gly Asp Ser Ser Glu Asp
20 25 30
Arg Ala Phe Arg Val Arg Ile Ala Gly Asp Ala Pro Leu Gln Gly Val
35 40 45
Leu Gly Gly Ala Leu Thr Ile Pro Cys His Val His Tyr Leu Arg Pro
50 55 60
Pro Pro Ser Arg Arg Ala Val Leu Gly Ser Pro Arg Val Lys Trp Thr
65 70 75 80
Phe Leu Ser Arg Gly Arg Glu Ala Glu Val Leu Val Ala Arg Gly Val
85 90 95
Arg Val Lys Val Asn Glu Ala Tyr Arg Phe Arg Val Ala Leu Pro Ala
100 105 110
Tyr Pro Ala Ser Leu Thr Asp Val Ser Leu Ala Leu Ser Glu Leu Arg
115 120 125
Pro Asn Asp Ser Gly Ile Tyr Arg Cys Glu Val Gln His Gly Ile Asp
130 135 140
Asp Ser Ser Asp Ala Val Glu Val Lys Val Lys Gly Val Val Phe Leu
145 150 155 160
Tyr Arg Glu Gly Ser Ala Arg Tyr Ala Phe Ser Phe Ser Gly Ala Gln
165 170 175
Glu Ala Cys Ala Arg Ile Gly Ala His Ile Ala Thr Pro Glu Gln Leu
180 185 190
Tyr Ala Ala Tyr Leu Gly Gly Tyr Glu Gln Cys Asp Ala Gly Trp Leu
195 200 205
Ser Asp Gln Thr Val Arg Tyr Pro Ile Gln Thr Pro Arg Glu Ala Cys
210 215 220
Tyr Gly Asp Met Asp Gly Phe Pro Gly Val Arg Asn Tyr Gly Val Val
225 230 235 240
Asp Pro Asp Asp Leu Tyr Asp Val Tyr Cys Tyr Ala Glu Asp Leu Asn
245 250 255
Gly Glu Leu Phe Leu Gly Asp Pro Pro Glu Lys Leu Thr Leu Glu Glu
260 265 270
Ala Arg Ala Tyr Cys Gln Glu Arg Gly Ala Glu Ile Ala Thr Thr Gly
275 280 285
Gln Leu Tyr Ala Ala Trp Asp Gly Gly Leu Asp His Cys Ser Pro Gly
290 295 300
Trp Leu Ala Asp Gly Ser Val Arg Tyr Pro Ile Val Thr Pro Ser Gln
305 310 315 320
Arg Cys Gly Gly Gly Leu Pro Gly Val Lys Thr Leu Phe Leu Phe Pro
325 330 335
Asn Gln Thr Gly Phe Pro Asn Lys His Ser Arg Phe Asn Val Tyr Cys
340 345 350
Phe Arg Asp Ser Ala Gln Pro Ser Ala Ile Pro Glu Ala Ser Asn Pro
355 360 365
Ala Ser Asn Pro Ala Ser Asp Gly Leu Glu Ala Ile Val Thr Val Thr
370 375 380
Glu Thr Leu Glu Glu Leu Gln Leu Pro Gln Glu Ala Thr Glu Ser Glu
385 390 395 400
Ser Arg Gly Ala Ile Tyr Ser Ile Pro Ile Met Glu Asp Gly Gly Gly
405 410 415
Gly Ser Ser Thr Pro Glu Asp Pro Ala Glu Ala Pro Arg Thr Leu Leu
420 425 430
Glu Phe Glu Thr Gln Ser Met Val Pro Pro Thr Gly Phe Ser Glu Glu
435 440 445
Glu Gly Lys Ala Leu Glu Glu Glu Glu Lys Tyr Glu Asp Glu Glu Glu
450 455 460
Lys Glu Glu Glu Glu Glu Glu Glu Glu Val Glu Asp Glu Ala Leu Trp
465 470 475 480
Ala Trp Pro Ser Glu Leu Ser Ser Pro Gly Pro Glu Ala Ser Leu Pro
485 490 495
Thr Glu Pro Ala Ala Gln Glu Glu Ser Leu Ser Gln Ala Pro Ala Arg
500 505 510
Ala Val Leu Gln Pro Gly Ala Ser Pro Leu Pro Asp Gly Glu Ser Glu
515 520 525
Ala Ser Arg Pro Pro Arg Val His Gly Pro Pro Thr Glu Thr Leu Pro
530 535 540
Thr Pro Arg Glu Arg Asn Leu Ala Ser Pro Ser Pro Ser Thr Leu Val
545 550 555 560
Glu Ala Arg Glu Val Gly Glu Ala Thr Gly Gly Pro Glu Leu Ser Gly
565 570 575
Val Pro Arg Gly Glu Ser Glu Glu Thr Gly Ser Ser Glu Gly Ala Pro
580 585 590
Ser Leu Leu Pro Ala Thr Arg Ala Pro Glu Gly Thr Arg Glu Leu Glu
595 600 605
Ala Pro Ser Glu Asp Asn Ser Gly Arg Thr Ala Pro Ala Gly Thr Ser
610 615 620
Val Gln Ala Gln Pro Val Leu Pro Thr Asp Ser Ala Ser Arg Gly Gly
625 630 635 640
Val Ala Val Val Pro Ala Ser Gly Asn Ser Ala Gln Gly Ser Thr Ala
645 650 655
Leu Ser Ile Leu Leu Leu Phe Phe Pro Leu Gln Leu Trp Val Thr
660 665 670
<210> 87
<211> 4869
<212> DNA
<213> 智人
<400> 87
cattctgctg gctgcgcggt ggcggcggct gtgtgtgcgc cgcgccttgc cgccccccct 60
ggccccccga gcccggggcg cgcgctcccg cccgggccgt ccgggccccg cggcgccgcg 120
gcccgaggcc ccgggaagcg cagccatggc tctgcggagg ctgggggccg cgctgctgct 180
gctgccgctg ctcgccgccg tggaagaaac gctaatggac tccactacag cgactgctga 240
gctgggctgg atggtgcatc ctccatcagg gtgggaagag gtgagtggct acgatgagaa 300
catgaacacg atccgcacgt accaggtgtg caacgtgttt gagtcaagcc agaacaactg 360
gctacggacc aagtttatcc ggcgccgtgg cgcccaccgc atccacgtgg agatgaagtt 420
ttcggtgcgt gactgcagca gcatccccag cgtgcctggc tcctgcaagg agaccttcaa 480
cctctattac tatgaggctg actttgactc ggccaccaag accttcccca actggatgga 540
gaatccatgg gtgaaggtgg ataccattgc agccgacgag agcttctccc aggtggacct 600
gggtggccgc gtcatgaaaa tcaacaccga ggtgcggagc ttcggacctg tgtcccgcag 660
cggcttctac ctggccttcc aggactatgg cggctgcatg tccctcatcg ccgtgcgtgt 720
cttctaccgc aagtgccccc gcatcatcca gaatggcgcc atcttccagg aaaccctgtc 780
gggggctgag agcacatcgc tggtggctgc ccggggcagc tgcatcgcca atgcggaaga 840
ggtggatgta cccatcaagc tctactgtaa cggggacggc gagtggctgg tgcccatcgg 900
gcgctgcatg tgcaaagcag gcttcgaggc cgttgagaat ggcaccgtct gccgaggttg 960
tccatctggg actttcaagg ccaaccaagg ggatgaggcc tgtacccact gtcccatcaa 1020
cagccggacc acttctgaag gggccaccaa ctgtgtctgc cgcaatggct actacagagc 1080
agacctggac cccctggaca tgccctgcac aaccatcccc tccgcgcccc aggctgtgat 1140
ttccagtgtc aatgagacct ccctcatgct ggagtggacc cctccccgcg actccggagg 1200
ccgagaggac ctcgtctaca acatcatctg caagagctgt ggctcgggcc ggggtgcctg 1260
cacccgctgc ggggacaatg tacagtacgc accacgccag ctaggcctga ccgagccacg 1320
catttacatc agtgacctgc tggcccacac ccagtacacc ttcgagatcc aggctgtgaa 1380
cggcgttact gaccagagcc ccttctcgcc tcagttcgcc tctgtgaaca tcaccaccaa 1440
ccaggcagct ccatcggcag tgtccatcat gcatcaggtg agccgcaccg tggacagcat 1500
taccctgtcg tggtcccagc cggaccagcc caatggcgtg atcctggact atgagctgca 1560
gtactatgag aaggagctca gtgagtacaa cgccacagcc ataaaaagcc ccaccaacac 1620
ggtcaccgtg cagggcctca aagccggcgc catctatgtc ttccaggtgc gggcacgcac 1680
cgtggcaggc tacgggcgct acagcggcaa gatgtacttc cagaccatga cagaagccga 1740
gtaccagaca agcatccagg agaagttgcc actcatcatc ggctcctcgg ccgctggcct 1800
ggtcttcctc attgctgtgg ttgtcatcgc catcgtgtgt aacagaagac gggggtttga 1860
gcgtgctgac tcggagtaca cggacaagct gcaacactac accagtggcc acatgacccc 1920
aggcatgaag atctacatcg atcctttcac ctacgaggac cccaacgagg cagtgcggga 1980
gtttgccaag gaaattgaca tctcctgtgt caaaattgag caggtgatcg gagcagggga 2040
gtttggcgag gtctgcagtg gccacctgaa gctgccaggc aagagagaga tctttgtggc 2100
catcaagacg ctcaagtcgg gctacacgga gaagcagcgc cgggacttcc tgagcgaagc 2160
ctccatcatg ggccagttcg accatcccaa cgtcatccac ctggagggtg tcgtgaccaa 2220
gagcacacct gtgatgatca tcaccgagtt catggagaat ggctccctgg actcctttct 2280
ccggcaaaac gatgggcagt tcacagtcat ccagctggtg ggcatgcttc ggggcatcgc 2340
agctggcatg aagtacctgg cagacatgaa ctatgttcac cgtgacctgg ctgcccgcaa 2400
catcctcgtc aacagcaacc tggtctgcaa ggtgtcggac tttgggctct cacgctttct 2460
agaggacgat acctcagacc ccacctacac cagtgccctg ggcggaaaga tccccatccg 2520
ctggacagcc ccggaagcca tccagtaccg gaagttcacc tcggccagtg atgtgtggag 2580
ctacggcatt gtcatgtggg aggtgatgtc ctatggggag cggccctact gggacatgac 2640
caaccaggat gtaatcaatg ccattgagca ggactatcgg ctgccaccgc ccatggactg 2700
cccgagcgcc ctgcaccaac tcatgctgga ctgttggcag aaggaccgca accaccggcc 2760
caagttcggc caaattgtca acacgctaga caagatgatc cgcaatccca acagcctcaa 2820
agccatggcg cccctctcct ctggcatcaa cctgccgctg ctggaccgca cgatccccga 2880
ctacaccagc tttaacacgg tggacgagtg gctggaggcc atcaagatgg ggcagtacaa 2940
ggagagcttc gccaatgccg gcttcacctc ctttgacgtc gtgtctcaga tgatgatgga 3000
ggacattctc cgggttgggg tcactttggc tggccaccag aaaaaaatcc tgaacagtat 3060
ccaggtgatg cgggcgcaga tgaaccagat tcagtctgtg gaggtttgac attcacctgc 3120
ctcggctcac ctcttcctcc aagccccgcc ccctctgccc cacgtgccgg ccctcctggt 3180
gctctatcca ctgcagggcc agccactcgc caggaggcca cgggccacgg gaagaaccaa 3240
gcggtgccag ccacgagacg tcaccaagaa aacatgcaac tcaaacgacg gaaaaaaaaa 3300
gggaatggga aaaaagaaaa cagatcctgg gagggggcgg gaaatacaag gaatattttt 3360
taaagaggat tctcataagg aaagcaatga ctgttcttgc gggggataaa aaagggcttg 3420
ggagattcat gcgatgtgtc caatcggaga caaaagcagt ttctctccaa ctccctctgg 3480
gaaggtgacc tggccagagc caagaaacac tttcagaaaa acaaatgtga aggggagaga 3540
caggggccgc ccttggctcc tgtccctgct gctcctctag gcctcactca acaaccaagc 3600
gcctggagga cgggacagat ggacagacag ccaccctgag aacccctctg ggaaaatcta 3660
ttcctgccac cactgggcaa acagaagaat ttttctgtct ttggagagta ttttagaaac 3720
tccaatgaaa gacactgttt ctcctgttgg ctcacagggc tgaaaggggc ttttgtcctc 3780
ctgggtcagg gagaacgcgg ggaccccaga aaggtcagcc ttcctgagga tgggcaaccc 3840
ccaggtctgc agctccaggt acatatcacg cgcacagcct ggcagcctgg ccctcctggt 3900
gcccactccc gccagcccct gcctcgagga ctgatactgc agtgactgcc gtcagctccg 3960
actgccgctg agaagggttg atcctgcatc tgggtttgtt tacagcaatt cctggactcg 4020
ggggtatttt ggtcacaggg tggttttggt ttagggggtt tgtttgttgg gttgtttttt 4080
gttttttggt tttttttaat gacaatgaag tgacactttg acatttccta ccttttgagg 4140
acttgatcct tctccaggaa gaaggtgctt tctgcttact gacttaggca atacaccaag 4200
ggcgagattt tatatgcaca tttctggatt tttttatacg gttttcattg acactcttcc 4260
ctcctcccac ctgccaccag gcctcaccaa agcccactgc catggggcca tctgggccat 4320
tcagagactg gagtgagatt tgggtgtgga gggggaggcg ccaaggtgga ggagcttccc 4380
actccaggac tgttgatgaa agggacagat tgaggaggaa gtgggctctg aggctgcagg 4440
gctggaagtc cttgcccact tcccactctc ctgccccaat ctatctagta cttcccaggc 4500
aaataggccc ctttgaggct cctgagtgcc ctcagatggt caaaacccag ttttccctct 4560
gggagcctaa accaggctgc atcggaggcc aggacccgga tcattcactg tgataccctg 4620
ccctccagag ggtgcgctca gagacacggg caagcatgcc tcttcccttc cctggagaga 4680
aagtgtgtga tttctctccc acctccttcc ccccaccaga cctttgctgg gcctaaaggt 4740
cttggccatg gggacgccct cagtctaggg atctggccac agactccctc ctgtgaacca 4800
acacagacac ccaagcagag caatcagtta gtgaattgaa tggaaataaa cgctttagtt 4860
ataatatga 4869
<210> 88
<211> 987
<212> PRT
<213> 智人
<400> 88
Met Ala Leu Arg Arg Leu Gly Ala Ala Leu Leu Leu Leu Pro Leu Leu
1 5 10 15
Ala Ala Val Glu Glu Thr Leu Met Asp Ser Thr Thr Ala Thr Ala Glu
20 25 30
Leu Gly Trp Met Val His Pro Pro Ser Gly Trp Glu Glu Val Ser Gly
35 40 45
Tyr Asp Glu Asn Met Asn Thr Ile Arg Thr Tyr Gln Val Cys Asn Val
50 55 60
Phe Glu Ser Ser Gln Asn Asn Trp Leu Arg Thr Lys Phe Ile Arg Arg
65 70 75 80
Arg Gly Ala His Arg Ile His Val Glu Met Lys Phe Ser Val Arg Asp
85 90 95
Cys Ser Ser Ile Pro Ser Val Pro Gly Ser Cys Lys Glu Thr Phe Asn
100 105 110
Leu Tyr Tyr Tyr Glu Ala Asp Phe Asp Ser Ala Thr Lys Thr Phe Pro
115 120 125
Asn Trp Met Glu Asn Pro Trp Val Lys Val Asp Thr Ile Ala Ala Asp
130 135 140
Glu Ser Phe Ser Gln Val Asp Leu Gly Gly Arg Val Met Lys Ile Asn
145 150 155 160
Thr Glu Val Arg Ser Phe Gly Pro Val Ser Arg Ser Gly Phe Tyr Leu
165 170 175
Ala Phe Gln Asp Tyr Gly Gly Cys Met Ser Leu Ile Ala Val Arg Val
180 185 190
Phe Tyr Arg Lys Cys Pro Arg Ile Ile Gln Asn Gly Ala Ile Phe Gln
195 200 205
Glu Thr Leu Ser Gly Ala Glu Ser Thr Ser Leu Val Ala Ala Arg Gly
210 215 220
Ser Cys Ile Ala Asn Ala Glu Glu Val Asp Val Pro Ile Lys Leu Tyr
225 230 235 240
Cys Asn Gly Asp Gly Glu Trp Leu Val Pro Ile Gly Arg Cys Met Cys
245 250 255
Lys Ala Gly Phe Glu Ala Val Glu Asn Gly Thr Val Cys Arg Gly Cys
260 265 270
Pro Ser Gly Thr Phe Lys Ala Asn Gln Gly Asp Glu Ala Cys Thr His
275 280 285
Cys Pro Ile Asn Ser Arg Thr Thr Ser Glu Gly Ala Thr Asn Cys Val
290 295 300
Cys Arg Asn Gly Tyr Tyr Arg Ala Asp Leu Asp Pro Leu Asp Met Pro
305 310 315 320
Cys Thr Thr Ile Pro Ser Ala Pro Gln Ala Val Ile Ser Ser Val Asn
325 330 335
Glu Thr Ser Leu Met Leu Glu Trp Thr Pro Pro Arg Asp Ser Gly Gly
340 345 350
Arg Glu Asp Leu Val Tyr Asn Ile Ile Cys Lys Ser Cys Gly Ser Gly
355 360 365
Arg Gly Ala Cys Thr Arg Cys Gly Asp Asn Val Gln Tyr Ala Pro Arg
370 375 380
Gln Leu Gly Leu Thr Glu Pro Arg Ile Tyr Ile Ser Asp Leu Leu Ala
385 390 395 400
His Thr Gln Tyr Thr Phe Glu Ile Gln Ala Val Asn Gly Val Thr Asp
405 410 415
Gln Ser Pro Phe Ser Pro Gln Phe Ala Ser Val Asn Ile Thr Thr Asn
420 425 430
Gln Ala Ala Pro Ser Ala Val Ser Ile Met His Gln Val Ser Arg Thr
435 440 445
Val Asp Ser Ile Thr Leu Ser Trp Ser Gln Pro Asp Gln Pro Asn Gly
450 455 460
Val Ile Leu Asp Tyr Glu Leu Gln Tyr Tyr Glu Lys Glu Leu Ser Glu
465 470 475 480
Tyr Asn Ala Thr Ala Ile Lys Ser Pro Thr Asn Thr Val Thr Val Gln
485 490 495
Gly Leu Lys Ala Gly Ala Ile Tyr Val Phe Gln Val Arg Ala Arg Thr
500 505 510
Val Ala Gly Tyr Gly Arg Tyr Ser Gly Lys Met Tyr Phe Gln Thr Met
515 520 525
Thr Glu Ala Glu Tyr Gln Thr Ser Ile Gln Glu Lys Leu Pro Leu Ile
530 535 540
Ile Gly Ser Ser Ala Ala Gly Leu Val Phe Leu Ile Ala Val Val Val
545 550 555 560
Ile Ala Ile Val Cys Asn Arg Arg Arg Gly Phe Glu Arg Ala Asp Ser
565 570 575
Glu Tyr Thr Asp Lys Leu Gln His Tyr Thr Ser Gly His Met Thr Pro
580 585 590
Gly Met Lys Ile Tyr Ile Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu
595 600 605
Ala Val Arg Glu Phe Ala Lys Glu Ile Asp Ile Ser Cys Val Lys Ile
610 615 620
Glu Gln Val Ile Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly His
625 630 635 640
Leu Lys Leu Pro Gly Lys Arg Glu Ile Phe Val Ala Ile Lys Thr Leu
645 650 655
Lys Ser Gly Tyr Thr Glu Lys Gln Arg Arg Asp Phe Leu Ser Glu Ala
660 665 670
Ser Ile Met Gly Gln Phe Asp His Pro Asn Val Ile His Leu Glu Gly
675 680 685
Val Val Thr Lys Ser Thr Pro Val Met Ile Ile Thr Glu Phe Met Glu
690 695 700
Asn Gly Ser Leu Asp Ser Phe Leu Arg Gln Asn Asp Gly Gln Phe Thr
705 710 715 720
Val Ile Gln Leu Val Gly Met Leu Arg Gly Ile Ala Ala Gly Met Lys
725 730 735
Tyr Leu Ala Asp Met Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn
740 745 750
Ile Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu
755 760 765
Ser Arg Phe Leu Glu Asp Asp Thr Ser Asp Pro Thr Tyr Thr Ser Ala
770 775 780
Leu Gly Gly Lys Ile Pro Ile Arg Trp Thr Ala Pro Glu Ala Ile Gln
785 790 795 800
Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly Ile Val
805 810 815
Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Thr
820 825 830
Asn Gln Asp Val Ile Asn Ala Ile Glu Gln Asp Tyr Arg Leu Pro Pro
835 840 845
Pro Met Asp Cys Pro Ser Ala Leu His Gln Leu Met Leu Asp Cys Trp
850 855 860
Gln Lys Asp Arg Asn His Arg Pro Lys Phe Gly Gln Ile Val Asn Thr
865 870 875 880
Leu Asp Lys Met Ile Arg Asn Pro Asn Ser Leu Lys Ala Met Ala Pro
885 890 895
Leu Ser Ser Gly Ile Asn Leu Pro Leu Leu Asp Arg Thr Ile Pro Asp
900 905 910
Tyr Thr Ser Phe Asn Thr Val Asp Glu Trp Leu Glu Ala Ile Lys Met
915 920 925
Gly Gln Tyr Lys Glu Ser Phe Ala Asn Ala Gly Phe Thr Ser Phe Asp
930 935 940
Val Val Ser Gln Met Met Met Glu Asp Ile Leu Arg Val Gly Val Thr
945 950 955 960
Leu Ala Gly His Gln Lys Lys Ile Leu Asn Ser Ile Gln Val Met Arg
965 970 975
Ala Gln Met Asn Gln Ile Gln Ser Val Glu Val
980 985
<210> 89
<211> 1658
<212> DNA
<213> 智人
<400> 89
ggaaggcagc ggcagctcca ctcagccagt acccagatac gctgggaacc ttccccagcc 60
atggcttccc tggggcagat cctcttctgg agcataatta gcatcatcat tattctggct 120
ggagcaattg cactcatcat tggctttggt atttcaggga gacactccat cacagtcact 180
actgtcgcct cagctgggaa cattggggag gatggaatcc tgagctgcac ttttgaacct 240
gacatcaaac tttctgatat cgtgatacaa tggctgaagg aaggtgtttt aggcttggtc 300
catgagttca aagaaggcaa agatgagctg tcggagcagg atgaaatgtt cagaggccgg 360
acagcagtgt ttgctgatca agtgatagtt ggcaatgcct ctttgcggct gaaaaacgtg 420
caactcacag atgctggcac ctacaaatgt tatatcatca cttctaaagg caaggggaat 480
gctaaccttg agtataaaac tggagccttc agcatgccgg aagtgaatgt ggactataat 540
gccagctcag agaccttgcg gtgtgaggct ccccgatggt tcccccagcc cacagtggtc 600
tgggcatccc aagttgacca gggagccaac ttctcggaag tctccaatac cagctttgag 660
ctgaactctg agaatgtgac catgaaggtt gtgtctgtgc tctacaatgt tacgatcaac 720
aacacatact cctgtatgat tgaaaatgac attgccaaag caacagggga tatcaaagtg 780
acagaatcgg agatcaaaag gcggagtcac ctacagctgc taaactcaaa ggcttctctg 840
tgtgtctctt ctttctttgc catcagctgg gcacttctgc ctctcagccc ttacctgatg 900
ctaaaataat gtgccttggc cacaaaaaag catgcaaagt cattgttaca acagggatct 960
acagaactat ttcaccacca gatatgacct agttttatat ttctgggagg aaatgaattc 1020
atatctagaa gtctggagtg agcaaacaag agcaagaaac aaaaagaagc caaaagcaga 1080
aggctccaat atgaacaaga taaatctatc ttcaaagaca tattagaagt tgggaaaata 1140
attcatgtga actagacaag tgtgttaaga gtgataagta aaatgcacgt ggagacaagt 1200
gcatccccag atctcaggga cctccccctg cctgtcacct ggggagtgag aggacaggat 1260
agtgcatgtt ctttgtctct gaatttttag ttatatgtgc tgtaatgttg ctctgaggaa 1320
gcccctggaa agtctatccc aacatatcca catcttatat tccacaaatt aagctgtagt 1380
atgtacccta agacgctgct aattgactgc cacttcgcaa ctcaggggcg gctgcatttt 1440
agtaatgggt caaatgattc actttttatg atgcttccaa aggtgccttg gcttctcttc 1500
ccaactgaca aatgccaaag ttgagaaaaa tgatcataat tttagcataa acagagcagt 1560
cggggacacc gattttataa ataaactgag caccttcttt ttaaacaaaa aaaaaaaaaa 1620
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1658
<210> 90
<211> 946
<212> DNA
<213> 智人
<400> 90
tgacagccca ccagtgacca tgaaggctgt gctgcttgcc ctgttgatgg caggcttggc 60
cctgcagcca ggcactgccc tgctgtgcta ctcctgcaaa gcccaggtga gcaacgagga 120
ctgcctgcag gtggagaact gcacccagct gggggagcag tgctggaccg cgcgcatccg 180
cgcagttggc ctcctgaccg tcatcagcaa aggctgcagc ttgaactgcg tggatgactc 240
acaggactac tacgtgggca agaagaacat cacgtgctgt gacaccgact tgtgcaacgc 300
cagcggggcc catgccctgc agccggctgc cgccatcctt gcgctgctcc ctgcactcgg 360
cctgctgctc tggggacccg gccagctata ggctctgggg ggccccgctg cagcccacac 420
tgggtgtggt gccccaggcc tctgtgccac tcctcacaga cctggcccag tgggagcctg 480
tcctggttcc tgaggcacat cctaacgcaa gtctgaccat gtatgtttgc acccctgtcc 540
cccaccctga ccctcccatg gccctctcca ggactcccac ccggcagatc agctctagtg 600
acacagatcc gcctgcagat ggcccctcca accctctctg ctgctgtttc catggcccag 660
cattctccac ccttaaccct gtgctcaggc acctcttccc ccaggaagcc ttccctgccc 720
accccatcta tgacttgagc caggtctggt ccgtggtgtc ccccgcaccc agcaggggac 780
aggcactcag gagggcccag taaaggctga gatgaagtgg actgagtaga actggaggac 840
aagagtcgac gtgagttcct gggagtctcc agagatgggg cctggaggcc tggaggaagg 900
ggccaggcct cacattcgtg gggctccctg aatggcagcc tgagca 946
<210> 91
<211> 123
<212> PRT
<213> 智人
<400> 91
Met Lys Ala Val Leu Leu Ala Leu Leu Met Ala Gly Leu Ala Leu Gln
1 5 10 15
Pro Gly Thr Ala Leu Leu Cys Tyr Ser Cys Lys Ala Gln Val Ser Asn
20 25 30
Glu Asp Cys Leu Gln Val Glu Asn Cys Thr Gln Leu Gly Glu Gln Cys
35 40 45
Trp Thr Ala Arg Ile Arg Ala Val Gly Leu Leu Thr Val Ile Ser Lys
50 55 60
Gly Cys Ser Leu Asn Cys Val Asp Asp Ser Gln Asp Tyr Tyr Val Gly
65 70 75 80
Lys Lys Asn Ile Thr Cys Cys Asp Thr Asp Leu Cys Asn Ala Ser Gly
85 90 95
Ala His Ala Leu Gln Pro Ala Ala Ala Ile Leu Ala Leu Leu Pro Ala
100 105 110
Leu Gly Leu Leu Leu Trp Gly Pro Gly Gln Leu
115 120
<210> 92
<211> 1852
<212> DNA
<213> 智人
<400> 92
ctgggctgag gcggaggcag gggagttgca gcgcgcgagg ctccgtgagt gtgtctcctg 60
cgcgctgaga ggcgggggga ggcggaggac caggaggagg aggaggagga ggaggagggg 120
gagaatgccc ggagccgccg ccgctgccgc cgccgccgcc gccgcgatgc tcccggctca 180
ggaggctgcc aagctgtacc acaccaacta tgtgcggaac tcgcgggcca tcggcgtgct 240
gtgggccatc ttcaccatct gctttgccat cgtcaacgtg gtgtgcttca tccagcccta 300
ctggataggc gacggcgtgg acaccccgca agccggctat ttcgggctct tccactactg 360
catcggcaac ggcttctccc gggagctgac ctgcaggggc agcttcacgg acttctccac 420
gctgccctcg ggcgccttca aagccgcctc cttctttatc ggcctctcca tgatgctcat 480
cattgcctgc atcatttgct ttaccctctt cttcttctgc aacacggcca ctgtgtacaa 540
gatatgtgcc tggatgcagc tcacctccgc tgcctgcctt gtgcttggct gtatgatttt 600
ccctgatggc tgggactcag atgaagtaaa acggatgtgt ggagaaaaga cagacaagta 660
cactcttggg gcttgctcag tccgctgggc atacatcctg gctattattg gaattttgga 720
tgccctgatc ctctcatttc tagcatttgt gcttggtaat cgacaagaca gcttgatggc 780
agaggaactg aaggcagaaa acaaagttct gctaagccaa tattctctag aatgagcaca 840
aaacaaatcg aataacagct aaacaaatcg aataacagct aaacgaatcg aataacagct 900
tttgtacatc aacatcaaga aggaatacgc ctgagagaga tcagagtata tagatgaata 960
tgaacaagaa tggaacattc acttgtcaac gcactttcta aatctagatc agcagagatg 1020
ggagtgattt tctggaaaga gatgtgatca tggattaaac accagctcat tggaaactca 1080
ttggatgaga tcagaaaacg ttcatgaaaa atcatattca ggaaataagg aagaggaata 1140
taaatgctct agagttaaca tgtaaaatat atacgtactg aggtttgtaa actgtccttt 1200
ttaaatcaaa ctgaaaacaa aaagctttaa cctttcaaca gaatttttaa aaaggcagtt 1260
agttctaaat tattcctatc tcaatagcca agaggctgat caagcgtcat ttattgagga 1320
agcatcttag aaaatgcctc tgaatgtttt cataggagcc gtgacctttg gttcttcatc 1380
tctaccattc atttacttca ctgtgtaatt agttacaacc actcagttat taagagacgt 1440
aacgcttcaa actttttacc aagtctgtgt tctgtttaat ctgtccatac aagttattac 1500
tgagaaagtg tttatgccat atactattac tccatcaagc tgtatattac aggaagtaca 1560
tctttacatc ataggttccc aagcaacata gatttcccta tctttcagga aacagcatca 1620
aggaactctg aaaaatatag aaaaagttca ttttcacctt ggaagctcac gtgtaatatt 1680
ataggctact atcaaataaa cacttttttt ctaattctcc ctagtatatg cataggaatt 1740
taatatactt tataaataag tatctaaaat gtctcctact tttttcctat ttctttgcca 1800
tacatgttat cagaaatcca tgtcttctat ttcccttact gatgggcact ca 1852
<210> 93
<211> 236
<212> PRT
<213> 智人
<400> 93
Met Pro Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Met Leu
1 5 10 15
Pro Ala Gln Glu Ala Ala Lys Leu Tyr His Thr Asn Tyr Val Arg Asn
20 25 30
Ser Arg Ala Ile Gly Val Leu Trp Ala Ile Phe Thr Ile Cys Phe Ala
35 40 45
Ile Val Asn Val Val Cys Phe Ile Gln Pro Tyr Trp Ile Gly Asp Gly
50 55 60
Val Asp Thr Pro Gln Ala Gly Tyr Phe Gly Leu Phe His Tyr Cys Ile
65 70 75 80
Gly Asn Gly Phe Ser Arg Glu Leu Thr Cys Arg Gly Ser Phe Thr Asp
85 90 95
Phe Ser Thr Leu Pro Ser Gly Ala Phe Lys Ala Ala Ser Phe Phe Ile
100 105 110
Gly Leu Ser Met Met Leu Ile Ile Ala Cys Ile Ile Cys Phe Thr Leu
115 120 125
Phe Phe Phe Cys Asn Thr Ala Thr Val Tyr Lys Ile Cys Ala Trp Met
130 135 140
Gln Leu Thr Ser Ala Ala Cys Leu Val Leu Gly Cys Met Ile Phe Pro
145 150 155 160
Asp Gly Trp Asp Ser Asp Glu Val Lys Arg Met Cys Gly Glu Lys Thr
165 170 175
Asp Lys Tyr Thr Leu Gly Ala Cys Ser Val Arg Trp Ala Tyr Ile Leu
180 185 190
Ala Ile Ile Gly Ile Leu Asp Ala Leu Ile Leu Ser Phe Leu Ala Phe
195 200 205
Val Leu Gly Asn Arg Gln Asp Ser Leu Met Ala Glu Glu Leu Lys Ala
210 215 220
Glu Asn Lys Val Leu Leu Ser Gln Tyr Ser Leu Glu
225 230 235
<210> 94
<211> 1052
<212> DNA
<213> 智人
<400> 94
tgccaagccc tgccatgtag tgcacgcagg acatcaacaa acacagataa caggaaatga 60
tccattccct gtggtcactt attctaaagg ccccaacctt caaagttcaa gtagtgatat 120
ggatgactcc acagaaaggg agcagtcacg ccttacttct tgccttaaga aaagagaaga 180
aatgaaactg aaggagtgtg tttccatcct cccacggaag gaaagcccct ctgtccgatc 240
ctccaaagac ggaaagctgc tggctgcaac cttgctgctg gcactgctgt cttgctgcct 300
cacggtggtg tctttctacc aggtggccgc cctgcaaggg gacctggcca gcctccgggc 360
agagctgcag ggccaccacg cggagaagct gccagcagga gcaggagccc ccaaggccgg 420
cctggaggaa gctccagctg tcaccgcggg actgaaaatc tttgaaccac cagctccagg 480
agaaggcaac tccagtcaga acagcagaaa taagcgtgcc gttcagggtc cagaagaaac 540
agtcactcaa gactgcttgc aactgattgc agacagtgaa acaccaacta tacaaaaagg 600
atcttacaca tttgttccat ggcttctcag ctttaaaagg ggaagtgccc tagaagaaaa 660
agagaataaa atattggtca aagaaactgg ttactttttt atatatggtc aggttttata 720
tactgataag acctacgcca tgggacatct aattcagagg aagaaggtcc atgtctttgg 780
ggatgaattg agtctggtga ctttgtttcg atgtattcaa aatatgcctg aaacactacc 840
caataattcc tgctattcag ctggcattgc aaaactggaa gaaggagatg aactccaact 900
tgcaatacca agagaaaatg cacaaatatc actggatgga gatgtcacat tttttggtgc 960
attgaaactg ctgtgaccta cttacaccat gtctgtagct attttcctcc ctttctctgt 1020
acctctaaga agaaagaatc taactgaaaa ta 1052
<210> 95
<211> 285
<212> PRT
<213> 智人
<400> 95
Met Asp Asp Ser Thr Glu Arg Glu Gln Ser Arg Leu Thr Ser Cys Leu
1 5 10 15
Lys Lys Arg Glu Glu Met Lys Leu Lys Glu Cys Val Ser Ile Leu Pro
20 25 30
Arg Lys Glu Ser Pro Ser Val Arg Ser Ser Lys Asp Gly Lys Leu Leu
35 40 45
Ala Ala Thr Leu Leu Leu Ala Leu Leu Ser Cys Cys Leu Thr Val Val
50 55 60
Ser Phe Tyr Gln Val Ala Ala Leu Gln Gly Asp Leu Ala Ser Leu Arg
65 70 75 80
Ala Glu Leu Gln Gly His His Ala Glu Lys Leu Pro Ala Gly Ala Gly
85 90 95
Ala Pro Lys Ala Gly Leu Glu Glu Ala Pro Ala Val Thr Ala Gly Leu
100 105 110
Lys Ile Phe Glu Pro Pro Ala Pro Gly Glu Gly Asn Ser Ser Gln Asn
115 120 125
Ser Arg Asn Lys Arg Ala Val Gln Gly Pro Glu Glu Thr Val Thr Gln
130 135 140
Asp Cys Leu Gln Leu Ile Ala Asp Ser Glu Thr Pro Thr Ile Gln Lys
145 150 155 160
Gly Ser Tyr Thr Phe Val Pro Trp Leu Leu Ser Phe Lys Arg Gly Ser
165 170 175
Ala Leu Glu Glu Lys Glu Asn Lys Ile Leu Val Lys Glu Thr Gly Tyr
180 185 190
Phe Phe Ile Tyr Gly Gln Val Leu Tyr Thr Asp Lys Thr Tyr Ala Met
195 200 205
Gly His Leu Ile Gln Arg Lys Lys Val His Val Phe Gly Asp Glu Leu
210 215 220
Ser Leu Val Thr Leu Phe Arg Cys Ile Gln Asn Met Pro Glu Thr Leu
225 230 235 240
Pro Asn Asn Ser Cys Tyr Ser Ala Gly Ile Ala Lys Leu Glu Glu Gly
245 250 255
Asp Glu Leu Gln Leu Ala Ile Pro Arg Glu Asn Ala Gln Ile Ser Leu
260 265 270
Asp Gly Asp Val Thr Phe Phe Gly Ala Leu Lys Leu Leu
275 280 285
<210> 96
<211> 2544
<212> DNA
<213> 智人
<400> 96
agatgctgcc agggtccctg aagagggaag acacgcggaa acaggtaaaa atcattttgc 60
ttttattttg cattcaacaa gcaagttatt acggaacagc agttatgggc caggcatacc 120
tcccagagct gggaacacag tggggacctc cctggctctc tcttaccggt gttacaacag 180
gttgtagaca gacccctgtc ttgagcatcc tccttgccag gcctgctgag tcttctgaga 240
gtagggtagg ttattggatg cccaggaggg aagaaggagc cagggaggtc agccccaagg 300
ttctgcaagg ccctcaacag gcctggactg aggaggtctg gacagcatgg ccctgtcctg 360
agcctctgtg cataataact gctgtcccta acctccaccc caccctcagc cttccaattc 420
ccgggcctgg ggccctactc ctgtgctcca gagactcctg gagctccttg aggcagcaca 480
cagtcctgct ctggaggcgc ccatctccca ctcatgctgg gatgctccag cccgtcccag 540
agcaggttgt ggctggaggg tgctggcaga ggagggacaa tggcccggct cctggaggca 600
agtgttggct gcagggaacg gagtctagtc cttgccacag cccttgttac cccttaggta 660
accttaaggg gatttcaaag aactctggct ctgcaaccct gctaagtttt ttatggaaca 720
tgtaaaatag atcccatggc caaagaagta tggacaatgt attatactat actctaatcc 780
ccatgtctag agattaatgg tgtagataga gtttactgaa aggtttttaa agtcctgcaa 840
taaagaatct tacttaagcc aggtgcggtg gctcacgcct gtcatctcag cactttggga 900
ggccaaggcg ggaggatcac ttgaggtcag gagttcgaga ccagcctggc cagcatggcg 960
aaaccctgtc tctactagaa atacagaaaa attagctggg tgtggtggtg ggcacctgta 1020
atcccagcta ctcgggaggc tgaggcagga ggatcacttg aactggggag gtagaggtta 1080
cagcgagcca agatcgcgcc actgcactcc agcctgggtg acagagggaa actccatctc 1140
aaaaaaacaa caacacaaca acaacaacaa taacaacaaa aaaacaaagc aggactggag 1200
agaggtggaa tgaagtggca aggggttcct gaggggtgat ttgggacagg acatctaaag 1260
ccaggtgtac gctcacgtcc tcagtccccc aggctcctgc acgggctctg ttcttttgca 1320
gaaaggcctt ttccacctca tatccagctc cctccagaaa ttcaagagtc ccaggaagtc 1380
actctgacct gcttgctgaa tttctcctgc tatgggtatc cgatccaatt gcagtggctc 1440
ctagaggggg ttccaatgag gcaggctgct gtcacctcga cctccttgac catcaagtct 1500
gtcttcaccc ggagcgagct caagttctcc ccacagtgga gtcaccatgg gaagattgtg 1560
acctgccagc ttcaggatgc agatgggaag ttcctctcca atgacacggt gcagctgaac 1620
gtgaagcgtg agtctccccg gcatgcctgt gggaagggca aggtctgtgt caccttctcc 1680
ccagccccgc agggggcatg cacccagggc agggggaagc ctgcacagac ggcggcatcc 1740
tccagccctg gtcacgccgc cttgtcagcc ctggtgtttc gggaaaaaga tttgctctag 1800
cctaacagaa taaaatggtc caccctcaag ccatgacatg aattggggat tatctggtta 1860
ggtctttttg ttcccctctt ggtggggatt tttttcgcat cattatcttg tgcctcattc 1920
attcaataaa tacgtatcat gaacctacta ggtaccaggc cctattacgg ctgccaatgg 1980
ggggcatggg gcggtgggca gggtgcagca gtgagcaaaa ctcttgcccc acgcggagcc 2040
agcgctgcag tgaaagagac agacaacaaa tggattacca aagaaataca gagcatgagc 2100
caagacatta gaatctggaa caaagcaatg ttaacaaaga aatatacaac actattgtag 2160
gtagtgatat gtgtgttagg aaaaaaataa ggccgagaga ggggagtgat ggagagagac 2220
ctctctaaga aggtgagcac ttaggccggg tgcggtggct cacgcctgta atcctagcac 2280
tttggaaggc cgaggcgggg ggatcacaag gtcaggagat cgagaccatc ctggctaaca 2340
tggtgaaacc ccatctctag taaaaataca aaaaattagc caggcatgat ggcaggcgcc 2400
tgtagtccca gctacttggg aggccaaggc aggagaatga catgaaccca ggaggcggag 2460
cttgcagtga gctgagatcg caccactgca ctccaacctg ggtgacgagt gagactccat 2520
ctcaaaaaaa aaaaaaaaaa aaaa 2544
<210> 97
<211> 240
<212> PRT
<213> 智人
<400> 97
Met Leu Pro Gly Ser Leu Lys Arg Glu Asp Thr Arg Lys Gln Val Lys
1 5 10 15
Ile Ile Leu Leu Leu Phe Cys Ile Gln Gln Ala Ser Tyr Tyr Gly Thr
20 25 30
Ala Val Met Gly Gln Ala Tyr Leu Pro Glu Leu Gly Thr Gln Trp Gly
35 40 45
Pro Pro Trp Leu Ser Leu Thr Gly Val Thr Thr Gly Cys Arg Gln Thr
50 55 60
Pro Val Leu Ser Ile Leu Leu Ala Arg Pro Ala Glu Ser Ser Glu Ser
65 70 75 80
Arg Val Gly Tyr Trp Met Pro Arg Arg Glu Glu Gly Ala Arg Glu Val
85 90 95
Ser Pro Lys Val Leu Gln Gly Pro Gln Gln Ala Trp Thr Glu Glu Val
100 105 110
Trp Thr Ala Trp Pro Cys Pro Glu Pro Leu Cys Ile Ile Thr Ala Val
115 120 125
Pro Asn Leu His Pro Thr Leu Ser Leu Pro Ile Pro Gly Pro Gly Ala
130 135 140
Leu Leu Leu Cys Ser Arg Asp Ser Trp Ser Ser Leu Arg Gln His Thr
145 150 155 160
Val Leu Leu Trp Arg Arg Pro Ser Pro Thr His Ala Gly Met Leu Gln
165 170 175
Pro Val Pro Glu Gln Val Val Ala Gly Gly Cys Trp Gln Arg Arg Asp
180 185 190
Asn Gly Pro Ala Pro Gly Gly Lys Cys Trp Leu Gln Gly Thr Glu Ser
195 200 205
Ser Pro Cys His Ser Pro Cys Tyr Pro Leu Gly Asn Leu Lys Gly Ile
210 215 220
Ser Lys Asn Ser Gly Ser Ala Thr Leu Leu Ser Phe Leu Trp Asn Met
225 230 235 240
<210> 98
<211> 2116
<212> DNA
<213> 智人
<400> 98
acgcggaaac aggcttgcac ccagacacga caccatgcat ctcctcggcc cctggctcct 60
gctcctggtt ctagaatact tggctttctc tgactcaagt aaatgggttt ttgagcaccc 120
tgaaaccctc tacgcctggg agggggcctg cgtctggatc ccctgcacct acagagccct 180
agatggtgac ctggaaagct tcatcctgtt ccacaatcct gagtataaca agaacacctc 240
gaagtttgat gggacaagac tctatgaaag cacaaaggat gggaaggttc cttctgagca 300
gaaaagggtg caattcctgg gagacaagaa taagaactgc acactgagta tccacccggt 360
gcacctcaat gacagtggtc agctggggct gaggatggag tccaagactg agaaatggat 420
ggaacgaata cacctcaatg tctctgaaag gccttttcca cctcatatcc agctccctcc 480
agaaattcaa gagtcccagg aagtcactct gacctgcttg ctgaatttct cctgctatgg 540
gtatccgatc caattgcagt ggctcctaga gggggttcca atgaggcagg ctgctgtcac 600
ctcgacctcc ttgaccatca agtctgtctt cacccggagc gagctcaagt tctccccaca 660
gtggagtcac catgggaaga ttgtgacctg ccagcttcag gatgcagatg ggaagttcct 720
ctccaatgac acggtgcagc tgaacgtgaa gcatcctccc aagaaggtga ccacagtgat 780
tcaaaacccc atgccgattc gagaaggaga cacagtgacc ctttcctgta actacaattc 840
cagtaacccc agtgttaccc ggtatgaatg gaaaccccat ggcgcctggg aggagccatc 900
gcttggggtg ctgaagatcc aaaacgttgg ctgggacaac acaaccatcg cctgcgcagc 960
ttgtaatagt tggtgctcgt gggcctcccc tgtcgccctg aatgtccagt atgccccccg 1020
agacgtgagg gtccggaaaa tcaagcccct ttccgagatt cactctggaa actcggtcag 1080
cctccaatgt gacttctcaa gcagccaccc caaagaagtc cagttcttct gggagaaaaa 1140
tggcaggctt ctggggaaag aaagccagct gaattttgac tccatctccc cagaagatgc 1200
tgggagttac agctgctggg tgaacaactc cataggacag acagcgtcca aggcctggac 1260
acttgaagtg ctgtatgcac ccaggaggct gcgtgtgtcc atgagcccgg gggaccaagt 1320
gatggagggg aagagtgcaa ccctgacctg tgagagcgac gccaaccctc ccgtctccca 1380
ctacacctgg tttgactgga ataaccaaag cctcccctac cacagccaga agctgagatt 1440
ggagccggtg aaggtccagc actcgggtgc ctactggtgc caggggacca acagtgtggg 1500
caagggccgt tcgcctctca gcaccctcac cgtctactat agcccggaga ccatcggcag 1560
gcgagtggct gtgggactcg ggtcctgcct cgccatcctc atcctggcaa tctgtgggct 1620
caagctccag cgacgttgga agaggacaca gagccagcag gggcttcagg agaattccag 1680
cggccagagc ttctttgtga ggaataaaaa ggttagaagg gcccccctct ctgaaggccc 1740
ccactccctg ggatgctaca atccaatgat ggaagatggc attagctaca ccaccctgcg 1800
ctttcccgag atgaacatac cacgaactgg agatgcagag tcctcagaga tgcagagacc 1860
tcccccggac tgcgatgaca cggtcactta ttcagcattg cacaagcgcc aagtgggcac 1920
tatgagaacg tcattccaga ttttccagaa gatgagggga ttcattactc agagctgatc 1980
cagtttgggg tcggggagcg gcctcaggca caagaaaatg tggactatgt gatcctcaaa 2040
cattgacact ggatgggctg cagcagaggc actgggggca gcgggggcca gggaagtccc 2100
cgagttttcc ccagac 2116
<210> 99
<211> 647
<212> PRT
<213> 智人
<400> 99
Met His Leu Leu Gly Pro Trp Leu Leu Leu Leu Val Leu Glu Tyr Leu
1 5 10 15
Ala Phe Ser Asp Ser Ser Lys Trp Val Phe Glu His Pro Glu Thr Leu
20 25 30
Tyr Ala Trp Glu Gly Ala Cys Val Trp Ile Pro Cys Thr Tyr Arg Ala
35 40 45
Leu Asp Gly Asp Leu Glu Ser Phe Ile Leu Phe His Asn Pro Glu Tyr
50 55 60
Asn Lys Asn Thr Ser Lys Phe Asp Gly Thr Arg Leu Tyr Glu Ser Thr
65 70 75 80
Lys Asp Gly Lys Val Pro Ser Glu Gln Lys Arg Val Gln Phe Leu Gly
85 90 95
Asp Lys Asn Lys Asn Cys Thr Leu Ser Ile His Pro Val His Leu Asn
100 105 110
Asp Ser Gly Gln Leu Gly Leu Arg Met Glu Ser Lys Thr Glu Lys Trp
115 120 125
Met Glu Arg Ile His Leu Asn Val Ser Glu Arg Pro Phe Pro Pro His
130 135 140
Ile Gln Leu Pro Pro Glu Ile Gln Glu Ser Gln Glu Val Thr Leu Thr
145 150 155 160
Cys Leu Leu Asn Phe Ser Cys Tyr Gly Tyr Pro Ile Gln Leu Gln Trp
165 170 175
Leu Leu Glu Gly Val Pro Met Arg Gln Ala Ala Val Thr Ser Thr Ser
180 185 190
Leu Thr Ile Lys Ser Val Phe Thr Arg Ser Glu Leu Lys Phe Ser Pro
195 200 205
Gln Trp Ser His His Gly Lys Ile Val Thr Cys Gln Leu Gln Asp Ala
210 215 220
Asp Gly Lys Phe Leu Ser Asn Asp Thr Val Gln Leu Asn Val Lys His
225 230 235 240
Pro Pro Lys Lys Val Thr Thr Val Ile Gln Asn Pro Met Pro Ile Arg
245 250 255
Glu Gly Asp Thr Val Thr Leu Ser Cys Asn Tyr Asn Ser Ser Asn Pro
260 265 270
Ser Val Thr Arg Tyr Glu Trp Lys Pro His Gly Ala Trp Glu Glu Pro
275 280 285
Ser Leu Gly Val Leu Lys Ile Gln Asn Val Gly Trp Asp Asn Thr Thr
290 295 300
Ile Ala Cys Ala Ala Cys Asn Ser Trp Cys Ser Trp Ala Ser Pro Val
305 310 315 320
Ala Leu Asn Val Gln Tyr Ala Pro Arg Asp Val Arg Val Arg Lys Ile
325 330 335
Lys Pro Leu Ser Glu Ile His Ser Gly Asn Ser Val Ser Leu Gln Cys
340 345 350
Asp Phe Ser Ser Ser His Pro Lys Glu Val Gln Phe Phe Trp Glu Lys
355 360 365
Asn Gly Arg Leu Leu Gly Lys Glu Ser Gln Leu Asn Phe Asp Ser Ile
370 375 380
Ser Pro Glu Asp Ala Gly Ser Tyr Ser Cys Trp Val Asn Asn Ser Ile
385 390 395 400
Gly Gln Thr Ala Ser Lys Ala Trp Thr Leu Glu Val Leu Tyr Ala Pro
405 410 415
Arg Arg Leu Arg Val Ser Met Ser Pro Gly Asp Gln Val Met Glu Gly
420 425 430
Lys Ser Ala Thr Leu Thr Cys Glu Ser Asp Ala Asn Pro Pro Val Ser
435 440 445
His Tyr Thr Trp Phe Asp Trp Asn Asn Gln Ser Leu Pro Tyr His Ser
450 455 460
Gln Lys Leu Arg Leu Glu Pro Val Lys Val Gln His Ser Gly Ala Tyr
465 470 475 480
Trp Cys Gln Gly Thr Asn Ser Val Gly Lys Gly Arg Ser Pro Leu Ser
485 490 495
Thr Leu Thr Val Tyr Tyr Ser Pro Glu Thr Ile Gly Arg Arg Val Ala
500 505 510
Val Gly Leu Gly Ser Cys Leu Ala Ile Leu Ile Leu Ala Ile Cys Gly
515 520 525
Leu Lys Leu Gln Arg Arg Trp Lys Arg Thr Gln Ser Gln Gln Gly Leu
530 535 540
Gln Glu Asn Ser Ser Gly Gln Ser Phe Phe Val Arg Asn Lys Lys Val
545 550 555 560
Arg Arg Ala Pro Leu Ser Glu Gly Pro His Ser Leu Gly Cys Tyr Asn
565 570 575
Pro Met Met Glu Asp Gly Ile Ser Tyr Thr Thr Leu Arg Phe Pro Glu
580 585 590
Met Asn Ile Pro Arg Thr Gly Asp Ala Glu Ser Ser Glu Met Gln Arg
595 600 605
Pro Pro Pro Asp Cys Asp Asp Thr Val Thr Tyr Ser Ala Leu His Lys
610 615 620
Arg Gln Val Gly Thr Met Arg Thr Ser Phe Gln Ile Phe Gln Lys Met
625 630 635 640
Arg Gly Phe Ile Thr Gln Ser
645
<210> 100
<211> 1258
<212> DNA
<213> 智人
<400> 100
tccactcaca gcctgaagca tacccggcag gggctgtccc caggcccaac aagcaaaggg 60
cccagtagcg agggccactg gagcccatct ccggggggct gggcaggaag tagggtgggg 120
tttggggtag ggatctggta ccctgggact gctgcaactc aaactaacca acccactggg 180
agaagatgcc tgggggtcca ggagtcctcc aagctctgcc tgccaccatc ttcctcctct 240
tcctgctgtc tgctgtctac ctgggccctg ggtgccaggc cctgtggatg cacaaggtcc 300
cagcatcatt gatggtgagc ctgggggaag acgcccactt ccaatgcccg cacaatagca 360
gcaacaacgc caacgtcacc tggtggcgcg tcctccatgg caactacacg tggccccctg 420
agttcttggg cccgggcgag gaccccaatg gtacgctgat catccagaat gtgaacaaga 480
gccatggggg catatacgtg tgccgggtcc aggagggcaa cgagtcatac cagcagtcct 540
gcggcaccta cctccgcgtg cgccagccgc cccccaggcc cttcctggac atgggggagg 600
gcaccaagaa ccgaatcatc acagccgagg ggatcatcct cctgttctgc gcggtggtgc 660
ctgggacgct gctgctgttc aggaaacgat ggcagaacga gaagctcggg ttggatgccg 720
gggatgaata tgaagatgaa aacctttatg aaggcctgaa cctggacgac tgctccatgt 780
atgaggacat ctcccggggc ctccagggca cctaccagga tgtgggcagc ctcaacatag 840
gagatgtcca gctggagaag ccgtgacacc cctactcctg ccaggctgcc cccgcctgct 900
gtgcacccag ctccagtgtc tcagctcact tccctgggac attctccttt cagcccttct 960
gggggcttcc ttagtcatat tcccccagtg gggggtggga gggtaacctc actcttctcc 1020
aggccaggcc tccttggact cccctggggg tgtcccactc ttcttccctc taaactgccc 1080
cacctcctaa cctaatcccc ccgccccgct gcctttccca ggctcccctc accccagcgg 1140
gtaatgagcc cttaatcgct gcctctaggg gagctgattg tagcagcctc gttagtgtca 1200
ccccctcctc cctgatctgt cagggccact tagtgataat aaattcttcc caactgca 1258
<210> 101
<211> 226
<212> PRT
<213> 智人
<400> 101
Met Pro Gly Gly Pro Gly Val Leu Gln Ala Leu Pro Ala Thr Ile Phe
1 5 10 15
Leu Leu Phe Leu Leu Ser Ala Val Tyr Leu Gly Pro Gly Cys Gln Ala
20 25 30
Leu Trp Met His Lys Val Pro Ala Ser Leu Met Val Ser Leu Gly Glu
35 40 45
Asp Ala His Phe Gln Cys Pro His Asn Ser Ser Asn Asn Ala Asn Val
50 55 60
Thr Trp Trp Arg Val Leu His Gly Asn Tyr Thr Trp Pro Pro Glu Phe
65 70 75 80
Leu Gly Pro Gly Glu Asp Pro Asn Gly Thr Leu Ile Ile Gln Asn Val
85 90 95
Asn Lys Ser His Gly Gly Ile Tyr Val Cys Arg Val Gln Glu Gly Asn
100 105 110
Glu Ser Tyr Gln Gln Ser Cys Gly Thr Tyr Leu Arg Val Arg Gln Pro
115 120 125
Pro Pro Arg Pro Phe Leu Asp Met Gly Glu Gly Thr Lys Asn Arg Ile
130 135 140
Ile Thr Ala Glu Gly Ile Ile Leu Leu Phe Cys Ala Val Val Pro Gly
145 150 155 160
Thr Leu Leu Leu Phe Arg Lys Arg Trp Gln Asn Glu Lys Leu Gly Leu
165 170 175
Asp Ala Gly Asp Glu Tyr Glu Asp Glu Asn Leu Tyr Glu Gly Leu Asn
180 185 190
Leu Asp Asp Cys Ser Met Tyr Glu Asp Ile Ser Arg Gly Leu Gln Gly
195 200 205
Thr Tyr Gln Asp Val Gly Ser Leu Asn Ile Gly Asp Val Gln Leu Glu
210 215 220
Lys Pro
225
<210> 102
<211> 2919
<212> DNA
<213> 智人
<400> 102
aaaaaaaaaa agtgatgagt tgtgaggcag gtcgcggccc tactgcctca ggagacgatg 60
cgcagctcat ttgcttaaat ttgcagctga cggctgccac ctctctagag gcacctggcg 120
gggagcctct caacataaga cagtgaccag tctggtgact cacagccggc acagccatga 180
actacccgct aacgctggaa atggacctcg agaacctgga ggacctgttc tgggaactgg 240
acagattgga caactataac gacacctccc tggtggaaaa tcatctctgc cctgccacag 300
aggggcccct catggcctcc ttcaaggccg tgttcgtgcc cgtggcctac agcctcatct 360
tcctcctggg cgtgatcggc aacgtcctgg tgctggtgat cctggagcgg caccggcaga 420
cacgcagttc cacggagacc ttcctgttcc acctggccgt ggccgacctc ctgctggtct 480
tcatcttgcc ctttgccgtg gccgagggct ctgtgggctg ggtcctgggg accttcctct 540
gcaaaactgt gattgccctg cacaaagtca acttctactg cagcagcctg ctcctggcct 600
gcatcgccgt ggaccgctac ctggccattg tccacgccgt ccatgcctac cgccaccgcc 660
gcctcctctc catccacatc acctgtggga ccatctggct ggtgggcttc ctccttgcct 720
tgccagagat tctcttcgcc aaagtcagcc aaggccatca caacaactcc ctgccacgtt 780
gcaccttctc ccaagagaac caagcagaaa cgcatgcctg gttcacctcc cgattcctct 840
accatgtggc gggattcctg ctgcccatgc tggtgatggg ctggtgctac gtgggggtag 900
tgcacaggtt gcgccaggcc cagcggcgcc ctcagcggca gaaggcagtc agggtggcca 960
tcctggtgac aagcatcttc ttcctctgct ggtcacccta ccacatcgtc atcttcctgg 1020
acaccctggc gaggctgaag gccgtggaca atacctgcaa gctgaatggc tctctccccg 1080
tggccatcac catgtgtgag ttcctgggcc tggcccactg ctgcctcaac cccatgctct 1140
acactttcgc cggcgtgaag ttccgcagtg acctgtcgcg gctcctgacg aagctgggct 1200
gtaccggccc tgcctccctg tgccagctct tccctagctg gcgcaggagc agtctctctg 1260
agtcagagaa tgccacctct ctcaccacgt tctaggtccc agtgtcccct tttattgctg 1320
cttttccttg gggcaggcag tgatgctgga tgctccttcc aacaggagct gggatcctaa 1380
gggctcaccg tggctaagag tgtcctagga gtatcctcat ttggggtagc tagaggaacc 1440
aacccccatt tctagaacat ccctgccagc tcttctgccg gccctggggc taggctggag 1500
cccagggagc ggaaagcagc tcaaaggcac agtgaaggct gtccttaccc atctgcaccc 1560
ccctgggctg agagaacctc acgcacctcc catcctaatc atccaatgct caagaaacaa 1620
cttctacttc tgcccttgcc aacggagagc gcctgcccct cccagaacac actccatcag 1680
cttaggggct gctgacctcc acagcttccc ctctctcctc ctgcccacct gtcaaacaaa 1740
gccagaagct gagcaccagg ggatgagtgg aggttaaggc tgaggaaagg ccagctggca 1800
gcagagtgtg gccttcggac aactcagtcc ctaaaaacac agacattctg ccaggccccc 1860
aagcctgcag tcatcttgac caagcaggaa gctcagactg gttgagttca ggtagctgcc 1920
cctggctctg accgaaacag cgctgggtcc accccatgtc accggatcct gggtggtctg 1980
caggcagggc tgactctagg tgcccttgga ggccagccag tgacctgagg aagcgtgaag 2040
gccgagaagc aagaaagaaa cccgacagag ggaagaaaag agctttcttc ccgaacccca 2100
aggagggaga tggatcaatc aaacccggcg gtcccctccg ccaggcgaga tggggtgggg 2160
tggagaactc ctagggtggc tgggtccagg ggatgggagg ttgtgggcat tgatggggaa 2220
ggaggctggc ttgtcccctc ctcactccct tcccataagc tatagacccg aggaaactca 2280
gagtcggaac ggagaaaggt ggactggaag gggcccgtgg gagtcatctc aaccatcccc 2340
tccgtggcat caccttaggc agggaagtgt aagaaacaca ctgaggcagg gaagtcccca 2400
ggccccagga agccgtgccc tgcccccgtg aggatgtcac tcagatggaa ccgcaggaag 2460
ctgctccgtg cttgtttgct cacctggggt gtgggaggcc cgtccggcag ttctgggtgc 2520
tccctaccac ctccccagcc tttgatcagg tggggagtca gggacccctg cccttgtccc 2580
actcaagcca agcagccaag ctccttggga ggccccactg gggaaataac agctgtggct 2640
cacgtgagag tgtcttcacg gcaggacaac gaggaagccc taagacgtcc cttttttctc 2700
tgagtatctc ctcgcaagct gggtaatcga tgggggagtc tgaagcagat gcaaagaggc 2760
aagaggctgg attttgaatt ttctttttaa taaaaaggca cctataaaac aggtcaatac 2820
agtacaggca gcacagagac ccccggaaca agcctaaaaa ttgtttcaaa ataaaaacca 2880
agaagatgtc ttcacatatt gtaaaaaaaa aaaaaaaaa 2919
<210> 103
<211> 372
<212> PRT
<213> 智人
<400> 103
Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp
1 5 10 15
Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu
20 25 30
Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser
35 40 45
Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu
50 55 60
Gly Val Ile Gly Asn Val Leu Val Leu Val Ile Leu Glu Arg His Arg
65 70 75 80
Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala
85 90 95
Asp Leu Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser
100 105 110
Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val Ile Ala Leu
115 120 125
His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala
130 135 140
Val Asp Arg Tyr Leu Ala Ile Val His Ala Val His Ala Tyr Arg His
145 150 155 160
Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr Ile Trp Leu Val
165 170 175
Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln
180 185 190
Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn
195 200 205
Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val
210 215 220
Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly
225 230 235 240
Val Val His Arg Leu Arg Gln Ala Gln Arg Arg Pro Gln Arg Gln Lys
245 250 255
Ala Val Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp
260 265 270
Ser Pro Tyr His Ile Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys
275 280 285
Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala Ile
290 295 300
Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met
305 310 315 320
Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu
325 330 335
Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gln Leu Phe
340 345 350
Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser
355 360 365
Leu Thr Thr Phe
370
<210> 104
<211> 1388
<212> DNA
<213> 智人
<400> 104
gatttatact cttaatgggt actttctgac tgaattttat gagctcattc tgaagaggct 60
gacgatttta ctatctcatt tttttccttt ctccagaatg ggttctgggt gggtcccctg 120
ggtggtggct ctgctagtga atctgacccg actggattcc tccatgactc aaggcacaga 180
ctctccagaa gattttgtga ttcaggcaaa ggctgactgt tacttcacca acgggacaga 240
aaaggtgcag tttgtggtca gattcatctt taacttggag gagtatgtac gtttcgacag 300
tgatgtgggg atgtttgtgg cattgaccaa gctggggcag ccagatgctg agcagtggaa 360
cagccggctg gatctcttgg agaggagcag acaggccgtg gatggggtct gtagacacaa 420
ctacaggctg ggcgcaccct tcactgtggg gagaaaagtg caaccagagg tgacagtgta 480
cccagagagg accccactcc tgcaccagca taatctgctg cactgctctg tgacaggctt 540
ctatccaggg gatatcaaga tcaagtggtt cctgaatggg caggaggaga gagctggggt 600
catgtccact ggccctatca ggaatggaga ctggaccttt cagactgtgg tgatgctaga 660
aatgactcct gaacttggac atgtctacac ctgccttgtc gatcactcca gcctgctgag 720
ccctgtttct gtggagtgga gagctcagtc tgaatattct tggagaaaga tgctgagtgg 780
cattgcagcc ttcctacttg ggctaatctt ccttctggtg ggaatcgtca tccagctaag 840
ggctcagaaa ggatatgtga ggacgcagat gtctggtaat gaggtctcaa gagctgttct 900
gctccctcag tcatgctaag gtcctcactg aagcttctct ctctggagcc tgaagtagtg 960
atgagtagtc tgggccctgg gtgaggtaaa ggacattcat gaggtcaatg ttctgggaat 1020
aactctcttc cctgatcctt ggaggagccc gaactgattc tggagctctg tgttctgaga 1080
tcatgcatct cccacccatc tgcccttctc ccttctacgt gtacatcatt aatccccatt 1140
gccaagggca ttgtccagaa actcccctga gaccttactc cttccagccc caaatcattt 1200
acttttctgt ggtccagccc tactcctata agtcatgatc tccaaagctt tctgtcttcc 1260
aactgcagtc tccacagtct tcagaagaca aatgctcagg tagtcactgt ttccttttca 1320
ctgtttttaa aaacctttta ttgtcaaata aaatggagat acaaaaaatg taaaaaaaaa 1380
aaaaaaaa 1388
<210> 105
<211> 273
<212> PRT
<213> 智人
<400> 105
Met Gly Ser Gly Trp Val Pro Trp Val Val Ala Leu Leu Val Asn Leu
1 5 10 15
Thr Arg Leu Asp Ser Ser Met Thr Gln Gly Thr Asp Ser Pro Glu Asp
20 25 30
Phe Val Ile Gln Ala Lys Ala Asp Cys Tyr Phe Thr Asn Gly Thr Glu
35 40 45
Lys Val Gln Phe Val Val Arg Phe Ile Phe Asn Leu Glu Glu Tyr Val
50 55 60
Arg Phe Asp Ser Asp Val Gly Met Phe Val Ala Leu Thr Lys Leu Gly
65 70 75 80
Gln Pro Asp Ala Glu Gln Trp Asn Ser Arg Leu Asp Leu Leu Glu Arg
85 90 95
Ser Arg Gln Ala Val Asp Gly Val Cys Arg His Asn Tyr Arg Leu Gly
100 105 110
Ala Pro Phe Thr Val Gly Arg Lys Val Gln Pro Glu Val Thr Val Tyr
115 120 125
Pro Glu Arg Thr Pro Leu Leu His Gln His Asn Leu Leu His Cys Ser
130 135 140
Val Thr Gly Phe Tyr Pro Gly Asp Ile Lys Ile Lys Trp Phe Leu Asn
145 150 155 160
Gly Gln Glu Glu Arg Ala Gly Val Met Ser Thr Gly Pro Ile Arg Asn
165 170 175
Gly Asp Trp Thr Phe Gln Thr Val Val Met Leu Glu Met Thr Pro Glu
180 185 190
Leu Gly His Val Tyr Thr Cys Leu Val Asp His Ser Ser Leu Leu Ser
195 200 205
Pro Val Ser Val Glu Trp Arg Ala Gln Ser Glu Tyr Ser Trp Arg Lys
210 215 220
Met Leu Ser Gly Ile Ala Ala Phe Leu Leu Gly Leu Ile Phe Leu Leu
225 230 235 240
Val Gly Ile Val Ile Gln Leu Arg Ala Gln Lys Gly Tyr Val Arg Thr
245 250 255
Gln Met Ser Gly Asn Glu Val Ser Arg Ala Val Leu Leu Pro Gln Ser
260 265 270
Cys
<210> 106
<211> 2341
<212> DNA
<213> 智人
<400> 106
tccgcgatcg ctgctttcag ctgggcaaca gcacagggcg gtcctgcgca tccggggggc 60
tgcgctccca gaccggtggc tggggagtag gttgcagggg tgtcactgtg tcaggggagg 120
aggaggagga ggaggaagaa gaggtagcgg caaggtcgcg gtctgaggtt ccggtgctcg 180
ccgccgccca gctcccagcc gaggctttct caaccgcgtc aataaaaggc cgccccgacc 240
cgcccccgcg ccccgcagcc ctgccggaca ccccgggctg cagctgagcg ggcgcagacg 300
ggccgaggcg ggcgccgggc gcgcagggac cgagggaccg agtgctcccc atgagcgcac 360
gtgggccggg cggtccgcaa gcccggctga gagcgcgcca tggggcaggc gggctgcaag 420
gggctctgcc tgtcgctgtt cgactacaag accgagaagt atgtcatcgc caagaacaag 480
aaggtgggcc tgctgtaccg gctgctgcag gcctccatcc tggcgtacct ggtcgtatgg 540
gtgttcctga taaagaaggg ttaccaagac gtcgacacct ccctgcagag tgctgtcatc 600
accaaagtca agggcgtggc cttcaccaac acctcggatc ttgggcagcg gatctgggat 660
gtcgccgact acgtcattcc agcccaggga gagaacgtct tttttgtggt caccaacctg 720
attgtgaccc ccaaccagcg gcagaacgtc tgtgctgaga atgaaggcat tcctgatggc 780
gcgtgctcca aggacagcga ctgccacgct ggggaagcgg ttacagctgg aaacggagtg 840
aagaccggcc gctgcctgcg gagagagaac ttggccaggg gcacctgtga gatctttgcc 900
tggtgcccgt tggagacaag ctccaggccg gaggagccat tcctgaagga ggccgaagac 960
ttcaccattt tcataaagaa ccacatccgt ttccccaaat tcaacttctc caaaagcaat 1020
gtgatggacg tcaaggacag atctttcctg aaatcatgcc actttggccc caagaaccac 1080
tactgcccca tcttccgact gggctccgtg atccgctggg ccgggagcga cttccaggat 1140
atagccctgg agggtggcgt gataggaatt aatattgaat ggaactgtga tcttgataaa 1200
gctgcctctg agtgccaccc tcactattct tttagccgtc tggacaataa actttcaaag 1260
tctgtctcct ccgggtacaa cttcagattt gccagatatt accgagacgc agccggggtg 1320
gagttccgca ccctgatgaa agcctacggg atccgctttg acgtgatggt gaacggcaag 1380
ggtgctttct tctgcgacct ggtactcatc tacctcatca aaaagagaga gttttaccgt 1440
gacaagaagt acgaggaagt gaggggccta gaagacagtt cccaggaggc cgaggacgag 1500
gcatcggggc tggggctatc tgagcagctc acatctgggc cagggctgct ggggatgccg 1560
gagcagcagg agctgcagga gccacccgag gcgaagcgtg gaagcagcag tcagaagggg 1620
aacggatctg tgtgcccaca gctcctggag ccccacagga gcacgtgaat tgcctctgct 1680
tacgttcagg ccctgtccta aacccagccg tctagcaccc agtgatccca tgcctttggg 1740
aatcccagga tgctgcccaa cgggaaattt gtacattggg tgctatcaat gccacatcac 1800
agggaccagc catcacagag caaagtgacc tccacgtctg atgctggggt catcaggacg 1860
gacccatcat ggctgtcttt ttgccccacc ccctgccgtc agttcttcct ttctccgtgg 1920
ctggcttccc gcactaggga acgggttgta aatggggaac atgacttcct tccggagtcc 1980
ttgagcacct cagctaagga ccgcagtgcc ctgtagagtt cctagattac ctcactggga 2040
atagcattgt gcgtgtccgg aaaagggctc catttggttc cagcccactc ccctctgcaa 2100
gtgccgcagc ttccctcaga gcatactctc cagtggatcc aagtactctc tctcctaaag 2160
acaccacctt cctgccagct gtttgccctt aggccagtac acagaattaa agtgggggag 2220
atggcagacg ctttctggga cctgcccaag atatgtattc tctgacactc ttatttggtc 2280
ataaaacaat aaatggtgtc aatttcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2340
a 2341
<210> 107
<211> 422
<212> PRT
<213> 智人
<400> 107
Met Gly Gln Ala Gly Cys Lys Gly Leu Cys Leu Ser Leu Phe Asp Tyr
1 5 10 15
Lys Thr Glu Lys Tyr Val Ile Ala Lys Asn Lys Lys Val Gly Leu Leu
20 25 30
Tyr Arg Leu Leu Gln Ala Ser Ile Leu Ala Tyr Leu Val Val Trp Val
35 40 45
Phe Leu Ile Lys Lys Gly Tyr Gln Asp Val Asp Thr Ser Leu Gln Ser
50 55 60
Ala Val Ile Thr Lys Val Lys Gly Val Ala Phe Thr Asn Thr Ser Asp
65 70 75 80
Leu Gly Gln Arg Ile Trp Asp Val Ala Asp Tyr Val Ile Pro Ala Gln
85 90 95
Gly Glu Asn Val Phe Phe Val Val Thr Asn Leu Ile Val Thr Pro Asn
100 105 110
Gln Arg Gln Asn Val Cys Ala Glu Asn Glu Gly Ile Pro Asp Gly Ala
115 120 125
Cys Ser Lys Asp Ser Asp Cys His Ala Gly Glu Ala Val Thr Ala Gly
130 135 140
Asn Gly Val Lys Thr Gly Arg Cys Leu Arg Arg Glu Asn Leu Ala Arg
145 150 155 160
Gly Thr Cys Glu Ile Phe Ala Trp Cys Pro Leu Glu Thr Ser Ser Arg
165 170 175
Pro Glu Glu Pro Phe Leu Lys Glu Ala Glu Asp Phe Thr Ile Phe Ile
180 185 190
Lys Asn His Ile Arg Phe Pro Lys Phe Asn Phe Ser Lys Ser Asn Val
195 200 205
Met Asp Val Lys Asp Arg Ser Phe Leu Lys Ser Cys His Phe Gly Pro
210 215 220
Lys Asn His Tyr Cys Pro Ile Phe Arg Leu Gly Ser Val Ile Arg Trp
225 230 235 240
Ala Gly Ser Asp Phe Gln Asp Ile Ala Leu Glu Gly Gly Val Ile Gly
245 250 255
Ile Asn Ile Glu Trp Asn Cys Asp Leu Asp Lys Ala Ala Ser Glu Cys
260 265 270
His Pro His Tyr Ser Phe Ser Arg Leu Asp Asn Lys Leu Ser Lys Ser
275 280 285
Val Ser Ser Gly Tyr Asn Phe Arg Phe Ala Arg Tyr Tyr Arg Asp Ala
290 295 300
Ala Gly Val Glu Phe Arg Thr Leu Met Lys Ala Tyr Gly Ile Arg Phe
305 310 315 320
Asp Val Met Val Asn Gly Lys Gly Ala Phe Phe Cys Asp Leu Val Leu
325 330 335
Ile Tyr Leu Ile Lys Lys Arg Glu Phe Tyr Arg Asp Lys Lys Tyr Glu
340 345 350
Glu Val Arg Gly Leu Glu Asp Ser Ser Gln Glu Ala Glu Asp Glu Ala
355 360 365
Ser Gly Leu Gly Leu Ser Glu Gln Leu Thr Ser Gly Pro Gly Leu Leu
370 375 380
Gly Met Pro Glu Gln Gln Glu Leu Gln Glu Pro Pro Glu Ala Lys Arg
385 390 395 400
Gly Ser Ser Ser Gln Lys Gly Asn Gly Ser Val Cys Pro Gln Leu Leu
405 410 415
Glu Pro His Arg Ser Thr
420
<210> 108
<211> 1548
<212> DNA
<213> 智人
<400> 108
aattgctaag ccgtgcagtc acagagggaa cacagagcct agttgtaaac ggacagagac 60
gagaggggca agggaggaca gtggatgaca gggaagacga gtgggggcag agctgctcag 120
gaccatggct gaggccatca cctatgcaga tctgaggttt gtgaaggctc ccctgaagaa 180
gagcatctcc agccggttag gacaggaccc aggggctgat gatgatgggg aaatcaccta 240
cgagaatgtt caagtgcccg cagtcctagg ggtgccctca agcttggctt cttctgtact 300
aggggacaaa gcagcggtca agtcggagca gccaactgcg tcctggagag ccgtgacgtc 360
accagctgtc gggcggattc tcccctgccg cacaacctgc ctgcgatacc tcctgctcgg 420
cctgctcctc acctgcctgc tgttaggagt gaccgccatc tgcctgggag tgcgctatct 480
gcaggtgtct cagcagctcc agcagacgaa cagggttctg gaagtcacta acagcagcct 540
gaggcagcag ctccgcctca agataacgca gctgggacag agtgcagagg atctgcaggg 600
gtccaggaga gagctggcgc agagtcagga agcactacag gtggaacaga gggctcatca 660
ggcggccgaa gggcagctac aggcctgcca ggcagacaga cagaagacga aggagacctt 720
gcaaagtgag gagcaacaga ggagggcctt ggagcagaag ctgagcaaca tggagaacag 780
actgaagccc ttcttcacat gcggctcagc agacacctgc tgtccgtcgg gatggataat 840
gcatcagaaa agctgctttt acatctcact tacttcaaaa aattggcagg agagccaaaa 900
acaatgtgaa actctgtctt ccaagctggc cacattcagt gaaatttatc cacaatcaca 960
ctcttactac ttcttaaatt cactgttgcc aaatggtggt tcagggaatt catattggac 1020
tggcctcagc tctaacaagg attggaagtt gactgatgat acacaacgca ctaggactta 1080
tgctcaaagc tcaaaatgta acaaggtaca taaaacttgg tcatggtgga cactggagtc 1140
agagtcatgt agaagttctc ttccctacat ctgtgagatg acagctttca ggtttccaga 1200
ttaggacagt cctttgcact gagttgacac tcatgccaac aagaacctgt gcccctcctt 1260
cctaacctga ggcctggggt tcctcagacc atctccttca ttctgggcag tgcccagcca 1320
ccggctgacc cacacctgac acttccagcc agtctgctgc ctgctccctc ttcctgaaac 1380
tggactgttc ctgggaaaag ggtgaagcca cctctagaag ggactttggc ctccccccaa 1440
gaacttccca tggtagaatg gggtggggga ggagggcgca cgggctgagc ggataggggc 1500
ggcccggagc cagccaggca gttttattga aatcttttta aataattg 1548
<210> 109
<211> 359
<212> PRT
<213> 智人
<400> 109
Met Ala Glu Ala Ile Thr Tyr Ala Asp Leu Arg Phe Val Lys Ala Pro
1 5 10 15
Leu Lys Lys Ser Ile Ser Ser Arg Leu Gly Gln Asp Pro Gly Ala Asp
20 25 30
Asp Asp Gly Glu Ile Thr Tyr Glu Asn Val Gln Val Pro Ala Val Leu
35 40 45
Gly Val Pro Ser Ser Leu Ala Ser Ser Val Leu Gly Asp Lys Ala Ala
50 55 60
Val Lys Ser Glu Gln Pro Thr Ala Ser Trp Arg Ala Val Thr Ser Pro
65 70 75 80
Ala Val Gly Arg Ile Leu Pro Cys Arg Thr Thr Cys Leu Arg Tyr Leu
85 90 95
Leu Leu Gly Leu Leu Leu Thr Cys Leu Leu Leu Gly Val Thr Ala Ile
100 105 110
Cys Leu Gly Val Arg Tyr Leu Gln Val Ser Gln Gln Leu Gln Gln Thr
115 120 125
Asn Arg Val Leu Glu Val Thr Asn Ser Ser Leu Arg Gln Gln Leu Arg
130 135 140
Leu Lys Ile Thr Gln Leu Gly Gln Ser Ala Glu Asp Leu Gln Gly Ser
145 150 155 160
Arg Arg Glu Leu Ala Gln Ser Gln Glu Ala Leu Gln Val Glu Gln Arg
165 170 175
Ala His Gln Ala Ala Glu Gly Gln Leu Gln Ala Cys Gln Ala Asp Arg
180 185 190
Gln Lys Thr Lys Glu Thr Leu Gln Ser Glu Glu Gln Gln Arg Arg Ala
195 200 205
Leu Glu Gln Lys Leu Ser Asn Met Glu Asn Arg Leu Lys Pro Phe Phe
210 215 220
Thr Cys Gly Ser Ala Asp Thr Cys Cys Pro Ser Gly Trp Ile Met His
225 230 235 240
Gln Lys Ser Cys Phe Tyr Ile Ser Leu Thr Ser Lys Asn Trp Gln Glu
245 250 255
Ser Gln Lys Gln Cys Glu Thr Leu Ser Ser Lys Leu Ala Thr Phe Ser
260 265 270
Glu Ile Tyr Pro Gln Ser His Ser Tyr Tyr Phe Leu Asn Ser Leu Leu
275 280 285
Pro Asn Gly Gly Ser Gly Asn Ser Tyr Trp Thr Gly Leu Ser Ser Asn
290 295 300
Lys Asp Trp Lys Leu Thr Asp Asp Thr Gln Arg Thr Arg Thr Tyr Ala
305 310 315 320
Gln Ser Ser Lys Cys Asn Lys Val His Lys Thr Trp Ser Trp Trp Thr
325 330 335
Leu Glu Ser Glu Ser Cys Arg Ser Ser Leu Pro Tyr Ile Cys Glu Met
340 345 350
Thr Ala Phe Arg Phe Pro Asp
355
<210> 110
<211> 2725
<212> DNA
<213> 智人
<400> 110
agaatgctga gcagtcaaca gcatttcttg ttccaagatc acccttctga gtacctctct 60
ggctgccaaa ttgccagggc cttcacagtt tgattccatt tctcagctcc aagcattagg 120
taaacccacc aagcaatcct agcctgtgat ggcgtttgac gtcagctgct tcttttgggt 180
ggtgctgttt tctgccggct gtaaagtcat cacctcctgg gatcagatgt gcattgagaa 240
agaagccaac aaaacatata actgtgaaaa tttaggtctc agtgaaatcc ctgacactct 300
accaaacaca acagaatttt tggaattcag ctttaatttt ttgcctacaa ttcacaatag 360
aaccttcagc agactcatga atcttacctt tttggattta actaggtgcc agattaactg 420
gatacatgaa gacacttttc aaagccatca tcaattaagc acacttgtgt taactggaaa 480
tcccctgata ttcatggcag aaacatcgct taatgggccc aagtcactga agcatctttt 540
cttaatccaa acgggaatat ccaatctcga gtttattcca gtgcacaatc tggaaaactt 600
ggaaagcttg tatcttggaa gcaaccatat ttcctccatt aagttcccca aagacttccc 660
agcacggaat ctgaaagtac tggattttca gaataatgct atacactaca tctctagaga 720
agacatgagg tctctggagc aggccatcaa cctaagcctg aacttcaatg gcaataatgt 780
taaaggtatt gagcttgggg cttttgattc aacgatcttc caaagtttga actttggagg 840
aactccaaat ttgtctgtta tattcaatgg tctgcagaac tctactactc agtctctctg 900
gctgggaaca tttgaggaca ttgatgacga agatattagt tcagccatgc tcaagggact 960
ctgtgaaatg tctgttgaga gcctcaacct gcaggaacac cgcttctctg acatctcatc 1020
caccacattt cagtgcttca cccaactcca agaattggat ctgacagcaa ctcacttgaa 1080
agggttaccc tctgggatga agggtctgaa cttgctcaag aaattagttc tcagtgtaaa 1140
tcatttcgat caattgtgtc aaatcagtgc tgccaatttc ccctccctta cacacctcta 1200
catcagaggc aacgtgaaga aacttcacct tggtgttggc tgcttggaga aactaggaaa 1260
ccttcagaca cttgatttaa gccataatga catagaggct tctgactgct gcagtctgca 1320
actcaaaaac ctgtcccact tgcaaacctt aaacctgagc cacaatgagc ctcttggtct 1380
ccagagtcag gcattcaaag aatgtcctca gctagaactc ctcgatttgg catttacccg 1440
cttacacatt aatgctccac aaagtccctt ccaaaacctc catttccttc aggttctgaa 1500
tctcacttac tgcttccttg ataccagcaa tcagcatctt ctagcaggcc taccagttct 1560
ccggcatctc aacttaaaag ggaatcactt tcaagatggg actatcacga agaccaacct 1620
acttcagacc gtgggcagct tggaggttct gattttgtcc tcttgtggtc tcctctctat 1680
agaccagcaa gcattccaca gcttgggaaa aatgagccat gtagacttaa gccacaacag 1740
cctgacatgc gacagcattg attctcttag ccatcttaag ggaatctacc tcaatctggc 1800
tgccaacagc attaacatca tctcaccccg tctcctccct atcttgtccc agcagagcac 1860
cattaattta agtcataacc ccctggactg cacttgctcg aatattcatt tcttaacatg 1920
gtacaaagaa aacctgcaca aacttgaagg ctcggaggag accacgtgtg caaacccgcc 1980
atctctaagg ggagttaagc tatctgatgt caagctttcc tgtgggatta cagccatagg 2040
cattttcttt ctcatagtat ttctattatt gttggctatt ctgctatttt ttgcagttaa 2100
ataccttctc aggtggaaat accaacacat ttagtgctga aggtttccag agaaagcaaa 2160
taagtgtgct tagcaaaatt gctctaagtg aaagaactgt catctgctgg tgaccagacc 2220
agacttttca gattgcttcc tggaactggg cagggactca ctgtgctttt ctgagcttct 2280
tactcctgtg agtcccagag ctaaagaacc ttctaggcaa gtacaccgaa tgactcagtc 2340
cagagggtca gatgctgctg tgagaggcac agagcccttt ccgcatgtgg aagagtggga 2400
ggaagcagag ggagggactg ggcagggact gccggccccg gagtctccca cagggaggcc 2460
attccccttc tactcaccga catccctccc agcaccacac accccgcccc tgaaaggaga 2520
tcatcagccc ccacaatttg tcagagctga agccagccca ctacccaccc ccactacagc 2580
attgtgcttg ggtctgggtt ctcagtaatg tagccatttg agaaacttac ttggggacaa 2640
agtctcaatc cttattttaa atgaaaaaag aaaagaaaag cataataaat ttaaaagaaa 2700
aggctgagaa atgaaaaaaa aaaaa 2725
<210> 111
<211> 661
<212> PRT
<213> 智人
<400> 111
Met Ala Phe Asp Val Ser Cys Phe Phe Trp Val Val Leu Phe Ser Ala
1 5 10 15
Gly Cys Lys Val Ile Thr Ser Trp Asp Gln Met Cys Ile Glu Lys Glu
20 25 30
Ala Asn Lys Thr Tyr Asn Cys Glu Asn Leu Gly Leu Ser Glu Ile Pro
35 40 45
Asp Thr Leu Pro Asn Thr Thr Glu Phe Leu Glu Phe Ser Phe Asn Phe
50 55 60
Leu Pro Thr Ile His Asn Arg Thr Phe Ser Arg Leu Met Asn Leu Thr
65 70 75 80
Phe Leu Asp Leu Thr Arg Cys Gln Ile Asn Trp Ile His Glu Asp Thr
85 90 95
Phe Gln Ser His His Gln Leu Ser Thr Leu Val Leu Thr Gly Asn Pro
100 105 110
Leu Ile Phe Met Ala Glu Thr Ser Leu Asn Gly Pro Lys Ser Leu Lys
115 120 125
His Leu Phe Leu Ile Gln Thr Gly Ile Ser Asn Leu Glu Phe Ile Pro
130 135 140
Val His Asn Leu Glu Asn Leu Glu Ser Leu Tyr Leu Gly Ser Asn His
145 150 155 160
Ile Ser Ser Ile Lys Phe Pro Lys Asp Phe Pro Ala Arg Asn Leu Lys
165 170 175
Val Leu Asp Phe Gln Asn Asn Ala Ile His Tyr Ile Ser Arg Glu Asp
180 185 190
Met Arg Ser Leu Glu Gln Ala Ile Asn Leu Ser Leu Asn Phe Asn Gly
195 200 205
Asn Asn Val Lys Gly Ile Glu Leu Gly Ala Phe Asp Ser Thr Ile Phe
210 215 220
Gln Ser Leu Asn Phe Gly Gly Thr Pro Asn Leu Ser Val Ile Phe Asn
225 230 235 240
Gly Leu Gln Asn Ser Thr Thr Gln Ser Leu Trp Leu Gly Thr Phe Glu
245 250 255
Asp Ile Asp Asp Glu Asp Ile Ser Ser Ala Met Leu Lys Gly Leu Cys
260 265 270
Glu Met Ser Val Glu Ser Leu Asn Leu Gln Glu His Arg Phe Ser Asp
275 280 285
Ile Ser Ser Thr Thr Phe Gln Cys Phe Thr Gln Leu Gln Glu Leu Asp
290 295 300
Leu Thr Ala Thr His Leu Lys Gly Leu Pro Ser Gly Met Lys Gly Leu
305 310 315 320
Asn Leu Leu Lys Lys Leu Val Leu Ser Val Asn His Phe Asp Gln Leu
325 330 335
Cys Gln Ile Ser Ala Ala Asn Phe Pro Ser Leu Thr His Leu Tyr Ile
340 345 350
Arg Gly Asn Val Lys Lys Leu His Leu Gly Val Gly Cys Leu Glu Lys
355 360 365
Leu Gly Asn Leu Gln Thr Leu Asp Leu Ser His Asn Asp Ile Glu Ala
370 375 380
Ser Asp Cys Cys Ser Leu Gln Leu Lys Asn Leu Ser His Leu Gln Thr
385 390 395 400
Leu Asn Leu Ser His Asn Glu Pro Leu Gly Leu Gln Ser Gln Ala Phe
405 410 415
Lys Glu Cys Pro Gln Leu Glu Leu Leu Asp Leu Ala Phe Thr Arg Leu
420 425 430
His Ile Asn Ala Pro Gln Ser Pro Phe Gln Asn Leu His Phe Leu Gln
435 440 445
Val Leu Asn Leu Thr Tyr Cys Phe Leu Asp Thr Ser Asn Gln His Leu
450 455 460
Leu Ala Gly Leu Pro Val Leu Arg His Leu Asn Leu Lys Gly Asn His
465 470 475 480
Phe Gln Asp Gly Thr Ile Thr Lys Thr Asn Leu Leu Gln Thr Val Gly
485 490 495
Ser Leu Glu Val Leu Ile Leu Ser Ser Cys Gly Leu Leu Ser Ile Asp
500 505 510
Gln Gln Ala Phe His Ser Leu Gly Lys Met Ser His Val Asp Leu Ser
515 520 525
His Asn Ser Leu Thr Cys Asp Ser Ile Asp Ser Leu Ser His Leu Lys
530 535 540
Gly Ile Tyr Leu Asn Leu Ala Ala Asn Ser Ile Asn Ile Ile Ser Pro
545 550 555 560
Arg Leu Leu Pro Ile Leu Ser Gln Gln Ser Thr Ile Asn Leu Ser His
565 570 575
Asn Pro Leu Asp Cys Thr Cys Ser Asn Ile His Phe Leu Thr Trp Tyr
580 585 590
Lys Glu Asn Leu His Lys Leu Glu Gly Ser Glu Glu Thr Thr Cys Ala
595 600 605
Asn Pro Pro Ser Leu Arg Gly Val Lys Leu Ser Asp Val Lys Leu Ser
610 615 620
Cys Gly Ile Thr Ala Ile Gly Ile Phe Phe Leu Ile Val Phe Leu Leu
625 630 635 640
Leu Leu Ala Ile Leu Leu Phe Phe Ala Val Lys Tyr Leu Leu Arg Trp
645 650 655
Lys Tyr Gln His Ile
660
<210> 112
<211> 3116
<212> DNA
<213> 智人
<400> 112
aacttccgat atcaacttcc tcaaacctct gatgagctgc tgctgctcga ctctgaggtg 60
cattcttttt ttgatgagag gcatctctag gtaccatccc tgacctggtc ctcatgctgc 120
cgaggctgtt gctgttgatc tgtgctccac tctgtgaacc tgccgagctg tttttgatag 180
ccagcccctc ccatcccaca gaggggagcc cagtgaccct gacgtgtaag atgccctttc 240
tacagagttc agatgcccag ttccagttct gctttttcag agacacccgg gccttgggcc 300
caggctggag cagctccccc aagctccaga tcgctgccat gtggaaagaa gacacagggt 360
catactggtg cgaggcacag acaatggcgt ccaaagtctt gaggagcagg agatcccaga 420
taaatgtgca cagggtccct gtcgctgatg tgagcttgga gactcagccc ccaggaggac 480
aggtgatgga gggagacagg ctggtcctca tctgctcagt tgctatgggc acaggagaca 540
tcaccttcct ttggtacaaa ggggctgtag gtttaaacct tcagtcaaag acccagcgtt 600
cactgacagc agagtatgag attccttcag tgagggagag tgatgctgag caatattact 660
gtgtagctga aaatggctat ggtcccagcc ccagtgggct ggtgagcatc actgtcagaa 720
tcccggtgtc tcgcccaatc ctcatgctca gggctcccag ggcccaggct gcagtggagg 780
atgtgctgga gcttcactgt gaggccctga gaggctctcc tccgatcctg tactggtttt 840
atcacgagga tatcaccctg gggagcaggt cggccccctc tggaggagga gcctccttca 900
acctttccct gactgaagaa cattctggaa actactcctg tgaggccaac aatggcctgg 960
gggcccagcg cagtgaggcg gtgacactca acttcacagt gcctactggg gccagaagca 1020
atcatcttac ctcaggagtc attgaggggc tgctcagcac ccttggtcca gccaccgtgg 1080
ccttattatt ttgctacggc ctcaaaagaa aaataggaag acgttcagcc agggatccac 1140
tcaggagcct tcccagccct ctaccccaag agttcaccta cctcaactca cctaccccag 1200
ggcagctaca gcctatatat gaaaatgtga atgttgtaag tggggatgag gtttattcac 1260
tggcgtacta taaccagccg gagcaggaat cagtagcagc agaaaccctg gggacacata 1320
tggaggacaa ggtttcctta gacatctatt ccaggctgag gaaagcaaac attacagatg 1380
tggactatga agatgctatg taaggttatg gaagattctg ctctttgaaa accatccatg 1440
accccaagcc tcaggcctga tatgttcttc agagatcctg gggcattagc tttccagtat 1500
acctcttctg gatgccattc tccatggcac tattccttca tctactgtga agtgaagttg 1560
gcgcagccct gaagaaacta cctaggagaa ctaatagaca caggagtgac agggactttg 1620
ttatcagaac cagattcctg ccggctcctt tgaaaacagg tcatattgtg ctcttctgtt 1680
tacaagagga aacaagatgg aataaaagaa attgggatct tgggttggag ggacagtgaa 1740
gcttagagca catgaactca aggttagtga ctctgcagga cttcacagag agagctgtgc 1800
ccatcattca gtccaagtgc tttctctgcc cagacagcac agaactccag ccccgctact 1860
tacatggatc atcgagtttc cacctaaaat atgattctat ttattttgag tcactgttac 1920
caaattagaa ctaaaacaaa gttacataaa aagttattgt gactccactt aattttagtg 1980
acgtattttt gtatatatag gccaacctat accacatcca aaattatgta tctattacag 2040
cccctagaag ctttataaat acagtgtgtc ttcttttatt cacaaaattt ttgaaatcgt 2100
ggtaatatgg tttgaaacct gtatcttaat tatttttttt ttaaattgag acagggtctc 2160
actctgtcac tcaatctgga atgcagtggc acaatcttgc ctcactgcaa cgcctgcctc 2220
tcaggctcaa gcaaacctct cacctcagcc tgctgagtag ctgggactac aggcacatgc 2280
caccaaactt ggccattttt tgtcttacgt agagacaaga tttcaccgtt ttgcccaggc 2340
tggtctcaaa ctcctgggct caagcaatgt attgaatttt aaaataacca ggcactcact 2400
cttatgaatt aataaacatt tggaggtata taaagtaaaa agttaaagtc tttcctgtaa 2460
gttaacacaa atgttaacta ttgttaaaaa ctttacaggt agctctctag atatttttct 2520
atttttgtat gtatacttat gcatacatgt aagtatataa acatttagaa gtgtacctat 2580
ctaacaaact attatgaaat actttcaaat ctgtaaatag atctattata ctattttaaa 2640
agtctctata gtagtgtgtt atatagataa atcataactt ttttcttttt ttattgtagt 2700
aaatatgcac aacataaaat tgatcatttt aaccattttt aagtgtacaa ttcagtggca 2760
ttaagtacta tcataatata ttttaatcct tctcatcact ggtggacatt aaggagactc 2820
tcaaaaaatt catattataa aaacaaagtt caaacaaatg tctttgtact agcatattat 2880
ggcactcctg ctggattatc tgaaggataa atttgtaaat ctagtattgc tagattatgc 2940
atattaaata ttcttgttaa atagtcttca atgtctctca ggtaaggctg tatcaattta 3000
tatcttcacc aacaacgtct gggaaatcag tttgtggggt gtattactta gttttcacat 3060
tgctaataaa gacatatcca agactgggta atttataaaa aaaaaaaaaa aaaaaa 3116
<210> 113
<211> 429
<212> PRT
<213> 智人
<400> 113
Met Leu Pro Arg Leu Leu Leu Leu Ile Cys Ala Pro Leu Cys Glu Pro
1 5 10 15
Ala Glu Leu Phe Leu Ile Ala Ser Pro Ser His Pro Thr Glu Gly Ser
20 25 30
Pro Val Thr Leu Thr Cys Lys Met Pro Phe Leu Gln Ser Ser Asp Ala
35 40 45
Gln Phe Gln Phe Cys Phe Phe Arg Asp Thr Arg Ala Leu Gly Pro Gly
50 55 60
Trp Ser Ser Ser Pro Lys Leu Gln Ile Ala Ala Met Trp Lys Glu Asp
65 70 75 80
Thr Gly Ser Tyr Trp Cys Glu Ala Gln Thr Met Ala Ser Lys Val Leu
85 90 95
Arg Ser Arg Arg Ser Gln Ile Asn Val His Arg Val Pro Val Ala Asp
100 105 110
Val Ser Leu Glu Thr Gln Pro Pro Gly Gly Gln Val Met Glu Gly Asp
115 120 125
Arg Leu Val Leu Ile Cys Ser Val Ala Met Gly Thr Gly Asp Ile Thr
130 135 140
Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gln Ser Lys Thr
145 150 155 160
Gln Arg Ser Leu Thr Ala Glu Tyr Glu Ile Pro Ser Val Arg Glu Ser
165 170 175
Asp Ala Glu Gln Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser
180 185 190
Pro Ser Gly Leu Val Ser Ile Thr Val Arg Ile Pro Val Ser Arg Pro
195 200 205
Ile Leu Met Leu Arg Ala Pro Arg Ala Gln Ala Ala Val Glu Asp Val
210 215 220
Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr
225 230 235 240
Trp Phe Tyr His Glu Asp Ile Thr Leu Gly Ser Arg Ser Ala Pro Ser
245 250 255
Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly
260 265 270
Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Arg Ser Glu
275 280 285
Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His
290 295 300
Leu Thr Ser Gly Val Ile Glu Gly Leu Leu Ser Thr Leu Gly Pro Ala
305 310 315 320
Thr Val Ala Leu Leu Phe Cys Tyr Gly Leu Lys Arg Lys Ile Gly Arg
325 330 335
Arg Ser Ala Arg Asp Pro Leu Arg Ser Leu Pro Ser Pro Leu Pro Gln
340 345 350
Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gln Leu Gln Pro Ile
355 360 365
Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu Ala
370 375 380
Tyr Tyr Asn Gln Pro Glu Gln Glu Ser Val Ala Ala Glu Thr Leu Gly
385 390 395 400
Thr His Met Glu Asp Lys Val Ser Leu Asp Ile Tyr Ser Arg Leu Arg
405 410 415
Lys Ala Asn Ile Thr Asp Val Asp Tyr Glu Asp Ala Met
420 425
<210> 114
<211> 2797
<212> DNA
<213> 智人
<400> 114
gtgcagtgtc ctgactgtaa gatcaagtcc aaacctgttt tggaattgag gaaacttctc 60
ttttgatctc agcccttggt ggtccaggtc ttcatgctgc tgtgggtgat attactggtc 120
ctggctcctg tcagtggaca gtttgcaagg acacccaggc ccattatttt cctccagcct 180
ccatggacca cagtcttcca aggagagaga gtgaccctca cttgcaaggg atttcgcttc 240
tactcaccac agaaaacaaa atggtaccat cggtaccttg ggaaagaaat actaagagaa 300
accccagaca atatccttga ggttcaggaa tctggagagt acagatgcca ggcccagggc 360
tcccctctca gtagccctgt gcacttggat ttttcttcag cttcgctgat cctgcaagct 420
ccactttctg tgtttgaagg agactctgtg gttctgaggt gccgggcaaa ggcggaagta 480
acactgaata atactattta caagaatgat aatgtcctgg cattccttaa taaaagaact 540
gacttccata ttcctcatgc atgtctcaag gacaatggtg catatcgctg tactggatat 600
aaggaaagtt gttgccctgt ttcttccaat acagtcaaaa tccaagtcca agagccattt 660
acacgtccag tgctgagagc cagctccttc cagcccatca gcgggaaccc agtgaccctg 720
acctgtgaga cccagctctc tctagagagg tcagatgtcc cgctccggtt ccgcttcttc 780
agagatgacc agaccctggg attaggctgg agtctctccc cgaatttcca gattactgcc 840
atgtggagta aagattcagg gttctactgg tgtaaggcag caacaatgcc tcacagcgtc 900
atatctgaca gcccgagatc ctggatacag gtgcagatcc ctgcatctca tcctgtcctc 960
actctcagcc ctgaaaaggc tctgaatttt gagggaacca aggtgacact tcactgtgaa 1020
acccaggaag attctctgcg cactttgtac aggttttatc atgagggtgt ccccctgagg 1080
cacaagtcag tccgctgtga aaggggagca tccatcagct tctcactgac tacagagaat 1140
tcagggaact actactgcac agctgacaat ggccttggcg ccaagcccag taaggctgtg 1200
agcctctcag tcactgttcc cgtgtctcat cctgtcctca acctcagctc tcctgaggac 1260
ctgatttttg agggagccaa ggtgacactt cactgtgaag cccagagagg ttcactcccc 1320
atcctgtacc agtttcatca tgaggatgct gccctggagc gtaggtcggc caactctgca 1380
ggaggagtgg ccatcagctt ctctctgact gcagagcatt cagggaacta ctactgcaca 1440
gctgacaatg gctttggccc ccagcgcagt aaggcggtga gcctctccat cactgtccct 1500
gtgtctcatc ctgtcctcac cctcagctct gctgaggccc tgacttttga aggagccact 1560
gtgacacttc actgtgaagt ccagagaggt tccccacaaa tcctatacca gttttatcat 1620
gaggacatgc ccctgtggag cagctcaaca ccctctgtgg gaagagtgtc cttcagcttc 1680
tctctgactg aaggacattc agggaattac tactgcacag ctgacaatgg ctttggtccc 1740
cagcgcagtg aagtggtgag cctttttgtc actgttccag tgtctcgccc catcctcacc 1800
ctcagggttc ccagggccca ggctgtggtg ggggacctgc tggagcttca ctgtgaggcc 1860
ccgagaggct ctcccccaat cctgtactgg ttttatcatg aggatgtcac cctggggagc 1920
agctcagccc cctctggagg agaagcttct ttcaacctct ctctgactgc agaacattct 1980
ggaaactact catgtgaggc caacaatggc ctagtggccc agcacagtga cacaatatca 2040
ctcagtgtta tagttccagt atctcgtccc atcctcacct tcagggctcc cagggcccag 2100
gctgtggtgg gggacctgct ggagcttcac tgtgaggccc tgagaggctc ctccccaatc 2160
ctgtactggt tttatcatga agatgtcacc ctgggtaaga tctcagcccc ctctggagga 2220
ggggcctcct tcaacctctc tctgactaca gaacattctg gaatctactc ctgtgaggca 2280
gacaatggtc tggaggccca gcgcagtgag atggtgacac tgaaagttgc aggtgagtgg 2340
gccctgccca ccagcagcac atctgagaac tgactgtgcc tgttctccct gcagctgaaa 2400
atggagccac agagctcctc agggctgttt gcttgtgtgg catcccagca cacttcctgc 2460
ctgcagaacc tccctgtgaa agtctcggat cctttgtggt atggttccag gaatctgatg 2520
tttcccagca gtcttcttga agatgatcaa agcacctcac taaaaatgca aataagactt 2580
ttttagaaca taaactatat tctgaactga aattattaca tgaaaatgaa accaaagaat 2640
tctgagcata tgtttctctg ccgtagaaag gattaagctg tttcttgtcc ggattcttct 2700
ctcattgact tctaagaagc ctctactctt gagtctcttt cattactggg gatgtaaatg 2760
ttccttacat ttccacatta aaaatcctat gttaacg 2797
<210> 115
<211> 759
<212> PRT
<213> 智人
<400> 115
Met Leu Leu Trp Val Ile Leu Leu Val Leu Ala Pro Val Ser Gly Gln
1 5 10 15
Phe Ala Arg Thr Pro Arg Pro Ile Ile Phe Leu Gln Pro Pro Trp Thr
20 25 30
Thr Val Phe Gln Gly Glu Arg Val Thr Leu Thr Cys Lys Gly Phe Arg
35 40 45
Phe Tyr Ser Pro Gln Lys Thr Lys Trp Tyr His Arg Tyr Leu Gly Lys
50 55 60
Glu Ile Leu Arg Glu Thr Pro Asp Asn Ile Leu Glu Val Gln Glu Ser
65 70 75 80
Gly Glu Tyr Arg Cys Gln Ala Gln Gly Ser Pro Leu Ser Ser Pro Val
85 90 95
His Leu Asp Phe Ser Ser Ala Ser Leu Ile Leu Gln Ala Pro Leu Ser
100 105 110
Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys Arg Ala Lys Ala Glu
115 120 125
Val Thr Leu Asn Asn Thr Ile Tyr Lys Asn Asp Asn Val Leu Ala Phe
130 135 140
Leu Asn Lys Arg Thr Asp Phe His Ile Pro His Ala Cys Leu Lys Asp
145 150 155 160
Asn Gly Ala Tyr Arg Cys Thr Gly Tyr Lys Glu Ser Cys Cys Pro Val
165 170 175
Ser Ser Asn Thr Val Lys Ile Gln Val Gln Glu Pro Phe Thr Arg Pro
180 185 190
Val Leu Arg Ala Ser Ser Phe Gln Pro Ile Ser Gly Asn Pro Val Thr
195 200 205
Leu Thr Cys Glu Thr Gln Leu Ser Leu Glu Arg Ser Asp Val Pro Leu
210 215 220
Arg Phe Arg Phe Phe Arg Asp Asp Gln Thr Leu Gly Leu Gly Trp Ser
225 230 235 240
Leu Ser Pro Asn Phe Gln Ile Thr Ala Met Trp Ser Lys Asp Ser Gly
245 250 255
Phe Tyr Trp Cys Lys Ala Ala Thr Met Pro His Ser Val Ile Ser Asp
260 265 270
Ser Pro Arg Ser Trp Ile Gln Val Gln Ile Pro Ala Ser His Pro Val
275 280 285
Leu Thr Leu Ser Pro Glu Lys Ala Leu Asn Phe Glu Gly Thr Lys Val
290 295 300
Thr Leu His Cys Glu Thr Gln Glu Asp Ser Leu Arg Thr Leu Tyr Arg
305 310 315 320
Phe Tyr His Glu Gly Val Pro Leu Arg His Lys Ser Val Arg Cys Glu
325 330 335
Arg Gly Ala Ser Ile Ser Phe Ser Leu Thr Thr Glu Asn Ser Gly Asn
340 345 350
Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala Lys Pro Ser Lys Ala
355 360 365
Val Ser Leu Ser Val Thr Val Pro Val Ser His Pro Val Leu Asn Leu
370 375 380
Ser Ser Pro Glu Asp Leu Ile Phe Glu Gly Ala Lys Val Thr Leu His
385 390 395 400
Cys Glu Ala Gln Arg Gly Ser Leu Pro Ile Leu Tyr Gln Phe His His
405 410 415
Glu Asp Ala Ala Leu Glu Arg Arg Ser Ala Asn Ser Ala Gly Gly Val
420 425 430
Ala Ile Ser Phe Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Tyr Cys
435 440 445
Thr Ala Asp Asn Gly Phe Gly Pro Gln Arg Ser Lys Ala Val Ser Leu
450 455 460
Ser Ile Thr Val Pro Val Ser His Pro Val Leu Thr Leu Ser Ser Ala
465 470 475 480
Glu Ala Leu Thr Phe Glu Gly Ala Thr Val Thr Leu His Cys Glu Val
485 490 495
Gln Arg Gly Ser Pro Gln Ile Leu Tyr Gln Phe Tyr His Glu Asp Met
500 505 510
Pro Leu Trp Ser Ser Ser Thr Pro Ser Val Gly Arg Val Ser Phe Ser
515 520 525
Phe Ser Leu Thr Glu Gly His Ser Gly Asn Tyr Tyr Cys Thr Ala Asp
530 535 540
Asn Gly Phe Gly Pro Gln Arg Ser Glu Val Val Ser Leu Phe Val Thr
545 550 555 560
Val Pro Val Ser Arg Pro Ile Leu Thr Leu Arg Val Pro Arg Ala Gln
565 570 575
Ala Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ala Pro Arg Gly
580 585 590
Ser Pro Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Val Thr Leu Gly
595 600 605
Ser Ser Ser Ala Pro Ser Gly Gly Glu Ala Ser Phe Asn Leu Ser Leu
610 615 620
Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu
625 630 635 640
Val Ala Gln His Ser Asp Thr Ile Ser Leu Ser Val Ile Val Pro Val
645 650 655
Ser Arg Pro Ile Leu Thr Phe Arg Ala Pro Arg Ala Gln Ala Val Val
660 665 670
Gly Asp Leu Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Ser Pro
675 680 685
Ile Leu Tyr Trp Phe Tyr His Glu Asp Val Thr Leu Gly Lys Ile Ser
690 695 700
Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Thr Glu
705 710 715 720
His Ser Gly Ile Tyr Ser Cys Glu Ala Asp Asn Gly Leu Glu Ala Gln
725 730 735
Arg Ser Glu Met Val Thr Leu Lys Val Ala Gly Glu Trp Ala Leu Pro
740 745 750
Thr Ser Ser Thr Ser Glu Asn
755
<210> 116
<211> 1814
<212> DNA
<213> 智人
<400> 116
gcagagctcg agaggcggct gccgggctgc ggggcgcctt gactctccct ccaccctgcc 60
tcctcgggct ccactcgtct gcccctggac tcccgtctcc tcctgtcctc cggcttccca 120
gagctccctc cttatggcag cagcttcccg cgtctccggc gcagcttctc agcggacgac 180
cctctcgctc cggggctgag cccagtccct ggatgttgct gaaactctcg agatcatgcg 240
cgggtttggc tgctgcttcc ccgccgggtg ccactgccac cgccgccgcc tctgctgccg 300
ccgtccgcgg gatgctcagt agcccgctgc ccggcccccg cgatcctgtg ttcctcggaa 360
gccgtttgct gctgcagagt tgcacgaact agtcatggtg ctgtgggagt ccccgcggca 420
gtgcagcagc tggacacttt gcgagggctt ttgctggctg ctgctgctgc ccgtcatgct 480
actcatcgta gcccgcccgg tgaagctcgc tgctttccct acctccttaa gtgactgcca 540
aacgcccacc ggctggaatt gctctggtta tgatgacaga gaaaatgatc tcttcctctg 600
tgacaccaac acctgtaaat ttgatgggga atgtttaaga attggagaca ctgtgacttg 660
cgtctgtcag ttcaagtgca acaatgacta tgtgcctgtg tgtggctcca atggggagag 720
ctaccagaat gagtgttacc tgcgacaggc tgcatgcaaa cagcagagtg agatacttgt 780
ggtgtcagaa ggatcatgtg ccacagatgc aggatcagga tctggagatg gagtccatga 840
aggctctgga gaaactagtc aaaaggagac atccacctgt gatatttgcc agtttggtgc 900
agaatgtgac gaagatgccg aggatgtctg gtgtgtgtgt aatattgact gttctcaaac 960
caacttcaat cccctctgcg cttctgatgg gaaatcttat gataatgcat gccaaatcaa 1020
agaagcatcg tgtcagaaac aggagaaaat tgaagtcatg tctttgggtc gatgtcaaga 1080
taacacaact acaactacta agtctgaaga tgggcattat gcaagaacag attatgcaga 1140
gaatgctaac aaattagaag aaagtgccag agaacaccac ataccttgtc cggaacatta 1200
caatggcttc tgcatgcatg ggaagtgtga gcattctatc aatatgcagg agccatcttg 1260
caggtgtgat gctggttata ctggacaaca ctgtgaaaaa aaggactaca gtgttctata 1320
cgttgttccc ggtcctgtac gatttcagta tgtcttaatc gcagctgtga ttggaacaat 1380
tcagattgct gtcatctgtg tggtggtcct ctgcatcaca aggaaatgcc ccagaagcaa 1440
cagaattcac agacagaagc aaaatacagg gcactacagt tcagacaata caacaagagc 1500
gtccacgagg ttaatctaaa gggagcatgt ttcacagtgg ctggactacc gagagcttgg 1560
actacacaat acagtattat agacaaaaga ataagacaag agatctacac atgttgcctt 1620
gcatttgtgg taatctacac caatgaaaac atgtactaca gctatatttg attatgtatg 1680
gatatatttg aaatagtata cattgtcttg atgttttttc tgtaatgtaa ataaactatt 1740
tatatcacac aatatagttt tttctttccc atgtatttgt tatatataat aaatactcag 1800
tgatgagaaa aaaa 1814
<210> 117
<211> 374
<212> PRT
<213> 智人
<400> 117
Met Val Leu Trp Glu Ser Pro Arg Gln Cys Ser Ser Trp Thr Leu Cys
1 5 10 15
Glu Gly Phe Cys Trp Leu Leu Leu Leu Pro Val Met Leu Leu Ile Val
20 25 30
Ala Arg Pro Val Lys Leu Ala Ala Phe Pro Thr Ser Leu Ser Asp Cys
35 40 45
Gln Thr Pro Thr Gly Trp Asn Cys Ser Gly Tyr Asp Asp Arg Glu Asn
50 55 60
Asp Leu Phe Leu Cys Asp Thr Asn Thr Cys Lys Phe Asp Gly Glu Cys
65 70 75 80
Leu Arg Ile Gly Asp Thr Val Thr Cys Val Cys Gln Phe Lys Cys Asn
85 90 95
Asn Asp Tyr Val Pro Val Cys Gly Ser Asn Gly Glu Ser Tyr Gln Asn
100 105 110
Glu Cys Tyr Leu Arg Gln Ala Ala Cys Lys Gln Gln Ser Glu Ile Leu
115 120 125
Val Val Ser Glu Gly Ser Cys Ala Thr Asp Ala Gly Ser Gly Ser Gly
130 135 140
Asp Gly Val His Glu Gly Ser Gly Glu Thr Ser Gln Lys Glu Thr Ser
145 150 155 160
Thr Cys Asp Ile Cys Gln Phe Gly Ala Glu Cys Asp Glu Asp Ala Glu
165 170 175
Asp Val Trp Cys Val Cys Asn Ile Asp Cys Ser Gln Thr Asn Phe Asn
180 185 190
Pro Leu Cys Ala Ser Asp Gly Lys Ser Tyr Asp Asn Ala Cys Gln Ile
195 200 205
Lys Glu Ala Ser Cys Gln Lys Gln Glu Lys Ile Glu Val Met Ser Leu
210 215 220
Gly Arg Cys Gln Asp Asn Thr Thr Thr Thr Thr Lys Ser Glu Asp Gly
225 230 235 240
His Tyr Ala Arg Thr Asp Tyr Ala Glu Asn Ala Asn Lys Leu Glu Glu
245 250 255
Ser Ala Arg Glu His His Ile Pro Cys Pro Glu His Tyr Asn Gly Phe
260 265 270
Cys Met His Gly Lys Cys Glu His Ser Ile Asn Met Gln Glu Pro Ser
275 280 285
Cys Arg Cys Asp Ala Gly Tyr Thr Gly Gln His Cys Glu Lys Lys Asp
290 295 300
Tyr Ser Val Leu Tyr Val Val Pro Gly Pro Val Arg Phe Gln Tyr Val
305 310 315 320
Leu Ile Ala Ala Val Ile Gly Thr Ile Gln Ile Ala Val Ile Cys Val
325 330 335
Val Val Leu Cys Ile Thr Arg Lys Cys Pro Arg Ser Asn Arg Ile His
340 345 350
Arg Gln Lys Gln Asn Thr Gly His Tyr Ser Ser Asp Asn Thr Thr Arg
355 360 365
Ala Ser Thr Arg Leu Ile
370
<210> 118
<211> 2653
<212> DNA
<213> 智人
<400> 118
ctcaaaaggg gccggatttc cttctcctgg aggcagatgt tgcctctctc tctcgctcgg 60
attggttcag tgcactctag aaacactgct gtggtggaga aactggaccc caggtctgga 120
gcgaattcca gcctgcaggg ctgataagcg aggcattagt gagattgaga gagactttac 180
cccgccgtgg tggttggagg gcgcgcagta gagcagcagc acaggcgcgg gtcccgggag 240
gccggctctg ctcgcgccga gatgtggaat ctccttcacg aaaccgactc ggctgtggcc 300
accgcgcgcc gcccgcgctg gctgtgcgct ggggcgctgg tgctggcggg tggcttcttt 360
ctcctcggct tcctcttcgg gtggtttata aaatcctcca atgaagctac taacattact 420
ccaaagcata atatgaaagc atttttggat gaattgaaag ctgagaacat caagaagttc 480
ttatataatt ttacacagat accacattta gcaggaacag aacaaaactt tcagcttgca 540
aagcaaattc aatcccagtg gaaagaattt ggcctggatt ctgttgagct agcacattat 600
gatgtcctgt tgtcctaccc aaataagact catcccaact acatctcaat aattaatgaa 660
gatggaaatg agattttcaa cacatcatta tttgaaccac ctcctccagg atatgaaaat 720
gtttcggata ttgtaccacc tttcagtgct ttctctcctc aaggaatgcc agagggcgat 780
ctagtgtatg ttaactatgc acgaactgaa gacttcttta aattggaacg ggacatgaaa 840
atcaattgct ctgggaaaat tgtaattgcc agatatggga aagttttcag aggaaataag 900
gttaaaaatg cccagctggc aggggccaaa ggagtcattc tctactccga ccctgctgac 960
tactttgctc ctggggtgaa gtcctatcca gatggttgga atcttcctgg aggtggtgtc 1020
cagcgtggaa atatcctaaa tctgaatggt gcaggagacc ctctcacacc aggttaccca 1080
gcaaatgaat atgcttatag gcgtggaatt gcagaggctg ttggtcttcc aagtattcct 1140
gttcatccaa ttggatacta tgatgcacag aagctcctag aaaaaatggg tggctcagca 1200
ccaccagata gcagctggag aggaagtctc aaagtgccct acaatgttgg acctggcttt 1260
actggaaact tttctacaca aaaagtcaag atgcacatcc actctaccaa tgaagtgaca 1320
agaatttaca atgtgatagg tactctcaga ggagcagtgg aaccagacag atatgtcatt 1380
ctgggaggtc accgggactc atgggtgttt ggtggtattg accctcagag tggagcagct 1440
gttgttcatg aaattgtgag gagctttgga acactgaaaa aggaagggtg gagacctaga 1500
agaacaattt tgtttgcaag ctgggatgca gaagaatttg gtcttcttgg ttctactgag 1560
tgggcagagg agaattcaag actccttcaa gagcgtggcg tggcttatat taatgctgac 1620
tcatctatag aaggaaacta cactctgaga gttgattgta caccgctgat gtacagcttg 1680
gtacacaacc taacaaaaga gctgaaaagc cctgatgaag gctttgaagg caaatctctt 1740
tatgaaagtt ggactaaaaa aagtccttcc ccagagttca gtggcatgcc caggataagc 1800
aaattgggat ctggaaatga ttttgaggtg ttcttccaac gacttggaat tgcttcaggc 1860
agagcacggt atactaaaaa ttgggaaaca aacaaattca gcggctatcc actgtatcac 1920
agtgtctatg aaacatatga gttggtggaa aagttttatg atccaatgtt taaatatcac 1980
ctcactgtgg cccaggttcg aggagggatg gtgtttgagc tagccaattc catagtgctc 2040
ccttttgatt gtcgagatta tgctgtagtt ttaagaaagt atgctgacaa aatctacagt 2100
atttctatga aacatccaca ggaaatgaag acatacagtg tatcatttga ttcacttttt 2160
tctgcagtaa agaattttac agaaattgct tccaagttca gtgagagact ccaggacttt 2220
gacaaaagca acccaatagt attaagaatg atgaatgatc aactcatgtt tctggaaaga 2280
gcatttattg atccattagg gttaccagac aggccttttt ataggcatgt catctatgct 2340
ccaagcagcc acaacaagta tgcaggggag tcattcccag gaatttatga tgctctgttt 2400
gatattgaaa gcaaagtgga cccttccaag gcctggggag aagtgaagag acagatttat 2460
gttgcagcct tcacagtgca ggcagctgca gagactttga gtgaagtagc ctaagaggat 2520
tctttagaga atccgtattg aatttgtgtg gtatgtcact cagaaagaat cgtaatgggt 2580
atattgataa attttaaaat tggtatattt gaaataaagt tgaatattat atataaaaaa 2640
aaaaaaaaaa aaa 2653
<210> 119
<211> 750
<212> PRT
<213> 智人
<400> 119
Met Trp Asn Leu Leu His Glu Thr Asp Ser Ala Val Ala Thr Ala Arg
1 5 10 15
Arg Pro Arg Trp Leu Cys Ala Gly Ala Leu Val Leu Ala Gly Gly Phe
20 25 30
Phe Leu Leu Gly Phe Leu Phe Gly Trp Phe Ile Lys Ser Ser Asn Glu
35 40 45
Ala Thr Asn Ile Thr Pro Lys His Asn Met Lys Ala Phe Leu Asp Glu
50 55 60
Leu Lys Ala Glu Asn Ile Lys Lys Phe Leu Tyr Asn Phe Thr Gln Ile
65 70 75 80
Pro His Leu Ala Gly Thr Glu Gln Asn Phe Gln Leu Ala Lys Gln Ile
85 90 95
Gln Ser Gln Trp Lys Glu Phe Gly Leu Asp Ser Val Glu Leu Ala His
100 105 110
Tyr Asp Val Leu Leu Ser Tyr Pro Asn Lys Thr His Pro Asn Tyr Ile
115 120 125
Ser Ile Ile Asn Glu Asp Gly Asn Glu Ile Phe Asn Thr Ser Leu Phe
130 135 140
Glu Pro Pro Pro Pro Gly Tyr Glu Asn Val Ser Asp Ile Val Pro Pro
145 150 155 160
Phe Ser Ala Phe Ser Pro Gln Gly Met Pro Glu Gly Asp Leu Val Tyr
165 170 175
Val Asn Tyr Ala Arg Thr Glu Asp Phe Phe Lys Leu Glu Arg Asp Met
180 185 190
Lys Ile Asn Cys Ser Gly Lys Ile Val Ile Ala Arg Tyr Gly Lys Val
195 200 205
Phe Arg Gly Asn Lys Val Lys Asn Ala Gln Leu Ala Gly Ala Lys Gly
210 215 220
Val Ile Leu Tyr Ser Asp Pro Ala Asp Tyr Phe Ala Pro Gly Val Lys
225 230 235 240
Ser Tyr Pro Asp Gly Trp Asn Leu Pro Gly Gly Gly Val Gln Arg Gly
245 250 255
Asn Ile Leu Asn Leu Asn Gly Ala Gly Asp Pro Leu Thr Pro Gly Tyr
260 265 270
Pro Ala Asn Glu Tyr Ala Tyr Arg Arg Gly Ile Ala Glu Ala Val Gly
275 280 285
Leu Pro Ser Ile Pro Val His Pro Ile Gly Tyr Tyr Asp Ala Gln Lys
290 295 300
Leu Leu Glu Lys Met Gly Gly Ser Ala Pro Pro Asp Ser Ser Trp Arg
305 310 315 320
Gly Ser Leu Lys Val Pro Tyr Asn Val Gly Pro Gly Phe Thr Gly Asn
325 330 335
Phe Ser Thr Gln Lys Val Lys Met His Ile His Ser Thr Asn Glu Val
340 345 350
Thr Arg Ile Tyr Asn Val Ile Gly Thr Leu Arg Gly Ala Val Glu Pro
355 360 365
Asp Arg Tyr Val Ile Leu Gly Gly His Arg Asp Ser Trp Val Phe Gly
370 375 380
Gly Ile Asp Pro Gln Ser Gly Ala Ala Val Val His Glu Ile Val Arg
385 390 395 400
Ser Phe Gly Thr Leu Lys Lys Glu Gly Trp Arg Pro Arg Arg Thr Ile
405 410 415
Leu Phe Ala Ser Trp Asp Ala Glu Glu Phe Gly Leu Leu Gly Ser Thr
420 425 430
Glu Trp Ala Glu Glu Asn Ser Arg Leu Leu Gln Glu Arg Gly Val Ala
435 440 445
Tyr Ile Asn Ala Asp Ser Ser Ile Glu Gly Asn Tyr Thr Leu Arg Val
450 455 460
Asp Cys Thr Pro Leu Met Tyr Ser Leu Val His Asn Leu Thr Lys Glu
465 470 475 480
Leu Lys Ser Pro Asp Glu Gly Phe Glu Gly Lys Ser Leu Tyr Glu Ser
485 490 495
Trp Thr Lys Lys Ser Pro Ser Pro Glu Phe Ser Gly Met Pro Arg Ile
500 505 510
Ser Lys Leu Gly Ser Gly Asn Asp Phe Glu Val Phe Phe Gln Arg Leu
515 520 525
Gly Ile Ala Ser Gly Arg Ala Arg Tyr Thr Lys Asn Trp Glu Thr Asn
530 535 540
Lys Phe Ser Gly Tyr Pro Leu Tyr His Ser Val Tyr Glu Thr Tyr Glu
545 550 555 560
Leu Val Glu Lys Phe Tyr Asp Pro Met Phe Lys Tyr His Leu Thr Val
565 570 575
Ala Gln Val Arg Gly Gly Met Val Phe Glu Leu Ala Asn Ser Ile Val
580 585 590
Leu Pro Phe Asp Cys Arg Asp Tyr Ala Val Val Leu Arg Lys Tyr Ala
595 600 605
Asp Lys Ile Tyr Ser Ile Ser Met Lys His Pro Gln Glu Met Lys Thr
610 615 620
Tyr Ser Val Ser Phe Asp Ser Leu Phe Ser Ala Val Lys Asn Phe Thr
625 630 635 640
Glu Ile Ala Ser Lys Phe Ser Glu Arg Leu Gln Asp Phe Asp Lys Ser
645 650 655
Asn Pro Ile Val Leu Arg Met Met Asn Asp Gln Leu Met Phe Leu Glu
660 665 670
Arg Ala Phe Ile Asp Pro Leu Gly Leu Pro Asp Arg Pro Phe Tyr Arg
675 680 685
His Val Ile Tyr Ala Pro Ser Ser His Asn Lys Tyr Ala Gly Glu Ser
690 695 700
Phe Pro Gly Ile Tyr Asp Ala Leu Phe Asp Ile Glu Ser Lys Val Asp
705 710 715 720
Pro Ser Lys Ala Trp Gly Glu Val Lys Arg Gln Ile Tyr Val Ala Ala
725 730 735
Phe Thr Val Gln Ala Ala Ala Glu Thr Leu Ser Glu Val Ala
740 745 750
<210> 120
<211> 2996
<212> DNA
<213> 智人
<400> 120
cgcagccacc catgcgcgcg cgctcgcaag accaccagcg cccagagccc cagtctgagg 60
cttggcgccg ggggtctgcg ggcgagggga gctctctacg tgcgaggggc tagcgggagc 120
cggcacaaga gggtcgagga gccaggaacc ccaaacgtcc ggcgccaggc gctagccaag 180
ctgctgcgcg ccccggcgcc cagctggctc ggggacagcc gctgggtgtc ggagaccgga 240
gctagcggat tgcagcggaa aagcaaagat gtcacactgg atccttggcc tccagggtcc 300
attaaggtga gaataagatc tctgggctgg ctggaactag cctaagactg aaaagcagcc 360
atggacatgg cggatgagcc actcaatgga agccacacat ggctatccat tccatttgac 420
ctcaatggct ctgtggtgtc aaccaacacc tcaaaccaga cagagccgta ctatgacctg 480
acaagcaatg cagtcctcac attcatctat tttgtggtct gcatcattgg gttgtgtggc 540
aacacacttg tcatttatgt catcctccgc tatgccaaga tgaagaccat caccaacatt 600
tacatcctca acctggccat cgcagatgag ctcttcatgc tgggtctgcc tttcttggct 660
atgcaggtgg ctctggtcca ctggcccttt ggcaaggcca tttgccgggt ggtcatgact 720
gtggatggca tcaatcagtt caccagcatc ttctgcctga cagtcatgag catcgaccga 780
tacctggctg tggtccaccc catcaagtcg gccaagtgga ggagaccccg gacggccaag 840
atgatcacca tggctgtgtg gggagtctct ctgctggtca tcttgcccat catgatatat 900
gctgggctcc ggagcaacca gtgggggaga agcagctgca ccatcaactg gccaggtgaa 960
tctggggctt ggtacacagg gttcatcatc tacactttca ttctggggtt cctggtaccc 1020
ctcaccatca tctgtctttg ctacctgttc attatcatca aggtgaagtc ctctggaatc 1080
cgagtgggct cctctaagag gaagaagtct gagaagaagg tcacccgaat ggtgtccatc 1140
gtggtggctg tcttcatctt ctgctggctt cccttctaca tattcaacgt ttcttccgtc 1200
tccatggcca tcagccccac cccagccctt aaaggcatgt ttgactttgt ggtggtcctc 1260
acctatgcta acagctgtgc caaccctatc ctatatgcct tcttgtctga caacttcaag 1320
aagagcttcc agaatgtcct ctgcttggtc aaggtgagcg gcacagatga tggggagcgg 1380
agtgacagta agcaggacaa atcccggctg aatgagacca cggagaccca gaggaccctc 1440
ctcaatggag acctccaaac cagtatctga actgcttggg gggtgggaaa gaaccaagcc 1500
atgctctgtc tactggcaat gggctcccta cccacactgg cttcctgcct cccacccctc 1560
acacctggct tctagaatag aggattgctc agcatgagtc caattcagag aacggtgttt 1620
gagtcagctt gtctgattga atgataatgt gctaaattga ttacctcccc cttaaagcga 1680
acactgaaat gcaggtagac aattcaaagt ctggagaaga gggatcatgc ctggatatga 1740
tctttagaaa caacaaaaat agaaaaaaat aagtatctgt gtgtttgtgt attgaaaact 1800
caatatgtaa tcttgtgttt ttatatgtat acttgtatat tcctatttat tctctgtata 1860
ggcattacct acgttcctgt gtttacatac acaagtagca aattcgagta tgcatagtgt 1920
agatggacat ttgccacaac acactgcccg cagaaatgga cttaccgtga agccaataaa 1980
gttcaagctt cagggatctc tcttgcacgg gccttgccaa ggcccaggag ggacttgggc 2040
agtatgttca tgtggtcata tgtttttgta aaaaattgtg aaagtaagat atgtttgtat 2100
tgtttttctt aaagaggaac ctcgtataag cttcaagcct cacaaacctt ctagcctctg 2160
cccttgggga tttgcttcat taatttcagg caagtgaggt caatgtaaga agggaaaggg 2220
agaagatatt tgaagaacca gaatgtaaat tcatgtgttt ccacttctca gatatagtca 2280
gagaattatt catttgccca aaaggactta agtggttgtg gtcatccatc attgtattta 2340
tcaagacaaa gccaactttg ttataagatt gcattttttt cttttcaaat tgctttagtt 2400
tttcttaggg agctatgagg gggaaaaatc actaacatga aaggcaaaaa atggactatg 2460
attcctgtgg ggaaacaatt tcattctctc catcgtgaaa ataagtgaat aagagtgaag 2520
caaaattaca cctttatgag aaaccataaa attgttttta tttttcaggc cagacatagc 2580
ttcctaatga aagaaaatgg aaatgtaatt cgacgactcc tcaaagggga ctttagagga 2640
cttcatacaa agctgggcat taagaaaacc acaatgcatg gccgggcgtg gtggcttaca 2700
cctgtaatcc cagcactttg ggaggccgag gtgggtggat cacccgaggt caggagttcg 2760
agaccagcct ggccaacatg gtgaaacccc atcactacta aaaatatgta aattagtcgg 2820
gcgtggtgtc acgtgcctgt aatcctagct gctcgggagg ctgaggcagg agaatcactt 2880
gaacttggga ggtggaggtt gcagtaagct gagattgtgc cactgcactc tagcctgagc 2940
aacaagagca aaactcagtc tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 2996
<210> 121
<211> 369
<212> PRT
<213> 智人
<400> 121
Met Asp Met Ala Asp Glu Pro Leu Asn Gly Ser His Thr Trp Leu Ser
1 5 10 15
Ile Pro Phe Asp Leu Asn Gly Ser Val Val Ser Thr Asn Thr Ser Asn
20 25 30
Gln Thr Glu Pro Tyr Tyr Asp Leu Thr Ser Asn Ala Val Leu Thr Phe
35 40 45
Ile Tyr Phe Val Val Cys Ile Ile Gly Leu Cys Gly Asn Thr Leu Val
50 55 60
Ile Tyr Val Ile Leu Arg Tyr Ala Lys Met Lys Thr Ile Thr Asn Ile
65 70 75 80
Tyr Ile Leu Asn Leu Ala Ile Ala Asp Glu Leu Phe Met Leu Gly Leu
85 90 95
Pro Phe Leu Ala Met Gln Val Ala Leu Val His Trp Pro Phe Gly Lys
100 105 110
Ala Ile Cys Arg Val Val Met Thr Val Asp Gly Ile Asn Gln Phe Thr
115 120 125
Ser Ile Phe Cys Leu Thr Val Met Ser Ile Asp Arg Tyr Leu Ala Val
130 135 140
Val His Pro Ile Lys Ser Ala Lys Trp Arg Arg Pro Arg Thr Ala Lys
145 150 155 160
Met Ile Thr Met Ala Val Trp Gly Val Ser Leu Leu Val Ile Leu Pro
165 170 175
Ile Met Ile Tyr Ala Gly Leu Arg Ser Asn Gln Trp Gly Arg Ser Ser
180 185 190
Cys Thr Ile Asn Trp Pro Gly Glu Ser Gly Ala Trp Tyr Thr Gly Phe
195 200 205
Ile Ile Tyr Thr Phe Ile Leu Gly Phe Leu Val Pro Leu Thr Ile Ile
210 215 220
Cys Leu Cys Tyr Leu Phe Ile Ile Ile Lys Val Lys Ser Ser Gly Ile
225 230 235 240
Arg Val Gly Ser Ser Lys Arg Lys Lys Ser Glu Lys Lys Val Thr Arg
245 250 255
Met Val Ser Ile Val Val Ala Val Phe Ile Phe Cys Trp Leu Pro Phe
260 265 270
Tyr Ile Phe Asn Val Ser Ser Val Ser Met Ala Ile Ser Pro Thr Pro
275 280 285
Ala Leu Lys Gly Met Phe Asp Phe Val Val Val Leu Thr Tyr Ala Asn
290 295 300
Ser Cys Ala Asn Pro Ile Leu Tyr Ala Phe Leu Ser Asp Asn Phe Lys
305 310 315 320
Lys Ser Phe Gln Asn Val Leu Cys Leu Val Lys Val Ser Gly Thr Asp
325 330 335
Asp Gly Glu Arg Ser Asp Ser Lys Gln Asp Lys Ser Arg Leu Asn Glu
340 345 350
Thr Thr Glu Thr Gln Arg Thr Leu Leu Asn Gly Asp Leu Gln Thr Ser
355 360 365
Ile
<210> 122
<211> 1245
<212> DNA
<213> 智人
<400> 122
cgtcctccct tcttctcttg cagagcctga cgcaccccag ggctgccgcc atggagcccc 60
tgttcccagc ctccacgccc agctggaacg cctcctcccc gggggctgcc tctggaggcg 120
gtgacaacag gacgctggtg gggccggcgc cctcggcagg ggcccgggcg gtgctggtgc 180
ccgtgctgta cctgctggtg tgtgcggccg ggctgggcgg gaacacgctg gtcatctacg 240
tggtgctgcg cttcgccaag atgaagaccg tcaccaacat ctacattctc aacctggcag 300
tggccgacgt cctgtacatg ctggggctgc ctttcctggc cacgcagaac gccgcgtcct 360
tctggccctt cggccccgtc ctgtgccgcc tggtcatgac gctggacggc gtcaaccagt 420
tcaccagtgt cttctgcctg acagtcatga gcgtggaccg ctacctggca gtggtgcacc 480
cgctgagctc ggcccgctgg cgccgcccgc gtgtggccaa gctggcgagc gccgcggcct 540
gggtcctgtc tctgtgcatg tcgctgccgc tcctggtgtt cgcggacgtg caggagggcg 600
gtacctgcaa cgccagctgg ccggagcccg tggggctgtg gggcgccgtc ttcatcatct 660
acacggccgt gctgggcttc ttcgcgccgc tgctggtcat ctgcctgtgc tacctgctca 720
tcgtggtgaa ggtgagggcg gcgggcgtgc gcgtgggctg cgtgcggcgg cgctcggagc 780
ggaaggtgac gcgcatggtg ttggtggtgg tgctggtgtt tgcgggatgt tggctgccct 840
tcttcaccgt caacatcgtc aacctggccg tggcgctgcc ccaggagccc gcctccgccg 900
gcctctactt cttcgtggtc atcctctcct acgccaacag ctgtgccaac cccgtcctct 960
acggcttcct ctctgacaac ttccgccaga gcttccagaa ggttctgtgc ctccgcaagg 1020
gctctggtgc caaggacgct gacgccacgg agccgcgtcc agacaggatc cggcagcagc 1080
aggaggccac gccgcccgcg caccgcgccg cagccaacgg gcttatgcag accagcaagc 1140
tgtgagagtg caggcggggg gtgggcggcc ccgtgtcacc cccaggagcg gaggttgcac 1200
tgcggtgacc cccacccatg acctgccagt caggatgctc cccgg 1245
<210> 123
<211> 364
<212> PRT
<213> 智人
<400> 123
Met Glu Pro Leu Phe Pro Ala Ser Thr Pro Ser Trp Asn Ala Ser Ser
1 5 10 15
Pro Gly Ala Ala Ser Gly Gly Gly Asp Asn Arg Thr Leu Val Gly Pro
20 25 30
Ala Pro Ser Ala Gly Ala Arg Ala Val Leu Val Pro Val Leu Tyr Leu
35 40 45
Leu Val Cys Ala Ala Gly Leu Gly Gly Asn Thr Leu Val Ile Tyr Val
50 55 60
Val Leu Arg Phe Ala Lys Met Lys Thr Val Thr Asn Ile Tyr Ile Leu
65 70 75 80
Asn Leu Ala Val Ala Asp Val Leu Tyr Met Leu Gly Leu Pro Phe Leu
85 90 95
Ala Thr Gln Asn Ala Ala Ser Phe Trp Pro Phe Gly Pro Val Leu Cys
100 105 110
Arg Leu Val Met Thr Leu Asp Gly Val Asn Gln Phe Thr Ser Val Phe
115 120 125
Cys Leu Thr Val Met Ser Val Asp Arg Tyr Leu Ala Val Val His Pro
130 135 140
Leu Ser Ser Ala Arg Trp Arg Arg Pro Arg Val Ala Lys Leu Ala Ser
145 150 155 160
Ala Ala Ala Trp Val Leu Ser Leu Cys Met Ser Leu Pro Leu Leu Val
165 170 175
Phe Ala Asp Val Gln Glu Gly Gly Thr Cys Asn Ala Ser Trp Pro Glu
180 185 190
Pro Val Gly Leu Trp Gly Ala Val Phe Ile Ile Tyr Thr Ala Val Leu
195 200 205
Gly Phe Phe Ala Pro Leu Leu Val Ile Cys Leu Cys Tyr Leu Leu Ile
210 215 220
Val Val Lys Val Arg Ala Ala Gly Val Arg Val Gly Cys Val Arg Arg
225 230 235 240
Arg Ser Glu Arg Lys Val Thr Arg Met Val Leu Val Val Val Leu Val
245 250 255
Phe Ala Gly Cys Trp Leu Pro Phe Phe Thr Val Asn Ile Val Asn Leu
260 265 270
Ala Val Ala Leu Pro Gln Glu Pro Ala Ser Ala Gly Leu Tyr Phe Phe
275 280 285
Val Val Ile Leu Ser Tyr Ala Asn Ser Cys Ala Asn Pro Val Leu Tyr
290 295 300
Gly Phe Leu Ser Asp Asn Phe Arg Gln Ser Phe Gln Lys Val Leu Cys
305 310 315 320
Leu Arg Lys Gly Ser Gly Ala Lys Asp Ala Asp Ala Thr Glu Pro Arg
325 330 335
Pro Asp Arg Ile Arg Gln Gln Gln Glu Ala Thr Pro Pro Ala His Arg
340 345 350
Ala Ala Ala Asn Gly Leu Met Gln Thr Ser Lys Leu
355 360
<210> 124
<211> 5717
<212> DNA
<213> 智人
<400> 124
ggctaccgct cccggcttgg cgtcccgcgc gcacttcggc gatggctttt ccgccgcggc 60
gacggctgcg cctcggtccc cgcggcctcc cgcttcttct ctcgggactc ctgctacctc 120
tgtgccgcgc cttcaaccta gacgtggaca gtcctgccga gtactctggc cccgagggaa 180
gttacttcgg cttcgccgtg gatttcttcg tgcccagcgc gtcttcccgg atgtttcttc 240
tcgtgggagc tcccaaagca aacaccaccc agcctgggat tgtggaagga gggcaggtcc 300
tcaaatgtga ctggtcttct acccgccggt gccagccaat tgaatttgat gcaacaggca 360
atagagatta tgccaaggat gatccattgg aatttaagtc ccatcagtgg tttggagcat 420
ctgtgaggtc gaaacaggat aaaattttgg cctgtgcccc attgtaccat tggagaactg 480
agatgaaaca ggagcgagag cctgttggaa catgctttct tcaagatgga acaaagactg 540
ttgagtatgc tccatgtaga tcacaagata ttgatgctga tggacaggga ttttgtcaag 600
gaggattcag cattgatttt actaaagctg acagagtact tcttggtggt cctggtagct 660
tttattggca aggtcagctt atttcggatc aagtggcaga aatcgtatct aaatacgacc 720
ccaatgttta cagcatcaag tataataacc aattagcaac tcggactgca caagctattt 780
ttgatgacag ctatttgggt tattctgtgg ctgtcggaga tttcaatggt gatggcatag 840
atgactttgt ttcaggagtt ccaagagcag caaggacttt gggaatggtt tatatttatg 900
atgggaagaa catgtcctcc ttatacaatt ttactggcga gcagatggct gcatatttcg 960
gattttctgt agctgccact gacattaatg gagatgatta tgcagatgtg tttattggag 1020
cacctctctt catggatcgt ggctctgatg gcaaactcca agaggtgggg caggtctcag 1080
tgtctctaca gagagcttca ggagacttcc agacgacaaa gctgaatgga tttgaggtct 1140
ttgcacggtt tggcagtgcc atagctcctt tgggagatct ggaccaggat ggtttcaatg 1200
atattgcaat tgctgctcca tatgggggtg aagataaaaa aggaattgtt tatatcttca 1260
atggaagatc aacaggcttg aacgcagtcc catctcaaat ccttgaaggg cagtgggctg 1320
ctcgaagcat gccaccaagc tttggctatt caatgaaagg agccacagat atagacaaaa 1380
atggatatcc agacttaatt gtaggagctt ttggtgtaga tcgagctatc ttatacaggg 1440
ccagaccagt tatcactgta aatgctggtc ttgaagtgta ccctagcatt ttaaatcaag 1500
acaataaaac ctgctcactg cctggaacag ctctcaaagt ttcctgtttt aatgttaggt 1560
tctgcttaaa ggcagatggc aaaggagtac ttcccaggaa acttaatttc caggtggaac 1620
ttcttttgga taaactcaag caaaagggag caattcgacg agcactgttt ctctacagca 1680
ggtccccaag tcactccaag aacatgacta tttcaagggg gggactgatg cagtgtgagg 1740
aattgatagc gtatctgcgg gatgaatctg aatttagaga caaactcact ccaattacta 1800
tttttatgga atatcggttg gattatagaa cagctgctga tacaacaggc ttgcaaccca 1860
ttcttaacca gttcacgcct gctaacatta gtcgacaggc tcacattcta cttgactgtg 1920
gtgaagacaa tgtctgtaaa cccaagctgg aagtttctgt agatagtgat caaaagaaga 1980
tctatattgg ggatgacaac cctctgacat tgattgttaa ggctcagaat caaggagaag 2040
gtgcctacga agctgagctc atcgtttcca ttccactgca ggctgatttc atcggggttg 2100
tccgaaacaa tgaagcctta gcaagacttt cctgtgcatt taagacagaa aaccaaactc 2160
gccaggtggt atgtgacctt ggaaacccaa tgaaggctgg aactcaactc ttagctggtc 2220
ttcgtttcag tgtgcaccag cagtcagaga tggatacttc tgtgaaattt gacttacaaa 2280
tccaaagctc aaatctattt gacaaagtaa gcccagttgt atctcacaaa gttgatcttg 2340
ctgttttagc tgcagttgag ataagaggag tctcgagtcc tgatcatatc tttcttccga 2400
ttccaaactg ggagcacaag gagaaccctg agactgaaga agatgttggg ccagttgttc 2460
agcacatcta tgagctgaga aacaatggtc caagttcatt cagcaaggca atgctccatc 2520
ttcagtggcc ttacaaatat aataataaca ctctgttgta tatccttcat tatgatattg 2580
atggaccaat gaactgcact tcagatatgg agatcaaccc tttgagaatt aagatctcat 2640
ctttgcaaac aactgaaaag aatgacacgg ttgccgggca aggtgagcgg gaccatctca 2700
tcactaagcg ggatcttgcc ctcagtgaag gagatattca cactttgggt tgtggagttg 2760
ctcagtgctt gaagattgtc tgccaagttg ggagattaga cagaggaaag agtgcaatct 2820
tgtacgtaaa gtcattactg tggactgaga cttttatgaa taaagaaaat cagaatcatt 2880
cctattctct gaagtcgtct gcttcattta atgtcataga gtttccttat aagaatcttc 2940
caattgagga tatcaccaac tccacattgg ttaccactaa tgtcacctgg ggcattcagc 3000
cagcgcccat gcctgtgcct gtgtgggtga tcattttagc agttctagca ggattgttgc 3060
tactggctgt tttggtattt gtaatgtaca ggatgggctt ttttaaacgg gtccggccac 3120
ctcaagaaga acaagaaagg gagcagcttc aacctcatga aaatggtgaa ggaaactcag 3180
aaacttaact gcagttttta agttatgcta catcttgacc cactagaatt agcaacttta 3240
ttatagattt aaactttctt catgaggagt aaaaatccaa ggctttactg ctgatagtgc 3300
taattggcat taaccacaaa atgagaatta tatttgtcaa ccttctcctt ataaataagt 3360
tcagacatac atttaataac atagggtgac ttgtgttttt aggtatttaa ataataaaat 3420
ttcaagggat agtttttatt caatgtatat aagacaggta gtgcctgatt tactacttta 3480
tataaaatag tacctccttc agttactgtt tctgatttaa tgtacggaac tttatttgtt 3540
gttgttgttg ttgttgttgt tgttgtttta aagcagtcca aatttggacc ttagcaatca 3600
tgtcttttgt ataggtactt aatgttaata catattacac tacagtttac ttttcagaat 3660
actaaagact ttataactgc atgaacttgg atttttttaa tcactcatat ggtagaattt 3720
tataaacaca tacatgatac catccaaatt cttgctttta ataacaaagg tacaatattt 3780
tgttttagta tgaaaatctg gtagatccta ttacacttct gtttatatta aatccacaat 3840
attttattac atttttaact tgtataaatt ttaggtcaaa tccttcaagc caacctatac 3900
taaaaattag ttccataatc acaaatggct cttttgtgta attgtttaat ttcacctgaa 3960
tatcataatg cttaaagcca tatggagttg gaaattattt ccaaagcata tttattccat 4020
tgttttagtc tggctattta cagtataaaa aaagcatttt attaaaatac tgtgtagttc 4080
tttgagatag ttgcttatgc atatagtaag tattacattc ttagagtaga gcagagtttt 4140
tagttagtat taatttattt tcctccattc atgtactttt ccttatattt ccaaaactgt 4200
tactgagaat gggtcaagat cagtgagaaa tctttacagt tgacaggaac ctggacccct 4260
taccccaact ttatgagtaa tgcttggaat aaaaaactct taaggcaact cactgattta 4320
cttctagcaa tagcatgatg ttacaggaat attacctctg tttaagcaag gtaatgtgta 4380
aaatcagtct cggctgtcag aataacttct aaaaggtatt tttataagca gttcaagtta 4440
ctgaaaacct tttaaacctt tctgaagttc gttagtataa attacttttc taggattatt 4500
aataaaagcc acataggtgg caagttgtag ttttatatgg ctctgtagag tggtgaacct 4560
tctagaggaa tatatgattt attcacagtt cctcaaggcc tggggatgat gatcagttat 4620
acctattttt gtgcaattac atcatgttgt acattagaaa tggagagttt aatagctctt 4680
taactgctgt cctcattagg taatgataaa tatttccctt aaataattga ctattttgct 4740
gtgttttaaa aatgattgaa atttatcttg ccatatctca taatttcatg cacaagttga 4800
ctgagctaat cttgagaata tattcgtaaa ataggagcac atttagttga ggtatacaag 4860
gtaggactct agacaaaacc ttctatttta gctttagtga atttcaaaag taatgggtct 4920
tggagtatag atttttatta gtagcttgaa agagcttaat catatgcagt aagtattttt 4980
attaccaata aatttaaaat tttttaagaa aaatattttt atcctagggc caagtgttgc 5040
ctgccaccaa tcagtaagtt agtctataac aaattttacc ctaacagttt taccacctag 5100
caacagtcat ttctgaaaat atgttggata gaaagtcact ctttggcaaa agtgttagaa 5160
tttgcttttg tgccatctat tccttttatg gcatctatct tgaaagtaat cttgtattgg 5220
agattgaaag atgctgtaat ttagaaatta acatgatatc ttaaattacc tttatgaaat 5280
atagttttgt ataatagcat agattttcct tcaaaaaatg aacatttata tatctacaaa 5340
aatatggaga agagcaattt gaaagcctac tttctgaaga aaatggtggg attttttttt 5400
atcatgatta aatatcaaaa aattgcccta tgaaaacttt aaatctctaa aacatttgaa 5460
atactaccat atttgtgatt tattgagaat aaaaatccat tttgaaatgt aaaattttta 5520
tgatctgatt cagttttaag aaaacatgaa tgaactagaa gatattaaaa acatttgaca 5580
ttggtaagaa atattgatac tgatattgat ttttatatag gtatttattt cagaattgat 5640
attttgagaa aaatacatgt gagtcatttt ttctgtttct cttttctctt aacgattatc 5700
actgtaattc tgaatct 5717
<210> 125
<211> 1048
<212> PRT
<213> 智人
<400> 125
Met Ala Phe Pro Pro Arg Arg Arg Leu Arg Leu Gly Pro Arg Gly Leu
1 5 10 15
Pro Leu Leu Leu Ser Gly Leu Leu Leu Pro Leu Cys Arg Ala Phe Asn
20 25 30
Leu Asp Val Asp Ser Pro Ala Glu Tyr Ser Gly Pro Glu Gly Ser Tyr
35 40 45
Phe Gly Phe Ala Val Asp Phe Phe Val Pro Ser Ala Ser Ser Arg Met
50 55 60
Phe Leu Leu Val Gly Ala Pro Lys Ala Asn Thr Thr Gln Pro Gly Ile
65 70 75 80
Val Glu Gly Gly Gln Val Leu Lys Cys Asp Trp Ser Ser Thr Arg Arg
85 90 95
Cys Gln Pro Ile Glu Phe Asp Ala Thr Gly Asn Arg Asp Tyr Ala Lys
100 105 110
Asp Asp Pro Leu Glu Phe Lys Ser His Gln Trp Phe Gly Ala Ser Val
115 120 125
Arg Ser Lys Gln Asp Lys Ile Leu Ala Cys Ala Pro Leu Tyr His Trp
130 135 140
Arg Thr Glu Met Lys Gln Glu Arg Glu Pro Val Gly Thr Cys Phe Leu
145 150 155 160
Gln Asp Gly Thr Lys Thr Val Glu Tyr Ala Pro Cys Arg Ser Gln Asp
165 170 175
Ile Asp Ala Asp Gly Gln Gly Phe Cys Gln Gly Gly Phe Ser Ile Asp
180 185 190
Phe Thr Lys Ala Asp Arg Val Leu Leu Gly Gly Pro Gly Ser Phe Tyr
195 200 205
Trp Gln Gly Gln Leu Ile Ser Asp Gln Val Ala Glu Ile Val Ser Lys
210 215 220
Tyr Asp Pro Asn Val Tyr Ser Ile Lys Tyr Asn Asn Gln Leu Ala Thr
225 230 235 240
Arg Thr Ala Gln Ala Ile Phe Asp Asp Ser Tyr Leu Gly Tyr Ser Val
245 250 255
Ala Val Gly Asp Phe Asn Gly Asp Gly Ile Asp Asp Phe Val Ser Gly
260 265 270
Val Pro Arg Ala Ala Arg Thr Leu Gly Met Val Tyr Ile Tyr Asp Gly
275 280 285
Lys Asn Met Ser Ser Leu Tyr Asn Phe Thr Gly Glu Gln Met Ala Ala
290 295 300
Tyr Phe Gly Phe Ser Val Ala Ala Thr Asp Ile Asn Gly Asp Asp Tyr
305 310 315 320
Ala Asp Val Phe Ile Gly Ala Pro Leu Phe Met Asp Arg Gly Ser Asp
325 330 335
Gly Lys Leu Gln Glu Val Gly Gln Val Ser Val Ser Leu Gln Arg Ala
340 345 350
Ser Gly Asp Phe Gln Thr Thr Lys Leu Asn Gly Phe Glu Val Phe Ala
355 360 365
Arg Phe Gly Ser Ala Ile Ala Pro Leu Gly Asp Leu Asp Gln Asp Gly
370 375 380
Phe Asn Asp Ile Ala Ile Ala Ala Pro Tyr Gly Gly Glu Asp Lys Lys
385 390 395 400
Gly Ile Val Tyr Ile Phe Asn Gly Arg Ser Thr Gly Leu Asn Ala Val
405 410 415
Pro Ser Gln Ile Leu Glu Gly Gln Trp Ala Ala Arg Ser Met Pro Pro
420 425 430
Ser Phe Gly Tyr Ser Met Lys Gly Ala Thr Asp Ile Asp Lys Asn Gly
435 440 445
Tyr Pro Asp Leu Ile Val Gly Ala Phe Gly Val Asp Arg Ala Ile Leu
450 455 460
Tyr Arg Ala Arg Pro Val Ile Thr Val Asn Ala Gly Leu Glu Val Tyr
465 470 475 480
Pro Ser Ile Leu Asn Gln Asp Asn Lys Thr Cys Ser Leu Pro Gly Thr
485 490 495
Ala Leu Lys Val Ser Cys Phe Asn Val Arg Phe Cys Leu Lys Ala Asp
500 505 510
Gly Lys Gly Val Leu Pro Arg Lys Leu Asn Phe Gln Val Glu Leu Leu
515 520 525
Leu Asp Lys Leu Lys Gln Lys Gly Ala Ile Arg Arg Ala Leu Phe Leu
530 535 540
Tyr Ser Arg Ser Pro Ser His Ser Lys Asn Met Thr Ile Ser Arg Gly
545 550 555 560
Gly Leu Met Gln Cys Glu Glu Leu Ile Ala Tyr Leu Arg Asp Glu Ser
565 570 575
Glu Phe Arg Asp Lys Leu Thr Pro Ile Thr Ile Phe Met Glu Tyr Arg
580 585 590
Leu Asp Tyr Arg Thr Ala Ala Asp Thr Thr Gly Leu Gln Pro Ile Leu
595 600 605
Asn Gln Phe Thr Pro Ala Asn Ile Ser Arg Gln Ala His Ile Leu Leu
610 615 620
Asp Cys Gly Glu Asp Asn Val Cys Lys Pro Lys Leu Glu Val Ser Val
625 630 635 640
Asp Ser Asp Gln Lys Lys Ile Tyr Ile Gly Asp Asp Asn Pro Leu Thr
645 650 655
Leu Ile Val Lys Ala Gln Asn Gln Gly Glu Gly Ala Tyr Glu Ala Glu
660 665 670
Leu Ile Val Ser Ile Pro Leu Gln Ala Asp Phe Ile Gly Val Val Arg
675 680 685
Asn Asn Glu Ala Leu Ala Arg Leu Ser Cys Ala Phe Lys Thr Glu Asn
690 695 700
Gln Thr Arg Gln Val Val Cys Asp Leu Gly Asn Pro Met Lys Ala Gly
705 710 715 720
Thr Gln Leu Leu Ala Gly Leu Arg Phe Ser Val His Gln Gln Ser Glu
725 730 735
Met Asp Thr Ser Val Lys Phe Asp Leu Gln Ile Gln Ser Ser Asn Leu
740 745 750
Phe Asp Lys Val Ser Pro Val Val Ser His Lys Val Asp Leu Ala Val
755 760 765
Leu Ala Ala Val Glu Ile Arg Gly Val Ser Ser Pro Asp His Ile Phe
770 775 780
Leu Pro Ile Pro Asn Trp Glu His Lys Glu Asn Pro Glu Thr Glu Glu
785 790 795 800
Asp Val Gly Pro Val Val Gln His Ile Tyr Glu Leu Arg Asn Asn Gly
805 810 815
Pro Ser Ser Phe Ser Lys Ala Met Leu His Leu Gln Trp Pro Tyr Lys
820 825 830
Tyr Asn Asn Asn Thr Leu Leu Tyr Ile Leu His Tyr Asp Ile Asp Gly
835 840 845
Pro Met Asn Cys Thr Ser Asp Met Glu Ile Asn Pro Leu Arg Ile Lys
850 855 860
Ile Ser Ser Leu Gln Thr Thr Glu Lys Asn Asp Thr Val Ala Gly Gln
865 870 875 880
Gly Glu Arg Asp His Leu Ile Thr Lys Arg Asp Leu Ala Leu Ser Glu
885 890 895
Gly Asp Ile His Thr Leu Gly Cys Gly Val Ala Gln Cys Leu Lys Ile
900 905 910
Val Cys Gln Val Gly Arg Leu Asp Arg Gly Lys Ser Ala Ile Leu Tyr
915 920 925
Val Lys Ser Leu Leu Trp Thr Glu Thr Phe Met Asn Lys Glu Asn Gln
930 935 940
Asn His Ser Tyr Ser Leu Lys Ser Ser Ala Ser Phe Asn Val Ile Glu
945 950 955 960
Phe Pro Tyr Lys Asn Leu Pro Ile Glu Asp Ile Thr Asn Ser Thr Leu
965 970 975
Val Thr Thr Asn Val Thr Trp Gly Ile Gln Pro Ala Pro Met Pro Val
980 985 990
Pro Val Trp Val Ile Ile Leu Ala Val Leu Ala Gly Leu Leu Leu Leu
995 1000 1005
Ala Val Leu Val Phe Val Met Tyr Arg Met Gly Phe Phe Lys Arg
1010 1015 1020
Val Arg Pro Pro Gln Glu Glu Gln Glu Arg Glu Gln Leu Gln Pro
1025 1030 1035
His Glu Asn Gly Glu Gly Asn Ser Glu Thr
1040 1045
<210> 126
<211> 2397
<212> DNA
<213> 智人
<400> 126
gcaagaactg aaacgaatgg ggattgaact gctttgcctg ttctttctat ttctaggaag 60
gaatgatcac gtacaaggtg gctgtgccct gggaggtgca gaaacctgtg aagactgcct 120
gcttattgga cctcagtgtg cctggtgtgc tcaggagaat tttactcatc catctggagt 180
tggcgaaagg tgtgataccc cagcaaacct tttagctaaa ggatgtcaat taaacttcat 240
cgaaaaccct gtctcccaag tagaaatact taaaaataag cctctcagtg taggcagaca 300
gaaaaatagt tctgacattg ttcagattgc gcctcaaagc ttgatcctta agttgagacc 360
aggtggtgcg cagactctgc aggtgcatgt ccgccagact gaggactacc cggtggattt 420
gtattacctc atggacctct ccgcctccat ggatgacgac ctcaacacaa taaaggagct 480
gggctcccgg ctttccaaag agatgtctaa attaaccagc aactttagac tgggcttcgg 540
atcttttgtg gaaaaacctg tatccccttt cgtgaaaaca acaccagaag aaattgccaa 600
cccttgcagt agtattccat acttctgttt acctacattt ggattcaagc acattttgcc 660
attgacaaat gatgctgaaa gattcaatga aattgtgaag aatcagaaaa tttctgctaa 720
tattgacaca cccgaaggtg gatttgatgc aattatgcaa gctgctgtgt gtaaggaaaa 780
aattggctgg cggaatgact ccctccacct cctggtcttt gtgagtgatg ctgattctca 840
ttttggaatg gacagcaaac tagcaggcat cgtcattcct aatgacgggc tctgtcactt 900
ggacagcaag aatgaatact ccatgtcaac tgtcttggaa tatccaacaa ttggacaact 960
cattgataaa ctggtacaaa acaacgtgtt attgatcttc gctgtaaccc aagaacaagt 1020
tcatttatat gagaattacg caaaacttat tcctggagct acagtaggtc tacttcagaa 1080
ggactccgga aacattctcc agctgatcat ctcagcttat gaagaactgc ggtctgaggt 1140
ggaactggaa gtattaggag acactgaagg actcaacttg tcatttacag ccatctgtaa 1200
caacggtacc ctcttccaac accaaaagaa atgctctcac atgaaagtgg gagacacagc 1260
ttccttcagc gtgactgtga atatcccaca ctgcgagaga agaagcaggc acattatcat 1320
aaagcctgtg gggctggggg atgccctgga attacttgtc agcccagaat gcaactgcga 1380
ctgtcagaaa gaagtggaag tgaacagctc caaatgtcac cacgggaacg gctctttcca 1440
gtgtggggtg tgtgcctgcc accctggcca catggggcct cgctgtgagt gtggcgagga 1500
catgctgagc acagattcct gcaaggaggc cccagatcat ccctcctgca gcggaagggg 1560
tgactgctac tgtgggcagt gtatctgcca cttgtctccc tatggaaaca tttatgggcc 1620
ttattgccag tgtgacaatt tctcctgcgt gagacacaaa gggctgctct gcggaggtaa 1680
cggcgactgt gactgtggtg aatgtgtgtg caggagcggc tggactggcg agtactgcaa 1740
ctgcaccacc agcacggact cctgcgtctc tgaagatgga gtgctctgca gcgggcgcgg 1800
ggactgtgtt tgtggcaagt gtgtttgcac aaaccctgga gcctcaggac caacctgtga 1860
acgatgtcct acctgtggtg acccctgtaa ctctaaacgg agctgcattg agtgccacct 1920
gtcagcagct ggccaagccc gagaagaatg tgtggacaag tgcaaactag ctggtgcgac 1980
catcagtgaa gaagaagatt tctcaaagga tggttctgtt tcctgctctc tgcaaggaga 2040
aaatgaatgt cttattacat tcctaataac tacagataat gaggggaaaa ccatcattca 2100
cagcatcaat gaaaaagatt gtccgaagcc tccaaacatt cccatgatca tgttaggggt 2160
ttccctggct attcttctca tcggggttgt cctactgtgc atctggaagc tactggtgtc 2220
atttcatgat cgtaaagaag ttgccaaatt tgaagcagaa cgatcaaaag ccaagtggca 2280
aacgggaacc aatccactct acagaggatc cacaagtact tttaaaaatg taacttataa 2340
acacagggaa aaacaaaagg tagacctttc cacagattgc tagaactact ttatgca 2397
<210> 127
<211> 788
<212> PRT
<213> 智人
<400> 127
Met Gly Ile Glu Leu Leu Cys Leu Phe Phe Leu Phe Leu Gly Arg Asn
1 5 10 15
Asp His Val Gln Gly Gly Cys Ala Leu Gly Gly Ala Glu Thr Cys Glu
20 25 30
Asp Cys Leu Leu Ile Gly Pro Gln Cys Ala Trp Cys Ala Gln Glu Asn
35 40 45
Phe Thr His Pro Ser Gly Val Gly Glu Arg Cys Asp Thr Pro Ala Asn
50 55 60
Leu Leu Ala Lys Gly Cys Gln Leu Asn Phe Ile Glu Asn Pro Val Ser
65 70 75 80
Gln Val Glu Ile Leu Lys Asn Lys Pro Leu Ser Val Gly Arg Gln Lys
85 90 95
Asn Ser Ser Asp Ile Val Gln Ile Ala Pro Gln Ser Leu Ile Leu Lys
100 105 110
Leu Arg Pro Gly Gly Ala Gln Thr Leu Gln Val His Val Arg Gln Thr
115 120 125
Glu Asp Tyr Pro Val Asp Leu Tyr Tyr Leu Met Asp Leu Ser Ala Ser
130 135 140
Met Asp Asp Asp Leu Asn Thr Ile Lys Glu Leu Gly Ser Arg Leu Ser
145 150 155 160
Lys Glu Met Ser Lys Leu Thr Ser Asn Phe Arg Leu Gly Phe Gly Ser
165 170 175
Phe Val Glu Lys Pro Val Ser Pro Phe Val Lys Thr Thr Pro Glu Glu
180 185 190
Ile Ala Asn Pro Cys Ser Ser Ile Pro Tyr Phe Cys Leu Pro Thr Phe
195 200 205
Gly Phe Lys His Ile Leu Pro Leu Thr Asn Asp Ala Glu Arg Phe Asn
210 215 220
Glu Ile Val Lys Asn Gln Lys Ile Ser Ala Asn Ile Asp Thr Pro Glu
225 230 235 240
Gly Gly Phe Asp Ala Ile Met Gln Ala Ala Val Cys Lys Glu Lys Ile
245 250 255
Gly Trp Arg Asn Asp Ser Leu His Leu Leu Val Phe Val Ser Asp Ala
260 265 270
Asp Ser His Phe Gly Met Asp Ser Lys Leu Ala Gly Ile Val Ile Pro
275 280 285
Asn Asp Gly Leu Cys His Leu Asp Ser Lys Asn Glu Tyr Ser Met Ser
290 295 300
Thr Val Leu Glu Tyr Pro Thr Ile Gly Gln Leu Ile Asp Lys Leu Val
305 310 315 320
Gln Asn Asn Val Leu Leu Ile Phe Ala Val Thr Gln Glu Gln Val His
325 330 335
Leu Tyr Glu Asn Tyr Ala Lys Leu Ile Pro Gly Ala Thr Val Gly Leu
340 345 350
Leu Gln Lys Asp Ser Gly Asn Ile Leu Gln Leu Ile Ile Ser Ala Tyr
355 360 365
Glu Glu Leu Arg Ser Glu Val Glu Leu Glu Val Leu Gly Asp Thr Glu
370 375 380
Gly Leu Asn Leu Ser Phe Thr Ala Ile Cys Asn Asn Gly Thr Leu Phe
385 390 395 400
Gln His Gln Lys Lys Cys Ser His Met Lys Val Gly Asp Thr Ala Ser
405 410 415
Phe Ser Val Thr Val Asn Ile Pro His Cys Glu Arg Arg Ser Arg His
420 425 430
Ile Ile Ile Lys Pro Val Gly Leu Gly Asp Ala Leu Glu Leu Leu Val
435 440 445
Ser Pro Glu Cys Asn Cys Asp Cys Gln Lys Glu Val Glu Val Asn Ser
450 455 460
Ser Lys Cys His His Gly Asn Gly Ser Phe Gln Cys Gly Val Cys Ala
465 470 475 480
Cys His Pro Gly His Met Gly Pro Arg Cys Glu Cys Gly Glu Asp Met
485 490 495
Leu Ser Thr Asp Ser Cys Lys Glu Ala Pro Asp His Pro Ser Cys Ser
500 505 510
Gly Arg Gly Asp Cys Tyr Cys Gly Gln Cys Ile Cys His Leu Ser Pro
515 520 525
Tyr Gly Asn Ile Tyr Gly Pro Tyr Cys Gln Cys Asp Asn Phe Ser Cys
530 535 540
Val Arg His Lys Gly Leu Leu Cys Gly Gly Asn Gly Asp Cys Asp Cys
545 550 555 560
Gly Glu Cys Val Cys Arg Ser Gly Trp Thr Gly Glu Tyr Cys Asn Cys
565 570 575
Thr Thr Ser Thr Asp Ser Cys Val Ser Glu Asp Gly Val Leu Cys Ser
580 585 590
Gly Arg Gly Asp Cys Val Cys Gly Lys Cys Val Cys Thr Asn Pro Gly
595 600 605
Ala Ser Gly Pro Thr Cys Glu Arg Cys Pro Thr Cys Gly Asp Pro Cys
610 615 620
Asn Ser Lys Arg Ser Cys Ile Glu Cys His Leu Ser Ala Ala Gly Gln
625 630 635 640
Ala Arg Glu Glu Cys Val Asp Lys Cys Lys Leu Ala Gly Ala Thr Ile
645 650 655
Ser Glu Glu Glu Asp Phe Ser Lys Asp Gly Ser Val Ser Cys Ser Leu
660 665 670
Gln Gly Glu Asn Glu Cys Leu Ile Thr Phe Leu Ile Thr Thr Asp Asn
675 680 685
Glu Gly Lys Thr Ile Ile His Ser Ile Asn Glu Lys Asp Cys Pro Lys
690 695 700
Pro Pro Asn Ile Pro Met Ile Met Leu Gly Val Ser Leu Ala Ile Leu
705 710 715 720
Leu Ile Gly Val Val Leu Leu Cys Ile Trp Lys Leu Leu Val Ser Phe
725 730 735
His Asp Arg Lys Glu Val Ala Lys Phe Glu Ala Glu Arg Ser Lys Ala
740 745 750
Lys Trp Gln Thr Gly Thr Asn Pro Leu Tyr Arg Gly Ser Thr Ser Thr
755 760 765
Phe Lys Asn Val Thr Tyr Lys His Arg Glu Lys Gln Lys Val Asp Leu
770 775 780
Ser Thr Asp Cys
785
<210> 128
<211> 3036
<212> DNA
<213> 智人
<400> 128
cgaccagcag accagacagt cacagcagcc ttgacaaaac gttcctggaa ctcaagcact 60
tctccacaga ggaggacaga gcagacagca gagaccatgg agtctccctc ggcccctccc 120
cacagatggt gcatcccctg gcagaggctc ctgctcacag cctcacttct aaccttctgg 180
aacccgccca ccactgccaa gctcactatt gaatccacgc cgttcaatgt cgcagagggg 240
aaggaggtgc ttctacttgt ccacaatctg ccccagcatc tttttggcta cagctggtac 300
aaaggtgaaa gagtggatgg caaccgtcaa attataggat atgtaatagg aactcaacaa 360
gctaccccag ggcccgcata cagtggtcga gagataatat accccaatgc atccctgctg 420
atccagaaca tcatccagaa tgacacagga ttctacaccc tacacgtcat aaagtcagat 480
cttgtgaatg aagaagcaac tggccagttc cgggtatacc cggagctgcc caagccctcc 540
atctccagca acaactccaa acccgtggag gacaaggatg ctgtggcctt cacctgtgaa 600
cctgagactc aggacgcaac ctacctgtgg tgggtaaaca atcagagcct cccggtcagt 660
cccaggctgc agctgtccaa tggcaacagg accctcactc tattcaatgt cacaagaaat 720
gacacagcaa gctacaaatg tgaaacccag aacccagtga gtgccaggcg cagtgattca 780
gtcatcctga atgtcctcta tggcccggat gcccccacca tttcccctct aaacacatct 840
tacagatcag gggaaaatct gaacctctcc tgccatgcag cctctaaccc acctgcacag 900
tactcttggt ttgtcaatgg gactttccag caatccaccc aagagctctt tatccccaac 960
atcactgtga ataatagtgg atcctatacg tgccaagccc ataactcaga cactggcctc 1020
aataggacca cagtcacgac gatcacagtc tatgcagagc cacccaaacc cttcatcacc 1080
agcaacaact ccaaccccgt ggaggatgag gatgctgtag ccttaacctg tgaacctgag 1140
attcagaaca caacctacct gtggtgggta aataatcaga gcctcccggt cagtcccagg 1200
ctgcagctgt ccaatgacaa caggaccctc actctactca gtgtcacaag gaatgatgta 1260
ggaccctatg agtgtggaat ccagaacgaa ttaagtgttg accacagcga cccagtcatc 1320
ctgaatgtcc tctatggccc agacgacccc accatttccc cctcatacac ctattaccgt 1380
ccaggggtga acctcagcct ctcctgccat gcagcctcta acccacctgc acagtattct 1440
tggctgattg atgggaacat ccagcaacac acacaagagc tctttatctc caacatcact 1500
gagaagaaca gcggactcta tacctgccag gccaataact cagccagtgg ccacagcagg 1560
actacagtca agacaatcac agtctctgcg gagctgccca agccctccat ctccagcaac 1620
aactccaaac ccgtggagga caaggatgct gtggccttca cctgtgaacc tgaggctcag 1680
aacacaacct acctgtggtg ggtaaatggt cagagcctcc cagtcagtcc caggctgcag 1740
ctgtccaatg gcaacaggac cctcactcta ttcaatgtca caagaaatga cgcaagagcc 1800
tatgtatgtg gaatccagaa ctcagtgagt gcaaaccgca gtgacccagt caccctggat 1860
gtcctctatg ggccggacac ccccatcatt tcccccccag actcgtctta cctttcggga 1920
gcgaacctca acctctcctg ccactcggcc tctaacccat ccccgcagta ttcttggcgt 1980
atcaatggga taccgcagca acacacacaa gttctcttta tcgccaaaat cacgccaaat 2040
aataacggga cctatgcctg ttttgtctct aacttggcta ctggccgcaa taattccata 2100
gtcaagagca tcacagtctc tgcatctgga acttctcctg gtctctcagc tggggccact 2160
gtcggcatca tgattggagt gctggttggg gttgctctga tatagcagcc ctggtgtagt 2220
ttcttcattt caggaagact gacagttgtt ttgcttcttc cttaaagcat ttgcaacagc 2280
tacagtctaa aattgcttct ttaccaagga tatttacaga aaagactctg accagagatc 2340
gagaccatcc tagccaacat cgtgaaaccc catctctact aaaaatacaa aaatgagctg 2400
ggcttggtgg cgcgcacctg tagtcccagt tactcgggag gctgaggcag gagaatcgct 2460
tgaacccggg aggtggagat tgcagtgagc ccagatcgca ccactgcact ccagtctggc 2520
aacagagcaa gactccatct caaaaagaaa agaaaagaag actctgacct gtactcttga 2580
atacaagttt ctgataccac tgcactgtct gagaatttcc aaaactttaa tgaactaact 2640
gacagcttca tgaaactgtc caccaagatc aagcagagaa aataattaat ttcatgggac 2700
taaatgaact aatgaggata atattttcat aattttttat ttgaaatttt gctgattctt 2760
taaatgtctt gtttcccaga tttcaggaaa ctttttttct tttaagctat ccacagctta 2820
cagcaatttg ataaaatata cttttgtgaa caaaaattga gacatttaca ttttctccct 2880
atgtggtcgc tccagacttg ggaaactatt catgaatatt tatattgtat ggtaatatag 2940
ttattgcaca agttcaataa aaatctgctc tttgtatgac agaatacatt tgaaaacatt 3000
ggttatatta ccaagacttt gactagaatg tcgtat 3036
<210> 129
<211> 702
<212> PRT
<213> 智人
<400> 129
Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro Trp Gln
1 5 10 15
Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr
20 25 30
Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu Gly
35 40 45
Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe Gly
50 55 60
Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile Ile
65 70 75 80
Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr Ser
85 90 95
Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn Ile
100 105 110
Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser Asp
115 120 125
Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu Leu
130 135 140
Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys
145 150 155 160
Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gln Asp Ala Thr Tyr
165 170 175
Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg Leu Gln
180 185 190
Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn
195 200 205
Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gln Asn Pro Val Ser Ala Arg
210 215 220
Arg Ser Asp Ser Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro
225 230 235 240
Thr Ile Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn
245 250 255
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp Phe
260 265 270
Val Asn Gly Thr Phe Gln Gln Ser Thr Gln Glu Leu Phe Ile Pro Asn
275 280 285
Ile Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gln Ala His Asn Ser
290 295 300
Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr Ile Thr Val Tyr Ala
305 310 315 320
Glu Pro Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
325 330 335
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
340 345 350
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
355 360 365
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
370 375 380
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Glu Leu Ser
385 390 395 400
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
405 410 415
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
420 425 430
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
435 440 445
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
450 455 460
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
465 470 475 480
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
485 490 495
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
500 505 510
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
515 520 525
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
530 535 540
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
545 550 555 560
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
565 570 575
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
580 585 590
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
595 600 605
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
610 615 620
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
625 630 635 640
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
645 650 655
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
660 665 670
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
675 680 685
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile
690 695 700
<210> 130
<211> 90
<212> DNA
<213> 智人
<400> 130
ttggctttgg tcttcaagta gccaaagcga tgaaatatct tgcaagcaaa aagtttgtcc 60
acagagactt ggctgcaaga aactgtatgt 90
<210> 131
<211> 22
<212> PRT
<213> 智人
<400> 131
Lys Ala Met Lys Tyr Leu Ala Ser Lys Lys Phe Val His Arg Asp Leu
1 5 10 15
Ala Ala Arg Asn Cys Met
20
<210> 132
<211> 1804
<212> DNA
<213> 智人
<400> 132
cgctccacct ctcaagcagc cagcgcctgc ctgaatctgt tctgccccct ccccacccat 60
ttcaccacca ccatgacacc gggcacccag tctcctttct tcctgctgct gctcctcaca 120
gtgcttacag ttgttacagg ttctggtcat gcaagctcta ccccaggtgg agaaaaggag 180
acttcggcta cccagagaag ttcagtgccc agctctactg agaagaatgc tgtgagtatg 240
accagcagcg tactctccag ccacagcccc ggttcaggct cctccaccac tcagggacag 300
gatgtcactc tggccccggc cacggaacca gcttcaggtt cagctgccac ctggggacag 360
gatgtcacct cggtcccagt caccaggcca gccctgggct ccaccacccc gccagcccac 420
gatgtcacct cagccccgga caacaagcca gccccgggct ccaccgcccc cccagcccac 480
ggtgtcacct cggccccgga caccaggccg gccccgggct ccaccgcccc cccagcccat 540
ggtgtcacct cggccccgga caacaggccc gccttgggct ccaccgcccc tccagtccac 600
aatgtcacct cggcctcagg ctctgcatca ggctcagctt ctactctggt gcacaacggc 660
acctctgcca gggctaccac aaccccagcc agcaagagca ctccattctc aattcccagc 720
caccactctg atactcctac cacccttgcc agccatagca ccaagactga tgccagtagc 780
actcaccata gcacggtacc tcctctcacc tcctccaatc acagcacttc tccccagttg 840
tctactgggg tctctttctt tttcctgtct tttcacattt caaacctcca gtttaattcc 900
tctctggaag atcccagcac cgactactac caagagctgc agagagacat ttctgaaatg 960
tttttgcaga tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggcca 1020
ggatctgtgg tggtacaatt gactctggcc ttccgagaag gtaccatcaa tgtccacgac 1080
gtggagacac agttcaatca gtataaaacg gaagcagcct ctcgatataa cctgacgatc 1140
tcagacgtca gcgtgagtga tgtgccattt cctttctctg cccagtctgg ggctggggtg 1200
ccaggctggg gcatcgcgct gctggtgctg gtctgtgttc tggttgcgct ggccattgtc 1260
tatctcattg ccttggctgt ctgtcagtgc cgccgaaaga actacgggca gctggacatc 1320
tttccagccc gggataccta ccatcctatg agcgagtacc ccacctacca cacccatggg 1380
cgctatgtgc cccctagcag taccgatcgt agcccctatg agaaggtttc tgcaggtaat 1440
ggtggcagca gcctctctta cacaaaccca gcagtggcag ccacttctgc caacttgtag 1500
gggcacgtcg cccgctgagc tgagtggcca gccagtgcca ttccactcca ctcaggttct 1560
tcagggccag agcccctgca ccctgtttgg gctggtgagc tgggagttca ggtgggctgc 1620
tcacaccgtc cttcagaggc cccaccaatt tctcggacac ttctcagtgt gtggaagctc 1680
atgtgggccc ctgaggctca tgcctgggaa gtgttgtggt gggggctccc aggaggactg 1740
gcccagagag ccctgagata gcggggatcc tgaactggac tgaataaaac gtggtctccc 1800
actg 1804
<210> 133
<211> 475
<212> PRT
<213> 智人
<400> 133
Met Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr
1 5 10 15
Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly
20 25 30
Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser
35 40 45
Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His
50 55 60
Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu
65 70 75 80
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln
85 90 95
Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr
100 105 110
Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro
115 120 125
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
130 135 140
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
145 150 155 160
Ala Pro Asp Asn Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His
165 170 175
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu
180 185 190
Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys
195 200 205
Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr
210 215 220
Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser
225 230 235 240
Thr Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu
245 250 255
Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu
260 265 270
Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu
275 280 285
Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly
290 295 300
Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val
305 310 315 320
Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp
325 330 335
Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr
340 345 350
Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe
355 360 365
Ser Ala Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu
370 375 380
Val Leu Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala
385 390 395 400
Leu Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
405 410 415
Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
420 425 430
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro
435 440 445
Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr
450 455 460
Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu
465 470 475
<210> 134
<211> 1552
<212> DNA
<213> 智人
<400> 134
gcccgtacac accgtgtgct gggacacccc acagtcagcc gcatggctcc cctgtgcccc 60
agcccctggc tccctctgtt gatcccggcc cctgctccag gcctcactgt gcaactgctg 120
ctgtcactgc tgcttctgat gcctgtccat ccccagaggt tgccccggat gcaggaggat 180
tcccccttgg gaggaggctc ttctggggaa gatgacccac tgggcgagga ggatctgccc 240
agtgaagagg attcacccag agaggaggat ccacccggag aggaggatct acctggagag 300
gaggatctac ctggagagga ggatctacct gaagttaagc ctaaatcaga agaagagggc 360
tccctgaagt tagaggatct acctactgtt gaggctcctg gagatcctca agaaccccag 420
aataatgccc acagggacaa agaaggggat gaccagagtc attggcgcta tggaggcgac 480
ccgccctggc cccgggtgtc cccagcctgc gcgggccgct tccagtcccc ggtggatatc 540
cgcccccagc tcgccgcctt ctgcccggcc ctgcgccccc tggaactcct gggcttccag 600
ctcccgccgc tcccagaact gcgcctgcgc aacaatggcc acagtgtgca actgaccctg 660
cctcctgggc tagagatggc tctgggtccc gggcgggagt accgggctct gcagctgcat 720
ctgcactggg gggctgcagg tcgtccgggc tcggagcaca ctgtggaagg ccaccgtttc 780
cctgccgaga tccacgtggt tcacctcagc accgcctttg ccagagttga cgaggccttg 840
gggcgcccgg gaggcctggc cgtgttggcc gcctttctgg aggagggccc ggaagaaaac 900
agtgcctatg agcagttgct gtctcgcttg gaagaaatcg ctgaggaagg ctcagagact 960
caggtcccag gactggacat atctgcactc ctgccctctg acttcagccg ctacttccaa 1020
tatgaggggt ctctgactac accgccctgt gcccagggtg tcatctggac tgtgtttaac 1080
cagacagtga tgctgagtgc taagcagctc cacaccctct ctgacaccct gtggggacct 1140
ggtgactctc ggctacagct gaacttccga gcgacgcagc ctttgaatgg gcgagtgatt 1200
gaggcctcct tccctgctgg agtggacagc agtcctcggg ctgctgagcc agtccagctg 1260
aattcctgcc tggctgctgg tgacatccta gccctggttt ttggcctcct ttttgctgtc 1320
accagcgtcg cgttccttgt gcagatgaga aggcagcaca gaaggggaac caaagggggt 1380
gtgagctacc gcccagcaga ggtagccgag actggagcct agaggctgga tcttggagaa 1440
tgtgagaagc cagccagagg catctgaggg ggagccggta actgtcctgt cctgctcatt 1500
atgccacttc cttttaactg ccaagaaatt ttttaaaata aatatttata at 1552
<210> 135
<211> 459
<212> PRT
<213> 智人
<400> 135
Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu Ile Pro Ala
1 5 10 15
Pro Ala Pro Gly Leu Thr Val Gln Leu Leu Leu Ser Leu Leu Leu Leu
20 25 30
Met Pro Val His Pro Gln Arg Leu Pro Arg Met Gln Glu Asp Ser Pro
35 40 45
Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp
50 55 60
Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu
65 70 75 80
Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro
85 90 95
Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp
100 105 110
Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gln Glu Pro Gln Asn Asn
115 120 125
Ala His Arg Asp Lys Glu Gly Asp Asp Gln Ser His Trp Arg Tyr Gly
130 135 140
Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala Gly Arg Phe
145 150 155 160
Gln Ser Pro Val Asp Ile Arg Pro Gln Leu Ala Ala Phe Cys Pro Ala
165 170 175
Leu Arg Pro Leu Glu Leu Leu Gly Phe Gln Leu Pro Pro Leu Pro Glu
180 185 190
Leu Arg Leu Arg Asn Asn Gly His Ser Val Gln Leu Thr Leu Pro Pro
195 200 205
Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gln
210 215 220
Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr
225 230 235 240
Val Glu Gly His Arg Phe Pro Ala Glu Ile His Val Val His Leu Ser
245 250 255
Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu
260 265 270
Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala
275 280 285
Tyr Glu Gln Leu Leu Ser Arg Leu Glu Glu Ile Ala Glu Glu Gly Ser
290 295 300
Glu Thr Gln Val Pro Gly Leu Asp Ile Ser Ala Leu Leu Pro Ser Asp
305 310 315 320
Phe Ser Arg Tyr Phe Gln Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys
325 330 335
Ala Gln Gly Val Ile Trp Thr Val Phe Asn Gln Thr Val Met Leu Ser
340 345 350
Ala Lys Gln Leu His Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp
355 360 365
Ser Arg Leu Gln Leu Asn Phe Arg Ala Thr Gln Pro Leu Asn Gly Arg
370 375 380
Val Ile Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala
385 390 395 400
Ala Glu Pro Val Gln Leu Asn Ser Cys Leu Ala Ala Gly Asp Ile Leu
405 410 415
Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala Phe Leu
420 425 430
Val Gln Met Arg Arg Gln His Arg Arg Gly Thr Lys Gly Gly Val Ser
435 440 445
Tyr Arg Pro Ala Glu Val Ala Glu Thr Gly Ala
450 455
<210> 136
<211> 1595
<212> DNA
<213> 智人
<400> 136
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg 60
gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac 120
aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc 180
gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga 240
gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc 300
tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc 360
acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt 420
gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc 480
ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga 540
attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc 600
ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga 660
aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac 720
gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg 780
gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc 840
tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag 900
tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca 960
ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc 1020
acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat 1080
gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat 1140
tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg 1200
gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac 1260
ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac 1320
ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt 1380
gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta 1440
aaggaaatca caggtttgag ctgaattatc acatgaatat aaatgggaaa tcagtgtttt 1500
agagagagaa cttttcgaca tatttcctgt tcccttggaa taaaaacatt tcttctgaaa 1560
ttttaccgtt aaaaaaaaaa aaaaaaaaaa aaaaa 1595
<210> 137
<211> 405
<212> PRT
<213> 智人
<400> 137
Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala
1 5 10 15
Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln
20 25 30
Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe
35 40 45
Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn
50 55 60
Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys
65 70 75 80
Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val
85 90 95
Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr
100 105 110
Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn
115 120 125
Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu
130 135 140
His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu
145 150 155 160
Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met
165 170 175
Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro
180 185 190
Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln
195 200 205
Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg
210 215 220
Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys
225 230 235 240
Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp
245 250 255
Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro
260 265 270
Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly
275 280 285
Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His
290 295 300
Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu
305 310 315 320
Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val
325 330 335
Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn
340 345 350
Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp
355 360 365
Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr
370 375 380
Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu
385 390 395 400
Ile Thr Gly Leu Ser
405
<210> 138
<211> 1437
<212> DNA
<213> 智人
<400> 138
gcttcctcag acatgccgct gctgctactg ctgcccctgc tgtgggcagg ggccctggct 60
atggatccaa atttctggct gcaagtgcag gagtcagtga cggtacagga gggtttgtgc 120
gtcctcgtgc cctgcacttt cttccatccc ataccctact acgacaagaa ctccccagtt 180
catggttact ggttccggga aggagccatt atatccgggg actctccagt ggccacaaac 240
aagctagatc aagaagtaca ggaggagact cagggcagat tccgcctcct tggggatccc 300
agtaggaaca actgctccct gagcatcgta gacgccagga ggagggataa tggttcatac 360
ttctttcgga tggagagagg aagtaccaaa tacagttaca aatctcccca gctctctgtg 420
catgtgacag acttgaccca caggcccaaa atcctcatcc ctggcactct agaacccggc 480
cactccaaaa accttacctg ctctgtgtcc tgggcctgtg agcagggaac acccccgatc 540
ttctcctggt tgtcagctgc ccccacctcc ctgggcccca ggactactca ctcctcggtg 600
ctcataatca ccccacggcc ccaggaccac ggcaccaacc tgacctgtca ggtgaagttc 660
gctggagctg gtgtgactac ggagagaacc atccagctca acgtcaccta tgttccacag 720
aacccaacaa ctggtatctt tccaggagat ggctcaggga aacaagagac cagagcagga 780
ctggttcatg gggccattgg aggagctggt gttacagccc tgctcgctct ttgtctctgc 840
ctcatcttct tcatagtgaa gacccacagg aggaaagcag ccaggacagc agtgggcagc 900
aatgacaccc accctaccac agggtcagcc tccccgaaac accagaagaa ctccaagtta 960
catggcccca ctgaaacctc aagctgttca ggtgccgccc ctactgtgga gatggatgag 1020
gagctgcatt atgcttccct caactttcat gggatgaatc cttccaagga cacctccacc 1080
gaatactcag aggtcaggac ccagtgagga accctcaaga gcatcaggct cagctagaag 1140
atccacatcc tctacaggtc ggggaccaaa ggctgattct tggagattta actccccaca 1200
ggcaatgggt ttatagacat tatgtgagtt tcctgctata ttaacatcat cttgagactt 1260
tgcaagcaga gagtcgtgga atcaaatctg tgctctttca tttgctaagt gtatgatgtc 1320
acacaagctc cttaaccttc catgtctcca ttttcttctc tgtgaagtag gtataagaag 1380
tcctatctca tagggatgct gtgagcatta aataaaggta cacatggaaa acaccag 1437
<210> 139
<211> 364
<212> PRT
<213> 智人
<400> 139
Met Pro Leu Leu Leu Leu Leu Pro Leu Leu Trp Ala Gly Ala Leu Ala
1 5 10 15
Met Asp Pro Asn Phe Trp Leu Gln Val Gln Glu Ser Val Thr Val Gln
20 25 30
Glu Gly Leu Cys Val Leu Val Pro Cys Thr Phe Phe His Pro Ile Pro
35 40 45
Tyr Tyr Asp Lys Asn Ser Pro Val His Gly Tyr Trp Phe Arg Glu Gly
50 55 60
Ala Ile Ile Ser Gly Asp Ser Pro Val Ala Thr Asn Lys Leu Asp Gln
65 70 75 80
Glu Val Gln Glu Glu Thr Gln Gly Arg Phe Arg Leu Leu Gly Asp Pro
85 90 95
Ser Arg Asn Asn Cys Ser Leu Ser Ile Val Asp Ala Arg Arg Arg Asp
100 105 110
Asn Gly Ser Tyr Phe Phe Arg Met Glu Arg Gly Ser Thr Lys Tyr Ser
115 120 125
Tyr Lys Ser Pro Gln Leu Ser Val His Val Thr Asp Leu Thr His Arg
130 135 140
Pro Lys Ile Leu Ile Pro Gly Thr Leu Glu Pro Gly His Ser Lys Asn
145 150 155 160
Leu Thr Cys Ser Val Ser Trp Ala Cys Glu Gln Gly Thr Pro Pro Ile
165 170 175
Phe Ser Trp Leu Ser Ala Ala Pro Thr Ser Leu Gly Pro Arg Thr Thr
180 185 190
His Ser Ser Val Leu Ile Ile Thr Pro Arg Pro Gln Asp His Gly Thr
195 200 205
Asn Leu Thr Cys Gln Val Lys Phe Ala Gly Ala Gly Val Thr Thr Glu
210 215 220
Arg Thr Ile Gln Leu Asn Val Thr Tyr Val Pro Gln Asn Pro Thr Thr
225 230 235 240
Gly Ile Phe Pro Gly Asp Gly Ser Gly Lys Gln Glu Thr Arg Ala Gly
245 250 255
Leu Val His Gly Ala Ile Gly Gly Ala Gly Val Thr Ala Leu Leu Ala
260 265 270
Leu Cys Leu Cys Leu Ile Phe Phe Ile Val Lys Thr His Arg Arg Lys
275 280 285
Ala Ala Arg Thr Ala Val Gly Ser Asn Asp Thr His Pro Thr Thr Gly
290 295 300
Ser Ala Ser Pro Lys His Gln Lys Asn Ser Lys Leu His Gly Pro Thr
305 310 315 320
Glu Thr Ser Ser Cys Ser Gly Ala Ala Pro Thr Val Glu Met Asp Glu
325 330 335
Glu Leu His Tyr Ala Ser Leu Asn Phe His Gly Met Asn Pro Ser Lys
340 345 350
Asp Thr Ser Thr Glu Tyr Ser Glu Val Arg Thr Gln
355 360
<210> 140
<211> 1968
<212> DNA
<213> 智人
<400> 140
aggcccctgc ctgccccagc atcccctgcg cgaagctggg tgccccggag agtctgacca 60
ccatgccacc tcctcgcctc ctcttcttcc tcctcttcct cacccccatg gaagtcaggc 120
ccgaggaacc tctagtggtg aaggtggaag agggagataa cgctgtgctg cagtgcctca 180
aggggacctc agatggcccc actcagcagc tgacctggtc tcgggagtcc ccgcttaaac 240
ccttcttaaa actcagcctg gggctgccag gcctgggaat ccacatgagg cccctggcca 300
tctggctttt catcttcaac gtctctcaac agatgggggg cttctacctg tgccagccgg 360
ggcccccctc tgagaaggcc tggcagcctg gctggacagt caatgtggag ggcagcgggg 420
agctgttccg gtggaatgtt tcggacctag gtggcctggg ctgtggcctg aagaacaggt 480
cctcagaggg ccccagctcc ccttccggga agctcatgag ccccaagctg tatgtgtggg 540
ccaaagaccg ccctgagatc tgggagggag agcctccgtg tctcccaccg agggacagcc 600
tgaaccagag cctcagccag gacctcacca tggcccctgg ctccacactc tggctgtcct 660
gtggggtacc ccctgactct gtgtccaggg gccccctctc ctggacccat gtgcacccca 720
aggggcctaa gtcattgctg agcctagagc tgaaggacga tcgcccggcc agagatatgt 780
gggtaatgga gacgggtctg ttgttgcccc gggccacagc tcaagacgct ggaaagtatt 840
attgtcaccg tggcaacctg accatgtcat tccacctgga gatcactgct cggccagtac 900
tatggcactg gctgctgagg actggtggct ggaaggtctc agctgtgact ttggcttatc 960
tgatcttctg cctgtgttcc cttgtgggca ttcttcatct tcaaagagcc ctggtcctga 1020
ggaggaaaag aaagcgaatg actgacccca ccaggagatt cttcaaagtg acgcctcccc 1080
caggaagcgg gccccagaac cagtacggga acgtgctgtc tctccccaca cccacctcag 1140
gcctcggacg cgcccagcgt tgggccgcag gcctgggggg cactgccccg tcttatggaa 1200
acccgagcag cgacgtccag gcggatggag ccttggggtc ccggagcccg ccgggagtgg 1260
gcccagaaga agaggaaggg gagggctatg aggaacctga cagtgaggag gactccgagt 1320
tctatgagaa cgactccaac cttgggcagg accagctctc ccaggatggc agcggctacg 1380
agaaccctga ggatgagccc ctgggtcctg aggatgaaga ctccttctcc aacgctgagt 1440
cttatgagaa cgaggatgaa gagctgaccc agccggtcgc caggacaatg gacttcctga 1500
gccctcatgg gtcagcctgg gaccccagcc gggaagcaac ctccctggca gggtcccagt 1560
cctatgagga tatgagagga atcctgtatg cagcccccca gctccgctcc attcggggcc 1620
agcctggacc caatcatgag gaagatgcag actcttatga gaacatggat aatcccgatg 1680
ggccagaccc agcctgggga ggagggggcc gcatgggcac ctggagcacc aggtgatcct 1740
caggtggcca gcctggatct cctcaagtcc ccaagattca cacctgactc tgaaatctga 1800
agacctcgag cagatgatgc caacctctgg agcaatgttg cttaggatgt gtgcatgtgt 1860
gtaagtgtgt gtgtgtgtgt gtgtgtgtat acatgccagt gacacttcca gtcccctttg 1920
tattccttaa ataaactcaa tgagctcttc caatcctaaa aaaaaaaa 1968
<210> 141
<211> 557
<212> PRT
<213> 智人
<400> 141
Met Pro Pro Pro Arg Leu Leu Phe Phe Leu Leu Phe Leu Thr Pro Met
1 5 10 15
Glu Val Arg Pro Glu Glu Pro Leu Val Val Lys Val Glu Glu Gly Asp
20 25 30
Asn Ala Val Leu Gln Cys Leu Lys Gly Thr Ser Asp Gly Pro Thr Gln
35 40 45
Gln Leu Thr Trp Ser Arg Glu Ser Pro Leu Lys Pro Phe Leu Lys Leu
50 55 60
Ser Leu Gly Leu Pro Gly Leu Gly Ile His Met Arg Pro Leu Ala Ile
65 70 75 80
Trp Leu Phe Ile Phe Asn Val Ser Gln Gln Met Gly Gly Phe Tyr Leu
85 90 95
Cys Gln Pro Gly Pro Pro Ser Glu Lys Ala Trp Gln Pro Gly Trp Thr
100 105 110
Val Asn Val Glu Gly Ser Gly Glu Leu Phe Arg Trp Asn Val Ser Asp
115 120 125
Leu Gly Gly Leu Gly Cys Gly Leu Lys Asn Arg Ser Ser Glu Gly Pro
130 135 140
Ser Ser Pro Ser Gly Lys Leu Met Ser Pro Lys Leu Tyr Val Trp Ala
145 150 155 160
Lys Asp Arg Pro Glu Ile Trp Glu Gly Glu Pro Pro Cys Leu Pro Pro
165 170 175
Arg Asp Ser Leu Asn Gln Ser Leu Ser Gln Asp Leu Thr Met Ala Pro
180 185 190
Gly Ser Thr Leu Trp Leu Ser Cys Gly Val Pro Pro Asp Ser Val Ser
195 200 205
Arg Gly Pro Leu Ser Trp Thr His Val His Pro Lys Gly Pro Lys Ser
210 215 220
Leu Leu Ser Leu Glu Leu Lys Asp Asp Arg Pro Ala Arg Asp Met Trp
225 230 235 240
Val Met Glu Thr Gly Leu Leu Leu Pro Arg Ala Thr Ala Gln Asp Ala
245 250 255
Gly Lys Tyr Tyr Cys His Arg Gly Asn Leu Thr Met Ser Phe His Leu
260 265 270
Glu Ile Thr Ala Arg Pro Val Leu Trp His Trp Leu Leu Arg Thr Gly
275 280 285
Gly Trp Lys Val Ser Ala Val Thr Leu Ala Tyr Leu Ile Phe Cys Leu
290 295 300
Cys Ser Leu Val Gly Ile Leu His Leu Gln Arg Ala Leu Val Leu Arg
305 310 315 320
Arg Lys Arg Lys Arg Met Thr Asp Pro Thr Arg Arg Phe Phe Lys Val
325 330 335
Thr Pro Pro Pro Gly Ser Gly Pro Gln Asn Gln Tyr Gly Asn Val Leu
340 345 350
Ser Leu Pro Thr Pro Thr Ser Gly Leu Gly Arg Ala Gln Arg Trp Ala
355 360 365
Ala Gly Leu Gly Gly Thr Ala Pro Ser Tyr Gly Asn Pro Ser Ser Asp
370 375 380
Val Gln Ala Asp Gly Ala Leu Gly Ser Arg Ser Pro Pro Gly Val Gly
385 390 395 400
Pro Glu Glu Glu Glu Gly Glu Gly Tyr Glu Glu Pro Asp Ser Glu Glu
405 410 415
Asp Ser Glu Phe Tyr Glu Asn Asp Ser Asn Leu Gly Gln Asp Gln Leu
420 425 430
Ser Gln Asp Gly Ser Gly Tyr Glu Asn Pro Glu Asp Glu Pro Leu Gly
435 440 445
Pro Glu Asp Glu Asp Ser Phe Ser Asn Ala Glu Ser Tyr Glu Asn Glu
450 455 460
Asp Glu Glu Leu Thr Gln Pro Val Ala Arg Thr Met Asp Phe Leu Ser
465 470 475 480
Pro His Gly Ser Ala Trp Asp Pro Ser Arg Glu Ala Thr Ser Leu Ala
485 490 495
Gly Ser Gln Ser Tyr Glu Asp Met Arg Gly Ile Leu Tyr Ala Ala Pro
500 505 510
Gln Leu Arg Ser Ile Arg Gly Gln Pro Gly Pro Asn His Glu Glu Asp
515 520 525
Ala Asp Ser Tyr Glu Asn Met Asp Asn Pro Asp Gly Pro Asp Pro Ala
530 535 540
Trp Gly Gly Gly Gly Arg Met Gly Thr Trp Ser Thr Arg
545 550 555
<210> 142
<211> 3216
<212> DNA
<213> 智人
<400> 142
ggcagtttcc tggctgaaca cgccagccca atacttaaag agagcaactc ctgactccga 60
tagagactgg atggacccac aagggtgaca gcccaggcgg accgatcttc ccatcccaca 120
tcctccggcg cgatgccaaa aagaggctga cggcaactgg gccttctgca gagaaagacc 180
tccgcttcac tgccccggct ggtcccaagg gtcaggaaga tggattcata cctgctgatg 240
tggggactgc tcacgttcat catggtgcct ggctgccagg cagagctctg tgacgatgac 300
ccgccagaga tcccacacgc cacattcaaa gccatggcct acaaggaagg aaccatgttg 360
aactgtgaat gcaagagagg tttccgcaga ataaaaagcg ggtcactcta tatgctctgt 420
acaggaaact ctagccactc gtcctgggac aaccaatgtc aatgcacaag ctctgccact 480
cggaacacaa cgaaacaagt gacacctcaa cctgaagaac agaaagaaag gaaaaccaca 540
gaaatgcaaa gtccaatgca gccagtggac caagcgagcc ttccaggtca ctgcagggaa 600
cctccaccat gggaaaatga agccacagag agaatttatc atttcgtggt ggggcagatg 660
gtttattatc agtgcgtcca gggatacagg gctctacaca gaggtcctgc tgagagcgtc 720
tgcaaaatga cccacgggaa gacaaggtgg acccagcccc agctcatatg cacaggtgaa 780
atggagacca gtcagtttcc aggtgaagag aagcctcagg caagccccga aggccgtcct 840
gagagtgaga cttcctgcct cgtcacaaca acagattttc aaatacagac agaaatggct 900
gcaaccatgg agacgtccat atttacaaca gagtaccagg tagcagtggc cggctgtgtt 960
ttcctgctga tcagcgtcct cctcctgagt gggctcacct ggcagcggag acagaggaag 1020
agtagaagaa caatctagaa aaccaaaaga acaagaattt cttggtaaga agccgggaac 1080
agacaacaga agtcatgaag cccaagtgaa atcaaaggtg ctaaatggtc gcccaggaga 1140
catccgttgt gcttgcctgc gttttggaag ctctgaagtc acatcacagg acacggggca 1200
gtggcaacct tgtctctatg ccagctcagt cccatcagag agcgagcgct acccacttct 1260
aaatagcaat ttcgccgttg aagaggaagg gcaaaaccac tagaactctc catcttattt 1320
tcatgtatat gtgttcatta aagcatgaat ggtatggaac tctctccacc ctatatgtag 1380
tataaagaaa agtaggttta cattcatctc attccaactt cccagttcag gagtcccaag 1440
gaaagcccca gcactaacgt aaatacacaa cacacacact ctaccctata caactggaca 1500
ttgtctgcgt ggttcctttc tcagccgctt ctgactgctg attctcccgt tcacgttgcc 1560
taataaacat ccttcaagaa ctctgggctg ctacccagaa atcattttac ccttggctca 1620
atcctctaag ctaaccccct tctactgagc cttcagtctt gaatttctaa aaaacagagg 1680
ccatggcaga ataatctttg ggtaacttca aaacggggca gccaaaccca tgaggcaatg 1740
tcaggaacag aaggatgaat gaggtcccag gcagagaatc atacttagca aagttttacc 1800
tgtgcgttac taattggcct ctttaagagt tagtttcttt gggattgcta tgaatgatac 1860
cctgaatttg gcctgcacta atttgatgtt tacaggtgga cacacaaggt gcaaatcaat 1920
gcgtacgttt cctgagaagt gtctaaaaac accaaaaagg gatccgtaca ttcaatgttt 1980
atgcaaggaa ggaaagaaag aaggaagtga agagggagaa gggatggagg tcacactggt 2040
agaacgtaac cacggaaaag agcgcatcag gcctggcacg gtggctcagg cctataaccc 2100
cagctcccta ggagaccaag gcgggagcat ctcttgaggc caggagtttg agaccagcct 2160
gggcagcata gcaagacaca tccctacaaa aaattagaaa ttggctggat gtggtggcat 2220
acgcctgtag tcctagccac tcaggaggct gaggcaggag gattgcttga gcccaggagt 2280
tcgaggctgc agtcagtcat gatggcacca ctgcactcca gcctgggcaa cagagcaaga 2340
tcctgtcttt aaggaaaaaa agacaagatg agcataccag cagtccttga acattatcaa 2400
aaagttcagc atattagaat caccgggagg ccttgttaaa agagttcgct gggcccatct 2460
tcagagtctc tgagttgttg gtctggaata gagccaaatg ttttgtgtgt ctaacaattc 2520
ccaggtgctg ttgctgctgc tactattcca ggaacacact ttgagaacca ttgtgttatt 2580
gctctgcacg cccacccact ctcaactccc acgaaaaaaa tcaacttcca gagctaagat 2640
ttcggtggaa gtcctggttc catatctggt gcaagatctc ccctcacgaa tcagttgagt 2700
caacattcta gctcaacaac atcacacgat taacattaac gaaaattatt catttgggaa 2760
actatcagcc agttttcact tctgaagggg caggagagtg ttatgagaaa tcacggcagt 2820
tttcagcagg gtccagattc agattaaata actattttct gtcatttctg tgaccaacca 2880
catacaaaca gactcatctg tgcactctcc ccctccccct tcaggtatat gttttctgag 2940
taaagttgaa aagaatctca gaccagaaaa tatagatata tatttaaatc ttacttgagt 3000
agaactgatt acgacttttg ggtgttgagg ggtctataag atcaaaactt ttccatgata 3060
atactaagat gttatcgacc atttatctgt ccttctctca aaagtgtatg gtggaatttt 3120
ccagaagcta tgtgatacgt gatgatgtca tcactctgct gttaacatat aataaattta 3180
ttgctattgt ttataaaaga ataaatgata tttttt 3216
<210> 143
<211> 272
<212> PRT
<213> 智人
<400> 143
Met Asp Ser Tyr Leu Leu Met Trp Gly Leu Leu Thr Phe Ile Met Val
1 5 10 15
Pro Gly Cys Gln Ala Glu Leu Cys Asp Asp Asp Pro Pro Glu Ile Pro
20 25 30
His Ala Thr Phe Lys Ala Met Ala Tyr Lys Glu Gly Thr Met Leu Asn
35 40 45
Cys Glu Cys Lys Arg Gly Phe Arg Arg Ile Lys Ser Gly Ser Leu Tyr
50 55 60
Met Leu Cys Thr Gly Asn Ser Ser His Ser Ser Trp Asp Asn Gln Cys
65 70 75 80
Gln Cys Thr Ser Ser Ala Thr Arg Asn Thr Thr Lys Gln Val Thr Pro
85 90 95
Gln Pro Glu Glu Gln Lys Glu Arg Lys Thr Thr Glu Met Gln Ser Pro
100 105 110
Met Gln Pro Val Asp Gln Ala Ser Leu Pro Gly His Cys Arg Glu Pro
115 120 125
Pro Pro Trp Glu Asn Glu Ala Thr Glu Arg Ile Tyr His Phe Val Val
130 135 140
Gly Gln Met Val Tyr Tyr Gln Cys Val Gln Gly Tyr Arg Ala Leu His
145 150 155 160
Arg Gly Pro Ala Glu Ser Val Cys Lys Met Thr His Gly Lys Thr Arg
165 170 175
Trp Thr Gln Pro Gln Leu Ile Cys Thr Gly Glu Met Glu Thr Ser Gln
180 185 190
Phe Pro Gly Glu Glu Lys Pro Gln Ala Ser Pro Glu Gly Arg Pro Glu
195 200 205
Ser Glu Thr Ser Cys Leu Val Thr Thr Thr Asp Phe Gln Ile Gln Thr
210 215 220
Glu Met Ala Ala Thr Met Glu Thr Ser Ile Phe Thr Thr Glu Tyr Gln
225 230 235 240
Val Ala Val Ala Gly Cys Val Phe Leu Leu Ile Ser Val Leu Leu Leu
245 250 255
Ser Gly Leu Thr Trp Gln Arg Arg Gln Arg Lys Ser Arg Arg Thr Ile
260 265 270
<210> 144
<211> 3227
<212> DNA
<213> 智人
<400> 144
gctgggcaaa gccggtggca agggcctccc ctgccgctgt gccaggcagg cagtgccaaa 60
tccggggagc ctggagctgg ggggagggcc ggggacagcc cggccctgcc ccctcccccg 120
ctgggagccc agcaacttct gaggaaagtt tggcacccat ggcgtggcgg tgccccagga 180
tgggcagggt cccgctggcc tggtgcttgg cgctgtgcgg ctgggcgtgc atggccccca 240
ggggcacgca ggctgaagaa agtcccttcg tgggcaaccc agggaatatc acaggtgccc 300
ggggactcac gggcaccctt cggtgtcagc tccaggttca gggagagccc cccgaggtac 360
attggcttcg ggatggacag atcctggagc tcgcggacag cacccagacc caggtgcccc 420
tgggtgagga tgaacaggat gactggatag tggtcagcca gctcagaatc acctccctgc 480
agctttccga cacgggacag taccagtgtt tggtgtttct gggacatcag accttcgtgt 540
cccagcctgg ctatgttggg ctggagggct tgccttactt cctggaggag cccgaagaca 600
ggactgtggc cgccaacacc cccttcaacc tgagctgcca agctcaggga cccccagagc 660
ccgtggacct actctggctc caggatgctg tccccctggc cacggctcca ggtcacggcc 720
cccagcgcag cctgcatgtt ccagggctga acaagacatc ctctttctcc tgcgaagccc 780
ataacgccaa gggggtcacc acatcccgca cagccaccat cacagtgctc ccccagcagc 840
cccgtaacct ccacctggtc tcccgccaac ccacggagct ggaggtggct tggactccag 900
gcctgagcgg catctacccc ctgacccact gcaccctgca ggctgtgctg tcagacgatg 960
ggatgggcat ccaggcggga gaaccagacc ccccagagga gcccctcacc tcgcaagcat 1020
ccgtgccccc ccatcagctt cggctaggca gcctccatcc tcacacccct tatcacatcc 1080
gcgtggcatg caccagcagc cagggcccct catcctggac ccactggctt cctgtggaga 1140
cgccggaggg agtgcccctg ggccccccta agaacattag tgctacgcgg aatgggagcc 1200
aggccttcgt gcattggcaa gagccccggg cgcccctgca gggtaccctg ttagggtacc 1260
ggctggcgta tcaaggccag gacaccccag aggtgctaat ggacataggg ctaaggcaag 1320
aggtgaccct ggagctgcag ggggacgggt ctgtgtccaa tctgacagtg tgtgtggcag 1380
cctacactgc tgctggggat ggaccctgga gcctcccagt acccctggag gcctggcgcc 1440
cagtgaagga accttcaact cctgccttct cgtggccctg gtggtatgta ctgctaggag 1500
cagtcgtggc cgctgcctgt gtcctcatct tggctctctt ccttgtccac cggcgaaaga 1560
aggagacccg ttatggagaa gtgtttgaac caacagtgga aagaggtgaa ctggtagtca 1620
ggtaccgcgt gcgcaagtcc tacagtcgtc ggaccactga agctaccttg aacagcctgg 1680
gcatcagtga agagctgaag gagaagctgc gggatgtgat ggtggaccgg cacaaggtgg 1740
ccctggggaa gactctggga gagggagagt ttggagctgt gatggaaggc cagctcaacc 1800
aggacgactc catcctcaag gtggctgtga agacgatgaa gattgccatc tgcacgaggt 1860
cagagctgga ggatttcctg agtgaagcgg tctgcatgaa ggaatttgac catcccaacg 1920
tcatgaggct catcggtgtc tgtttccagg gttctgaacg agagagcttc ccagcacctg 1980
tggtcatctt acctttcatg aaacatggag acctacacag cttcctcctc tattcccggc 2040
tcggggacca gccagtgtac ctgcccactc agatgctagt gaagttcatg gcagacatcg 2100
ccagtggcat ggagtatctg agtaccaaga gattcataca ccgggacctg gcggccagga 2160
actgcatgct gaatgagaac atgtccgtgt gtgtggcgga cttcgggctc tccaagaaga 2220
tctacaatgg ggactactac cgccagggac gtatcgccaa gatgccagtc aagtggattg 2280
ccattgagag tctagctgac cgtgtctaca ccagcaagag cgatgtgtgg tccttcgggg 2340
tgacaatgtg ggagattgcc acaagaggcc aaaccccata tccgggcgtg gagaacagcg 2400
agatttatga ctatctgcgc cagggaaatc gcctgaagca gcctgcggac tgtctggatg 2460
gactgtatgc cttgatgtcg cggtgctggg agctaaatcc ccaggaccgg ccaagtttta 2520
cagagctgcg ggaagatttg gagaacacac tgaaggcctt gcctcctgcc caggagcctg 2580
acgaaatcct ctatgtcaac atggatgagg gtggaggtta tcctgaaccc cctggagctg 2640
caggaggagc tgacccccca acccagccag accctaagga ttcctgtagc tgcctcactg 2700
cggctgaggt ccatcctgct ggacgctatg tcctctgccc ttccacaacc cctagccccg 2760
ctcagcctgc tgataggggc tccccagcag ccccagggca ggaggatggt gcctgagaca 2820
accctccacc tggtactccc tctcaggatc caagctaagc actgccactg gggaaaactc 2880
caccttccca cttttccacc ccacgcctta tccccacttg cagccctgtc ttcctaccta 2940
tcccacctcc atcccagaca ggtccctccc cttctctgtg cagtagcatc accttgaaag 3000
cagtagcatc accatctgta aaaggaaggg gttggattgc aatatctgaa gccctcccag 3060
gtgttaacat tccaagactc tagagtccaa ggtttaaaga gtctagattc aaaggttcta 3120
ggtttcaaag atgctgtgag tctttggttc taaggacctg aaattccaaa gtctctaatt 3180
ctattaaagt gctaaggttc taaggcaaaa aaaaaaaaaa aaaaaaa 3227
<210> 145
<211> 885
<212> PRT
<213> 智人
<400> 145
Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp Cys
1 5 10 15
Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr Gln Ala
20 25 30
Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn Ile Thr Gly Ala Arg
35 40 45
Gly Leu Thr Gly Thr Leu Arg Cys Gln Leu Gln Val Gln Gly Glu Pro
50 55 60
Pro Glu Val His Trp Leu Arg Asp Gly Gln Ile Leu Glu Leu Ala Asp
65 70 75 80
Ser Thr Gln Thr Gln Val Pro Leu Gly Glu Asp Glu Gln Asp Asp Trp
85 90 95
Ile Val Val Ser Gln Leu Arg Ile Thr Ser Leu Gln Leu Ser Asp Thr
100 105 110
Gly Gln Tyr Gln Cys Leu Val Phe Leu Gly His Gln Thr Phe Val Ser
115 120 125
Gln Pro Gly Tyr Val Gly Leu Glu Gly Leu Pro Tyr Phe Leu Glu Glu
130 135 140
Pro Glu Asp Arg Thr Val Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys
145 150 155 160
Gln Ala Gln Gly Pro Pro Glu Pro Val Asp Leu Leu Trp Leu Gln Asp
165 170 175
Ala Val Pro Leu Ala Thr Ala Pro Gly His Gly Pro Gln Arg Ser Leu
180 185 190
His Val Pro Gly Leu Asn Lys Thr Ser Ser Phe Ser Cys Glu Ala His
195 200 205
Asn Ala Lys Gly Val Thr Thr Ser Arg Thr Ala Thr Ile Thr Val Leu
210 215 220
Pro Gln Gln Pro Arg Asn Leu His Leu Val Ser Arg Gln Pro Thr Glu
225 230 235 240
Leu Glu Val Ala Trp Thr Pro Gly Leu Ser Gly Ile Tyr Pro Leu Thr
245 250 255
His Cys Thr Leu Gln Ala Val Leu Ser Asp Asp Gly Met Gly Ile Gln
260 265 270
Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr Ser Gln Ala Ser
275 280 285
Val Pro Pro His Gln Leu Arg Leu Gly Ser Leu His Pro His Thr Pro
290 295 300
Tyr His Ile Arg Val Ala Cys Thr Ser Ser Gln Gly Pro Ser Ser Trp
305 310 315 320
Thr His Trp Leu Pro Val Glu Thr Pro Glu Gly Val Pro Leu Gly Pro
325 330 335
Pro Lys Asn Ile Ser Ala Thr Arg Asn Gly Ser Gln Ala Phe Val His
340 345 350
Trp Gln Glu Pro Arg Ala Pro Leu Gln Gly Thr Leu Leu Gly Tyr Arg
355 360 365
Leu Ala Tyr Gln Gly Gln Asp Thr Pro Glu Val Leu Met Asp Ile Gly
370 375 380
Leu Arg Gln Glu Val Thr Leu Glu Leu Gln Gly Asp Gly Ser Val Ser
385 390 395 400
Asn Leu Thr Val Cys Val Ala Ala Tyr Thr Ala Ala Gly Asp Gly Pro
405 410 415
Trp Ser Leu Pro Val Pro Leu Glu Ala Trp Arg Pro Val Lys Glu Pro
420 425 430
Ser Thr Pro Ala Phe Ser Trp Pro Trp Trp Tyr Val Leu Leu Gly Ala
435 440 445
Val Val Ala Ala Ala Cys Val Leu Ile Leu Ala Leu Phe Leu Val His
450 455 460
Arg Arg Lys Lys Glu Thr Arg Tyr Gly Glu Val Phe Glu Pro Thr Val
465 470 475 480
Glu Arg Gly Glu Leu Val Val Arg Tyr Arg Val Arg Lys Ser Tyr Ser
485 490 495
Arg Arg Thr Thr Glu Ala Thr Leu Asn Ser Leu Gly Ile Ser Glu Glu
500 505 510
Leu Lys Glu Lys Leu Arg Asp Val Met Val Asp Arg His Lys Val Ala
515 520 525
Leu Gly Lys Thr Leu Gly Glu Gly Glu Phe Gly Ala Val Met Glu Gly
530 535 540
Gln Leu Asn Gln Asp Asp Ser Ile Leu Lys Val Ala Val Lys Thr Met
545 550 555 560
Lys Ile Ala Ile Cys Thr Arg Ser Glu Leu Glu Asp Phe Leu Ser Glu
565 570 575
Ala Val Cys Met Lys Glu Phe Asp His Pro Asn Val Met Arg Leu Ile
580 585 590
Gly Val Cys Phe Gln Gly Ser Glu Arg Glu Ser Phe Pro Ala Pro Val
595 600 605
Val Ile Leu Pro Phe Met Lys His Gly Asp Leu His Ser Phe Leu Leu
610 615 620
Tyr Ser Arg Leu Gly Asp Gln Pro Val Tyr Leu Pro Thr Gln Met Leu
625 630 635 640
Val Lys Phe Met Ala Asp Ile Ala Ser Gly Met Glu Tyr Leu Ser Thr
645 650 655
Lys Arg Phe Ile His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Asn
660 665 670
Glu Asn Met Ser Val Cys Val Ala Asp Phe Gly Leu Ser Lys Lys Ile
675 680 685
Tyr Asn Gly Asp Tyr Tyr Arg Gln Gly Arg Ile Ala Lys Met Pro Val
690 695 700
Lys Trp Ile Ala Ile Glu Ser Leu Ala Asp Arg Val Tyr Thr Ser Lys
705 710 715 720
Ser Asp Val Trp Ser Phe Gly Val Thr Met Trp Glu Ile Ala Thr Arg
725 730 735
Gly Gln Thr Pro Tyr Pro Gly Val Glu Asn Ser Glu Ile Tyr Asp Tyr
740 745 750
Leu Arg Gln Gly Asn Arg Leu Lys Gln Pro Ala Asp Cys Leu Asp Gly
755 760 765
Leu Tyr Ala Leu Met Ser Arg Cys Trp Glu Leu Asn Pro Gln Asp Arg
770 775 780
Pro Ser Phe Thr Glu Leu Arg Glu Asp Leu Glu Asn Thr Leu Lys Ala
785 790 795 800
Leu Pro Pro Ala Gln Glu Pro Asp Glu Ile Leu Tyr Val Asn Met Asp
805 810 815
Glu Gly Gly Gly Tyr Pro Glu Pro Pro Gly Ala Ala Gly Gly Ala Asp
820 825 830
Pro Pro Thr Gln Pro Asp Pro Lys Asp Ser Cys Ser Cys Leu Thr Ala
835 840 845
Ala Glu Val His Pro Ala Gly Arg Tyr Val Leu Cys Pro Ser Thr Thr
850 855 860
Pro Ser Pro Ala Gln Pro Ala Asp Arg Gly Ser Pro Ala Ala Pro Gly
865 870 875 880
Gln Glu Asp Gly Ala
885
<210> 146
<211> 3630
<212> DNA
<213> 智人
<400> 146
atacgggaga actaaggctg aaacctcgga ggaacaacca cttttgaagt gacttcgcgg 60
cgtgcgttgg gtgcggacta ggtggccccg gcgggagtgt gctggagcct gaagtccacg 120
cgcgcggctg agaaccgccg ggaccgcacg tgggcgccgc gcgcttcccc cgcttcccag 180
gtgggcgccg gccgccaggc cacctcacgt ccggccccgg ggatgcgcgt cctcctcgcc 240
gcgctgggac tgctgttcct gggggcgcta cgagccttcc cacaggatcg acccttcgag 300
gacacctgtc atggaaaccc cagccactac tatgacaagg ctgtcaggag gtgctgttac 360
cgctgcccca tggggctgtt cccgacacag cagtgcccac agaggcctac tgactgcagg 420
aagcagtgtg agcctgacta ctacctggat gaggccgacc gctgtacagc ctgcgtgact 480
tgttctcgag atgacctcgt ggagaagacg ccgtgtgcat ggaactcctc ccgtgtctgc 540
gaatgtcgac ccggcatgtt ctgttccacg tctgccgtca actcctgtgc ccgctgcttc 600
ttccattctg tctgtccggc agggatgatt gtcaagttcc caggcacggc gcagaagaac 660
acggtctgtg agccggcttc cccaggggtc agccctgcct gtgccagccc agagaactgc 720
aaggaaccct ccagtggcac catcccccag gccaagccca ccccggtgtc cccagcaacc 780
tccagtgcca gcaccatgcc tgtaagaggg ggcacccgcc tcgcccagga agctgcttct 840
aaactgacga gggctcccga ctctccctcc tctgtgggaa ggcctagttc agatccaggt 900
ctgtccccaa cacagccatg cccagagggg tctggtgatt gcagaaagca gtgtgagccc 960
gactactacc tggacgaggc cggccgctgc acagcctgcg tgagctgttc tcgagatgac 1020
cttgtggaga agacgccatg tgcatggaac tcctcccgca cctgcgaatg tcgacctggc 1080
atgatctgtg ccacatcagc caccaactcc tgtgcccgct gtgtccccta cccaatctgt 1140
gcagcagaga cggtcaccaa gccccaggat atggctgaga aggacaccac ctttgaggcg 1200
ccacccctgg ggacccagcc ggactgcaac cccaccccag agaatggcga ggcgcctgcc 1260
agcaccagcc ccactcagag cttgctggtg gactcccagg ccagtaagac gctgcccatc 1320
ccaaccagcg ctcccgtcgc tctctcctcc acggggaagc ccgttctgga tgcagggcca 1380
gtgctcttct gggtgatcct ggtgttggtt gtggtggtcg gctccagcgc cttcctcctg 1440
tgccaccgga gggcctgcag gaagcgaatt cggcagaagc tccacctgtg ctacccggtc 1500
cagacctccc agcccaagct agagcttgtg gattccagac ccaggaggag ctcaacgcag 1560
ctgaggagtg gtgcgtcggt gacagaaccc gtcgcggaag agcgagggtt aatgagccag 1620
ccactgatgg agacctgcca cagcgtgggg gcagcctacc tggagagcct gccgctgcag 1680
gatgccagcc cggccggggg cccctcgtcc cccagggacc ttcctgagcc ccgggtgtcc 1740
acggagcaca ccaataacaa gattgagaaa atctacatca tgaaggctga caccgtgatc 1800
gtggggaccg tgaaggctga gctgccggag ggccggggcc tggcggggcc agcagagccc 1860
gagttggagg aggagctgga ggcggaccat accccccact accccgagca ggagacagaa 1920
ccgcctctgg gcagctgcag cgatgtcatg ctctcagtgg aagaggaagg gaaagaagac 1980
cccttgccca cagctgcctc tggaaagtga ggcctgggct gggctggggc taggagggca 2040
gcagggtggc ctctgggagg ccaggatggc actgttggca ccgaggttgg gggcagaggc 2100
ccatctggcc tgaactgagg ctccagcatc tagtggtgga ccggccggtc actgcagggg 2160
tctggtggtc tctgcttgca tccccaactt agctgtcccc tgacccagag cctaggggat 2220
ccggggcttg tacagaagag acagtccaag gggactggat cccagcagtg atgttggttg 2280
aggcagcaaa cagatggcag gatgggcact gccgagaaca gcattggtcc cagagccctg 2340
ggcatcagac cttaaccacc aggcccacag cccagcgagg gagaggtcgt gaggccagct 2400
cccggggccc ctgtaaccct actctcctct ctccctggac ctcagaggtg acacccattg 2460
ggcccttccg gcatgccccc agttactgta aatgtggccc ccagtgggca tggagccagt 2520
gcctgtggtt gtttctccag agtcaaaagg gaagtcgagg gatggggcgt cgtcagctgg 2580
cactgtctct gctgcagcgg ccacactgta ctctgcactg gtgtgagggc ccctgcctgg 2640
actgtgggac cctcctggtg ctgcccacct tccctgtcct gtagccccct cggtgggccc 2700
agggcctagg ggcccaggat caagtcactc atctcagaat gtccccacca atccccgcca 2760
cagcaggcgc ctcgggtccc agatgtctgc agccctcagc agctgcagac cgcccctcac 2820
caacccagag aacctgcttt actttgccca gggacttcct ccccatgtga acatggggaa 2880
cttcgggccc tgcctggagt ccttgaccgc tctctgtggg ccccacccac tctgtcctgg 2940
gaaatgaaga agcatcttcc ttaggtctgc cctgcttgca aatccactag caccgacccc 3000
accacctggt tccggctctg cacgctttgg ggtgtggatg tcgagaggca ccacggcctc 3060
acccaggcat ctgctttact ctggaccata ggaaacaaga ccgtttggag gtttcatcag 3120
gattttgggt ttttcacatt tcacgctaag gagtagtggc cctgacttcc ggtcggctgg 3180
ccagctgact ccctagggcc ttcagacgtg tatgcaaatg agtgatggat aaggatgagt 3240
cttggagttg cgggcagcct ggagactcgt ggacttaccg cctggaggca ggcccgggaa 3300
ggctgctgtt tactcatcgg gcagccacgt gctctctgga ggaagtgata gtttctgaaa 3360
ccgctcagat gttttgggga aagttggaga agccgtggcc ttgcgagagg tggttacacc 3420
agaacctgga cattggccag aagaagctta agtgggcaga cactgtttgc ccagtgtttg 3480
tgcaaggatg gagtgggtgt ctctgcatca cccacagccg cagctgtaag gcacgctgga 3540
aggcacacgc ctgccaggca gggcagtctg gcgcccatga tgggagggat tgacatgttt 3600
caacaaaata atgcacttcc ttaaaaaaaa 3630
<210> 147
<211> 595
<212> PRT
<213> 智人
<400> 147
Met Arg Val Leu Leu Ala Ala Leu Gly Leu Leu Phe Leu Gly Ala Leu
1 5 10 15
Arg Ala Phe Pro Gln Asp Arg Pro Phe Glu Asp Thr Cys His Gly Asn
20 25 30
Pro Ser His Tyr Tyr Asp Lys Ala Val Arg Arg Cys Cys Tyr Arg Cys
35 40 45
Pro Met Gly Leu Phe Pro Thr Gln Gln Cys Pro Gln Arg Pro Thr Asp
50 55 60
Cys Arg Lys Gln Cys Glu Pro Asp Tyr Tyr Leu Asp Glu Ala Asp Arg
65 70 75 80
Cys Thr Ala Cys Val Thr Cys Ser Arg Asp Asp Leu Val Glu Lys Thr
85 90 95
Pro Cys Ala Trp Asn Ser Ser Arg Val Cys Glu Cys Arg Pro Gly Met
100 105 110
Phe Cys Ser Thr Ser Ala Val Asn Ser Cys Ala Arg Cys Phe Phe His
115 120 125
Ser Val Cys Pro Ala Gly Met Ile Val Lys Phe Pro Gly Thr Ala Gln
130 135 140
Lys Asn Thr Val Cys Glu Pro Ala Ser Pro Gly Val Ser Pro Ala Cys
145 150 155 160
Ala Ser Pro Glu Asn Cys Lys Glu Pro Ser Ser Gly Thr Ile Pro Gln
165 170 175
Ala Lys Pro Thr Pro Val Ser Pro Ala Thr Ser Ser Ala Ser Thr Met
180 185 190
Pro Val Arg Gly Gly Thr Arg Leu Ala Gln Glu Ala Ala Ser Lys Leu
195 200 205
Thr Arg Ala Pro Asp Ser Pro Ser Ser Val Gly Arg Pro Ser Ser Asp
210 215 220
Pro Gly Leu Ser Pro Thr Gln Pro Cys Pro Glu Gly Ser Gly Asp Cys
225 230 235 240
Arg Lys Gln Cys Glu Pro Asp Tyr Tyr Leu Asp Glu Ala Gly Arg Cys
245 250 255
Thr Ala Cys Val Ser Cys Ser Arg Asp Asp Leu Val Glu Lys Thr Pro
260 265 270
Cys Ala Trp Asn Ser Ser Arg Thr Cys Glu Cys Arg Pro Gly Met Ile
275 280 285
Cys Ala Thr Ser Ala Thr Asn Ser Cys Ala Arg Cys Val Pro Tyr Pro
290 295 300
Ile Cys Ala Ala Glu Thr Val Thr Lys Pro Gln Asp Met Ala Glu Lys
305 310 315 320
Asp Thr Thr Phe Glu Ala Pro Pro Leu Gly Thr Gln Pro Asp Cys Asn
325 330 335
Pro Thr Pro Glu Asn Gly Glu Ala Pro Ala Ser Thr Ser Pro Thr Gln
340 345 350
Ser Leu Leu Val Asp Ser Gln Ala Ser Lys Thr Leu Pro Ile Pro Thr
355 360 365
Ser Ala Pro Val Ala Leu Ser Ser Thr Gly Lys Pro Val Leu Asp Ala
370 375 380
Gly Pro Val Leu Phe Trp Val Ile Leu Val Leu Val Val Val Val Gly
385 390 395 400
Ser Ser Ala Phe Leu Leu Cys His Arg Arg Ala Cys Arg Lys Arg Ile
405 410 415
Arg Gln Lys Leu His Leu Cys Tyr Pro Val Gln Thr Ser Gln Pro Lys
420 425 430
Leu Glu Leu Val Asp Ser Arg Pro Arg Arg Ser Ser Thr Gln Leu Arg
435 440 445
Ser Gly Ala Ser Val Thr Glu Pro Val Ala Glu Glu Arg Gly Leu Met
450 455 460
Ser Gln Pro Leu Met Glu Thr Cys His Ser Val Gly Ala Ala Tyr Leu
465 470 475 480
Glu Ser Leu Pro Leu Gln Asp Ala Ser Pro Ala Gly Gly Pro Ser Ser
485 490 495
Pro Arg Asp Leu Pro Glu Pro Arg Val Ser Thr Glu His Thr Asn Asn
500 505 510
Lys Ile Glu Lys Ile Tyr Ile Met Lys Ala Asp Thr Val Ile Val Gly
515 520 525
Thr Val Lys Ala Glu Leu Pro Glu Gly Arg Gly Leu Ala Gly Pro Ala
530 535 540
Glu Pro Glu Leu Glu Glu Glu Leu Glu Ala Asp His Thr Pro His Tyr
545 550 555 560
Pro Glu Gln Glu Thr Glu Pro Pro Leu Gly Ser Cys Ser Asp Val Met
565 570 575
Leu Ser Val Glu Glu Glu Gly Lys Glu Asp Pro Leu Pro Thr Ala Ala
580 585 590
Ser Gly Lys
595
<210> 148
<211> 3802
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1064)..(1065)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1240)..(1240)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1862)..(1862)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1878)..(1878)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1899)..(1900)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1906)..(1906)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1917)..(1917)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (1928)..(1929)
<223> n 是 a, c, g, 或 t
<220>
<221> misc_feature
<222> (2395)..(2395)
<223> n 是 a, c, g, 或 t
<400> 148
cccctgcagc tggcctcaat gttaagatct taaggggcac agcacagacc ttgtcttgtc 60
tatctccctg gcacctctca cagagtacag cttattaaca gatgttccag aaactgttac 120
tgaacaatcg gcttgatgct gtgggcttgt ctgcatcttg caactgtcac ctggctgaga 180
aatttcttct ataaataagc agtttctgtt tcagatgtga tatgccctga tatttacacc 240
ctgtctctta ccccatccaa gactcaaact tagaaacttg aattagatgt ggtattcaaa 300
tccttacgtg ccgcgaagac acagacagcc cccgtaagaa cccacgaagc aggcgaagtt 360
cattgttctc aacattctag ctgctcttgc tgcatttgct ctggaattct tgtagagata 420
ttacttgtcc ttccaggctg ttctttctgt agctcccttg ttttcttttt gtgatcatgt 480
tgcagatggc tgggcagtgc tcccaaaatg aatattttga cagtttgttg catgcttgca 540
taccttgtca acttcgatgt tcttctaata ctcctcctct aacatgtcag cgttattgta 600
atgcaagtaa gtaatattgc ttgaacgatt attcattggt gtgaactatt ctgtctatat 660
ggactgctta ttcagagaat caacataatg ggcatgatgg tgagttttct tgaatcaaaa 720
agagaaagga agcaaggcag tgattttaat gtttatggaa acaaagtaat tatttggaac 780
tgaacttgat atgattcagc actattagca acatagattt tttttaaaaa tcagctcttc 840
taattaagtg atatttagaa tttaaaagtc aatgttcatt aattaaggtg attgaatgga 900
aataatccat acttgattat tttgctatca aaacaatcca taattcatta tttttagcaa 960
aataatcaag tatgacagcc gggtgcggtg gatggctcac acctgtaatc ccagcacttt 1020
gggaggccga gatgggtgaa tcacctgagg tcggcagttt gagnncagcc tggccaacct 1080
ggtgaaaccc tgtctctact aaaaatacaa aaaattagct gggcatggtg gcacaggtct 1140
gtaatcccag ctactcggga ggctgaggca ggagaatcgt ttgaacttgg gaggtggagg 1200
ttgcagtgag ccgagatcgc gccactgcaa ctctagcctn ggcaacagag caagactttg 1260
tctcaaaata aataaataaa taataacaat aaagtatgtg aatattatgt tatcagctca 1320
ttatctgtct gatgttcttt tcataaaggt gtgaccaatt cagtgaaagg aacgaatgcg 1380
attctctgga cctgtttggg actgagctta ataatttctt tggcagtttt cgtgctaatg 1440
tttttgctaa ggaagataag ctctgaacca ttaaaggacg agtttaaaaa cacaggttgg 1500
tttgatggtg aatctttgaa atctatttcc aggggatggc tattgtgagt ttcagttcct 1560
tttctttttt tagcgttgac tatttcactt cgttacagcc ctttcgaatg tgttagaaca 1620
ttgttacatt aaatgaactt ggtagaggtg agccatcttc attctgattt tgacaccttg 1680
gcagattttc tacaatgtca gtcttctcca ggattcttcc actgttaatt actctattga 1740
aagtactaag gctttcttgg gaaacatcag tctctttgac taaagttagc acatcgatta 1800
aatgccacat tatcagaaac tcctagccag gtctgctact gtcaggaaaa gcatatttgt 1860
cnagatctat ggtcakgntt ttatacaaat ataggtgtnn yttgcntgag tgaacanttt 1920
actactgnna aaatgttaga aatgaataac cagttgctcc tgaattattt gaggaatcat 1980
ctaaaaaata attattttta agcaatagag aaccagtccc agaaaaatga atgttctact 2040
taagtgcctc ttaagataaa aaatacttct gcagcacctt tgctcatgat tggattccca 2100
agcatgtaca gccactgccc tatttctgta tgcatttatt tatttattta tttatttatt 2160
tatttagaga tggagtctcg ctctgtggcc caaggctgga gtgcagtggc gtgatctcaa 2220
ctcactgcag cctctgcctc ttgggttcaa acaattctcc catctcagcc tcctgagtaa 2280
ctggactata ggtatgtgcc atcacctccg actaattttt gtactttttg gtagagacag 2340
ggtttcatca tgttggccag gatggtctca agctcctgac cacaagtgat ctgcncgcct 2400
cagcttccca aagtgctggg attacaggcg tgagccacag ggcccagcac atactcattc 2460
ttttttactg aaaagatctg tttcaagctg ggtgttggtg gctatggagc tgtagtccga 2520
ctgctctgta ggctaacgtg ggaggattgc ttgagcccag agtttgaatg cagcctgggc 2580
aacacagtaa gaccccacct ctaaaaaatg aaaaaatctc tctcacattg ctttgagtcc 2640
cgatgtgtac tgctaagact ctcatgacca cattctctgt gaagtttggg ttaagttccg 2700
ttctacataa ttaggatcag gtctcctggg catggctaac attgacctgg aaaagagcag 2760
gactggtgat gaaattattc ttccgagagg cctcgagtac acggtggaag aatgcacctg 2820
tgaagactgc atcaagagca aaccgaaggt cgactctgac cattgctttc cactcccagc 2880
tatggaggaa ggcgcaacca ttcttgtcac cacgaaaacg aatgactatt gcaagagcct 2940
gccagctgct ttgagtgcta cggagataga gaaatcaatt tctgctaggt aattaaccat 3000
ttcgactcga gcagtgccac tttaaaaatc ttttgtcaga atagatgatg tgtcagatct 3060
ctttaggatg actgtatttt tcagttgccg atacagcttt ttgtcctcta actgtggaaa 3120
ctctttatgt tagatatatt tctctaggtt actgttggga gcttaatggt agaaacttcc 3180
ttggtttcat gattaaagtc tttttttttc ctgacatcta agtttttatt aacgtgagtt 3240
tttaaaaaca agcatgtata ccagtgtggg gggtgagggt gggagagaaa ggtgggaggg 3300
ggaaagaatt ctaacctatt gataataaag ctccagtttt ggccaggcgc ggtgctcatg 3360
cctgtaatcc cagcactttg aaaggccgag gcgggcagat tacctgaggt caggaatttg 3420
agaccagcct ggccaacatg gtgaaaccct gtctttacta aaaatacaaa aattagctgg 3480
gcatggtggt aggcacctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 3540
gaacctggga ggtggaggtt gcaatgagct gagatagcat ccctgcactc cagcctgggc 3600
aagagggtga gactccgtct caaaacaaaa caacacaaac aaacaaaaag tacctccagc 3660
ttcatcttct gctggatttt atagcgcccc caaagatatg tggtccttaa aaattgtata 3720
ccacttattc aggagtcttg ttcctgaaag ggttgttctt gttacagccc tagtctgggc 3780
tgtaatcagc ttcttaaggt cc 3802
<210> 149
<211> 184
<212> PRT
<213> 智人
<400> 149
Met Leu Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp Ser
1 5 10 15
Leu Leu His Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr
20 25 30
Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser
35 40 45
Val Lys Gly Thr Asn Ala Ile Leu Trp Thr Cys Leu Gly Leu Ser Leu
50 55 60
Ile Ile Ser Leu Ala Val Phe Val Leu Met Phe Leu Leu Arg Lys Ile
65 70 75 80
Ser Ser Glu Pro Leu Lys Asp Glu Phe Lys Asn Thr Gly Ser Gly Leu
85 90 95
Leu Gly Met Ala Asn Ile Asp Leu Glu Lys Ser Arg Thr Gly Asp Glu
100 105 110
Ile Ile Leu Pro Arg Gly Leu Glu Tyr Thr Val Glu Glu Cys Thr Cys
115 120 125
Glu Asp Cys Ile Lys Ser Lys Pro Lys Val Asp Ser Asp His Cys Phe
130 135 140
Pro Leu Pro Ala Met Glu Glu Gly Ala Thr Ile Leu Val Thr Thr Lys
145 150 155 160
Thr Asn Asp Tyr Cys Lys Ser Leu Pro Ala Ala Leu Ser Ala Thr Glu
165 170 175
Ile Glu Lys Ser Ile Ser Ala Arg
180
<210> 150
<211> 2607
<212> DNA
<213> 智人
<400> 150
tctgttccca cttcctcccc gccccaggaa acctgccatg gcctcctggt gagctgtcct 60
catccactgc tcgctgcctc tccagatctt cagttgcttc aggccatttg aacgtatatg 120
agccggtcgt aggggatatg atggcttagc ttgggctcag aggcctgaaa atcgccccca 180
ccaatcacct gtttccccca atctaccctc ctgaaggtca ctgacaaaga cttcattgtc 240
tcctaggaga ggctgccata tatccagggc tgacgtaatt ccatcttaat atcagttaca 300
ttataaaaat ttacctcgtg cctgaggccc cagagcccaa gggtgcaaag cagtaattgg 360
tcaaagttca acttccctcc cactctgggc tcaggctgtc ccctgagggc ctgtgttttg 420
agtctctttc cagaaccttg gtgtgaactt aggtcttggc gtcgggatcc cttttcgtca 480
cactcaggtg acctacaggg agctccgctc gacactgcaa ggcttagacc agttcggtcc 540
aacagagaaa gcaggcaacc accatgtcat ttgaaaacag tttcatcggg atataattcg 600
caacccatac agtgaatcca tttaagatac tctgacccat ggatcccctg ggtgcagcca 660
agccacaatg gccatggcgc cgctgtctgg ccgcactgct atttcagctg ctggtggctg 720
tgtgtttctt ctcctacctg cgtgtgtccc gagacgatgc cactggatcc cctagggctc 780
ccagtgggtc ctcccgacag gacaccactc ccacccgccc caccctcctg atcctgctat 840
ggacatggcc tttccacatc cctgtggctc tgtcccgctg ttcagagatg gtgcccggca 900
cagccgactg ccacatcact gccgaccgca aggtgtaccc acaggcagac acggtcatcg 960
tgcaccactg ggatatcatg tccaacccta agtcacgcct cccaccttcc ccgaggccgc 1020
aggggcagcg ctggatctgg ttcaacttgg agccaccccc taactgccag cacctggaag 1080
ccctggacag atacttcaat ctcaccatgt cctaccgcag cgactccgac atcttcacgc 1140
cctacggctg gctggagccg tggtccggcc agcctgccca cccaccgctc aacctctcgg 1200
ccaagaccga gctggtggcc tgggcggtgt ccaactggaa gccggactca gccagggtgc 1260
gctactacca gagcctgcag gctcatctca aggtggacgt gtacggacgc tcccacaagc 1320
ccctgcccaa ggggaccatg atggagacgc tgtcccggta caagttctac ctggccttcg 1380
agaactcctt gcaccccgac tacatcaccg agaagctgtg gaggaacgcc ctggaggcct 1440
gggccgtgcc cgtggtgctg ggccccagca gaagcaacta cgagaggttc ctgccacccg 1500
acgccttcat ccacgtggac gacttccaga gccccaagga cctggcccgg tacctgcagg 1560
agctggacaa ggaccacgcc cgctacctga gctactttcg ctggcgggag acgctgcggc 1620
ctcgctcctt cagctgggca ctggatttct gcaaggcctg ctggaaactg cagcaggaat 1680
ccaggtacca gacggtgcgc agcatagcgg cttggttcac ctgagaggcc ggcatggtgc 1740
ctgggctgcc gggaacctca tctgcctggg gcctcacctg ctggagtcct ttgtggccaa 1800
ccctctctct tacctgggac ctcacacgct gggcttcacg gctgccagga gcctctcccc 1860
tccagaagac ttgcctgcta gggacctcgc ctgctgggga cctcgcctgt tggggacctc 1920
acctgctggg gacctcacct gctggggacc ttggctgctg gaggctgcac ctactgagga 1980
tgtcggcggt cggggacttt acctgctggg acctgctccc agagaccttg ccacactgaa 2040
tctcacctgc tggggacctc accctggagg gccctgggcc ctggggaact ggcttacttg 2100
gggccccacc cgggagtgat ggttctggct gatttgtttg tgatgttgtt agccgcctgt 2160
gaggggtgca gagagatcat cacggcacgg tttccagatg taatactgca aggaaaaatg 2220
atgacgtgtc tcctcactct agaggggttg gtcccatggg ttaagagctc accccaggtt 2280
ctcacctcag gggttaagag ctcagagttc agacaggtcc aagttcaagc ccaggaccac 2340
cacttatagg gtacaggtgg gatcgactgt aaatgaggac ttctggaaca ttccaaatat 2400
tctggggttg agggaaattg ctgctgtcta caaaatgcca agggtggaca ggcgctgtgg 2460
ctcacgcctg taatcccagc actttgggag gctgaggtag gaggattgat tgaggccaag 2520
agttaaagac cagcctggtc aatatagcaa gaccacgtct ctaaataaaa aataataggc 2580
cggccaggca aaaaaaaaaa aaaaaaa 2607
<210> 151
<211> 361
<212> PRT
<213> 智人
<400> 151
Met Asp Pro Leu Gly Ala Ala Lys Pro Gln Trp Pro Trp Arg Arg Cys
1 5 10 15
Leu Ala Ala Leu Leu Phe Gln Leu Leu Val Ala Val Cys Phe Phe Ser
20 25 30
Tyr Leu Arg Val Ser Arg Asp Asp Ala Thr Gly Ser Pro Arg Ala Pro
35 40 45
Ser Gly Ser Ser Arg Gln Asp Thr Thr Pro Thr Arg Pro Thr Leu Leu
50 55 60
Ile Leu Leu Trp Thr Trp Pro Phe His Ile Pro Val Ala Leu Ser Arg
65 70 75 80
Cys Ser Glu Met Val Pro Gly Thr Ala Asp Cys His Ile Thr Ala Asp
85 90 95
Arg Lys Val Tyr Pro Gln Ala Asp Thr Val Ile Val His His Trp Asp
100 105 110
Ile Met Ser Asn Pro Lys Ser Arg Leu Pro Pro Ser Pro Arg Pro Gln
115 120 125
Gly Gln Arg Trp Ile Trp Phe Asn Leu Glu Pro Pro Pro Asn Cys Gln
130 135 140
His Leu Glu Ala Leu Asp Arg Tyr Phe Asn Leu Thr Met Ser Tyr Arg
145 150 155 160
Ser Asp Ser Asp Ile Phe Thr Pro Tyr Gly Trp Leu Glu Pro Trp Ser
165 170 175
Gly Gln Pro Ala His Pro Pro Leu Asn Leu Ser Ala Lys Thr Glu Leu
180 185 190
Val Ala Trp Ala Val Ser Asn Trp Lys Pro Asp Ser Ala Arg Val Arg
195 200 205
Tyr Tyr Gln Ser Leu Gln Ala His Leu Lys Val Asp Val Tyr Gly Arg
210 215 220
Ser His Lys Pro Leu Pro Lys Gly Thr Met Met Glu Thr Leu Ser Arg
225 230 235 240
Tyr Lys Phe Tyr Leu Ala Phe Glu Asn Ser Leu His Pro Asp Tyr Ile
245 250 255
Thr Glu Lys Leu Trp Arg Asn Ala Leu Glu Ala Trp Ala Val Pro Val
260 265 270
Val Leu Gly Pro Ser Arg Ser Asn Tyr Glu Arg Phe Leu Pro Pro Asp
275 280 285
Ala Phe Ile His Val Asp Asp Phe Gln Ser Pro Lys Asp Leu Ala Arg
290 295 300
Tyr Leu Gln Glu Leu Asp Lys Asp His Ala Arg Tyr Leu Ser Tyr Phe
305 310 315 320
Arg Trp Arg Glu Thr Leu Arg Pro Arg Ser Phe Ser Trp Ala Leu Asp
325 330 335
Phe Cys Lys Ala Cys Trp Lys Leu Gln Gln Glu Ser Arg Tyr Gln Thr
340 345 350
Val Arg Ser Ile Ala Ala Trp Phe Thr
355 360
<210> 152
<211> 2371
<212> DNA
<213> 智人
<400> 152
tctcctcttg ctctaagcag ggtgtttgac cttctagtcg actgcgtccc ctgtacccgg 60
cgccagctgt gttcctgacc ccagaataac tcagggctgc accgggcctg gcagcgctcc 120
gcacacattt cctgtcgcgg cctaagggaa actgttggcc gctgggcccg cggggggatt 180
cttggcagtt ggggggtccg tcgggagcga gggcggaggg gaagggaggg ggaaccgggt 240
tggggaagcc agctgtagag ggcggtgacc gcgctccaga cacagctctg cgtcctcgag 300
cgggacagat ccaagttggg agcagctctg cgtgcggggc ctcagagaat gaggccggcg 360
ttcgccctgt gcctcctctg gcaggcgctc tggcccgggc cgggcggcgg cgaacacccc 420
actgccgacc gtgctggctg ctcggcctcg ggggcctgct acagcctgca ccacgctacc 480
atgaagcggc aggcggccga ggaggcctgc atcctgcgag gtggggcgct cagcaccgtg 540
cgtgcgggcg ccgagctgcg cgctgtgctc gcgctcctgc gggcaggccc agggcccgga 600
gggggctcca aagacctgct gttctgggtc gcactggagc gcaggcgttc ccactgcacc 660
ctggagaacg agcctttgcg gggtttctcc tggctgtcct ccgaccccgg cggtctcgaa 720
agcgacacgc tgcagtgggt ggaggagccc caacgctcct gcaccgcgcg gagatgcgcg 780
gtactccagg ccaccggtgg ggtcgagccc gcaggctgga aggagatgcg atgccacctg 840
cgcgccaacg gctacctgtg caagtaccag tttgaggtct tgtgtcctgc gccgcgcccc 900
ggggccgcct ctaacttgag ctatcgcgcg cccttccagc tgcacagcgc cgctctggac 960
ttcagtccac ctgggaccga ggtgagtgcg ctctgccggg gacagctccc gatctcagtt 1020
acttgcatcg cggacgaaat cggcgctcgc tgggacaaac tctcgggcga tgtgttgtgt 1080
ccctgccccg ggaggtacct ccgtgctggc aaatgcgcag agctccctaa ctgcctagac 1140
gacttgggag gctttgcctg cgaatgtgct acgggcttcg agctggggaa ggacggccgc 1200
tcttgtgtga ccagtgggga aggacagccg acccttgggg ggaccggggt gcccaccagg 1260
cgcccgccgg ccactgcaac cagccccgtg ccgcagagaa catggccaat cagggtcgac 1320
gagaagctgg gagagacacc acttgtccct gaacaagaca attcagtaac atctattcct 1380
gagattcctc gatggggatc acagagcacg atgtctaccc ttcaaatgtc ccttcaagcc 1440
gagtcaaagg ccactatcac cccatcaggg agcgtgattt ccaagtttaa ttctacgact 1500
tcctctgcca ctcctcaggc tttcgactcc tcctctgccg tggtcttcat atttgtgagc 1560
acagcagtag tagtgttggt gatcttgacc atgacagtac tggggcttgt caagctctgc 1620
tttcacgaaa gcccctcttc ccagccaagg aaggagtcta tgggcccgcc gggcctggag 1680
agtgatcctg agcccgctgc tttgggctcc agttctgcac attgcacaaa caatggggtg 1740
aaagtcgggg actgtgatct gcgggacaga gcagagggtg ccttgctggc ggagtcccct 1800
cttggctcta gtgatgcata gggaaacagg ggacatgggc actcctgtga acagtttttc 1860
acttttgatg aaacggggaa ccaagaggaa cttacttgtg taactgacaa tttctgcaga 1920
aatccccctt cctctaaatt ccctttactc cactgaggag ctaaatcaga actgcacact 1980
ccttccctga tgatagagga agtggaagtg cctttaggat ggtgatactg ggggaccggg 2040
tagtgctggg gagagatatt ttcttatgtt tattcggaga atttggagaa gtgattgaac 2100
ttttcaagac attggaaaca aatagaacac aatataattt acattaaaaa ataatttcta 2160
ccaaaatgga aaggaaatgt tctatgttgt tcaggctagg agtatattgg ttcgaaatcc 2220
cagggaaaaa aataaaaata aaaaattaaa ggattgttga taacccagac tcaaatatca 2280
ttgccttcct ccaggagtaa ttaggaacag ctgagggcat gctgggagta agcagcaaga 2340
gtgcattctg cttttagatt gagggagagg t 2371
<210> 153
<211> 490
<212> PRT
<213> 智人
<400> 153
Met Arg Pro Ala Phe Ala Leu Cys Leu Leu Trp Gln Ala Leu Trp Pro
1 5 10 15
Gly Pro Gly Gly Gly Glu His Pro Thr Ala Asp Arg Ala Gly Cys Ser
20 25 30
Ala Ser Gly Ala Cys Tyr Ser Leu His His Ala Thr Met Lys Arg Gln
35 40 45
Ala Ala Glu Glu Ala Cys Ile Leu Arg Gly Gly Ala Leu Ser Thr Val
50 55 60
Arg Ala Gly Ala Glu Leu Arg Ala Val Leu Ala Leu Leu Arg Ala Gly
65 70 75 80
Pro Gly Pro Gly Gly Gly Ser Lys Asp Leu Leu Phe Trp Val Ala Leu
85 90 95
Glu Arg Arg Arg Ser His Cys Thr Leu Glu Asn Glu Pro Leu Arg Gly
100 105 110
Phe Ser Trp Leu Ser Ser Asp Pro Gly Gly Leu Glu Ser Asp Thr Leu
115 120 125
Gln Trp Val Glu Glu Pro Gln Arg Ser Cys Thr Ala Arg Arg Cys Ala
130 135 140
Val Leu Gln Ala Thr Gly Gly Val Glu Pro Ala Gly Trp Lys Glu Met
145 150 155 160
Arg Cys His Leu Arg Ala Asn Gly Tyr Leu Cys Lys Tyr Gln Phe Glu
165 170 175
Val Leu Cys Pro Ala Pro Arg Pro Gly Ala Ala Ser Asn Leu Ser Tyr
180 185 190
Arg Ala Pro Phe Gln Leu His Ser Ala Ala Leu Asp Phe Ser Pro Pro
195 200 205
Gly Thr Glu Val Ser Ala Leu Cys Arg Gly Gln Leu Pro Ile Ser Val
210 215 220
Thr Cys Ile Ala Asp Glu Ile Gly Ala Arg Trp Asp Lys Leu Ser Gly
225 230 235 240
Asp Val Leu Cys Pro Cys Pro Gly Arg Tyr Leu Arg Ala Gly Lys Cys
245 250 255
Ala Glu Leu Pro Asn Cys Leu Asp Asp Leu Gly Gly Phe Ala Cys Glu
260 265 270
Cys Ala Thr Gly Phe Glu Leu Gly Lys Asp Gly Arg Ser Cys Val Thr
275 280 285
Ser Gly Glu Gly Gln Pro Thr Leu Gly Gly Thr Gly Val Pro Thr Arg
290 295 300
Arg Pro Pro Ala Thr Ala Thr Ser Pro Val Pro Gln Arg Thr Trp Pro
305 310 315 320
Ile Arg Val Asp Glu Lys Leu Gly Glu Thr Pro Leu Val Pro Glu Gln
325 330 335
Asp Asn Ser Val Thr Ser Ile Pro Glu Ile Pro Arg Trp Gly Ser Gln
340 345 350
Ser Thr Met Ser Thr Leu Gln Met Ser Leu Gln Ala Glu Ser Lys Ala
355 360 365
Thr Ile Thr Pro Ser Gly Ser Val Ile Ser Lys Phe Asn Ser Thr Thr
370 375 380
Ser Ser Ala Thr Pro Gln Ala Phe Asp Ser Ser Ser Ala Val Val Phe
385 390 395 400
Ile Phe Val Ser Thr Ala Val Val Val Leu Val Ile Leu Thr Met Thr
405 410 415
Val Leu Gly Leu Val Lys Leu Cys Phe His Glu Ser Pro Ser Ser Gln
420 425 430
Pro Arg Lys Glu Ser Met Gly Pro Pro Gly Leu Glu Ser Asp Pro Glu
435 440 445
Pro Ala Ala Leu Gly Ser Ser Ser Ala His Cys Thr Asn Asn Gly Val
450 455 460
Lys Val Gly Asp Cys Asp Leu Arg Asp Arg Ala Glu Gly Ala Leu Leu
465 470 475 480
Ala Glu Ser Pro Leu Gly Ser Ser Asp Ala
485 490
<210> 154
<211> 3973
<212> DNA
<213> 智人
<400> 154
gggctggggg agggtatata agccgagtag gcgacggtga ggtcgacgcc ggccaagaca 60
gcacagacag attgacctat tggggtgttt cgcgagtgtg agagggaagc gccgcggcct 120
gtatttctag acctgccctt cgcctggttc gtggcgcctt gtgaccccgg gcccctgccg 180
cctgcaagtc ggaaattgcg ctgtgctcct gtgctacggc ctgtggctgg actgcctgct 240
gctgcccaac tggctggcaa gatgaagctc tccctggtgg ccgcgatgct gctgctgctc 300
agcgcggcgc gggccgagga ggaggacaag aaggaggacg tgggcacggt ggtcggcatc 360
gacctgggga ccacctactc ctgcgtcggc gtgttcaaga acggccgcgt ggagatcatc 420
gccaacgatc agggcaaccg catcacgccg tcctatgtcg ccttcactcc tgaaggggaa 480
cgtctgattg gcgatgccgc caagaaccag ctcacctcca accccgagaa cacggtcttt 540
gacgccaagc ggctcatcgg ccgcacgtgg aatgacccgt ctgtgcagca ggacatcaag 600
ttcttgccgt tcaaggtggt tgaaaagaaa actaaaccat acattcaagt tgatattgga 660
ggtgggcaaa caaagacatt tgctcctgaa gaaatttctg ccatggttct cactaaaatg 720
aaagaaaccg ctgaggctta tttgggaaag aaggttaccc atgcagttgt tactgtacca 780
gcctatttta atgatgccca acgccaagca accaaagacg ctggaactat tgctggccta 840
aatgttatga ggatcatcaa cgagcctacg gcagctgcta ttgcttatgg cctggataag 900
agggaggggg agaagaacat cctggtgttt gacctgggtg gcggaacctt cgatgtgtct 960
cttctcacca ttgacaatgg tgtcttcgaa gttgtggcca ctaatggaga tactcatctg 1020
ggtggagaag actttgacca gcgtgtcatg gaacacttca tcaaactgta caaaaagaag 1080
acgggcaaag atgtcaggaa agacaataga gctgtgcaga aactccggcg cgaggtagaa 1140
aaggccaaac gggccctgtc ttctcagcat caagcaagaa ttgaaattga gtccttctat 1200
gaaggagaag acttttctga gaccctgact cgggccaaat ttgaagagct caacatggat 1260
ctgttccggt ctactatgaa gcccgtccag aaagtgttgg aagattctga tttgaagaag 1320
tctgatattg atgaaattgt tcttgttggt ggctcgactc gaattccaaa gattcagcaa 1380
ctggttaaag agttcttcaa tggcaaggaa ccatcccgtg gcataaaccc agatgaagct 1440
gtagcgtatg gtgctgctgt ccaggctggt gtgctctctg gtgatcaaga tacaggtgac 1500
ctggtactgc ttgatgtatg tccccttaca cttggtattg aaactgtggg aggtgtcatg 1560
accaaactga ttccaaggaa cacagtggtg cctaccaaga agtctcagat cttttctaca 1620
gcttctgata atcaaccaac tgttacaatc aaggtctatg aaggtgaaag acccctgaca 1680
aaagacaatc atcttctggg tacatttgat ctgactggaa ttcctcctgc tcctcgtggg 1740
gtcccacaga ttgaagtcac ctttgagata gatgtgaatg gtattcttcg agtgacagct 1800
gaagacaagg gtacagggaa caaaaataag atcacaatca ccaatgacca gaatcgcctg 1860
acacctgaag aaatcgaaag gatggttaat gatgctgaga agtttgctga ggaagacaaa 1920
aagctcaagg agcgcattga tactagaaat gagttggaaa gctatgccta ttctctaaag 1980
aatcagattg gagataaaga aaagctggga ggtaaacttt cctctgaaga taaggagacc 2040
atggaaaaag ctgtagaaga aaagattgaa tggctggaaa gccaccaaga tgctgacatt 2100
gaagacttca aagctaagaa gaaggaactg gaagaaattg ttcaaccaat tatcagcaaa 2160
ctctatggaa gtgcaggccc tcccccaact ggtgaagagg atacagcaga aaaagatgag 2220
ttgtagacac tgatctgcta gtgctgtaat attgtaaata ctggactcag gaacttttgt 2280
taggaaaaaa ttgaaagaac ttaagtctcg aatgtaattg gaatcttcac ctcagagtgg 2340
agttgaaact gctatagcct aagcggctgt ttactgcttt tcattagcag ttgctcacat 2400
gtctttgggt gggggggaga agaagaattg gccatcttaa aaagcgggta aaaaacctgg 2460
gttagggtgt gtgttcacct tcaaaatgtt ctatttaaca actgggtcat gtgcatctgg 2520
tgtaggaagt tttttctacc ataagtgaca ccaataaatg tttgttattt acactggtct 2580
aatgtttgtg agaagcttct aattagatca attacttatt ttaggaaatt taagactaga 2640
tactcgtgtg tggggtgagg ggagggagta tttggtatgt tgggataagg aaacacttct 2700
atttaatgct tccagggatt tttttttttt tttttaaccc tcctgggccc aagtgatcct 2760
tccacctcag tctcccagct aattgagacc acaggcttgt taccaccatg ctcggctttt 2820
gcattaatct aagaaaaggg gagagaagtt aatccacatc tttactcagg caaggggcat 2880
ttcacagtgc ccaagagtgg ggttttcttg aacatacttg gtttcctatt tccccttatc 2940
tttctaaaac tgcctttctg gtggcttttt ttaaaattat tactaatgat gcttttatag 3000
ctgcttggat tctctgagaa atgatgggga gtgagtgatc actggtatta actttataca 3060
cttggatttc atttgtaact ttaggatgta aaggtatatt gtgaacccta gctgtgtcag 3120
aatctccatc cctgaaattt ctcattagtg gtactggggt gggatcttgg atggtgacat 3180
tgaaactaca ctaaatcccc tcactatgaa tgggttgtta aaggcaatgg tttgtgtcaa 3240
aactggttta ggattactta gattgtgttc ctgaagaaaa gagtccaggt aaatggtatg 3300
atcaataaag gacaggctgg tgctaacata aaatccaata ttgtaatcct agcactttgg 3360
gaggccaagg cgggtggatc acaaggtcaa gagatagaga ccatctttgc caacatggtg 3420
aaactccatc tctactgaaa atacaaaaat tagctgggcg tggtagtgca agctgaaggc 3480
tgaggcagga gaatcactcg aacccgggag gcagaggttg cagtgagccg agatcacacc 3540
actgtactcc agcccggcac tccagcctgg cgacaagagt gagactccac ctcaaaaaaa 3600
aaaaaaagaa tccaatactg cccaaggata ggtattttat agatgggcaa ctggctgaaa 3660
ggttaattct ctagggctag tagaactgga tcccaacacc aaactcttaa ttagacctag 3720
gcctcagctg cactgcccga aaagcatttg ggcagaccct gagcagaata ctggtctcag 3780
gccaagccca atacagccat taaagatgac ctacagtgct gtgtaccctg gggcaatagg 3840
gttaaatggt agttagcaac tagggctagt cttcccttac ctcaaaggct ctcactaccg 3900
tggaccacct agtctgtaac tctttctgag gagctgttac tgaatattaa aaagatagac 3960
ttcaactatg aaa 3973
<210> 155
<211> 654
<212> PRT
<213> 智人
<400> 155
Met Lys Leu Ser Leu Val Ala Ala Met Leu Leu Leu Leu Ser Ala Ala
1 5 10 15
Arg Ala Glu Glu Glu Asp Lys Lys Glu Asp Val Gly Thr Val Val Gly
20 25 30
Ile Asp Leu Gly Thr Thr Tyr Ser Cys Val Gly Val Phe Lys Asn Gly
35 40 45
Arg Val Glu Ile Ile Ala Asn Asp Gln Gly Asn Arg Ile Thr Pro Ser
50 55 60
Tyr Val Ala Phe Thr Pro Glu Gly Glu Arg Leu Ile Gly Asp Ala Ala
65 70 75 80
Lys Asn Gln Leu Thr Ser Asn Pro Glu Asn Thr Val Phe Asp Ala Lys
85 90 95
Arg Leu Ile Gly Arg Thr Trp Asn Asp Pro Ser Val Gln Gln Asp Ile
100 105 110
Lys Phe Leu Pro Phe Lys Val Val Glu Lys Lys Thr Lys Pro Tyr Ile
115 120 125
Gln Val Asp Ile Gly Gly Gly Gln Thr Lys Thr Phe Ala Pro Glu Glu
130 135 140
Ile Ser Ala Met Val Leu Thr Lys Met Lys Glu Thr Ala Glu Ala Tyr
145 150 155 160
Leu Gly Lys Lys Val Thr His Ala Val Val Thr Val Pro Ala Tyr Phe
165 170 175
Asn Asp Ala Gln Arg Gln Ala Thr Lys Asp Ala Gly Thr Ile Ala Gly
180 185 190
Leu Asn Val Met Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala Ile Ala
195 200 205
Tyr Gly Leu Asp Lys Arg Glu Gly Glu Lys Asn Ile Leu Val Phe Asp
210 215 220
Leu Gly Gly Gly Thr Phe Asp Val Ser Leu Leu Thr Ile Asp Asn Gly
225 230 235 240
Val Phe Glu Val Val Ala Thr Asn Gly Asp Thr His Leu Gly Gly Glu
245 250 255
Asp Phe Asp Gln Arg Val Met Glu His Phe Ile Lys Leu Tyr Lys Lys
260 265 270
Lys Thr Gly Lys Asp Val Arg Lys Asp Asn Arg Ala Val Gln Lys Leu
275 280 285
Arg Arg Glu Val Glu Lys Ala Lys Arg Ala Leu Ser Ser Gln His Gln
290 295 300
Ala Arg Ile Glu Ile Glu Ser Phe Tyr Glu Gly Glu Asp Phe Ser Glu
305 310 315 320
Thr Leu Thr Arg Ala Lys Phe Glu Glu Leu Asn Met Asp Leu Phe Arg
325 330 335
Ser Thr Met Lys Pro Val Gln Lys Val Leu Glu Asp Ser Asp Leu Lys
340 345 350
Lys Ser Asp Ile Asp Glu Ile Val Leu Val Gly Gly Ser Thr Arg Ile
355 360 365
Pro Lys Ile Gln Gln Leu Val Lys Glu Phe Phe Asn Gly Lys Glu Pro
370 375 380
Ser Arg Gly Ile Asn Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val
385 390 395 400
Gln Ala Gly Val Leu Ser Gly Asp Gln Asp Thr Gly Asp Leu Val Leu
405 410 415
Leu Asp Val Cys Pro Leu Thr Leu Gly Ile Glu Thr Val Gly Gly Val
420 425 430
Met Thr Lys Leu Ile Pro Arg Asn Thr Val Val Pro Thr Lys Lys Ser
435 440 445
Gln Ile Phe Ser Thr Ala Ser Asp Asn Gln Pro Thr Val Thr Ile Lys
450 455 460
Val Tyr Glu Gly Glu Arg Pro Leu Thr Lys Asp Asn His Leu Leu Gly
465 470 475 480
Thr Phe Asp Leu Thr Gly Ile Pro Pro Ala Pro Arg Gly Val Pro Gln
485 490 495
Ile Glu Val Thr Phe Glu Ile Asp Val Asn Gly Ile Leu Arg Val Thr
500 505 510
Ala Glu Asp Lys Gly Thr Gly Asn Lys Asn Lys Ile Thr Ile Thr Asn
515 520 525
Asp Gln Asn Arg Leu Thr Pro Glu Glu Ile Glu Arg Met Val Asn Asp
530 535 540
Ala Glu Lys Phe Ala Glu Glu Asp Lys Lys Leu Lys Glu Arg Ile Asp
545 550 555 560
Thr Arg Asn Glu Leu Glu Ser Tyr Ala Tyr Ser Leu Lys Asn Gln Ile
565 570 575
Gly Asp Lys Glu Lys Leu Gly Gly Lys Leu Ser Ser Glu Asp Lys Glu
580 585 590
Thr Met Glu Lys Ala Val Glu Glu Lys Ile Glu Trp Leu Glu Ser His
595 600 605
Gln Asp Ala Asp Ile Glu Asp Phe Lys Ala Lys Lys Lys Glu Leu Glu
610 615 620
Glu Ile Val Gln Pro Ile Ile Ser Lys Leu Tyr Gly Ser Ala Gly Pro
625 630 635 640
Pro Pro Thr Gly Glu Glu Asp Thr Ala Glu Lys Asp Glu Leu
645 650
<210> 156
<211> 926
<212> DNA
<213> 智人
<400> 156
ccagagaggg gcaggcttgt cccctgacag gttgaagcaa gtagacgccc aggagccccg 60
ggagggggct gcagtttcct tccttccttc tcggcagcgc tccgcgcccc catcgcccct 120
cctgcgctag cggaggtgat cgccgcggcg atgccggagg agggttcggg ctgctcggtg 180
cggcgcaggc cctatgggtg cgtcctgcgg gctgctttgg tcccattggt cgcgggcttg 240
gtgatctgcc tcgtggtgtg catccagcgc ttcgcacagg ctcagcagca gctgccgctc 300
gagtcacttg ggtgggacgt agctgagctg cagctgaatc acacaggacc tcagcaggac 360
cccaggctat actggcaggg gggcccagca ctgggccgct ccttcctgca tggaccagag 420
ctggacaagg ggcagctacg tatccatcgt gatggcatct acatggtaca catccaggtg 480
acgctggcca tctgctcctc cacgacggcc tccaggcacc accccaccac cctggccgtg 540
ggaatctgct ctcccgcctc ccgtagcatc agcctgctgc gtctcagctt ccaccaaggt 600
tgtaccattg tctcccagcg cctgacgccc ctggcccgag gggacacact ctgcaccaac 660
ctcactggga cacttttgcc ttcccgaaac actgatgaga ccttctttgg agtgcagtgg 720
gtgcgcccct gaccactgct gctgattagg gttttttaaa ttttatttta ttttatttaa 780
gttcaagaga aaaagtgtac acacaggggc cacccggggt tggggtggga gtgtggtggg 840
gggtagtttg tggcaggaca agagaaggca ttgagctttt tctttcattt tcctattaaa 900
aaatacaaaa atcaaaacaa aaaaaa 926
<210> 157
<211> 193
<212> PRT
<213> 智人
<400> 157
Met Pro Glu Glu Gly Ser Gly Cys Ser Val Arg Arg Arg Pro Tyr Gly
1 5 10 15
Cys Val Leu Arg Ala Ala Leu Val Pro Leu Val Ala Gly Leu Val Ile
20 25 30
Cys Leu Val Val Cys Ile Gln Arg Phe Ala Gln Ala Gln Gln Gln Leu
35 40 45
Pro Leu Glu Ser Leu Gly Trp Asp Val Ala Glu Leu Gln Leu Asn His
50 55 60
Thr Gly Pro Gln Gln Asp Pro Arg Leu Tyr Trp Gln Gly Gly Pro Ala
65 70 75 80
Leu Gly Arg Ser Phe Leu His Gly Pro Glu Leu Asp Lys Gly Gln Leu
85 90 95
Arg Ile His Arg Asp Gly Ile Tyr Met Val His Ile Gln Val Thr Leu
100 105 110
Ala Ile Cys Ser Ser Thr Thr Ala Ser Arg His His Pro Thr Thr Leu
115 120 125
Ala Val Gly Ile Cys Ser Pro Ala Ser Arg Ser Ile Ser Leu Leu Arg
130 135 140
Leu Ser Phe His Gln Gly Cys Thr Ile Val Ser Gln Arg Leu Thr Pro
145 150 155 160
Leu Ala Arg Gly Asp Thr Leu Cys Thr Asn Leu Thr Gly Thr Leu Leu
165 170 175
Pro Ser Arg Asn Thr Asp Glu Thr Phe Phe Gly Val Gln Trp Val Arg
180 185 190
Pro
<210> 158
<211> 293
<212> PRT
<213> 人工序列
<220>
<223> 合成构建体
<400> 158
Met Gly Ile Leu Pro Phe Leu Leu Ile Pro Met Glu Ser Asn Trp Thr
1 5 10 15
Val His Val Phe Ser Arg Thr Leu Cys His Met Leu Leu Trp Thr Ala
20 25 30
Val Leu Asn Leu Ala Ala Gly Thr His Asp Leu Pro Lys Ala Val Val
35 40 45
Lys Leu Glu Pro Pro Trp Ile Gln Val Leu Lys Glu Asp Thr Val Thr
50 55 60
Leu Thr Cys Glu Gly Thr His Asn Pro Gly Asn Ser Ser Thr Gln Trp
65 70 75 80
Phe His Asn Gly Arg Ser Ile Arg Ser Gln Val Gln Ala Ser Tyr Thr
85 90 95
Phe Lys Ala Thr Val Asn Asp Ser Gly Glu Tyr Arg Cys Gln Met Glu
100 105 110
Gln Thr Arg Leu Ser Asp Pro Val Asp Leu Gly Val Ile Ser Asp Trp
115 120 125
Leu Leu Leu Gln Thr Pro Gln Leu Val Phe Leu Glu Gly Glu Thr Ile
130 135 140
Thr Leu Arg Cys His Ser Trp Arg Asn Lys Leu Leu Asn Arg Ile Ser
145 150 155 160
Phe Phe His Asn Glu Lys Ser Val Arg Tyr His His Tyr Ser Ser Asn
165 170 175
Phe Ser Ile Pro Lys Ala Asn His Ser His Ser Gly Asp Tyr Tyr Cys
180 185 190
Lys Gly Ser Leu Gly Arg Thr Gln His Gln Ser Lys Thr Val Thr Ile
195 200 205
Thr Val Gln Gly Pro Lys Ser Ser Arg Ser Leu Pro Val Leu Thr Ile
210 215 220
Val Ala Ala Val Thr Gly Ile Ala Val Ala Ala Ile Val Ile Ile Leu
225 230 235 240
Val Ser Leu Val Tyr Leu Lys Lys Lys Gln Val Pro Asp Asn Pro Pro
245 250 255
Asp Leu Glu Glu Ala Ala Lys Thr Glu Ala Glu Asn Thr Ile Thr Tyr
260 265 270
Ser Leu Leu Lys His Pro Glu Ala Leu Asp Glu Glu Thr Glu His Asp
275 280 285
Tyr Gln Asn His Ile
290
<210> 159
<211> 2880
<212> DNA
<213> 智人
<400> 159
tgctgctctc cgcccgcgtc cggctcgtgg ccccctactt cgggcaccat ggacacctcc 60
cggctcggtg tgctcctgtc cttgcctgtg ctgctgcagc tggcgaccgg gggcagctct 120
cccaggtctg gtgtgttgct gaggggctgc cccacacact gtcattgcga gcccgacggc 180
aggatgttgc tcagggtgga ctgctccgac ctggggctct cggagctgcc ttccaacctc 240
agcgtcttca cctcctacct agacctcagt atgaacaaca tcagtcagct gctcccgaat 300
cccctgccca gtctccgctt cctggaggag ttacgtcttg cgggaaacgc tctgacatac 360
attcccaagg gagcattcac tggcctttac agtcttaaag ttcttatgct gcagaataat 420
cagctaagac acgtacccac agaagctctg cagaatttgc gaagccttca atccctgcgt 480
ctggatgcta accacatcag ctatgtgccc ccaagctgtt tcagtggcct gcattccctg 540
aggcacctgt ggctggatga caatgcgtta acagaaatcc ccgtccaggc ttttagaagt 600
ttatcggcat tgcaagccat gaccttggcc ctgaacaaaa tacaccacat accagactat 660
gcctttggaa acctctccag cttggtagtt ctacatctcc ataacaatag aatccactcc 720
ctgggaaaga aatgctttga tgggctccac agcctagaga ctttagattt aaattacaat 780
aaccttgatg aattccccac tgcaattagg acactctcca accttaaaga actaggattt 840
catagcaaca atatcaggtc gatacctgag aaagcatttg taggcaaccc ttctcttatt 900
acaatacatt tctatgacaa tcccatccaa tttgttggga gatctgcttt tcaacattta 960
cctgaactaa gaacactgac tctgaatggt gcctcacaaa taactgaatt tcctgattta 1020
actggaactg caaacctgga gagtctgact ttaactggag cacagatctc atctcttcct 1080
caaaccgtct gcaatcagtt acctaatctc caagtgctag atctgtctta caacctatta 1140
gaagatttac ccagtttttc agtctgccaa aagcttcaga aaattgacct aagacataat 1200
gaaatctacg aaattaaagt tgacactttc cagcagttgc ttagcctccg atcgctgaat 1260
ttggcttgga acaaaattgc tattattcac cccaatgcat tttccacttt gccatcccta 1320
ataaagctgg acctatcgtc caacctcctg tcgtcttttc ctataactgg gttacatggt 1380
ttaactcact taaaattaac aggaaatcat gccttacaga gcttgatatc atctgaaaac 1440
tttccagaac tcaaggttat agaaatgcct tatgcttacc agtgctgtgc atttggagtg 1500
tgtgagaatg cctataagat ttctaatcaa tggaataaag gtgacaacag cagtatggac 1560
gaccttcata agaaagatgc tggaatgttt caggctcaag atgaacgtga ccttgaagat 1620
ttcctgcttg actttgagga agacctgaaa gcccttcatt cagtgcagtg ttcaccttcc 1680
ccaggcccct tcaaaccctg tgaacacctg cttgatggct ggctgatcag aattggagtg 1740
tggaccatag cagttctggc acttacttgt aatgctttgg tgacttcaac agttttcaga 1800
tcccctctgt acatttcccc cattaaactg ttaattgggg tcatcgcagc agtgaacatg 1860
ctcacgggag tctccagtgc cgtgctggct ggtgtggatg cgttcacttt tggcagcttt 1920
gcacgacatg gtgcctggtg ggagaatggg gttggttgcc atgtcattgg ttttttgtcc 1980
atttttgctt cagaatcatc tgttttcctg cttactctgg cagccctgga gcgtgggttc 2040
tctgtgaaat attctgcaaa atttgaaacg aaagctccat tttctagcct gaaagtaatc 2100
attttgctct gtgccctgct ggccttgacc atggccgcag ttcccctgct gggtggcagc 2160
aagtatggcg cctcccctct ctgcctgcct ttgccttttg gggagcccag caccatgggc 2220
tacatggtcg ctctcatctt gctcaattcc ctttgcttcc tcatgatgac cattgcctac 2280
accaagctct actgcaattt ggacaaggga gacctggaga atatttggga ctgctctatg 2340
gtaaaacaca ttgccctgtt gctcttcacc aactgcatcc taaactgccc tgtggctttc 2400
ttgtccttct cctctttaat aaaccttaca tttatcagtc ctgaagtaat taagtttatc 2460
cttctggtgg tagtcccact tcctgcatgt ctcaatcccc ttctctacat cttgttcaat 2520
cctcacttta aggaggatct ggtgagcctg agaaagcaaa cctacgtctg gacaagatca 2580
aaacacccaa gcttgatgtc aattaactct gatgatgtcg aaaaacagtc ctgtgactca 2640
actcaagcct tggtaacctt taccagctcc agcatcactt atgacctgcc tcccagttcc 2700
gtgccatcac cagcttatcc agtgactgag agctgccatc tttcctctgt ggcatttgtc 2760
ccatgtctct aattaatatg tgaaggaaaa tgttttcaaa ggttgagaac ctgaaaatgt 2820
gagattgagt atatcagagc agtaattaat aagaagagct gaggtgaaac tcggtttaaa 2880
<210> 160
<211> 907
<212> PRT
<213> 智人
<400> 160
Met Asp Thr Ser Arg Leu Gly Val Leu Leu Ser Leu Pro Val Leu Leu
1 5 10 15
Gln Leu Ala Thr Gly Gly Ser Ser Pro Arg Ser Gly Val Leu Leu Arg
20 25 30
Gly Cys Pro Thr His Cys His Cys Glu Pro Asp Gly Arg Met Leu Leu
35 40 45
Arg Val Asp Cys Ser Asp Leu Gly Leu Ser Glu Leu Pro Ser Asn Leu
50 55 60
Ser Val Phe Thr Ser Tyr Leu Asp Leu Ser Met Asn Asn Ile Ser Gln
65 70 75 80
Leu Leu Pro Asn Pro Leu Pro Ser Leu Arg Phe Leu Glu Glu Leu Arg
85 90 95
Leu Ala Gly Asn Ala Leu Thr Tyr Ile Pro Lys Gly Ala Phe Thr Gly
100 105 110
Leu Tyr Ser Leu Lys Val Leu Met Leu Gln Asn Asn Gln Leu Arg His
115 120 125
Val Pro Thr Glu Ala Leu Gln Asn Leu Arg Ser Leu Gln Ser Leu Arg
130 135 140
Leu Asp Ala Asn His Ile Ser Tyr Val Pro Pro Ser Cys Phe Ser Gly
145 150 155 160
Leu His Ser Leu Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu
165 170 175
Ile Pro Val Gln Ala Phe Arg Ser Leu Ser Ala Leu Gln Ala Met Thr
180 185 190
Leu Ala Leu Asn Lys Ile His His Ile Pro Asp Tyr Ala Phe Gly Asn
195 200 205
Leu Ser Ser Leu Val Val Leu His Leu His Asn Asn Arg Ile His Ser
210 215 220
Leu Gly Lys Lys Cys Phe Asp Gly Leu His Ser Leu Glu Thr Leu Asp
225 230 235 240
Leu Asn Tyr Asn Asn Leu Asp Glu Phe Pro Thr Ala Ile Arg Thr Leu
245 250 255
Ser Asn Leu Lys Glu Leu Gly Phe His Ser Asn Asn Ile Arg Ser Ile
260 265 270
Pro Glu Lys Ala Phe Val Gly Asn Pro Ser Leu Ile Thr Ile His Phe
275 280 285
Tyr Asp Asn Pro Ile Gln Phe Val Gly Arg Ser Ala Phe Gln His Leu
290 295 300
Pro Glu Leu Arg Thr Leu Thr Leu Asn Gly Ala Ser Gln Ile Thr Glu
305 310 315 320
Phe Pro Asp Leu Thr Gly Thr Ala Asn Leu Glu Ser Leu Thr Leu Thr
325 330 335
Gly Ala Gln Ile Ser Ser Leu Pro Gln Thr Val Cys Asn Gln Leu Pro
340 345 350
Asn Leu Gln Val Leu Asp Leu Ser Tyr Asn Leu Leu Glu Asp Leu Pro
355 360 365
Ser Phe Ser Val Cys Gln Lys Leu Gln Lys Ile Asp Leu Arg His Asn
370 375 380
Glu Ile Tyr Glu Ile Lys Val Asp Thr Phe Gln Gln Leu Leu Ser Leu
385 390 395 400
Arg Ser Leu Asn Leu Ala Trp Asn Lys Ile Ala Ile Ile His Pro Asn
405 410 415
Ala Phe Ser Thr Leu Pro Ser Leu Ile Lys Leu Asp Leu Ser Ser Asn
420 425 430
Leu Leu Ser Ser Phe Pro Ile Thr Gly Leu His Gly Leu Thr His Leu
435 440 445
Lys Leu Thr Gly Asn His Ala Leu Gln Ser Leu Ile Ser Ser Glu Asn
450 455 460
Phe Pro Glu Leu Lys Val Ile Glu Met Pro Tyr Ala Tyr Gln Cys Cys
465 470 475 480
Ala Phe Gly Val Cys Glu Asn Ala Tyr Lys Ile Ser Asn Gln Trp Asn
485 490 495
Lys Gly Asp Asn Ser Ser Met Asp Asp Leu His Lys Lys Asp Ala Gly
500 505 510
Met Phe Gln Ala Gln Asp Glu Arg Asp Leu Glu Asp Phe Leu Leu Asp
515 520 525
Phe Glu Glu Asp Leu Lys Ala Leu His Ser Val Gln Cys Ser Pro Ser
530 535 540
Pro Gly Pro Phe Lys Pro Cys Glu His Leu Leu Asp Gly Trp Leu Ile
545 550 555 560
Arg Ile Gly Val Trp Thr Ile Ala Val Leu Ala Leu Thr Cys Asn Ala
565 570 575
Leu Val Thr Ser Thr Val Phe Arg Ser Pro Leu Tyr Ile Ser Pro Ile
580 585 590
Lys Leu Leu Ile Gly Val Ile Ala Ala Val Asn Met Leu Thr Gly Val
595 600 605
Ser Ser Ala Val Leu Ala Gly Val Asp Ala Phe Thr Phe Gly Ser Phe
610 615 620
Ala Arg His Gly Ala Trp Trp Glu Asn Gly Val Gly Cys His Val Ile
625 630 635 640
Gly Phe Leu Ser Ile Phe Ala Ser Glu Ser Ser Val Phe Leu Leu Thr
645 650 655
Leu Ala Ala Leu Glu Arg Gly Phe Ser Val Lys Tyr Ser Ala Lys Phe
660 665 670
Glu Thr Lys Ala Pro Phe Ser Ser Leu Lys Val Ile Ile Leu Leu Cys
675 680 685
Ala Leu Leu Ala Leu Thr Met Ala Ala Val Pro Leu Leu Gly Gly Ser
690 695 700
Lys Tyr Gly Ala Ser Pro Leu Cys Leu Pro Leu Pro Phe Gly Glu Pro
705 710 715 720
Ser Thr Met Gly Tyr Met Val Ala Leu Ile Leu Leu Asn Ser Leu Cys
725 730 735
Phe Leu Met Met Thr Ile Ala Tyr Thr Lys Leu Tyr Cys Asn Leu Asp
740 745 750
Lys Gly Asp Leu Glu Asn Ile Trp Asp Cys Ser Met Val Lys His Ile
755 760 765
Ala Leu Leu Leu Phe Thr Asn Cys Ile Leu Asn Cys Pro Val Ala Phe
770 775 780
Leu Ser Phe Ser Ser Leu Ile Asn Leu Thr Phe Ile Ser Pro Glu Val
785 790 795 800
Ile Lys Phe Ile Leu Leu Val Val Val Pro Leu Pro Ala Cys Leu Asn
805 810 815
Pro Leu Leu Tyr Ile Leu Phe Asn Pro His Phe Lys Glu Asp Leu Val
820 825 830
Ser Leu Arg Lys Gln Thr Tyr Val Trp Thr Arg Ser Lys His Pro Ser
835 840 845
Leu Met Ser Ile Asn Ser Asp Asp Val Glu Lys Gln Ser Cys Asp Ser
850 855 860
Thr Gln Ala Leu Val Thr Phe Thr Ser Ser Ser Ile Thr Tyr Asp Leu
865 870 875 880
Pro Pro Ser Ser Val Pro Ser Pro Ala Tyr Pro Val Thr Glu Ser Cys
885 890 895
His Leu Ser Ser Val Ala Phe Val Pro Cys Leu
900 905
<210> 161
<211> 3977
<212> DNA
<213> 智人
<400> 161
ggatggtgcc ttgagtgaat gacccccttg gagaacattc ttccgcatcc ctcgcctcaa 60
gccagcctca gacagaaaac tgaagattca gcagatccag tgcttcctgc tcctcttctg 120
cccaggaaca cgcttgcctt ccccaaggct tccagaagct ctgaggcagg aggcaccaag 180
ttctacctca tgtttggagg atcttgctag ctatggccct cgtactcggc tccctgttgc 240
tgctggggct gtgcgggaac tccttttcag gagggcagcc ttcatccaca gatgctccta 300
aggcttggaa ttatgaattg cctgcaacaa attatgagac ccaagactcc cataaagctg 360
gacccattgg cattctcttt gaactagtgc atatctttct ctatgtggta cagccgcgtg 420
atttcccaga agatactttg agaaaattct tacagaaggc atatgaatcc aaaattgatt 480
atgacaagcc agaaactgta atcttaggtc taaagattgt ctactatgaa gcagggatta 540
ttctatgctg tgtcctgggg ctgctgttta ttattctgat gcctctggtg gggtatttct 600
tttgtatgtg tcgttgctgt aacaaatgtg gtggagaaat gcaccagcga cagaaggaaa 660
atgggccctt cctgaggaaa tgctttgcaa tctccctgtt ggtgatttgt ataataataa 720
gcattggcat cttctatggt tttgtggcaa atcaccaggt aagaacccgg atcaaaagga 780
gtcggaaact ggcagatagc aatttcaagg acttgcgaac tctcttgaat gaaactccag 840
agcaaatcaa atatatattg gcccagtaca acactaccaa ggacaaggcg ttcacagatc 900
tgaacagtat caattcagtg ctaggaggcg gaattcttga ccgactgaga cccaacatca 960
tccctgttct tgatgagatt aagtccatgg caacagcgat caaggagacc aaagaggcgt 1020
tggagaacat gaacagcacc ttgaagagct tgcaccaaca aagtacacag cttagcagca 1080
gtctgaccag cgtgaaaact agcctgcggt catctctcaa tgaccctctg tgcttggtgc 1140
atccatcaag tgaaacctgc aacagcatca gattgtctct aagccagctg aatagcaacc 1200
ctgaactgag gcagcttcca cccgtggatg cagaacttga caacgttaat aacgttctta 1260
ggacagattt ggatggcctg gtccaacagg gctatcaatc ccttaatgat atacctgaca 1320
gagtacaacg ccaaaccacg actgtcgtag caggtatcaa aagggtcttg aattccattg 1380
gttcagatat cgacaatgta actcagcgtc ttcctattca ggatatactc tcagcattct 1440
ctgtttatgt taataacact gaaagttaca tccacagaaa tttacctaca ttggaagagt 1500
atgattcata ctggtggctg ggtggcctgg tcatctgctc tctgctgacc ctcatcgtga 1560
ttttttacta cctgggctta ctgtgtggcg tgtgcggcta tgacaggcat gccaccccga 1620
ccacccgagg ctgtgtctcc aacaccggag gcgtcttcct catggttgga gttggattaa 1680
gtttcctctt ttgctggata ttgatgatca ttgtggttct tacctttgtc tttggtgcaa 1740
atgtggaaaa actgatctgt gaaccttaca cgagcaagga attattccgg gttttggata 1800
caccctactt actaaatgaa gactgggaat actatctctc tgggaagcta tttaataaat 1860
caaaaatgaa gctcactttt gaacaagttt acagtgactg caaaaaaaat agaggcactt 1920
acggcactct tcacctgcag aacagcttca atatcagtga acatctcaac attaatgagc 1980
atactggaag cataagcagt gaattggaaa gtctgaaggt aaatcttaat atctttctgt 2040
tgggtgcagc aggaagaaaa aaccttcagg attttgctgc ttgtggaata gacagaatga 2100
attatgacag ctacttggct cagactggta aatcccccgc aggagtgaat cttttatcat 2160
ttgcatatga tctagaagca aaagcaaaca gtttgccccc aggaaatttg aggaactccc 2220
tgaaaagaga tgcacaaact attaaaacaa ttcaccagca acgagtcctt cctatagaac 2280
aatcactgag cactctatac caaagcgtca agatacttca acgcacaggg aatggattgt 2340
tggagagagt aactaggatt ctagcttctc tggattttgc tcagaacttc atcacaaaca 2400
atacttcctc tgttattatt gaggaaacta agaagtatgg gagaacaata ataggatatt 2460
ttgaacatta tctgcagtgg atcgagttct ctatcagtga gaaagtggca tcgtgcaaac 2520
ctgtggccac cgctctagat actgctgttg atgtctttct gtgtagctac attatcgacc 2580
ccttgaattt gttttggttt ggcataggaa aagctactgt atttttactt ccggctctaa 2640
tttttgcggt aaaactggct aagtactatc gtcgaatgga ttcggaggac gtgtacgatg 2700
atgttgaaac tatacccatg aaaaatatgg aaaatggtaa taatggttat cataaagatc 2760
atgtatatgg tattcacaat cctgttatga caagcccatc acaacattga tagctgatgt 2820
tgaaactgct tgagcatcag gatactcaaa gtggaaagga tcacagattt ttggtagttt 2880
ctgggtctac aaggactttc caaatccagg agcaacgcca gtggcaacgt agtgactcag 2940
gcgggcacca aggcaacggc accattggtc tctgggtagt gctttaagaa tgaacacaat 3000
cacgttatag tccatggtcc atcactattc aaggatgact ccctcccttc ctgtctattt 3060
ttgtttttta cttttttaca ctgagtttct atttagacac tacaacatat ggggtgtttg 3120
ttcccattgg atgcatttct atcaaaactc tatcaaatgt gatggctaga ttctaacata 3180
ttgccatgtg tggagtgtgc tgaacacaca ccagtttaca ggaaagatgc attttgtgta 3240
cagtaaacgg tgtatatacc ttttgttacc acagagtttt ttaaacaaat gagtattata 3300
ggactttctt ctaaatgagc taaataagtc accattgact tcttggtgct gttgaaaata 3360
atccattttc actaaaagtg tgtgaaacct acagcatatt cttcacgcag agattttcat 3420
ctattatact ttatcaaaga ttggccatgt tccacttgga aatggcatgc aaaagcaatc 3480
atagagaaac ctgcgtaact ccatctgaca aattcaaaag agagagagag atcttgagag 3540
agaaatgctg ttcgttcaaa agtggagttg ttttaacaga tgccaattac ggtgtacagt 3600
ttaacagagt tttctgttgc attaggataa acattaattg gagtgcagct aacatgagta 3660
tcatcagact agtatcaagt gttctaaaat gaaatatgag aagatcctgt cacaattctt 3720
agatctggtg tccagcatgg atgaaacctt tgagtttggt ccctaaattt gcatgaaagc 3780
acaaggtaaa tattcatttg cttcaggagt ttcatgttgg atctgtcatt atcaaaagtg 3840
atcagcaatg aagaactggt cggacaaaat ttaacgttga tgtaatgaaa ttccagatgt 3900
aggcattccc cccaggtctt ttcatgtgca gattgcagtt ctgattcatt tgaataaaaa 3960
ggaacttgga aaacatg 3977
<210> 162
<211> 865
<212> PRT
<213> 智人
<400> 162
Met Ala Leu Val Leu Gly Ser Leu Leu Leu Leu Gly Leu Cys Gly Asn
1 5 10 15
Ser Phe Ser Gly Gly Gln Pro Ser Ser Thr Asp Ala Pro Lys Ala Trp
20 25 30
Asn Tyr Glu Leu Pro Ala Thr Asn Tyr Glu Thr Gln Asp Ser His Lys
35 40 45
Ala Gly Pro Ile Gly Ile Leu Phe Glu Leu Val His Ile Phe Leu Tyr
50 55 60
Val Val Gln Pro Arg Asp Phe Pro Glu Asp Thr Leu Arg Lys Phe Leu
65 70 75 80
Gln Lys Ala Tyr Glu Ser Lys Ile Asp Tyr Asp Lys Pro Glu Thr Val
85 90 95
Ile Leu Gly Leu Lys Ile Val Tyr Tyr Glu Ala Gly Ile Ile Leu Cys
100 105 110
Cys Val Leu Gly Leu Leu Phe Ile Ile Leu Met Pro Leu Val Gly Tyr
115 120 125
Phe Phe Cys Met Cys Arg Cys Cys Asn Lys Cys Gly Gly Glu Met His
130 135 140
Gln Arg Gln Lys Glu Asn Gly Pro Phe Leu Arg Lys Cys Phe Ala Ile
145 150 155 160
Ser Leu Leu Val Ile Cys Ile Ile Ile Ser Ile Gly Ile Phe Tyr Gly
165 170 175
Phe Val Ala Asn His Gln Val Arg Thr Arg Ile Lys Arg Ser Arg Lys
180 185 190
Leu Ala Asp Ser Asn Phe Lys Asp Leu Arg Thr Leu Leu Asn Glu Thr
195 200 205
Pro Glu Gln Ile Lys Tyr Ile Leu Ala Gln Tyr Asn Thr Thr Lys Asp
210 215 220
Lys Ala Phe Thr Asp Leu Asn Ser Ile Asn Ser Val Leu Gly Gly Gly
225 230 235 240
Ile Leu Asp Arg Leu Arg Pro Asn Ile Ile Pro Val Leu Asp Glu Ile
245 250 255
Lys Ser Met Ala Thr Ala Ile Lys Glu Thr Lys Glu Ala Leu Glu Asn
260 265 270
Met Asn Ser Thr Leu Lys Ser Leu His Gln Gln Ser Thr Gln Leu Ser
275 280 285
Ser Ser Leu Thr Ser Val Lys Thr Ser Leu Arg Ser Ser Leu Asn Asp
290 295 300
Pro Leu Cys Leu Val His Pro Ser Ser Glu Thr Cys Asn Ser Ile Arg
305 310 315 320
Leu Ser Leu Ser Gln Leu Asn Ser Asn Pro Glu Leu Arg Gln Leu Pro
325 330 335
Pro Val Asp Ala Glu Leu Asp Asn Val Asn Asn Val Leu Arg Thr Asp
340 345 350
Leu Asp Gly Leu Val Gln Gln Gly Tyr Gln Ser Leu Asn Asp Ile Pro
355 360 365
Asp Arg Val Gln Arg Gln Thr Thr Thr Val Val Ala Gly Ile Lys Arg
370 375 380
Val Leu Asn Ser Ile Gly Ser Asp Ile Asp Asn Val Thr Gln Arg Leu
385 390 395 400
Pro Ile Gln Asp Ile Leu Ser Ala Phe Ser Val Tyr Val Asn Asn Thr
405 410 415
Glu Ser Tyr Ile His Arg Asn Leu Pro Thr Leu Glu Glu Tyr Asp Ser
420 425 430
Tyr Trp Trp Leu Gly Gly Leu Val Ile Cys Ser Leu Leu Thr Leu Ile
435 440 445
Val Ile Phe Tyr Tyr Leu Gly Leu Leu Cys Gly Val Cys Gly Tyr Asp
450 455 460
Arg His Ala Thr Pro Thr Thr Arg Gly Cys Val Ser Asn Thr Gly Gly
465 470 475 480
Val Phe Leu Met Val Gly Val Gly Leu Ser Phe Leu Phe Cys Trp Ile
485 490 495
Leu Met Ile Ile Val Val Leu Thr Phe Val Phe Gly Ala Asn Val Glu
500 505 510
Lys Leu Ile Cys Glu Pro Tyr Thr Ser Lys Glu Leu Phe Arg Val Leu
515 520 525
Asp Thr Pro Tyr Leu Leu Asn Glu Asp Trp Glu Tyr Tyr Leu Ser Gly
530 535 540
Lys Leu Phe Asn Lys Ser Lys Met Lys Leu Thr Phe Glu Gln Val Tyr
545 550 555 560
Ser Asp Cys Lys Lys Asn Arg Gly Thr Tyr Gly Thr Leu His Leu Gln
565 570 575
Asn Ser Phe Asn Ile Ser Glu His Leu Asn Ile Asn Glu His Thr Gly
580 585 590
Ser Ile Ser Ser Glu Leu Glu Ser Leu Lys Val Asn Leu Asn Ile Phe
595 600 605
Leu Leu Gly Ala Ala Gly Arg Lys Asn Leu Gln Asp Phe Ala Ala Cys
610 615 620
Gly Ile Asp Arg Met Asn Tyr Asp Ser Tyr Leu Ala Gln Thr Gly Lys
625 630 635 640
Ser Pro Ala Gly Val Asn Leu Leu Ser Phe Ala Tyr Asp Leu Glu Ala
645 650 655
Lys Ala Asn Ser Leu Pro Pro Gly Asn Leu Arg Asn Ser Leu Lys Arg
660 665 670
Asp Ala Gln Thr Ile Lys Thr Ile His Gln Gln Arg Val Leu Pro Ile
675 680 685
Glu Gln Ser Leu Ser Thr Leu Tyr Gln Ser Val Lys Ile Leu Gln Arg
690 695 700
Thr Gly Asn Gly Leu Leu Glu Arg Val Thr Arg Ile Leu Ala Ser Leu
705 710 715 720
Asp Phe Ala Gln Asn Phe Ile Thr Asn Asn Thr Ser Ser Val Ile Ile
725 730 735
Glu Glu Thr Lys Lys Tyr Gly Arg Thr Ile Ile Gly Tyr Phe Glu His
740 745 750
Tyr Leu Gln Trp Ile Glu Phe Ser Ile Ser Glu Lys Val Ala Ser Cys
755 760 765
Lys Pro Val Ala Thr Ala Leu Asp Thr Ala Val Asp Val Phe Leu Cys
770 775 780
Ser Tyr Ile Ile Asp Pro Leu Asn Leu Phe Trp Phe Gly Ile Gly Lys
785 790 795 800
Ala Thr Val Phe Leu Leu Pro Ala Leu Ile Phe Ala Val Lys Leu Ala
805 810 815
Lys Tyr Tyr Arg Arg Met Asp Ser Glu Asp Val Tyr Asp Asp Val Glu
820 825 830
Thr Ile Pro Met Lys Asn Met Glu Asn Gly Asn Asn Gly Tyr His Lys
835 840 845
Asp His Val Tyr Gly Ile His Asn Pro Val Met Thr Ser Pro Ser Gln
850 855 860
His
865
<210> 163
<211> 3858
<212> DNA
<213> 智人
<400> 163
ctactttatt ctgataaaac aggtctatgc agctaccagg acaatggaat ctacgttgac 60
tttagcaacg gaacaacctg ttaagaagaa cactcttaag aaatataaaa tagcttgcat 120
tgttcttctt gctttgctgg tgatcatgtc acttggatta ggcctggggc ttggactcag 180
gaaactggaa aagcaaggca gctgcaggaa gaagtgcttt gatgcatcat ttagaggact 240
ggagaactgc cggtgtgatg tggcatgtaa agaccgaggt gattgctgct gggattttga 300
agacacctgt gtggaatcaa ctcgaatatg gatgtgcaat aaatttcgtt gtggagagac 360
cagattagag gccagccttt gctcttgttc agatgactgt ttgcagaaga aagattgctg 420
tgctgactat aagagtgttt gccaaggaga aacctcatgg ctggaagaaa actgtgacac 480
agcccagcag tctcagtgcc cagaagggtt tgacctgcca ccagttatct tgttttctat 540
ggatggattt agagctgaat atttatacac atgggatact ttaatgccaa atatcaataa 600
actgaaaaca tgtggaattc attcaaaata catgagagct atgtatccta ccaaaacctt 660
cccaaatcat tacaccattg tcacgggctt gtatccagag tcacatggca tcattgacaa 720
taatatgtat gatgtaaatc tcaacaagaa tttttcactt tcttcaaagg aacaaaataa 780
tccagcctgg tggcatgggc aaccaatgtg gctgacagca atgtatcaag gtttaaaagc 840
cgctacctac ttttggcccg gatcagaagt ggctataaat ggctcctttc cttccatata 900
catgccttac aacggaagtg tcccatttga agagaggatt tctacactgt taaaatggct 960
ggacctgccc aaagctgaaa gacccaggtt ttataccatg tattttgaag aacctgattc 1020
ctctggacat gcaggtggac cagtcagtgc cagagtaatt aaagccttac aggtagtaga 1080
tcatgctttt gggatgttga tggaaggcct gaagcagcgg aatttgcaca actgtgtcaa 1140
tatcatcctt ctggctgacc atggaatgga ccagacttat tgtaacaaga tggaatacat 1200
gactgattat tttcccagaa taaacttctt ctacatgtac gaagggcctg ccccccgcat 1260
ccgagctcat aatatacctc atgacttttt tagttttaat tctgaggaaa ttgttagaaa 1320
cctcagttgc cgaaaacctg atcagcattt caagccctat ttgactcctg atttgccaaa 1380
gcgactgcac tatgccaaga acgtcagaat cgacaaagtt catctctttg tggatcaaca 1440
gtggctggct gttaggagta aatcaaatac aaattgtgga ggaggcaacc atggttataa 1500
caatgagttt aggagcatgg aggctatctt tctggcacat ggacccagtt ttaaagagaa 1560
gactgaagtt gaaccatttg aaaatattga agtctataac ctaatgtgtg atcttctacg 1620
cattcaacca gcaccaaaca atggaaccca tggtagttta aaccatcttc tgaaggtgcc 1680
tttttatgag ccatcccatg cagaggaggt gtcaaagttt tctgtttgtg gctttgctaa 1740
tccattgccc acagagtctc ttgactgttt ctgccctcac ctacaaaata gtactcagct 1800
ggaacaagtg aatcagatgc taaatctcac ccaagaagaa ataacagcaa cagtgaaagt 1860
aaatttgcca tttgggaggc ctagggtact gcagaagaac gtggaccact gtctccttta 1920
ccacagggaa tatgtcagtg gatttggaaa agctatgagg atgcccatgt ggagttcata 1980
cacagtcccc cagttgggag acacatcgcc tctgcctccc actgtcccag actgtctgcg 2040
ggctgatgtc agggttcctc cttctgagag ccaaaaatgt tccttctatt tagcagacaa 2100
gaatatcacc cacggcttcc tctatcctcc tgccagcaat agaacatcag atagccaata 2160
tgatgcttta attactagca atttggtacc tatgtatgaa gaattcagaa aaatgtggga 2220
ctacttccac agtgttcttc ttataaaaca tgccacagaa agaaatggag taaatgtggt 2280
tagtggacca atatttgatt ataattatga tggccatttt gatgctccag atgaaattac 2340
caaacattta gccaacactg atgttcccat cccaacacac tactttgtgg tgctgaccag 2400
ttgtaaaaac aagagccaca caccggaaaa ctgccctggg tggctggatg tcctaccctt 2460
tatcatccct caccgaccta ccaacgtgga gagctgtcct gaaggtaaac cagaagctct 2520
ttgggttgaa gaaagattta cagctcacat tgcccgggtc cgtgatgtag aacttctcac 2580
tgggcttgac ttctatcagg ataaagtgca gcctgtctct gaaattttgc aactaaagac 2640
atatttacca acatttgaaa ccactattta acttaataat gtctacttaa tatataattt 2700
actgtataaa gtaattttgg caaaatataa gtgatttttt ctggagaatt gtaaaataaa 2760
gttttctatt tttccttaaa aaaaaaaccg gaattccggg cttgggaggc tgaggcagga 2820
gactcgcttg aacccgggag gcagaggttg cagtgagcca agattgcgcc attgcactcc 2880
agagcctggg tgacagagca agactacatc tcaaaaaata aataaataaa ataaaagtaa 2940
caataaaaat aaaaagaaca gcagagagaa tgagcaagga gaaatgtcac aaactattgc 3000
aaaatactgt tacactgggt tggctctcca agaagatact ggaatctctt cagccatttg 3060
cttttcagaa gtagaaacca gcaaaccacc tctaagcgga gaacatacga ttctttatta 3120
agtagctctg gggaaggaaa gaataaaagt tgatagctcc ctgattggga aaaaatgcac 3180
aattaataaa gaatgaagat gaaagaaagc atgcttatgt tgtaacacaa aaaaaattca 3240
caaacgttgg tggaaggaaa acagtataga aaacattact ttaactaaaa gctggaaaaa 3300
ttttcagttg ggatgcgact gacaaaaaga acgggatttc caggcataaa gttggcgtga 3360
gctacagagg gcaccatgtg gctcagtgga agacccttca agattcaaag ttccatttga 3420
cagagcaaag gcacttcgca aggagaaggg tttaaattat gggtccaaaa gccaagtggt 3480
aaagcgagca atttgcagca taactgcttc tcctagacag ggctgagtgg gcaaaatacg 3540
acagtacaca cagtgactat tagccactgc cagaaacagg ctgaacagcc ctgggagaca 3600
agggaaggca ggtggtggga gttgttcatg gagagaaagg agagttttag aaccagcaca 3660
tccactggag atgctgggcc accagacccc tcccagtcaa taaagtctgg tgcctcattt 3720
gatctcagcc tcatcatgac cctggagaga ccctgatacc atctgccagt ccccgacagc 3780
ttaggcactc cttgccatca acctgacccc ccgagtggtt ctccaggctc cctgccccac 3840
ccattcaggc cggaattc 3858
<210> 164
<211> 875
<212> PRT
<213> 智人
<400> 164
Met Glu Ser Thr Leu Thr Leu Ala Thr Glu Gln Pro Val Lys Lys Asn
1 5 10 15
Thr Leu Lys Lys Tyr Lys Ile Ala Cys Ile Val Leu Leu Ala Leu Leu
20 25 30
Val Ile Met Ser Leu Gly Leu Gly Leu Gly Leu Gly Leu Arg Lys Leu
35 40 45
Glu Lys Gln Gly Ser Cys Arg Lys Lys Cys Phe Asp Ala Ser Phe Arg
50 55 60
Gly Leu Glu Asn Cys Arg Cys Asp Val Ala Cys Lys Asp Arg Gly Asp
65 70 75 80
Cys Cys Trp Asp Phe Glu Asp Thr Cys Val Glu Ser Thr Arg Ile Trp
85 90 95
Met Cys Asn Lys Phe Arg Cys Gly Glu Thr Arg Leu Glu Ala Ser Leu
100 105 110
Cys Ser Cys Ser Asp Asp Cys Leu Gln Lys Lys Asp Cys Cys Ala Asp
115 120 125
Tyr Lys Ser Val Cys Gln Gly Glu Thr Ser Trp Leu Glu Glu Asn Cys
130 135 140
Asp Thr Ala Gln Gln Ser Gln Cys Pro Glu Gly Phe Asp Leu Pro Pro
145 150 155 160
Val Ile Leu Phe Ser Met Asp Gly Phe Arg Ala Glu Tyr Leu Tyr Thr
165 170 175
Trp Asp Thr Leu Met Pro Asn Ile Asn Lys Leu Lys Thr Cys Gly Ile
180 185 190
His Ser Lys Tyr Met Arg Ala Met Tyr Pro Thr Lys Thr Phe Pro Asn
195 200 205
His Tyr Thr Ile Val Thr Gly Leu Tyr Pro Glu Ser His Gly Ile Ile
210 215 220
Asp Asn Asn Met Tyr Asp Val Asn Leu Asn Lys Asn Phe Ser Leu Ser
225 230 235 240
Ser Lys Glu Gln Asn Asn Pro Ala Trp Trp His Gly Gln Pro Met Trp
245 250 255
Leu Thr Ala Met Tyr Gln Gly Leu Lys Ala Ala Thr Tyr Phe Trp Pro
260 265 270
Gly Ser Glu Val Ala Ile Asn Gly Ser Phe Pro Ser Ile Tyr Met Pro
275 280 285
Tyr Asn Gly Ser Val Pro Phe Glu Glu Arg Ile Ser Thr Leu Leu Lys
290 295 300
Trp Leu Asp Leu Pro Lys Ala Glu Arg Pro Arg Phe Tyr Thr Met Tyr
305 310 315 320
Phe Glu Glu Pro Asp Ser Ser Gly His Ala Gly Gly Pro Val Ser Ala
325 330 335
Arg Val Ile Lys Ala Leu Gln Val Val Asp His Ala Phe Gly Met Leu
340 345 350
Met Glu Gly Leu Lys Gln Arg Asn Leu His Asn Cys Val Asn Ile Ile
355 360 365
Leu Leu Ala Asp His Gly Met Asp Gln Thr Tyr Cys Asn Lys Met Glu
370 375 380
Tyr Met Thr Asp Tyr Phe Pro Arg Ile Asn Phe Phe Tyr Met Tyr Glu
385 390 395 400
Gly Pro Ala Pro Arg Ile Arg Ala His Asn Ile Pro His Asp Phe Phe
405 410 415
Ser Phe Asn Ser Glu Glu Ile Val Arg Asn Leu Ser Cys Arg Lys Pro
420 425 430
Asp Gln His Phe Lys Pro Tyr Leu Thr Pro Asp Leu Pro Lys Arg Leu
435 440 445
His Tyr Ala Lys Asn Val Arg Ile Asp Lys Val His Leu Phe Val Asp
450 455 460
Gln Gln Trp Leu Ala Val Arg Ser Lys Ser Asn Thr Asn Cys Gly Gly
465 470 475 480
Gly Asn His Gly Tyr Asn Asn Glu Phe Arg Ser Met Glu Ala Ile Phe
485 490 495
Leu Ala His Gly Pro Ser Phe Lys Glu Lys Thr Glu Val Glu Pro Phe
500 505 510
Glu Asn Ile Glu Val Tyr Asn Leu Met Cys Asp Leu Leu Arg Ile Gln
515 520 525
Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu Asn His Leu Leu Lys
530 535 540
Val Pro Phe Tyr Glu Pro Ser His Ala Glu Glu Val Ser Lys Phe Ser
545 550 555 560
Val Cys Gly Phe Ala Asn Pro Leu Pro Thr Glu Ser Leu Asp Cys Phe
565 570 575
Cys Pro His Leu Gln Asn Ser Thr Gln Leu Glu Gln Val Asn Gln Met
580 585 590
Leu Asn Leu Thr Gln Glu Glu Ile Thr Ala Thr Val Lys Val Asn Leu
595 600 605
Pro Phe Gly Arg Pro Arg Val Leu Gln Lys Asn Val Asp His Cys Leu
610 615 620
Leu Tyr His Arg Glu Tyr Val Ser Gly Phe Gly Lys Ala Met Arg Met
625 630 635 640
Pro Met Trp Ser Ser Tyr Thr Val Pro Gln Leu Gly Asp Thr Ser Pro
645 650 655
Leu Pro Pro Thr Val Pro Asp Cys Leu Arg Ala Asp Val Arg Val Pro
660 665 670
Pro Ser Glu Ser Gln Lys Cys Ser Phe Tyr Leu Ala Asp Lys Asn Ile
675 680 685
Thr His Gly Phe Leu Tyr Pro Pro Ala Ser Asn Arg Thr Ser Asp Ser
690 695 700
Gln Tyr Asp Ala Leu Ile Thr Ser Asn Leu Val Pro Met Tyr Glu Glu
705 710 715 720
Phe Arg Lys Met Trp Asp Tyr Phe His Ser Val Leu Leu Ile Lys His
725 730 735
Ala Thr Glu Arg Asn Gly Val Asn Val Val Ser Gly Pro Ile Phe Asp
740 745 750
Tyr Asn Tyr Asp Gly His Phe Asp Ala Pro Asp Glu Ile Thr Lys His
755 760 765
Leu Ala Asn Thr Asp Val Pro Ile Pro Thr His Tyr Phe Val Val Leu
770 775 780
Thr Ser Cys Lys Asn Lys Ser His Thr Pro Glu Asn Cys Pro Gly Trp
785 790 795 800
Leu Asp Val Leu Pro Phe Ile Ile Pro His Arg Pro Thr Asn Val Glu
805 810 815
Ser Cys Pro Glu Gly Lys Pro Glu Ala Leu Trp Val Glu Glu Arg Phe
820 825 830
Thr Ala His Ile Ala Arg Val Arg Asp Val Glu Leu Leu Thr Gly Leu
835 840 845
Asp Phe Tyr Gln Asp Lys Val Gln Pro Val Ser Glu Ile Leu Gln Leu
850 855 860
Lys Thr Tyr Leu Pro Thr Phe Glu Thr Thr Ile
865 870 875
<210> 165
<211> 585
<212> DNA
<213> 智人
<400> 165
caacgcagag ttgggagcaa ctccagagcc tccttcaaga tgctgctggt cctgctctca 60
gtggtccttc tggctctgag ctcagctcag agcacagata atgatgtgaa ctatgaagac 120
tttactttca ccataccaga tgtagaggac tcaagtcaga gaccagatca gggaccccag 180
agacctcctc ctgaaggact cctacctaga ccccctggtg atagtggtaa ccaagatgat 240
ggtcctcagc agagaccacc aaaaccagga ggccatcacc gccatcctcc cccacctcct 300
tttcaaaatc agcaacgacc accccgacga ggacaccgtc aactctctct accccgattt 360
ccttctgtca gcctgcagga agcatcatca ttcttccaga gggacagacc agcaagacat 420
ccccaggagc aaccactctg gtaatctaga attcagtggc agaaaataaa taagaagata 480
acttccttca gaaagccatg acattgaaat aatgtggtca taactctttc ttcagtatac 540
caataaaata ttaatagcat gcaaaaaaaa aaaaaaaaaa aaaaa 585
<210> 166
<211> 134
<212> PRT
<213> 智人
<400> 166
Met Leu Leu Val Leu Leu Ser Val Val Leu Leu Ala Leu Ser Ser Ala
1 5 10 15
Gln Ser Thr Asp Asn Asp Val Asn Tyr Glu Asp Phe Thr Phe Thr Ile
20 25 30
Pro Asp Val Glu Asp Ser Ser Gln Arg Pro Asp Gln Gly Pro Gln Arg
35 40 45
Pro Pro Pro Glu Gly Leu Leu Pro Arg Pro Pro Gly Asp Ser Gly Asn
50 55 60
Gln Asp Asp Gly Pro Gln Gln Arg Pro Pro Lys Pro Gly Gly His His
65 70 75 80
Arg His Pro Pro Pro Pro Pro Phe Gln Asn Gln Gln Arg Pro Pro Arg
85 90 95
Arg Gly His Arg Gln Leu Ser Leu Pro Arg Phe Pro Ser Val Ser Leu
100 105 110
Gln Glu Ala Ser Ser Phe Phe Gln Arg Asp Arg Pro Ala Arg His Pro
115 120 125
Gln Glu Gln Pro Leu Trp
130
<210> 167
<211> 3850
<212> DNA
<213> 智人
<400> 167
gaccagagag aagcgtgggg aagagtgggc tgagggactc cactagaggc tgtccatctg 60
gattccctgc ctccctagga gcccaacaga gcaaagcaag tgggcacaag gagtatggtt 120
ctaacgtgat tggggtcatg aagacgttgc tgttggactt ggctttgtgg tcactgctct 180
tccagcccgg gtggctgtcc tttagttccc aggtgagtca gaactgccac aatggcagct 240
atgaaatcag cgtcctgatg atgggcaact cagcctttgc agagcccctg aaaaacttgg 300
aagatgcggt gaatgagggg ctggaaatag tgagaggacg tctgcaaaat gctggcctaa 360
atgtgactgt gaacgctact ttcatgtatt cggatggtct gattcataac tcaggcgact 420
gccggagtag cacctgtgaa ggcctcgacc tactcaggaa aatttcaaat gcacaacgga 480
tgggctgtgt cctcataggg ccctcatgta catactccac cttccagatg taccttgaca 540
cagaattgag ctaccccatg atctcagctg gaagttttgg attgtcatgt gactataaag 600
aaaccttaac caggctgatg tctccagcta gaaagttgat gtacttcttg gttaactttt 660
ggaaaaccaa cgatctgccc ttcaaaactt attcctggag cacttcgtat gtttacaaga 720
atggtacaga aactgaggac tgtttctggt accttaatgc tctggaggct agcgtttcct 780
atttctccca cgaactcggc tttaaggtgg tgttaagaca agataaggag tttcaggata 840
tcttaatgga ccacaacagg aaaagcaatg tgattattat gtgtggtggt ccagagttcc 900
tctacaagct gaagggtgac cgagcagtgg ctgaagacat tgtcattatt ctagtggatc 960
ttttcaatga ccagtacttt gaggacaatg tcacagcccc tgactatatg aaaaatgtcc 1020
ttgttctgac gctgtctcct gggaattccc ttctaaatag ctctttctcc aggaatctat 1080
caccaacaaa acgagacttt gctcttgcct atttgaatgg aatcctgctc tttggacata 1140
tgctgaagat atttcttgaa aatggagaaa atattaccac ccccaaattt gctcatgctt 1200
tcaggaatct cacttttgaa gggtatgacg gtccagtgac cttggatgac tggggggatg 1260
ttgacagtac catggtgctt ctgtatacct ctgtggacac caagaaatac aaggttcttt 1320
tgacctatga tacccacgta aataagacct atcctgtgga tatgagcccc acattcactt 1380
ggaagaactc taaacttcct aatgatatta caggccgggg ccctcagatc ctgatgattg 1440
cagtcttcac cctcactgga gctgtggtgc tgctcctgct cgtcgctctc ctgatgctca 1500
gaaaatatag aaaagattat gaacttcgtc agaaaaaatg gtcccacatt cctcctgaaa 1560
atatctttcc tctggagacc aatgagacca atcatgttag cctcaagatc gatgatgaca 1620
aaagacgaga tacaatccag agactacgac agtgcaaata cgacaaaaag cgagtgattc 1680
tcaaagatct caagcacaat gatggtaatt tcactgaaaa acagaagata gaattgaaca 1740
agttgcttca gattgactat tacaacctga ccaagttcta cggcacagtg aaacttgata 1800
ccatgatctt cggggtgata gaatactgtg agagaggatc cctccgggaa gttttaaatg 1860
acacaatttc ctaccctgat ggcacattca tggattggga gtttaagatc tctgtcttgt 1920
atgacattgc taagggaatg tcatatctgc actccagtaa gacagaagtc catggtcgtc 1980
tgaaatctac caactgcgta gtggacagta gaatggtggt gaagatcact gattttggct 2040
gcaattccat tttacctcca aaaaaggacc tgtggacagc tccagagcac ctccgccaag 2100
ccaacatctc tcagaaagga gatgtgtaca gctatgggat catcgcacag gagatcatcc 2160
tgcggaaaga aaccttctac actttgagct gtcgggaccg gaatgagaag attttcagag 2220
tggaaaattc caatggaatg aaacccttcc gcccagattt attcttggaa acagcagagg 2280
aaaaagagct agaagtgtac ctacttgtaa aaaactgttg ggaggaagat ccagaaaaga 2340
gaccagattt caaaaaaatt gagactacac ttgccaagat atttggactt tttcatgacc 2400
aaaaaaatga aagctatatg gataccttga tccgacgtct acagctatat tctcgaaacc 2460
tggaacatct ggtagaggaa aggacacagc tgtacaaggc agagagggac agggctgaca 2520
gacttaactt tatgttgctt ccaaggctag tggtaaagtc tctgaaggag aaaggctttg 2580
tggagccgga actatatgag gaagttacaa tctacttcag tgacattgta ggtttcacta 2640
ctatctgcaa atacagcacc cccatggaag tggtggacat gcttaatgac atctataaga 2700
gttttgacca cattgttgat catcatgatg tctacaaggt ggaaaccatc ggtgatgcgt 2760
acatggtggc tagtggtttg cctaagagaa atggcaatcg gcatgcaata gacattgcca 2820
agatggcctt ggaaatcctc agcttcatgg ggacctttga gctggagcat cttcctggcc 2880
tcccaatatg gattcgcatt ggagttcact ctggtccctg tgctgctgga gttgtgggaa 2940
tcaagatgcc tcgttattgt ctatttggag atacggtcaa cacagcctct aggatggaat 3000
ccactggcct ccctttgaga attcacgtga gtggctccac catagccatc ctgaagagaa 3060
ctgagtgcca gttcctttat gaagtgagag gagaaacata cttaaaggga agaggaaatg 3120
agactaccta ctggctgact gggatgaagg accagaaatt caacctgcca acccctccta 3180
ctgtggagaa tcaacagcgt ttgcaagcag aattttcaga catgattgcc aactctttac 3240
agaaaagaca ggcagcaggg ataagaagcc aaaaacccag acgggtagcc agctataaaa 3300
aaggcactct ggaatacttg cagctgaata ccacagacaa ggagagcacc tatttttaaa 3360
cctaaatgag gtataaggac tcacacaaat taaaatacag ctgcactgag gcagcgacct 3420
caagtgtcct gaaagcttac attttcctga gacctcaatg aagcagaaat gtacttaggc 3480
ttggctgccc tgtctggaac atggactttc ttgcatgaat cagatgtgtg ttctcagtga 3540
aataactacc ttccactctg gaaccttatt ccagcagttg ttccagggag cttctacctg 3600
gaaaagaaaa gaaatgaata gactatctag aacttgagaa gattttattc ttatttcatt 3660
tattttttgt ttgtttattt ttatcgtttt tgtttactgg ctttccttct gtattcataa 3720
gattttttaa attgtcataa ttatatttta aatacccatc ttcattaaag tatatttaac 3780
tcataatttt tgcagaaaat atgctatata ttaggcaaga ataaaagcta aaggtttccc 3840
aaaaaaaaaa 3850
<210> 168
<211> 1073
<212> PRT
<213> 智人
<400> 168
Met Lys Thr Leu Leu Leu Asp Leu Ala Leu Trp Ser Leu Leu Phe Gln
1 5 10 15
Pro Gly Trp Leu Ser Phe Ser Ser Gln Val Ser Gln Asn Cys His Asn
20 25 30
Gly Ser Tyr Glu Ile Ser Val Leu Met Met Gly Asn Ser Ala Phe Ala
35 40 45
Glu Pro Leu Lys Asn Leu Glu Asp Ala Val Asn Glu Gly Leu Glu Ile
50 55 60
Val Arg Gly Arg Leu Gln Asn Ala Gly Leu Asn Val Thr Val Asn Ala
65 70 75 80
Thr Phe Met Tyr Ser Asp Gly Leu Ile His Asn Ser Gly Asp Cys Arg
85 90 95
Ser Ser Thr Cys Glu Gly Leu Asp Leu Leu Arg Lys Ile Ser Asn Ala
100 105 110
Gln Arg Met Gly Cys Val Leu Ile Gly Pro Ser Cys Thr Tyr Ser Thr
115 120 125
Phe Gln Met Tyr Leu Asp Thr Glu Leu Ser Tyr Pro Met Ile Ser Ala
130 135 140
Gly Ser Phe Gly Leu Ser Cys Asp Tyr Lys Glu Thr Leu Thr Arg Leu
145 150 155 160
Met Ser Pro Ala Arg Lys Leu Met Tyr Phe Leu Val Asn Phe Trp Lys
165 170 175
Thr Asn Asp Leu Pro Phe Lys Thr Tyr Ser Trp Ser Thr Ser Tyr Val
180 185 190
Tyr Lys Asn Gly Thr Glu Thr Glu Asp Cys Phe Trp Tyr Leu Asn Ala
195 200 205
Leu Glu Ala Ser Val Ser Tyr Phe Ser His Glu Leu Gly Phe Lys Val
210 215 220
Val Leu Arg Gln Asp Lys Glu Phe Gln Asp Ile Leu Met Asp His Asn
225 230 235 240
Arg Lys Ser Asn Val Ile Ile Met Cys Gly Gly Pro Glu Phe Leu Tyr
245 250 255
Lys Leu Lys Gly Asp Arg Ala Val Ala Glu Asp Ile Val Ile Ile Leu
260 265 270
Val Asp Leu Phe Asn Asp Gln Tyr Phe Glu Asp Asn Val Thr Ala Pro
275 280 285
Asp Tyr Met Lys Asn Val Leu Val Leu Thr Leu Ser Pro Gly Asn Ser
290 295 300
Leu Leu Asn Ser Ser Phe Ser Arg Asn Leu Ser Pro Thr Lys Arg Asp
305 310 315 320
Phe Ala Leu Ala Tyr Leu Asn Gly Ile Leu Leu Phe Gly His Met Leu
325 330 335
Lys Ile Phe Leu Glu Asn Gly Glu Asn Ile Thr Thr Pro Lys Phe Ala
340 345 350
His Ala Phe Arg Asn Leu Thr Phe Glu Gly Tyr Asp Gly Pro Val Thr
355 360 365
Leu Asp Asp Trp Gly Asp Val Asp Ser Thr Met Val Leu Leu Tyr Thr
370 375 380
Ser Val Asp Thr Lys Lys Tyr Lys Val Leu Leu Thr Tyr Asp Thr His
385 390 395 400
Val Asn Lys Thr Tyr Pro Val Asp Met Ser Pro Thr Phe Thr Trp Lys
405 410 415
Asn Ser Lys Leu Pro Asn Asp Ile Thr Gly Arg Gly Pro Gln Ile Leu
420 425 430
Met Ile Ala Val Phe Thr Leu Thr Gly Ala Val Val Leu Leu Leu Leu
435 440 445
Val Ala Leu Leu Met Leu Arg Lys Tyr Arg Lys Asp Tyr Glu Leu Arg
450 455 460
Gln Lys Lys Trp Ser His Ile Pro Pro Glu Asn Ile Phe Pro Leu Glu
465 470 475 480
Thr Asn Glu Thr Asn His Val Ser Leu Lys Ile Asp Asp Asp Lys Arg
485 490 495
Arg Asp Thr Ile Gln Arg Leu Arg Gln Cys Lys Tyr Asp Lys Lys Arg
500 505 510
Val Ile Leu Lys Asp Leu Lys His Asn Asp Gly Asn Phe Thr Glu Lys
515 520 525
Gln Lys Ile Glu Leu Asn Lys Leu Leu Gln Ile Asp Tyr Tyr Asn Leu
530 535 540
Thr Lys Phe Tyr Gly Thr Val Lys Leu Asp Thr Met Ile Phe Gly Val
545 550 555 560
Ile Glu Tyr Cys Glu Arg Gly Ser Leu Arg Glu Val Leu Asn Asp Thr
565 570 575
Ile Ser Tyr Pro Asp Gly Thr Phe Met Asp Trp Glu Phe Lys Ile Ser
580 585 590
Val Leu Tyr Asp Ile Ala Lys Gly Met Ser Tyr Leu His Ser Ser Lys
595 600 605
Thr Glu Val His Gly Arg Leu Lys Ser Thr Asn Cys Val Val Asp Ser
610 615 620
Arg Met Val Val Lys Ile Thr Asp Phe Gly Cys Asn Ser Ile Leu Pro
625 630 635 640
Pro Lys Lys Asp Leu Trp Thr Ala Pro Glu His Leu Arg Gln Ala Asn
645 650 655
Ile Ser Gln Lys Gly Asp Val Tyr Ser Tyr Gly Ile Ile Ala Gln Glu
660 665 670
Ile Ile Leu Arg Lys Glu Thr Phe Tyr Thr Leu Ser Cys Arg Asp Arg
675 680 685
Asn Glu Lys Ile Phe Arg Val Glu Asn Ser Asn Gly Met Lys Pro Phe
690 695 700
Arg Pro Asp Leu Phe Leu Glu Thr Ala Glu Glu Lys Glu Leu Glu Val
705 710 715 720
Tyr Leu Leu Val Lys Asn Cys Trp Glu Glu Asp Pro Glu Lys Arg Pro
725 730 735
Asp Phe Lys Lys Ile Glu Thr Thr Leu Ala Lys Ile Phe Gly Leu Phe
740 745 750
His Asp Gln Lys Asn Glu Ser Tyr Met Asp Thr Leu Ile Arg Arg Leu
755 760 765
Gln Leu Tyr Ser Arg Asn Leu Glu His Leu Val Glu Glu Arg Thr Gln
770 775 780
Leu Tyr Lys Ala Glu Arg Asp Arg Ala Asp Arg Leu Asn Phe Met Leu
785 790 795 800
Leu Pro Arg Leu Val Val Lys Ser Leu Lys Glu Lys Gly Phe Val Glu
805 810 815
Pro Glu Leu Tyr Glu Glu Val Thr Ile Tyr Phe Ser Asp Ile Val Gly
820 825 830
Phe Thr Thr Ile Cys Lys Tyr Ser Thr Pro Met Glu Val Val Asp Met
835 840 845
Leu Asn Asp Ile Tyr Lys Ser Phe Asp His Ile Val Asp His His Asp
850 855 860
Val Tyr Lys Val Glu Thr Ile Gly Asp Ala Tyr Met Val Ala Ser Gly
865 870 875 880
Leu Pro Lys Arg Asn Gly Asn Arg His Ala Ile Asp Ile Ala Lys Met
885 890 895
Ala Leu Glu Ile Leu Ser Phe Met Gly Thr Phe Glu Leu Glu His Leu
900 905 910
Pro Gly Leu Pro Ile Trp Ile Arg Ile Gly Val His Ser Gly Pro Cys
915 920 925
Ala Ala Gly Val Val Gly Ile Lys Met Pro Arg Tyr Cys Leu Phe Gly
930 935 940
Asp Thr Val Asn Thr Ala Ser Arg Met Glu Ser Thr Gly Leu Pro Leu
945 950 955 960
Arg Ile His Val Ser Gly Ser Thr Ile Ala Ile Leu Lys Arg Thr Glu
965 970 975
Cys Gln Phe Leu Tyr Glu Val Arg Gly Glu Thr Tyr Leu Lys Gly Arg
980 985 990
Gly Asn Glu Thr Thr Tyr Trp Leu Thr Gly Met Lys Asp Gln Lys Phe
995 1000 1005
Asn Leu Pro Thr Pro Pro Thr Val Glu Asn Gln Gln Arg Leu Gln
1010 1015 1020
Ala Glu Phe Ser Asp Met Ile Ala Asn Ser Leu Gln Lys Arg Gln
1025 1030 1035
Ala Ala Gly Ile Arg Ser Gln Lys Pro Arg Arg Val Ala Ser Tyr
1040 1045 1050
Lys Lys Gly Thr Leu Glu Tyr Leu Gln Leu Asn Thr Thr Asp Lys
1055 1060 1065
Glu Ser Thr Tyr Phe
1070
<210> 169
<211> 2744
<212> DNA
<213> 智人
<400> 169
ctcgtgccga attcggcacg agaccgcgtg ttcgcgcctg gtagagattt ctcgaagaca 60
ccagtgggcc cgtgtggaac caaacctgcg cgcgtggccg ggccgtggga caacgaggcc 120
gcggagacga aggcgcaatg gcgaggaagt tatctgtaat cttgatcctg acctttgccc 180
tctctgtcac aaatcccctt catgaactaa aagcagctgc tttcccccag accactgaga 240
aaattagtcc gaattgggaa tctggcatta atgttgactt ggcaatttcc acacggcaat 300
atcatctaca acagcttttc taccgctatg gagaaaataa ttctttgtca gttgaagggt 360
tcagaaaatt acttcaaaat ataggcatag ataagattaa aagaatccat atacaccatg 420
accacgacca tcactcagac cacgagcatc actcagacca tgagcgtcac tcagaccatg 480
agcatcactc agaccacgag catcactctg accataatca tgctgcttct ggtaaaaata 540
agcgaaaagc tctttgccca gaccatgact cagatagttc aggtaaagat cctagaaaca 600
gccaggggaa aggagctcac cgaccagaac atgccagtgg tagaaggaat gtcaaggaca 660
gtgttagtgc tagtgaagtg acctcaactg tgtacaacac tgtctctgaa ggaactcact 720
ttctagagac aatagagact ccaagacctg gaaaactctt ccccaaagat gtaagcagct 780
ccactccacc cagtgtcaca tcaaagagcc gggtgagccg gctggctggt aggaaaacaa 840
atgaatctgt gagtgagccc cgaaaaggct ttatgtattc cagaaacaca aatgaaaatc 900
ctcaggagtg tttcaatgca tcaaagctac tgacatctca tggcatgggc atccaggttc 960
cgctgaatgc aacagagttc aactatctct gtccagccat catcaaccaa attgatgcta 1020
gatcttgtct gattcataca agtgaaaaga aggctgaaat ccctccaaag acctattcat 1080
tacaaatagc ctgggttggt ggttttatag ccatttccat catcagtttc ctgtctctgc 1140
tgggggttat cttagtgcct ctcatgaatc gggtgttttt caaatttctc ctgagtttcc 1200
ttgtggcact ggccgttggg actttgagtg gtgatgcttt tttacacctt cttccacatt 1260
ctcatgcaag tcaccaccat agtcatagcc atgaagaacc agcaatggaa atgaaaagag 1320
gaccactttt cagtcatctg tcttctcaaa acatagaaga aagtgcctat tttgattcca 1380
cgtggaaggg tctaacagct ctaggaggcc tgtatttcat gtttcttgtt gaacatgtcc 1440
tcacattgat caaacaattt aaagataaga agaaaaagaa tcagaagaaa cctgaaaatg 1500
atgatgatgt ggagattaag aagcagttgt ccaagtatga atctcaactt tcaacaaatg 1560
aggagaaagt agatacagat gatcgaactg aaggctattt acgagcagac tcacaagagc 1620
cctcccactt tgattctcag cagcctgcag tcttggaaga agaagaggtc atgatagctc 1680
atgctcatcc acaggaagtc tacaatgaat atgtacccag agggtgcaag aataaatgcc 1740
attcacattt ccacgataca ctcggccagt cagacgatct cattcaccac catcatgact 1800
accatcatat tctccatcat caccaccacc aaaaccacca tcctcacagt cacagccagc 1860
gctactctcg ggaggagctg aaagatgccg gcgtcgccac tttggcctgg atggtgataa 1920
tgggtgatgg cctgcacaat ttcagcgatg gcctagcaat tggtgctgct tttactgaag 1980
gcttatcaag tggtttaagt acttctgttg ctgtgttctg tcatgagttg cctcatgaat 2040
taggtgactt tgctgttcta ctaaaggctg gcatgaccgt taagcaggct gtcctttata 2100
atgcattgtc agccatgctg gcgtatcttg gaatggcaac aggaattttc attggtcatt 2160
atgctgaaaa tgtttctatg tggatatttg cacttactgc tggcttattc atgtatgttg 2220
ctctggttga tatggtacct gaaatgctgc acaatgatgc tagtgaccat ggatgtagcc 2280
gctgggggta tttcttttta cagaatgctg ggatgctttt gggttttgga attatgttac 2340
ttatttccat atttgaacat aaaatcgtgt ttcgtataaa tttctagtta aggtttaaat 2400
gctagagtag cttaaaaagt tgtcatagtt tcagtaggtc atagggagat gagtttgtat 2460
gctgtactat gcagcgttta aagttagtgg gttttgtgat ttttgtattg aatattgctg 2520
tctgttacaa agtcagttaa aggtacgttt taatatttaa gttattctat cttggagata 2580
aaatctgtat gtgcaattca ccggtattac cagtttatta tgtaaacaag agatttggca 2640
tgacatgttc tgtatgtttc agggaaaaat gtctttaatg ctttttcaag aactaacaca 2700
gttattccta tactggattt taggtctctg aagaactgct ggtg 2744
<210> 170
<211> 749
<212> PRT
<213> 智人
<400> 170
Met Ala Arg Lys Leu Ser Val Ile Leu Ile Leu Thr Phe Ala Leu Ser
1 5 10 15
Val Thr Asn Pro Leu His Glu Leu Lys Ala Ala Ala Phe Pro Gln Thr
20 25 30
Thr Glu Lys Ile Ser Pro Asn Trp Glu Ser Gly Ile Asn Val Asp Leu
35 40 45
Ala Ile Ser Thr Arg Gln Tyr His Leu Gln Gln Leu Phe Tyr Arg Tyr
50 55 60
Gly Glu Asn Asn Ser Leu Ser Val Glu Gly Phe Arg Lys Leu Leu Gln
65 70 75 80
Asn Ile Gly Ile Asp Lys Ile Lys Arg Ile His Ile His His Asp His
85 90 95
Asp His His Ser Asp His Glu His His Ser Asp His Glu Arg His Ser
100 105 110
Asp His Glu His His Ser Asp His Glu His His Ser Asp His Asn His
115 120 125
Ala Ala Ser Gly Lys Asn Lys Arg Lys Ala Leu Cys Pro Asp His Asp
130 135 140
Ser Asp Ser Ser Gly Lys Asp Pro Arg Asn Ser Gln Gly Lys Gly Ala
145 150 155 160
His Arg Pro Glu His Ala Ser Gly Arg Arg Asn Val Lys Asp Ser Val
165 170 175
Ser Ala Ser Glu Val Thr Ser Thr Val Tyr Asn Thr Val Ser Glu Gly
180 185 190
Thr His Phe Leu Glu Thr Ile Glu Thr Pro Arg Pro Gly Lys Leu Phe
195 200 205
Pro Lys Asp Val Ser Ser Ser Thr Pro Pro Ser Val Thr Ser Lys Ser
210 215 220
Arg Val Ser Arg Leu Ala Gly Arg Lys Thr Asn Glu Ser Val Ser Glu
225 230 235 240
Pro Arg Lys Gly Phe Met Tyr Ser Arg Asn Thr Asn Glu Asn Pro Gln
245 250 255
Glu Cys Phe Asn Ala Ser Lys Leu Leu Thr Ser His Gly Met Gly Ile
260 265 270
Gln Val Pro Leu Asn Ala Thr Glu Phe Asn Tyr Leu Cys Pro Ala Ile
275 280 285
Ile Asn Gln Ile Asp Ala Arg Ser Cys Leu Ile His Thr Ser Glu Lys
290 295 300
Lys Ala Glu Ile Pro Pro Lys Thr Tyr Ser Leu Gln Ile Ala Trp Val
305 310 315 320
Gly Gly Phe Ile Ala Ile Ser Ile Ile Ser Phe Leu Ser Leu Leu Gly
325 330 335
Val Ile Leu Val Pro Leu Met Asn Arg Val Phe Phe Lys Phe Leu Leu
340 345 350
Ser Phe Leu Val Ala Leu Ala Val Gly Thr Leu Ser Gly Asp Ala Phe
355 360 365
Leu His Leu Leu Pro His Ser His Ala Ser His His His Ser His Ser
370 375 380
His Glu Glu Pro Ala Met Glu Met Lys Arg Gly Pro Leu Phe Ser His
385 390 395 400
Leu Ser Ser Gln Asn Ile Glu Glu Ser Ala Tyr Phe Asp Ser Thr Trp
405 410 415
Lys Gly Leu Thr Ala Leu Gly Gly Leu Tyr Phe Met Phe Leu Val Glu
420 425 430
His Val Leu Thr Leu Ile Lys Gln Phe Lys Asp Lys Lys Lys Lys Asn
435 440 445
Gln Lys Lys Pro Glu Asn Asp Asp Asp Val Glu Ile Lys Lys Gln Leu
450 455 460
Ser Lys Tyr Glu Ser Gln Leu Ser Thr Asn Glu Glu Lys Val Asp Thr
465 470 475 480
Asp Asp Arg Thr Glu Gly Tyr Leu Arg Ala Asp Ser Gln Glu Pro Ser
485 490 495
His Phe Asp Ser Gln Gln Pro Ala Val Leu Glu Glu Glu Glu Val Met
500 505 510
Ile Ala His Ala His Pro Gln Glu Val Tyr Asn Glu Tyr Val Pro Arg
515 520 525
Gly Cys Lys Asn Lys Cys His Ser His Phe His Asp Thr Leu Gly Gln
530 535 540
Ser Asp Asp Leu Ile His His His His Asp Tyr His His Ile Leu His
545 550 555 560
His His His His Gln Asn His His Pro His Ser His Ser Gln Arg Tyr
565 570 575
Ser Arg Glu Glu Leu Lys Asp Ala Gly Val Ala Thr Leu Ala Trp Met
580 585 590
Val Ile Met Gly Asp Gly Leu His Asn Phe Ser Asp Gly Leu Ala Ile
595 600 605
Gly Ala Ala Phe Thr Glu Gly Leu Ser Ser Gly Leu Ser Thr Ser Val
610 615 620
Ala Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp Phe Ala Val
625 630 635 640
Leu Leu Lys Ala Gly Met Thr Val Lys Gln Ala Val Leu Tyr Asn Ala
645 650 655
Leu Ser Ala Met Leu Ala Tyr Leu Gly Met Ala Thr Gly Ile Phe Ile
660 665 670
Gly His Tyr Ala Glu Asn Val Ser Met Trp Ile Phe Ala Leu Thr Ala
675 680 685
Gly Leu Phe Met Tyr Val Ala Leu Val Asp Met Val Pro Glu Met Leu
690 695 700
His Asn Asp Ala Ser Asp His Gly Cys Ser Arg Trp Gly Tyr Phe Phe
705 710 715 720
Leu Gln Asn Ala Gly Met Leu Leu Gly Phe Gly Ile Met Leu Leu Ile
725 730 735
Ser Ile Phe Glu His Lys Ile Val Phe Arg Ile Asn Phe
740 745
<210> 171
<211> 5551
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1769)..(1769)
<223> n 是 a, c, g, 或 t
<400> 171
ttttcatgga aatatcttca aatttcatta gtggtcctat tatagaagac acacagcagg 60
caagtgtcag agctagaaat gaccctaaat ttctgggtcc ctattcagca gccataagca 120
attgccagct gcaggaaggt ggaggcagaa gaagtaaaaa gggaacaaga ggtgaggttt 180
acgttctgcc taagtggaac agagtagaac tttgcacaaa ctccttaatc ttgcctagag 240
gcagaatgtg ggatgtacta tatgggaggc aggcaaatct gggcctggtt cccacctcca 300
tcactcattt gaacaagtta tgtaatttat ttttgcgtca attttcttgt ctgtaaaact 360
ggagtgagag gacctaactt gagtaccgaa gtgaagagtc aatgagataa tatctgtgat 420
cagcacagtg cttggcaagt ggaagacact cagtcaagct aacttaaagc cccagttttt 480
tcatttgtac aacagaacaa caacaataat agcaacaagc aataatacta acaatattta 540
tcttgcctct gagttgtctt gaaaatcaga tgagtatata tattatgttt ctaaattcta 600
aagtgtatct cactctccta acaaagcagt tggtcttcag atcaggacca gagttaatac 660
catgccagag cattattaca ggttttttcc aatgagtaat taattttatt tttaaactta 720
tcatacagta aaattgactc ttttggttta caaagttcta tgagttatag cacgttgtat 780
aggtttatgt aaccaccacc acaatccgga tacagaacat gactacagct ttagagattc 840
tttttcaaag tgttttacat tcatgttgca aaacactttt aaaaataaga aaagagatgt 900
agataaacgt gctgaagatg caaattaatg aaaccagctt ttgtgcacaa atctctgcag 960
tttccagtgt tttcagcttt tgtctgagcc atttttgctg tcggctttgc agactatctc 1020
tttctataaa agtaatcagg cgtccgttgg agagttgggt ctttcttcac attaaaggca 1080
catgtcccag gcacaagcct caggatcctt atttgatctt ttttaatacg accaggtctt 1140
aaatttccga accatgtttt ccctaattgc cattttctct ctaaaatgaa gaagcctgga 1200
agaaccttct tatcccttcg gggaaattat ttaattggga caaaggggat gtggagttac 1260
agagagcaac gatagggctt tcaggaaccc aaaacctgtc tggtataata acagatcctt 1320
ccggtaacca accattcaat ttgtccctct cccccaaccc catcctctcc atcaccatct 1380
gccagatggc cccctgggac caatcgactt tgaaaccaaa cttctctccc aggtcagctc 1440
ctgcaatccc cactcctcgt cccttggagc tttctccaca gaagtgactt gacacaccgc 1500
gctacatctg gggagggggg cgggtccgct ttccgggaga ctcaagtctt tgctatttgc 1560
cttttgtttc tgggatgcct ttggctgcct ttctcttcca tctgctttta attttggtaa 1620
gagaaaagct aatgaatcag gaatgaatct cctctccact aatctttwtg ttttctttct 1680
ctctctctct ctcttagagt tggcgcgctc tcttcacgag ttccatcact acccaagaag 1740
tgagggggcg ggggggacag gcgggaagnc attaattaca cctaaatctg aagtttgcgg 1800
cctcaagtac cactttgatg gggagagctt ctgaagcttc catacaacct gccgtgcctt 1860
tgctgctgcc gcagtagatt ttcagctcat ttctttctgc tgctgcttcg cgctgctctg 1920
tcatgcctaa atctatctcg tcactgccaa aggtgcctga atcagggtga attccacgag 1980
attcaccagc ggttttgctc cagtccaagg taaatacagt atgcaaaatg aggaaccacc 2040
caaggatgtc ggggaggggg gagagaaagg gattccctca ccttttccgg cacattcctc 2100
gttaatttcc accaggagga ggcgcgcagc ccagctcccc tagtctctct tcttaaatcc 2160
ccctacctac ggctgccgag gttggccgcg cgtcgggagt ctccccgaca gtcctggccc 2220
tcccccgccc cccgggttgg tttttcccag ctgcggaggt tgggaggttg ggccaggggc 2280
tggggtgcga agagagtcgg cgcccgcaac gcggagccgg gaagtcgtcg ctactctggt 2340
ggaactcaga gttggttctg gaggcggcgg acgcggaggt gagcagtggg agcccgcggc 2400
ctgggagaga cccgggaggc gtcgttgggg cccctcccca tcctcgggtg gagagtaggg 2460
ttggttcggg ttgccactgt accccgagtc ccactgctgc tttgttccca gactctcccc 2520
tcctctttgg agatgacttg aacccctttg agatccgagg ggctggcggg gcgcgggctg 2580
gggggcgaca gttgtagctc cacttcctgc taatgcgaga aggtacaaaa agagatcaaa 2640
aagtcctgta acctgaaact tccctcgcat cctaaccggg accgggtgga gaggtctggg 2700
cgggggcgga gacacccgga ggccgacctt ccgccgaggc ggtgcaggaa ggaacgggag 2760
agggagaagt ttggtaggga aggaattggg gattaagcgg taagttagac gcgggagacg 2820
atctcctcgg gaaaggcgga gggcgggagg ccggtcgggt ttatttagtg tgggccaggg 2880
aagaaggaag actttgcggt ctgggtgtcg gatgcgcgtc cccctccgga gtaaaagtga 2940
ccggaggggt tggggaggcg aggccggggc gggcttttgg aaggaggtct ctgggacaga 3000
ctggagacga ctagactcga aaaggccggt ttttgcactc cggaagccgc ggcagccacc 3060
gctgttcacg cctctctcct gcttgtccca ggtccctcag aggatccagc gaggggtgcc 3120
aacaagaggc gaagaggtgg caccagggcg gcggcaggaa gaggagcggg agcaggagcg 3180
cggagcggag cgtcccgacc cgccgtgcgt actttctgga gggaaggggc gggggaatcg 3240
gcccctgagg gaagcgcccg gtggcgaggg ggttagccaa gttccggctg cggcgccact 3300
ccctcggttc cacgagagga aagttttttt tttccagacg cttccgccgg ctcgcgccct 3360
ccgggcccag cctcccgagc cttcggagcg ggcgccgtcc cagcccagct ccggggaaac 3420
gcgagccgcg atgcctgggg ggtgctcccg gggccccgcc gccggggacg ggcgtctgcg 3480
gctggcgcga ctagcgctgg tactcctggg ctgggtctcc tcgtcttctc ccacctcctc 3540
ggcatcctcc ttctcctcct cggcgccgtt cctggcttcc gccgtgtccg cccagccccc 3600
gctgccggac cagtgccccg cgctgtgcga gtgctccgag gcagcgcgca cagtcaagtg 3660
cgttaaccgc aatctgaccg aggtgcccac ggacctgccc gcctacgtgc gcaacctctt 3720
ccttaccggc aaccagctgg ccgtgctccc tgccggcgcc ttcgcccgcc ggccgccgct 3780
ggcggagctg gccgcgctca acctcagcgg cagccgcctg gacgaggtgc gcgcgggcgc 3840
cttcgagcat ctgcccagcc tgcgccagct cgacctcagc cacaacccac tggccgacct 3900
cagtcccttc gctttctcgg gcagcaatgc cagcgtctcg gcccccagtc cccttgtgga 3960
actgatcctg aaccacatcg tgccccctga agatgagcgg cagaaccgga gcttcgaggg 4020
catggtggtg gcggccctgc tggcgggccg tgcactgcag gggctccgcc gcttggagct 4080
ggccagcaac cacttccttt acctgccgcg ggatgtgctg gcccaactgc ccagcctcag 4140
gcacctggac ttaagtaata attcgctggt gagcctgacc tacgtgtcct tccgcaacct 4200
gacacatcta gaaagcctcc acctggagga caatgccctc aaggtccttc acaatggcac 4260
cctggctgag ttgcaaggtc taccccacat tagggttttc ctggacaaca atccctgggt 4320
ctgcgactgc cacatggcag acatggtgac ctggctcaag gaaacagagg tagtgcaggg 4380
caaagaccgg ctcacctgtg catatccgga aaaaatgagg aatcgggtcc tcttggaact 4440
caacagtgct gacctggact gtgacccgat tcttccccca tccctgcaaa cctcttatgt 4500
cttcctgggt attgttttag ccctgatagg cgctattttc ctcctggttt tgtatttgaa 4560
ccgcaagggg ataaaaaagt ggatgcataa catcagagat gcctgcaggg atcacatgga 4620
agggtatcat tacagatatg aaatcaatgc ggaccccaga ttaacaaacc tcagttctaa 4680
ctcggatgtc tgagaaatat tagaggacag accaaggaca actctgcatg agatgtagac 4740
ttaagcttta tccctactag gcttgctcca ctttcatcct ccactataga tacaacggac 4800
tttgactaaa agcagtgaag gggatttgct tccttgttat gtaaagtttc tcggtgtgtt 4860
ctgttaatgt aagacgatga acagttgtgt atagtgtttt accctcttct ttttcttgga 4920
actcctcaac acgtatggag ggatttttca ggtttcagca tgaacatggg cttcttgctg 4980
tctgtctctc tctcagtaca gttcaaggtg tagcaagtgt acccacacag atagcattca 5040
acaaaagctg cctcaacttt ttcgagaaaa atactttatt cataaatatc agttttattc 5100
tcatgtacct aagttgtgga gaaaataatt gcatcctata aactgcctgc agacgttagc 5160
aggctcttca aaataactcc atggtgcaca ggagcacctg catccaagag catgcttaca 5220
ttttactgtt ctgcatatta caaaaaataa cttgcaactt cataacttct ttgacaaagt 5280
aaattacttt tttgattgca gtttatatga aaatgtactg attttttttt aataaactgc 5340
atcgagatcc aaccgactga attgttaaaa aaaaaaaaaa ataaacattg ttaaaacaat 5400
tacagtgtgt gcaagtttgc tttgaaaaaa ggatgaaggg caggagtatt caatggtctt 5460
ggttccgatg ataattatta cttaacatag ctgagaatca aactgtagca atgtctaata 5520
taaatagact tgggagatcg tttcgaaatg g 5551
<210> 172
<211> 420
<212> PRT
<213> 智人
<400> 172
Met Pro Gly Gly Cys Ser Arg Gly Pro Ala Ala Gly Asp Gly Arg Leu
1 5 10 15
Arg Leu Ala Arg Leu Ala Leu Val Leu Leu Gly Trp Val Ser Ser Ser
20 25 30
Ser Pro Thr Ser Ser Ala Ser Ser Phe Ser Ser Ser Ala Pro Phe Leu
35 40 45
Ala Ser Ala Val Ser Ala Gln Pro Pro Leu Pro Asp Gln Cys Pro Ala
50 55 60
Leu Cys Glu Cys Ser Glu Ala Ala Arg Thr Val Lys Cys Val Asn Arg
65 70 75 80
Asn Leu Thr Glu Val Pro Thr Asp Leu Pro Ala Tyr Val Arg Asn Leu
85 90 95
Phe Leu Thr Gly Asn Gln Leu Ala Val Leu Pro Ala Gly Ala Phe Ala
100 105 110
Arg Arg Pro Pro Leu Ala Glu Leu Ala Ala Leu Asn Leu Ser Gly Ser
115 120 125
Arg Leu Asp Glu Val Arg Ala Gly Ala Phe Glu His Leu Pro Ser Leu
130 135 140
Arg Gln Leu Asp Leu Ser His Asn Pro Leu Ala Asp Leu Ser Pro Phe
145 150 155 160
Ala Phe Ser Gly Ser Asn Ala Ser Val Ser Ala Pro Ser Pro Leu Val
165 170 175
Glu Leu Ile Leu Asn His Ile Val Pro Pro Glu Asp Glu Arg Gln Asn
180 185 190
Arg Ser Phe Glu Gly Met Val Val Ala Ala Leu Leu Ala Gly Arg Ala
195 200 205
Leu Gln Gly Leu Arg Arg Leu Glu Leu Ala Ser Asn His Phe Leu Tyr
210 215 220
Leu Pro Arg Asp Val Leu Ala Gln Leu Pro Ser Leu Arg His Leu Asp
225 230 235 240
Leu Ser Asn Asn Ser Leu Val Ser Leu Thr Tyr Val Ser Phe Arg Asn
245 250 255
Leu Thr His Leu Glu Ser Leu His Leu Glu Asp Asn Ala Leu Lys Val
260 265 270
Leu His Asn Gly Thr Leu Ala Glu Leu Gln Gly Leu Pro His Ile Arg
275 280 285
Val Phe Leu Asp Asn Asn Pro Trp Val Cys Asp Cys His Met Ala Asp
290 295 300
Met Val Thr Trp Leu Lys Glu Thr Glu Val Val Gln Gly Lys Asp Arg
305 310 315 320
Leu Thr Cys Ala Tyr Pro Glu Lys Met Arg Asn Arg Val Leu Leu Glu
325 330 335
Leu Asn Ser Ala Asp Leu Asp Cys Asp Pro Ile Leu Pro Pro Ser Leu
340 345 350
Gln Thr Ser Tyr Val Phe Leu Gly Ile Val Leu Ala Leu Ile Gly Ala
355 360 365
Ile Phe Leu Leu Val Leu Tyr Leu Asn Arg Lys Gly Ile Lys Lys Trp
370 375 380
Met His Asn Ile Arg Asp Ala Cys Arg Asp His Met Glu Gly Tyr His
385 390 395 400
Tyr Arg Tyr Glu Ile Asn Ala Asp Pro Arg Leu Thr Asn Leu Ser Ser
405 410 415
Asn Ser Asp Val
420
<210> 173
<211> 5977
<212> DNA
<213> 智人
<400> 173
tgtctctctt acctccttga tgttcggcac tatttgtggc cggcgtggtg gaaggacaca 60
gtgaggttct cacccccgcc ccccgctcct cgctcccatc ccagttccat caaaacgaac 120
ccgggccagc gcaaggatct ccgagttgcg agtgtgctga ggctgggact gtcactcatt 180
ctccgatcag cgcgtgaacg cagctcggct gccgctggca ggaaacaatt ctgcaaaaat 240
aatcatactc agcctggcaa ttgtctgccc ctaggtctgt cgctcagccg ccgtccacac 300
tcgctgcagg ggggggggca cagaatttac cgcggcaaga acatccctcc cagccagcag 360
attacaatgc tgcaaactaa ggatctcatc tggactttgt ttttcctggg aactgcagtt 420
tctctgcagg tggatattgt tcccagccag ggggagatca gcgttggaga gtccaaattc 480
ttcttatgcc aagtggcagg agatgccaaa gataaagaca tctcctggtt ctcccccaat 540
ggagaaaagc tcaccccaaa ccagcagcgg atctcagtgg tgtggaatga tgattcctcc 600
tccaccctca ccatctataa cgccaacatc gacgacgccg gcatttacaa gtgtgtggtt 660
acaggcgagg atggcagtga gtcagaggcc accgtcaacg tgaagatctt tcagaagctc 720
atgttcaaga atgcgccaac cccacaggag ttccgggagg gggaagatgc cgtgattgtg 780
tgtgatgtgg tcagctccct cccaccaacc atcatctgga aacacaaagg ccgagatgtc 840
atcctgaaaa aagatgtccg attcatagtc ctgtccaaca actacctgca gatccggggc 900
atcaagaaaa cagatgaggg cacttatcgc tgtgagggca gaatcctggc acggggggag 960
atcaacttca aggacattca ggtcattgtg aatgtgccac ctaccatcca ggccaggcag 1020
aatattgtga atgccaccgc caacctcggc cagtccgtca ccctggtgtg cgatgccgaa 1080
ggcttcccag agcccaccat gagctggaca aaggatgggg aacagataga gcaagaggaa 1140
gacgatgaga agtacatctt cagcgacgat agttcccagc tgaccatcaa aaaggtggat 1200
aagaacgacg aggctgagta catctgcatt gctgagaaca aggctggcga gcaggatgcg 1260
accatccacc tcaaagtctt tgcaaaaccc aaaatcacat atgtagagaa ccagactgcc 1320
atggaattag aggagcaggt cactcttacc tgtgaagcct ccggagaccc cattccctcc 1380
atcacctgga ggacttctac ccggaacatc agcagcgaag aaaagactct ggatgggcac 1440
atggtggtgc gtagccatgc ccgtgtgtcg tcgctgaccc tgaagagcat ccagtacact 1500
gatgccggag agtacatctg caccgccagc aacaccatcg gccaggactc ccagtccatg 1560
taccttgaag tgcaatatgc cccaaagcta cagggccctg tggctgtgta cacttgggag 1620
gggaaccagg tgaacatcac ctgcgaggta tttgcctatc ccagtgccac gatctcatgg 1680
tttcgggatg gccagctgct gccaagctcc aattacagca atatcaagat ctacaacacc 1740
ccctctgcca gctatctgga ggtgacccca gactctgaga atgattttgg gaactacaac 1800
tgtactgcag tgaaccgcat tgggcaggag tccttggaat tcatccttgt tcaagcagac 1860
accccctctt caccatccat cgaccaggtg gagccatact ccagcacagc ccaggtgcag 1920
tttgatgaac cagaggccac aggtggggtg cccatcctca aatacaaagc tgagtggaga 1980
gcagttggtg aagaagtatg gcattccaag tggtatgatg ccaaggaagc cagcatggag 2040
ggcatcgtca ccatcgtggg cctgaagccc gaaacaacgt acgccgtaag gctggcggcg 2100
ctcaatggca aagggctggg tgagatcagc gcggcctccg agttcaagac gcagccagtc 2160
caaggggaac ccagtgcacc taagctcgaa gggcagatgg gagaggatgg aaactctatt 2220
aaagtgaacc tgatcaagca ggatgacggc ggctccccca tcagacacta tctggtcagg 2280
taccgagcgc tctcctccga gtggaaacca gagatcaggc tcccgtctgg cagtgaccac 2340
gtcatgctga agtccctgga ctggaatgct gagtatgagg tctacgtggt ggctgagaac 2400
cagcaaggaa aatccaaggc ggctcatttt gtgttcagga cctcggccca gcccacagcc 2460
atcccagcca acggcagccc cacctcaggc ctgagcaccg gggccatcgt gggcatcctc 2520
atcgtcatct tcgtcctgct cctggtggtt gtggacatca cctgctactt cctgaacaag 2580
tgtggcctgt tcatgtgcat tgcggtcaac ctgtgtggaa aagccgggcc cggggccaag 2640
ggcaaggaca tggaggaggg caaggccgcc ttctcgaaag atgagtccaa ggagcccatc 2700
gtggaggttc gaacggagga ggagaggacc ccaaaccatg atggagggaa acacacagag 2760
cccaacgaga ccacgccact gacggagccc gagaagggcc ccgtagaagc aaagccagag 2820
tgccaggaga cagaaacgaa gccagcgcca gccgaagtca agacggtccc caatgacgcc 2880
acacagacaa aggagaacga gagcaaagca tgatgggtga agagaaccga gcaaagatca 2940
aaataaaaag tgacacagca gcttcaccag agcatttcca acaccacaga cacacacacg 3000
cacgcacaca cacaaacaca catgcacaca cacacatctc atttctctag tgtcttttgc 3060
ctttaaaaaa aactaaacag ataaaacatg ggaatctcct ttttgtaggt ttatagaaag 3120
ggtccctttg ttgcacactc acttgtaaga aaatgagaca aaaaggttaa acccacagcc 3180
aaactaggac actccgttcc ctgaaaccgt taaaaaatca aacaaaagga ccccaaatta 3240
agaatctagg aagctcagaa acgaaatcta ggttcaggaa gaccacactt ggtgttaccc 3300
gattggcaca gaccagtttc agagaaatac tttcaggcac taagactaat cgaatgaaca 3360
aagtccacag tttattttta tactttcagt caagtttgaa ctctgtaaaa cctcataaat 3420
aagttataat ttctgttcac tttgtatttg ttcagtatgc aaagtgtgtc accctttcta 3480
gctgaattca attcccacgt agactcttat tttataggac gaatgccaaa ttgcagcttc 3540
tgggggtaga tctcaatttg cagtattcag acttcttttt ctttctttta cattcttttt 3600
tctttctttc tttctgccaa ctttgttttc cagtgtttac aaggtgacaa atgtttgact 3660
ttggttgtgt ttaaatgtcc gtgtaaaata gctgcctttt attttttaag gtaacaaata 3720
ccacctagag gtaggtagga tcatcccacg cttgctttag cacaggacaa ctttacaaaa 3780
catgattgtt tacagctgct cttcccctct tttctgatct gcagtttttg cctgggtccc 3840
actcaggtga aaatccatct cattctggaa tggttttgct tttgaatttt tggttatttt 3900
tgtgtttctt tgggggttag accactttct gattagccgc cacctgcctg catctgtgaa 3960
aagggatctg ctcccaggcg ttctcaccct tcttttgaag gactccttag gctttgttga 4020
atgaagcaga gaagattgta tagttggggc tggtcttggt gaacacacat tattacccca 4080
cacatcccct ttgtgtagaa agccaaataa aatctataca taccatttcc ttttgagccc 4140
agaatctaga tttgagcgga agagcatgtg tgcttcaggg aattagtgtc tttttttgga 4200
aatctgttga agtaaagtaa catcggcctt ctgttcactt aggcagcatt tatagaaaca 4260
aaagaagaaa gaaacaacct actgtctgga gtcataacac aactttcctg gattggaaac 4320
caagtggggg aaaaaataca gaaactttaa gggggatggg agggggggga gaagggaaaa 4380
gccagccctt tgtatagaaa ttttgctttt ttttccctca ttctacttta gaactgcaag 4440
cttgtgcact gtggatgcgt gaatatttta gtgtgaaacg tgtttttgtc atagtattga 4500
aataaaactt caacatagtt tggttgtgga aggtatagca gatagttcag aaaaaatatt 4560
caggaaacaa aaatcactca aacggaatcg aagcctttta acaaagaaaa tgaaatacag 4620
atgatgatga tgatgatgaa gatgatgcta agtaaacaga aatcagtact ccgcatgcgc 4680
tcctctccta aggtacaaag cagcaagagg ttagggtggc aaggctgcct ctgggtccat 4740
tctgtgggcc actctcccca acgttctgac acttctgcag tctgatcagt ggcgatgcta 4800
gattataatt tcaaactgtg aagaataatg gtcttgtcat ttgctcaatg tggggttatg 4860
ttgcattttc tcagctcctg gggatggaaa tggaggatcc cagaacacac agccctggcc 4920
cctttgattc tagggcctgc acagatctct ggttcaaatg cacaggccct cagaatagag 4980
gaacatgaag agagatctta gagcacacag tagaatgtga gagcctgggt gtctgagacc 5040
gggagggccc agcagtgagg ggcaggctct tctggtcacc aggctgttca gtggactcag 5100
ttcttcatct tgtaatgtcg atggctttgc cacaccaggc caagcccatg ccataccttg 5160
tcaagactgt caaagtggtt gtggttaggt caaactggtt ttggttctga tggttaggaa 5220
gaaacaggtc agccctcaga tcacctggcc cgggacagct gaccccctag aaccctggct 5280
ctgccattag ctaggaccta agactctgcc cacattttgg tctgttctct cccattacac 5340
ataggtttgt ctcagcatgc aagagttttt cctttaaaaa aaaaaaaaaa aaaaaaaaaa 5400
aagcaatgct ttctctaaaa tcaaagaggg agtcatttta ttccaagatg ttttatcttt 5460
tatgttaaga gatcaaagct tataattttc ttttttaatt tttgaaggag ggatcaactc 5520
cagtttccaa tgtctatgtg tctatgtgtg tatgtgccat acatatgtat tcacatgaag 5580
accggcatgg ccaagttctg ctggaggagc actcaagtgt gacgagcagg gccactggac 5640
cctgcagggc tgtggtgtat atagtgcagc tttggaggtg gaactctatt ttcacacttt 5700
tctatggagc cttccgagtc ccaggttttc acttgaggct gtctgtctgg atggcggttt 5760
tcagacctcc attaacatcc ctacccagca ttctgtactt cgggggcctt ctctcttgtt 5820
ataaaacttt ttaccaagtg aaacatcgat accacctttg tttccattct cactggtgta 5880
aatactgagt actaactgag aattttgact ttgcattctg tcggaatact tgtgttcaat 5940
aaaaattgaa agaaaaaagc taaaaaaaaa aaaaaaa 5977
<210> 174
<211> 848
<212> PRT
<213> 智人
<400> 174
Met Leu Gln Thr Lys Asp Leu Ile Trp Thr Leu Phe Phe Leu Gly Thr
1 5 10 15
Ala Val Ser Leu Gln Val Asp Ile Val Pro Ser Gln Gly Glu Ile Ser
20 25 30
Val Gly Glu Ser Lys Phe Phe Leu Cys Gln Val Ala Gly Asp Ala Lys
35 40 45
Asp Lys Asp Ile Ser Trp Phe Ser Pro Asn Gly Glu Lys Leu Thr Pro
50 55 60
Asn Gln Gln Arg Ile Ser Val Val Trp Asn Asp Asp Ser Ser Ser Thr
65 70 75 80
Leu Thr Ile Tyr Asn Ala Asn Ile Asp Asp Ala Gly Ile Tyr Lys Cys
85 90 95
Val Val Thr Gly Glu Asp Gly Ser Glu Ser Glu Ala Thr Val Asn Val
100 105 110
Lys Ile Phe Gln Lys Leu Met Phe Lys Asn Ala Pro Thr Pro Gln Glu
115 120 125
Phe Arg Glu Gly Glu Asp Ala Val Ile Val Cys Asp Val Val Ser Ser
130 135 140
Leu Pro Pro Thr Ile Ile Trp Lys His Lys Gly Arg Asp Val Ile Leu
145 150 155 160
Lys Lys Asp Val Arg Phe Ile Val Leu Ser Asn Asn Tyr Leu Gln Ile
165 170 175
Arg Gly Ile Lys Lys Thr Asp Glu Gly Thr Tyr Arg Cys Glu Gly Arg
180 185 190
Ile Leu Ala Arg Gly Glu Ile Asn Phe Lys Asp Ile Gln Val Ile Val
195 200 205
Asn Val Pro Pro Thr Ile Gln Ala Arg Gln Asn Ile Val Asn Ala Thr
210 215 220
Ala Asn Leu Gly Gln Ser Val Thr Leu Val Cys Asp Ala Glu Gly Phe
225 230 235 240
Pro Glu Pro Thr Met Ser Trp Thr Lys Asp Gly Glu Gln Ile Glu Gln
245 250 255
Glu Glu Asp Asp Glu Lys Tyr Ile Phe Ser Asp Asp Ser Ser Gln Leu
260 265 270
Thr Ile Lys Lys Val Asp Lys Asn Asp Glu Ala Glu Tyr Ile Cys Ile
275 280 285
Ala Glu Asn Lys Ala Gly Glu Gln Asp Ala Thr Ile His Leu Lys Val
290 295 300
Phe Ala Lys Pro Lys Ile Thr Tyr Val Glu Asn Gln Thr Ala Met Glu
305 310 315 320
Leu Glu Glu Gln Val Thr Leu Thr Cys Glu Ala Ser Gly Asp Pro Ile
325 330 335
Pro Ser Ile Thr Trp Arg Thr Ser Thr Arg Asn Ile Ser Ser Glu Glu
340 345 350
Lys Thr Leu Asp Gly His Met Val Val Arg Ser His Ala Arg Val Ser
355 360 365
Ser Leu Thr Leu Lys Ser Ile Gln Tyr Thr Asp Ala Gly Glu Tyr Ile
370 375 380
Cys Thr Ala Ser Asn Thr Ile Gly Gln Asp Ser Gln Ser Met Tyr Leu
385 390 395 400
Glu Val Gln Tyr Ala Pro Lys Leu Gln Gly Pro Val Ala Val Tyr Thr
405 410 415
Trp Glu Gly Asn Gln Val Asn Ile Thr Cys Glu Val Phe Ala Tyr Pro
420 425 430
Ser Ala Thr Ile Ser Trp Phe Arg Asp Gly Gln Leu Leu Pro Ser Ser
435 440 445
Asn Tyr Ser Asn Ile Lys Ile Tyr Asn Thr Pro Ser Ala Ser Tyr Leu
450 455 460
Glu Val Thr Pro Asp Ser Glu Asn Asp Phe Gly Asn Tyr Asn Cys Thr
465 470 475 480
Ala Val Asn Arg Ile Gly Gln Glu Ser Leu Glu Phe Ile Leu Val Gln
485 490 495
Ala Asp Thr Pro Ser Ser Pro Ser Ile Asp Gln Val Glu Pro Tyr Ser
500 505 510
Ser Thr Ala Gln Val Gln Phe Asp Glu Pro Glu Ala Thr Gly Gly Val
515 520 525
Pro Ile Leu Lys Tyr Lys Ala Glu Trp Arg Ala Val Gly Glu Glu Val
530 535 540
Trp His Ser Lys Trp Tyr Asp Ala Lys Glu Ala Ser Met Glu Gly Ile
545 550 555 560
Val Thr Ile Val Gly Leu Lys Pro Glu Thr Thr Tyr Ala Val Arg Leu
565 570 575
Ala Ala Leu Asn Gly Lys Gly Leu Gly Glu Ile Ser Ala Ala Ser Glu
580 585 590
Phe Lys Thr Gln Pro Val Gln Gly Glu Pro Ser Ala Pro Lys Leu Glu
595 600 605
Gly Gln Met Gly Glu Asp Gly Asn Ser Ile Lys Val Asn Leu Ile Lys
610 615 620
Gln Asp Asp Gly Gly Ser Pro Ile Arg His Tyr Leu Val Arg Tyr Arg
625 630 635 640
Ala Leu Ser Ser Glu Trp Lys Pro Glu Ile Arg Leu Pro Ser Gly Ser
645 650 655
Asp His Val Met Leu Lys Ser Leu Asp Trp Asn Ala Glu Tyr Glu Val
660 665 670
Tyr Val Val Ala Glu Asn Gln Gln Gly Lys Ser Lys Ala Ala His Phe
675 680 685
Val Phe Arg Thr Ser Ala Gln Pro Thr Ala Ile Pro Ala Asn Gly Ser
690 695 700
Pro Thr Ser Gly Leu Ser Thr Gly Ala Ile Val Gly Ile Leu Ile Val
705 710 715 720
Ile Phe Val Leu Leu Leu Val Val Val Asp Ile Thr Cys Tyr Phe Leu
725 730 735
Asn Lys Cys Gly Leu Phe Met Cys Ile Ala Val Asn Leu Cys Gly Lys
740 745 750
Ala Gly Pro Gly Ala Lys Gly Lys Asp Met Glu Glu Gly Lys Ala Ala
755 760 765
Phe Ser Lys Asp Glu Ser Lys Glu Pro Ile Val Glu Val Arg Thr Glu
770 775 780
Glu Glu Arg Thr Pro Asn His Asp Gly Gly Lys His Thr Glu Pro Asn
785 790 795 800
Glu Thr Thr Pro Leu Thr Glu Pro Glu Lys Gly Pro Val Glu Ala Lys
805 810 815
Pro Glu Cys Gln Glu Thr Glu Thr Lys Pro Ala Pro Ala Glu Val Lys
820 825 830
Thr Val Pro Asn Asp Ala Thr Gln Thr Lys Glu Asn Glu Ser Lys Ala
835 840 845
<210> 175
<211> 1108
<212> DNA
<213> 智人
<400> 175
ctggaggcct ggctggtgct cacatacaat aattaactgc tgagtggcct tcgcccaatc 60
ccaggctcca ctcctgggct ccattcccac tccctgcctg tctcctaggc cactaaacca 120
cagctgtccc ctggaataag gcaaggggga gtgtagagca gagcagaagc ctgagccaga 180
cggagagcca cctcctctcc cagggacaga catggctcag cggatgacaa cacagctgct 240
gctccttcta gtgtgggtgg ctgtagtagg ggaggctcag acaaggattg catgggccag 300
gactgagctt ctcaatgtct gcatgaacgc caagcaccac aaggaaaagc caggccccga 360
ggacaagttg catgagcagt gtcgaccctg gaggaagaat gcctgctgtt ctaccaacac 420
cagccaggaa gcccataagg atgtttccta cctatataga ttcaactgga accactgtgg 480
agagatggca cctgcctgca aacggcattt catccaggac acctgcctct acgagtgctc 540
ccccaacttg gggccctgga tccagcaggt ggatcagagc tggcgcaaag agcgggtact 600
gaacgtgccc ctgtgcaaag aggactgtga gcaatggtgg gaagattgtc gcacctccta 660
cacctgcaag agcaactggc acaagggctg gaactggact tcagggttta acaagtgcgc 720
agtgggagct gcctgccaac ctttccattt ctacttcccc acacccactg ttctgtgcaa 780
tgaaatctgg actcactcct acaaggtcag caactacagc cgagggagtg gccgctgcat 840
ccagatgtgg ttcgacccag cccagggcaa ccccaatgag gaggtggcga ggttctatgc 900
tgcagccatg agtggggctg ggccctgggc agcctggcct ttcctgctta gcctggccct 960
aatgctgctg tggctgctca gctgacctcc ttttaccttc tgatacctgg aaatccctgc 1020
cctgttcagc cccacagctc ccaactattt ggttcctgct ccatggtcgg gcctctgaca 1080
gccactttga ataaaccaga caccgcac 1108
<210> 176
<211> 257
<212> PRT
<213> 智人
<400> 176
Met Ala Gln Arg Met Thr Thr Gln Leu Leu Leu Leu Leu Val Trp Val
1 5 10 15
Ala Val Val Gly Glu Ala Gln Thr Arg Ile Ala Trp Ala Arg Thr Glu
20 25 30
Leu Leu Asn Val Cys Met Asn Ala Lys His His Lys Glu Lys Pro Gly
35 40 45
Pro Glu Asp Lys Leu His Glu Gln Cys Arg Pro Trp Arg Lys Asn Ala
50 55 60
Cys Cys Ser Thr Asn Thr Ser Gln Glu Ala His Lys Asp Val Ser Tyr
65 70 75 80
Leu Tyr Arg Phe Asn Trp Asn His Cys Gly Glu Met Ala Pro Ala Cys
85 90 95
Lys Arg His Phe Ile Gln Asp Thr Cys Leu Tyr Glu Cys Ser Pro Asn
100 105 110
Leu Gly Pro Trp Ile Gln Gln Val Asp Gln Ser Trp Arg Lys Glu Arg
115 120 125
Val Leu Asn Val Pro Leu Cys Lys Glu Asp Cys Glu Gln Trp Trp Glu
130 135 140
Asp Cys Arg Thr Ser Tyr Thr Cys Lys Ser Asn Trp His Lys Gly Trp
145 150 155 160
Asn Trp Thr Ser Gly Phe Asn Lys Cys Ala Val Gly Ala Ala Cys Gln
165 170 175
Pro Phe His Phe Tyr Phe Pro Thr Pro Thr Val Leu Cys Asn Glu Ile
180 185 190
Trp Thr His Ser Tyr Lys Val Ser Asn Tyr Ser Arg Gly Ser Gly Arg
195 200 205
Cys Ile Gln Met Trp Phe Asp Pro Ala Gln Gly Asn Pro Asn Glu Glu
210 215 220
Val Ala Arg Phe Tyr Ala Ala Ala Met Ser Gly Ala Gly Pro Trp Ala
225 230 235 240
Ala Trp Pro Phe Leu Leu Ser Leu Ala Leu Met Leu Leu Trp Leu Leu
245 250 255
Ser
<210> 177
<211> 2669
<212> DNA
<213> 智人
<400> 177
cagatgccag aagaacactg ttgctcttgg tggacgggcc cagaggaatt cagagttaaa 60
ccttgagtgc ctgcgtccgt gagaattcag catggaatgt ctctactatt tcctgggatt 120
tctgctcctg gctgcaagat tgccacttga tgccgccaaa cgatttcatg atgtgctggg 180
caatgaaaga ccttctgctt acatgaggga gcacaatcaa ttaaatggct ggtcttctga 240
tgaaaatgac tggaatgaaa aactctaccc agtgtggaag cggggagaca tgaggtggaa 300
aaactcctgg aagggaggcc gtgtgcaggc ggtcctgacc agtgactcac cagccctcgt 360
gggctcaaat ataacatttg cggtgaacct gatattccct agatgccaaa aggaagatgc 420
caatggcaac atagtctatg agaagaactg cagaaatgag gctggtttat ctgctgatcc 480
atatgtttac aactggacag catggtcaga ggacagtgac ggggaaaatg gcaccggcca 540
aagccatcat aacgtcttcc ctgatgggaa accttttcct caccaccccg gatggagaag 600
atggaatttc atctacgtct tccacacact tggtcagtat ttccagaaat tgggacgatg 660
ttcagtgaga gtttctgtga acacagccaa tgtgacactt gggcctcaac tcatggaagt 720
gactgtctac agaagacatg gacgggcata tgttcccatc gcacaagtga aagatgtgta 780
cgtggtaaca gatcagattc ctgtgtttgt gactatgttc cagaagaacg atcgaaattc 840
atccgacgaa accttcctca aagatctccc cattatgttt gatgtcctga ttcatgatcc 900
tagccacttc ctcaattatt ctaccattaa ctacaagtgg agcttcgggg ataatactgg 960
cctgtttgtt tccaccaatc atactgtgaa tcacacgtat gtgctcaatg gaaccttcag 1020
ccttaacctc actgtgaaag ctgcagcacc aggaccttgt ccgccaccgc caccaccacc 1080
cagaccttca aaacccaccc cttctttagg acctgctggt gacaaccccc tggagctgag 1140
taggattcct gatgaaaact gccagattaa cagatatggc cactttcaag ccaccatcac 1200
aattgtagag ggaatcttag aggttaacat catccagatg acagacgtcc tgatgccggt 1260
gccatggcct gaaagctccc taatagactt tgtcgtgacc tgccaaggga gcattcccac 1320
ggaggtctgt accatcattt ctgaccccac ctgcgagatc acccagaaca cagtctgcag 1380
ccctgtggat gtggatgaga tgtgtctgct gactgtgaga cgaaccttca atgggtctgg 1440
gacgtactgt gtgaacctca ccctggggga tgacacaagc ctggctctca cgagcaccct 1500
gatttctgtt cctgacagag acccagcctc gcctttaagg atggcaaaca gtgccctgat 1560
ctccgttggc tgcttggcca tatttgtcac tgtgatctcc ctcttggtgt acaaaaaaca 1620
caaggaatac aacccaatag aaaatagtcc tgggaatgtg gtcagaagca aaggcctgag 1680
tgtctttctc aaccgtgcaa aagccgtgtt cttcccggga aaccaggaaa aggatccgct 1740
actcaaaaac caagaattta aaggagtttc ttaaatttcg accttgtttc tgaagctcac 1800
ttttcagtgc cattgatgtg agatgtgctg gagtggctat taaccttttt ttcctaaaga 1860
ttattgttaa atagatattg tggtttgggg aagttgaatt ttttataggt taaatgtcat 1920
tttagagatg gggagaggga ttatactgca ggcagcttca gccatgttgt gaaactgata 1980
aaagcaactt agcaaggctt cttttcatta ttttttatgt ttcacttata aagtcttagg 2040
taactagtag gatagaaaca ctgtgtcccg agagtaagga gagaagctac tattgattag 2100
agcctaaccc aggttaactg caagaagagg cgggatactt tcagctttcc atgtaactgt 2160
atgcataaag ccaatgtagt ccagtttcta agatcatgtt ccaagctaac tgaatcccac 2220
ttcaatacac actcatgaac tcctgatgga acaataacag gcccaagcct gtggtatgat 2280
gtgcacactt gctagactca gaaaaaatac tactctcata aatgggtggg agtattttgg 2340
tgacaaccta ctttgcttgg ctgagtgaag gaatgatatt catatattca tttattccat 2400
ggacatttag ttagtgcttt ttatatacca ggcatgatgc tgagtgacac tcttgtgtat 2460
atttccaaat ttttgtatag tcgctgcaca tatttgaaat catatattaa gactttccaa 2520
agatgaggtc cctggttttt catggcaact tgatcagtaa ggatttcacc tctgtttgta 2580
actaaaacca tctactatat gttagacatg acattctttt tctctccttc ctgaaaaata 2640
aagtgtggga agagacaaaa aaaaaaaaa 2669
<210> 178
<211> 560
<212> PRT
<213> 智人
<400> 178
Met Glu Cys Leu Tyr Tyr Phe Leu Gly Phe Leu Leu Leu Ala Ala Arg
1 5 10 15
Leu Pro Leu Asp Ala Ala Lys Arg Phe His Asp Val Leu Gly Asn Glu
20 25 30
Arg Pro Ser Ala Tyr Met Arg Glu His Asn Gln Leu Asn Gly Trp Ser
35 40 45
Ser Asp Glu Asn Asp Trp Asn Glu Lys Leu Tyr Pro Val Trp Lys Arg
50 55 60
Gly Asp Met Arg Trp Lys Asn Ser Trp Lys Gly Gly Arg Val Gln Ala
65 70 75 80
Val Leu Thr Ser Asp Ser Pro Ala Leu Val Gly Ser Asn Ile Thr Phe
85 90 95
Ala Val Asn Leu Ile Phe Pro Arg Cys Gln Lys Glu Asp Ala Asn Gly
100 105 110
Asn Ile Val Tyr Glu Lys Asn Cys Arg Asn Glu Ala Gly Leu Ser Ala
115 120 125
Asp Pro Tyr Val Tyr Asn Trp Thr Ala Trp Ser Glu Asp Ser Asp Gly
130 135 140
Glu Asn Gly Thr Gly Gln Ser His His Asn Val Phe Pro Asp Gly Lys
145 150 155 160
Pro Phe Pro His His Pro Gly Trp Arg Arg Trp Asn Phe Ile Tyr Val
165 170 175
Phe His Thr Leu Gly Gln Tyr Phe Gln Lys Leu Gly Arg Cys Ser Val
180 185 190
Arg Val Ser Val Asn Thr Ala Asn Val Thr Leu Gly Pro Gln Leu Met
195 200 205
Glu Val Thr Val Tyr Arg Arg His Gly Arg Ala Tyr Val Pro Ile Ala
210 215 220
Gln Val Lys Asp Val Tyr Val Val Thr Asp Gln Ile Pro Val Phe Val
225 230 235 240
Thr Met Phe Gln Lys Asn Asp Arg Asn Ser Ser Asp Glu Thr Phe Leu
245 250 255
Lys Asp Leu Pro Ile Met Phe Asp Val Leu Ile His Asp Pro Ser His
260 265 270
Phe Leu Asn Tyr Ser Thr Ile Asn Tyr Lys Trp Ser Phe Gly Asp Asn
275 280 285
Thr Gly Leu Phe Val Ser Thr Asn His Thr Val Asn His Thr Tyr Val
290 295 300
Leu Asn Gly Thr Phe Ser Leu Asn Leu Thr Val Lys Ala Ala Ala Pro
305 310 315 320
Gly Pro Cys Pro Pro Pro Pro Pro Pro Pro Arg Pro Ser Lys Pro Thr
325 330 335
Pro Ser Leu Gly Pro Ala Gly Asp Asn Pro Leu Glu Leu Ser Arg Ile
340 345 350
Pro Asp Glu Asn Cys Gln Ile Asn Arg Tyr Gly His Phe Gln Ala Thr
355 360 365
Ile Thr Ile Val Glu Gly Ile Leu Glu Val Asn Ile Ile Gln Met Thr
370 375 380
Asp Val Leu Met Pro Val Pro Trp Pro Glu Ser Ser Leu Ile Asp Phe
385 390 395 400
Val Val Thr Cys Gln Gly Ser Ile Pro Thr Glu Val Cys Thr Ile Ile
405 410 415
Ser Asp Pro Thr Cys Glu Ile Thr Gln Asn Thr Val Cys Ser Pro Val
420 425 430
Asp Val Asp Glu Met Cys Leu Leu Thr Val Arg Arg Thr Phe Asn Gly
435 440 445
Ser Gly Thr Tyr Cys Val Asn Leu Thr Leu Gly Asp Asp Thr Ser Leu
450 455 460
Ala Leu Thr Ser Thr Leu Ile Ser Val Pro Asp Arg Asp Pro Ala Ser
465 470 475 480
Pro Leu Arg Met Ala Asn Ser Ala Leu Ile Ser Val Gly Cys Leu Ala
485 490 495
Ile Phe Val Thr Val Ile Ser Leu Leu Val Tyr Lys Lys His Lys Glu
500 505 510
Tyr Asn Pro Ile Glu Asn Ser Pro Gly Asn Val Val Arg Ser Lys Gly
515 520 525
Leu Ser Val Phe Leu Asn Arg Ala Lys Ala Val Phe Phe Pro Gly Asn
530 535 540
Gln Glu Lys Asp Pro Leu Leu Lys Asn Gln Glu Phe Lys Gly Val Ser
545 550 555 560
<210> 179
<211> 1440
<212> DNA
<213> 智人
<400> 179
gttacccagc attgtgagtg acagagcctg gatctgaacg ctgatcccat aatgcatcct 60
caagtggtca tcttaagcct catcctacat ctggcagatt ctgtagctgg ttctgtaaag 120
gttggtggag aggcaggtcc atctgtcaca ctaccctgcc actacagtgg agctgtcaca 180
tcaatgtgct ggaatagagg ctcatgttct ctattcacat gccaaaatgg cattgtctgg 240
accaatggaa cccacgtcac ctatcggaag gacacacgct ataagctatt gggggacctt 300
tcaagaaggg atgtctcttt gaccatagaa aatacagctg tgtctgacag tggcgtatat 360
tgttgccgtg ttgagcaccg tgggtggttc aatgacatga aaatcaccgt atcattggag 420
attgtgccac ccaaggtcac gactactcca attgtcacaa ctgttccaac cgtcacgact 480
gttcgaacga gcaccactgt tccaacgaca acgactgttc caacgacaac tgttccaaca 540
acaatgagca ttccaacgac aacgactgtt ccgacgacaa tgactgtttc aacgacaacg 600
agcgttccaa cgacaacgag cattccaaca acaacaagtg ttccagtgac aacaacggtc 660
tctacctttg ttcctccaat gcctttgccc aggcagaacc atgaaccagt agccacttca 720
ccatcttcac ctcagccagc agaaacccac cctacgacac tgcagggagc aataaggaga 780
gaacccacca gctcaccatt gtactcttac acaacagatg ggaatgacac cgtgacagag 840
tcttcagatg gcctttggaa taacaatcaa actcaactgt tcctagaaca tagtctactg 900
acggccaata ccactaaagg aatctatgct ggagtctgta tttctgtctt ggtgcttctt 960
gctcttttgg gtgtcatcat tgccaaaaag tatttcttca aaaaggaggt tcaacaacta 1020
agtgtttcat ttagcagcct tcaaattaaa gctttgcaaa atgcagttga aaaggaagtc 1080
caagcagaag acaatatcta cattgagaat agtctttatg ccacggacta agacccagtg 1140
gtgctctttg agagtttacg cccatgactg cagaagactg aacaggtatc agcacatcag 1200
atgtctttta gactccaaga caatttttct gtttcagttt catctggcat tccaacatgt 1260
cagtgatact gggtagagta actctcccac tccaaactgt gtatagtcaa cctcatcatt 1320
aatgtagtcc taatttgttt tgctaaaact ggctcaatcc ttctgatcat tgcagagttt 1380
tctctcaaac atgaacactt tagaattgta tgttctcttt agaccccata aatcctgtat 1440
<210> 180
<211> 359
<212> PRT
<213> 智人
<400> 180
Met His Pro Gln Val Val Ile Leu Ser Leu Ile Leu His Leu Ala Asp
1 5 10 15
Ser Val Ala Gly Ser Val Lys Val Gly Gly Glu Ala Gly Pro Ser Val
20 25 30
Thr Leu Pro Cys His Tyr Ser Gly Ala Val Thr Ser Met Cys Trp Asn
35 40 45
Arg Gly Ser Cys Ser Leu Phe Thr Cys Gln Asn Gly Ile Val Trp Thr
50 55 60
Asn Gly Thr His Val Thr Tyr Arg Lys Asp Thr Arg Tyr Lys Leu Leu
65 70 75 80
Gly Asp Leu Ser Arg Arg Asp Val Ser Leu Thr Ile Glu Asn Thr Ala
85 90 95
Val Ser Asp Ser Gly Val Tyr Cys Cys Arg Val Glu His Arg Gly Trp
100 105 110
Phe Asn Asp Met Lys Ile Thr Val Ser Leu Glu Ile Val Pro Pro Lys
115 120 125
Val Thr Thr Thr Pro Ile Val Thr Thr Val Pro Thr Val Thr Thr Val
130 135 140
Arg Thr Ser Thr Thr Val Pro Thr Thr Thr Thr Val Pro Thr Thr Thr
145 150 155 160
Val Pro Thr Thr Met Ser Ile Pro Thr Thr Thr Thr Val Pro Thr Thr
165 170 175
Met Thr Val Ser Thr Thr Thr Ser Val Pro Thr Thr Thr Ser Ile Pro
180 185 190
Thr Thr Thr Ser Val Pro Val Thr Thr Thr Val Ser Thr Phe Val Pro
195 200 205
Pro Met Pro Leu Pro Arg Gln Asn His Glu Pro Val Ala Thr Ser Pro
210 215 220
Ser Ser Pro Gln Pro Ala Glu Thr His Pro Thr Thr Leu Gln Gly Ala
225 230 235 240
Ile Arg Arg Glu Pro Thr Ser Ser Pro Leu Tyr Ser Tyr Thr Thr Asp
245 250 255
Gly Asn Asp Thr Val Thr Glu Ser Ser Asp Gly Leu Trp Asn Asn Asn
260 265 270
Gln Thr Gln Leu Phe Leu Glu His Ser Leu Leu Thr Ala Asn Thr Thr
275 280 285
Lys Gly Ile Tyr Ala Gly Val Cys Ile Ser Val Leu Val Leu Leu Ala
290 295 300
Leu Leu Gly Val Ile Ile Ala Lys Lys Tyr Phe Phe Lys Lys Glu Val
305 310 315 320
Gln Gln Leu Ser Val Ser Phe Ser Ser Leu Gln Ile Lys Ala Leu Gln
325 330 335
Asn Ala Val Glu Lys Glu Val Gln Ala Glu Asp Asn Ile Tyr Ile Glu
340 345 350
Asn Ser Leu Tyr Ala Thr Asp
355
<210> 181
<211> 2671
<212> DNA
<213> 智人
<400> 181
ggtgagtcac caaggaaggc agcggcagct ccactcagcc actgcaattg tgtccggaat 60
ttattccttc tggtgggttc atggtcttgc tgacttcaag aatgaagccg ctgacctcgc 120
gcataattag catcatcatt attctggctg gagcaattgc actcatcatt ggctttggta 180
tttcagggag acactccatc acagtcacta ctgtcgcctc agctgggaac attggggagg 240
atggaatcct gagctgcact tttgaacctg acatcaaact ttctgatatc gtgatacaat 300
ggctgaagga aggtgtttta ggcttggtcc atgagttcaa agaaggcaaa gatgagctgt 360
cggagcagga tgaaatgttc agaggccgga cagcagtgtt tgctgatcaa gtgatagttg 420
gcaatgcctc tttgcggctg aaaaacgtgc aactcacaga tgctggcacc tacaaatgtt 480
atatcatcac ttctaaaggc aaggggaatg ctaaccttga gtataaaact ggagccttca 540
gcatgccgga agtgaatgtg gactataatg ccagctcaga gaccttgcgg tgtgaggctc 600
cccgatggtt cccccggccc acagtggtct gggcatccca agttgaccag ggagccaact 660
tctcggaagt ctccaatacc agctttgagc tgaactctga gaatgtgacc atgaaggtag 720
tgtctgtgct ctacaatgtt acgatcaaca acacatactc ctgtatggtt gaaaatgaca 780
ttgccaaagc aacaggggat atcaaagtga cagaatcgga gatcaaaagg cggagtcacc 840
tacagctgct aaactcaaag gcttctctgt gtgtctcttc tttctttgcc atcagctggg 900
cacttctgcc tctcagccct tacctgatgc taaaataatg tgccttggcc acaaaaaagc 960
atgcaaagtc attgttacaa cagggatcta cagaactatt tcaccaccag atatgaccta 1020
gttttatatt tctgggagga aatgaattca tatctagaag tctggagtga gcaaacaaga 1080
gcaagaaaca aaaagaagcc aaaagcagaa ggctccaata tgaacaagat aaatctatct 1140
tcaaagacat attagaagtt gggaaaataa ttcatgtgaa ctagacaagt gtgttaagag 1200
tgataagtaa aatgcacgtg gagacaagtg catccccaga tctcagggac ctccccctgc 1260
ctgtcacctg gggagtgaga ggacaggata gtgcatgttc tttgtctctg aatttttagt 1320
tatatgtgcc gtaatgttgc tctgaggaag cccctggaaa gtctatccca acatatccac 1380
atcttatatt ccacaaatta agctgtagta tgtaccctaa gacgctgcta attgactgcc 1440
acttcgcaac tcaggggcgg ctgcatttta gtaatgggtc aaatgattca ctttttatga 1500
tgcttccaaa ggtgccttgg cttctcttcc caactgacaa atgccaaagt tgagaaaaat 1560
gatcataatt ttagcataaa cagagcagtc ggcgacaccg attttataaa taaactgagc 1620
accttctttt taaacaaaca aatgcgggtt tatttctcag atgatgttca tccgtgaatg 1680
gtccagggaa ggacctttca ccttgtctat atggcattat gtcatcacaa gctctgaggc 1740
ttctcctttc catcctgcgt ggacagctaa gacctcagtt ttcaatagca tctagagcag 1800
tgggactcag ctggggtgat ttcgcccccc atctccgggg gaatgtctga agacaatttt 1860
ggttacctca atgagggagt ggaggaggat acagtgccac taccaactag tggatagagg 1920
ccagggatgc tgctcaacct cctaccatgt acaggacgtc tccccattac aactacccaa 1980
tccgaagtgt caactgtgtc agggctaagg aaccctggtt ttgagtagaa aagggcctgg 2040
aaagagggga gccaacaaat ctgtctgctt cctcacatta gtcattggca aataagcatt 2100
ctgtctcttt ggctgctgcc tcagcacaga gagccagaac tctatcgggc accaggataa 2160
catctctcag tgaacagagt tgacaaggcc tatgggaaat gcctgatggg attatcttca 2220
gcttgttgag cttctaagtt tctttccctt cattctaccc tgcaagccaa gttctgtaag 2280
agaaatgcct gagttctagc tcaggttttc ttactctgaa tttagatctc cagaccctgc 2340
ctggccacaa ttcaaattaa ggcaacaaac atataccttc catgaagcac acacagactt 2400
ttgaaagcaa ggacaatgac tgcttgaatt gaggccttga ggaatgaagc tttgaaggaa 2460
aagaatactt tgtttccagc ccccttccca cactcttcat gtgttaacca ctgccttcct 2520
ggaccttgga gccacggtga ctgtattaca tgttgttata gaaaactgat tttagagttc 2580
tgatcgttca agagaatgat taaatataca tttcctacaa aaaaaaaaaa aaaaaaaaaa 2640
aaaaaaaaaa aataaaaaaa aaaaaaaaaa a 2671
<210> 182
<211> 1164
<212> DNA
<213> 智人
<400> 182
agtggggcca ggggagtcat ctttttccgt tgcgggtacc cacctgccgc tggccagcat 60
tgccccagag gcatccttgg gtggctgctt gtataaactg caaggtccac aaagtgttga 120
acaagttgtt gaatgttttg gccaacactg cacctgctgt cttccctaca gatttgcagg 180
ctgggaaggc tagacttcct cagcccgagg gctgcccttc caaactctat cggctgatgc 240
agcgctgctg ggccctcagc cccaaggacc ggccctcctt cagtgagatt gccagcgccc 300
tgggagacag caccgtggac agcaagccgt gaggagggag cccgctcagg atggcctggg 360
caggggagga catctctaga gggaagctca cagcatgatg ggcaagatcc ctgtcctcct 420
gggccctgag gcccctgccc tagtgcaaca ggcattgctg aggtctgagc agggcctggc 480
ctttcctcct cttcctcacc ctcatccttt gggaggctga cttggaccca aactgggcga 540
ctagggcttt gagctgggca gttttccctg ccacctcttc ctctatcagg gacagtgtgg 600
gtgccacagg taaccccaat ttctggcctt caacttctcc ccttgaccgg gtccaactct 660
gccactcatc tgccaacttt gcctggggag ggctaggctt gggatgagct gggtttgtgg 720
ggagttcctt aatattctca agttctgggc acacagggtt aatgagtctc ttggcccact 780
ggtcccactt gggggtctag accaggatta tagaggacac agcaagtgag tcctccccac 840
tctgggcttg tgcacactga cccagaccca cgtcttcccc acccttctct cctttcctca 900
tcctaagtgc ctggcagatg aaggagtttt caggagcttt tgacactata taaaccgccc 960
tttttgtatg caccacgggc ggcttttata tgtaattgca gcgtggggtg ggtgggcatg 1020
ggaggtaggg gtgggccctg gagatgagga gggtgggcca tccttacccc acacttttat 1080
tgttgtcgtt ttttgtttgt ttttgttttt ttgtttttgt ttttgttttt acactcgctg 1140
ctctcaataa ataagccttt ttta 1164
<210> 183
<211> 1070
<212> PRT
<213> 智人
<400> 183
Met Gly Ala Ala Arg Gly Ser Pro Ala Arg Pro Arg Arg Leu Pro Leu
1 5 10 15
Leu Ser Val Leu Leu Leu Pro Leu Leu Gly Gly Thr Gln Thr Ala Ile
20 25 30
Val Phe Ile Lys Gln Pro Ser Ser Gln Asp Ala Leu Gln Gly Arg Arg
35 40 45
Ala Leu Leu Arg Cys Glu Val Glu Ala Pro Gly Pro Val His Val Tyr
50 55 60
Trp Leu Leu Asp Gly Ala Pro Val Gln Asp Thr Glu Arg Arg Phe Ala
65 70 75 80
Gln Gly Ser Ser Leu Ser Phe Ala Ala Val Asp Arg Leu Gln Asp Ser
85 90 95
Gly Thr Phe Gln Cys Val Ala Arg Asp Asp Val Thr Gly Glu Glu Ala
100 105 110
Arg Ser Ala Asn Ala Ser Phe Asn Ile Lys Trp Ile Glu Ala Gly Pro
115 120 125
Val Val Leu Lys His Pro Ala Ser Glu Ala Glu Ile Gln Pro Gln Thr
130 135 140
Gln Val Thr Leu Arg Cys His Ile Asp Gly His Pro Arg Pro Thr Tyr
145 150 155 160
Gln Trp Phe Arg Asp Gly Thr Pro Leu Ser Asp Gly Gln Ser Asn His
165 170 175
Thr Val Ser Ser Lys Glu Arg Asn Leu Thr Leu Arg Pro Ala Gly Pro
180 185 190
Glu His Ser Gly Leu Tyr Ser Cys Cys Ala His Ser Ala Phe Gly Gln
195 200 205
Ala Cys Ser Ser Gln Asn Phe Thr Leu Ser Ile Ala Asp Glu Ser Phe
210 215 220
Ala Arg Val Val Leu Ala Pro Gln Asp Val Val Val Ala Arg Tyr Glu
225 230 235 240
Glu Ala Met Phe His Cys Gln Phe Ser Ala Gln Pro Pro Pro Ser Leu
245 250 255
Gln Trp Leu Phe Glu Asp Glu Thr Pro Ile Thr Asn Arg Ser Arg Pro
260 265 270
Pro His Leu Arg Arg Ala Thr Val Phe Ala Asn Gly Ser Leu Leu Leu
275 280 285
Thr Gln Val Arg Pro Arg Asn Ala Gly Ile Tyr Arg Cys Ile Gly Gln
290 295 300
Gly Gln Arg Gly Pro Pro Ile Ile Leu Glu Ala Thr Leu His Leu Ala
305 310 315 320
Glu Ile Glu Asp Met Pro Leu Phe Glu Pro Arg Val Phe Thr Ala Gly
325 330 335
Ser Glu Glu Arg Val Thr Cys Leu Pro Pro Lys Gly Leu Pro Glu Pro
340 345 350
Ser Val Trp Trp Glu His Ala Gly Val Arg Leu Pro Thr His Gly Arg
355 360 365
Val Tyr Gln Lys Gly His Glu Leu Val Leu Ala Asn Ile Ala Glu Ser
370 375 380
Asp Ala Gly Val Tyr Thr Cys His Ala Ala Asn Leu Ala Gly Gln Arg
385 390 395 400
Arg Gln Asp Val Asn Ile Thr Val Ala Thr Val Pro Ser Trp Leu Lys
405 410 415
Lys Pro Gln Asp Ser Gln Leu Glu Glu Gly Lys Pro Gly Tyr Leu Asp
420 425 430
Cys Leu Thr Gln Ala Thr Pro Lys Pro Thr Val Val Trp Tyr Arg Asn
435 440 445
Gln Met Leu Ile Ser Glu Asp Ser Arg Phe Glu Val Phe Lys Asn Gly
450 455 460
Thr Leu Arg Ile Asn Ser Val Glu Val Tyr Asp Gly Thr Trp Tyr Arg
465 470 475 480
Cys Met Ser Ser Thr Pro Ala Gly Ser Ile Glu Ala Gln Ala Arg Val
485 490 495
Gln Val Leu Glu Lys Leu Lys Phe Thr Pro Pro Pro Gln Pro Gln Gln
500 505 510
Cys Met Glu Phe Asp Lys Glu Ala Thr Val Pro Cys Ser Ala Thr Gly
515 520 525
Arg Glu Lys Pro Thr Ile Lys Trp Glu Arg Ala Asp Gly Ser Ser Leu
530 535 540
Pro Glu Trp Val Thr Asp Asn Ala Gly Thr Leu His Phe Ala Arg Val
545 550 555 560
Thr Arg Asp Asp Ala Gly Asn Tyr Thr Cys Ile Ala Ser Asn Gly Pro
565 570 575
Gln Gly Gln Ile Arg Ala His Val Gln Leu Thr Val Ala Val Phe Ile
580 585 590
Thr Phe Lys Val Glu Pro Glu Arg Thr Thr Val Tyr Gln Gly His Thr
595 600 605
Ala Leu Leu Gln Cys Glu Ala Gln Gly Asp Pro Lys Pro Leu Ile Gln
610 615 620
Trp Lys Gly Lys Asp Arg Ile Leu Asp Pro Thr Lys Leu Gly Pro Arg
625 630 635 640
Met His Ile Phe Gln Asn Gly Ser Leu Val Ile His Asp Val Ala Pro
645 650 655
Glu Asp Ser Gly Arg Tyr Thr Cys Ile Ala Gly Asn Ser Cys Asn Ile
660 665 670
Lys His Thr Glu Ala Pro Leu Tyr Val Val Asp Lys Pro Val Pro Glu
675 680 685
Glu Ser Glu Gly Pro Gly Ser Pro Pro Pro Tyr Lys Met Ile Gln Thr
690 695 700
Ile Gly Leu Ser Val Gly Ala Ala Val Ala Tyr Ile Ile Ala Val Leu
705 710 715 720
Gly Leu Met Phe Tyr Cys Lys Lys Arg Cys Lys Ala Lys Arg Leu Gln
725 730 735
Lys Gln Pro Glu Gly Glu Glu Pro Glu Met Glu Cys Leu Asn Gly Gly
740 745 750
Pro Leu Gln Asn Gly Gln Pro Ser Ala Glu Ile Gln Glu Glu Val Ala
755 760 765
Leu Thr Ser Leu Gly Ser Gly Pro Ala Ala Thr Asn Lys Arg His Ser
770 775 780
Thr Ser Asp Lys Met His Phe Pro Arg Ser Ser Leu Gln Pro Ile Thr
785 790 795 800
Thr Leu Gly Lys Ser Glu Phe Gly Glu Val Phe Leu Ala Lys Ala Gln
805 810 815
Gly Leu Glu Glu Gly Val Ala Glu Thr Leu Val Leu Val Lys Ser Leu
820 825 830
Gln Ser Lys Asp Glu Gln Gln Gln Leu Asp Phe Arg Arg Glu Leu Glu
835 840 845
Met Phe Gly Lys Leu Asn His Ala Asn Val Val Arg Leu Leu Gly Leu
850 855 860
Cys Arg Glu Ala Glu Pro His Tyr Met Val Leu Glu Tyr Val Asp Leu
865 870 875 880
Gly Asp Leu Lys Gln Phe Leu Arg Ile Ser Lys Ser Lys Asp Glu Lys
885 890 895
Leu Lys Ser Gln Pro Leu Ser Thr Lys Gln Lys Val Ala Leu Cys Thr
900 905 910
Gln Val Ala Leu Gly Met Glu His Leu Ser Asn Asn Arg Phe Val His
915 920 925
Lys Asp Leu Ala Ala Arg Asn Cys Leu Val Ser Ala Gln Arg Gln Val
930 935 940
Lys Val Ser Ala Leu Gly Leu Ser Lys Asp Val Tyr Asn Ser Glu Tyr
945 950 955 960
Tyr His Phe Arg Gln Ala Trp Val Pro Leu Arg Trp Met Ser Pro Glu
965 970 975
Ala Ile Leu Glu Gly Asp Phe Ser Thr Lys Ser Asp Val Trp Ala Phe
980 985 990
Gly Val Leu Met Trp Glu Val Phe Thr His Gly Glu Met Pro His Gly
995 1000 1005
Gly Gln Ala Asp Asp Glu Val Leu Ala Asp Leu Gln Ala Gly Lys
1010 1015 1020
Ala Arg Leu Pro Gln Pro Glu Gly Cys Pro Ser Lys Leu Tyr Arg
1025 1030 1035
Leu Met Gln Arg Cys Trp Ala Leu Ser Pro Lys Asp Arg Pro Ser
1040 1045 1050
Phe Ser Glu Ile Ala Ser Ala Leu Gly Asp Ser Thr Val Asp Ser
1055 1060 1065
Lys Pro
1070
<210> 184
<211> 1229
<212> DNA
<213> 智人
<400> 184
ttcctttctc tctcagctct ccgtctctct ttctctctca gcctctttct ttctccctgt 60
ctcccccact gtcagcacct cttctgtgtg gtgagtggac cgcttacccc actaggtgaa 120
gatgtcagcc caggagagct gcctcagcct catcaagtcc tcggcagcct gatcttctgc 180
ttcggcatct ggatcctcat tgacaagacc agcttcgtgt cctttgtggg cttggccttc 240
gtgcctctgc agatctggtc caaagtcctg gccatctcag gaatcttcac catgggcatc 300
gccctcctgg gttgtgtggg ggccctcaag gagctccgct gcctcctggg cctgtatttt 360
gggatgctgc tgctcctgtt tgccacacag atcaccctgg gaatcctcat ctccactcag 420
cgggcccagc tggagcgaag cttgcgggac gtcgtagaga aaaccatcca aaagtacggc 480
accaaccccg aggagaccgc ggccgaggag agctgggact atgtgcagtt ccagctgcgc 540
tgctgcggct ggcactaccc gcaggactgg ttccaagtcc tcatcctgag aggtaacggg 600
tcggaggcgc accgcgtgcc ctgctcctgc tacaacttgt cggcgaccaa cgactccaca 660
atcctagata aggtgatctt gccccagctc agcaggcttg gacacctggc gcggtccaga 720
cacagtgcag acatctgcgc tgtccctgca gagagccaca tctaccgcga gggctgcgcg 780
cagggcctcc agaagtggct gcacaacaac cttatttcca tagtgggcat ttgcctgggc 840
gtcggcctac tcgagctcgg gttcatgacg ctctcgatat tcctgtgcag aaacctggac 900
cacgtctaca accggctcgc tcgataccgt taggccccgc cctccccaaa gtcccgcccc 960
gcccccgtca cgtgcgctgg gcacttccct gctgcctgta aatatttgtt taatccccag 1020
ttcgcctgga gccctccgcc ttcacattcc cctggggacc cacgtggctg cgtgcccctg 1080
ctgctgtcac ctctcccacg ggacctgggg ctttcgtcca cagcttcctg tccccatctg 1140
tcggcctacc accacccaca agattatttt tcacccaaac ctcaaataaa tcccctgcgt 1200
ttttggtaaa aaaaaaaaaa aaaaaaaaa 1229
<210> 185
<211> 213
<212> PRT
<213> 智人
<400> 185
Met Gly Ile Ala Leu Leu Gly Cys Val Gly Ala Leu Lys Glu Leu Arg
1 5 10 15
Cys Leu Leu Gly Leu Tyr Phe Gly Met Leu Leu Leu Leu Phe Ala Thr
20 25 30
Gln Ile Thr Leu Gly Ile Leu Ile Ser Thr Gln Arg Ala Gln Leu Glu
35 40 45
Arg Ser Leu Arg Asp Val Val Glu Lys Thr Ile Gln Lys Tyr Gly Thr
50 55 60
Asn Pro Glu Glu Thr Ala Ala Glu Glu Ser Trp Asp Tyr Val Gln Phe
65 70 75 80
Gln Leu Arg Cys Cys Gly Trp His Tyr Pro Gln Asp Trp Phe Gln Val
85 90 95
Leu Ile Leu Arg Gly Asn Gly Ser Glu Ala His Arg Val Pro Cys Ser
100 105 110
Cys Tyr Asn Leu Ser Ala Thr Asn Asp Ser Thr Ile Leu Asp Lys Val
115 120 125
Ile Leu Pro Gln Leu Ser Arg Leu Gly His Leu Ala Arg Ser Arg His
130 135 140
Ser Ala Asp Ile Cys Ala Val Pro Ala Glu Ser His Ile Tyr Arg Glu
145 150 155 160
Gly Cys Ala Gln Gly Leu Gln Lys Trp Leu His Asn Asn Leu Ile Ser
165 170 175
Ile Val Gly Ile Cys Leu Gly Val Gly Leu Leu Glu Leu Gly Phe Met
180 185 190
Thr Leu Ser Ile Phe Leu Cys Arg Asn Leu Asp His Val Tyr Asn Arg
195 200 205
Leu Ala Arg Tyr Arg
210
<210> 186
<211> 3148
<212> DNA
<213> 智人
<400> 186
gtgggaaggg cctgtgggtt tattataagg cggagctcgg cgggagaggt gcgggccgaa 60
tccgagccga gcggagagga atccggcagt agagagcgga ctccagccgg cggaccctgc 120
agccctcgcc tgggacagcg gcgcgctggg caggcgccca agagagcatc gagcagcgga 180
acccgcgaag ccggcccgca gccgcgaccc gcgcagcctg ccgctctccc gccgccggtc 240
cgggcagcat gaggcgcgcg gcgctctggc tctggctgtg cgcgctggcg ctgagcctgc 300
agccggccct gccgcaaatt gtggctacta atttgccccc tgaagatcaa gatggctctg 360
gggatgactc tgacaacttc tccggctcag gtgcaggtgc tttgcaagat atcaccttgt 420
cacagcagac cccctccact tggaaggaca cgcagctcct gacggctatt cccacgtctc 480
cagaacccac cggcctggag gctacagctg cctccacctc caccctgccg gctggagagg 540
ggcccaagga gggagaggct gtagtcctgc cagaagtgga gcctggcctc accgcccggg 600
agcaggaggc caccccccga cccagggaga ccacacagct cccgaccact catcaggcct 660
caacgaccac agccaccacg gcccaggagc ccgccacctc ccacccccac agggacatgc 720
agcctggcca ccatgagacc tcaacccctg caggacccag ccaagctgac cttcacactc 780
cccacacaga ggatggaggt ccttctgcca ccgagagggc tgctgaggat ggagcctcca 840
gtcagctccc agcagcagag ggctctgggg agcaggactt cacctttgaa acctcggggg 900
agaatacggc tgtagtggcc gtggagcctg accgccggaa ccagtcccca gtggatcagg 960
gggccacggg ggcctcacag ggcctcctgg acaggaaaga ggtgctggga ggggtcattg 1020
ccggaggcct cgtggggctc atctttgctg tgtgcctggt gggtttcatg ctgtaccgca 1080
tgaagaagaa ggacgaaggc agctactcct tggaggagcc gaaacaagcc aacggcgggg 1140
cctaccagaa gcccaccaaa caggaggaat tctatgcctg acgcgggagc catgcgcccc 1200
ctccgccctg ccactcacta ggcccccact tgcctcttcc ttgaagaact gcaggccctg 1260
gcctcccctg ccaccaggcc acctccccag cattccagcc cctctggtcg ctcctgccca 1320
cggagtcgtg gggtgtgctg ggagctccac tctgcttctc tgacttctgc ctggagactt 1380
agggcaccag gggtttctcg cataggacct ttccaccaca gccagcacct ggcatcgcac 1440
cattctgact cggtttctcc aaactgaagc agcctctccc caggtccagc tctggagggg 1500
agggggatcc gactgctttg gacctaaatg gcctcatgtg gctggaagat cctgcgggtg 1560
gggcttgggg ctcacacacc tgtagcactt actggtagga ccaagcatct tgggggggtg 1620
gccgctgagt ggcaggggac aggagtccac tttgtttcgt ggggaggtct aatctagata 1680
tcgacttgtt tttgcacatg tttcctctag ttctttgttc atagcccagt agaccttgtt 1740
acttctgagg taagttaagt aagttgattc ggtatccccc catcttgctt ccctaatcta 1800
tggtcgggag acagcatcag ggttaagaag actttttttt ttttttttaa actaggagaa 1860
ccaaatctgg aagccaaaat gtaggcttag tttgtgtgtt gtctcttgag tttgtcgctc 1920
atgtgtgcaa cagggtatgg actatctgtc tggtggcccc gtttctggtg gtctgttggc 1980
aggctggcca gtccaggctg ccgtggggcc gccgcctctt tcaagcagtc gtgcctgtgt 2040
ccatgcgctc agggccatgc tgaggcctgg gccgctgcca cgttggagaa gcccgtgtga 2100
gaagtgaatg ctgggactca gccttcagac agagaggact gtagggaggg cggcaggggc 2160
ctggagatcc tcctgcagac cacgcccgtc ctgcctgtgg cgccgtctcc aggggctgct 2220
tcctcctgga aattgacgag gggtgtcttg ggcagagctg gctctgagcg cctccatcca 2280
aggccaggtt ctccgttagc tcctgtggcc ccaccctggg ccctgggctg gaatcaggaa 2340
tattttccaa agagtgatag tcttttgctt ttggcaaaac tctacttaat ccaatgggtt 2400
tttccctgta cagtagattt tccaaatgta ataaacttta atataaagta gtcctgtgaa 2460
tgccactgcc ttcgcttctt gcctctgtgc tgtgtgtgac gtgaccggac ttttctgcaa 2520
acaccaacat gttgggaaac ttggctcgaa tctctgtgcc ttcgtctttc ccatggggag 2580
ggattctggt tccagggtcc ctctgtgtat ttgctttttt gttttggctg aaattctcct 2640
ggaggtcggt aggttcagcc aaggttttat aaggctgatg tcaatttctg tgttgccaag 2700
ctccaagccc catcttctaa atggcaaagg aaggtggatg gccccagcac agcttgacct 2760
gaggctgtgg tcacagcgga ggtgtggagc cgaggcctac cccgcagaca ccttggacat 2820
cctcctccca cccggctgca gaggccagag gcccccagcc cagggctcct gcacttactt 2880
gcttatttga caacgtttca gcgactccgt tggccactcc gagaggtggg ccagtctgtg 2940
gatcagagat gcaccaccaa gccaagggaa cctgtgtccg gtattcgata ctgcgacttt 3000
ctgcctggag tgtatgactg cacatgactc gggggtgggg aaaggggtcg gctgaccatg 3060
ctcatctgct ggtccgtggg acggtgccca agccagaggc tgggttcatt tgtgtaacga 3120
caataaacgg tacttgtcat ttcgggca 3148
<210> 187
<211> 310
<212> PRT
<213> 智人
<400> 187
Met Arg Arg Ala Ala Leu Trp Leu Trp Leu Cys Ala Leu Ala Leu Ser
1 5 10 15
Leu Gln Pro Ala Leu Pro Gln Ile Val Ala Thr Asn Leu Pro Pro Glu
20 25 30
Asp Gln Asp Gly Ser Gly Asp Asp Ser Asp Asn Phe Ser Gly Ser Gly
35 40 45
Ala Gly Ala Leu Gln Asp Ile Thr Leu Ser Gln Gln Thr Pro Ser Thr
50 55 60
Trp Lys Asp Thr Gln Leu Leu Thr Ala Ile Pro Thr Ser Pro Glu Pro
65 70 75 80
Thr Gly Leu Glu Ala Thr Ala Ala Ser Thr Ser Thr Leu Pro Ala Gly
85 90 95
Glu Gly Pro Lys Glu Gly Glu Ala Val Val Leu Pro Glu Val Glu Pro
100 105 110
Gly Leu Thr Ala Arg Glu Gln Glu Ala Thr Pro Arg Pro Arg Glu Thr
115 120 125
Thr Gln Leu Pro Thr Thr His Gln Ala Ser Thr Thr Thr Ala Thr Thr
130 135 140
Ala Gln Glu Pro Ala Thr Ser His Pro His Arg Asp Met Gln Pro Gly
145 150 155 160
His His Glu Thr Ser Thr Pro Ala Gly Pro Ser Gln Ala Asp Leu His
165 170 175
Thr Pro His Thr Glu Asp Gly Gly Pro Ser Ala Thr Glu Arg Ala Ala
180 185 190
Glu Asp Gly Ala Ser Ser Gln Leu Pro Ala Ala Glu Gly Ser Gly Glu
195 200 205
Gln Asp Phe Thr Phe Glu Thr Ser Gly Glu Asn Thr Ala Val Val Ala
210 215 220
Val Glu Pro Asp Arg Arg Asn Gln Ser Pro Val Asp Gln Gly Ala Thr
225 230 235 240
Gly Ala Ser Gln Gly Leu Leu Asp Arg Lys Glu Val Leu Gly Gly Val
245 250 255
Ile Ala Gly Gly Leu Val Gly Leu Ile Phe Ala Val Cys Leu Val Gly
260 265 270
Phe Met Leu Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Ser Leu
275 280 285
Glu Glu Pro Lys Gln Ala Asn Gly Gly Ala Tyr Gln Lys Pro Thr Lys
290 295 300
Gln Glu Glu Phe Tyr Ala
305 310
<210> 188
<211> 1506
<212> DNA
<213> 智人
<400> 188
ctgcctgggg agcccccccg ccccacatcc tgccccgcaa aaggcagctt caccaaagtg 60
gggtatttcc agcctttgta gctttcactt ccacatctac caagtgggcg gagtggcctt 120
ctgtggacga atcagattcc tctccagcac cgactttaag aggcgagccg gggggtcagg 180
gtcccagatg cacaggagga gaagcaggag ctgtcgggaa gatcagaagc cagtcatgga 240
tgaccagcgc gaccttatct ccaacaatga gcaactgccc atgctgggcc ggcgccctgg 300
ggccccggag agcaagtgca gccgcggagc cctgtacaca ggcttttcca tcctggtgac 360
tctgctcctc gctggccagg ccaccaccgc ctacttcctg taccagcagc agggccggct 420
ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca tgaagcttcc 480
caagcctccc aagcctgtga gcaagatgcg catggccacc ccgctgctga tgcaggcgct 540
gcccatggga gccctgcccc aggggcccat gcagaatgcc accaagtatg gcaacatgac 600
agaggaccat gtgatgcacc tgctccagaa tgctgacccc ctgaaggtgt acccgccact 660
gaaggggagc ttcccggaga acctgagaca ccttaagaac accatggaga ccatagactg 720
gaaggtcttt gagagctgga tgcaccattg gctcctgttt gaaatgagca ggcactcctt 780
ggagcaaaag cccactgacg ctccaccgaa agagtcactg gaactggagg acccgtcttc 840
tgggctgggt gtgaccaagc aggatctggg cccagtcccc atgtgagagc agcagaggcg 900
gtcttcaaca tcctgccagc cccacacagc tacagctttc ttgctccctt cagcccccag 960
cccctccccc atctcccacc ctgtacctca tcccatgaga ccctggtgcc tggctctttc 1020
gtcacccttg gacaagacaa accaagtcgg aacagcagat aacaatgcag caaggccctg 1080
ctgcccaatc tccatctgtc aacaggggcg tgaggtccca ggaagtggcc aaaagctaga 1140
cagatccccg ttcctgacat cacagcagcc tccaacacaa ggctccaaga cctaggctca 1200
tggacgagat gggaaggcac agggagaagg gataacccta cacccagacc ccaggctgga 1260
catgctgact gtcctctccc ctccagcctt tggccttggc ttttctagcc tatttacctg 1320
caggctgagc cactctcttc cctttcccca gcatcactcc ccaaggaaga gccaatgttt 1380
tccacccata atcctttctg ccgaccccta gttccctctg ctcagccaag cttgttatca 1440
gctttcaggg ccatggttca cattagaata aaaggtagta attagaacaa aaaaaaaaaa 1500
aaaaaa 1506
<210> 189
<211> 232
<212> PRT
<213> 智人
<400> 189
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala
35 40 45
Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr Leu Leu Leu Ala Gly Gln
50 55 60
Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln Gln Gly Arg Leu Asp Lys
65 70 75 80
Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu Asn Leu Arg Met Lys
85 90 95
Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro
100 105 110
Leu Leu Met Gln Ala Leu Pro Met Gly Ala Leu Pro Gln Gly Pro Met
115 120 125
Gln Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val Met His
130 135 140
Leu Leu Gln Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly
145 150 155 160
Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr Ile
165 170 175
Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu
180 185 190
Met Ser Arg His Ser Leu Glu Gln Lys Pro Thr Asp Ala Pro Pro Lys
195 200 205
Glu Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val Thr Lys
210 215 220
Gln Asp Leu Gly Pro Val Pro Met
225 230
<210> 190
<211> 5616
<212> DNA
<213> 智人
<400> 190
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg 60
gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac 120
aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc 180
gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga 240
gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc 300
tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc 360
acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt 420
gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc 480
ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga 540
attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc 600
ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga 660
aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac 720
gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg 780
gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc 840
tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag 900
tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca 960
ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc 1020
acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat 1080
gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat 1140
tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg 1200
gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac 1260
ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac 1320
ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt 1380
gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta 1440
aaggaaatca cagggttttt gctgattcag gcttggcctg aaaacaggac ggacctccat 1500
gcctttgaga acctagaaat catacgcggc aggaccaagc aacatggtca gttttctctt 1560
gcagtcgtca gcctgaacat aacatccttg ggattacgct ccctcaagga gataagtgat 1620
ggagatgtga taatttcagg aaacaaaaat ttgtgctatg caaatacaat aaactggaaa 1680
aaactgtttg ggacctccgg tcagaaaacc aaaattataa gcaacagagg tgaaaacagc 1740
tgcaaggcca caggccaggt ctgccatgcc ttgtgctccc ccgagggctg ctggggcccg 1800
gagcccaggg actgcgtctc ttgccggaat gtcagccgag gcagggaatg cgtggacaag 1860
tgcaaccttc tggagggtga gccaagggag tttgtggaga actctgagtg catacagtgc 1920
cacccagagt gcctgcctca ggccatgaac atcacctgca caggacgggg accagacaac 1980
tgtatccagt gtgcccacta cattgacggc ccccactgcg tcaagacctg cccggcagga 2040
gtcatgggag aaaacaacac cctggtctgg aagtacgcag acgccggcca tgtgtgccac 2100
ctgtgccatc caaactgcac ctacggatgc actgggccag gtcttgaagg ctgtccaacg 2160
aatgggccta agatcccgtc catcgccact gggatggtgg gggccctcct cttgctgctg 2220
gtggtggccc tggggatcgg cctcttcatg cgaaggcgcc acatcgttcg gaagcgcacg 2280
ctgcggaggc tgctgcagga gagggagctt gtggagcctc ttacacccag tggagaagct 2340
cccaaccaag ctctcttgag gatcttgaag gaaactgaat tcaaaaagat caaagtgctg 2400
ggctccggtg cgttcggcac ggtgtataag ggactctgga tcccagaagg tgagaaagtt 2460
aaaattcccg tcgctatcaa ggaattaaga gaagcaacat ctccgaaagc caacaaggaa 2520
atcctcgatg aagcctacgt gatggccagc gtggacaacc cccacgtgtg ccgcctgctg 2580
ggcatctgcc tcacctccac cgtgcagctc atcacgcagc tcatgccctt cggctgcctc 2640
ctggactatg tccgggaaca caaagacaat attggctccc agtacctgct caactggtgt 2700
gtgcagatcg caaagggcat gaactacttg gaggaccgtc gcttggtgca ccgcgacctg 2760
gcagccagga acgtactggt gaaaacaccg cagcatgtca agatcacaga ttttgggctg 2820
gccaaactgc tgggtgcgga agagaaagaa taccatgcag aaggaggcaa agtgcctatc 2880
aagtggatgg cattggaatc aattttacac agaatctata cccaccagag tgatgtctgg 2940
agctacgggg tgaccgtttg ggagttgatg acctttggat ccaagccata tgacggaatc 3000
cctgccagcg agatctcctc catcctggag aaaggagaac gcctccctca gccacccata 3060
tgtaccatcg atgtctacat gatcatggtc aagtgctgga tgatagacgc agatagtcgc 3120
ccaaagttcc gtgagttgat catcgaattc tccaaaatgg cccgagaccc ccagcgctac 3180
cttgtcattc agggggatga aagaatgcat ttgccaagtc ctacagactc caacttctac 3240
cgtgccctga tggatgaaga agacatggac gacgtggtgg atgccgacga gtacctcatc 3300
ccacagcagg gcttcttcag cagcccctcc acgtcacgga ctcccctcct gagctctctg 3360
agtgcaacca gcaacaattc caccgtggct tgcattgata gaaatgggct gcaaagctgt 3420
cccatcaagg aagacagctt cttgcagcga tacagctcag accccacagg cgccttgact 3480
gaggacagca tagacgacac cttcctccca gtgcctgaat acataaacca gtccgttccc 3540
aaaaggcccg ctggctctgt gcagaatcct gtctatcaca atcagcctct gaaccccgcg 3600
cccagcagag acccacacta ccaggacccc cacagcactg cagtgggcaa ccccgagtat 3660
ctcaacactg tccagcccac ctgtgtcaac agcacattcg acagccctgc ccactgggcc 3720
cagaaaggca gccaccaaat tagcctggac aaccctgact accagcagga cttctttccc 3780
aaggaagcca agccaaatgg catctttaag ggctccacag ctgaaaatgc agaataccta 3840
agggtcgcgc cacaaagcag tgaatttatt ggagcatgac cacggaggat agtatgagcc 3900
ctaaaaatcc agactctttc gatacccagg accaagccac agcaggtcct ccatcccaac 3960
agccatgccc gcattagctc ttagacccac agactggttt tgcaacgttt acaccgacta 4020
gccaggaagt acttccacct cgggcacatt ttgggaagtt gcattccttt gtcttcaaac 4080
tgtgaagcat ttacagaaac gcatccagca agaatattgt ccctttgagc agaaatttat 4140
ctttcaaaga ggtatatttg aaaaaaaaaa aaagtatatg tgaggatttt tattgattgg 4200
ggatcttgga gtttttcatt gtcgctattg atttttactt caatgggctc ttccaacaag 4260
gaagaagctt gctggtagca cttgctaccc tgagttcatc caggcccaac tgtgagcaag 4320
gagcacaagc cacaagtctt ccagaggatg cttgattcca gtggttctgc ttcaaggctt 4380
ccactgcaaa acactaaaga tccaagaagg ccttcatggc cccagcaggc cggatcggta 4440
ctgtatcaag tcatggcagg tacagtagga taagccactc tgtcccttcc tgggcaaaga 4500
agaaacggag gggatggaat tcttccttag acttactttt gtaaaaatgt ccccacggta 4560
cttactcccc actgatggac cagtggtttc cagtcatgag cgttagactg acttgtttgt 4620
cttccattcc attgttttga aactcagtat gctgcccctg tcttgctgtc atgaaatcag 4680
caagagagga tgacacatca aataataact cggattccag cccacattgg attcatcagc 4740
atttggacca atagcccaca gctgagaatg tggaatacct aaggatagca ccgcttttgt 4800
tctcgcaaaa acgtatctcc taatttgagg ctcagatgaa atgcatcagg tcctttgggg 4860
catagatcag aagactacaa aaatgaagct gctctgaaat ctcctttagc catcacccca 4920
accccccaaa attagtttgt gttacttatg gaagatagtt ttctcctttt acttcacttc 4980
aaaagctttt tactcaaaga gtatatgttc cctccaggtc agctgccccc aaaccccctc 5040
cttacgcttt gtcacacaaa aagtgtctct gccttgagtc atctattcaa gcacttacag 5100
ctctggccac aacagggcat tttacaggtg cgaatgacag tagcattatg agtagtgtgg 5160
aattcaggta gtaaatatga aactagggtt tgaaattgat aatgctttca caacatttgc 5220
agatgtttta gaaggaaaaa agttccttcc taaaataatt tctctacaat tggaagattg 5280
gaagattcag ctagttagga gcccaccttt tttcctaatc tgtgtgtgcc ctgtaacctg 5340
actggttaac agcagtcctt tgtaaacagt gttttaaact ctcctagtca atatccaccc 5400
catccaattt atcaaggaag aaatggttca gaaaatattt tcagcctaca gttatgttca 5460
gtcacacaca catacaaaat gttccttttg cttttaaagt aatttttgac tcccagatca 5520
gtcagagccc ctacagcatt gttaagaaag tatttgattt ttgtctcaat gaaaataaaa 5580
ctatattcat ttccactcta aaaaaaaaaa aaaaaa 5616
<210> 191
<211> 1210
<212> PRT
<213> 智人
<400> 191
Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala
1 5 10 15
Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln
20 25 30
Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe
35 40 45
Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn
50 55 60
Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys
65 70 75 80
Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val
85 90 95
Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr
100 105 110
Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn
115 120 125
Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu
130 135 140
His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu
145 150 155 160
Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met
165 170 175
Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro
180 185 190
Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln
195 200 205
Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg
210 215 220
Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys
225 230 235 240
Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp
245 250 255
Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro
260 265 270
Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly
275 280 285
Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His
290 295 300
Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu
305 310 315 320
Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val
325 330 335
Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn
340 345 350
Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp
355 360 365
Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr
370 375 380
Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu
385 390 395 400
Ile Thr Gly Phe Leu Leu Ile Gln Ala Trp Pro Glu Asn Arg Thr Asp
405 410 415
Leu His Ala Phe Glu Asn Leu Glu Ile Ile Arg Gly Arg Thr Lys Gln
420 425 430
His Gly Gln Phe Ser Leu Ala Val Val Ser Leu Asn Ile Thr Ser Leu
435 440 445
Gly Leu Arg Ser Leu Lys Glu Ile Ser Asp Gly Asp Val Ile Ile Ser
450 455 460
Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr Ile Asn Trp Lys Lys Leu
465 470 475 480
Phe Gly Thr Ser Gly Gln Lys Thr Lys Ile Ile Ser Asn Arg Gly Glu
485 490 495
Asn Ser Cys Lys Ala Thr Gly Gln Val Cys His Ala Leu Cys Ser Pro
500 505 510
Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn
515 520 525
Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly
530 535 540
Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys Ile Gln Cys His Pro
545 550 555 560
Glu Cys Leu Pro Gln Ala Met Asn Ile Thr Cys Thr Gly Arg Gly Pro
565 570 575
Asp Asn Cys Ile Gln Cys Ala His Tyr Ile Asp Gly Pro His Cys Val
580 585 590
Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp
595 600 605
Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys
610 615 620
Thr Tyr Gly Cys Thr Gly Pro Gly Leu Glu Gly Cys Pro Thr Asn Gly
625 630 635 640
Pro Lys Ile Pro Ser Ile Ala Thr Gly Met Val Gly Ala Leu Leu Leu
645 650 655
Leu Leu Val Val Ala Leu Gly Ile Gly Leu Phe Met Arg Arg Arg His
660 665 670
Ile Val Arg Lys Arg Thr Leu Arg Arg Leu Leu Gln Glu Arg Glu Leu
675 680 685
Val Glu Pro Leu Thr Pro Ser Gly Glu Ala Pro Asn Gln Ala Leu Leu
690 695 700
Arg Ile Leu Lys Glu Thr Glu Phe Lys Lys Ile Lys Val Leu Gly Ser
705 710 715 720
Gly Ala Phe Gly Thr Val Tyr Lys Gly Leu Trp Ile Pro Glu Gly Glu
725 730 735
Lys Val Lys Ile Pro Val Ala Ile Lys Glu Leu Arg Glu Ala Thr Ser
740 745 750
Pro Lys Ala Asn Lys Glu Ile Leu Asp Glu Ala Tyr Val Met Ala Ser
755 760 765
Val Asp Asn Pro His Val Cys Arg Leu Leu Gly Ile Cys Leu Thr Ser
770 775 780
Thr Val Gln Leu Ile Thr Gln Leu Met Pro Phe Gly Cys Leu Leu Asp
785 790 795 800
Tyr Val Arg Glu His Lys Asp Asn Ile Gly Ser Gln Tyr Leu Leu Asn
805 810 815
Trp Cys Val Gln Ile Ala Lys Gly Met Asn Tyr Leu Glu Asp Arg Arg
820 825 830
Leu Val His Arg Asp Leu Ala Ala Arg Asn Val Leu Val Lys Thr Pro
835 840 845
Gln His Val Lys Ile Thr Asp Phe Gly Leu Ala Lys Leu Leu Gly Ala
850 855 860
Glu Glu Lys Glu Tyr His Ala Glu Gly Gly Lys Val Pro Ile Lys Trp
865 870 875 880
Met Ala Leu Glu Ser Ile Leu His Arg Ile Tyr Thr His Gln Ser Asp
885 890 895
Val Trp Ser Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ser
900 905 910
Lys Pro Tyr Asp Gly Ile Pro Ala Ser Glu Ile Ser Ser Ile Leu Glu
915 920 925
Lys Gly Glu Arg Leu Pro Gln Pro Pro Ile Cys Thr Ile Asp Val Tyr
930 935 940
Met Ile Met Val Lys Cys Trp Met Ile Asp Ala Asp Ser Arg Pro Lys
945 950 955 960
Phe Arg Glu Leu Ile Ile Glu Phe Ser Lys Met Ala Arg Asp Pro Gln
965 970 975
Arg Tyr Leu Val Ile Gln Gly Asp Glu Arg Met His Leu Pro Ser Pro
980 985 990
Thr Asp Ser Asn Phe Tyr Arg Ala Leu Met Asp Glu Glu Asp Met Asp
995 1000 1005
Asp Val Val Asp Ala Asp Glu Tyr Leu Ile Pro Gln Gln Gly Phe
1010 1015 1020
Phe Ser Ser Pro Ser Thr Ser Arg Thr Pro Leu Leu Ser Ser Leu
1025 1030 1035
Ser Ala Thr Ser Asn Asn Ser Thr Val Ala Cys Ile Asp Arg Asn
1040 1045 1050
Gly Leu Gln Ser Cys Pro Ile Lys Glu Asp Ser Phe Leu Gln Arg
1055 1060 1065
Tyr Ser Ser Asp Pro Thr Gly Ala Leu Thr Glu Asp Ser Ile Asp
1070 1075 1080
Asp Thr Phe Leu Pro Val Pro Glu Tyr Ile Asn Gln Ser Val Pro
1085 1090 1095
Lys Arg Pro Ala Gly Ser Val Gln Asn Pro Val Tyr His Asn Gln
1100 1105 1110
Pro Leu Asn Pro Ala Pro Ser Arg Asp Pro His Tyr Gln Asp Pro
1115 1120 1125
His Ser Thr Ala Val Gly Asn Pro Glu Tyr Leu Asn Thr Val Gln
1130 1135 1140
Pro Thr Cys Val Asn Ser Thr Phe Asp Ser Pro Ala His Trp Ala
1145 1150 1155
Gln Lys Gly Ser His Gln Ile Ser Leu Asp Asn Pro Asp Tyr Gln
1160 1165 1170
Gln Asp Phe Phe Pro Lys Glu Ala Lys Pro Asn Gly Ile Phe Lys
1175 1180 1185
Gly Ser Thr Ala Glu Asn Ala Glu Tyr Leu Arg Val Ala Pro Gln
1190 1195 1200
Ser Ser Glu Phe Ile Gly Ala
1205 1210
<210> 192
<211> 4975
<212> DNA
<213> 智人
<400> 192
ctctcacaca cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga 60
ttgcaatttg caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag 120
gtggctcttg cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc 180
ttcaccctct gcggagtcat gagggcgaac gacgctctgc aggtgctggg cttgcttttc 240
agcctggccc ggggctccga ggtgggcaac tctcaggcag tgtgtcctgg gactctgaat 300
ggcctgagtg tgaccggcga tgctgagaac caataccaga cactgtacaa gctctacgag 360
aggtgtgagg tggtgatggg gaaccttgag attgtgctca cgggacacaa tgccgacctc 420
tccttcctgc agtggattcg agaagtgaca ggctatgtcc tcgtggccat gaatgaattc 480
tctactctac cattgcccaa cctccgcgtg gtgcgaggga cccaggtcta cgatgggaag 540
tttgccatct tcgtcatgtt gaactataac accaactcca gccacgctct gcgccagctc 600
cgcttgactc agctcaccga gattctgtca gggggtgttt atattgagaa gaacgataag 660
ctttgtcaca tggacacaat tgactggagg gacatcgtga gggaccgaga tgctgagata 720
gtggtgaagg acaatggcag aagctgtccc ccctgtcatg aggtttgcaa ggggcgatgc 780
tggggtcctg gatcagaaga ctgccagaca ttgaccaaga ccatctgtgc tcctcagtgt 840
aatggtcact gctttgggcc caaccccaac cagtgctgcc atgatgagtg tgccgggggc 900
tgctcaggcc ctcaggacac agactgcttt gcctgccggc acttcaatga cagtggagcc 960
tgtgtacctc gctgtccaca gcctcttgtc tacaacaagc taactttcca gctggaaccc 1020
aatccccaca ccaagtatca gtatggagga gtttgtgtag ccagctgtcc ccataacttt 1080
gtggtggatc aaacatcctg tgtcagggcc tgtcctcctg acaagatgga agtagataaa 1140
aatgggctca agatgtgtga gccttgtggg ggactatgtc ccaaagcctg tgagggaaca 1200
ggctctggga gccgcttcca gactgtggac tcgagcaaca ttgatggatt tgtgaactgc 1260
accaagatcc tgggcaacct ggactttctg atcaccggcc tcaatggaga cccctggcac 1320
aagatccctg ccctggaccc agagaagctc aatgtcttcc ggacagtacg ggagatcaca 1380
ggttacctga acatccagtc ctggccgccc cacatgcaca acttcagtgt tttttccaat 1440
ttgacaacca ttggaggcag aagcctctac aaccggggct tctcattgtt gatcatgaag 1500
aacttgaatg tcacatctct gggcttccga tccctgaagg aaattagtgc tgggcgtatc 1560
tatataagtg ccaataggca gctctgctac caccactctt tgaactggac caaggtgctt 1620
cgggggccta cggaagagcg actagacatc aagcataatc ggccgcgcag agactgcgtg 1680
gcagagggca aagtgtgtga cccactgtgc tcctctgggg gatgctgggg cccaggccct 1740
ggtcagtgct tgtcctgtcg aaattatagc cgaggaggtg tctgtgtgac ccactgcaac 1800
tttctgaatg gggagcctcg agaatttgcc catgaggccg aatgcttctc ctgccacccg 1860
gaatgccaac ccatgggggg cactgccaca tgcaatggct cgggctctga tacttgtgct 1920
caatgtgccc attttcgaga tgggccccac tgtgtgagca gctgccccca tggagtccta 1980
ggtgccaagg gcccaatcta caagtaccca gatgttcaga atgaatgtcg gccctgccat 2040
gagaactgca cccaggggtg taaaggacca gagcttcaag actgtttagg acaaacactg 2100
gtgctgatcg gcaaaaccca tctgacaatg gctttgacag tgatagcagg attggtagtg 2160
attttcatga tgctgggcgg cacttttctc tactggcgtg ggcgccggat tcagaataaa 2220
agggctatga ggcgatactt ggaacggggt gagagcatag agcctctgga ccccagtgag 2280
aaggctaaca aagtcttggc cagaatcttc aaagagacag agctaaggaa gcttaaagtg 2340
cttggctcgg gtgtctttgg aactgtgcac aaaggagtgt ggatccctga gggtgaatca 2400
atcaagattc cagtctgcat taaagtcatt gaggacaaga gtggacggca gagttttcaa 2460
gctgtgacag atcatatgct ggccattggc agcctggacc atgcccacat tgtaaggctg 2520
ctgggactat gcccagggtc atctctgcag cttgtcactc aatatttgcc tctgggttct 2580
ctgctggatc atgtgagaca acaccggggg gcactggggc cacagctgct gctcaactgg 2640
ggagtacaaa ttgccaaggg aatgtactac cttgaggaac atggtatggt gcatagaaac 2700
ctggctgccc gaaacgtgct actcaagtca cccagtcagg ttcaggtggc agattttggt 2760
gtggctgacc tgctgcctcc tgatgataag cagctgctat acagtgaggc caagactcca 2820
attaagtgga tggcccttga gagtatccac tttgggaaat acacacacca gagtgatgtc 2880
tggagctatg gtgtgacagt ttgggagttg atgaccttcg gggcagagcc ctatgcaggg 2940
ctacgattgg ctgaagtacc agacctgcta gagaaggggg agcggttggc acagccccag 3000
atctgcacaa ttgatgtcta catggtgatg gtcaagtgtt ggatgattga tgagaacatt 3060
cgcccaacct ttaaagaact agccaatgag ttcaccagga tggcccgaga cccaccacgg 3120
tatctggtca taaagagaga gagtgggcct ggaatagccc ctgggccaga gccccatggt 3180
ctgacaaaca agaagctaga ggaagtagag ctggagccag aactagacct agacctagac 3240
ttggaagcag aggaggacaa cctggcaacc accacactgg gctccgccct cagcctacca 3300
gttggaacac ttaatcggcc acgtgggagc cagagccttt taagtccatc atctggatac 3360
atgcccatga accagggtaa tcttgggggg tcttgccagg agtctgcagt ttctgggagc 3420
agtgaacggt gcccccgtcc agtctctcta cacccaatgc cacggggatg cctggcatca 3480
gagtcatcag aggggcatgt aacaggctct gaggctgagc tccaggagaa agtgtcaatg 3540
tgtagaagcc ggagcaggag ccggagccca cggccacgcg gagatagcgc ctaccattcc 3600
cagcgccaca gtctgctgac tcctgttacc ccactctccc cacccgggtt agaggaagag 3660
gatgtcaacg gttatgtcat gccagataca cacctcaaag gtactccctc ctcccgggaa 3720
ggcacccttt cttcagtggg tctcagttct gtcctgggta ctgaagaaga agatgaagat 3780
gaggagtatg aatacatgaa ccggaggaga aggcacagtc cacctcatcc ccctaggcca 3840
agttcccttg aggagctggg ttatgagtac atggatgtgg ggtcagacct cagtgcctct 3900
ctgggcagca cacagagttg cccactccac cctgtaccca tcatgcccac tgcaggcaca 3960
actccagatg aagactatga atatatgaat cggcaacgag atggaggtgg tcctgggggt 4020
gattatgcag ccatgggggc ctgcccagca tctgagcaag ggtatgaaga gatgagagct 4080
tttcaggggc ctggacatca ggccccccat gtccattatg cccgcctaaa aactctacgt 4140
agcttagagg ctacagactc tgcctttgat aaccctgatt actggcatag caggcttttc 4200
cccaaggcta atgcccagag aacgtaactc ctgctccctg tggcactcag ggagcattta 4260
atggcagcta gtgcctttag agggtaccgt cttctcccta ttccctctct ctcccaggtc 4320
ccagcccctt ttccccagtc ccagacaatt ccattcaatc tttggaggct tttaaacatt 4380
ttgacacaaa attcttatgg tatgtagcca gctgtgcact ttcttctctt tcccaacccc 4440
aggaaaggtt ttccttattt tgtgtgcttt cccagtccca ttcctcagct tcttcacagg 4500
cactcctgga gatatgaagg attactctcc atatcccttc ctctcaggct cttgactact 4560
tggaactagg ctcttatgtg tgcctttgtt tcccatcaga ctgtcaagaa gaggaaaggg 4620
aggaaaccta gcagaggaaa gtgtaatttt ggtttatgac tcttaacccc ctagaaagac 4680
agaagcttaa aatctgtgaa gaaagaggtt aggagtagat attgattact atcataattc 4740
agcacttaac tatgagccag gcatcatact aaacttcacc tacattatct cacttagtcc 4800
tttatcatcc ttaaaacaat tctgtgacat acatattatc tcattttaca caaagggaag 4860
tcgggcatgg tggctcatgc ctgtaatctc agcactttgg gaggctgagg cagaaggatt 4920
acctgaggca aggagtttga gaccagctta gccaacatag taagaccccc atctc 4975
<210> 193
<211> 1342
<212> PRT
<213> 智人
<400> 193
Met Arg Ala Asn Asp Ala Leu Gln Val Leu Gly Leu Leu Phe Ser Leu
1 5 10 15
Ala Arg Gly Ser Glu Val Gly Asn Ser Gln Ala Val Cys Pro Gly Thr
20 25 30
Leu Asn Gly Leu Ser Val Thr Gly Asp Ala Glu Asn Gln Tyr Gln Thr
35 40 45
Leu Tyr Lys Leu Tyr Glu Arg Cys Glu Val Val Met Gly Asn Leu Glu
50 55 60
Ile Val Leu Thr Gly His Asn Ala Asp Leu Ser Phe Leu Gln Trp Ile
65 70 75 80
Arg Glu Val Thr Gly Tyr Val Leu Val Ala Met Asn Glu Phe Ser Thr
85 90 95
Leu Pro Leu Pro Asn Leu Arg Val Val Arg Gly Thr Gln Val Tyr Asp
100 105 110
Gly Lys Phe Ala Ile Phe Val Met Leu Asn Tyr Asn Thr Asn Ser Ser
115 120 125
His Ala Leu Arg Gln Leu Arg Leu Thr Gln Leu Thr Glu Ile Leu Ser
130 135 140
Gly Gly Val Tyr Ile Glu Lys Asn Asp Lys Leu Cys His Met Asp Thr
145 150 155 160
Ile Asp Trp Arg Asp Ile Val Arg Asp Arg Asp Ala Glu Ile Val Val
165 170 175
Lys Asp Asn Gly Arg Ser Cys Pro Pro Cys His Glu Val Cys Lys Gly
180 185 190
Arg Cys Trp Gly Pro Gly Ser Glu Asp Cys Gln Thr Leu Thr Lys Thr
195 200 205
Ile Cys Ala Pro Gln Cys Asn Gly His Cys Phe Gly Pro Asn Pro Asn
210 215 220
Gln Cys Cys His Asp Glu Cys Ala Gly Gly Cys Ser Gly Pro Gln Asp
225 230 235 240
Thr Asp Cys Phe Ala Cys Arg His Phe Asn Asp Ser Gly Ala Cys Val
245 250 255
Pro Arg Cys Pro Gln Pro Leu Val Tyr Asn Lys Leu Thr Phe Gln Leu
260 265 270
Glu Pro Asn Pro His Thr Lys Tyr Gln Tyr Gly Gly Val Cys Val Ala
275 280 285
Ser Cys Pro His Asn Phe Val Val Asp Gln Thr Ser Cys Val Arg Ala
290 295 300
Cys Pro Pro Asp Lys Met Glu Val Asp Lys Asn Gly Leu Lys Met Cys
305 310 315 320
Glu Pro Cys Gly Gly Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser
325 330 335
Gly Ser Arg Phe Gln Thr Val Asp Ser Ser Asn Ile Asp Gly Phe Val
340 345 350
Asn Cys Thr Lys Ile Leu Gly Asn Leu Asp Phe Leu Ile Thr Gly Leu
355 360 365
Asn Gly Asp Pro Trp His Lys Ile Pro Ala Leu Asp Pro Glu Lys Leu
370 375 380
Asn Val Phe Arg Thr Val Arg Glu Ile Thr Gly Tyr Leu Asn Ile Gln
385 390 395 400
Ser Trp Pro Pro His Met His Asn Phe Ser Val Phe Ser Asn Leu Thr
405 410 415
Thr Ile Gly Gly Arg Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu Ile
420 425 430
Met Lys Asn Leu Asn Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu
435 440 445
Ile Ser Ala Gly Arg Ile Tyr Ile Ser Ala Asn Arg Gln Leu Cys Tyr
450 455 460
His His Ser Leu Asn Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu
465 470 475 480
Arg Leu Asp Ile Lys His Asn Arg Pro Arg Arg Asp Cys Val Ala Glu
485 490 495
Gly Lys Val Cys Asp Pro Leu Cys Ser Ser Gly Gly Cys Trp Gly Pro
500 505 510
Gly Pro Gly Gln Cys Leu Ser Cys Arg Asn Tyr Ser Arg Gly Gly Val
515 520 525
Cys Val Thr His Cys Asn Phe Leu Asn Gly Glu Pro Arg Glu Phe Ala
530 535 540
His Glu Ala Glu Cys Phe Ser Cys His Pro Glu Cys Gln Pro Met Gly
545 550 555 560
Gly Thr Ala Thr Cys Asn Gly Ser Gly Ser Asp Thr Cys Ala Gln Cys
565 570 575
Ala His Phe Arg Asp Gly Pro His Cys Val Ser Ser Cys Pro His Gly
580 585 590
Val Leu Gly Ala Lys Gly Pro Ile Tyr Lys Tyr Pro Asp Val Gln Asn
595 600 605
Glu Cys Arg Pro Cys His Glu Asn Cys Thr Gln Gly Cys Lys Gly Pro
610 615 620
Glu Leu Gln Asp Cys Leu Gly Gln Thr Leu Val Leu Ile Gly Lys Thr
625 630 635 640
His Leu Thr Met Ala Leu Thr Val Ile Ala Gly Leu Val Val Ile Phe
645 650 655
Met Met Leu Gly Gly Thr Phe Leu Tyr Trp Arg Gly Arg Arg Ile Gln
660 665 670
Asn Lys Arg Ala Met Arg Arg Tyr Leu Glu Arg Gly Glu Ser Ile Glu
675 680 685
Pro Leu Asp Pro Ser Glu Lys Ala Asn Lys Val Leu Ala Arg Ile Phe
690 695 700
Lys Glu Thr Glu Leu Arg Lys Leu Lys Val Leu Gly Ser Gly Val Phe
705 710 715 720
Gly Thr Val His Lys Gly Val Trp Ile Pro Glu Gly Glu Ser Ile Lys
725 730 735
Ile Pro Val Cys Ile Lys Val Ile Glu Asp Lys Ser Gly Arg Gln Ser
740 745 750
Phe Gln Ala Val Thr Asp His Met Leu Ala Ile Gly Ser Leu Asp His
755 760 765
Ala His Ile Val Arg Leu Leu Gly Leu Cys Pro Gly Ser Ser Leu Gln
770 775 780
Leu Val Thr Gln Tyr Leu Pro Leu Gly Ser Leu Leu Asp His Val Arg
785 790 795 800
Gln His Arg Gly Ala Leu Gly Pro Gln Leu Leu Leu Asn Trp Gly Val
805 810 815
Gln Ile Ala Lys Gly Met Tyr Tyr Leu Glu Glu His Gly Met Val His
820 825 830
Arg Asn Leu Ala Ala Arg Asn Val Leu Leu Lys Ser Pro Ser Gln Val
835 840 845
Gln Val Ala Asp Phe Gly Val Ala Asp Leu Leu Pro Pro Asp Asp Lys
850 855 860
Gln Leu Leu Tyr Ser Glu Ala Lys Thr Pro Ile Lys Trp Met Ala Leu
865 870 875 880
Glu Ser Ile His Phe Gly Lys Tyr Thr His Gln Ser Asp Val Trp Ser
885 890 895
Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ala Glu Pro Tyr
900 905 910
Ala Gly Leu Arg Leu Ala Glu Val Pro Asp Leu Leu Glu Lys Gly Glu
915 920 925
Arg Leu Ala Gln Pro Gln Ile Cys Thr Ile Asp Val Tyr Met Val Met
930 935 940
Val Lys Cys Trp Met Ile Asp Glu Asn Ile Arg Pro Thr Phe Lys Glu
945 950 955 960
Leu Ala Asn Glu Phe Thr Arg Met Ala Arg Asp Pro Pro Arg Tyr Leu
965 970 975
Val Ile Lys Arg Glu Ser Gly Pro Gly Ile Ala Pro Gly Pro Glu Pro
980 985 990
His Gly Leu Thr Asn Lys Lys Leu Glu Glu Val Glu Leu Glu Pro Glu
995 1000 1005
Leu Asp Leu Asp Leu Asp Leu Glu Ala Glu Glu Asp Asn Leu Ala
1010 1015 1020
Thr Thr Thr Leu Gly Ser Ala Leu Ser Leu Pro Val Gly Thr Leu
1025 1030 1035
Asn Arg Pro Arg Gly Ser Gln Ser Leu Leu Ser Pro Ser Ser Gly
1040 1045 1050
Tyr Met Pro Met Asn Gln Gly Asn Leu Gly Gly Ser Cys Gln Glu
1055 1060 1065
Ser Ala Val Ser Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser
1070 1075 1080
Leu His Pro Met Pro Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu
1085 1090 1095
Gly His Val Thr Gly Ser Glu Ala Glu Leu Gln Glu Lys Val Ser
1100 1105 1110
Met Cys Arg Ser Arg Ser Arg Ser Arg Ser Pro Arg Pro Arg Gly
1115 1120 1125
Asp Ser Ala Tyr His Ser Gln Arg His Ser Leu Leu Thr Pro Val
1130 1135 1140
Thr Pro Leu Ser Pro Pro Gly Leu Glu Glu Glu Asp Val Asn Gly
1145 1150 1155
Tyr Val Met Pro Asp Thr His Leu Lys Gly Thr Pro Ser Ser Arg
1160 1165 1170
Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser Val Leu Gly Thr
1175 1180 1185
Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met Asn Arg Arg
1190 1195 1200
Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser Leu Glu
1205 1210 1215
Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser Ala
1220 1225 1230
Ser Leu Gly Ser Thr Gln Ser Cys Pro Leu His Pro Val Pro Ile
1235 1240 1245
Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met
1250 1255 1260
Asn Arg Gln Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala
1265 1270 1275
Met Gly Ala Cys Pro Ala Ser Glu Gln Gly Tyr Glu Glu Met Arg
1280 1285 1290
Ala Phe Gln Gly Pro Gly His Gln Ala Pro His Val His Tyr Ala
1295 1300 1305
Arg Leu Lys Thr Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe
1310 1315 1320
Asp Asn Pro Asp Tyr Trp His Ser Arg Leu Phe Pro Lys Ala Asn
1325 1330 1335
Ala Gln Arg Thr
1340
<210> 194
<211> 4541
<212> DNA
<213> 智人
<400> 194
ggatcctcta gggtcccagc tcgcctcgat ggagctcctc ccgccgctgc ctcagtcctt 60
cctgttgctg ctgctgttgc ctgccaagcc cgcggcgggc gaggactggc agtgcccgcg 120
caccccctac gcggcctctc gcgactttga cgtgaagtac gtggtgccca gcttctccgc 180
cggaggcctg gtacaggcca tggtgaccta cgagggcgac agaaatgaga gtgctgtgtt 240
tgtagccata cgcaatcgcc tgcatgtgct tgggcctgac ctgaagtctg tccagagcct 300
ggccacgggc cctgctggag accctggctg ccagacgtgt gcagcctgtg gcccaggacc 360
ccacggccct cccggtgaca cagacacaaa ggtgctggtg ctggatcccg cgctgcctgc 420
gctggtcagt tgtggctcca gcctgcaggg ccgctgcttc ctgcatgacc tagagcccca 480
agggacagcc gtgcatctgg cagcgccagc ctgcctcttc tcagcccacc ataaccggcc 540
cgatgactgc cccgactgtg tggccagccc attgggcacc cgtgtaactg tggttgagca 600
aggccaggcc tcctatttct acgtggcatc ctcactggac gcagccgtgg ctggcagctt 660
cagcccacgc tcagtgtcta tcaggcgtct caaggctgac gcctcgggat tcgcaccggg 720
ctttgtggcg ttgtcagtgc tgcccaagca tcttgtctcc tacagtattg aatacgtgca 780
cagcttccac acgggagcct tcgtatactt cctgactgta cagccggcca gcgtgacaga 840
tgatcctagt gccctgcaca cacgcctggc acggcttagc gccactgagc cagagttggg 900
tgactatcgg gagctggtcc tcgactgcag atttgctcca aaacgcaggc gccggggggc 960
cccagaaggc ggacagccct accctgtgct gcaggtggcc cactccgctc cagtgggtgc 1020
ccaacttgcc actgagctga gcatcgccga gggccaggaa gtactatttg gggtctttgt 1080
gactggcaag gatggtggtc ctggcgtggg ccccaactct gtcgtctgtg ccttccccat 1140
tgacctgctg gacacactaa ttgatgaggg tgtggagcgc tgttgtgaat ccccagtcca 1200
tccaggcctc cggcgaggcc tcgacttctt ccagtcgccc agtttttgcc ccaacccgcc 1260
tggcctggaa gccctcagcc ccaacaccag ctgccgccac ttccctctgc tggtcagtag 1320
cagcttctca cgtgtggacc tattcaatgg gctgttggga ccagtacagg tcactgcatt 1380
gtatgtgaca cgccttgaca acgtcacagt ggcacacatg ggcacaatgg atgggcgtat 1440
cctgcaggtg gagctggtca ggtcactaaa ctacttgctg tatgtgtcca acttctcact 1500
gggtgacagt gggcagcccg tgcagcggga tgtcagtcgt cttggggacc acctactctt 1560
tgcctctggg gaccaggttt tccaggtacc tatccgaggc cctggctgcc gccacttcct 1620
gacctgtggg cgttgcctaa gggcatggca tttcatgggc tgtggctggt gtgggaacat 1680
gtgcggccag cagaaggagt gtcctggctc ctggcaacag gaccactgcc cacctaagct 1740
tactgagttc cacccccaca gtggacctct aaggggcagt acaaggctga ccctgtgtgg 1800
ctccaacttc taccttcacc cttctggtct ggtgcctgag ggaacccatc aggtcactgt 1860
gggccaaagt ccctgccggc cactgcccaa ggacagctca aaactcagac cagtgccccg 1920
gaaagacttt gtagaggagt ttgagtgtga actggagccc ttgggcaccc aggcagtggg 1980
gcctaccaac gtcagcctca ccgtgactaa catgccaccg ggcaagcact tccgggtaga 2040
cggcacctcc gtgctgagag gcttctcttt catggagcca gtgctgatag cagtgcaacc 2100
cctctttggc ccacgggcag gaggcacctg tctcactctt gaaggccaga gtctgtctgt 2160
aggcaccagc cgggctgtgc tggtcaatgg gactgagtgt ctgctagcac gggtcagtga 2220
ggggcagctt ttatgtgcca caccccctgg ggccacggtg gccagtgtcc cccttagcct 2280
gcaggtgggg ggtgcccagg tacctggttc ctggaccttc cagtacagag aagaccctgt 2340
cgtgctaagc atcagcccca actgtggcta catcaactcc cacatcacca tctgtggcca 2400
gcatctaact tcagcatggc acttagtgct gtcattccat gacgggctta gggcagtgga 2460
aagcaggtgt gagaggcagc ttccagagca gcagctgtgc cgccttcctg aatatgtggt 2520
ccgagacccc cagggatggg tggcagggaa tctgagtgcc cgaggggatg gagctgctgg 2580
ctttacactg cctggctttc gcttcctacc cccaccccat ccacccagtg ccaacctagt 2640
tccactgaag cctgaggagc atgccattaa gtttgagtat attgggctgg gcgctgtggc 2700
tgactgtgtg ggtatcaacg tgaccgtggg tggtgagagc tgccagcacg agttccgggg 2760
ggacatggtt gtctgccccc tgcccccatc cctgcagctt ggccaggatg gtgccccatt 2820
gcaggtctgc gtagatggtg aatgtcatat cctgggtaga gtggtgcggc cagggccaga 2880
tggggtccca cagagcacgc tccttggtat cctgctgcct ttgctgctgc ttgtggctgc 2940
actggcgact gcactggtct tcagctactg gtggcggagg aagcagctag ttcttcctcc 3000
caacctgaat gacctggcat ccctggacca gactgctgga gccacacccc tgcctattct 3060
gtactcgggc tctgactaca gaagtggcct tgcactccct gccattgatg gtctggattc 3120
caccacttgt gtccatggag catccttctc cgatagtgaa gatgaatcct gtgtgccact 3180
gctgcggaaa gagtccatcc agctaaggga cctggactct gcgctcttgg ctgaggtcaa 3240
ggatgtgctg attccccatg agcgggtggt cacccacagt gaccgagtca ttggcaaagg 3300
ccactttgga gttgtctacc acggagaata catagaccag gcccagaatc gaatccaatg 3360
tgccatcaag tcactaagtc gcatcacaga gatgcagcag gtggaggcct tcctgcgaga 3420
ggggctgctc atgcgtggcc tgaaccaccc gaatgtgctg gctctcattg gtatcatgtt 3480
gccacctgag ggcctgcccc atgtgctgct gccctatatg tgccacggtg acctgctcca 3540
gttcatccgc tcacctcagc ggaaccccac cgtgaaggac ctcatcagct ttggcctgca 3600
ggtagcccgc ggcatggagt acctggcaga gcagaagttt gtgcacaggg acctggctgc 3660
gcggaactgc atgctggacg agtcattcac agtcaaggtg gctgactttg gtttggcccg 3720
cgacatcctg gacagggagt actatagtgt tcaacagcat cgccacgctc gcctacctgt 3780
gaagtggatg gcgctggaga gcctgcagac ctatagattt accaccaagt ctgatgtgtg 3840
gtcatttggt gtgctgctgt gggaactgct gacacggggt gccccaccat accgccacat 3900
tgaccctttt gaccttaccc acttcctggc ccagggtcgg cgcctgcccc agcctgagta 3960
ttgccctgat tctctgtacc aagtgatgca gcaatgctgg gaggcagacc cagcagtgcg 4020
acccaccttc agagtactag tgggggaggt ggagcagata gtgtctgcac tgcttgggga 4080
ccattatgtg cagctgccag caacctacat gaacttgggc cccagcacct cgcatgagat 4140
gaatgtgcgt ccagaacagc cgcagttctc acccatgcca gggaatgtac gccggccccg 4200
gccactctca gagcctcctc ggcccacttg acttagttct tgggctggac ctgcttagct 4260
gccttgagct aaccccaagg ctgcctctgg gccatgccag gccagagcag tggccctcca 4320
ccttgttcct gccctttaac tttcagaggc aataggtaaa tgggcccatt aggtccctca 4380
ctccacagag tgagccagtg agggcagtcc tgcaacatgt atttatggag tgcctgctgt 4440
ggaccctgtc ttctgggcac agtggactca gcagtgacca caccaacact gacccttgaa 4500
ccaataaagg aacaaatgac tattaaagca caaaaaaaaa a 4541
<210> 195
<211> 661
<212> PRT
<213> 智人
<400> 195
Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu
1 5 10 15
Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr
20 25 30
Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser
35 40 45
Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp
50 55 60
Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val
65 70 75 80
Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala
85 90 95
Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His
100 105 110
Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala
115 120 125
Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe
130 135 140
Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro
145 150 155 160
Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp
165 170 175
Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly
180 185 190
Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala
195 200 205
Gly Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp
210 215 220
Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys
225 230 235 240
His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly
245 250 255
Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp
260 265 270
Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro
275 280 285
Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro
290 295 300
Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val
305 310 315 320
Leu Gln Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu
325 330 335
Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr
340 345 350
Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala
355 360 365
Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg
370 375 380
Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe
385 390 395 400
Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu
405 410 415
Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser
420 425 430
Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val
435 440 445
Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met
450 455 460
Gly Thr Met Asp Gly Arg Ile Leu Gln Val Glu Leu Val Arg Ser Leu
465 470 475 480
Asn Tyr Leu Leu Tyr Val Ser Asn Phe Ser Leu Gly Asp Ser Gly Gln
485 490 495
Pro Val Gln Arg Asp Val Ser Arg Leu Gly Asp His Leu Leu Phe Ala
500 505 510
Ser Gly Asp Gln Val Phe Gln Val Pro Ile Arg Gly Pro Gly Cys Arg
515 520 525
His Phe Leu Thr Cys Gly Arg Cys Leu Arg Ala Trp His Phe Met Gly
530 535 540
Cys Gly Trp Cys Gly Asn Met Cys Gly Gln Gln Lys Glu Cys Pro Gly
545 550 555 560
Ser Trp Gln Gln Asp His Cys Pro Pro Lys Leu Thr Glu Phe His Pro
565 570 575
His Ser Gly Pro Leu Arg Gly Ser Thr Arg Leu Thr Leu Cys Gly Ser
580 585 590
Asn Phe Tyr Leu His Pro Ser Gly Leu Val Pro Glu Gly Thr His Gln
595 600 605
Val Thr Val Gly Gln Ser Pro Cys Arg Pro Leu Pro Lys Asp Ser Ser
610 615 620
Lys Leu Arg Pro Val Pro Arg Lys Asp Phe Val Glu Glu Phe Glu Cys
625 630 635 640
Glu Leu Glu Pro Leu Gly Thr Gln Ala Val Gly Pro Thr Asn Val Ser
645 650 655
Leu Thr Val Thr Asn
660
<210> 196
<211> 3935
<212> DNA
<213> 智人
<400> 196
aggggcagaa gttgcgcgca ggccggcggg cgggagcgga caccgaggcc ggcgtgcagg 60
cgtgcgggtg tgcgggagcc gggctcgggg ggatcggacc gagagcgaga agcgcggcat 120
ggagctccag gcagcccgcg cctgcttcgc cctgctgtgg ggctgtgcgc tggccgcggc 180
cgcggcggcg cagggcaagg aagtggtact gctggacttt gctgcagctg gaggggagct 240
cggctggctc acacacccgt atggcaaagg gtgggacctg atgcagaaca tcatgaatga 300
catgccgatc tacatgtact ccgtgtgcaa cgtgatgtct ggcgaccagg acaactggct 360
ccgcaccaac tgggtgtacc gaggagaggc tgagcgtatc ttcattgagc tcaagtttac 420
tgtacgtgac tgcaacagct tccctggtgg cgccagctcc tgcaaggaga ctttcaacct 480
ctactatgcc gagtcggacc tggactacgg caccaacttc cagaagcgcc tgttcaccaa 540
gattgacacc attgcgcccg atgagatcac cgtcagcagc gacttcgagg cacgccacgt 600
gaagctgaac gtggaggagc gctccgtggg gccgctcacc cgcaaaggct tctacctggc 660
cttccaggat atcggtgcct gtgtggcgct gctctccgtc cgtgtctact acaagaagtg 720
ccccgagctg ctgcagggcc tggcccactt ccctgagacc atcgccggct ctgatgcacc 780
ttccctggcc actgtggccg gcacctgtgt ggaccatgcc gtggtgccac cggggggtga 840
agagccccgt atgcactgtg cagtggatgg cgagtggctg gtgcccattg ggcagtgcct 900
gtgccaggca ggctacgaga aggtggagga tgcctgccag gcctgctcgc ctggattttt 960
taagtttgag gcatctgaga gcccctgctt ggagtgccct gagcacacgc tgccatcccc 1020
tgagggtgcc acctcctgcg agtgtgagga aggcttcttc cgggcacctc aggacccagc 1080
gtcgatgcct tgcacacgac ccccctccgc cccacactac ctcacagccg tgggcatggg 1140
tgccaaggtg gagctgcgct ggacgccccc tcaggacagc gggggccgcg aggacattgt 1200
ctacagcgtc acctgcgaac agtgctggcc cgagtctggg gaatgcgggc cgtgtgaggc 1260
cagtgtgcgc tactcggagc ctcctcacgg actgacccgc accagtgtga cagtgagcga 1320
cctggagccc cacatgaact acaccttcac cgtggaggcc cgcaatggcg tctcaggcct 1380
ggtaaccagc cgcagcttcc gtactgccag tgtcagcatc aaccagacag agccccccaa 1440
ggtgaggctg gagggccgca gcaccacctc gcttagcgtc tcctggagca tccccccgcc 1500
gcagcagagc cgagtgtgga agtacgaggt cacttaccgc aagaagggag actccaacag 1560
ctacaatgtg cgccgcaccg agggtttctc cgtgaccctg gacgacctgg ccccagacac 1620
cacctacctg gtccaggtgc aggcactgac gcaggagggc cagggggccg gcagcaaggt 1680
gcacgaattc cagacgctgt ccccggaggg atctggcaac ttggcggtga ttggcggcgt 1740
ggctgtcggt gtggtcctgc ttctggtgct ggcaggagtt ggcttcttta tccaccgcag 1800
gaggaagaac cagcgtgccc gccagtcccc ggaggacgtt tacttctcca agtcagaaca 1860
actgaagccc ctgaagacat acgtggaccc ccacacatat gaggacccca accaggctgt 1920
gttgaagttc actaccgaga tccatccatc ctgtgtcact cggcagaagg tgatcggagc 1980
aggagagttt ggggaggtgt acaagggcat gctgaagaca tcctcgggga agaaggaggt 2040
gccggtggcc atcaagacgc tgaaagccgg ctacacagag aagcagcgag tggacttcct 2100
cggcgaggcc ggcatcatgg gccagttcag ccaccacaac atcatccgcc tagagggcgt 2160
catctccaaa tacaagccca tgatgatcat cactgagtac atggagaatg gggccctgga 2220
caagttcctt cgggagaagg atggcgagtt cagcgtgctg cagctggtgg gcatgctgcg 2280
gggcatcgca gctggcatga agtacctggc caacatgaac tatgtgcacc gtgacctggc 2340
tgcccgcaac atcctcgtca acagcaacct ggtctgcaag gtgtctgact ttggcctgtc 2400
ccgcgtgctg gaggacgacc ccgaggccac ctacaccacc agtggcggca agatccccat 2460
ccgctggacc gccccggagg ccatttccta ccggaagttc acctctgcca gcgacgtgtg 2520
gagctttggc attgtcatgt gggaggtgat gacctatggc gagcggccct actgggagtt 2580
gtccaaccac gaggtgatga aagccatcaa tgatggcttc cggctcccca cacccatgga 2640
ctgcccctcc gccatctacc agctcatgat gcagtgctgg cagcaggagc gtgcccgccg 2700
ccccaagttc gctgacatcg tcagcatcct ggacaagctc attcgtgccc ctgactccct 2760
caagaccctg gctgactttg acccccgcgt gtctatccgg ctccccagca cgagcggctc 2820
ggagggggtg cccttccgca cggtgtccga gtggctggag tccatcaaga tgcagcagta 2880
tacggagcac ttcatggcgg ccggctacac tgccatcgag aaggtggtgc agatgaccaa 2940
cgacgacatc aagaggattg gggtgcggct gcccggccac cagaagcgca tcgcctacag 3000
cctgctggga ctcaaggacc aggtgaacac tgtggggatc cccatctgag cctcgacagg 3060
gcctggagcc ccatcggcca agaatacttg aagaaacaga gtggcctccc tgctgtgcca 3120
tgctgggcca ctggggactt tatttatttc tagttctttc ctccccctgc aacttccgct 3180
gaggggtctc ggatgacacc ctggcctgaa ctgaggagat gaccagggat gctgggctgg 3240
gccctctttc cctgcgagac gcacacagct gagcacttag caggcaccgc cacgtcccag 3300
catccctgga gcaggagccc cgccacagcc ttcggacaga catataggat attcccaagc 3360
cgaccttccc tccgccttct cccacatgag gccatctcag gagatggagg gcttggccca 3420
gcgccaagta aacagggtac ctcaagcccc atttcctcac actaagaggg cagactgtga 3480
acttgactgg gtgagaccca aagcggtccc tgtccctcta gtgccttctt tagaccctcg 3540
ggccccatcc tcatccctga ctggccaaac ccttgctttc ctgggccttt gcaagatgct 3600
tggttgtgtt gaggttttta aatatatatt ttgtactttg tggagaaaat gtgtgtgtgt 3660
ggcagggggc cccgccaggg ctggggacag agggtgtcaa acattcgtga gctggggact 3720
cagggaccgg tgctgcagga gtgtcctgcc catgccccag tcggccccat ctctcatcct 3780
tttggataag tttctattct gtcagtgtta aagattttgt tttgttggac atttttttcg 3840
aatcttaatt tattattttt tttatattta ttgttagaaa atgacttatt tctgctctgg 3900
aataaagttg cagatgattc aaaaaaaaaa aaaaa 3935
<210> 197
<211> 976
<212> PRT
<213> 智人
<400> 197
Met Glu Leu Gln Ala Ala Arg Ala Cys Phe Ala Leu Leu Trp Gly Cys
1 5 10 15
Ala Leu Ala Ala Ala Ala Ala Ala Gln Gly Lys Glu Val Val Leu Leu
20 25 30
Asp Phe Ala Ala Ala Gly Gly Glu Leu Gly Trp Leu Thr His Pro Tyr
35 40 45
Gly Lys Gly Trp Asp Leu Met Gln Asn Ile Met Asn Asp Met Pro Ile
50 55 60
Tyr Met Tyr Ser Val Cys Asn Val Met Ser Gly Asp Gln Asp Asn Trp
65 70 75 80
Leu Arg Thr Asn Trp Val Tyr Arg Gly Glu Ala Glu Arg Ile Phe Ile
85 90 95
Glu Leu Lys Phe Thr Val Arg Asp Cys Asn Ser Phe Pro Gly Gly Ala
100 105 110
Ser Ser Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Ala Glu Ser Asp Leu
115 120 125
Asp Tyr Gly Thr Asn Phe Gln Lys Arg Leu Phe Thr Lys Ile Asp Thr
130 135 140
Ile Ala Pro Asp Glu Ile Thr Val Ser Ser Asp Phe Glu Ala Arg His
145 150 155 160
Val Lys Leu Asn Val Glu Glu Arg Ser Val Gly Pro Leu Thr Arg Lys
165 170 175
Gly Phe Tyr Leu Ala Phe Gln Asp Ile Gly Ala Cys Val Ala Leu Leu
180 185 190
Ser Val Arg Val Tyr Tyr Lys Lys Cys Pro Glu Leu Leu Gln Gly Leu
195 200 205
Ala His Phe Pro Glu Thr Ile Ala Gly Ser Asp Ala Pro Ser Leu Ala
210 215 220
Thr Val Ala Gly Thr Cys Val Asp His Ala Val Val Pro Pro Gly Gly
225 230 235 240
Glu Glu Pro Arg Met His Cys Ala Val Asp Gly Glu Trp Leu Val Pro
245 250 255
Ile Gly Gln Cys Leu Cys Gln Ala Gly Tyr Glu Lys Val Glu Asp Ala
260 265 270
Cys Gln Ala Cys Ser Pro Gly Phe Phe Lys Phe Glu Ala Ser Glu Ser
275 280 285
Pro Cys Leu Glu Cys Pro Glu His Thr Leu Pro Ser Pro Glu Gly Ala
290 295 300
Thr Ser Cys Glu Cys Glu Glu Gly Phe Phe Arg Ala Pro Gln Asp Pro
305 310 315 320
Ala Ser Met Pro Cys Thr Arg Pro Pro Ser Ala Pro His Tyr Leu Thr
325 330 335
Ala Val Gly Met Gly Ala Lys Val Glu Leu Arg Trp Thr Pro Pro Gln
340 345 350
Asp Ser Gly Gly Arg Glu Asp Ile Val Tyr Ser Val Thr Cys Glu Gln
355 360 365
Cys Trp Pro Glu Ser Gly Glu Cys Gly Pro Cys Glu Ala Ser Val Arg
370 375 380
Tyr Ser Glu Pro Pro His Gly Leu Thr Arg Thr Ser Val Thr Val Ser
385 390 395 400
Asp Leu Glu Pro His Met Asn Tyr Thr Phe Thr Val Glu Ala Arg Asn
405 410 415
Gly Val Ser Gly Leu Val Thr Ser Arg Ser Phe Arg Thr Ala Ser Val
420 425 430
Ser Ile Asn Gln Thr Glu Pro Pro Lys Val Arg Leu Glu Gly Arg Ser
435 440 445
Thr Thr Ser Leu Ser Val Ser Trp Ser Ile Pro Pro Pro Gln Gln Ser
450 455 460
Arg Val Trp Lys Tyr Glu Val Thr Tyr Arg Lys Lys Gly Asp Ser Asn
465 470 475 480
Ser Tyr Asn Val Arg Arg Thr Glu Gly Phe Ser Val Thr Leu Asp Asp
485 490 495
Leu Ala Pro Asp Thr Thr Tyr Leu Val Gln Val Gln Ala Leu Thr Gln
500 505 510
Glu Gly Gln Gly Ala Gly Ser Lys Val His Glu Phe Gln Thr Leu Ser
515 520 525
Pro Glu Gly Ser Gly Asn Leu Ala Val Ile Gly Gly Val Ala Val Gly
530 535 540
Val Val Leu Leu Leu Val Leu Ala Gly Val Gly Phe Phe Ile His Arg
545 550 555 560
Arg Arg Lys Asn Gln Arg Ala Arg Gln Ser Pro Glu Asp Val Tyr Phe
565 570 575
Ser Lys Ser Glu Gln Leu Lys Pro Leu Lys Thr Tyr Val Asp Pro His
580 585 590
Thr Tyr Glu Asp Pro Asn Gln Ala Val Leu Lys Phe Thr Thr Glu Ile
595 600 605
His Pro Ser Cys Val Thr Arg Gln Lys Val Ile Gly Ala Gly Glu Phe
610 615 620
Gly Glu Val Tyr Lys Gly Met Leu Lys Thr Ser Ser Gly Lys Lys Glu
625 630 635 640
Val Pro Val Ala Ile Lys Thr Leu Lys Ala Gly Tyr Thr Glu Lys Gln
645 650 655
Arg Val Asp Phe Leu Gly Glu Ala Gly Ile Met Gly Gln Phe Ser His
660 665 670
His Asn Ile Ile Arg Leu Glu Gly Val Ile Ser Lys Tyr Lys Pro Met
675 680 685
Met Ile Ile Thr Glu Tyr Met Glu Asn Gly Ala Leu Asp Lys Phe Leu
690 695 700
Arg Glu Lys Asp Gly Glu Phe Ser Val Leu Gln Leu Val Gly Met Leu
705 710 715 720
Arg Gly Ile Ala Ala Gly Met Lys Tyr Leu Ala Asn Met Asn Tyr Val
725 730 735
His Arg Asp Leu Ala Ala Arg Asn Ile Leu Val Asn Ser Asn Leu Val
740 745 750
Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu Glu Asp Asp Pro
755 760 765
Glu Ala Thr Tyr Thr Thr Ser Gly Gly Lys Ile Pro Ile Arg Trp Thr
770 775 780
Ala Pro Glu Ala Ile Ser Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val
785 790 795 800
Trp Ser Phe Gly Ile Val Met Trp Glu Val Met Thr Tyr Gly Glu Arg
805 810 815
Pro Tyr Trp Glu Leu Ser Asn His Glu Val Met Lys Ala Ile Asn Asp
820 825 830
Gly Phe Arg Leu Pro Thr Pro Met Asp Cys Pro Ser Ala Ile Tyr Gln
835 840 845
Leu Met Met Gln Cys Trp Gln Gln Glu Arg Ala Arg Arg Pro Lys Phe
850 855 860
Ala Asp Ile Val Ser Ile Leu Asp Lys Leu Ile Arg Ala Pro Asp Ser
865 870 875 880
Leu Lys Thr Leu Ala Asp Phe Asp Pro Arg Val Ser Ile Arg Leu Pro
885 890 895
Ser Thr Ser Gly Ser Glu Gly Val Pro Phe Arg Thr Val Ser Glu Trp
900 905 910
Leu Glu Ser Ile Lys Met Gln Gln Tyr Thr Glu His Phe Met Ala Ala
915 920 925
Gly Tyr Thr Ala Ile Glu Lys Val Val Gln Met Thr Asn Asp Asp Ile
930 935 940
Lys Arg Ile Gly Val Arg Leu Pro Gly His Gln Lys Arg Ile Ala Tyr
945 950 955 960
Ser Leu Leu Gly Leu Lys Asp Gln Val Asn Thr Val Gly Ile Pro Ile
965 970 975
<210> 198
<211> 1146
<212> DNA
<213> 智人
<400> 198
cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 60
agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 120
ttttgagagc aaaatgacaa cacccagaaa ttcagtaaat gggactttcc cggcagagcc 180
aatgaaaggc cctattgcta tgcaatctgg tccaaaacca ctcttcagga ggatgtcttc 240
actggtgggc cccacgcaaa gcttcttcat gagggaatct aagactttgg gggctgtcca 300
gattatgaat gggctcttcc acattgccct ggggggtctt ctgatgatcc cagcagggat 360
ctatgcaccc atctgtgtga ctgtgtggta ccctctctgg ggaggcatta tgtatattat 420
ttccggatca ctcttggcag caacggagaa aaactctagg aagtgtttgg tcaaaggaaa 480
aatgataatg aattcattga gcctctttgc tgccatttct ggaatgattc tttcaatcat 540
ggacatactt aatattaaaa tttcccattt tttaaaaatg gagagtctga attttattag 600
agctcacaca ccatatatta acatatacaa ctgtgaacca gctaatccct ctgagaaaaa 660
ctccccatct acccaatact gttacagcat acaatctctg ttcttgggca ttttgtcagt 720
gatgctgatc tttgccttct tccaggaact tgtaatagct ggcatcgttg agaatgaatg 780
gaaaagaacg tgctccagac ccaaatctaa catagttctc ctgtcagcag aagaaaaaaa 840
agaacagact attgaaataa aagaagaagt ggttgggcta actgaaacat cttcccaacc 900
aaagaatgaa gaagacattg aaattattcc aatccaagaa gaggaagaag aagaaacaga 960
gacgaacttt ccagaacctc cccaagatca ggaatcctca ccaatagaaa atgacagctc 1020
tccttaagtg atttcttctg ttttctgttt ccttttttaa acattagtgt tcatagcttc 1080
caagagacat gctgactttc atttcttgag gtactctgca catacgcacc acatctctat 1140
ctggcc 1146
<210> 199
<211> 297
<212> PRT
<213> 智人
<400> 199
Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe Pro Ala Glu Pro
1 5 10 15
Met Lys Gly Pro Ile Ala Met Gln Ser Gly Pro Lys Pro Leu Phe Arg
20 25 30
Arg Met Ser Ser Leu Val Gly Pro Thr Gln Ser Phe Phe Met Arg Glu
35 40 45
Ser Lys Thr Leu Gly Ala Val Gln Ile Met Asn Gly Leu Phe His Ile
50 55 60
Ala Leu Gly Gly Leu Leu Met Ile Pro Ala Gly Ile Tyr Ala Pro Ile
65 70 75 80
Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly Ile Met Tyr Ile Ile
85 90 95
Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser Arg Lys Cys Leu
100 105 110
Val Lys Gly Lys Met Ile Met Asn Ser Leu Ser Leu Phe Ala Ala Ile
115 120 125
Ser Gly Met Ile Leu Ser Ile Met Asp Ile Leu Asn Ile Lys Ile Ser
130 135 140
His Phe Leu Lys Met Glu Ser Leu Asn Phe Ile Arg Ala His Thr Pro
145 150 155 160
Tyr Ile Asn Ile Tyr Asn Cys Glu Pro Ala Asn Pro Ser Glu Lys Asn
165 170 175
Ser Pro Ser Thr Gln Tyr Cys Tyr Ser Ile Gln Ser Leu Phe Leu Gly
180 185 190
Ile Leu Ser Val Met Leu Ile Phe Ala Phe Phe Gln Glu Leu Val Ile
195 200 205
Ala Gly Ile Val Glu Asn Glu Trp Lys Arg Thr Cys Ser Arg Pro Lys
210 215 220
Ser Asn Ile Val Leu Leu Ser Ala Glu Glu Lys Lys Glu Gln Thr Ile
225 230 235 240
Glu Ile Lys Glu Glu Val Val Gly Leu Thr Glu Thr Ser Ser Gln Pro
245 250 255
Lys Asn Glu Glu Asp Ile Glu Ile Ile Pro Ile Gln Glu Glu Glu Glu
260 265 270
Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gln Asp Gln Glu Ser
275 280 285
Ser Pro Ile Glu Asn Asp Ser Ser Pro
290 295
<210> 200
<211> 8605
<212> DNA
<213> 智人
<400> 200
aattcgccaa ctgaaaaagt gggaaaggat gtctggaggc gaggcgtccc attacagagg 60
aaggagctcg ctatataagc cagccaaagt tggctgcacc ggccacagcc tgcctactgt 120
cacccgcctc tcccgcgcgc agatacacgc ccccgcctcc gtgggcacaa aggcagcgct 180
gctggggaac tcgggggaac gcgcacgtgg gaaccgccgc agctccacac tccaggtact 240
tcttccaagg acctaggtct ctcgcccatc ggaaagaaaa taattctttc aagaagatca 300
gggacaactg atttgaagtc tactctgtgc ttctaaatcc ccaattctgc tgaaagtgag 360
ataccctaga gccctagagc cccagcagca cccagccaaa cccacctcca ccatgggggc 420
catgactcag ctgttggcag gtgtctttct tgctttcctt gccctcgcta ccgaaggtgg 480
ggtcctcaag aaagtcatcc ggcacaagcg acagagtggg gtgaacgcca ccctgccaga 540
agagaaccag ccagtggtgt ttaaccacgt ttacaacatc aagctgccag tgggatccca 600
gtgttcggtg gatctggagt cagccagtgg ggagaaagac ctggcaccgc cttcagagcc 660
cagcgaaagc tttcaggagc acacagtgga tggggaaaac cagattgtct tcacacatcg 720
catcaacatc ccccgccggg cctgtggctg tgccgcagcc cctgatgtta aggagctgct 780
gagcagactg gaggagctgg agaacctggt gtcttccctg agggagcaat gtactgcagg 840
agcaggctgc tgtctccagc ctgccacagg ccgcttggac accaggccct tctgtagcgg 900
tcggggcaac ttcagcactg aaggatgtgg ctgtgtctgc gaacctggct ggaaaggccc 960
caactgctct gagcccgaat gtccaggcaa ctgtcacctt cgaggccggt gcattgatgg 1020
gcagtgcatc tgtgacgacg gcttcacggg cgaggactgc agccagctgg cttgccccag 1080
cgactgcaat gaccagggca agtgcgtaaa tggagtctgc atctgtttcg aaggctacgc 1140
cggggctgac tgcagccgtg aaatctgccc agtgccctgc agtgaggagc acggcacatg 1200
tgtagatggc ttgtgtgtgt gccacgatgg ctttgcaggc gatgactgca acaagcctct 1260
gtgtctcaac aattgctaca accgtggacg atgcgtggag aatgagtgcg tgtgtgatga 1320
gggtttcacg ggcgaagact gcagtgagct catctgcccc aatgactgct tcgaccgggg 1380
ccgctgcatc aatggcacct gctactgcga agaaggcttc acaggtgaag actgcgggaa 1440
acccacctgc ccacatgcct gccacaccca gggccggtgt gaggaggggc agtgtgtatg 1500
tgatgagggc tttgccggtg tggactgcag cgagaagagg tgtcctgctg actgtcacaa 1560
tcgtggccgc tgtgtagacg ggcggtgtga gtgtgatgat ggtttcactg gagctgactg 1620
tggggagctc aagtgtccca atggctgcag tggccatggc cgctgtgtca atgggcagtg 1680
tgtgtgtgat gagggctata ctggggagga ctgcagccag ctacggtgcc ccaatgactg 1740
tcacagtcgg ggccgctgtg tcgagggcaa atgtgtatgt gagcaaggct tcaagggcta 1800
tgactgcagt gacatgagct gccctaatga ctgtcaccag cacggccgct gtgtgaatgg 1860
catgtgtgtt tgtgatgacg gctacacagg ggaagactgc cgggatcgcc aatgccccag 1920
ggactgcagc aacaggggcc tctgtgtgga cggacagtgc gtctgtgagg acggcttcac 1980
cggccctgac tgtgcagaac tctcctgtcc aaatgactgc catggccagg gtcgctgtgt 2040
gaatgggcag tgcgtgtgcc atgaaggatt tatgggcaaa gactgcaagg agcaaagatg 2100
tcccagtgac tgtcatggcc agggccgctg cgtggacggc cagtgcatct gccacgaggg 2160
cttcacaggc ctggactgtg gccagcactc ctgccccagt gactgcaaca acttaggaca 2220
atgcgtctcg ggccgctgca tctgcaacga gggctacagc ggagaagact gctcagaggt 2280
gtctcctccc aaagacctcg ttgtgacaga agtgacggaa gagacggtca acctggcctg 2340
ggacaatgag atgcgggtca cagagtacct tgtcgtgtac acgcccaccc acgagggtgg 2400
tctggaaatg cagttccgtg tgcctgggga ccagacgtcc accatcatcc aggagctgga 2460
gcctggtgtg gagtacttta tccgtgtatt tgccatcctg gagaacaaga agagcattcc 2520
tgtcagcgcc agggtggcca cgtacttacc tgcacctgaa ggcctgaaat tcaagtccat 2580
caaggagaca tctgtggaag tggagtggga tcctctagac attgcttttg aaacctggga 2640
gatcatcttc cggaatatga ataaagaaga tgagggagag atcaccaaaa gcctgaggag 2700
gccagagacc tcttaccggc aaactggtct agctcctggg caagagtatg agatatctct 2760
gcacatagtg aaaaacaata cccggggccc tggcctgaag agggtgacca ccacacgctt 2820
ggatgccccc agccagatcg aggtgaaaga tgtcacagac accactgcct tgatcacctg 2880
gttcaagccc ctggctgaga tcgatggcat tgagctgacc tacggcatca aagacgtgcc 2940
aggagaccgt accaccatcg atctcacaga ggacgagaac cagtactcca tcgggaacct 3000
gaagcctgac actgagtacg aggtgtccct catctcccgc agaggtgaca tgtcaagcaa 3060
cccagccaaa gagaccttca caacaggcct cgatgctccc aggaatcttc gacgtgtttc 3120
ccagacagat aacagcatca ccctggaatg gaggaatggc aaggcagcta ttgacagtta 3180
cagaattaag tatgccccca tctctggagg ggaccacgct gaggttgatg ttccaaagag 3240
ccaacaagcc acaaccaaaa ccacactcac aggtctgagg ccgggaactg aatatgggat 3300
tggagtttct gctgtgaagg aagacaagga gagcaatcca gcgaccatca acgcagccac 3360
agagttggac acgcccaagg accttcaggt ttctgaaact gcagagacca gcctgaccct 3420
gctctggaag acaccgttgg ccaaatttga ccgctaccgc ctcaattaca gtctccccac 3480
aggccagtgg gtgggagtgc agcttccaag aaacaccact tcctatgtcc tgagaggcct 3540
ggaaccagga caggagtaca atgtcctcct gacagccgag aaaggcagac acaagagcaa 3600
gcccgcacgt gtgaaggcat ccactgaaca agcccctgag ctggaaaacc tcaccgtgac 3660
tgaggttggc tgggatggcc tcagactcaa ctggaccgca gctgaccagg cctatgagca 3720
ctttatcatt caggtgcagg aggccaacaa ggtggaggca gctcggaacc tcaccgtgcc 3780
tggcagcctt cgggctgtgg acataccggg cctcaaggct gctacgcctt atacagtctc 3840
catctatggg gtgatccagg gctatagaac accagtgctc tctgctgagg cctccacagg 3900
ggaaactccc aatttgggag aggtcgtggt ggccgaggtg ggctgggatg ccctcaaact 3960
caactggact gctccagaag gggcctatga gtactttttc attcaggtgc aggaggctga 4020
cacagtagag gcagcccaga acctcaccgt cccaggagga ctgaggtcca cagacctgcc 4080
tgggctcaaa gcagccactc attataccat caccatccgc ggggtcactc aggacttcag 4140
cacaacccct ctctctgttg aagtcttgac agaggaggtt ccagatatgg gaaacctcac 4200
agtgaccgag gttagctggg atgctctcag actgaactgg accacgccag atggaaccta 4260
tgaccagttt actattcagg tccaggaggc tgaccaggtg gaagaggctc acaatctcac 4320
ggttcctggc agcctgcgtt ccatggaaat cccaggcctc agggctggca ctccttacac 4380
agtcaccctg cacggcgagg tcaggggcca cagcactcga ccccttgctg tagaggtcgt 4440
cacagaggat ctcccacagc tgggagattt agccgtgtct gaggttggct gggatggcct 4500
cagactcaac tggaccgcag ctgacaatgc ctatgagcac tttgtcattc aggtgcagga 4560
ggtcaacaaa gtggaggcag cccagaacct cacgttgcct ggcagcctca gggctgtgga 4620
catcccgggc ctcgaggctg ccacgcctta tagagtctcc atctatgggg tgatccgggg 4680
ctatagaaca ccagtactct ctgctgaggc ctccacagcc aaagaacctg aaattggaaa 4740
cttaaatgtt tctgacataa ctcccgagag cttcaatctc tcctggatgg ctaccgatgg 4800
gatcttcgag acctttacca ttgaaattat tgattccaat aggttgctgg agactgtgga 4860
atataatatc tctggtgctg aacgaactgc ccatatctca gggctacccc ctagtactga 4920
ttttattgtc tacctctctg gacttgctcc cagcatccgg accaaaacca tcagtgccac 4980
agccacgaca gaggccctgc cccttctgga aaacctaacc atttccgaca ttaatcccta 5040
cgggttcaca gtttcctgga tggcatcgga gaatgccttt gacagctttc tagtaacggt 5100
ggtggattct gggaagctgc tggaccccca ggaattcaca ctttcaggaa cccagaggaa 5160
gctggagctt agaggcctca taactggcat tggctatgag gttatggtct ctggcttcac 5220
ccaagggcat caaaccaagc ccttgagggc tgagattgtt acagaagccg aaccggaagt 5280
tgacaacctt ctggtttcag atgccacccc agacggtttc cgtctgtcct ggacagctga 5340
tgaaggggtc ttcgacaatt ttgttctcaa aatcagagat accaaaaagc agtctgagcc 5400
actggaaata accctacttg cccccgaacg taccagggac ataacaggtc tcagagaggc 5460
tactgaatac gaaattgaac tctatggaat aagcaaagga aggcgatccc agacagtcag 5520
tgctatagca acaacagcca tgggctcccc aaaggaagtc attttctcag acatcactga 5580
aaattcggct actgtcagct ggagggcacc cacagcccaa gtggagagct tccggattac 5640
ctatgtgccc attacaggag gtacaccctc catggtaact gtggacggaa ccaagactca 5700
gaccaggctg gtgaaactca tacctggcgt ggagtacctt gtcagcatca tcgccatgaa 5760
gggctttgag gaaagtgaac ctgtctcagg gtcattcacc acagctctgg atggcccatc 5820
tggcctggtg acagccaaca tcactgactc agaagccttg gccaggtggc agccagccat 5880
tgccactgtg gacagttatg tcatctccta cacaggcgag aaagtgccag aaattacacg 5940
cacggtgtcc gggaacacag tggagtatgc tctgaccgac ctcgagcctg ccacggaata 6000
cacactgaga atctttgcag agaaagggcc ccagaagagc tcaaccatca ctgccaagtt 6060
cacaacagac ctcgattctc caagagactt gactgctact gaggttcagt cggaaactgc 6120
cctccttacc tggcgacccc cccgggcatc agtcaccggt tacctgctgg tctatgaatc 6180
agtggatggc acagtcaagg aagtcattgt gggtccagat accacctcct acagcctggc 6240
agacctgagc ccatccaccc actacacagc caagatccag gcactcaatg ggcccctgag 6300
gagcaatatg atccagacca tcttcaccac aattggactc ctgtacccct tccccaagga 6360
ctgctcccaa gcaatgctga atggagacac gacctctggc ctctacacca tttatctgaa 6420
tggtgataag gctgaggcgc tggaagtctt ctgtgacatg acctctgatg ggggtggatg 6480
gattgtgttc ctgagacgca aaaacggacg cgagaacttc taccaaaact ggaaggcata 6540
tgctgctgga tttggggacc gcagagaaga attctggctt gggctggaca acctgaacaa 6600
aatcacagcc caggggcagt acgagctccg ggtggacctg cgggaccatg gggagacagc 6660
ctttgctgtc tatgacaagt tcagcgtggg agatgccaag actcgctaca agctgaaggt 6720
ggaggggtac agtgggacag caggtgactc catggcctac cacaatggca gatccttctc 6780
cacctttgac aaggacacag attcagccat caccaactgt gctctgtcct acaaaggggc 6840
tttctggtac aggaactgtc accgtgtcaa cctgatgggg agatatgggg acaataacca 6900
cagtcagggc gttaactggt tccactggaa gggccacgaa cactcaatcc agtttgctga 6960
gatgaagctg agaccaagca acttcagaaa tcttgaaggc aggcgcaaac gggcataaat 7020
tccagggacc actgggtgag agaggaataa ggcccagagc gaggaaagga ttttaccaaa 7080
gcatcaatac aaccagccca accatcggtc cacacctggg catttggtga gagtcaaagc 7140
tgaccatgga tccctggggc caacggcaac agcatgggcc tcacctcctc tgtgatttct 7200
ttctttgcac caaagacatc agtctccaac atgtttctgt tttgttgttt gattcagcaa 7260
aaatctccca gtgacaacat cgcaatagtt ttttacttct cttaggtggc tctgggaatg 7320
ggagaggggt aggatgtaca ggggtagttt gttttagaac cagccgtatt ttacatgaag 7380
ctgtataatt aattgtcatt atttttgtta gcaaagatta aatgtgtcat tggaagccat 7440
cccttttttt acatttcata caacagaaac cagaaaagca atactgtttc cattttaagg 7500
atatgattaa tattattaat ataataatga tgatgatgat gatgaaaact aaggattttt 7560
caagagatct ttctttccaa aacatttctg gacagtacct gattgtattt tttttttaaa 7620
taaaagcaca agtacttttg agtttgttat tttgctttga attgttgagt ctgaatttca 7680
ccaaagccaa tcatttgaac aaagcgggga atgttgggat aggaaaggta agtagggata 7740
gtggtcaagt gggaggggtg gaaaggagac taaagactgg gagagaggga agcacttttt 7800
ttaaataaag ttgaacacac ttgggaaaag cttacaggcc aggcctgtaa tcccaacact 7860
ttgggaggcc aaggtgggag gatagcttaa ccccaggagt ttgagaccag cctgagcaac 7920
atagtgagaa cttgtctcta cagaaaaaaa aaaaaaaaaa aatttaatta ggcaagcgtg 7980
gtagtgcgca cctgtcgtcc cagctactca ggaggctgag gtaggaaaat cactggagcc 8040
caggagttag aggttacagt gagctatgat cacactactg cactccagcc tgggcaacag 8100
agggagaccc tgtctctaaa taaaaaaaga aaagaaaaaa aaagcttaca acttgagatt 8160
cagcatcttg ctcagtattt ccaagactaa tagattatgg tttaaaagat gcttttatac 8220
tcattttcta atgcaactcc tagaaactct atgatatagt tgaggtaagt attgttacca 8280
cacatgggct aagatcccca gaggcagact gcctgagttc aattcttggc tccaccattc 8340
ccaagttccc taacctctct atgcctcagt ttcctcttct gtaaagtagg gacactcata 8400
cttctcattt cagaacattt ttgtgaagaa taaattatgt tatccatttg aggcccttag 8460
aatggtaccc ggtgtatatt aagtgctagt acatgttagc tatcatcatt atcactttat 8520
atgagatgga ctggggttca tagaaaccca atgacttgat tgtggctact actcaataaa 8580
taatagaatt tggatttaaa aaaaa 8605
<210> 201
<211> 2201
<212> PRT
<213> 智人
<400> 201
Met Gly Ala Met Thr Gln Leu Leu Ala Gly Val Phe Leu Ala Phe Leu
1 5 10 15
Ala Leu Ala Thr Glu Gly Gly Val Leu Lys Lys Val Ile Arg His Lys
20 25 30
Arg Gln Ser Gly Val Asn Ala Thr Leu Pro Glu Glu Asn Gln Pro Val
35 40 45
Val Phe Asn His Val Tyr Asn Ile Lys Leu Pro Val Gly Ser Gln Cys
50 55 60
Ser Val Asp Leu Glu Ser Ala Ser Gly Glu Lys Asp Leu Ala Pro Pro
65 70 75 80
Ser Glu Pro Ser Glu Ser Phe Gln Glu His Thr Val Asp Gly Glu Asn
85 90 95
Gln Ile Val Phe Thr His Arg Ile Asn Ile Pro Arg Arg Ala Cys Gly
100 105 110
Cys Ala Ala Ala Pro Asp Val Lys Glu Leu Leu Ser Arg Leu Glu Glu
115 120 125
Leu Glu Asn Leu Val Ser Ser Leu Arg Glu Gln Cys Thr Ala Gly Ala
130 135 140
Gly Cys Cys Leu Gln Pro Ala Thr Gly Arg Leu Asp Thr Arg Pro Phe
145 150 155 160
Cys Ser Gly Arg Gly Asn Phe Ser Thr Glu Gly Cys Gly Cys Val Cys
165 170 175
Glu Pro Gly Trp Lys Gly Pro Asn Cys Ser Glu Pro Glu Cys Pro Gly
180 185 190
Asn Cys His Leu Arg Gly Arg Cys Ile Asp Gly Gln Cys Ile Cys Asp
195 200 205
Asp Gly Phe Thr Gly Glu Asp Cys Ser Gln Leu Ala Cys Pro Ser Asp
210 215 220
Cys Asn Asp Gln Gly Lys Cys Val Asn Gly Val Cys Ile Cys Phe Glu
225 230 235 240
Gly Tyr Ala Gly Ala Asp Cys Ser Arg Glu Ile Cys Pro Val Pro Cys
245 250 255
Ser Glu Glu His Gly Thr Cys Val Asp Gly Leu Cys Val Cys His Asp
260 265 270
Gly Phe Ala Gly Asp Asp Cys Asn Lys Pro Leu Cys Leu Asn Asn Cys
275 280 285
Tyr Asn Arg Gly Arg Cys Val Glu Asn Glu Cys Val Cys Asp Glu Gly
290 295 300
Phe Thr Gly Glu Asp Cys Ser Glu Leu Ile Cys Pro Asn Asp Cys Phe
305 310 315 320
Asp Arg Gly Arg Cys Ile Asn Gly Thr Cys Tyr Cys Glu Glu Gly Phe
325 330 335
Thr Gly Glu Asp Cys Gly Lys Pro Thr Cys Pro His Ala Cys His Thr
340 345 350
Gln Gly Arg Cys Glu Glu Gly Gln Cys Val Cys Asp Glu Gly Phe Ala
355 360 365
Gly Val Asp Cys Ser Glu Lys Arg Cys Pro Ala Asp Cys His Asn Arg
370 375 380
Gly Arg Cys Val Asp Gly Arg Cys Glu Cys Asp Asp Gly Phe Thr Gly
385 390 395 400
Ala Asp Cys Gly Glu Leu Lys Cys Pro Asn Gly Cys Ser Gly His Gly
405 410 415
Arg Cys Val Asn Gly Gln Cys Val Cys Asp Glu Gly Tyr Thr Gly Glu
420 425 430
Asp Cys Ser Gln Leu Arg Cys Pro Asn Asp Cys His Ser Arg Gly Arg
435 440 445
Cys Val Glu Gly Lys Cys Val Cys Glu Gln Gly Phe Lys Gly Tyr Asp
450 455 460
Cys Ser Asp Met Ser Cys Pro Asn Asp Cys His Gln His Gly Arg Cys
465 470 475 480
Val Asn Gly Met Cys Val Cys Asp Asp Gly Tyr Thr Gly Glu Asp Cys
485 490 495
Arg Asp Arg Gln Cys Pro Arg Asp Cys Ser Asn Arg Gly Leu Cys Val
500 505 510
Asp Gly Gln Cys Val Cys Glu Asp Gly Phe Thr Gly Pro Asp Cys Ala
515 520 525
Glu Leu Ser Cys Pro Asn Asp Cys His Gly Gln Gly Arg Cys Val Asn
530 535 540
Gly Gln Cys Val Cys His Glu Gly Phe Met Gly Lys Asp Cys Lys Glu
545 550 555 560
Gln Arg Cys Pro Ser Asp Cys His Gly Gln Gly Arg Cys Val Asp Gly
565 570 575
Gln Cys Ile Cys His Glu Gly Phe Thr Gly Leu Asp Cys Gly Gln His
580 585 590
Ser Cys Pro Ser Asp Cys Asn Asn Leu Gly Gln Cys Val Ser Gly Arg
595 600 605
Cys Ile Cys Asn Glu Gly Tyr Ser Gly Glu Asp Cys Ser Glu Val Ser
610 615 620
Pro Pro Lys Asp Leu Val Val Thr Glu Val Thr Glu Glu Thr Val Asn
625 630 635 640
Leu Ala Trp Asp Asn Glu Met Arg Val Thr Glu Tyr Leu Val Val Tyr
645 650 655
Thr Pro Thr His Glu Gly Gly Leu Glu Met Gln Phe Arg Val Pro Gly
660 665 670
Asp Gln Thr Ser Thr Ile Ile Gln Glu Leu Glu Pro Gly Val Glu Tyr
675 680 685
Phe Ile Arg Val Phe Ala Ile Leu Glu Asn Lys Lys Ser Ile Pro Val
690 695 700
Ser Ala Arg Val Ala Thr Tyr Leu Pro Ala Pro Glu Gly Leu Lys Phe
705 710 715 720
Lys Ser Ile Lys Glu Thr Ser Val Glu Val Glu Trp Asp Pro Leu Asp
725 730 735
Ile Ala Phe Glu Thr Trp Glu Ile Ile Phe Arg Asn Met Asn Lys Glu
740 745 750
Asp Glu Gly Glu Ile Thr Lys Ser Leu Arg Arg Pro Glu Thr Ser Tyr
755 760 765
Arg Gln Thr Gly Leu Ala Pro Gly Gln Glu Tyr Glu Ile Ser Leu His
770 775 780
Ile Val Lys Asn Asn Thr Arg Gly Pro Gly Leu Lys Arg Val Thr Thr
785 790 795 800
Thr Arg Leu Asp Ala Pro Ser Gln Ile Glu Val Lys Asp Val Thr Asp
805 810 815
Thr Thr Ala Leu Ile Thr Trp Phe Lys Pro Leu Ala Glu Ile Asp Gly
820 825 830
Ile Glu Leu Thr Tyr Gly Ile Lys Asp Val Pro Gly Asp Arg Thr Thr
835 840 845
Ile Asp Leu Thr Glu Asp Glu Asn Gln Tyr Ser Ile Gly Asn Leu Lys
850 855 860
Pro Asp Thr Glu Tyr Glu Val Ser Leu Ile Ser Arg Arg Gly Asp Met
865 870 875 880
Ser Ser Asn Pro Ala Lys Glu Thr Phe Thr Thr Gly Leu Asp Ala Pro
885 890 895
Arg Asn Leu Arg Arg Val Ser Gln Thr Asp Asn Ser Ile Thr Leu Glu
900 905 910
Trp Arg Asn Gly Lys Ala Ala Ile Asp Ser Tyr Arg Ile Lys Tyr Ala
915 920 925
Pro Ile Ser Gly Gly Asp His Ala Glu Val Asp Val Pro Lys Ser Gln
930 935 940
Gln Ala Thr Thr Lys Thr Thr Leu Thr Gly Leu Arg Pro Gly Thr Glu
945 950 955 960
Tyr Gly Ile Gly Val Ser Ala Val Lys Glu Asp Lys Glu Ser Asn Pro
965 970 975
Ala Thr Ile Asn Ala Ala Thr Glu Leu Asp Thr Pro Lys Asp Leu Gln
980 985 990
Val Ser Glu Thr Ala Glu Thr Ser Leu Thr Leu Leu Trp Lys Thr Pro
995 1000 1005
Leu Ala Lys Phe Asp Arg Tyr Arg Leu Asn Tyr Ser Leu Pro Thr
1010 1015 1020
Gly Gln Trp Val Gly Val Gln Leu Pro Arg Asn Thr Thr Ser Tyr
1025 1030 1035
Val Leu Arg Gly Leu Glu Pro Gly Gln Glu Tyr Asn Val Leu Leu
1040 1045 1050
Thr Ala Glu Lys Gly Arg His Lys Ser Lys Pro Ala Arg Val Lys
1055 1060 1065
Ala Ser Thr Glu Gln Ala Pro Glu Leu Glu Asn Leu Thr Val Thr
1070 1075 1080
Glu Val Gly Trp Asp Gly Leu Arg Leu Asn Trp Thr Ala Ala Asp
1085 1090 1095
Gln Ala Tyr Glu His Phe Ile Ile Gln Val Gln Glu Ala Asn Lys
1100 1105 1110
Val Glu Ala Ala Arg Asn Leu Thr Val Pro Gly Ser Leu Arg Ala
1115 1120 1125
Val Asp Ile Pro Gly Leu Lys Ala Ala Thr Pro Tyr Thr Val Ser
1130 1135 1140
Ile Tyr Gly Val Ile Gln Gly Tyr Arg Thr Pro Val Leu Ser Ala
1145 1150 1155
Glu Ala Ser Thr Gly Glu Thr Pro Asn Leu Gly Glu Val Val Val
1160 1165 1170
Ala Glu Val Gly Trp Asp Ala Leu Lys Leu Asn Trp Thr Ala Pro
1175 1180 1185
Glu Gly Ala Tyr Glu Tyr Phe Phe Ile Gln Val Gln Glu Ala Asp
1190 1195 1200
Thr Val Glu Ala Ala Gln Asn Leu Thr Val Pro Gly Gly Leu Arg
1205 1210 1215
Ser Thr Asp Leu Pro Gly Leu Lys Ala Ala Thr His Tyr Thr Ile
1220 1225 1230
Thr Ile Arg Gly Val Thr Gln Asp Phe Ser Thr Thr Pro Leu Ser
1235 1240 1245
Val Glu Val Leu Thr Glu Glu Val Pro Asp Met Gly Asn Leu Thr
1250 1255 1260
Val Thr Glu Val Ser Trp Asp Ala Leu Arg Leu Asn Trp Thr Thr
1265 1270 1275
Pro Asp Gly Thr Tyr Asp Gln Phe Thr Ile Gln Val Gln Glu Ala
1280 1285 1290
Asp Gln Val Glu Glu Ala His Asn Leu Thr Val Pro Gly Ser Leu
1295 1300 1305
Arg Ser Met Glu Ile Pro Gly Leu Arg Ala Gly Thr Pro Tyr Thr
1310 1315 1320
Val Thr Leu His Gly Glu Val Arg Gly His Ser Thr Arg Pro Leu
1325 1330 1335
Ala Val Glu Val Val Thr Glu Asp Leu Pro Gln Leu Gly Asp Leu
1340 1345 1350
Ala Val Ser Glu Val Gly Trp Asp Gly Leu Arg Leu Asn Trp Thr
1355 1360 1365
Ala Ala Asp Asn Ala Tyr Glu His Phe Val Ile Gln Val Gln Glu
1370 1375 1380
Val Asn Lys Val Glu Ala Ala Gln Asn Leu Thr Leu Pro Gly Ser
1385 1390 1395
Leu Arg Ala Val Asp Ile Pro Gly Leu Glu Ala Ala Thr Pro Tyr
1400 1405 1410
Arg Val Ser Ile Tyr Gly Val Ile Arg Gly Tyr Arg Thr Pro Val
1415 1420 1425
Leu Ser Ala Glu Ala Ser Thr Ala Lys Glu Pro Glu Ile Gly Asn
1430 1435 1440
Leu Asn Val Ser Asp Ile Thr Pro Glu Ser Phe Asn Leu Ser Trp
1445 1450 1455
Met Ala Thr Asp Gly Ile Phe Glu Thr Phe Thr Ile Glu Ile Ile
1460 1465 1470
Asp Ser Asn Arg Leu Leu Glu Thr Val Glu Tyr Asn Ile Ser Gly
1475 1480 1485
Ala Glu Arg Thr Ala His Ile Ser Gly Leu Pro Pro Ser Thr Asp
1490 1495 1500
Phe Ile Val Tyr Leu Ser Gly Leu Ala Pro Ser Ile Arg Thr Lys
1505 1510 1515
Thr Ile Ser Ala Thr Ala Thr Thr Glu Ala Leu Pro Leu Leu Glu
1520 1525 1530
Asn Leu Thr Ile Ser Asp Ile Asn Pro Tyr Gly Phe Thr Val Ser
1535 1540 1545
Trp Met Ala Ser Glu Asn Ala Phe Asp Ser Phe Leu Val Thr Val
1550 1555 1560
Val Asp Ser Gly Lys Leu Leu Asp Pro Gln Glu Phe Thr Leu Ser
1565 1570 1575
Gly Thr Gln Arg Lys Leu Glu Leu Arg Gly Leu Ile Thr Gly Ile
1580 1585 1590
Gly Tyr Glu Val Met Val Ser Gly Phe Thr Gln Gly His Gln Thr
1595 1600 1605
Lys Pro Leu Arg Ala Glu Ile Val Thr Glu Ala Glu Pro Glu Val
1610 1615 1620
Asp Asn Leu Leu Val Ser Asp Ala Thr Pro Asp Gly Phe Arg Leu
1625 1630 1635
Ser Trp Thr Ala Asp Glu Gly Val Phe Asp Asn Phe Val Leu Lys
1640 1645 1650
Ile Arg Asp Thr Lys Lys Gln Ser Glu Pro Leu Glu Ile Thr Leu
1655 1660 1665
Leu Ala Pro Glu Arg Thr Arg Asp Ile Thr Gly Leu Arg Glu Ala
1670 1675 1680
Thr Glu Tyr Glu Ile Glu Leu Tyr Gly Ile Ser Lys Gly Arg Arg
1685 1690 1695
Ser Gln Thr Val Ser Ala Ile Ala Thr Thr Ala Met Gly Ser Pro
1700 1705 1710
Lys Glu Val Ile Phe Ser Asp Ile Thr Glu Asn Ser Ala Thr Val
1715 1720 1725
Ser Trp Arg Ala Pro Thr Ala Gln Val Glu Ser Phe Arg Ile Thr
1730 1735 1740
Tyr Val Pro Ile Thr Gly Gly Thr Pro Ser Met Val Thr Val Asp
1745 1750 1755
Gly Thr Lys Thr Gln Thr Arg Leu Val Lys Leu Ile Pro Gly Val
1760 1765 1770
Glu Tyr Leu Val Ser Ile Ile Ala Met Lys Gly Phe Glu Glu Ser
1775 1780 1785
Glu Pro Val Ser Gly Ser Phe Thr Thr Ala Leu Asp Gly Pro Ser
1790 1795 1800
Gly Leu Val Thr Ala Asn Ile Thr Asp Ser Glu Ala Leu Ala Arg
1805 1810 1815
Trp Gln Pro Ala Ile Ala Thr Val Asp Ser Tyr Val Ile Ser Tyr
1820 1825 1830
Thr Gly Glu Lys Val Pro Glu Ile Thr Arg Thr Val Ser Gly Asn
1835 1840 1845
Thr Val Glu Tyr Ala Leu Thr Asp Leu Glu Pro Ala Thr Glu Tyr
1850 1855 1860
Thr Leu Arg Ile Phe Ala Glu Lys Gly Pro Gln Lys Ser Ser Thr
1865 1870 1875
Ile Thr Ala Lys Phe Thr Thr Asp Leu Asp Ser Pro Arg Asp Leu
1880 1885 1890
Thr Ala Thr Glu Val Gln Ser Glu Thr Ala Leu Leu Thr Trp Arg
1895 1900 1905
Pro Pro Arg Ala Ser Val Thr Gly Tyr Leu Leu Val Tyr Glu Ser
1910 1915 1920
Val Asp Gly Thr Val Lys Glu Val Ile Val Gly Pro Asp Thr Thr
1925 1930 1935
Ser Tyr Ser Leu Ala Asp Leu Ser Pro Ser Thr His Tyr Thr Ala
1940 1945 1950
Lys Ile Gln Ala Leu Asn Gly Pro Leu Arg Ser Asn Met Ile Gln
1955 1960 1965
Thr Ile Phe Thr Thr Ile Gly Leu Leu Tyr Pro Phe Pro Lys Asp
1970 1975 1980
Cys Ser Gln Ala Met Leu Asn Gly Asp Thr Thr Ser Gly Leu Tyr
1985 1990 1995
Thr Ile Tyr Leu Asn Gly Asp Lys Ala Glu Ala Leu Glu Val Phe
2000 2005 2010
Cys Asp Met Thr Ser Asp Gly Gly Gly Trp Ile Val Phe Leu Arg
2015 2020 2025
Arg Lys Asn Gly Arg Glu Asn Phe Tyr Gln Asn Trp Lys Ala Tyr
2030 2035 2040
Ala Ala Gly Phe Gly Asp Arg Arg Glu Glu Phe Trp Leu Gly Leu
2045 2050 2055
Asp Asn Leu Asn Lys Ile Thr Ala Gln Gly Gln Tyr Glu Leu Arg
2060 2065 2070
Val Asp Leu Arg Asp His Gly Glu Thr Ala Phe Ala Val Tyr Asp
2075 2080 2085
Lys Phe Ser Val Gly Asp Ala Lys Thr Arg Tyr Lys Leu Lys Val
2090 2095 2100
Glu Gly Tyr Ser Gly Thr Ala Gly Asp Ser Met Ala Tyr His Asn
2105 2110 2115
Gly Arg Ser Phe Ser Thr Phe Asp Lys Asp Thr Asp Ser Ala Ile
2120 2125 2130
Thr Asn Cys Ala Leu Ser Tyr Lys Gly Ala Phe Trp Tyr Arg Asn
2135 2140 2145
Cys His Arg Val Asn Leu Met Gly Arg Tyr Gly Asp Asn Asn His
2150 2155 2160
Ser Gln Gly Val Asn Trp Phe His Trp Lys Gly His Glu His Ser
2165 2170 2175
Ile Gln Phe Ala Glu Met Lys Leu Arg Pro Ser Asn Phe Arg Asn
2180 2185 2190
Leu Glu Gly Arg Arg Lys Arg Ala
2195 2200
<210> 202
<211> 2814
<212> DNA
<213> 智人
<400> 202
aagaacgccc ccaaaatctg tttctaattt tacagaaatc ttttgaaact tggcacggta 60
ttcaaaagtc cgtggaaaga aaaaaacctt gtcctggctt cagcttccaa ctacaaagac 120
agacttggtc cttttcaacg gttttcacag atccagtgac ccacgctctg aagacagaat 180
tagctaactt tcaaaaacat ctggaaaaat gaagacttgg gtaaaaatcg tatttggagt 240
tgccacctct gctgtgcttg ccttattggt gatgtgcatt gtcttacgcc cttcaagagt 300
tcataactct gaagaaaata caatgagagc actcacactg aaggatattt taaatggaac 360
attttcttat aaaacatttt ttccaaactg gatttcagga caagaatatc ttcatcaatc 420
tgcagataac aatatagtac tttataatat tgaaacagga caatcatata ccattttgag 480
taatagaacc atgaaaagtg tgaatgcttc aaattacggc ttatcacctg atcggcaatt 540
tgtatatcta gaaagtgatt attcaaagct ttggagatac tcttacacag caacatatta 600
catctatgac cttagcaatg gagaatttgt aagaggaaat gagcttcctc gtccaattca 660
gtatttatgc tggtcgcctg ttgggagtaa attagcatat gtctatcaaa acaatatcta 720
tttgaaacaa agaccaggag atccaccttt tcaaataaca tttaatggaa gagaaaataa 780
aatatttaat ggaatcccag actgggttta tgaagaggaa atgcttccta caaaatatgc 840
tctctggtgg tctcctaatg gaaaattttt ggcatatgcg gaatttaatg ataaggatat 900
accagttatt gcctattcct attatggcga tgaacaatat cctagaacaa taaatattcc 960
atacccaaag gctggagcta agaatcccgt tgttcggata tttattatcg ataccactta 1020
ccctgcgtat gtaggtcccc aggaagtgcc tgttccagca atgatagcct caagtgatta 1080
ttatttcagt tggctcacgt gggttactga tgaacgagta tgtttgcagt ggctaaaaag 1140
agtccagaat gtttcggtcc tgtctatatg tgacttcagg gaagactggc agacatggga 1200
ttgtccaaag acccaggagc atatagaaga aagcagaact ggatgggctg gtggattctt 1260
tgtttcaaga ccagttttca gctatgatgc catttcgtac tacaaaatat ttagtgacaa 1320
ggatggctac aaacatattc actatatcaa agacactgtg gaaaatgcta ttcaaattac 1380
aagtggcaag tgggaggcca taaatatatt cagagtaaca caggattcac tgttttattc 1440
tagcaatgaa tttgaagaat accctggaag aagaaacatc tacagaatta gcattggaag 1500
ctatcctcca agcaagaagt gtgttacttg ccatctaagg aaagaaaggt gccaatatta 1560
cacagcaagt ttcagcgact acgccaagta ctatgcactt gtctgctacg gcccaggcat 1620
ccccatttcc acccttcatg atggacgcac tgatcaagaa attaaaatcc tggaagaaaa 1680
caaggaattg gaaaatgctt tgaaaaatat ccagctgcct aaagaggaaa ttaagaaact 1740
tgaagtagat gaaattactt tatggtacaa gatgattctt cctcctcaat ttgacagatc 1800
aaagaagtat cccttgctaa ttcaagtgta tggtggtccc tgcagtcaga gtgtaaggtc 1860
tgtatttgct gttaattgga tatcttatct tgcaagtaag gaagggatgg tcattgcctt 1920
ggtggatggt cgaggaacag ctttccaagg tgacaaactc ctctatgcag tgtatcgaaa 1980
gctgggtgtt tatgaagttg aagaccagat tacagctgtc agaaaattca tagaaatggg 2040
tttcattgat gaaaaaagaa tagccatatg gggctggtcc tatggaggat acgtttcatc 2100
actggccctt gcatctggaa ctggtctttt caaatgtggt atagcagtgg ctccagtctc 2160
cagctgggaa tattacgcgt ctgtctacac agagagattc atgggtctcc caacaaagga 2220
tgataatctt gagcactata agaattcaac tgtgatggca agagcagaat atttcagaaa 2280
tgtagactat cttctcatcc acggaacagc agatgataat gtgcactttc aaaactcagc 2340
acagattgct aaagctctgg ttaatgcaca agtggatttc caggcaatgt ggtactctga 2400
ccagaaccac ggcttatccg gcctgtccac gaaccactta tacacccaca tgacccactt 2460
cctaaagcag tgtttctctt tgtcagacta aaaacgatgc agatgcaagc ctgtatcaga 2520
atctgaaaac cttatataaa cccctcagac agtttgctta ttttattttt tatgttgtaa 2580
aatgctagta taaacaaaca aattaatgtt gttctaaagg ctgttaaaaa aaagatgagg 2640
actcagaagt tcaagctaaa tattgtttac attttctggt actctgtgaa agaagagaaa 2700
agggagtcat gcattttgct ttggacacag tgttttatca cctgttcatt tgaagaaaaa 2760
taataaagtc agaagttcaa aaaaaaaaaa aaaaaaaaaa aaagcggccg ctcg 2814
<210> 203
<211> 760
<212> PRT
<213> 智人
<400> 203
Met Lys Thr Trp Val Lys Ile Val Phe Gly Val Ala Thr Ser Ala Val
1 5 10 15
Leu Ala Leu Leu Val Met Cys Ile Val Leu Arg Pro Ser Arg Val His
20 25 30
Asn Ser Glu Glu Asn Thr Met Arg Ala Leu Thr Leu Lys Asp Ile Leu
35 40 45
Asn Gly Thr Phe Ser Tyr Lys Thr Phe Phe Pro Asn Trp Ile Ser Gly
50 55 60
Gln Glu Tyr Leu His Gln Ser Ala Asp Asn Asn Ile Val Leu Tyr Asn
65 70 75 80
Ile Glu Thr Gly Gln Ser Tyr Thr Ile Leu Ser Asn Arg Thr Met Lys
85 90 95
Ser Val Asn Ala Ser Asn Tyr Gly Leu Ser Pro Asp Arg Gln Phe Val
100 105 110
Tyr Leu Glu Ser Asp Tyr Ser Lys Leu Trp Arg Tyr Ser Tyr Thr Ala
115 120 125
Thr Tyr Tyr Ile Tyr Asp Leu Ser Asn Gly Glu Phe Val Arg Gly Asn
130 135 140
Glu Leu Pro Arg Pro Ile Gln Tyr Leu Cys Trp Ser Pro Val Gly Ser
145 150 155 160
Lys Leu Ala Tyr Val Tyr Gln Asn Asn Ile Tyr Leu Lys Gln Arg Pro
165 170 175
Gly Asp Pro Pro Phe Gln Ile Thr Phe Asn Gly Arg Glu Asn Lys Ile
180 185 190
Phe Asn Gly Ile Pro Asp Trp Val Tyr Glu Glu Glu Met Leu Pro Thr
195 200 205
Lys Tyr Ala Leu Trp Trp Ser Pro Asn Gly Lys Phe Leu Ala Tyr Ala
210 215 220
Glu Phe Asn Asp Lys Asp Ile Pro Val Ile Ala Tyr Ser Tyr Tyr Gly
225 230 235 240
Asp Glu Gln Tyr Pro Arg Thr Ile Asn Ile Pro Tyr Pro Lys Ala Gly
245 250 255
Ala Lys Asn Pro Val Val Arg Ile Phe Ile Ile Asp Thr Thr Tyr Pro
260 265 270
Ala Tyr Val Gly Pro Gln Glu Val Pro Val Pro Ala Met Ile Ala Ser
275 280 285
Ser Asp Tyr Tyr Phe Ser Trp Leu Thr Trp Val Thr Asp Glu Arg Val
290 295 300
Cys Leu Gln Trp Leu Lys Arg Val Gln Asn Val Ser Val Leu Ser Ile
305 310 315 320
Cys Asp Phe Arg Glu Asp Trp Gln Thr Trp Asp Cys Pro Lys Thr Gln
325 330 335
Glu His Ile Glu Glu Ser Arg Thr Gly Trp Ala Gly Gly Phe Phe Val
340 345 350
Ser Arg Pro Val Phe Ser Tyr Asp Ala Ile Ser Tyr Tyr Lys Ile Phe
355 360 365
Ser Asp Lys Asp Gly Tyr Lys His Ile His Tyr Ile Lys Asp Thr Val
370 375 380
Glu Asn Ala Ile Gln Ile Thr Ser Gly Lys Trp Glu Ala Ile Asn Ile
385 390 395 400
Phe Arg Val Thr Gln Asp Ser Leu Phe Tyr Ser Ser Asn Glu Phe Glu
405 410 415
Glu Tyr Pro Gly Arg Arg Asn Ile Tyr Arg Ile Ser Ile Gly Ser Tyr
420 425 430
Pro Pro Ser Lys Lys Cys Val Thr Cys His Leu Arg Lys Glu Arg Cys
435 440 445
Gln Tyr Tyr Thr Ala Ser Phe Ser Asp Tyr Ala Lys Tyr Tyr Ala Leu
450 455 460
Val Cys Tyr Gly Pro Gly Ile Pro Ile Ser Thr Leu His Asp Gly Arg
465 470 475 480
Thr Asp Gln Glu Ile Lys Ile Leu Glu Glu Asn Lys Glu Leu Glu Asn
485 490 495
Ala Leu Lys Asn Ile Gln Leu Pro Lys Glu Glu Ile Lys Lys Leu Glu
500 505 510
Val Asp Glu Ile Thr Leu Trp Tyr Lys Met Ile Leu Pro Pro Gln Phe
515 520 525
Asp Arg Ser Lys Lys Tyr Pro Leu Leu Ile Gln Val Tyr Gly Gly Pro
530 535 540
Cys Ser Gln Ser Val Arg Ser Val Phe Ala Val Asn Trp Ile Ser Tyr
545 550 555 560
Leu Ala Ser Lys Glu Gly Met Val Ile Ala Leu Val Asp Gly Arg Gly
565 570 575
Thr Ala Phe Gln Gly Asp Lys Leu Leu Tyr Ala Val Tyr Arg Lys Leu
580 585 590
Gly Val Tyr Glu Val Glu Asp Gln Ile Thr Ala Val Arg Lys Phe Ile
595 600 605
Glu Met Gly Phe Ile Asp Glu Lys Arg Ile Ala Ile Trp Gly Trp Ser
610 615 620
Tyr Gly Gly Tyr Val Ser Ser Leu Ala Leu Ala Ser Gly Thr Gly Leu
625 630 635 640
Phe Lys Cys Gly Ile Ala Val Ala Pro Val Ser Ser Trp Glu Tyr Tyr
645 650 655
Ala Ser Val Tyr Thr Glu Arg Phe Met Gly Leu Pro Thr Lys Asp Asp
660 665 670
Asn Leu Glu His Tyr Lys Asn Ser Thr Val Met Ala Arg Ala Glu Tyr
675 680 685
Phe Arg Asn Val Asp Tyr Leu Leu Ile His Gly Thr Ala Asp Asp Asn
690 695 700
Val His Phe Gln Asn Ser Ala Gln Ile Ala Lys Ala Leu Val Asn Ala
705 710 715 720
Gln Val Asp Phe Gln Ala Met Trp Tyr Ser Asp Gln Asn His Gly Leu
725 730 735
Ser Gly Leu Ser Thr Asn His Leu Tyr Thr His Met Thr His Phe Leu
740 745 750
Lys Gln Cys Phe Ser Leu Ser Asp
755 760
<210> 204
<211> 1815
<212> DNA
<213> 智人
<400> 204
gcagagctct gtgctccctg cagtcaggac tctgggaccg cagggggctc ccggaccctg 60
actctgcagc cgaaccggca cggtttcgtg gggacccagg cttgcaaagt gacggtcatt 120
ttctctttct ttctccctct tgagtccttc tgagatgatg gctctgggcg cagcgggagc 180
tacccgggtc tttgtcgcga tggtagcggc ggctctcggc ggccaccctc tgctgggagt 240
gagcgccacc ttgaactcgg ttctcaattc caacgctatc aagaacctgc ccccaccgct 300
gggcggcgct gcggggcacc caggctctgc agtcagcgcc gcgccgggaa tcctgtaccc 360
gggcgggaat aagtaccaga ccattgacaa ctaccagccg tacccgtgcg cagaggacga 420
ggagtgcggc actgatgagt actgcgctag tcccacccgc ggaggggacg caggcgtgca 480
aatctgtctc gcctgcagga agcgccgaaa acgctgcatg cgtcacgcta tgtgctgccc 540
cgggaattac tgcaaaaatg gaatatgtgt gtcttctgat caaaatcatt tccgaggaga 600
aattgaggaa accatcactg aaagctttgg taatgatcat agcaccttgg atgggtattc 660
cagaagaacc accttgtctt caaaaatgta tcacaccaaa ggacaagaag gttctgtttg 720
tctccggtca tcagactgtg cctcaggatt gtgttgtgct agacacttct ggtccaagat 780
ctgtaaacct gtcctgaaag aaggtcaagt gtgtaccaag cataggagaa aaggctctca 840
tggactagaa atattccagc gttgttactg tggagaaggt ctgtcttgcc ggatacagaa 900
agatcaccat caagccagta attcttctag gcttcacact tgtcagagac actaaaccag 960
ctatccaaat gcagtgaact ccttttatat aatagatgct atgaaaacct tttatgacct 1020
tcatcaactc aatcctaagg atatacaagt tctgtggttt cagttaagca ttccaataac 1080
accttccaaa aacctggagt gtaagagctt tgtttcttta tggaactccc ctgtgattgc 1140
agtaaattac tgtattgtaa attctcagtg tggcacttac ctgtaaatgc aatgaaactt 1200
ttaattattt ttctaaaggt gctgcactgc ctatttttcc tcttgttatg taaatttttg 1260
tacacattga ttgttatctt gactgacaaa tattctatat tgaactgaag taaatcattt 1320
cagcttatag ttcttaaaag cataaccctt taccccattt aattctagag tctagaacgc 1380
aaggatctct tggaatgaca aatgataggt acctaaaatg taacatgaaa atactagctt 1440
attttctgaa atgtactatc ttaatgctta aattatattt ccctttaggc tgtgatagtt 1500
tttgaaataa aatttaacat ttaatatcat gaaatgttat aagtagacat acattttggg 1560
attgtgatct tagaggtttg tgtgtgtgta cgtatgtgtg tgttctacaa gaacggaagt 1620
gtgatatgtt taaagatgat cagagaaaag acagtgtcta aatataagac aatattgatc 1680
agctctagaa taactttaaa gaaagacgtg ttctgcattg ataaactcaa atgatcatgg 1740
cagaatgaga gtgaatctta cattactact ttcaaaaata gtttccaata aattaataat 1800
acctaaaaaa aaaaa 1815
<210> 205
<211> 266
<212> PRT
<213> 智人
<400> 205
Met Met Ala Leu Gly Ala Ala Gly Ala Thr Arg Val Phe Val Ala Met
1 5 10 15
Val Ala Ala Ala Leu Gly Gly His Pro Leu Leu Gly Val Ser Ala Thr
20 25 30
Leu Asn Ser Val Leu Asn Ser Asn Ala Ile Lys Asn Leu Pro Pro Pro
35 40 45
Leu Gly Gly Ala Ala Gly His Pro Gly Ser Ala Val Ser Ala Ala Pro
50 55 60
Gly Ile Leu Tyr Pro Gly Gly Asn Lys Tyr Gln Thr Ile Asp Asn Tyr
65 70 75 80
Gln Pro Tyr Pro Cys Ala Glu Asp Glu Glu Cys Gly Thr Asp Glu Tyr
85 90 95
Cys Ala Ser Pro Thr Arg Gly Gly Asp Ala Gly Val Gln Ile Cys Leu
100 105 110
Ala Cys Arg Lys Arg Arg Lys Arg Cys Met Arg His Ala Met Cys Cys
115 120 125
Pro Gly Asn Tyr Cys Lys Asn Gly Ile Cys Val Ser Ser Asp Gln Asn
130 135 140
His Phe Arg Gly Glu Ile Glu Glu Thr Ile Thr Glu Ser Phe Gly Asn
145 150 155 160
Asp His Ser Thr Leu Asp Gly Tyr Ser Arg Arg Thr Thr Leu Ser Ser
165 170 175
Lys Met Tyr His Thr Lys Gly Gln Glu Gly Ser Val Cys Leu Arg Ser
180 185 190
Ser Asp Cys Ala Ser Gly Leu Cys Cys Ala Arg His Phe Trp Ser Lys
195 200 205
Ile Cys Lys Pro Val Leu Lys Glu Gly Gln Val Cys Thr Lys His Arg
210 215 220
Arg Lys Gly Ser His Gly Leu Glu Ile Phe Gln Arg Cys Tyr Cys Gly
225 230 235 240
Glu Gly Leu Ser Cys Arg Ile Gln Lys Asp His His Gln Ala Ser Asn
245 250 255
Ser Ser Arg Leu His Thr Cys Gln Arg His
260 265
<210> 206
<211> 523
<212> DNA
<213> 智人
<400> 206
ctcctggttc aaaagcagct aaaccaaaag aagcctccag acagccctga gatcacctaa 60
aaagctgcta ccaagacagc cacgaagatc ctaccaaaat gaagcgcttc ctcttcctcc 120
tactcaccat cagcctcctg gttatggtac agatacaaac tggactctca ggacaaaacg 180
acaccagcca aaccagcagc ccctcagcat ccagcaacat aagcggaggc attttccttt 240
tcttcgtggc caatgccata atccacctct tctgcttcag ttgaggtgac acgtctcagc 300
cttagccctg tgccccctga aacagctgcc accatcactc gcaagagaat cccctccatc 360
tttgggaggg gttgatgcca gacatcacca ggttgtagaa gttgacaggc agtgccatgg 420
gggcaacagc caaaataggg gggtaatgat gtaggggcca agcagtgccc agctgggggt 480
caataaagtt acccttgtac ttgcaaaaaa aaaaaaaaaa aaa 523
<210> 207
<211> 61
<212> PRT
<213> 智人
<400> 207
Met Lys Arg Phe Leu Phe Leu Leu Leu Thr Ile Ser Leu Leu Val Met
1 5 10 15
Val Gln Ile Gln Thr Gly Leu Ser Gly Gln Asn Asp Thr Ser Gln Thr
20 25 30
Ser Ser Pro Ser Ala Ser Ser Asn Ile Ser Gly Gly Ile Phe Leu Phe
35 40 45
Phe Val Ala Asn Ala Ile Ile His Leu Phe Cys Phe Ser
50 55 60
<210> 208
<211> 2672
<212> DNA
<213> 智人
<400> 208
cttccagaga gcaatatggc tggttcccca acatgcctca ccctcatcta tatcctttgg 60
cagctcacag ggtcagcagc ctctggaccc gtgaaagagc tggtcggttc cgttggtggg 120
gccgtgactt tccccctgaa gtccaaagta aagcaagttg actctattgt ctggaccttc 180
aacacaaccc ctcttgtcac catacagcca gaagggggca ctatcatagt gacccaaaat 240
cgtaataggg agagagtaga cttcccagat ggaggctact ccctgaagct cagcaaactg 300
aagaagaatg actcagggat ctactatgtg gggatataca gctcatcact ccagcagccc 360
tccacccagg agtacgtgct gcatgtctac gagcacctgt caaagcctaa agtcaccatg 420
ggtctgcaga gcaataagaa tggcacctgt gtgaccaatc tgacatgctg catggaacat 480
ggggaagagg atgtgattta tacctggaag gccctggggc aagcagccaa tgagtcccat 540
aatgggtcca tcctccccat ctcctggaga tggggagaaa gtgatatgac cttcatctgc 600
gttgccagga accctgtcag cagaaacttc tcaagcccca tccttgccag gaagctctgt 660
gaaggtgctg ctgatgaccc agattcctcc atggtcctcc tgtgtctcct gttggtgccc 720
ctcctgctca gtctctttgt actggggcta tttctttggt ttctgaagag agagagacaa 780
gaagagtaca ttgaagagaa gaagagagtg gacatttgtc gggaaactcc taacatatgc 840
ccccattctg gagagaacac agagtacgac acaatccctc acactaatag aacaatccta 900
aaggaagatc cagcaaatac ggtttactcc actgtggaaa taccgaaaaa gatggaaaat 960
ccccactcac tgctcacgat gccagacaca ccaaggctat ttgcctatga gaatgttatc 1020
tagacagcag tgcactcccc taagtctctg ctcaaaaaaa aaacaattct cggcccaaag 1080
aaaacaatca gaagaattca ctgatttgac tagaaacatc aaggaagaat gaagaacgtt 1140
gacttttttc caggataaat tatctctgat gcttctttag atttaagagt tcataattcc 1200
atccactgct gagaaatctc ctcaaaccca gaaggtttaa tcacttcatc ccaaaaatgg 1260
gattgtgaat gtcagcaaac cataaaaaaa gtgcttagaa gtattcctat agaaatgtaa 1320
atgcaaggtc acacatatta atgacagcct gttgtattaa tgatggctcc aggtcagtgt 1380
ctggagtttc attccatccc agggcttgga tgtaaggatt ataccaagag tcttgctacc 1440
aggagggcaa gaagaccaaa acagacagac aagtccagca gaagcagatg cacctgacaa 1500
aaatggatgt attaattggc tctataaact atgtgcccag cactatgctg agcttacact 1560
aattggtcag acgtgctgtc tgccctcatg aaattggctc caaatgaatg aactactttc 1620
atgagcagtt gtagcaggcc tgaccacaga ttcccagagg gccaggtgtg gatccacagg 1680
acttgaaggt caaagttcac aaagatgaag aatcagggta gctgaccatg tttggcagat 1740
actataatgg agacacagaa gtgtgcatgg cccaaggaca aggacctcca gccaggcttc 1800
atttatgcac ttgtgctgca aaagaaaagt ctaggtttta aggctgtgcc agaacccatc 1860
ccaataaaga gaccgagtct gaagtcacat tgtaaatcta gtgtaggaga cttggagtca 1920
ggcagtgaga ctggtggggc acggggggca gtgggtactt gtaaaccttt aaagatggtt 1980
aattcattca atagatattt attaagaacc tatgcggccc ggcatggtgg ctcacacctg 2040
taatcccagc actttgggag gccaaggtgg gtgggtcatc tgaggtcagg agttcaagac 2100
cagcctggcc aacatggtga aaccccatct ctactaaaga tacaaaaatt tgctgagcgt 2160
ggtggtgtgc acctgtaatc ccagctactc gagaggccaa ggcatgagaa tcgcttgaac 2220
ctgggaggtg gaggttgcag tgagctgaga tggcaccact gcactccggc ctaggcaacg 2280
agagcaaaac tccaatacaa acaaacaaac aaacacctgt gctaggtcag tctggcacgt 2340
aagatgaaca tccctaccaa cacagagctc accatctctt atacttaagt gaaaaacatg 2400
gggaagggga aaggggaatg gctgcttttg atatgttccc tgacacatat cttgaatgga 2460
gacctcccta ccaagtgatg aaagtgttga aaaacttaat aacaaatgct tgttgggcaa 2520
gaatgggatt gaggattatc ttctctcaga aaggcattgt gaaggaattg agccagatct 2580
ctctccctac tgcaaaaccc tattgtagta aaaaagtctt ctttactatc ttaataaaac 2640
agatattgtg agattcaaaa aaaaaaaaaa aa 2672
<210> 209
<211> 335
<212> PRT
<213> 智人
<400> 209
Met Ala Gly Ser Pro Thr Cys Leu Thr Leu Ile Tyr Ile Leu Trp Gln
1 5 10 15
Leu Thr Gly Ser Ala Ala Ser Gly Pro Val Lys Glu Leu Val Gly Ser
20 25 30
Val Gly Gly Ala Val Thr Phe Pro Leu Lys Ser Lys Val Lys Gln Val
35 40 45
Asp Ser Ile Val Trp Thr Phe Asn Thr Thr Pro Leu Val Thr Ile Gln
50 55 60
Pro Glu Gly Gly Thr Ile Ile Val Thr Gln Asn Arg Asn Arg Glu Arg
65 70 75 80
Val Asp Phe Pro Asp Gly Gly Tyr Ser Leu Lys Leu Ser Lys Leu Lys
85 90 95
Lys Asn Asp Ser Gly Ile Tyr Tyr Val Gly Ile Tyr Ser Ser Ser Leu
100 105 110
Gln Gln Pro Ser Thr Gln Glu Tyr Val Leu His Val Tyr Glu His Leu
115 120 125
Ser Lys Pro Lys Val Thr Met Gly Leu Gln Ser Asn Lys Asn Gly Thr
130 135 140
Cys Val Thr Asn Leu Thr Cys Cys Met Glu His Gly Glu Glu Asp Val
145 150 155 160
Ile Tyr Thr Trp Lys Ala Leu Gly Gln Ala Ala Asn Glu Ser His Asn
165 170 175
Gly Ser Ile Leu Pro Ile Ser Trp Arg Trp Gly Glu Ser Asp Met Thr
180 185 190
Phe Ile Cys Val Ala Arg Asn Pro Val Ser Arg Asn Phe Ser Ser Pro
195 200 205
Ile Leu Ala Arg Lys Leu Cys Glu Gly Ala Ala Asp Asp Pro Asp Ser
210 215 220
Ser Met Val Leu Leu Cys Leu Leu Leu Val Pro Leu Leu Leu Ser Leu
225 230 235 240
Phe Val Leu Gly Leu Phe Leu Trp Phe Leu Lys Arg Glu Arg Gln Glu
245 250 255
Glu Tyr Ile Glu Glu Lys Lys Arg Val Asp Ile Cys Arg Glu Thr Pro
260 265 270
Asn Ile Cys Pro His Ser Gly Glu Asn Thr Glu Tyr Asp Thr Ile Pro
275 280 285
His Thr Asn Arg Thr Ile Leu Lys Glu Asp Pro Ala Asn Thr Val Tyr
290 295 300
Ser Thr Val Glu Ile Pro Lys Lys Met Glu Asn Pro His Ser Leu Leu
305 310 315 320
Thr Met Pro Asp Thr Pro Arg Leu Phe Ala Tyr Glu Asn Val Ile
325 330 335
<210> 210
<211> 3292
<212> DNA
<213> 智人
<400> 210
ccgcggtggt cccaatctct tccttcctcc atgaggtgtc tggggtcggg gccccagttc 60
ttcctggagt ccatctggag tctctcctac tttctagaga atccattggg tcttgtttac 120
aacgtggatg gggacagact gtgcagcgtg gagggaaggg gagggaggga ggtttgggaa 180
gagcctcctg tggggtcctg tcactgcccc agatgcccca acaccctgtg atacctgcac 240
ccctgccaca tctgtccctc actccaaact cagctcaagg ggatggtggc agggaagaag 300
gtgagcttgg aacccaggca gccctggggt caccacctcc agctggggtt cctcctctgt 360
aaagtggagg tataacggta cccacctcct ggggtggctg tgaggattca gagctgataa 420
ggtgaacgcc tagggcgggc cctggtgcag agagagcgct cagctcctag gctggattaa 480
ctgtccctgg ggcacagatc tcggtctggg gcctgtggaa acctcagagc cacccctgaa 540
cccccaccga gccacctttg cctcgcagtg ccatggcctc gtctccgagt tacaggaaaa 600
ggcagagaat gcccttctca ggtggccctc tgggagagac actctccttg actccaaagc 660
cacgcttggc tgcaaactgg gccagcacca caaggctggg caagcaaaca tcctaatccc 720
accaaaacca caccgacctc caccctgtga cactctgcaa caaacacacg gcctactctt 780
gtcacccggg ccggccaata agcacggaag aggcaaggcc tcagaccctg gacagacatc 840
ctccctccag aggcacccag ggcctcagcc ttctcctccc tccctggcct caatttctcc 900
acctgtgacc cagggcaggt ggatccaggg agaagaacct tctggctcca tctcaccgtg 960
ggtcctgcca gcacacacaa agatttggcc tctcaaagcc tagctctgcc agcgtccttc 1020
tgctcaagaa ctctccatga ctcccagtgg ccctaaggac aaagtcctgg catttgaggc 1080
cctcccaatg cagggccaga ctctgcctct ccagcttcct gtccccacca cacccctgct 1140
gtctcacggt ggtccgactg tttcctgctt ctgttgcttt gcttagtctg gcaccctgct 1200
ggcatgcttt ctcacccttc ttccccaatc ccaactcacc cagtctttca aagggcaggc 1260
taaataccag gcctccaggt ggcccaggat tcttctctga gctttcatgg gcctggccct 1320
gggtgctacc tgtgagtagt cccacggtgg gtacatagta ggtgcgctta ctgtttgcaa 1380
gaatgaacat gggacagttt ggggactgtc acccagctca gggagcactg atggggaagc 1440
atctcctgta tgtcccaggg ctcagtgctg tagtgtcctg accctcagaa atctcataat 1500
ggcttggtca ggaaggcatc gtgccccact ttgcaaacag ggggtgctga gaattgaggg 1560
gccttgtcca aggtctcatg gctaggagca agcagaatcg gatttgaacc cagggccacg 1620
tgacttcaga agtgccatta aagtccccat aatttggagc tgtcttcttt ttttttttct 1680
tttctttttt tttgagaccg agcctcactc tgtcacctag gccaggagtg cagtggtctg 1740
atctcagctc actgcaacct ccgcctccta ggttcaagtg attctctagc ctcagcctcc 1800
caagtagctg ggactacagg cgcacgtcat catgcccagc taacttttgt atttttagta 1860
gagatgggtt ttcaccatgt tggtcaggct ggtctcgaac tcctgacctc aagtgatccg 1920
tctgcctcgg cctctcaaag tgctgggatt ataaggcttg agccactaca ctcggcctgg 1980
agctgtgttt tgtcggtgaa ggattttcca cccatgaagg ggtcagacgt gaagtgtgtg 2040
gccctgggca gctcctctga gcccagagac gccagcccta gccgccttgc tgtgccactt 2100
tgggacttcc ctccctagcc tgagcttcag ttttcctgcc tgttaggcag ccccatgtca 2160
actgcactta gtaggccggg tttgatgccc gacaagacgt gaagtggtgg aggtgggcag 2220
gatcccagcg ctaccatctt cttgaaccag tgatctcaac acatcggatt tctgtttcct 2280
catctgcaaa atgggatcag tgagctcagg tgggtcacaa attctacagg aactacttta 2340
gccaagcccg gccccctgaa agttcccctc ggtgggctgt tagggtgatt gttttcatct 2400
gtggggctcc tgatgcgtcc cacccaccag ccttggagag ggtgggatgg gagggtgggg 2460
tgcttgggga gacaagccta gagcctgggc ctcccacccc actgcctccc cccatcccag 2520
ggccccccac ccagtgacaa agcccgtggc acttcctcta cccggttggc aggcggcctg 2580
gcccagcccc ttctctaagg aagcgcattt cctgcctccc tgggccggcc gggctggatg 2640
agccgggagc tccctgctgc cggtcatacc acagccttca tctgcgccct ggggccagga 2700
ctgctgctgt cactgccatc cattggagcc cagcaccccc tccccgccca tccttcggac 2760
agcaactcca gcccagcccc gcgtccctgt gtccacttct cctgaccctc ggccgccacc 2820
ccagaaggct ggagcaggga cgccgtcgct ccggccgcct gctcccctcg ggtccccgtg 2880
cgagcccacg ccggccccgg tgcccgcccg cagccctgcc actggacaca ggataaggcc 2940
cagcgcacag gcccccacgt ggacagcatg gaccgcggca cgctccctct ggctgttgcc 3000
ctgctgctgg ccagctgcag cctcagcccc acaagtaggt gtccagggac ccagggtggg 3060
gagactcggc ctccggtgca cggaccaggc cccaagtatt cccggcctcc ttcctgtatc 3120
ctgagctcac gcccagcaga gccatccttg gggctctgga gggtcaccaa ccctcccagt 3180
ttgctggaac taaatggtta tgcaggactt tcagtgttga aagaaagcct cgggcaaact 3240
gggctgactc tttcacttta accctggtct ctggcgtctg ctcacccagc tg 3292
<210> 211
<211> 22
<212> PRT
<213> 智人
<400> 211
Met Asp Arg Gly Thr Leu Pro Leu Ala Val Ala Leu Leu Leu Ala Ser
1 5 10 15
Cys Ser Leu Ser Pro Thr
20
<210> 212
<211> 1399
<212> DNA
<213> 智人
<400> 212
agtgtgaaat cttcagagaa gaatttctct ttagttcttt gcaagaaggt agagataaag 60
acactttttc aaaaatggca atggtatcag aattcctcaa gcaggcctgg tttattgaaa 120
atgaagagca ggaatatgtt caaactgtga agtcatccaa aggtggtccc ggatcagcgg 180
tgagccccta tcctaccttc aatccatcct cggatgtcgc tgccttgcat aaggccataa 240
tggttaaagg tgtggatgaa gcaaccatca ttgacattct aactaagcga aacaatgcac 300
agcgtcaaca gatcaaagca gcatatctcc aggaaacagg aaagcccctg gatgaaacac 360
ttaagaaagc ccttacaggt caccttgagg aggttgtttt agctctgcta aaaactccag 420
cgcaatttga tgctgatgaa cttcgtgctg ccatgaaggg ccttggaact gatgaagata 480
ctctaattga gattttggca tcaagaacta acaaagaaat cagagacatt aacagggtct 540
acagagagga actgaagaga gatctggcca aagacataac ctcagacaca tctggagatt 600
ttcggaacgc tttgctttct cttgctaagg gtgaccgatc tgaggacttt ggtgtgaatg 660
aagacttggc tgattcagat gccagggcct tgtatgaagc aggagaaagg agaaagggga 720
cagacgtaaa cgtgttcaat accatcctta ccaccagaag ctatccacaa cttcgcagag 780
tgtttcagaa atacaccaag tacagtaagc atgacatgaa caaagttctg gacctggagt 840
tgaaaggtga cattgagaaa tgcctcacag ctatcgtgaa gtgcgccaca agcaaaccag 900
ctttctttgc agagaagctt catcaagcca tgaaaggtgt tggaactcgc cataaggcat 960
tgatcaggat tatggtttcc cgttctgaaa ttgacatgaa tgatatcaaa gcattctatc 1020
agaagatgta tggtatctcc ctttgccaag ccatcctgga tgaaaccaaa ggagattatg 1080
agaaaatcct ggtggctctt tgtggaggaa actaaacatt cccttgatgg tctcaagcta 1140
tgatcagaag actttaatta tatattttca tcctataagc ttaaatagga aagtttcttc 1200
aacaggatta cagtgtagct acctacatgc tgaaaaatat agcctttaaa tcatttttat 1260
attataactc tgtataatag agataagtcc attttttaaa aatgttttcc ccaaaccata 1320
aaaccctata caagttgttc tagtaacaat acatgagaaa gatgtctatg tagctgaaaa 1380
taaaatgacg tcacaagac 1399
<210> 213
<211> 346
<212> PRT
<213> 智人
<400> 213
Met Ala Met Val Ser Glu Phe Leu Lys Gln Ala Trp Phe Ile Glu Asn
1 5 10 15
Glu Glu Gln Glu Tyr Val Gln Thr Val Lys Ser Ser Lys Gly Gly Pro
20 25 30
Gly Ser Ala Val Ser Pro Tyr Pro Thr Phe Asn Pro Ser Ser Asp Val
35 40 45
Ala Ala Leu His Lys Ala Ile Met Val Lys Gly Val Asp Glu Ala Thr
50 55 60
Ile Ile Asp Ile Leu Thr Lys Arg Asn Asn Ala Gln Arg Gln Gln Ile
65 70 75 80
Lys Ala Ala Tyr Leu Gln Glu Thr Gly Lys Pro Leu Asp Glu Thr Leu
85 90 95
Lys Lys Ala Leu Thr Gly His Leu Glu Glu Val Val Leu Ala Leu Leu
100 105 110
Lys Thr Pro Ala Gln Phe Asp Ala Asp Glu Leu Arg Ala Ala Met Lys
115 120 125
Gly Leu Gly Thr Asp Glu Asp Thr Leu Ile Glu Ile Leu Ala Ser Arg
130 135 140
Thr Asn Lys Glu Ile Arg Asp Ile Asn Arg Val Tyr Arg Glu Glu Leu
145 150 155 160
Lys Arg Asp Leu Ala Lys Asp Ile Thr Ser Asp Thr Ser Gly Asp Phe
165 170 175
Arg Asn Ala Leu Leu Ser Leu Ala Lys Gly Asp Arg Ser Glu Asp Phe
180 185 190
Gly Val Asn Glu Asp Leu Ala Asp Ser Asp Ala Arg Ala Leu Tyr Glu
195 200 205
Ala Gly Glu Arg Arg Lys Gly Thr Asp Val Asn Val Phe Asn Thr Ile
210 215 220
Leu Thr Thr Arg Ser Tyr Pro Gln Leu Arg Arg Val Phe Gln Lys Tyr
225 230 235 240
Thr Lys Tyr Ser Lys His Asp Met Asn Lys Val Leu Asp Leu Glu Leu
245 250 255
Lys Gly Asp Ile Glu Lys Cys Leu Thr Ala Ile Val Lys Cys Ala Thr
260 265 270
Ser Lys Pro Ala Phe Phe Ala Glu Lys Leu His Gln Ala Met Lys Gly
275 280 285
Val Gly Thr Arg His Lys Ala Leu Ile Arg Ile Met Val Ser Arg Ser
290 295 300
Glu Ile Asp Met Asn Asp Ile Lys Ala Phe Tyr Gln Lys Met Tyr Gly
305 310 315 320
Ile Ser Leu Cys Gln Ala Ile Leu Asp Glu Thr Lys Gly Asp Tyr Glu
325 330 335
Lys Ile Leu Val Ala Leu Cys Gly Gly Asn
340 345
<210> 214
<211> 3080
<212> DNA
<213> 智人
<400> 214
cactggcttc aggagctgaa taccctccca ggcacacaca ggtgggacac aaataagggt 60
tttggaacca ctattttctc atcacgacag caacttaaaa tgcctgggaa gatggtcgtg 120
atccttggag cctcaaatat actttggata atgtttgcag cttctcaagc ttttaaaatc 180
gagaccaccc cagaatctag atatcttgct cagattggtg actccgtctc attgacttgc 240
agcaccacag gctgtgagtc cccatttttc tcttggagaa cccagataga tagtccactg 300
aatgggaagg tgacgaatga ggggaccaca tctacgctga caatgaatcc tgttagtttt 360
gggaacgaac actcttacct gtgcacagca acttgtgaat ctaggaaatt ggaaaaagga 420
atccaggtgg agatctactc ttttcctaag gatccagaga ttcatttgag tggccctctg 480
gaggctggga agccgatcac agtcaagtgt tcagttgctg atgtataccc atttgacagg 540
ctggagatag acttactgaa aggagatcat ctcatgaaga gtcaggaatt tctggaggat 600
gcagacagga agtccctgga aaccaagagt ttggaagtaa cctttactcc tgtcattgag 660
gatattggaa aagttcttgt ttgccgagct aaattacaca ttgatgaaat ggattctgtg 720
cccacagtaa ggcaggctgt aaaagaattg caagtctaca tatcacccaa gaatacagtt 780
atttctgtga atccatccac aaagctgcaa gaaggtggct ctgtgaccat gacctgttcc 840
agcgagggtc taccagctcc agagattttc tggagtaaga aattagataa tgggaatcta 900
cagcaccttt ctggaaatgc aactctcacc ttaattgcta tgaggatgga agattctgga 960
atttatgtgt gtgaaggagt taatttgatt gggaaaaaca gaaaagaggt ggaattaatt 1020
gttcaagaga aaccatttac tgttgagatc tcccctggac cccggattgc tgctcagatt 1080
ggagactcag tcatgttgac atgtagtgtc atgggctgtg aatccccatc tttctcctgg 1140
agaacccaga tagacagccc tctgagcggg aaggtgagga gtgaggggac caattccacg 1200
ctgaccctga gccctgtgag ttttgagaac gaacactctt atctgtgcac agtgacttgt 1260
ggacataaga aactggaaaa gggaatccag gtggagctct actcattccc tagagatcca 1320
gaaatcgaga tgagtggtgg cctcgtgaat gggagctctg tcactgtaag ctgcaaggtt 1380
cctagcgtgt acccccttga ccggctggag attgaattac ttaaggggga gactattctg 1440
gagaatatag agtttttgga ggatacggat atgaaatctc tagagaacaa aagtttggaa 1500
atgaccttca tccctaccat tgaagatact ggaaaagctc ttgtttgtca ggctaagtta 1560
catattgatg acatggaatt cgaacccaaa caaaggcaga gtacgcaaac actttatgtc 1620
aatgttgccc ccagagatac aaccgtcttg gtcagccctt cctccatcct ggaggaaggc 1680
agttctgtga atatgacatg cttgagccag ggctttcctg ctccgaaaat cctgtggagc 1740
aggcagctcc ctaacgggga gctacagcct ctttctgaga atgcaactct caccttaatt 1800
tctacaaaaa tggaagattc tggggtttat ttatgtgaag gaattaacca ggctggaaga 1860
agcagaaagg aagtggaatt aattatccaa gttactccaa aagacataaa acttacagct 1920
tttccttctg agagtgtcaa agaaggagac actgtcatca tctcttgtac atgtggaaat 1980
gttccagaaa catggataat cctgaagaaa aaagcggaga caggagacac agtactaaaa 2040
tctatagatg gcgcctatac catccgaaag gcccagttga aggatgcggg agtatatgaa 2100
tgtgaatcta aaaacaaagt tggctcacaa ttaagaagtt taacacttga tgttcaagga 2160
agagaaaaca acaaagacta tttttctcct gagcttctcg tgctctattt tgcatcctcc 2220
ttaataatac ctgccattgg aatgataatt tactttgcaa gaaaagccaa catgaagggg 2280
tcatatagtc ttgtagaagc acagaaatca aaagtgtagc taatgcttga tatgttcaac 2340
tggagacact atttatctgt gcaaatcctt gatactgctc atcattcctt gagaaaaaca 2400
atgagctgag aggcagactt ccctgaatgt attgaacttg gaaagaaatg cccatctatg 2460
tcccttgctg tgagcaagaa gtcaaagtaa aacttgctgc ctgaagaaca gtaactgcca 2520
tcaagatgag agaactggag gagttccttg atctgtatat acaataacat aatttgtaca 2580
tatgtaaaat aaaattatgc catagcaaga ttgcttaaaa tagcaacact ctatatttag 2640
attgttaaaa taactagtgt tgcttggact attataattt aatgcatgtt aggaaaattt 2700
cacattaata tttgctgaca gctgaccttt gtcatctttc ttctatttta ttccctttca 2760
caaaatttta ttcctatata gtttattgac aataatttca ggttttgtaa agatgccggg 2820
ttttatattt ttatagacaa ataataagca aagggagcac tgggttgact ttcaggtact 2880
aaatacctca acctatggta taatggttga ctgggtttct ctgtatagta ctggcatggt 2940
acggagatgt ttcacgaagt ttgttcatca gactcctgtg caactttccc aatgtggcct 3000
aaaaatgcaa cttcttttta ttttcttttg taaatgttta ggtttttttg tatagtaaag 3060
tgataatttc tggaattaaa 3080
<210> 215
<211> 739
<212> PRT
<213> 智人
<400> 215
Met Pro Gly Lys Met Val Val Ile Leu Gly Ala Ser Asn Ile Leu Trp
1 5 10 15
Ile Met Phe Ala Ala Ser Gln Ala Phe Lys Ile Glu Thr Thr Pro Glu
20 25 30
Ser Arg Tyr Leu Ala Gln Ile Gly Asp Ser Val Ser Leu Thr Cys Ser
35 40 45
Thr Thr Gly Cys Glu Ser Pro Phe Phe Ser Trp Arg Thr Gln Ile Asp
50 55 60
Ser Pro Leu Asn Gly Lys Val Thr Asn Glu Gly Thr Thr Ser Thr Leu
65 70 75 80
Thr Met Asn Pro Val Ser Phe Gly Asn Glu His Ser Tyr Leu Cys Thr
85 90 95
Ala Thr Cys Glu Ser Arg Lys Leu Glu Lys Gly Ile Gln Val Glu Ile
100 105 110
Tyr Ser Phe Pro Lys Asp Pro Glu Ile His Leu Ser Gly Pro Leu Glu
115 120 125
Ala Gly Lys Pro Ile Thr Val Lys Cys Ser Val Ala Asp Val Tyr Pro
130 135 140
Phe Asp Arg Leu Glu Ile Asp Leu Leu Lys Gly Asp His Leu Met Lys
145 150 155 160
Ser Gln Glu Phe Leu Glu Asp Ala Asp Arg Lys Ser Leu Glu Thr Lys
165 170 175
Ser Leu Glu Val Thr Phe Thr Pro Val Ile Glu Asp Ile Gly Lys Val
180 185 190
Leu Val Cys Arg Ala Lys Leu His Ile Asp Glu Met Asp Ser Val Pro
195 200 205
Thr Val Arg Gln Ala Val Lys Glu Leu Gln Val Tyr Ile Ser Pro Lys
210 215 220
Asn Thr Val Ile Ser Val Asn Pro Ser Thr Lys Leu Gln Glu Gly Gly
225 230 235 240
Ser Val Thr Met Thr Cys Ser Ser Glu Gly Leu Pro Ala Pro Glu Ile
245 250 255
Phe Trp Ser Lys Lys Leu Asp Asn Gly Asn Leu Gln His Leu Ser Gly
260 265 270
Asn Ala Thr Leu Thr Leu Ile Ala Met Arg Met Glu Asp Ser Gly Ile
275 280 285
Tyr Val Cys Glu Gly Val Asn Leu Ile Gly Lys Asn Arg Lys Glu Val
290 295 300
Glu Leu Ile Val Gln Glu Lys Pro Phe Thr Val Glu Ile Ser Pro Gly
305 310 315 320
Pro Arg Ile Ala Ala Gln Ile Gly Asp Ser Val Met Leu Thr Cys Ser
325 330 335
Val Met Gly Cys Glu Ser Pro Ser Phe Ser Trp Arg Thr Gln Ile Asp
340 345 350
Ser Pro Leu Ser Gly Lys Val Arg Ser Glu Gly Thr Asn Ser Thr Leu
355 360 365
Thr Leu Ser Pro Val Ser Phe Glu Asn Glu His Ser Tyr Leu Cys Thr
370 375 380
Val Thr Cys Gly His Lys Lys Leu Glu Lys Gly Ile Gln Val Glu Leu
385 390 395 400
Tyr Ser Phe Pro Arg Asp Pro Glu Ile Glu Met Ser Gly Gly Leu Val
405 410 415
Asn Gly Ser Ser Val Thr Val Ser Cys Lys Val Pro Ser Val Tyr Pro
420 425 430
Leu Asp Arg Leu Glu Ile Glu Leu Leu Lys Gly Glu Thr Ile Leu Glu
435 440 445
Asn Ile Glu Phe Leu Glu Asp Thr Asp Met Lys Ser Leu Glu Asn Lys
450 455 460
Ser Leu Glu Met Thr Phe Ile Pro Thr Ile Glu Asp Thr Gly Lys Ala
465 470 475 480
Leu Val Cys Gln Ala Lys Leu His Ile Asp Asp Met Glu Phe Glu Pro
485 490 495
Lys Gln Arg Gln Ser Thr Gln Thr Leu Tyr Val Asn Val Ala Pro Arg
500 505 510
Asp Thr Thr Val Leu Val Ser Pro Ser Ser Ile Leu Glu Glu Gly Ser
515 520 525
Ser Val Asn Met Thr Cys Leu Ser Gln Gly Phe Pro Ala Pro Lys Ile
530 535 540
Leu Trp Ser Arg Gln Leu Pro Asn Gly Glu Leu Gln Pro Leu Ser Glu
545 550 555 560
Asn Ala Thr Leu Thr Leu Ile Ser Thr Lys Met Glu Asp Ser Gly Val
565 570 575
Tyr Leu Cys Glu Gly Ile Asn Gln Ala Gly Arg Ser Arg Lys Glu Val
580 585 590
Glu Leu Ile Ile Gln Val Thr Pro Lys Asp Ile Lys Leu Thr Ala Phe
595 600 605
Pro Ser Glu Ser Val Lys Glu Gly Asp Thr Val Ile Ile Ser Cys Thr
610 615 620
Cys Gly Asn Val Pro Glu Thr Trp Ile Ile Leu Lys Lys Lys Ala Glu
625 630 635 640
Thr Gly Asp Thr Val Leu Lys Ser Ile Asp Gly Ala Tyr Thr Ile Arg
645 650 655
Lys Ala Gln Leu Lys Asp Ala Gly Val Tyr Glu Cys Glu Ser Lys Asn
660 665 670
Lys Val Gly Ser Gln Leu Arg Ser Leu Thr Leu Asp Val Gln Gly Arg
675 680 685
Glu Asn Asn Lys Asp Tyr Phe Ser Pro Glu Leu Leu Val Leu Tyr Phe
690 695 700
Ala Ser Ser Leu Ile Ile Pro Ala Ile Gly Met Ile Ile Tyr Phe Ala
705 710 715 720
Arg Lys Ala Asn Met Lys Gly Ser Tyr Ser Leu Val Glu Ala Gln Lys
725 730 735
Ser Lys Val
<210> 216
<211> 5
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR1
<400> 216
Asp Asn Tyr Met His
1 5
<210> 217
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR2
<400> 217
Trp Ile Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe Arg
1 5 10 15
Gly
<210> 218
<211> 11
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR3
<400> 218
Leu Ile Tyr Ala Gly Tyr Leu Ala Met Asp Tyr
1 5 10
<210> 219
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR1
<400> 219
Ser Ala Ser Ser Ser Val Thr Tyr Met His
1 5 10
<210> 220
<211> 7
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR2
<400> 220
Ser Thr Ser Asn Leu Ala Ser
1 5
<210> 221
<211> 9
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR3
<400> 221
Gln Gln Arg Ser Thr Tyr Pro Leu Thr
1 5
<210> 222
<211> 11
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR1
<400> 222
Lys Ala Ser Gln Asp Val Gly Thr Ser Val Ala
1 5 10
<210> 223
<211> 7
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR2
<400> 223
Trp Thr Ser Thr Arg His Thr
1 5
<210> 224
<211> 8
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR3
<400> 224
Gln Gln Tyr Ser Leu Tyr Arg Ser
1 5
<210> 225
<211> 5
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR1
<400> 225
Thr Tyr Trp Met Ser
1 5
<210> 226
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR2
<400> 226
Glu Ile His Pro Asp Ser Ser Thr Ile Asn Tyr Ala Pro Ser Leu Lys
1 5 10 15
Asp
<210> 227
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR3
<400> 227
Leu Tyr Phe Gly Phe Pro Trp Phe Ala Tyr
1 5 10
<210> 228
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR1
<400> 228
Lys Ala Ser Gln Ser Val Asp Tyr Asp Gly Asp Ser Tyr Leu Asn
1 5 10 15
<210> 229
<211> 7
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR2
<400> 229
Asp Ala Ser Asn Leu Val Ser
1 5
<210> 230
<211> 9
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 轻链 CDR3
<400> 230
Gln Gln Ser Thr Glu Asp Pro Trp Thr
1 5
<210> 231
<211> 5
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR1
<400> 231
Ser Tyr Trp Met Asn
1 5
<210> 232
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR2
<400> 232
Gln Ile Trp Pro Gly Asp Gly Asp Thr Asn Tyr Asn Gly Lys Phe Lys
1 5 10 15
Gly
<210> 233
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 合成序列: 重链 CDR3
<400> 233
Arg Glu Thr Thr Thr Val Gly Arg Tyr Tyr Tyr Ala Met Asp Tyr
1 5 10 15
Claims (20)
1.一种以下式III的化合物:
或其药用盐,
其中:
R22选自:
(a)式IVa:
其中,A是苯基基团,并且以下两者之一:
(i)Q1是单键,并且Q2选自单键和-Z-(CH2)n-,其中Z选自单键、O、S和NH,并且n是1至3;或
(ii)Q1是-CH=CH-,并且Q2是单键;
(b)式IVb:
其中RC1、RC2和RC3独立地选自H和未取代的C1-2烷基;
(c)式IVc:
其中,Q选自OH、SH和NRN,并且RN选自H、甲基和乙基;
X选自包括以下的组:OH、SH、CO2H、COH、N=C=O、NHNH2、CONHNH2、和NHRN,其中,RN选自包括H和C1-4烷基的组;
L4选自单键和以下组:
(a):
其中,n是0至3;
(b)其中,n为以上定义;
(c)其中,n为以上定义;和
(d)其中,n为以上定义,E是O、S或NR,D是N、CH或CR,并且F是N、CH或CR;
L3是:其中,X使得L3是氨基酸残基、二肽残基或三肽残基;
Prot选自Fmoc(芴甲氧羰基)、Teoc(2-(三甲基硅烷基)乙氧基羰基)和Boc(叔丁氧基羰基);
R6和R9独立地选自H、R、OH、OR、SH、SR、NH2、NHR、NRR′、硝基、Me3Sn和卤素;
R7选自H、R、OH、OR、SH、SR、NH2、NHR、NHRR’、硝基、Me3Sn和卤素;
R6’、R7’、R9’分别选自与R6、R7和R9相同的基团;
当C2’和C3’之间存在双键时,R12选自由以下组成的组:
(ia)苯基、萘基、薁基或C5-10杂芳基,可选地被选自包括以下的组中的一个或多个取代基取代:卤素、硝基、氰基、C1-7烷氧基、C3-20杂环氧基、C5-20芳氧基、羧基、-C(=O)OC1-7烷基、-C(=O)OC3-20杂环基、-C(=O)OC5-20芳基、C1-7烷基、C3-7杂环基和双-氧基-C1-3亚烷基;
(ib)C1-5饱和脂肪族烷基;
(ic)C3-6饱和环烷基;
(id)其中,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5;
(ie)其中,R25a和R25b中的一个是H,并且另一个选自:苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;和
(if)其中,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;
当C2’和C3’之间存在单键时,
R12是H或其中,R26a和R26b独立地选自H、F、C1-4饱和烷基、C2-3烯基,所述烷基和烯基基团可选地被选自C1-4烷基酰氨基和C1-4烷基酯的基团取代;或者,当R26a和R26b中的一个是H时,另一个选自腈和C1-4烷基酯;
R″是C3-12亚烷基基团,其链被一个或多个以下项中断:选自苯或吡啶的芳环;
Y和Y′选自O、S或NH;
以下两者之一:
(A)R20是H或Me且R21a和R21b都是H或一起形成=O,并且以下各项之一:
(i)R10是H,R11a是H且R11b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R10和R11b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R10是H,R11a是H且R11b是SOzM,其中,z是2或3且M是单价药用阳离子;或者
(B)R10是H或Me且R11a和R11b都是H或一起形成=O,并且以下各项之一:
(i)R20是H,R21a是H且R21b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R20是H,R21a是H且R21b是SOzM,其中,z是2或3且M是单价药用阳离子,
其中所述C3-20杂环氧基和所述-C(=O)OC3-20杂环基独立地含有选自N、O和S的1至10个环杂原子;
所述C3-7杂环基含有选自N、O和S的1至4个环杂原子。
2.一种具有式V的结合物:
L-(LU-D)p (V),
其中,L是抗体或抗体片段,
LU是接头单元,
p是1至20;并且
D是药物单元,所述药物单元是根据式I的PBD二聚物,或其药用盐,
其中,R2是式IIa、式IIb或式IIc:
(a)
其中,A是苯基基团,并且以下两者之一:
(i)Q1是单键,并且Q2选自单键和-Z-(CH2)n-,其中Z选自单键、O、S和NH,并且n是1至3;或
(ii)Q1是-CH=CH-,并且Q2是单键;
(b)
其中:
RC1、RC2和RC3独立地选自H和未取代的C1-2烷基;
(c)
其中,Q选自OH、SH和NRN,并且RN选自H、甲基和乙基;
X选自包括以下的组:OH、SH、CO2H、COH、N=C=O、NHNH2、CONHNH2、和NHRN,其中,RN选自包括H和C1-4烷基的组;
并且以下两者之一:
当C2’和C3’之间存在双键时,R12选自由以下组成的组:
(ia)苯基、萘基、薁基或C5-10杂芳基,可选地被选自包括以下的组中的一个或多个取代基取代:卤素、硝基、氰基、C1-7烷氧基、C3-20杂环氧基、C5-20芳氧基、羧基、-C(=O)OC1-7烷基、-C(=O)OC3-20杂环基、-C(=O)OC5-20芳基、C1-7烷基、C3-7杂环基和双-氧基-C1-3亚烷基;
(ib)C1-5饱和脂肪族烷基;
(ic)C3-6饱和环烷基;
(id)其中,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5;
(ie)其中,R25a和R25b中的一个是H,并且另一个选自:苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;和
(if)其中,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;
当C2’和C3’之间存在单键时,
R12是H或其中,R26a和R26b独立地选自H、F、C1-4饱和烷基、C2-3烯基,所述烷基和烯基基团可选地被选自C1-4烷基酰氨基和C1-4烷基酯的基团取代;或者,当R26a和R26b中的一个是H时,另一个选自腈和C1-4烷基酯;
R6和R9独立地选自H、R、OH、OR、SH、SR、NH2、NHR、NRR’、硝基、Me3Sn和卤素;
其中,R和R’独立地选自可选取代的C1-12烷基、C3-20杂环基和C5-20芳基基团,所述C3-20杂环氧基具有选自N、O和S的1至10个环杂原子;
R7选自H、R、OH、OR、SH、SR、NH2、NHR、NHRR’、硝基、Me3Sn和卤素;
R″是C3-12亚烷基基团,其链被一个或多个以下项中断:选自苯或吡啶的芳环;
Y和Y′选自O、S或NH;
R6’、R7’、R9’选自分别与R6、R7和R9相同的基团;
以下两者之一:
(A)R20是H或Me且R21a和R21b都是H或一起形成=O,并且以下各项之一:
(i)R10是H,R11a是H且R11b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R10和R11b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R10是H,R11a是H且R11b是SOzM,其中,z是2或3且M是单价药用阳离子;或者
(B)R10是H或Me且R11a和R11b都是H或一起形成=O,并且以下各项之一:
(i)R20是H,R21a是H且R21b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R20是H,R21a是H且R21b是SOzM,其中,z是2或3且M是单价药用阳离子,
其中,LU通过R2的X取代基连接至D,
其中所述接头单元(LU)具有式(Va):
-A1-L3-L4- (Va),
其中:
-A1-是延伸子单元,
p是1-20;
L4选自单键和以下组:
(a):
其中,n是0至3;
(b)其中,n为以上定义;
(c)其中,n为以上定义;和
(d)其中,n为以上定义,E是O、S或NR,D是N、CH或CR,并且F是N、CH或CR;
L3是:其中,X使得L3是氨基酸残基、二肽残基或三肽残基;
其中A1选自:
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30;或
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30,
其中所述C3-20杂环氧基和所述-C(=O)OC3-20杂环基独立地含有选自N、O和S的1至10个环杂原子;所述C3-7杂环基含有选自N、O和S的1至4个环杂原子。
3.根据权利要求2所述的结合物,其中,所述接头单元(LU)具有式(Va),L3是二肽,L4是单键,且A1选自:
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30;或
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30。
4.一种以下式VI的药物接头:
LU-D (VI),
或其药用盐,其中,LU是接头单元并且其中D是药物单元,所述药物单元是根据式I的PBD二聚物,
R2是式IIa、式IIb或式IIc:
(a)
其中,A是苯基基团,并且以下两者之一:
(i)Q1是单键,并且Q2选自单键和-Z-(CH2)n-,其中Z选自单键、O、S和NH,并且n是1至3;或
(ii)Q1是-CH=CH-,并且Q2是单键;
(b)
其中:
RC1、RC2和RC3独立地选自H和未取代的C1-2烷基;
(c)
其中,Q选自OH、SH和NRN,并且RN选自H、甲基和乙基;
X选自包括以下的组:*-O-q、*-S-q、*-CO2-q、*-CO-q、*-NH(C=O)-q、*-NHNH-q、*-CONHNH-q、 其中,RN选自包括H和C1-4烷基的组,且星号表示附接至药物单元的剩余部分的点以及波浪线或q表示附接至接头单元的点;
并且以下两者之一:
当C2’和C3’之间存在双键时,R12选自由以下组成的组:
(ia)苯基、萘基、薁基或C5-10杂芳基,可选地被选自包括以下的组中的一个或多个取代基取代:卤素、硝基、氰基、C1-7烷氧基、C3-20杂环氧基、C5-20芳氧基、羧基、-C(=O)OC1-7烷基、-C(=O)OC3-20杂环基、-C(=O)OC5-20芳基、C1-7烷基、C3-7杂环基和双-氧基-C1-3亚烷基;
(ib)C1-5饱和脂肪族烷基;
(ic)C3-6饱和环烷基;
(id)其中,R21、R22和R23中的每个独立地选自H、C1-3饱和烷基、C2-3烯基、C2-3炔基和环丙基,其中,R12基团中碳原子的总数不大于5;
(ie)其中,R25a和R25b中的一个是H,并且另一个选自:苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;和
(if)其中,R24选自:H;C1-3饱和烷基;C2-3烯基;C2-3炔基;环丙基;苯基,所述苯基可选地被选自卤素、甲基、甲氧基的基团取代;吡啶基;和苯硫基;
当C2’和C3’之间存在单键时,
R12是H或其中,R26a和R26b独立地选自H、F、C1-4饱和烷基、C2-3烯基,所述烷基和烯基基团可选地被选自C1-4烷基酰氨基和C1-4烷基酯的基团取代;或者,当R26a和R26b中的一个是H时,另一个选自腈和C1-4烷基酯;
R6和R9独立地选自H、R、OH、OR、SH、SR、NH2、NHR、NRR’、硝基、Me3Sn和卤素;其中,R和R’独立地选自可选取代的C1-12烷基、C3-20杂环基和C5-20芳基基团,所述C3-20杂环氧基具有选自N、O和S的1至10个环杂原子;
R7选自H、R、OH、OR、SH、SR、NH2、NHR、NHRR’、硝基、Me3Sn和卤素;
R″是C3-12亚烷基基团,其链被一个或多个以下项中断:选自苯或吡啶的芳环;
Y和Y′选自O、S或NH;
R6’、R7’、R9’选自分别与R6、R7和R9相同的基团;
以下两者之一:
(A)R20是H或Me且R21a和R21b都是H或一起形成=O,并且以下各项之一:
(i)R10是H,R11a是H且R11b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R10和R11b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R10是H,R11a是H且R11b是SOzM,其中,z是2或3且M是单价药用阳离子;或者
(B)R10是H或Me且R11a和R11b都是H或一起形成=O,并且以下各项之一:
(i)R20是H,R21a是H且R21b是OH或ORA,其中,RA是C1-4烷基;或
(ii)R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键且R11a是H;或
(iii)R20是H,R21a是H且R21b是SOzM,其中,z是2或3且M是单价药用阳离子,
其中所述C3-20杂环氧基和所述-C(=O)OC3-20杂环基独立地含有选自N、O和S的1至10个环杂原子;所述C3-7杂环基含有选自N、O和S的1至4个环杂原子
其中所述接头单元(LU)具有式(Va):
G1-L3-L4- (Va),
其中:
G1-是延伸子单元,
p是1-20;
L4选自单键和以下组:
(a):
其中,n是0至3;
(b)其中,n为以上定义;
(c)其中,n为以上定义;和
(d)其中,n为以上定义,E是O、S或NR,D是N、CH或CR,并且F是N、CH或CR;
L3是:其中,X使得L3是氨基酸残基、二肽残基或三肽残基;
其中,G1选自
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,并且n是0至6;
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30;或
其中,星号表示附接至L3的点,波浪线表示附接至所述配体单元的点,n是0或1,并且m是0至30。
5.根据权利要求2所述的结合物,其中,R7是C1-4烷氧基基团。
6.根据权利要求2所述的结合物,其中,Y是O,R″是C3-7亚烷基,其链被一个或多个以下项中断:选自苯或吡啶的芳环。
7.根据权利要求2所述的结合物,其中,R6和R9是H。
8.根据权利要求2所述的结合物,其中,C2’和C3’之间存在双键,且R12是:
(a)C5-7芳基基团,可选地被选自包括以下的组中的一个或多个取代基取代:甲氧基、乙氧基、氟、氯、氰基、双-氧基-亚甲基、甲基-哌嗪基、吗啉基和甲基-苯硫基;或
(b)甲基、乙基或丙基;或
(c)环丙基;或
(d)下式的基团
其中,R12基团中碳原子的总数不大于4;或
(e)基团或
(f)下式的基团
其中,R24选自H和甲基。
9.根据权利要求2所述的结合物,其中,C2’和C3’之间存在单键,R12是并且
(a)R26a和R26b都是H;或
(b)R26a和R26b都是甲基;或
(c)R26a和R26b中的一个是H,并且另一个选自C1-4饱和烷基、C2-3烯基,所述烷基和烯基基团是可选取代的。
10.根据权利要求2所述的结合物,其中,R2是式IIa,A是苯基,Q1是单键,且Q2是单键。
11.根据权利要求2所述的结合物,其中,R2是式IIb,且RC1、RC2和RC3都是H。
12.根据权利要求10所述的结合物,其中,X是NH2。
13.根据权利要求2所述的结合物,其中,R2是式IIc,并且Q是NRN,其中RN是H或甲基。
14.根据权利要求2所述的结合物,其中,R6’、R7’、R9’和Y’分别与R6、R7、R9和Y相同。
15.根据权利要求2所述的结合物,其中,R20是H或Me,并且R21a和R21b都是H。
16.根据权利要求2所述的结合物,其中,R20是H或Me,并且R21a和
R21b一起形成=O。
17.根据权利要求15所述的结合物,其中,R10和R11b形成它们连接的氮和碳原子之间的氮-碳双键,并且R11a是H。
18.根据权利要求2所述的结合物,其中,其中,R10是H或Me,并且R11a和R11b都是H。
19.根据权利要求2所述的结合物,其中,R10是H或Me,并且R11a和R11b一起形成=O。
20.根据权利要求18所述的结合物,其中,R20和R21b形成它们连接的氮和碳原子之间的氮-碳双键,并且R11a是H。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261740605P | 2012-12-21 | 2012-12-21 | |
US61/740,605 | 2012-12-21 | ||
CN201380073646.6A CN105246894A (zh) | 2012-12-21 | 2013-12-20 | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201380073646.6A Division CN105246894A (zh) | 2012-12-21 | 2013-12-20 | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110627797A true CN110627797A (zh) | 2019-12-31 |
Family
ID=49958418
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201380073646.6A Pending CN105246894A (zh) | 2012-12-21 | 2013-12-20 | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 |
CN201910920109.XA Pending CN110627797A (zh) | 2012-12-21 | 2013-12-20 | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201380073646.6A Pending CN105246894A (zh) | 2012-12-21 | 2013-12-20 | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 |
Country Status (9)
Country | Link |
---|---|
US (1) | US9567340B2 (zh) |
EP (1) | EP2935273A1 (zh) |
JP (1) | JP6527466B2 (zh) |
CN (2) | CN105246894A (zh) |
AU (1) | AU2013366490B9 (zh) |
CA (1) | CA2894959C (zh) |
EA (1) | EA032986B1 (zh) |
HK (1) | HK1216176A1 (zh) |
WO (1) | WO2014096365A1 (zh) |
Families Citing this family (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9193763B2 (en) | 2007-08-17 | 2015-11-24 | Purdue Research Foundation | PSMA binding ligand-linker conjugates and methods for using |
GB0819095D0 (en) | 2008-10-17 | 2008-11-26 | Spirogen Ltd | Pyrrolobenzodiazepines |
US9951324B2 (en) | 2010-02-25 | 2018-04-24 | Purdue Research Foundation | PSMA binding ligand-linker conjugates and methods for using |
US9242013B2 (en) | 2010-04-15 | 2016-01-26 | Seattle Genetics Inc. | Targeted pyrrolobenzodiazapine conjugates |
EP2558475A1 (en) | 2010-04-15 | 2013-02-20 | Spirogen Sàrl | Pyrrolobenzodiazepines used to treat proliferative diseases |
KR101860174B1 (ko) | 2011-09-20 | 2018-05-21 | 메디뮨 리미티드 | 표적 접합체 내의 내포를 위한 비대칭 이량체 pbd 화합물로서 피롤로벤조디아제핀 |
CN110183470A (zh) | 2011-10-14 | 2019-08-30 | 麦迪穆有限责任公司 | 吡咯并苯并二氮杂卓 |
BR112014009055B1 (pt) | 2011-10-14 | 2021-12-14 | Seattle Genetics, Inc. | Compostos pirrolobenzodiazepinas, conjugados alvos, ligante de fármaco e uso dos ditos conjugados para tratar uma doença proliferativa |
US9399073B2 (en) | 2011-10-14 | 2016-07-26 | Seattle Genetics, Inc. | Pyrrolobenzodiazepines |
US9526798B2 (en) | 2011-10-14 | 2016-12-27 | Seattle Genetics, Inc. | Pyrrolobenzodiazepines and targeted conjugates |
CA2885315C (en) | 2012-10-12 | 2020-06-23 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
WO2014057072A1 (en) | 2012-10-12 | 2014-04-17 | Spirogen Sàrl | Synthesis and intermediates of pyrrolobenzodiazepine derivatives for conjugation |
KR101819404B1 (ko) | 2012-10-12 | 2018-02-28 | 메디뮨 리미티드 | 피롤로벤조디아제핀 및 그의 컨주게이트 |
US9636413B2 (en) | 2012-11-15 | 2017-05-02 | Endocyte, Inc. | Conjugates for treating diseases caused by PSMA expressing cells |
CN110452242A (zh) | 2012-12-21 | 2019-11-15 | 麦迪穆有限责任公司 | 吡咯并苯并二氮杂卓及其结合物 |
CA2894959C (en) | 2012-12-21 | 2022-01-11 | Spirogen Sarl | Unsymmetrical pyrrolobenzodiazepines-dimers for use in the treatment of proliferative and autoimmune diseases |
BR112015023070B1 (pt) | 2013-03-13 | 2022-06-07 | Genentech, Inc. | Conjugados e compostos de pirrolobenzodiazepinas, composição farmacêutica que compreende os mesmo, bem como seus usos para o tratamento de uma doença proliferativa |
KR102066318B1 (ko) | 2013-03-13 | 2020-01-14 | 메디뮨 리미티드 | 피롤로벤조디아제핀 및 그의 컨쥬게이트 |
GB201317982D0 (en) | 2013-10-11 | 2013-11-27 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
GB201317981D0 (en) | 2013-10-11 | 2013-11-27 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
MX2016005013A (es) | 2013-10-18 | 2017-02-28 | Deutsches Krebsforsch | Inhibidores marcados de antigeno prostatico especifico de membrana (psma), su uso como agentes formadores de imagenes y agentes farmaceuticos para el tratamiento de cancer de prostata. |
CN106687141A (zh) | 2014-09-10 | 2017-05-17 | 麦迪穆有限责任公司 | 吡咯并苯并二氮杂卓及其缀合物 |
PT3191135T (pt) | 2014-09-12 | 2020-11-12 | Genentech Inc | Anticorpos anti-her2 e imunoconjugados |
GB201416112D0 (en) | 2014-09-12 | 2014-10-29 | Medimmune Ltd | Pyrrolobenzodiazepines and conjugates thereof |
JP6730261B2 (ja) * | 2014-09-17 | 2020-07-29 | ジェネンテック, インコーポレイテッド | 抗her2抗体を含む免疫複合体 |
CN107148285B (zh) | 2014-11-25 | 2022-01-04 | Adc治疗股份有限公司 | 吡咯并苯并二氮杂䓬-抗体缀合物 |
US10188759B2 (en) | 2015-01-07 | 2019-01-29 | Endocyte, Inc. | Conjugates for imaging |
CA2973354A1 (en) | 2015-01-14 | 2016-07-21 | Bristol-Myers Squibb Company | Benzodiazepine dimers, conjugates thereof, and methods of making and using |
CN107231804B (zh) | 2015-01-14 | 2019-11-26 | 百时美施贵宝公司 | 亚杂芳基桥连苯并二氮杂*二聚体、其缀合物及制备和使用方法 |
CA2979527A1 (en) | 2015-03-13 | 2016-09-22 | Endocyte, Inc. | Conjugates of pyrrolobenzodiazepine (pbd) prodrugs for treating disease |
EP3313854A1 (en) | 2015-06-23 | 2018-05-02 | Bristol-Myers Squibb Company | Macrocyclic benzodiazepine dimers, conjugates thereof, preparation and uses |
MA43345A (fr) | 2015-10-02 | 2018-08-08 | Hoffmann La Roche | Conjugués anticorps-médicaments de pyrrolobenzodiazépine et méthodes d'utilisation |
GB201601431D0 (en) * | 2016-01-26 | 2016-03-09 | Medimmune Ltd | Pyrrolobenzodiazepines |
GB201602359D0 (en) | 2016-02-10 | 2016-03-23 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
GB201602356D0 (en) | 2016-02-10 | 2016-03-23 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
WO2017172930A1 (en) * | 2016-03-29 | 2017-10-05 | Endocyte, Inc. | Pbd conjugates for treating diseases |
GB201607478D0 (en) | 2016-04-29 | 2016-06-15 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
EP3458069B1 (en) | 2016-05-18 | 2023-07-05 | Mersana Therapeutics, Inc. | Pyrrolobenzodiazepines and conjugates thereof |
US10526294B2 (en) | 2016-06-24 | 2020-01-07 | Mersana Therapeutics, Inc. | Pyrrolobenzodiazepines and conjugates thereof |
GB201617466D0 (en) | 2016-10-14 | 2016-11-30 | Medimmune Ltd | Pyrrolobenzodiazepine conjugates |
GB201702031D0 (en) | 2017-02-08 | 2017-03-22 | Medlmmune Ltd | Pyrrolobenzodiazepine-antibody conjugates |
PL3544636T3 (pl) | 2017-02-08 | 2021-12-06 | Adc Therapeutics Sa | Koniugaty pirolobenzodiazepina-przeciwciało |
RS63502B1 (sr) | 2017-04-18 | 2022-09-30 | Medimmune Ltd | Konjugati pirolobenzodiazepina |
US11318211B2 (en) * | 2017-06-14 | 2022-05-03 | Adc Therapeutics Sa | Dosage regimes for the administration of an anti-CD19 ADC |
NZ761175A (en) | 2017-08-18 | 2024-07-26 | Medimmune Ltd | Pyrrolobenzodiazepine conjugates |
GB201714115D0 (en) * | 2017-09-04 | 2017-10-18 | Femtogenix Ltd | Cytotoxic agents |
KR20200061376A (ko) | 2017-09-29 | 2020-06-02 | 다이이찌 산쿄 가부시키가이샤 | 항체-피롤로벤조디아제핀 유도체 콘쥬게이트 |
WO2019104289A1 (en) | 2017-11-27 | 2019-05-31 | Mersana Therapeutics, Inc. | Pyrrolobenzodiazepine antibody conjugates |
TW201929908A (zh) | 2017-12-21 | 2019-08-01 | 美商梅爾莎納醫療公司 | 吡咯并苯并二氮呯抗體共軛物 |
GB201803342D0 (en) | 2018-03-01 | 2018-04-18 | Medimmune Ltd | Methods |
SG11202009308WA (en) | 2018-03-28 | 2020-10-29 | Mitsubishi Tanabe Pharma Corp | DRUG CONJUGATES OF cMET MONOCLONAL BINDING AGENTS, AND USES THEREOF |
GB201806022D0 (en) | 2018-04-12 | 2018-05-30 | Medimmune Ltd | Pyrrolobenzodiazepines and conjugates thereof |
CA3112977A1 (en) | 2018-10-19 | 2020-04-23 | Medimmune Limited | Pyrrolobenzodiazepine conjugates |
US20240123081A1 (en) | 2019-10-25 | 2024-04-18 | Medimmune, Llc | Branched moiety for use in conjugates |
WO2023051814A1 (zh) * | 2021-09-30 | 2023-04-06 | 江苏恒瑞医药股份有限公司 | 吡咯并苯并二氮杂卓类衍生物及其偶联物、其制备方法及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011130613A1 (en) * | 2010-04-15 | 2011-10-20 | Seattle Genetics, Inc. | Targeted pyrrolobenzodiazapine conjugates |
WO2012112708A1 (en) * | 2011-02-15 | 2012-08-23 | Immunogen, Inc. | Cytotoxic benzodiazepine derivatives and methods of preparation |
Family Cites Families (327)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58180487U (ja) | 1982-05-28 | 1983-12-02 | 松下電工株式会社 | 光線式報知器の組立体 |
US4816567A (en) | 1983-04-08 | 1989-03-28 | Genentech, Inc. | Recombinant immunoglobin preparations |
US5583024A (en) | 1985-12-02 | 1996-12-10 | The Regents Of The University Of California | Recombinant expression of Coleoptera luciferase |
JPH053790Y2 (zh) | 1988-08-03 | 1993-01-29 | ||
WO1991002536A1 (en) | 1989-08-23 | 1991-03-07 | Scripps Clinic And Research Foundation | Compositions and methods for detection and treatment of epstein-barr virus infection and immune disorders |
US5256643A (en) | 1990-05-29 | 1993-10-26 | The Government Of The United States | Human cripto protein |
WO1992007574A1 (en) | 1990-10-25 | 1992-05-14 | Tanox Biosystems, Inc. | Glycoproteins associated with membrane-bound immunoglobulins as antibody targets on b cells |
US5543503A (en) | 1991-03-29 | 1996-08-06 | Genentech Inc. | Antibodies to human IL-8 type A receptor |
US5440021A (en) | 1991-03-29 | 1995-08-08 | Chuntharapai; Anan | Antibodies to human IL-8 type B receptor |
CA2105998C (en) | 1991-03-29 | 2003-05-13 | James Lee | Human pf4a receptors and their use |
JP3050424B2 (ja) | 1991-07-12 | 2000-06-12 | 塩野義製薬株式会社 | ヒトエンドセリンリセプター |
US5264557A (en) | 1991-08-23 | 1993-11-23 | The United States Of America As Represented By The Department Of Health And Human Services | Polypeptide of a human cripto-related gene, CR-3 |
US5362852A (en) | 1991-09-27 | 1994-11-08 | Pfizer Inc. | Modified peptide derivatives conjugated at 2-hydroxyethylamine moieties |
US6011146A (en) | 1991-11-15 | 2000-01-04 | Institut Pasteur | Altered major histocompatibility complex (MHC) determinant and methods of using the determinant |
US6153408A (en) | 1991-11-15 | 2000-11-28 | Institut Pasteur And Institut National De La Sante Et De La Recherche Medicale | Altered major histocompatibility complex (MHC) determinant and methods of using the determinant |
IL107366A (en) | 1992-10-23 | 2003-03-12 | Chugai Pharmaceutical Co Ltd | Genes coding for megakaryocyte potentiator |
US5644033A (en) | 1992-12-22 | 1997-07-01 | Health Research, Inc. | Monoclonal antibodies that define a unique antigen of human B cell antigen receptor complex and methods of using same for diagnosis and treatment |
US5801005A (en) | 1993-03-17 | 1998-09-01 | University Of Washington | Immune reactivity to HER-2/neu protein for diagnosis of malignancies in which the HER-2/neu oncogene is associated |
US5869445A (en) | 1993-03-17 | 1999-02-09 | University Of Washington | Methods for eliciting or enhancing reactivity to HER-2/neu protein |
US6214345B1 (en) | 1993-05-14 | 2001-04-10 | Bristol-Myers Squibb Co. | Lysosomal enzyme-cleavable antitumor drug conjugates |
US5773223A (en) | 1993-09-02 | 1998-06-30 | Chiron Corporation | Endothelin B1, (ETB1) receptor polypeptide and its encoding nucleic acid methods, and uses thereof |
EP0647450A1 (en) | 1993-09-09 | 1995-04-12 | BEHRINGWERKE Aktiengesellschaft | Improved prodrugs for enzyme mediated activation |
US5750370A (en) | 1995-06-06 | 1998-05-12 | Human Genome Sciences, Inc. | Nucleic acid encoding human endothlein-bombesin receptor and method of producing the receptor |
JPH08336393A (ja) | 1995-04-13 | 1996-12-24 | Mitsubishi Chem Corp | 光学活性なγ−置換−β−ヒドロキシ酪酸エステルの製造法 |
US5707829A (en) | 1995-08-11 | 1998-01-13 | Genetics Institute, Inc. | DNA sequences and secreted proteins encoded thereby |
US20020193567A1 (en) | 1995-08-11 | 2002-12-19 | Genetics Institute, Inc. | Secreted proteins and polynucleotides encoding them |
JP3646191B2 (ja) | 1996-03-19 | 2005-05-11 | 大塚製薬株式会社 | ヒト遺伝子 |
US6218519B1 (en) | 1996-04-12 | 2001-04-17 | Pro-Neuron, Inc. | Compounds and methods for the selective treatment of cancer and bacterial infections |
AU722499B2 (en) | 1996-05-17 | 2000-08-03 | Schering Corporation | Human B-cell antigens, related reagents |
WO1998013059A1 (en) | 1996-09-27 | 1998-04-02 | Bristol-Myers Squibb Company | Hydrolyzable prodrugs for delivery of anticancer drugs to metastatic cells |
US6759509B1 (en) | 1996-11-05 | 2004-07-06 | Bristol-Myers Squibb Company | Branched peptide linkers |
US5945511A (en) | 1997-02-20 | 1999-08-31 | Zymogenetics, Inc. | Class II cytokine receptor |
US7033827B2 (en) | 1997-02-25 | 2006-04-25 | Corixa Corporation | Prostate-specific polynucleotide compositions |
US20030185830A1 (en) | 1997-02-25 | 2003-10-02 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of prostate cancer |
US6261791B1 (en) | 1997-03-10 | 2001-07-17 | The Regents Of The University Of California | Method for diagnosing cancer using specific PSCA antibodies |
US6541212B2 (en) | 1997-03-10 | 2003-04-01 | The Regents Of The University Of California | Methods for detecting prostate stem cell antigen protein |
JP4404381B2 (ja) | 1997-03-10 | 2010-01-27 | ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア | Psca:前立腺幹細胞抗原 |
US6555339B1 (en) | 1997-04-14 | 2003-04-29 | Arena Pharmaceuticals, Inc. | Non-endogenous, constitutively activated human protein-coupled receptors |
US6319688B1 (en) | 1997-04-28 | 2001-11-20 | Smithkline Beecham Corporation | Polynucleotide encoding human sodium dependent phosphate transporter (IPT-1) |
WO1998051824A1 (en) | 1997-05-15 | 1998-11-19 | Abbott Laboratories | Reagents and methods useful for detecting disease of the urinary tract |
WO1998051805A1 (en) | 1997-05-15 | 1998-11-19 | Abbott Laboratories | Reagents and methods useful for detecting diseases of the prostate |
US6602677B1 (en) | 1997-09-19 | 2003-08-05 | Promega Corporation | Thermostable luciferases and methods of production |
US20030060612A1 (en) | 1997-10-28 | 2003-03-27 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
US20020034749A1 (en) | 1997-11-18 | 2002-03-21 | Billing-Medel Patricia A. | Reagents and methods useful for detecting diseases of the breast |
US6110695A (en) | 1997-12-02 | 2000-08-29 | The Regents Of The University Of California | Modulating the interaction of the chemokine, B Lymphocyte Hemoattractant, and its Receptor, BLR1 |
JP4603157B2 (ja) | 1998-03-13 | 2010-12-22 | ザ バーナム インスティチュート | 種々の選択された器官または組織にホーミングする分子 |
CA2685270C (en) | 1998-05-13 | 2014-07-29 | Pharmexa Inc. | Expression vectors for stimulating an immune response and methods of using the same |
US20030064397A1 (en) | 1998-05-22 | 2003-04-03 | Incyte Genomics, Inc. | Transmembrane protein differentially expressed in prostate and lung tumors |
US20020187472A1 (en) | 2001-03-09 | 2002-12-12 | Preeti Lal | Steap-related protein |
US6835807B1 (en) | 1998-05-22 | 2004-12-28 | Daiichi Pharmaceuticals Co., Ltd. | Drug complex and drug delivery system |
WO2000012130A1 (en) | 1998-08-27 | 2000-03-09 | Smithkline Beecham Corporation | Rp105 agonists and antagonists |
GB9818731D0 (en) | 1998-08-27 | 1998-10-21 | Univ Portsmouth | Compounds |
GB9818732D0 (en) | 1998-08-27 | 1998-10-21 | Univ Portsmouth | Collection of compounds |
AU757510C (en) | 1998-08-27 | 2003-09-11 | Medimmune Limited | Pyrrolobenzodiazepines |
GB9818730D0 (en) | 1998-08-27 | 1998-10-21 | Univ Portsmouth | Collections of compounds |
JP4689781B2 (ja) | 1998-09-03 | 2011-05-25 | 独立行政法人科学技術振興機構 | アミノ酸輸送蛋白及びその遺伝子 |
WO2000020579A1 (en) | 1998-10-02 | 2000-04-13 | Mcmaster University | Spliced form of erbb-2/neu oncogene |
US6468546B1 (en) | 1998-12-17 | 2002-10-22 | Corixa Corporation | Compositions and methods for therapy and diagnosis of ovarian cancer |
US6962980B2 (en) | 1999-09-24 | 2005-11-08 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US20020119158A1 (en) | 1998-12-17 | 2002-08-29 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US20030091580A1 (en) | 2001-06-18 | 2003-05-15 | Mitcham Jennifer L. | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US6858710B2 (en) | 1998-12-17 | 2005-02-22 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US20030187196A1 (en) | 1998-12-30 | 2003-10-02 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
WO2000040614A2 (en) | 1998-12-30 | 2000-07-13 | Beth Israel Deaconess Medical Center, Inc. | Characterization of the soc/crac calcium channel protein family |
CN1201004C (zh) | 1999-01-29 | 2005-05-11 | 考丽克萨有限公司 | HER-2/neu融合蛋白 |
GB9905124D0 (en) | 1999-03-05 | 1999-04-28 | Smithkline Beecham Biolog | Novel compounds |
AU3395900A (en) | 1999-03-12 | 2000-10-04 | Human Genome Sciences, Inc. | Human lung cancer associated gene sequences and polypeptides |
US7312303B2 (en) | 1999-05-11 | 2007-12-25 | Genentech, Inc. | Anti-PRO4980 antibodies |
US6268488B1 (en) | 1999-05-25 | 2001-07-31 | Barbas, Iii Carlos F. | Prodrug activation using catalytic antibodies |
AU4952600A (en) | 1999-06-03 | 2000-12-28 | Takeda Chemical Industries Ltd. | Screening method with the use of cd100 |
BRPI0012196B8 (pt) | 1999-06-25 | 2021-05-25 | Genentech Inc | artigo industrializado |
US20030119113A1 (en) | 1999-07-20 | 2003-06-26 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US7297770B2 (en) | 1999-08-10 | 2007-11-20 | Genentech, Inc. | PRO6496 polypeptides |
US7294696B2 (en) | 1999-08-17 | 2007-11-13 | Genentech Inc. | PRO7168 polypeptides |
US6909006B1 (en) | 1999-08-27 | 2005-06-21 | Spirogen Limited | Cyclopropylindole derivatives |
CA2380355A1 (en) | 1999-09-01 | 2001-03-08 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US20030206918A1 (en) | 1999-09-10 | 2003-11-06 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US20030129192A1 (en) | 1999-09-10 | 2003-07-10 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
US20030232056A1 (en) | 1999-09-10 | 2003-12-18 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of ovarian cancer |
DK1226177T3 (da) | 1999-10-29 | 2008-10-06 | Genentech Inc | Antistofsammensætninger mod anti-prostata stamcelle-antigen (PSCA) og anvendelser deraf |
CA2393126C (en) | 1999-11-29 | 2016-05-24 | The Trustees Of Columbia University In The City Of New York | Isolation of five novel genes coding for new fc receptors-type melanoma involved in the pathogenesis of lymphoma/melanoma |
AU1807401A (en) | 1999-11-30 | 2001-06-12 | Corixa Corporation | Compositions and methods for therapy and diagnosis of breast cancer |
EP1568373A3 (en) | 1999-12-10 | 2005-12-21 | Epimmune Inc. | Inducing cellular immune responses to HER2/neu using peptide and nucleic acid compositions |
NZ502058A (en) | 1999-12-23 | 2003-11-28 | Ovita Ltd | Isolated mutated nucleic acid molecule for regulation of ovulation rate |
US6610286B2 (en) | 1999-12-23 | 2003-08-26 | Zymogenetics, Inc. | Method for treating inflammation using soluble receptors to interleukin-20 |
DE60030769T2 (de) | 1999-12-23 | 2007-10-25 | Zymogenetics, Inc., Seattle | Verfahren zur behandlung von entzündungen |
JP4741139B2 (ja) | 1999-12-23 | 2011-08-03 | ザイモジェネティクス, インコーポレイテッド | 可溶性インターロイキン−20レセプター |
US20040001827A1 (en) | 2002-06-28 | 2004-01-01 | Dennis Mark S. | Serum albumin binding peptides for tumor targeting |
PT1240337E (pt) | 1999-12-24 | 2007-01-31 | Genentech Inc | Métodos e composições para prolongar as meias-vidas de eliminação de compostos bioactivos |
US7294695B2 (en) | 2000-01-20 | 2007-11-13 | Genentech, Inc. | PRO10268 polypeptides |
US20030224379A1 (en) | 2000-01-21 | 2003-12-04 | Tang Y. Tom | Novel nucleic acids and polypeptides |
WO2001053463A2 (en) | 2000-01-21 | 2001-07-26 | Corixa Corporation | COMPOUNDS AND METHODS FOR PREVENTION AND TREATMENT OF HER-2/neu ASSOCIATED MALIGNANCIES |
AU2001243142A1 (en) | 2000-02-03 | 2001-08-14 | Hyseq, Inc. | Novel nucleic acids and polypeptides |
US20030219806A1 (en) | 2000-02-22 | 2003-11-27 | Millennium Pharmaceuticals, Inc. | Novel 18607, 15603, 69318, 12303, 48000, 52920, 5433, 38554, 57301, 58324, 55063, 52991, 59914, 59921 and 33751 molecules and uses therefor |
US20020142377A1 (en) | 2000-02-22 | 2002-10-03 | Glucksmann Maria Alexandra | 18607, a novel human calcium channel |
US20040052793A1 (en) | 2001-02-22 | 2004-03-18 | Carter Paul J. | Caspase activivated prodrugs therapy |
US20040002068A1 (en) | 2000-03-01 | 2004-01-01 | Corixa Corporation | Compositions and methods for the detection, diagnosis and therapy of hematological malignancies |
US20040005561A1 (en) | 2000-03-01 | 2004-01-08 | Corixa Corporation | Compositions and methods for the detection, diagnosis and therapy of hematological malignancies |
EP1261743A2 (en) | 2000-03-07 | 2002-12-04 | Hyseq, Inc. | Novel nucleic acids and polypeptides |
AU2001249411B2 (en) | 2000-03-24 | 2007-02-15 | Fahri Saatcioglu | Novel prostate-specific or testis-specific nucleic acid molecules, polypeptides,and diagnostic and therapeutic methods |
WO2004043361A2 (en) | 2002-11-08 | 2004-05-27 | Genentech, Inc. | Compositions and methods for the treatment of natural killer cell related diseases |
AU2001250412A1 (en) | 2000-03-31 | 2001-10-08 | Ipf Pharmaceuticals Gmbh | Diagnostic and medicament for analysing the cell surface proteome of tumour and inflammatory cells and for treating tumorous and inflammatory diseases, preferably using specific chemokine receptor analysis and the chemokine receptor-ligand interaction |
WO2001075177A2 (en) | 2000-04-03 | 2001-10-11 | The Government Of The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Tumor markers in ovarian cancer |
JP2003532057A (ja) | 2000-04-07 | 2003-10-28 | アリーナ・フアーマシユーチカルズ・インコーポレーテツド | 非内因性の、構成的に活性化される既知gタンパク質−共役受容体 |
US20030119115A1 (en) | 2000-05-17 | 2003-06-26 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
WO2001088133A2 (en) | 2000-05-18 | 2001-11-22 | Lexicon Genetics Incorporated | Human semaphorin homologs and polynucleotides encoding the same |
WO2001090304A2 (en) | 2000-05-19 | 2001-11-29 | Human Genome Sciences, Inc. | Nucleic acids, proteins, and antibodies |
WO2001094641A2 (en) | 2000-06-09 | 2001-12-13 | Idec Pharmaceuticals Corporation | Gene targets and ligands that bind thereto for treatment and diagnosis of ovarian carcinomas |
EP1297130A2 (en) | 2000-06-16 | 2003-04-02 | Incyte Genomics, Inc. | G-protein coupled receptors |
MXPA02012749A (es) | 2000-06-30 | 2003-10-06 | Amgen Inc | Moleculas semejantes a b7 y sus usos. |
WO2002002587A1 (en) | 2000-06-30 | 2002-01-10 | Human Genome Sciences, Inc. | B7-like polynucleotides, polypeptides, and antibodies |
WO2002006339A2 (en) | 2000-07-03 | 2002-01-24 | Curagen Corporation | Proteins and nucleic acids encoding same |
US20040044179A1 (en) | 2000-07-25 | 2004-03-04 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2001283507A1 (en) | 2000-07-27 | 2002-02-13 | Mayo Foundation For Medical Education And Research | B7-h3 and b7-h4, novel immunoregulatory molecules |
ATE396266T1 (de) | 2000-07-28 | 2008-06-15 | Ulrich Wissenbach | Trp8 krebsmarker |
US7229623B1 (en) | 2000-08-03 | 2007-06-12 | Corixa Corporation | Her-2/neu fusion proteins |
WO2002014503A2 (en) | 2000-08-14 | 2002-02-21 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of her-2/neu-associated malignancies |
WO2002013847A2 (en) | 2000-08-14 | 2002-02-21 | Corixa Corporation | Methods for diagnosis and therapy of hematological and virus-associated malignancies |
EP1445318A2 (en) | 2000-08-24 | 2004-08-11 | Genetech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
GB0020953D0 (en) | 2000-08-24 | 2000-10-11 | Smithkline Beecham Biolog | Vaccine |
EP1346040A2 (en) | 2000-09-11 | 2003-09-24 | Nuvelo, Inc. | Novel nucleic acids and polypeptides |
EP1803733B1 (en) | 2000-09-15 | 2010-03-10 | ZymoGenetics, Inc. | Polypeptides comprising the extracellular domains of IL-20RA and/or IL-20RB |
US6613567B1 (en) | 2000-09-15 | 2003-09-02 | Isis Pharmaceuticals, Inc. | Antisense inhibition of Her-2 expression |
US7582293B2 (en) | 2000-09-15 | 2009-09-01 | Genentech, Inc. | Anti-PRO6308 antibodies |
UA83458C2 (uk) | 2000-09-18 | 2008-07-25 | Байоджен Айдек Ма Інк. | Виділений поліпептид baff-r (рецептор фактора активації в-клітин сімейства tnf) |
AU2001292724B2 (en) | 2000-09-18 | 2005-05-26 | Biogen Idec Ma Inc. | CRIPTO mutant and uses thereof |
ES2254492T3 (es) | 2000-09-19 | 2006-06-16 | Moses Lee | Composiciones y procedimientos de uso de analogos aquirales de cc-1065 y de duocarmicinas. |
WO2002030268A2 (en) | 2000-10-13 | 2002-04-18 | Eos Biotechnology, Inc. | Methods of diagnosis of prostate cancer, compositions and methods of screening for modulators of prostate cancer |
CA2428242A1 (en) | 2000-11-07 | 2002-05-16 | Zymogenetics, Inc. | Human tumor necrosis factor receptor |
US20020150573A1 (en) | 2000-11-10 | 2002-10-17 | The Rockefeller University | Anti-Igalpha-Igbeta antibody for lymphoma therapy |
US20040018194A1 (en) | 2000-11-28 | 2004-01-29 | Francisco Joseph A. | Recombinant anti-CD30 antibodies and uses thereof |
WO2002061087A2 (en) | 2000-12-19 | 2002-08-08 | Lifespan Biosciences, Inc. | Antigenic peptides, such as for g protein-coupled receptors (gpcrs), antibodies thereto, and systems for identifying such antigenic peptides |
WO2002054940A2 (en) | 2001-01-12 | 2002-07-18 | University Of Medicine & Dentistry Of New Jersey | Bone morphogenetic protein-2 in the treatment and diagnosis of cancer |
US20030119119A1 (en) | 2001-01-16 | 2003-06-26 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US20030119126A1 (en) | 2001-01-16 | 2003-06-26 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US7754208B2 (en) | 2001-01-17 | 2010-07-13 | Trubion Pharmaceuticals, Inc. | Binding domain-immunoglobulin fusion proteins |
WO2002059377A2 (en) | 2001-01-24 | 2002-08-01 | Protein Design Labs | Methods of diagnosis of breast cancer, compositions and methods of screening for modulators of breast cancer |
US20030073144A1 (en) | 2001-01-30 | 2003-04-17 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of pancreatic cancer |
WO2002064775A1 (en) | 2001-02-12 | 2002-08-22 | Bionomics Limited | Identification of genes involved in the tumourigenic process |
WO2002071928A2 (en) | 2001-03-14 | 2002-09-19 | Millennium Pharmaceuticals, Inc. | Nucleic acid molecules and proteins for the identification, assessment, prevention, and therapy of ovarian cancer |
EP1243276A1 (en) | 2001-03-23 | 2002-09-25 | Franciscus Marinus Hendrikus De Groot | Elongated and multiple spacers containing activatible prodrugs |
WO2002078524A2 (en) | 2001-03-28 | 2002-10-10 | Zycos Inc. | Translational profiling |
US6362331B1 (en) | 2001-03-30 | 2002-03-26 | Council Of Scientific And Industrial Research | Process for the preparation of antitumor agents |
WO2003008537A2 (en) | 2001-04-06 | 2003-01-30 | Mannkind Corporation | Epitope sequences |
US6820011B2 (en) | 2001-04-11 | 2004-11-16 | The Regents Of The University Of Colorado | Three-dimensional structure of complement receptor type 2 and uses thereof |
AU2002254615A1 (en) | 2001-04-17 | 2002-10-28 | The Board Of Trustees Of The University Of Arkansas | Repeat sequences of the ca125 gene and their use for diagnostic and therapeutic interventions |
EP1463928A2 (en) | 2001-04-18 | 2004-10-06 | Protein Design Labs | Methods of diagnosis of lung cancer, compositions and methods of screening for modulators of lung cancer |
EE200300528A (et) | 2001-04-26 | 2004-08-16 | Biogen, Inc. | Cripto valku blokeerivad antikehad ja nende kasutamine |
WO2003083041A2 (en) | 2002-03-22 | 2003-10-09 | Biogen, Inc. | Cripto-specific antibodies |
US6884869B2 (en) | 2001-04-30 | 2005-04-26 | Seattle Genetics, Inc. | Pentapeptide compounds and uses related thereto |
WO2002089747A2 (en) | 2001-05-09 | 2002-11-14 | Corixa Corporation | Compositions and methods for the therapy and diagnosis of prostate cancer |
WO2002092836A2 (en) | 2001-05-11 | 2002-11-21 | Sloan-Kettering Institute For Cancer Research | Nucleic acid sequence encoding ovarian antigen, ca125, and uses thereof |
ATE542545T1 (de) | 2001-05-24 | 2012-02-15 | Zymogenetics Inc | Taci-immunoglobulin-fusionsproteine |
US7157558B2 (en) | 2001-06-01 | 2007-01-02 | Genentech, Inc. | Polypeptide encoded by a polynucleotide overexpresses in tumors |
AU2002314901A1 (en) | 2001-06-04 | 2002-12-16 | Eos Biotechnology, Inc. | Methods of diagnosis and treatment of androgen-dependent prostate cancer, prostate cancer undergoing androgen-withdrawal, and androgen-independent prostate cancer |
WO2003000842A2 (en) | 2001-06-04 | 2003-01-03 | Curagen Corporation | Novel proteins and nucleic acids encoding same |
EP1402053A4 (en) | 2001-06-05 | 2005-05-11 | Exelixis Inc | CHDS AS MODULATORS OF THE MECHANISM OF ACTION OF P53 AND USES THEREOF |
JP2005501528A (ja) | 2001-06-05 | 2005-01-20 | エクセリクシス・インコーポレイテッド | p53経路のモディファイヤーとしてのGFATsおよび使用方法 |
US7235358B2 (en) | 2001-06-08 | 2007-06-26 | Expression Diagnostics, Inc. | Methods and compositions for diagnosing and monitoring transplant rejection |
US7125663B2 (en) | 2001-06-13 | 2006-10-24 | Millenium Pharmaceuticals, Inc. | Genes, compositions, kits and methods for identification, assessment, prevention, and therapy of cervical cancer |
AU2002347428A1 (en) | 2001-06-18 | 2003-01-02 | Eos Biotechnology Inc. | Methods of diagnosis of ovarian cancer, compositions and methods of screening for modulators of ovarian cancer |
US7189507B2 (en) | 2001-06-18 | 2007-03-13 | Pdl Biopharma, Inc. | Methods of diagnosis of ovarian cancer, compositions and methods of screening for modulators of ovarian cancer |
US7705120B2 (en) | 2001-06-21 | 2010-04-27 | Millennium Pharmaceuticals, Inc. | Compositions, kits, and methods for identification, assessment, prevention, and therapy of breast cancer |
WO2003002717A2 (en) | 2001-06-28 | 2003-01-09 | Schering Corporation | Biological activity of ak155 |
WO2003004529A2 (en) | 2001-07-02 | 2003-01-16 | Licentia Ltd. | Ephrin-tie receptor materials and methods |
US20040076955A1 (en) | 2001-07-03 | 2004-04-22 | Eos Biotechnology, Inc. | Methods of diagnosis of bladder cancer, compositions and methods of screening for modulators of bladder cancer |
WO2003003984A2 (en) | 2001-07-05 | 2003-01-16 | Curagen Corporation | Novel proteins and nucleic acids encoding same |
WO2003055439A2 (en) | 2001-07-18 | 2003-07-10 | The Regents Of The University Of California | Her2/neu target antigen and use of same to stimulate an immune response |
US20030108963A1 (en) | 2001-07-25 | 2003-06-12 | Millennium Pharmaceuticals, Inc. | Novel genes, compositions, kit, and methods for identification, assessment, prevention and therapy of prostate cancer |
HUP0500992A3 (en) | 2001-08-03 | 2007-11-28 | Genentech Inc | Tacis and br3 polypeptides and uses thereof |
EP1478772A2 (en) | 2001-08-14 | 2004-11-24 | The General Hospital Corporation | Nucleic acid and amino acid sequences involved in pain |
US20030092013A1 (en) | 2001-08-16 | 2003-05-15 | Vitivity, Inc. | Diagnosis and treatment of vascular disease |
AU2002313559A1 (en) | 2001-08-23 | 2003-03-10 | Oxford Biomedica (Uk) Limited | Genes |
US6902930B2 (en) | 2001-08-29 | 2005-06-07 | Vanderbilt University | Human Mob-5 (IL-24) receptors and uses thereof |
US20030124579A1 (en) | 2001-09-05 | 2003-07-03 | Eos Biotechnology, Inc. | Methods of diagnosis of ovarian cancer, compositions and methods of screening for modulators of ovarian cancer |
WO2003022995A2 (en) | 2001-09-06 | 2003-03-20 | Agensys, Inc. | Nucleic acid and corresponding protein entitled steap-1 useful in treatment and detection of cancer |
CA2459219A1 (en) | 2001-09-17 | 2003-03-27 | Protein Design Labs, Inc. | Methods of diagnosis of cancer compositions and methods of screening for modulators of cancer |
IL160933A0 (en) | 2001-09-18 | 2004-08-31 | Proteologics Inc | Methods and compositions for treating ?cap associated diseases |
NZ531674A (en) | 2001-09-18 | 2009-03-31 | Genentech Inc | Compositions and methods for the diagnosis and treatment of tumor |
WO2003025148A2 (en) | 2001-09-19 | 2003-03-27 | Nuvelo, Inc. | Novel nucleic acids and polypeptides |
WO2003026577A2 (en) | 2001-09-24 | 2003-04-03 | Seattle Genetics, Inc. | P-amidobenzylethers in drug delivery agents |
US7091186B2 (en) | 2001-09-24 | 2006-08-15 | Seattle Genetics, Inc. | p-Amidobenzylethers in drug delivery agents |
US20030077644A1 (en) | 2001-09-28 | 2003-04-24 | Bing Yang | Diagnosis and treatment of diseases caused by mutations in CD72 |
WO2003029421A2 (en) | 2001-10-03 | 2003-04-10 | Origene Technologies, Inc. | Regulated breast cancer genes |
US20050130117A1 (en) | 2001-10-03 | 2005-06-16 | Davis Cong L. | Modulators of lymphocyte activation and migration |
US20050123925A1 (en) | 2002-11-15 | 2005-06-09 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
JP2005522986A (ja) | 2001-10-19 | 2005-08-04 | ジェネンテック・インコーポレーテッド | 炎症性腸疾患の診断と治療のための組成物と方法 |
EP1525213B1 (en) | 2001-10-24 | 2012-06-27 | National Jewish Health | Three-dimensional structures of tall-1 and its cognate receptors and modified proteins and methods related thereto |
KR101072867B1 (ko) | 2001-10-31 | 2011-10-17 | 유니버시티 오브 노스 텍사스 헬스 사이언스 센터 | 뼈 형태발생 단백질(bmp), bmp 수용체 및 bmp 결합 단백질 및 녹내장 진단 및 치료에서 그의 용도 |
WO2003042661A2 (en) | 2001-11-13 | 2003-05-22 | Protein Design Labs, Inc. | Methods of diagnosis of cancer, compositions and methods of screening for modulators of cancer |
US20030232350A1 (en) | 2001-11-13 | 2003-12-18 | Eos Biotechnology, Inc. | Methods of diagnosis of cancer, compositions and methods of screening for modulators of cancer |
CA2467242A1 (en) | 2001-11-20 | 2003-05-30 | Seattle Genetics, Inc. | Treatment of immunological disorders using anti-cd30 antibodies |
US7344843B2 (en) | 2001-11-29 | 2008-03-18 | Serono Genetics Institute S.A. | Agonists and antagonists of prolixin for the treatment of metabolic disorders |
AU2002349784A1 (en) | 2001-12-03 | 2003-06-17 | Asahi Kasei Pharma Corporation | Nf-kappab activating genes |
WO2003054152A2 (en) | 2001-12-10 | 2003-07-03 | Nuvelo, Inc. | Novel nucleic acids and polypeptides |
US20030134790A1 (en) | 2002-01-11 | 2003-07-17 | University Of Medicine And Dentistry Of New Jersey | Bone Morphogenetic Protein-2 And Bone Morphogenetic Protein-4 In The Treatment And Diagnosis Of Cancer |
US7452675B2 (en) | 2002-01-25 | 2008-11-18 | The Queen's Medical Center | Methods of screening for TRPM4b modulators |
WO2003072036A2 (en) | 2002-02-21 | 2003-09-04 | Duke University | Treatment methods using anti-cd22 antibodies |
US20040258678A1 (en) | 2002-02-22 | 2004-12-23 | Genentech, Inc. | Compositions and methods for the treatment of immune related diseases |
US20030219795A1 (en) | 2002-03-01 | 2003-11-27 | Marcia Belvin | SCDs as modifiers of the p53 pathway and methods of use |
EP2258712A3 (en) | 2002-03-15 | 2011-05-04 | Multicell Immunotherapeutics, Inc. | Compositions and Methods to Initiate or Enhance Antibody and Major-histocompatibility Class I or Class II-restricted T Cell Responses by Using Immunomodulatory, Non-coding RNA Motifs |
CA2486490A1 (en) | 2002-03-19 | 2003-12-31 | Curagen Corporation | Therapeutic polypeptides, nucleic acids encoding same, and methods of use |
EP1579186A4 (en) | 2002-03-21 | 2008-10-08 | Sunesis Pharmaceuticals Inc | IDENTIFICATION OF KINASE INHIBITORS |
US7193069B2 (en) | 2002-03-22 | 2007-03-20 | Research Association For Biotechnology | Full-length cDNA |
CA2480404A1 (en) | 2002-03-25 | 2003-10-30 | Uab Research Foundation | Fc receptor homolog, reagents, and uses thereof |
WO2003083074A2 (en) | 2002-03-28 | 2003-10-09 | Idec Pharmaceuticals Corporation | Novel gene targets and ligands that bind thereto for treatment and diagnosis of colon carcinomas |
US20030194704A1 (en) | 2002-04-03 | 2003-10-16 | Penn Sharron Gaynor | Human genome-derived single exon nucleic acid probes useful for gene expression analysis two |
JP2005534287A (ja) | 2002-04-05 | 2005-11-17 | アジェンシス, インコーポレイテッド | 癌の処置および検出において有用な98p4b6との名称の核酸および対応するタンパク質 |
AU2003223520A1 (en) | 2002-04-12 | 2003-10-27 | Mitokor | Targets for therapeutic intervention identified in the mitochondrial proteome |
AU2003230874A1 (en) | 2002-04-16 | 2003-11-03 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
AU2003239158A1 (en) | 2002-04-17 | 2003-11-03 | Baylor College Of Medicine | Aib1 as a prognostic marker and predictor of resistance to encocrine therapy |
WO2003093444A2 (en) | 2002-05-03 | 2003-11-13 | Incyte Corporation | Transporters and ion channels |
EP1572925A4 (en) | 2002-05-15 | 2007-08-15 | Avalon Pharmaceuticals | CANCER-RELATED GENE AS A TARGET FOR CHEMOTHERAPY |
AU2003232453A1 (en) | 2002-05-30 | 2003-12-19 | David K. Bol | Human solute carrier family 7 member 11 (hslc7a11) |
AU2003239969A1 (en) | 2002-06-04 | 2003-12-19 | Avalon Pharmaceuticals, Inc. | Cancer-linked gene as target for chemotherapy |
WO2003101283A2 (en) | 2002-06-04 | 2003-12-11 | Incyte Corporation | Diagnostics markers for lung cancer |
AU2003242633A1 (en) | 2002-06-06 | 2003-12-22 | Molecular Engines Laboratories | Dudulin genes, non-human animal model: uses in human hematological disease |
KR101031025B1 (ko) | 2002-06-06 | 2011-04-25 | 온코세라피 사이언스 가부시키가이샤 | 사람 결장암에 관계하는 유전자 및 폴리펩티드 |
EP1576111A4 (en) | 2002-06-07 | 2006-10-18 | Avalon Pharmaceuticals | CANCER-ASSOCIATED GENE AS A TARGET FOR CHEMOTHERAPY |
AU2003245441A1 (en) | 2002-06-12 | 2003-12-31 | Avalon Pharmaceuticals, Inc. | Cancer-linked gene as target for chemotherapy |
WO2003106659A2 (en) | 2002-06-18 | 2003-12-24 | Archemix Corp. | Aptamer-toxin molecules and methods for using same |
US20040249130A1 (en) | 2002-06-18 | 2004-12-09 | Martin Stanton | Aptamer-toxin molecules and methods for using same |
US7304149B2 (en) | 2002-06-20 | 2007-12-04 | Washington University In St. Louis | BTLA nucleic acids |
WO2004001004A2 (en) | 2002-06-21 | 2003-12-31 | Johns Hopkins University School Of Medicine | Membrane associated tumor endothelium markers |
WO2004009622A2 (en) | 2002-07-19 | 2004-01-29 | Cellzome Ag | Protein complexes of cellular networks underlying the development of cancer and other diseases |
US20050043516A1 (en) | 2002-07-25 | 2005-02-24 | Genentech, Inc. | TACI antibodies and uses thereof |
US20050180972A1 (en) | 2002-07-31 | 2005-08-18 | Wahl Alan F. | Anti-CD20 antibody-drug conjugates for the treatment of cancer and immune disorders |
AU2003251471A1 (en) | 2002-08-06 | 2004-02-25 | Bayer Healthcare Ag | Diagnostics and therapeutics for diseases associated with human cxc chemokine receptor 5(cxcr5) |
JP2004121218A (ja) | 2002-08-06 | 2004-04-22 | Jenokkusu Soyaku Kenkyusho:Kk | 気管支喘息または慢性閉塞性肺疾患の検査方法 |
KR20050048615A (ko) | 2002-08-19 | 2005-05-24 | 제넨테크, 인크. | 종양의 진단 및 치료를 위한 조성물 및 방법 |
JP2006515742A (ja) | 2002-08-27 | 2006-06-08 | ブリストル−マイヤーズ スクイブ カンパニー | 乳癌細胞でのプロテインチロシンキナーゼおよび/またはプロテインチロシンキナーゼ経路と相互作用および/またはモデュレートする化合物の活性を予測するためのポリヌクレオチドの同定 |
WO2004020595A2 (en) | 2002-08-29 | 2004-03-11 | Five Prime Therapeutics, Inc. | Novel human polypeptides encoded by polynucleotides |
WO2004019993A1 (en) | 2002-08-30 | 2004-03-11 | Ramot At Tel Aviv University Ltd. | Self-immolative dendrimers releasing many active moieties upon a single activating event |
AU2002951346A0 (en) | 2002-09-05 | 2002-09-26 | Garvan Institute Of Medical Research | Diagnosis of ovarian cancer |
CA2496888A1 (en) | 2002-09-06 | 2004-03-18 | Mannkind Corporation | Epitope sequences |
AU2003300776A1 (en) | 2002-09-09 | 2004-05-25 | Omeros Corporation | G protein coupled receptors and uses thereof |
JP2004113151A (ja) | 2002-09-27 | 2004-04-15 | Sankyo Co Ltd | 癌遺伝子及びその用途 |
CA2500978A1 (en) | 2002-10-03 | 2004-04-15 | Mcgill University | Antibodies and cyclic peptides which bind cea (carcinoembryonic antigen) and their use as cancer therapeutics |
US20060183120A1 (en) | 2002-10-04 | 2006-08-17 | Teh Bin T | Molecular sub-classification of kidney tumors and the discovery of new diagnostic markers |
US20040146907A1 (en) | 2002-11-13 | 2004-07-29 | Genentech, Inc. | Methods and compositions for detecting dysplasia |
JP2006507322A (ja) | 2002-11-14 | 2006-03-02 | シンタルガ・ビーブイ | 多重自己脱離放出スペーサーとして構築されたプロドラッグ |
GB0226593D0 (en) | 2002-11-14 | 2002-12-24 | Consultants Ltd | Compounds |
ES2392511T3 (es) | 2002-11-15 | 2012-12-11 | Musc Foundation For Research Development | Moduladores de complemento dianas sobre el receptor 2 de complemento |
CA2505919A1 (en) | 2002-11-15 | 2004-06-03 | The Board Of Trustees Of The University Of Arkansas | Ca125 gene and its use for diagnostic and therapeutic interventions |
AU2003297300A1 (en) | 2002-11-20 | 2004-06-15 | Biogen Idec Inc. | Novel gene targets and ligands that bind thereto for treatment and diagnosis of carcinomas |
AU2003294462C1 (en) | 2002-11-21 | 2011-06-30 | University Of Utah Research Foundation | Purinergic modulation of smell |
US20040253606A1 (en) | 2002-11-26 | 2004-12-16 | Protein Design Labs, Inc. | Methods of detecting soft tissue sarcoma, compositions and methods of screening for soft tissue sarcoma modulators |
EP1581641A4 (en) | 2002-12-06 | 2006-11-15 | Diadexus Inc | COMPOSITIONS, SPREADING VARIATIONS AND METHODS RELATED TO OVARAL SPECIFIC GENES AND PROTEINS |
US20040157278A1 (en) | 2002-12-13 | 2004-08-12 | Bayer Corporation | Detection methods using TIMP 1 |
ES2388280T3 (es) | 2002-12-20 | 2012-10-11 | Abbott Biotherapeutics Corp. | Anticuerpos que reaccionan frente a GPR64 y utilización de los mismos |
WO2004058309A1 (en) | 2002-12-23 | 2004-07-15 | Human Genome Sciences, Inc. | Neutrokine-alpha conjugate, neutrokine-alpha complex, and uses thereof |
WO2004063709A2 (en) | 2003-01-08 | 2004-07-29 | Bristol-Myers Squibb Company | Biomarkers and methods for determining sensitivity to epidermal growth factor receptor modulators |
US20050181375A1 (en) | 2003-01-10 | 2005-08-18 | Natasha Aziz | Novel methods of diagnosis of metastatic cancer, compositions and methods of screening for modulators of metastatic cancer |
US20050227301A1 (en) | 2003-01-10 | 2005-10-13 | Polgen | Cell cycle progression proteins |
WO2004065577A2 (en) | 2003-01-14 | 2004-08-05 | Bristol-Myers Squibb Company | Polynucleotides and polypeptides associated with the nf-kb pathway |
US7258971B2 (en) | 2003-01-15 | 2007-08-21 | Bayer Healthcare Ag | Methods and compositions for treating urological disorders using carboxypeptidase Z identified as 8263 |
CA2516128A1 (en) | 2003-02-14 | 2004-09-02 | Sagres Discovery, Inc. | Therapeutic targets in cancer |
US20030224411A1 (en) | 2003-03-13 | 2003-12-04 | Stanton Lawrence W. | Genes that are up- or down-regulated during differentiation of human embryonic stem cells |
ATE421967T1 (de) | 2003-03-31 | 2009-02-15 | Council Scient Ind Res | Nichtvernetzende pyrroloä2,1-cüä1, 4übenzodiazepine als potentielle antitumor- agentien und ihre herstellung |
GB0321295D0 (en) | 2003-09-11 | 2003-10-15 | Spirogen Ltd | Synthesis of protected pyrrolobenzodiazepines |
GB0416511D0 (en) | 2003-10-22 | 2004-08-25 | Spirogen Ltd | Pyrrolobenzodiazepines |
JP5064037B2 (ja) | 2004-02-23 | 2012-10-31 | ジェネンテック, インコーポレイテッド | 複素環式自壊的リンカーおよび結合体 |
GB0404574D0 (en) | 2004-03-01 | 2004-04-07 | Spirogen Ltd | Amino acids |
GB0404577D0 (en) | 2004-03-01 | 2004-04-07 | Spirogen Ltd | Pyrrolobenzodiazepines |
GB0404578D0 (en) | 2004-03-01 | 2004-04-07 | Spirogen Ltd | Pyrrolobenzodiazepines |
CA2558195C (en) | 2004-03-01 | 2012-11-06 | Spirogen Limited | 11-hydroxy-5h-pyrrolo[2,1-c][1,4]benzodiazepin-5-one derivatives as key intermediates for the preparation of c2 substituted pyrrolobenzodiazepines |
EP1723152B1 (en) | 2004-03-09 | 2015-02-11 | Spirogen Sàrl | Pyrrolobenzodiazepines |
FR2869231B1 (fr) | 2004-04-27 | 2008-03-14 | Sod Conseils Rech Applic | Composition therapeutique contenant au moins un derive de la pyrrolobenzodiazepine et la fludarabine |
GB0410725D0 (en) | 2004-05-13 | 2004-06-16 | Spirogen Ltd | Pyrrolobenzodiazepine therapeutic agents |
CA2580141C (en) | 2004-09-23 | 2013-12-10 | Genentech, Inc. | Cysteine engineered antibodies and conjugates |
CA2604805C (en) | 2005-04-21 | 2014-05-27 | Spirogen Limited | Pyrrolobenzodiazepines |
GB0508084D0 (en) * | 2005-04-21 | 2005-06-01 | Spirogen Ltd | Pyrrolobenzodiazepines |
WO2007039752A1 (en) | 2005-10-05 | 2007-04-12 | Spirogen Limited | Alkyl 4- [4- (5-oxo-2, 3, 5, 11a-tetrahyd0-5h-pyrr0l0 [2, 1-c] [1, 4] benzodiazepine-8-yloxy) -butyrylamino]-1h-pyrrole-2-carboxylate derivatives and related compounds for the treatment of a proliferative disease |
US20070154906A1 (en) | 2005-10-05 | 2007-07-05 | Spirogen Ltd. | Methods to identify therapeutic candidates |
EP1934174B1 (en) | 2005-10-07 | 2011-04-06 | Exelixis, Inc. | Azetidines as mek inhibitors for the treatment of proliferative diseases |
ATE527262T1 (de) | 2006-01-25 | 2011-10-15 | Sanofi Sa | Neue tomaymycin derivate enhaltende zytotoxische mittel |
WO2008050140A2 (en) | 2006-10-27 | 2008-05-02 | Spirogen Limited | Compounds for treatment of parasitic infection |
PE20090245A1 (es) | 2007-05-08 | 2009-03-17 | Genentech Inc | Anticuerpos anti-muc16 disenados con cisteina y conjugados de anticuerpos y farmacos |
KR101622412B1 (ko) | 2007-10-19 | 2016-05-18 | 제넨테크, 인크. | 시스테인 조작된 항-tenb2 항체 및 항체 약물 접합체 |
GB0722088D0 (en) | 2007-11-09 | 2007-12-19 | Spirogen Ltd | Pyrrolobenzodiazepines |
GB0722087D0 (en) | 2007-11-09 | 2007-12-19 | Spirogen Ltd | Polyamides |
GB0813432D0 (en) | 2008-07-22 | 2008-08-27 | Spirogen Ltd | Pyrrolobenzodiazepines |
GB0819097D0 (en) | 2008-10-17 | 2008-11-26 | Spirogen Ltd | Pyrrolobenzodiazepines |
GB0819095D0 (en) * | 2008-10-17 | 2008-11-26 | Spirogen Ltd | Pyrrolobenzodiazepines |
RU2545080C2 (ru) | 2009-02-05 | 2015-03-27 | Иммьюноджен, Инк. | Новые производные бензодиазепина |
EP2558475A1 (en) | 2010-04-15 | 2013-02-20 | Spirogen Sàrl | Pyrrolobenzodiazepines used to treat proliferative diseases |
GB201006340D0 (en) | 2010-04-15 | 2010-06-02 | Spirogen Ltd | Synthesis method and intermediates |
PE20130342A1 (es) | 2010-04-15 | 2013-04-20 | Spirogen Sarl | Pirrolobenzodiacepinas y conjugados de las mismas |
KR101860174B1 (ko) * | 2011-09-20 | 2018-05-21 | 메디뮨 리미티드 | 표적 접합체 내의 내포를 위한 비대칭 이량체 pbd 화합물로서 피롤로벤조디아제핀 |
EP2750713B1 (en) | 2011-10-14 | 2015-09-16 | Spirogen Sàrl | Pyrrolobenzodiazepines and conjugates thereof |
CN104011018B (zh) | 2011-10-14 | 2016-12-14 | 麦迪穆有限责任公司 | 可用于制备吡咯并苯并二氮杂卓的合成方法和中间体 |
BR112014009055B1 (pt) | 2011-10-14 | 2021-12-14 | Seattle Genetics, Inc. | Compostos pirrolobenzodiazepinas, conjugados alvos, ligante de fármaco e uso dos ditos conjugados para tratar uma doença proliferativa |
US9526798B2 (en) | 2011-10-14 | 2016-12-27 | Seattle Genetics, Inc. | Pyrrolobenzodiazepines and targeted conjugates |
CN110183470A (zh) | 2011-10-14 | 2019-08-30 | 麦迪穆有限责任公司 | 吡咯并苯并二氮杂卓 |
US9399073B2 (en) | 2011-10-14 | 2016-07-26 | Seattle Genetics, Inc. | Pyrrolobenzodiazepines |
WO2013164593A1 (en) | 2012-04-30 | 2013-11-07 | Spirogen Sàrl | Pyrrolobenzodiazepines |
CN104540827B (zh) | 2012-04-30 | 2018-03-23 | Ucl商业有限公司 | 吡咯并苯并二氮杂卓 |
WO2013177481A1 (en) | 2012-05-25 | 2013-11-28 | Immunogen, Inc. | Benzodiazepines and conjugates thereof |
JP2015523380A (ja) | 2012-07-09 | 2015-08-13 | ジェネンテック, インコーポレイテッド | 抗cd79b抗体を含む免疫複合体 |
SG11201500087VA (en) | 2012-07-09 | 2015-02-27 | Genentech Inc | Immunoconjugates comprising anti-cd22 antibodies |
BR112015002193A2 (pt) | 2012-08-02 | 2017-07-04 | Genentech Inc | anticorpos anti-etbr e imunoconjugados |
ES2660029T3 (es) | 2012-10-12 | 2018-03-20 | Medimmune Limited | Conjugados de anticuerpo-pirrolobenzodiazepinas |
EP2906250B1 (en) | 2012-10-12 | 2018-05-30 | ADC Therapeutics SA | Pyrrolobenzodiazepine-anti-psma antibody conjugates |
CN105102068B (zh) | 2012-10-12 | 2018-06-01 | Adc疗法责任有限公司 | 吡咯并苯并二氮杂卓-抗体结合物 |
WO2014057118A1 (en) | 2012-10-12 | 2014-04-17 | Adc Therapeutics Sarl | Pyrrolobenzodiazepine-anti-cd22 antibody conjugates |
WO2014057072A1 (en) | 2012-10-12 | 2014-04-17 | Spirogen Sàrl | Synthesis and intermediates of pyrrolobenzodiazepine derivatives for conjugation |
PL2906253T3 (pl) | 2012-10-12 | 2019-02-28 | Adc Therapeutics Sa | Koniugaty pirolobenzodiazepina-przeciwciało anty-psma |
CA2887896A1 (en) | 2012-10-12 | 2014-04-17 | Adc Therapeutics Sarl | Pyrrolobenzodiazepine-anti-her2 antibody conjugates |
MX364329B (es) | 2012-10-12 | 2019-04-23 | Medimmune Ltd | Conjugados del anticuerpo pirrolobenzodiazepina. |
AU2013328628B2 (en) | 2012-10-12 | 2016-12-15 | Adc Therapeutics Sa | Pyrrolobenzodiazepine-anti-CD22 antibody conjugates |
KR101819404B1 (ko) | 2012-10-12 | 2018-02-28 | 메디뮨 리미티드 | 피롤로벤조디아제핀 및 그의 컨주게이트 |
CA2885315C (en) | 2012-10-12 | 2020-06-23 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
CN110452242A (zh) | 2012-12-21 | 2019-11-15 | 麦迪穆有限责任公司 | 吡咯并苯并二氮杂卓及其结合物 |
CA2894959C (en) | 2012-12-21 | 2022-01-11 | Spirogen Sarl | Unsymmetrical pyrrolobenzodiazepines-dimers for use in the treatment of proliferative and autoimmune diseases |
HUE043851T2 (hu) | 2013-02-22 | 2019-09-30 | Abbvie Stemcentrx Llc | Anti-DLL3-antitest-PBD konjugátumok és alkalmazásuk |
JP6444902B2 (ja) | 2013-03-13 | 2018-12-26 | メドイミューン・リミテッドMedImmune Limited | ピロロベンゾジアゼピン及びその結合体 |
KR102066318B1 (ko) | 2013-03-13 | 2020-01-14 | 메디뮨 리미티드 | 피롤로벤조디아제핀 및 그의 컨쥬게이트 |
BR112015023070B1 (pt) | 2013-03-13 | 2022-06-07 | Genentech, Inc. | Conjugados e compostos de pirrolobenzodiazepinas, composição farmacêutica que compreende os mesmo, bem como seus usos para o tratamento de uma doença proliferativa |
WO2014174111A1 (en) | 2013-04-26 | 2014-10-30 | Pierre Fabre Medicament | Axl antibody-drug conjugate and its use for the treatment of cancer |
US9950078B2 (en) | 2013-10-11 | 2018-04-24 | Medimmune Limited | Pyrrolobenzodiazepine-antibody conjugates |
GB201317982D0 (en) | 2013-10-11 | 2013-11-27 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
GB201317981D0 (en) | 2013-10-11 | 2013-11-27 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
WO2015052535A1 (en) | 2013-10-11 | 2015-04-16 | Spirogen Sàrl | Pyrrolobenzodiazepine-antibody conjugates |
EP3054984A1 (en) | 2013-10-11 | 2016-08-17 | Medimmune Limited | Pyrrolobenzodiazepine-antibody conjugates |
US10010624B2 (en) | 2013-10-11 | 2018-07-03 | Medimmune Limited | Pyrrolobenzodiazepine-antibody conjugates |
CA2931798C (en) | 2013-12-16 | 2023-06-27 | Genentech, Inc. | Peptidomimetic compounds and antibody-drug conjugates thereof |
GB201406767D0 (en) | 2014-04-15 | 2014-05-28 | Cancer Rec Tech Ltd | Humanized anti-Tn-MUC1 antibodies anf their conjugates |
-
2013
- 2013-12-20 CA CA2894959A patent/CA2894959C/en active Active
- 2013-12-20 JP JP2015548633A patent/JP6527466B2/ja active Active
- 2013-12-20 AU AU2013366490A patent/AU2013366490B9/en not_active Ceased
- 2013-12-20 CN CN201380073646.6A patent/CN105246894A/zh active Pending
- 2013-12-20 EP EP13821469.7A patent/EP2935273A1/en not_active Withdrawn
- 2013-12-20 CN CN201910920109.XA patent/CN110627797A/zh active Pending
- 2013-12-20 EA EA201591000A patent/EA032986B1/ru not_active IP Right Cessation
- 2013-12-20 WO PCT/EP2013/077695 patent/WO2014096365A1/en active Application Filing
- 2013-12-20 US US14/654,070 patent/US9567340B2/en active Active
-
2016
- 2016-04-13 HK HK16104190.2A patent/HK1216176A1/zh unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011130613A1 (en) * | 2010-04-15 | 2011-10-20 | Seattle Genetics, Inc. | Targeted pyrrolobenzodiazapine conjugates |
WO2012112708A1 (en) * | 2011-02-15 | 2012-08-23 | Immunogen, Inc. | Cytotoxic benzodiazepine derivatives and methods of preparation |
Non-Patent Citations (2)
Title |
---|
AHMED KAMAL ET AL.: "Design, synthesis, and evaluation of mixed imine-amine pyrrolobenzodiazepine dimers with efficient DNA binding affinity and potent cytotoxicity", 《BIOORGANIC & MEDICINAL CHEMISTRY》 * |
AHMED KAMAL ET AL.: "Design, Synthesis, and Evaluation of New Noncross-Linking Pyrrolobenzodiazepine Dimers with Efficient DNA Binding Ability and Potent Antitumor Activity", 《J. MED. CHEM.》 * |
Also Published As
Publication number | Publication date |
---|---|
HK1216176A1 (zh) | 2016-10-21 |
AU2013366490B9 (en) | 2018-02-01 |
US9567340B2 (en) | 2017-02-14 |
AU2013366490B2 (en) | 2018-01-18 |
JP6527466B2 (ja) | 2019-06-05 |
US20150315196A1 (en) | 2015-11-05 |
EP2935273A1 (en) | 2015-10-28 |
WO2014096365A1 (en) | 2014-06-26 |
CA2894959A1 (en) | 2014-06-26 |
EA201591000A1 (ru) | 2015-11-30 |
AU2013366490A1 (en) | 2015-07-09 |
EA032986B1 (ru) | 2019-08-30 |
CN105246894A (zh) | 2016-01-13 |
JP2016505586A (ja) | 2016-02-25 |
CA2894959C (en) | 2022-01-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102066319B1 (ko) | 피롤로벤조디아제핀 및 그의 컨쥬게이트 | |
KR102052244B1 (ko) | 피롤로벤조디아제핀 및 이의 컨주게이트 | |
CN110627797A (zh) | 用于治疗增殖性和自身免疫疾病的非对称吡咯并苯并二氮杂卓二聚物 | |
EA031585B1 (ru) | Пирролобензодиазепины и их конъюгаты | |
KR100607612B1 (ko) | 종양의 진단 및 치료를 위한 방법 및 이를 위한 조성물 | |
CA2473746C (en) | Compositions and methods for the detection, diagnosis and therapy of hematological malignancies | |
CN113490850A (zh) | 用于治疗乳腺癌的诊断和治疗方法 | |
KR102239125B1 (ko) | 인간 이디오타입을 갖는 설치류 항체를 인코딩하는 폴리뉴클레오티드 및 이를 포함하는 동물 | |
KR20050048615A (ko) | 종양의 진단 및 치료를 위한 조성물 및 방법 | |
KR20070100235A (ko) | 종양의 진단 및 치료를 위한 조성물 및 방법 | |
US20210189342A1 (en) | Compositions and methods for modulating monocyte and macrophage inflammatory phenotypes and immunotherapy uses thereof | |
CA3040194A1 (en) | Compositions and methods for predicting response and resistance to ctla4 blockade in melanoma using a gene expression signature | |
KR20060015296A (ko) | 신경교 기원의 종양의 진단 및 치료를 위한 조성물 및 방법 | |
KR20180011276A (ko) | 잘못 접혀진 단백질의 분해를 위한 조성물 및 방법 | |
KR20140057331A (ko) | 유방암의 치료 및 진단을 위한 방법 및 조성물 | |
KR20040073537A (ko) | 종양의 진단 및 치료 방법 및 이를 위한 조성물 | |
US20200300857A1 (en) | Modulating biomarkers to increase tumor immunity and improve the efficacy of cancer immunotherapy | |
WO2019014663A1 (en) | MODULATION OF BIOMARKERS TO INCREASE ANTI-TUMOR IMMUNITY AND IMPROVE THE EFFECTIVENESS OF CANCER IMMUNOTHERAPY | |
US20020119541A1 (en) | Tumor suppressor CAR-1 | |
KR20190104400A (ko) | 다중 중쇄 면역글로불린 유전자좌를 갖는 트랜스제닉 설치류 기원의 인간 항체 | |
US12050219B2 (en) | Modulating biomarkers such as SPP to increase tumor immunity and improve the efficacy of cancer immunotherapy | |
KR102001153B1 (ko) | 유방암 예후 예측용 조성물 및 방법 | |
US20220265798A1 (en) | Cancer vaccine compositions and methods for using same to prevent and/or treat cancer | |
CN115698719A (zh) | 用于癌症免疫疗法的方法和组合物 | |
KR20220008244A (ko) | 전립선암 환자의 예후 진단 및 치료 전략 결정용 연령 특이적 마커 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |