IE83896B1 - Heregulins (HRGs), binding proteins of P185erb2 - Google Patents
Heregulins (HRGs), binding proteins of P185erb2Info
- Publication number
- IE83896B1 IE83896B1 IE1992/1654A IE921654A IE83896B1 IE 83896 B1 IE83896 B1 IE 83896B1 IE 1992/1654 A IE1992/1654 A IE 1992/1654A IE 921654 A IE921654 A IE 921654A IE 83896 B1 IE83896 B1 IE 83896B1
- Authority
- IE
- Ireland
- Prior art keywords
- hrg
- ser
- heregulin
- glu
- lys
- Prior art date
Links
- 102000024070 binding proteins Human genes 0.000 title description 2
- 108091007650 binding proteins Proteins 0.000 title description 2
- 230000027455 binding Effects 0.000 claims description 110
- 229920001184 polypeptide Polymers 0.000 claims description 101
- 102000004965 antibodies Human genes 0.000 claims description 99
- 108090001123 antibodies Proteins 0.000 claims description 99
- 150000007523 nucleic acids Chemical class 0.000 claims description 99
- 108020004707 nucleic acids Proteins 0.000 claims description 82
- 230000014509 gene expression Effects 0.000 claims description 74
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 56
- 239000000203 mixture Substances 0.000 claims description 47
- 230000000694 effects Effects 0.000 claims description 43
- 241000282414 Homo sapiens Species 0.000 claims description 42
- 125000000539 amino acid group Chemical group 0.000 claims description 40
- 102000004190 Enzymes Human genes 0.000 claims description 33
- 108090000790 Enzymes Proteins 0.000 claims description 33
- 239000003102 growth factor Substances 0.000 claims description 17
- 230000002163 immunogen Effects 0.000 claims description 10
- 230000001086 cytosolic Effects 0.000 claims description 7
- 230000000240 adjuvant Effects 0.000 claims description 6
- 239000002671 adjuvant Substances 0.000 claims description 6
- 230000000890 antigenic Effects 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N L-serine Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 373
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 248
- 210000004027 cells Anatomy 0.000 description 220
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 188
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 158
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 151
- 235000001014 amino acid Nutrition 0.000 description 135
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 133
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 133
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 112
- 150000001413 amino acids Chemical class 0.000 description 106
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 95
- XUJNEKJLAYXESH-REOHCLBHSA-N L-cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 90
- 102000004169 proteins and genes Human genes 0.000 description 82
- 108090000623 proteins and genes Proteins 0.000 description 82
- 235000018102 proteins Nutrition 0.000 description 81
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 78
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 65
- 238000004166 bioassay Methods 0.000 description 59
- 239000003446 ligand Substances 0.000 description 57
- 229920002676 Complementary DNA Polymers 0.000 description 53
- 229920001850 Nucleic acid sequence Polymers 0.000 description 53
- 238000000034 method Methods 0.000 description 53
- 108020004635 Complementary DNA Proteins 0.000 description 47
- 238000010804 cDNA synthesis Methods 0.000 description 47
- 239000002299 complementary DNA Substances 0.000 description 47
- 102100010813 EGF Human genes 0.000 description 46
- 101700033006 EGF Proteins 0.000 description 46
- 229940116977 Epidermal Growth Factor Drugs 0.000 description 46
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 45
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 39
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 34
- 239000000727 fraction Substances 0.000 description 31
- 229920000272 Oligonucleotide Polymers 0.000 description 30
- 239000000523 sample Substances 0.000 description 30
- 239000002609 media Substances 0.000 description 28
- 102000005962 receptors Human genes 0.000 description 27
- 108020003175 receptors Proteins 0.000 description 27
- 238000006467 substitution reaction Methods 0.000 description 27
- 239000012528 membrane Substances 0.000 description 26
- 210000004379 Membranes Anatomy 0.000 description 25
- 235000018417 cysteine Nutrition 0.000 description 25
- -1 deoxyribonucleotide triphosphates Chemical class 0.000 description 24
- 230000035772 mutation Effects 0.000 description 24
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 22
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 22
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 22
- 239000012491 analyte Substances 0.000 description 22
- 238000006206 glycosylation reaction Methods 0.000 description 22
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 21
- 230000003899 glycosylation Effects 0.000 description 21
- 230000012010 growth Effects 0.000 description 21
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 21
- 238000000746 purification Methods 0.000 description 21
- 230000035897 transcription Effects 0.000 description 21
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 20
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 20
- 125000003729 nucleotide group Chemical group 0.000 description 20
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 20
- 229940088598 Enzyme Drugs 0.000 description 19
- 230000002708 enhancing Effects 0.000 description 19
- 239000002773 nucleotide Substances 0.000 description 19
- 241000894007 species Species 0.000 description 19
- 210000001519 tissues Anatomy 0.000 description 19
- 201000011510 cancer Diseases 0.000 description 18
- 238000004113 cell culture Methods 0.000 description 18
- 230000000295 complement Effects 0.000 description 18
- 230000000875 corresponding Effects 0.000 description 18
- 238000003780 insertion Methods 0.000 description 18
- 238000003752 polymerase chain reaction Methods 0.000 description 18
- 239000011780 sodium chloride Substances 0.000 description 18
- 235000002639 sodium chloride Nutrition 0.000 description 18
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 17
- 239000003623 enhancer Substances 0.000 description 17
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 16
- 101700025368 ERBB2 Proteins 0.000 description 16
- 102100016662 ERBB2 Human genes 0.000 description 16
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 16
- 238000006243 chemical reaction Methods 0.000 description 16
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 15
- 238000003776 cleavage reaction Methods 0.000 description 15
- 230000004927 fusion Effects 0.000 description 15
- WEVYAHXRMPXWCK-UHFFFAOYSA-N acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 14
- 238000010367 cloning Methods 0.000 description 14
- 239000000499 gel Substances 0.000 description 14
- 108010054155 lysyllysine Proteins 0.000 description 14
- 108091007521 restriction endonucleases Proteins 0.000 description 14
- 229940110715 ENZYMES FOR TREATMENT OF WOUNDS AND ULCERS Drugs 0.000 description 13
- 101700029736 HBEGF Proteins 0.000 description 13
- 102100016400 HBEGF Human genes 0.000 description 13
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 13
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 13
- 230000003321 amplification Effects 0.000 description 13
- 230000010261 cell growth Effects 0.000 description 13
- 239000003795 chemical substances by application Substances 0.000 description 13
- 108010015792 glycyllysine Proteins 0.000 description 13
- 229940020899 hematological Enzymes Drugs 0.000 description 13
- 229920002106 messenger RNA Polymers 0.000 description 13
- 238000003199 nucleic acid amplification method Methods 0.000 description 13
- DTQVDTLACAAQTR-UHFFFAOYSA-N trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 13
- 108020004999 Messenger RNA Proteins 0.000 description 12
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 12
- 230000001580 bacterial Effects 0.000 description 12
- 230000029087 digestion Effects 0.000 description 12
- 238000004128 high performance liquid chromatography Methods 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 12
- 108020004705 Codon Proteins 0.000 description 11
- 241000588724 Escherichia coli Species 0.000 description 11
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 11
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 11
- 102000004419 dihydrofolate reductase family Human genes 0.000 description 11
- 108020001096 dihydrofolate reductase family Proteins 0.000 description 11
- 108010045030 monoclonal antibodies Proteins 0.000 description 11
- 238000002703 mutagenesis Methods 0.000 description 11
- 231100000350 mutagenesis Toxicity 0.000 description 11
- 229910052700 potassium Inorganic materials 0.000 description 11
- 239000000126 substance Substances 0.000 description 11
- 108010033760 Amphiregulin Proteins 0.000 description 10
- 102000007299 Amphiregulin Human genes 0.000 description 10
- 102000001301 EGF receptors Human genes 0.000 description 10
- 108060006698 EGF receptors Proteins 0.000 description 10
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 10
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 10
- 239000004472 Lysine Substances 0.000 description 10
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 10
- 239000001963 growth media Substances 0.000 description 10
- 210000004962 mammalian cells Anatomy 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 238000006011 modification reaction Methods 0.000 description 10
- 102000005614 monoclonal antibodies Human genes 0.000 description 10
- 238000006366 phosphorylation reaction Methods 0.000 description 10
- 230000000865 phosphorylative Effects 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 230000002797 proteolythic Effects 0.000 description 10
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 9
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 9
- PHTXVQQRWJXYPP-UHFFFAOYSA-N Ethyltrifluoromethylaminoindane Chemical compound C1=C(C(F)(F)F)C=C2CC(NCC)CC2=C1 PHTXVQQRWJXYPP-UHFFFAOYSA-N 0.000 description 9
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 9
- 229920002684 Sepharose Polymers 0.000 description 9
- 241000700605 Viruses Species 0.000 description 9
- 238000007792 addition Methods 0.000 description 9
- 230000003042 antagnostic Effects 0.000 description 9
- 239000005557 antagonist Substances 0.000 description 9
- 108010062796 arginyllysine Proteins 0.000 description 9
- 150000001720 carbohydrates Chemical class 0.000 description 9
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 230000002401 inhibitory effect Effects 0.000 description 9
- 108010009298 lysylglutamic acid Proteins 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 230000001225 therapeutic Effects 0.000 description 9
- 238000001890 transfection Methods 0.000 description 9
- 230000001131 transforming Effects 0.000 description 9
- 239000004475 Arginine Substances 0.000 description 8
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 8
- 206010006187 Breast cancer Diseases 0.000 description 8
- 101700060347 CUL7 Proteins 0.000 description 8
- 241000196324 Embryophyta Species 0.000 description 8
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 8
- 241000283973 Oryctolagus cuniculus Species 0.000 description 8
- 229920000805 Polyaspartic acid Polymers 0.000 description 8
- 239000000427 antigen Substances 0.000 description 8
- 108091007172 antigens Proteins 0.000 description 8
- 102000038129 antigens Human genes 0.000 description 8
- 229960000070 antineoplastic Monoclonal antibodies Drugs 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- 230000002860 competitive Effects 0.000 description 8
- 238000000338 in vitro Methods 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 230000000051 modifying Effects 0.000 description 8
- 229960000060 monoclonal antibodies Drugs 0.000 description 8
- 239000002953 phosphate buffered saline Substances 0.000 description 8
- 230000004044 response Effects 0.000 description 8
- 150000003839 salts Chemical class 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 230000001052 transient Effects 0.000 description 8
- QXRNAOYBCYVZCD-BQBZGAKWSA-N (2S)-6-amino-2-[[(2S)-2-aminopropanoyl]amino]hexanoic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN QXRNAOYBCYVZCD-BQBZGAKWSA-N 0.000 description 7
- MPZWMIIOPAPAKE-UHFFFAOYSA-N 4-amino-5-[[1-carboxy-4-(diaminomethylideneamino)butyl]amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CCCN=C(N)N MPZWMIIOPAPAKE-UHFFFAOYSA-N 0.000 description 7
- 229920001405 Coding region Polymers 0.000 description 7
- 102000027760 ERBB2 Human genes 0.000 description 7
- 108010066668 ErbB-2 Receptor Proteins 0.000 description 7
- 229960002897 Heparin Drugs 0.000 description 7
- ZFGMDIBRIDKWMY-PASTXAENSA-N Heparin Chemical compound CC(O)=N[C@@H]1[C@@H](O)[C@H](O)[C@@H](COS(O)(=O)=O)O[C@@H]1O[C@@H]1[C@@H](C(O)=O)O[C@@H](O[C@H]2[C@@H]([C@@H](OS(O)(=O)=O)[C@@H](O[C@@H]3[C@@H](OC(O)[C@H](OS(O)(=O)=O)[C@H]3O)C(O)=O)O[C@@H]2O)CS(O)(=O)=O)[C@H](O)[C@H]1O ZFGMDIBRIDKWMY-PASTXAENSA-N 0.000 description 7
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- 102000035443 Peptidases Human genes 0.000 description 7
- 108091005771 Peptidases Proteins 0.000 description 7
- 239000002202 Polyethylene glycol Substances 0.000 description 7
- 241000700159 Rattus Species 0.000 description 7
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 7
- STTYIMSDIYISRG-WDSKDSINSA-N Val-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(O)=O STTYIMSDIYISRG-WDSKDSINSA-N 0.000 description 7
- 230000003213 activating Effects 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 230000035578 autophosphorylation Effects 0.000 description 7
- 239000003153 chemical reaction reagent Substances 0.000 description 7
- 230000001809 detectable Effects 0.000 description 7
- 229920000669 heparin Polymers 0.000 description 7
- 230000001965 increased Effects 0.000 description 7
- 101710007041 let-363 Proteins 0.000 description 7
- 239000010452 phosphate Substances 0.000 description 7
- 235000021317 phosphate Nutrition 0.000 description 7
- 229920001223 polyethylene glycol Polymers 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 230000000638 stimulation Effects 0.000 description 7
- LQJAALCCPOTJGB-YUMQZZPRSA-N (2S)-1-[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 6
- PCDQPRRSZKQHHS-XVFCMESISA-N ({[({[(2R,3S,4R,5R)-5-(4-amino-2-oxo-1,2-dihydropyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy}(hydroxy)phosphoryl)oxy](hydroxy)phosphoryl}oxy)phosphonic acid Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-XVFCMESISA-N 0.000 description 6
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 6
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 6
- 241000282693 Cercopithecidae Species 0.000 description 6
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 6
- CLSDNFWKGFJIBZ-UHFFFAOYSA-N Glutaminyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CCC(N)=O CLSDNFWKGFJIBZ-UHFFFAOYSA-N 0.000 description 6
- 210000003734 Kidney Anatomy 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 101710037934 QRSL1 Proteins 0.000 description 6
- 210000002966 Serum Anatomy 0.000 description 6
- 239000004473 Threonine Substances 0.000 description 6
- NBIIXXVUZAFLBC-UHFFFAOYSA-K [O-]P([O-])([O-])=O Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 239000003636 conditioned culture media Substances 0.000 description 6
- 239000012091 fetal bovine serum Substances 0.000 description 6
- 230000003053 immunization Effects 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 210000003292 kidney cell Anatomy 0.000 description 6
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 6
- 238000001556 precipitation Methods 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 235000019833 protease Nutrition 0.000 description 6
- 125000003616 serine group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 6
- 238000010186 staining Methods 0.000 description 6
- 235000000346 sugar Nutrition 0.000 description 6
- 230000002194 synthesizing Effects 0.000 description 6
- 239000000700 tracer Substances 0.000 description 6
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 5
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 5
- 108091003117 Bovine Serum Albumin Proteins 0.000 description 5
- 229940098773 Bovine Serum Albumin Drugs 0.000 description 5
- OWOFCNWTMWOOJJ-WDSKDSINSA-N Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OWOFCNWTMWOOJJ-WDSKDSINSA-N 0.000 description 5
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 5
- 102000003886 Glycoproteins Human genes 0.000 description 5
- 108090000288 Glycoproteins Proteins 0.000 description 5
- 210000004408 Hybridomas Anatomy 0.000 description 5
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 5
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 5
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 5
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 5
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 5
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 5
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 5
- 210000001672 Ovary Anatomy 0.000 description 5
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 5
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 5
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 5
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 5
- RZEQTVHJZCIUBT-UHFFFAOYSA-N Serinyl-Arginine Chemical compound OCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-UHFFFAOYSA-N 0.000 description 5
- SBMNPABNWKXNBJ-UHFFFAOYSA-N Serinyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CO SBMNPABNWKXNBJ-UHFFFAOYSA-N 0.000 description 5
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 5
- YKRQRPFODDJQTC-UHFFFAOYSA-N Threoninyl-Lysine Chemical compound CC(O)C(N)C(=O)NC(C(O)=O)CCCCN YKRQRPFODDJQTC-UHFFFAOYSA-N 0.000 description 5
- 238000002835 absorbance Methods 0.000 description 5
- 235000004279 alanine Nutrition 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 230000001413 cellular Effects 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 229940079593 drugs Drugs 0.000 description 5
- 238000010828 elution Methods 0.000 description 5
- 230000002255 enzymatic Effects 0.000 description 5
- BDAGIHXWWSANSR-UHFFFAOYSA-N formic acid Substances OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 5
- 238000003018 immunoassay Methods 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 230000000977 initiatory Effects 0.000 description 5
- 108010027338 isoleucylcysteine Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 230000017854 proteolysis Effects 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 230000004936 stimulating Effects 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- IOUPEELXVYPCPG-UHFFFAOYSA-N val-gly Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 5
- 230000003612 virological Effects 0.000 description 5
- CCQOOWAONKGYKQ-BYPYZUCNSA-N (2S)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoate Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- 101710027066 ALB Proteins 0.000 description 4
- 210000000628 Antibody-Producing Cells Anatomy 0.000 description 4
- OSASDIVHOSJVII-UHFFFAOYSA-N Arginyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CCCNC(N)=N OSASDIVHOSJVII-UHFFFAOYSA-N 0.000 description 4
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 4
- 229960001230 Asparagine Drugs 0.000 description 4
- NPDLYUOYAGBHFB-UHFFFAOYSA-N Asparaginyl-Arginine Chemical compound NC(=O)CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N NPDLYUOYAGBHFB-UHFFFAOYSA-N 0.000 description 4
- 241000699802 Cricetulus griseus Species 0.000 description 4
- ATDGTVJJHBUTRL-UHFFFAOYSA-N Cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 4
- HAYVTMHUNMMXCV-UHFFFAOYSA-N Cysteinyl-Alanine Chemical compound OC(=O)C(C)NC(=O)C(N)CS HAYVTMHUNMMXCV-UHFFFAOYSA-N 0.000 description 4
- OELDIVRKHTYFNG-UHFFFAOYSA-N Cysteinyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)CS OELDIVRKHTYFNG-UHFFFAOYSA-N 0.000 description 4
- 241000701022 Cytomegalovirus Species 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 4
- 102000004594 DNA Polymerase I Human genes 0.000 description 4
- 108010017826 DNA Polymerase I Proteins 0.000 description 4
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 4
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 4
- 229920002683 Glycosaminoglycan Polymers 0.000 description 4
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 4
- DRCKHKZYDLJYFQ-UHFFFAOYSA-N Isoleucyl-Threonine Chemical compound CCC(C)C(N)C(=O)NC(C(C)O)C(O)=O DRCKHKZYDLJYFQ-UHFFFAOYSA-N 0.000 description 4
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 4
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 4
- 210000004185 Liver Anatomy 0.000 description 4
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 4
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 4
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 4
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 4
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 4
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 4
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 4
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 4
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 4
- ONWMQORSVZYVNH-UHFFFAOYSA-N Tyrosyl-Asparagine Chemical compound NC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 ONWMQORSVZYVNH-UHFFFAOYSA-N 0.000 description 4
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 4
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 4
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 4
- 238000001261 affinity purification Methods 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 230000004075 alteration Effects 0.000 description 4
- 125000000511 arginine group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 235000009582 asparagine Nutrition 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000003115 biocidal Effects 0.000 description 4
- 238000004587 chromatography analysis Methods 0.000 description 4
- 238000007796 conventional method Methods 0.000 description 4
- 238000004132 cross linking Methods 0.000 description 4
- 238000010192 crystallographic characterization Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 230000002209 hydrophobic Effects 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 238000011068 load Methods 0.000 description 4
- 108010044655 lysylproline Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 239000003094 microcapsule Substances 0.000 description 4
- 229960004927 neomycin Drugs 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- OZAIFHULBGXAKX-UHFFFAOYSA-N precursor Substances N#CC(C)(C)N=NC(C)(C)C#N OZAIFHULBGXAKX-UHFFFAOYSA-N 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 230000001105 regulatory Effects 0.000 description 4
- 238000004007 reversed phase HPLC Methods 0.000 description 4
- 239000011734 sodium Substances 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 230000002459 sustained Effects 0.000 description 4
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N β-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 4
- JSZMKEYEVLDPDO-ACZMJKKPSA-N (2R)-2-[[(2S,3S)-2-amino-3-methylpentanoyl]amino]-3-sulfanylpropanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(O)=O JSZMKEYEVLDPDO-ACZMJKKPSA-N 0.000 description 3
- OCYROESYHWUPBP-VGMNWLOBSA-N (2S,3R)-3-methyl-2-[[(2S)-pyrrolidin-1-ium-2-carbonyl]amino]pentanoate Chemical compound CC[C@@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 OCYROESYHWUPBP-VGMNWLOBSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N 2-[[(2S)-2-[[(2S)-2-azaniumylpropanoyl]amino]propanoyl]amino]acetate Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- XUUXCWCKKCZEAW-YFKPBYRVSA-N 2-[[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 3
- PABVKUJVLNMOJP-UHFFFAOYSA-N 4-amino-5-[(1-carboxy-2-sulfanylethyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CS)C(O)=O PABVKUJVLNMOJP-UHFFFAOYSA-N 0.000 description 3
- TUTIHHSZKFBMHM-UHFFFAOYSA-N 4-amino-5-[(3-amino-1-carboxy-3-oxopropyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CC(N)=O)C(O)=O TUTIHHSZKFBMHM-UHFFFAOYSA-N 0.000 description 3
- 101700012170 ADH5 Proteins 0.000 description 3
- 102100001249 ALB Human genes 0.000 description 3
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 3
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 3
- XZWXFWBHYRFLEF-FSPLSTOPSA-N Ala-His Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XZWXFWBHYRFLEF-FSPLSTOPSA-N 0.000 description 3
- 229940064005 Antibiotic throat preparations Drugs 0.000 description 3
- 229940083879 Antibiotics FOR TREATMENT OF HEMORRHOIDS AND ANAL FISSURES FOR TOPICAL USE Drugs 0.000 description 3
- 229940042052 Antibiotics for systemic use Drugs 0.000 description 3
- 229940042786 Antitubercular Antibiotics Drugs 0.000 description 3
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 3
- QJMCHPGWFZZRID-UHFFFAOYSA-N Asparaginyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC(N)=O QJMCHPGWFZZRID-UHFFFAOYSA-N 0.000 description 3
- UKGGPJNBONZZCM-WDSKDSINSA-N Aspartyl-L-proline Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 210000004556 Brain Anatomy 0.000 description 3
- YHDXIZKDOIWPBW-UHFFFAOYSA-N Cysteinyl-Glutamine Chemical compound SCC(N)C(=O)NC(C(O)=O)CCC(N)=O YHDXIZKDOIWPBW-UHFFFAOYSA-N 0.000 description 3
- NXTYATMDWQYLGJ-UHFFFAOYSA-N Cysteinyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CS NXTYATMDWQYLGJ-UHFFFAOYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N Cysteinyl-Phenylalanine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- IQFYYKKMVGJFEH-XLPZGREQSA-N DEOXYTHYMIDINE Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 3
- 101700011961 DPOM Proteins 0.000 description 3
- 241000283073 Equus caballus Species 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 3
- SSHIXEILTLPAQT-UHFFFAOYSA-N Glutaminyl-Aspartate Chemical compound NC(=O)CCC(N)C(=O)NC(CC(O)=O)C(O)=O SSHIXEILTLPAQT-UHFFFAOYSA-N 0.000 description 3
- VHLZDSUANXBJHW-UHFFFAOYSA-N Glutaminyl-Phenylalanine Chemical compound NC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 VHLZDSUANXBJHW-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 229940093922 Gynecological Antibiotics Drugs 0.000 description 3
- 102000026357 HRG family Human genes 0.000 description 3
- 108091013523 HRG family Proteins 0.000 description 3
- 230000036499 Half live Effects 0.000 description 3
- 229920001681 Heteroduplex Polymers 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 210000004754 Hybrid Cells Anatomy 0.000 description 3
- 102000018358 Immunoglobulins Human genes 0.000 description 3
- 108060003951 Immunoglobulins Proteins 0.000 description 3
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 3
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 3
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 3
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 3
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 3
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 3
- OAPNERBWQWUPTI-YUMQZZPRSA-N Lys-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O OAPNERBWQWUPTI-YUMQZZPRSA-N 0.000 description 3
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 3
- 101710029649 MDV043 Proteins 0.000 description 3
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 3
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 208000007538 Neurilemmoma Diseases 0.000 description 3
- 238000000636 Northern blotting Methods 0.000 description 3
- 101700061424 POLB Proteins 0.000 description 3
- 239000002033 PVDF binder Substances 0.000 description 3
- 101700054624 RF1 Proteins 0.000 description 3
- 108010033725 Recombinant Proteins Proteins 0.000 description 3
- 102000007312 Recombinant Proteins Human genes 0.000 description 3
- 210000003079 Salivary Glands Anatomy 0.000 description 3
- 206010039667 Schwannoma Diseases 0.000 description 3
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 3
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 3
- 210000000952 Spleen Anatomy 0.000 description 3
- 229940014598 TAC Drugs 0.000 description 3
- 101700038204 TGFA Proteins 0.000 description 3
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 3
- 229940024982 Topical Antifungal Antibiotics Drugs 0.000 description 3
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K Trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 3
- 101710042194 Trpgamma Proteins 0.000 description 3
- 108090000631 Trypsin Proteins 0.000 description 3
- 102000004142 Trypsin Human genes 0.000 description 3
- QJKMCQRFHJRIPU-XDTLVQLUSA-N Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QJKMCQRFHJRIPU-XDTLVQLUSA-N 0.000 description 3
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 3
- GVRKWABULJAONN-UHFFFAOYSA-N Valyl-Threonine Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(O)=O GVRKWABULJAONN-UHFFFAOYSA-N 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 229940050528 albumin Drugs 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 3
- 201000008275 breast carcinoma Diseases 0.000 description 3
- 150000001718 carbodiimides Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 201000009030 carcinoma Diseases 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 230000021615 conjugation Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 229920001577 copolymer Polymers 0.000 description 3
- 230000001808 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 230000001419 dependent Effects 0.000 description 3
- 238000001212 derivatisation Methods 0.000 description 3
- 150000002019 disulfides Chemical class 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000001976 enzyme digestion Methods 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 235000019253 formic acid Nutrition 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 3
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 238000002649 immunization Methods 0.000 description 3
- 230000002055 immunohistochemical Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 229940079866 intestinal antibiotics Drugs 0.000 description 3
- 230000003834 intracellular Effects 0.000 description 3
- 238000005304 joining Methods 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 239000002502 liposome Substances 0.000 description 3
- 101700072735 lys-1 Proteins 0.000 description 3
- 230000001404 mediated Effects 0.000 description 3
- 229960000485 methotrexate Drugs 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 229920001894 non-coding RNA Polymers 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 229940005935 ophthalmologic Antibiotics Drugs 0.000 description 3
- 230000011340 peptidyl-tyrosine autophosphorylation Effects 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 230000001402 polyadenylating Effects 0.000 description 3
- 108091008117 polyclonal antibodies Proteins 0.000 description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 3
- 230000001323 posttranslational Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- BDERNNFJNOPAEC-UHFFFAOYSA-N propanol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 3
- 239000001509 sodium citrate Substances 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 230000005030 transcription termination Effects 0.000 description 3
- 239000011778 trisodium citrate Substances 0.000 description 3
- 229960001322 trypsin Drugs 0.000 description 3
- 239000012588 trypsin Substances 0.000 description 3
- 210000004881 tumor cells Anatomy 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N (2S)-2-[[(2S)-2-[[(2S)-2-aminopropanoyl]amino]propanoyl]amino]-3-hydroxypropanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N (3β)-Cholest-5-en-3-ol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- KTGFOCFYOZQVRJ-UHFFFAOYSA-N 2-[(2-amino-3-methylpentanoyl)amino]pentanedioic acid Chemical compound CCC(C)C(N)C(=O)NC(C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-UHFFFAOYSA-N 0.000 description 2
- 108020005345 3' Untranslated Regions Proteins 0.000 description 2
- KYHPMCZXXRPOKS-ZJWYQBPBSA-N 3,7-dihydropurin-6-one;2-hydrazinyl-1H-pteridin-4-one;1-[(2R,4S,5R)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-methylpyrimidine-2,4-dione Chemical compound O=C1NC=NC2=C1NC=N2.C1=CN=C2C(=O)NC(NN)=NC2=N1.O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 KYHPMCZXXRPOKS-ZJWYQBPBSA-N 0.000 description 2
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-Phosphoglyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 2
- 102000013563 Acid Phosphatase Human genes 0.000 description 2
- 108010051457 Acid Phosphatase Proteins 0.000 description 2
- ZSOICJZJSRWNHX-ACZMJKKPSA-N Ala-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@H](C)[NH3+] ZSOICJZJSRWNHX-ACZMJKKPSA-N 0.000 description 2
- RDIKFPRVLJLMER-BQBZGAKWSA-N Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)N RDIKFPRVLJLMER-BQBZGAKWSA-N 0.000 description 2
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 2
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N Ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- PMGDADKJMCOXHX-BQBZGAKWSA-N Arg-Gln Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PMGDADKJMCOXHX-BQBZGAKWSA-N 0.000 description 2
- XTWSWDJMIKUJDQ-RYUDHWBXSA-N Arg-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XTWSWDJMIKUJDQ-RYUDHWBXSA-N 0.000 description 2
- BNODVYXZAAXSHW-UHFFFAOYSA-N Arginyl-Histidine Chemical compound NC(=N)NCCCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 BNODVYXZAAXSHW-UHFFFAOYSA-N 0.000 description 2
- XNSKSTRGQIPTSE-UHFFFAOYSA-N Arginyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CCCNC(N)=N XNSKSTRGQIPTSE-UHFFFAOYSA-N 0.000 description 2
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 description 2
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 2
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 2
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 2
- SJUXYGVRSGTPMC-UHFFFAOYSA-N Asparaginyl-Alanine Chemical compound OC(=O)C(C)NC(=O)C(N)CC(N)=O SJUXYGVRSGTPMC-UHFFFAOYSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 102100015648 BACE2 Human genes 0.000 description 2
- 101700051947 BACE2 Proteins 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 241000701822 Bovine papillomavirus Species 0.000 description 2
- 210000000481 Breast Anatomy 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- 101700060761 COS7 Proteins 0.000 description 2
- 241000282465 Canis Species 0.000 description 2
- 208000005623 Carcinogenesis Diseases 0.000 description 2
- 210000000170 Cell Membrane Anatomy 0.000 description 2
- 210000000349 Chromosomes Anatomy 0.000 description 2
- 229920002759 Circular DNA Polymers 0.000 description 2
- 108020004638 Circular DNA Proteins 0.000 description 2
- NKLPQNGYXWVELD-UHFFFAOYSA-M Coomassie Brilliant Blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 2
- 108091006941 Cullin Ring E3 Ligases Proteins 0.000 description 2
- RGTVXXNMOGHRAY-UHFFFAOYSA-N Cysteinyl-Arginine Chemical compound SCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N RGTVXXNMOGHRAY-UHFFFAOYSA-N 0.000 description 2
- LVNMAAGSAUGNIC-UHFFFAOYSA-N Cysteinyl-Histidine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 LVNMAAGSAUGNIC-UHFFFAOYSA-N 0.000 description 2
- GZCGUPFRVQAUEE-KCDKBNATSA-N D-(+)-Galactose Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)C=O GZCGUPFRVQAUEE-KCDKBNATSA-N 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 108010016626 Dipeptides Proteins 0.000 description 2
- 241000255925 Diptera Species 0.000 description 2
- 102000005731 EC 5.3.1.9 Human genes 0.000 description 2
- 108010070600 EC 5.3.1.9 Proteins 0.000 description 2
- 102100010782 EGFR Human genes 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 230000035693 Fab Effects 0.000 description 2
- 101710042240 GLUL Proteins 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 2
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 2
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 2
- SXGAGTVDWKQYCX-BQBZGAKWSA-N Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SXGAGTVDWKQYCX-BQBZGAKWSA-N 0.000 description 2
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 2
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 description 2
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 2
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 2
- 102000006602 Glyceraldehyde-3-Phosphate Dehydrogenases Human genes 0.000 description 2
- 108020004445 Glyceraldehyde-3-Phosphate Dehydrogenases Proteins 0.000 description 2
- 240000007842 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 101700075868 HER1 Proteins 0.000 description 2
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 2
- LNCFUHAPNTYMJB-IUCAKERBSA-N His-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNCFUHAPNTYMJB-IUCAKERBSA-N 0.000 description 2
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 2
- HTOOKGDPMXSJSY-STQMWFEESA-N His-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 HTOOKGDPMXSJSY-STQMWFEESA-N 0.000 description 2
- IDXZDKMBEXLFMB-UHFFFAOYSA-N Histidinyl-Isoleucine Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)CC1=CN=CN1 IDXZDKMBEXLFMB-UHFFFAOYSA-N 0.000 description 2
- WRPDZHJNLYNFFT-UHFFFAOYSA-N Histidinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC1=CN=CN1 WRPDZHJNLYNFFT-UHFFFAOYSA-N 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 2
- BCXBIONYYJCSDF-CIUDSAMLSA-N Ile-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BCXBIONYYJCSDF-CIUDSAMLSA-N 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 229920002459 Intron Polymers 0.000 description 2
- UWBDLNOCIDGPQE-UHFFFAOYSA-N Isoleucyl-Lysine Chemical compound CCC(C)C(N)C(=O)NC(C(O)=O)CCCCN UWBDLNOCIDGPQE-UHFFFAOYSA-N 0.000 description 2
- 244000285963 Kluyveromyces fragilis Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- 102100005410 LINE-1 retrotransposable element ORF2 protein Human genes 0.000 description 2
- 108060004448 LPP Proteins 0.000 description 2
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 2
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 210000004072 Lung Anatomy 0.000 description 2
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 2
- GUBGYTABKSRVRQ-YOLKTULGSA-N Maltose Natural products O([C@@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@H]1CO)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 GUBGYTABKSRVRQ-YOLKTULGSA-N 0.000 description 2
- 210000002027 Muscle, Skeletal Anatomy 0.000 description 2
- 229960000951 Mycophenolic Acid Drugs 0.000 description 2
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-hydroxy-Succinimide Chemical class ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 2
- 101700080605 NUC1 Proteins 0.000 description 2
- 206010029260 Neuroblastoma Diseases 0.000 description 2
- 230000004989 O-glycosylation Effects 0.000 description 2
- 108020005203 Oxidases Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N PMSF Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 210000000496 Pancreas Anatomy 0.000 description 2
- 210000001322 Periplasm Anatomy 0.000 description 2
- OZILORBBPKKGRI-RYUDHWBXSA-N Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 OZILORBBPKKGRI-RYUDHWBXSA-N 0.000 description 2
- PYOHODCEOHCZBM-RYUDHWBXSA-N Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 PYOHODCEOHCZBM-RYUDHWBXSA-N 0.000 description 2
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 2
- KNPVDQMEHSCAGX-UHFFFAOYSA-N Phenylalanyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 KNPVDQMEHSCAGX-UHFFFAOYSA-N 0.000 description 2
- 210000002826 Placenta Anatomy 0.000 description 2
- 210000002381 Plasma Anatomy 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 2
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 2
- 108010004434 Primatone RL Proteins 0.000 description 2
- SHAQGFGGJSLLHE-BQBZGAKWSA-N Pro-Gln Chemical compound NC(=O)CC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 SHAQGFGGJSLLHE-BQBZGAKWSA-N 0.000 description 2
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 2
- 108010078762 Protein Precursors Proteins 0.000 description 2
- 102000014961 Protein Precursors Human genes 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 101710042981 SHMT1 Proteins 0.000 description 2
- 101710039870 SLC46A1 Proteins 0.000 description 2
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 2
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- FFOKMZOAVHEWET-UHFFFAOYSA-N Serinyl-Cysteine Chemical compound OCC(N)C(=O)NC(CS)C(O)=O FFOKMZOAVHEWET-UHFFFAOYSA-N 0.000 description 2
- BXLYSRPHVMCOPS-UHFFFAOYSA-N Serinyl-Isoleucine Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)CO BXLYSRPHVMCOPS-UHFFFAOYSA-N 0.000 description 2
- LDEBVRIURYMKQS-UHFFFAOYSA-N Serinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CO LDEBVRIURYMKQS-UHFFFAOYSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 229920000978 Start codon Polymers 0.000 description 2
- 210000002784 Stomach Anatomy 0.000 description 2
- 210000001550 Testis Anatomy 0.000 description 2
- 229960002180 Tetracycline Drugs 0.000 description 2
- OFVLGDICTFRJMM-WESIUVDSSA-N Tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- NYTOUQBROMCLBJ-UHFFFAOYSA-N Tetranitromethane Chemical compound [O-][N+](=O)C([N+]([O-])=O)([N+]([O-])=O)[N+]([O-])=O NYTOUQBROMCLBJ-UHFFFAOYSA-N 0.000 description 2
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 2
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 2
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 2
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 description 2
- 229920000401 Three prime untranslated region Polymers 0.000 description 2
- 102000006601 Thymidine Kinase Human genes 0.000 description 2
- 108020004440 Thymidine Kinase Proteins 0.000 description 2
- 231100000765 Toxin Toxicity 0.000 description 2
- 102000006747 Transforming growth factor alpha Human genes 0.000 description 2
- ZHSGGJXRNHWHRS-VIDYELAYSA-N Tunicamycin Chemical compound O([C@H]1[C@@H]([C@H]([C@@H](O)[C@@H](CC(O)[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C(NC(=O)C=C2)=O)O)O1)O)NC(=O)/C=C/CC(C)C)[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1NC(C)=O ZHSGGJXRNHWHRS-VIDYELAYSA-N 0.000 description 2
- UBAQSAUDKMIEQZ-QWRGUYRKSA-N Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBAQSAUDKMIEQZ-QWRGUYRKSA-N 0.000 description 2
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 2
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 2
- 210000004291 Uterus Anatomy 0.000 description 2
- 240000004652 Vaccinium ovalifolium Species 0.000 description 2
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 2
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 2
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 2
- SRBFZHDQGSBBOR-SQOUGZDYSA-N Xylose Natural products O[C@@H]1CO[C@@H](O)[C@@H](O)[C@@H]1O SRBFZHDQGSBBOR-SQOUGZDYSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000002378 acidificating Effects 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- 101700005582 apaH Proteins 0.000 description 2
- 101710036446 aph Proteins 0.000 description 2
- 101710036445 aphA-6 Proteins 0.000 description 2
- 101710036449 aphA-7 Proteins 0.000 description 2
- 101710036447 aphA1 Proteins 0.000 description 2
- 101710036452 aphA4 Proteins 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 101700065210 asp1 Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000007640 basal medium Substances 0.000 description 2
- 230000001588 bifunctional Effects 0.000 description 2
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 230000024881 catalytic activity Effects 0.000 description 2
- 239000006143 cell culture media Substances 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000002759 chromosomal Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 239000003431 cross linking reagent Substances 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- VOLSCWDWGMWXGO-UHFFFAOYSA-N cyclobuten-1-yl acetate Chemical compound CC(=O)OC1=CCC1 VOLSCWDWGMWXGO-UHFFFAOYSA-N 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 238000006209 dephosphorylation reaction Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 239000005038 ethylene vinyl acetate Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000001605 fetal Effects 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 101700034105 fmrf Proteins 0.000 description 2
- 238000005755 formation reaction Methods 0.000 description 2
- 238000004108 freeze drying Methods 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002068 genetic Effects 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-VKHMYHEASA-N gly ala Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine zwitterion Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 239000011544 gradient gel Substances 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 125000003372 histidine group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 101500012147 human Neuregulin-1 Proteins 0.000 description 2
- 239000000017 hydrogel Substances 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 150000002463 imido esters Chemical class 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 238000007912 intraperitoneal administration Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- RGLRXNKKBLIBQS-XNHQSDQCSA-N leuprolide acetate Chemical compound CC(O)=O.CCNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)CC1=CC=C(O)C=C1 RGLRXNKKBLIBQS-XNHQSDQCSA-N 0.000 description 2
- 101710030587 ligN Proteins 0.000 description 2
- 101700077585 ligd Proteins 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 2
- 201000000050 myeloid neoplasm Diseases 0.000 description 2
- 101710036448 neo Proteins 0.000 description 2
- 230000001264 neutralization Effects 0.000 description 2
- 101700006494 nucA Proteins 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 238000005191 phase separation Methods 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 238000003566 phosphorylation assay Methods 0.000 description 2
- 229920001200 poly(ethylene-vinyl acetate) Polymers 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 230000003389 potentiating Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- QLROSWPKSBORFJ-BQBZGAKWSA-N pro glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 230000001737 promoting Effects 0.000 description 2
- 229960005335 propanol Drugs 0.000 description 2
- RADKZDMFGJYCBB-UHFFFAOYSA-N pyridoxal Chemical compound CC1=NC=C(CO)C(C=O)=C1O RADKZDMFGJYCBB-UHFFFAOYSA-N 0.000 description 2
- 230000002285 radioactive Effects 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000002829 reduced Effects 0.000 description 2
- 230000003362 replicative Effects 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 101710036430 rph Proteins 0.000 description 2
- 230000003248 secreting Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L sodium carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 101710011036 stiI Proteins 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 125000003396 thiol group Chemical group [H]S* 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 108020003112 toxins Proteins 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- ORHBXUUXSCNDEV-UHFFFAOYSA-N umbelliferone Chemical compound C1=CC(=O)OC2=CC(O)=CC=C21 ORHBXUUXSCNDEV-UHFFFAOYSA-N 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N α-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- UKKNTTCNGZLJEX-UHFFFAOYSA-N γ-glutamyl-Serine Chemical compound NC(=O)CCC(N)C(=O)NC(CO)C(O)=O UKKNTTCNGZLJEX-UHFFFAOYSA-N 0.000 description 2
- HMFAQQIORZDPJG-UHFFFAOYSA-L (2-chloroacetyl) phosphate Chemical compound [O-]P([O-])(=O)OC(=O)CCl HMFAQQIORZDPJG-UHFFFAOYSA-L 0.000 description 1
- OABOXRPGTFRBFZ-IMJSIDKUSA-N (2R)-2-[[(2R)-2-amino-3-sulfanylpropanoyl]amino]-3-sulfanylpropanoic acid Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(O)=O OABOXRPGTFRBFZ-IMJSIDKUSA-N 0.000 description 1
- BRZYSWJRSDMWLG-DJWUNRQOSA-N (2R,3R,4R,5R)-2-[(1S,2S,3R,4S,6R)-4,6-diamino-3-[(2S,3R,4R,5S,6R)-3-amino-4,5-dihydroxy-6-[(1R)-1-hydroxyethyl]oxan-2-yl]oxy-2-hydroxycyclohexyl]oxy-5-methyl-4-(methylamino)oxane-3,5-diol Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H]([C@@H](C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-DJWUNRQOSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N (2S)-1-[(2S)-1-[(2S)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N (2S)-1-[2-[[(2S)-2-azaniumyl-4-methylpentanoyl]amino]acetyl]pyrrolidine-2-carboxylate Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N (2S)-2-[[(2S)-2,5-diamino-5-oxopentanoyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- XMQUEQJCYRFIQS-YFKPBYRVSA-N (2S)-2-amino-5-ethoxy-5-oxopentanoic acid Chemical compound CCOC(=O)CC[C@H](N)C(O)=O XMQUEQJCYRFIQS-YFKPBYRVSA-N 0.000 description 1
- KYBXNPIASYUWLN-WUCPZUCCSA-N (2S)-5-hydroxypyrrolidine-2-carboxylic acid Chemical compound OC1CC[C@@H](C(O)=O)N1 KYBXNPIASYUWLN-WUCPZUCCSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N (2S,3S)-2-[[(2S)-2-[[(2S)-2-amino-4-carboxybutanoyl]amino]-4-carboxybutanoyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N (3S)-3-[[(2S)-2-aminopropanoyl]amino]-4-[[(1S)-1-carboxy-2-hydroxyethyl]amino]-4-oxobutanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NBCQBGBECVUZMK-UHFFFAOYSA-N (4-carboxyphenyl)mercury;hydrate Chemical compound O.OC(=O)C1=CC=C([Hg])C=C1 NBCQBGBECVUZMK-UHFFFAOYSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N (4S)-4-amino-5-[[(2S,3S)-1-[[(1S)-1-carboxy-2-methylpropyl]amino]-3-methyl-1-oxopentan-2-yl]amino]-5-oxopentanoic acid Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- CHADEQDQBURGHL-UHFFFAOYSA-N (6'-acetyloxy-3-oxospiro[2-benzofuran-1,9'-xanthene]-3'-yl) acetate Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(OC(C)=O)C=C1OC1=CC(OC(=O)C)=CC=C21 CHADEQDQBURGHL-UHFFFAOYSA-N 0.000 description 1
- 229940044613 1-Propanol Drugs 0.000 description 1
- 125000003287 1H-imidazol-4-ylmethyl group Chemical group [H]N1C([H])=NC(C([H])([H])[*])=C1[H] 0.000 description 1
- KGLPWQKSKUVKMJ-UHFFFAOYSA-N 2,3-dihydrophthalazine-1,4-dione Chemical class C1=CC=C2C(=O)NNC(=O)C2=C1 KGLPWQKSKUVKMJ-UHFFFAOYSA-N 0.000 description 1
- NHJVRSWLHSJWIN-UHFFFAOYSA-N 2,4,6-Trinitrobenzenesulfonic acid Chemical compound OS(=O)(=O)C1=C([N+]([O-])=O)C=C([N+]([O-])=O)C=C1[N+]([O-])=O NHJVRSWLHSJWIN-UHFFFAOYSA-N 0.000 description 1
- 150000003923 2,5-pyrrolediones Chemical class 0.000 description 1
- BHANCCMWYDZQOR-UHFFFAOYSA-N 2-(methyldisulfanyl)pyridine Chemical compound CSSC1=CC=CC=N1 BHANCCMWYDZQOR-UHFFFAOYSA-N 0.000 description 1
- 125000000979 2-amino-2-oxoethyl group Chemical group [H]C([*])([H])C(=O)N([H])[H] 0.000 description 1
- FKJSFKCZZIXQIP-UHFFFAOYSA-N 2-bromo-1-(4-bromophenyl)ethanone Chemical compound BrCC(=O)C1=CC=C(Br)C=C1 FKJSFKCZZIXQIP-UHFFFAOYSA-N 0.000 description 1
- 101700002664 26KD Proteins 0.000 description 1
- ALRHLSYJTWAHJZ-UHFFFAOYSA-N 3-Hydroxypropionic acid Chemical compound OCCC(O)=O ALRHLSYJTWAHJZ-UHFFFAOYSA-N 0.000 description 1
- KMEVEXPZPXFKIZ-UHFFFAOYSA-L 3-[[2-carboxylato-2-(2,5-dioxopyrrolidin-1-yl)ethyl]disulfanyl]-2-(2,5-dioxopyrrolidin-1-yl)propanoate Chemical compound O=C1CCC(=O)N1C(C(=O)[O-])CSSCC(C([O-])=O)N1C(=O)CCC1=O KMEVEXPZPXFKIZ-UHFFFAOYSA-L 0.000 description 1
- 125000003974 3-carbamimidamidopropyl group Chemical group C(N)(=N)NCCC* 0.000 description 1
- QHSXWDVVFHXHHB-UHFFFAOYSA-N 3-nitro-2-[(3-nitropyridin-2-yl)disulfanyl]pyridine Chemical compound [O-][N+](=O)C1=CC=CN=C1SSC1=NC=CC=C1[N+]([O-])=O QHSXWDVVFHXHHB-UHFFFAOYSA-N 0.000 description 1
- HJBUBXIDMQBSQW-UHFFFAOYSA-N 4-(4-diazoniophenyl)benzenediazonium Chemical compound C1=CC([N+]#N)=CC=C1C1=CC=C([N+]#N)C=C1 HJBUBXIDMQBSQW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N 4-amino-5-[(1-carboxy-2-phenylethyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- HKTRDWYCAUTRRL-UHFFFAOYSA-N 4-amino-5-[[1-carboxy-2-(1H-imidazol-5-yl)ethyl]amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 HKTRDWYCAUTRRL-UHFFFAOYSA-N 0.000 description 1
- 125000004042 4-aminobutyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])N([H])[H] 0.000 description 1
- NLPWSMKACWGINL-UHFFFAOYSA-N 4-azido-2-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(N=[N+]=[N-])C=C1O NLPWSMKACWGINL-UHFFFAOYSA-N 0.000 description 1
- OIRDTQYFTABQOQ-GAWUUDPSSA-N 9-β-D-XYLOFURANOSYL-ADENINE Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@H](O)[C@H]1O OIRDTQYFTABQOQ-GAWUUDPSSA-N 0.000 description 1
- 101710006356 ACTI Proteins 0.000 description 1
- 101700080270 ADH2 Proteins 0.000 description 1
- 101700033574 ADH6 Proteins 0.000 description 1
- 101700013392 ASI1 Proteins 0.000 description 1
- 101700078948 ASP4 Proteins 0.000 description 1
- YRKCREAYFQTBPV-UHFFFAOYSA-N Acetylacetone Chemical compound CC(=O)CC(C)=O YRKCREAYFQTBPV-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- OIRDTQYFTABQOQ-SXVXDFOESA-N Adenosine Natural products Nc1ncnc2c1ncn2[C@@H]3O[C@@H](CO)[C@H](O)[C@@H]3O OIRDTQYFTABQOQ-SXVXDFOESA-N 0.000 description 1
- 241000256118 Aedes aegypti Species 0.000 description 1
- 241000256173 Aedes albopictus Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- OMNVYXHOSHNURL-WPRPVWTQSA-N Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMNVYXHOSHNURL-WPRPVWTQSA-N 0.000 description 1
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 1
- ALZVPLKYDKJKQU-XVKPBYJWSA-N Ala-Tyr Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ALZVPLKYDKJKQU-XVKPBYJWSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N Alanyl-Arginine Chemical compound CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N Ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 108020004491 Antisense DNA Proteins 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- QYLJIYOGHRGUIH-CIUDSAMLSA-N Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N QYLJIYOGHRGUIH-CIUDSAMLSA-N 0.000 description 1
- PQBHGSGQZSOLIR-RYUDHWBXSA-N Arg-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PQBHGSGQZSOLIR-RYUDHWBXSA-N 0.000 description 1
- JSLGXODUIAFWCF-UHFFFAOYSA-N Arginyl-Asparagine Chemical compound NC(N)=NCCCC(N)C(=O)NC(CC(N)=O)C(O)=O JSLGXODUIAFWCF-UHFFFAOYSA-N 0.000 description 1
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- HXWUJJADFMXNKA-UHFFFAOYSA-N Asparaginyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC(N)=O HXWUJJADFMXNKA-UHFFFAOYSA-N 0.000 description 1
- 229960005261 Aspartic Acid Drugs 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 206010063836 Atrioventricular septal defect Diseases 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241001203868 Autographa californica Species 0.000 description 1
- 241000713842 Avian sarcoma virus Species 0.000 description 1
- 102200067278 BIRC5 K23R Human genes 0.000 description 1
- 101710042656 BQ2027_MB1231C Proteins 0.000 description 1
- 241000304886 Bacilli Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108090000363 Bacterial Luciferases Proteins 0.000 description 1
- 210000002469 Basement Membrane Anatomy 0.000 description 1
- 241000219430 Betula pendula Species 0.000 description 1
- 210000004369 Blood Anatomy 0.000 description 1
- 210000001124 Body Fluids Anatomy 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241000409811 Bombyx mori nucleopolyhedrovirus Species 0.000 description 1
- 241000701922 Bovine parvovirus Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101700052995 CASK Proteins 0.000 description 1
- 108060001685 CNTF Proteins 0.000 description 1
- 102100006400 CSF2 Human genes 0.000 description 1
- 101700046715 CSTI Proteins 0.000 description 1
- 101710022848 CYPADH Proteins 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 210000002421 Cell Wall Anatomy 0.000 description 1
- VDQQXEISLMTGAB-UHFFFAOYSA-N Chloramine-T Chemical compound [Na+].CC1=CC=C(S(=O)(=O)[N-]Cl)C=C1 VDQQXEISLMTGAB-UHFFFAOYSA-N 0.000 description 1
- VXIVSQZSERGHQP-UHFFFAOYSA-N Chloroacetamide Chemical compound NC(=O)CCl VXIVSQZSERGHQP-UHFFFAOYSA-N 0.000 description 1
- FOCAUTSVDIKZOP-UHFFFAOYSA-N Chloroacetic acid Chemical compound OC(=O)CCl FOCAUTSVDIKZOP-UHFFFAOYSA-N 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 229940107161 Cholesterol Drugs 0.000 description 1
- 229920000062 Coding strand Polymers 0.000 description 1
- 210000001072 Colon Anatomy 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- ZSRSLWKGWFFVCM-WDSKDSINSA-N Cys-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O ZSRSLWKGWFFVCM-WDSKDSINSA-N 0.000 description 1
- AYKQJQVWUYEZNU-UHFFFAOYSA-N Cysteinyl-Asparagine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC(N)=O AYKQJQVWUYEZNU-UHFFFAOYSA-N 0.000 description 1
- VBIIZCXWOZDIHS-UHFFFAOYSA-N Cysteinyl-Isoleucine Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)CS VBIIZCXWOZDIHS-UHFFFAOYSA-N 0.000 description 1
- WXOFKRKAHJQKLT-UHFFFAOYSA-N Cysteinyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CS WXOFKRKAHJQKLT-UHFFFAOYSA-N 0.000 description 1
- OOULJWDSSVOMHX-UHFFFAOYSA-N Cysteinyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CS OOULJWDSSVOMHX-UHFFFAOYSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- FBPFZTCFMRRESA-KAZBKCHUSA-N D-Mannitol Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KAZBKCHUSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N D-sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 101700020566 DEFA4 Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-N Deoxycytidine triphosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO[P@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-N 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N Deoxyguanosine triphosphate Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- 229960000633 Dextran Sulfate Drugs 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- FFYPMLJYZAEMQB-UHFFFAOYSA-N Diethylpyrocarbonate Chemical compound CCOC(=O)OC(=O)OCC FFYPMLJYZAEMQB-UHFFFAOYSA-N 0.000 description 1
- 241000255601 Drosophila melanogaster Species 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 108010031111 EBV-encoded nuclear antigen 1 Proteins 0.000 description 1
- 108010015776 EC 1.1.3.4 Proteins 0.000 description 1
- 108010015133 EC 1.1.3.9 Proteins 0.000 description 1
- 108010093894 EC 1.17.3.2 Proteins 0.000 description 1
- 102000013009 EC 2.7.1.40 Human genes 0.000 description 1
- 108020005115 EC 2.7.1.40 Proteins 0.000 description 1
- 102000008422 EC 2.7.1.78 Human genes 0.000 description 1
- 108010021757 EC 2.7.1.78 Proteins 0.000 description 1
- 108010011939 EC 4.1.1.1 Proteins 0.000 description 1
- 102000012288 EC 4.2.1.11 Human genes 0.000 description 1
- 108010022181 EC 4.2.1.11 Proteins 0.000 description 1
- 108010041525 EC 5.1.1.1 Proteins 0.000 description 1
- 102000018386 EGF Family of Proteins Human genes 0.000 description 1
- 108010066486 EGF Family of Proteins Proteins 0.000 description 1
- 102000012545 EGF-like domain Human genes 0.000 description 1
- 108050002150 EGF-like domain Proteins 0.000 description 1
- 101700039191 EGFR Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 210000001161 Embryo, Mammalian Anatomy 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 240000003550 Eusideroxylon zwageri Species 0.000 description 1
- 231100000776 Exotoxin Toxicity 0.000 description 1
- 101710003421 FGF Proteins 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 102000008857 Ferritin Human genes 0.000 description 1
- 108050000784 Ferritin Proteins 0.000 description 1
- 238000008416 Ferritin Methods 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108090000331 Firefly luciferases Proteins 0.000 description 1
- MKXKFYHWDHIYRV-UHFFFAOYSA-N Flutamide Chemical compound CC(C)C(=O)NC1=CC=C([N+]([O-])=O)C(C(F)(F)F)=C1 MKXKFYHWDHIYRV-UHFFFAOYSA-N 0.000 description 1
- 101710030621 GALNT6 Proteins 0.000 description 1
- 101710030615 GALNT9 Proteins 0.000 description 1
- 108010072062 GEKG peptide Proteins 0.000 description 1
- 229940116332 GLUCOSE OXIDASE Drugs 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108010089239 Gelonium multiflorum GEL protein Proteins 0.000 description 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 1
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 102000002794 Glucosephosphate Dehydrogenase Human genes 0.000 description 1
- 108010018962 Glucosephosphate Dehydrogenase Proteins 0.000 description 1
- 229960002989 Glutamic Acid Drugs 0.000 description 1
- MRVYVEQPNDSWLH-UHFFFAOYSA-N Glutaminyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)CCC(N)=O MRVYVEQPNDSWLH-UHFFFAOYSA-N 0.000 description 1
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 1
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 1
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Natural products OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 240000006962 Gossypium hirsutum Species 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N HEPES Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 108060003792 HRG Proteins 0.000 description 1
- 101700071120 HSTN Proteins 0.000 description 1
- 229940031574 HYDROXYMETHYL CELLULOSE Drugs 0.000 description 1
- 210000002216 Heart Anatomy 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 206010073071 Hepatocellular carcinoma Diseases 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 108020002022 Hexokinases Proteins 0.000 description 1
- 102000005548 Hexokinases Human genes 0.000 description 1
- FRJIAZKQGSCKPQ-FSPLSTOPSA-N His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 FRJIAZKQGSCKPQ-FSPLSTOPSA-N 0.000 description 1
- MDCTVRUPVLZSPG-BQBZGAKWSA-N His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 MDCTVRUPVLZSPG-BQBZGAKWSA-N 0.000 description 1
- LYCVKHSJGDMDLM-LURJTMIESA-N His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 LYCVKHSJGDMDLM-LURJTMIESA-N 0.000 description 1
- CZVQSYNVUHAILZ-UWVGGRQHSA-N His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 CZVQSYNVUHAILZ-UWVGGRQHSA-N 0.000 description 1
- FBTYOQIYBULKEH-ZFWWWQNUSA-N His-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 FBTYOQIYBULKEH-ZFWWWQNUSA-N 0.000 description 1
- VLDVBZICYBVQHB-IUCAKERBSA-N His-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 VLDVBZICYBVQHB-IUCAKERBSA-N 0.000 description 1
- WSDOHRLQDGAOGU-UHFFFAOYSA-N Histidinyl-Asparagine Chemical compound NC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CN=CN1 WSDOHRLQDGAOGU-UHFFFAOYSA-N 0.000 description 1
- 229940088597 Hormone Drugs 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 229960002591 Hydroxyproline Drugs 0.000 description 1
- 229940072221 IMMUNOGLOBULINS Drugs 0.000 description 1
- 101710006353 IP3R Proteins 0.000 description 1
- 101700035656 ISOTI Proteins 0.000 description 1
- 101700035039 ITI Proteins 0.000 description 1
- 101700052013 ITR2 Proteins 0.000 description 1
- 101700068039 ITRP Proteins 0.000 description 1
- UCGDDTHMMVWVMV-FSPLSTOPSA-N Ile-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(O)=O UCGDDTHMMVWVMV-FSPLSTOPSA-N 0.000 description 1
- QNBYCZTZNOVDMI-HGNGGELXSA-N Ile-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QNBYCZTZNOVDMI-HGNGGELXSA-N 0.000 description 1
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
- TUYOFUHICRWDGA-CIUDSAMLSA-N Ile-Met Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCSC TUYOFUHICRWDGA-CIUDSAMLSA-N 0.000 description 1
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- 210000000987 Immune System Anatomy 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- WKXVAXOSIPTXEC-UHFFFAOYSA-N Isoleucyl-Aspartate Chemical compound CCC(C)C(N)C(=O)NC(C(O)=O)CC(O)=O WKXVAXOSIPTXEC-UHFFFAOYSA-N 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N Kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-2-aminohexanoic acid zwitterion Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 1
- 125000001214 L-asparto group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(O[H])=O 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- JAQGKXUEKGKTKX-HOTGVXAUSA-N L-tyrosyl-L-tyrosine Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 1
- 229940057428 LACTOPEROXIDASE Drugs 0.000 description 1
- 101710019425 LARS2 Proteins 0.000 description 1
- 101710010796 LYZ Proteins 0.000 description 1
- 241000481961 Lachancea thermotolerans Species 0.000 description 1
- 108010023244 Lactoperoxidase Proteins 0.000 description 1
- 102000028880 Lactoperoxidase Human genes 0.000 description 1
- GUBGYTABKSRVRQ-UUNJERMWSA-N Lactose Natural products O([C@@H]1[C@H](O)[C@H](O)[C@H](O)O[C@@H]1CO)[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1 GUBGYTABKSRVRQ-UUNJERMWSA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- LCPYQJIKPJDLLB-UWVGGRQHSA-N Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C LCPYQJIKPJDLLB-UWVGGRQHSA-N 0.000 description 1
- NTISAKGPIGTIJJ-IUCAKERBSA-N Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C NTISAKGPIGTIJJ-IUCAKERBSA-N 0.000 description 1
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 1
- 229940089022 Leuprolide Acetate Drugs 0.000 description 1
- 229960004338 Leuprorelin Drugs 0.000 description 1
- 108060001084 Luciferase family Proteins 0.000 description 1
- 229940087857 Lupron Drugs 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 210000002751 Lymph Anatomy 0.000 description 1
- 210000001165 Lymph Nodes Anatomy 0.000 description 1
- 210000004698 Lymphocytes Anatomy 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- JPNRPAJITHRXRH-UHFFFAOYSA-N Lysyl-Asparagine Chemical compound NCCCCC(N)C(=O)NC(C(O)=O)CC(N)=O JPNRPAJITHRXRH-UHFFFAOYSA-N 0.000 description 1
- 102100008175 MGAM Human genes 0.000 description 1
- 101700036939 MTI Proteins 0.000 description 1
- 239000004907 Macro-emulsion Substances 0.000 description 1
- 210000002540 Macrophages Anatomy 0.000 description 1
- 210000002264 Mammary Glands, Animal Anatomy 0.000 description 1
- 210000004293 Mammary Glands, Human Anatomy 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 1
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 1
- PESQCPHRXOFIPX-RYUDHWBXSA-N Met-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-RYUDHWBXSA-N 0.000 description 1
- 208000008466 Metabolic Disease Diseases 0.000 description 1
- 230000036740 Metabolism Effects 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- NDYNTQWSJLPEMK-UHFFFAOYSA-N Methionyl-Cysteine Chemical compound CSCCC(N)C(=O)NC(CS)C(O)=O NDYNTQWSJLPEMK-UHFFFAOYSA-N 0.000 description 1
- 101710039943 Mup37 Proteins 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 210000004165 Myocardium Anatomy 0.000 description 1
- 240000001307 Myosotis scorpioides Species 0.000 description 1
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-ACETYL-D-GALACTOSAMINE Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 1
- FUSGACRLAFQQRL-UHFFFAOYSA-N N-Ethyl-N-nitrosourea Chemical compound CCN(N=O)C(N)=O FUSGACRLAFQQRL-UHFFFAOYSA-N 0.000 description 1
- XNPOFXIBHOVFFH-UHFFFAOYSA-N N-cyclohexyl-N'-(2-(4-morpholinyl)ethyl)carbodiimide Chemical compound C1CCCCC1N=C=NCCN1CCOCC1 XNPOFXIBHOVFFH-UHFFFAOYSA-N 0.000 description 1
- 102400000874 N-terminal form Human genes 0.000 description 1
- 101800001340 N-terminal form Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 101700082129 NAPSA Proteins 0.000 description 1
- 102100015302 NAPSA Human genes 0.000 description 1
- 102100015656 NRG1 Human genes 0.000 description 1
- 229910020936 NaC Inorganic materials 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 240000008962 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- FEMOMIGRRWSMCU-UHFFFAOYSA-N Ninhydrin Chemical compound C1=CC=C2C(=O)C(O)(O)C(=O)C2=C1 FEMOMIGRRWSMCU-UHFFFAOYSA-N 0.000 description 1
- 208000002154 Non-Small-Cell Lung Carcinoma Diseases 0.000 description 1
- 108009000071 Non-small cell lung cancer Proteins 0.000 description 1
- 210000004940 Nucleus Anatomy 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N O(4)-phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 102000016979 Other receptors Human genes 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 101700067249 POP2 Proteins 0.000 description 1
- 101710042135 PPi-PFK Proteins 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 229950009506 Penicillinase Drugs 0.000 description 1
- 229940072417 Peroxidase Drugs 0.000 description 1
- 108090000437 Peroxidases Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- HWMGTNOVUDIKRE-UWVGGRQHSA-N Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 HWMGTNOVUDIKRE-UWVGGRQHSA-N 0.000 description 1
- JWBLQDDHSDGEGR-DRZSPHRISA-N Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWBLQDDHSDGEGR-DRZSPHRISA-N 0.000 description 1
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 1
- FSXRLASFHBWESK-HOTGVXAUSA-N Phe-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 FSXRLASFHBWESK-HOTGVXAUSA-N 0.000 description 1
- 229960005190 Phenylalanine Drugs 0.000 description 1
- KLAONOISLHWJEE-UHFFFAOYSA-N Phenylalanyl-Glutamine Chemical compound NC(=O)CCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 KLAONOISLHWJEE-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N Phenylalanyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- AQSJGOWTSHOLKH-UHFFFAOYSA-N Phosphite Chemical compound [O-]P([O-])[O-] AQSJGOWTSHOLKH-UHFFFAOYSA-N 0.000 description 1
- 102000011755 Phosphoglycerate kinases Human genes 0.000 description 1
- 108020004454 Phosphoglycerate kinases Proteins 0.000 description 1
- LLKYUHGUYSLMPA-UHFFFAOYSA-N Phosphoramidite Chemical compound NP([O-])[O-] LLKYUHGUYSLMPA-UHFFFAOYSA-N 0.000 description 1
- 102000030951 Phosphotransferases Human genes 0.000 description 1
- 108091000081 Phosphotransferases Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 229920001451 Polypropylene glycol Polymers 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 1
- JQOHKCDMINQZRV-WDSKDSINSA-N Pro-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 JQOHKCDMINQZRV-WDSKDSINSA-N 0.000 description 1
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- HXNYBZQLBWIADP-UHFFFAOYSA-N Prolyl-Cysteine Chemical compound OC(=O)C(CS)NC(=O)C1CCCN1 HXNYBZQLBWIADP-UHFFFAOYSA-N 0.000 description 1
- 210000002307 Prostate Anatomy 0.000 description 1
- 229940024999 Proteolytic enzymes for treatment of wounds and ulcers Drugs 0.000 description 1
- 229960003581 Pyridoxal Drugs 0.000 description 1
- 229960001327 Pyridoxal Phosphate Drugs 0.000 description 1
- 239000007759 RPMI Media 1640 Substances 0.000 description 1
- 241000700157 Rattus norvegicus Species 0.000 description 1
- 108091005674 Receptor kinase Proteins 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N Rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- XMVJITFPVVRMHC-UHFFFAOYSA-N Roxarsone Chemical group OC1=CC=C([As](O)(O)=O)C=C1[N+]([O-])=O XMVJITFPVVRMHC-UHFFFAOYSA-N 0.000 description 1
- 101700044242 SER2 Proteins 0.000 description 1
- 102200033639 SF1 S20T Human genes 0.000 description 1
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 1
- 229940081969 Saccharomyces cerevisiae Drugs 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 241001522306 Serinus serinus Species 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 210000000717 Sertoli cell Anatomy 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 239000004280 Sodium formate Substances 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 240000001016 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- RINCXYDBBGOEEQ-UHFFFAOYSA-N Succinic anhydride Chemical compound O=C1CCC(=O)O1 RINCXYDBBGOEEQ-UHFFFAOYSA-N 0.000 description 1
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfizole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 1
- 210000001744 T-Lymphocytes Anatomy 0.000 description 1
- 102100014223 TGFA Human genes 0.000 description 1
- 101700062451 TI Proteins 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- FQENQNTWSFEDLI-UHFFFAOYSA-J Tetrasodium pyrophosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OP([O-])([O-])=O FQENQNTWSFEDLI-UHFFFAOYSA-J 0.000 description 1
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 1
- WXVIGTAUZBUDPZ-DTLFHODZSA-N Thr-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 WXVIGTAUZBUDPZ-DTLFHODZSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- CUTPSEKWUPZFLV-UHFFFAOYSA-N Threoninyl-Cysteine Chemical compound CC(O)C(N)C(=O)NC(CS)C(O)=O CUTPSEKWUPZFLV-UHFFFAOYSA-N 0.000 description 1
- LUMXICQAOKVQOB-UHFFFAOYSA-N Threoninyl-Isoleucine Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)C(C)O LUMXICQAOKVQOB-UHFFFAOYSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-UHFFFAOYSA-N Threoninyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)C(C)O CKHWEVXPLJBEOZ-UHFFFAOYSA-N 0.000 description 1
- 229940104230 Thymidine Drugs 0.000 description 1
- 108010034949 Thyroglobulin Proteins 0.000 description 1
- 102000009843 Thyroglobulin Human genes 0.000 description 1
- 229960002175 Thyroglobulin Drugs 0.000 description 1
- 102000002142 Trans-Activators Human genes 0.000 description 1
- 108010040939 Trans-Activators Proteins 0.000 description 1
- 229920002969 Transcription factor Polymers 0.000 description 1
- 102000004338 Transferrin Human genes 0.000 description 1
- 108090000901 Transferrin Proteins 0.000 description 1
- 102000009618 Transforming Growth Factors Human genes 0.000 description 1
- 108010009583 Transforming Growth Factors Proteins 0.000 description 1
- 102000005924 Triose-phosphate isomerases Human genes 0.000 description 1
- 108020003073 Triose-phosphate isomerases Proteins 0.000 description 1
- NQIHMZLGCZNZBN-PXNSSMCTSA-N Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 NQIHMZLGCZNZBN-PXNSSMCTSA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- KBUBZAMBIVEFEI-UHFFFAOYSA-N Tryptophyl-Histidine Chemical compound C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 KBUBZAMBIVEFEI-UHFFFAOYSA-N 0.000 description 1
- 206010064390 Tumour invasion Diseases 0.000 description 1
- ZQOOYCZQENFIMC-STQMWFEESA-N Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=C(O)C=C1 ZQOOYCZQENFIMC-STQMWFEESA-N 0.000 description 1
- QZOSVNLXLSNHQK-UHFFFAOYSA-N Tyrosyl-Aspartate Chemical compound OC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 QZOSVNLXLSNHQK-UHFFFAOYSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UHFFFAOYSA-N Tyrosyl-Serine Chemical compound OCC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UHFFFAOYSA-N 0.000 description 1
- MFEVVAXTBZELLL-UHFFFAOYSA-N Tyrosyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 MFEVVAXTBZELLL-UHFFFAOYSA-N 0.000 description 1
- 230000036462 Unbound Effects 0.000 description 1
- 108010092464 Urate Oxidase Proteins 0.000 description 1
- 229940029983 VITAMINS Drugs 0.000 description 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 1
- YSGSDAIMSCVPHG-YUMQZZPRSA-N Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C YSGSDAIMSCVPHG-YUMQZZPRSA-N 0.000 description 1
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 1
- WPSXZFTVLIAPCN-UHFFFAOYSA-N Valyl-Cysteine Chemical compound CC(C)C(N)C(=O)NC(CS)C(O)=O WPSXZFTVLIAPCN-UHFFFAOYSA-N 0.000 description 1
- 241000711975 Vesicular stomatitis virus Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 229940021016 Vitamin IV solution additives Drugs 0.000 description 1
- 102100002634 XDH Human genes 0.000 description 1
- 229960003487 Xylose Drugs 0.000 description 1
- 101700035783 YM10 Proteins 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 230000001594 aberrant Effects 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000003929 acidic solution Substances 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 230000001058 adult Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000004931 aggregating Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 125000003277 amino group Chemical compound 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 230000000454 anti-cipatory Effects 0.000 description 1
- 230000001028 anti-proliferant Effects 0.000 description 1
- 230000000692 anti-sense Effects 0.000 description 1
- 239000003816 antisense DNA Substances 0.000 description 1
- 239000012223 aqueous fraction Substances 0.000 description 1
- 101700021576 arg-13 Proteins 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 239000012131 assay buffer Substances 0.000 description 1
- 244000052616 bacterial pathogens Species 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- 108020004256 beta-Lactamases Proteins 0.000 description 1
- 102000006635 beta-Lactamases Human genes 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 238000005047 biotechnology Methods 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 230000000903 blocking Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 108091008123 c-sis Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 230000000711 cancerogenic Effects 0.000 description 1
- 150000001244 carboxylic acid anhydrides Chemical class 0.000 description 1
- 125000002057 carboxymethyl group Chemical group [H]OC(=O)C([H])([H])[*] 0.000 description 1
- 231100000357 carcinogen Toxicity 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 239000003183 carcinogenic agent Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 239000003729 cation exchange resin Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 238000002701 cell growth assay Methods 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 229920002083 cellular DNA Polymers 0.000 description 1
- 229920002092 cellular RNA Polymers 0.000 description 1
- 230000003196 chaotropic Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 230000003399 chemotactic Effects 0.000 description 1
- 108091006028 chimera Proteins 0.000 description 1
- 229940106681 chloroacetic acid Drugs 0.000 description 1
- SJOVRJIZOJKXLQ-UHFFFAOYSA-N chloroboranuide Chemical compound Cl[BH3-] SJOVRJIZOJKXLQ-UHFFFAOYSA-N 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 238000007374 clinical diagnostic method Methods 0.000 description 1
- 238000005354 coacervation Methods 0.000 description 1
- 201000010897 colon adenocarcinoma Diseases 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 235000005824 corn Nutrition 0.000 description 1
- 239000007822 coupling agent Substances 0.000 description 1
- 238000003235 crystal violet staining Methods 0.000 description 1
- OILAIQUEIWYQPH-UHFFFAOYSA-N cyclohexane-1,2-dione Chemical compound O=C1CCCCC1=O OILAIQUEIWYQPH-UHFFFAOYSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-J dATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-J 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- 230000003247 decreasing Effects 0.000 description 1
- 230000004059 degradation Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000006297 dehydration reaction Methods 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- QSJXEFYPDANLFS-UHFFFAOYSA-N diacetyl Chemical compound CC(=O)C(C)=O QSJXEFYPDANLFS-UHFFFAOYSA-N 0.000 description 1
- 238000002059 diagnostic imaging Methods 0.000 description 1
- LCGLNKUTAGEVQW-UHFFFAOYSA-N dimethyl ether Chemical compound COC LCGLNKUTAGEVQW-UHFFFAOYSA-N 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000029578 entry into host Effects 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- YNQLUTRBYVCPMQ-UHFFFAOYSA-N ethylbenzene Chemical compound CCC1=CC=CC=C1 YNQLUTRBYVCPMQ-UHFFFAOYSA-N 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 239000002095 exotoxin Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- ANUSOIHIIPAHJV-UHFFFAOYSA-N fenticlor Chemical compound OC1=CC=C(Cl)C=C1SC1=CC(Cl)=CC=C1O ANUSOIHIIPAHJV-UHFFFAOYSA-N 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 201000006585 gastric adenocarcinoma Diseases 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 102000034448 gene-regulatory proteins Human genes 0.000 description 1
- 108091006088 gene-regulatory proteins Proteins 0.000 description 1
- 102000018146 globin family Human genes 0.000 description 1
- 108060003196 globin family Proteins 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 101700066025 gly-2 Proteins 0.000 description 1
- STKYPAFSDFAEPH-LURJTMIESA-N gly-val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 1
- 230000002414 glycolytic Effects 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- HHLFWLYXYJOTON-UHFFFAOYSA-M glyoxylate Chemical compound [O-]C(=O)C=O HHLFWLYXYJOTON-UHFFFAOYSA-M 0.000 description 1
- ZRALSGWEFCBTJO-UHFFFAOYSA-N guanidine group Chemical group NC(=N)N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 1
- 108010038082 heparin proteoglycan Proteins 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000003054 hormonal Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 101500002601 human Epidermal growth factor Proteins 0.000 description 1
- 102000008000 human cationic antimicrobial protein CAP 37 Human genes 0.000 description 1
- 108010089633 human cationic antimicrobial protein CAP 37 Proteins 0.000 description 1
- 229940116978 human epidermal growth factor Drugs 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 229920001600 hydrophobic polymer Polymers 0.000 description 1
- 238000002169 hydrotherapy Methods 0.000 description 1
- 125000002349 hydroxyamino group Chemical group [H]ON([H])[*] 0.000 description 1
- 230000000640 hydroxylating Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 229920003063 hydroxymethyl cellulose Polymers 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 230000000968 intestinal Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 230000004068 intracellular signaling Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000009114 investigational therapy Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- XEEYBQQBJWHFJM-UHFFFAOYSA-N iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 230000002147 killing Effects 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 230000003902 lesions Effects 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010091798 leucylleucine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 230000004301 light adaptation Effects 0.000 description 1
- 230000000670 limiting Effects 0.000 description 1
- 101700042928 lin-2 Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 230000003211 malignant Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000035786 metabolism Effects 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- NEGQCMNHXHSFGU-UHFFFAOYSA-N methyl pyridine-2-carboximidate Chemical compound COC(=N)C1=CC=CC=N1 NEGQCMNHXHSFGU-UHFFFAOYSA-N 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 239000004530 micro-emulsion Substances 0.000 description 1
- 108010029942 microperoxidase Proteins 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 230000002297 mitogenic Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000010813 municipal solid waste Substances 0.000 description 1
- 239000002088 nanocapsule Substances 0.000 description 1
- 230000001613 neoplastic Effects 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000001590 oxidative Effects 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- MYMOFIZGZYHOMD-UHFFFAOYSA-N oxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N p-acetaminophenol Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 230000036961 partial Effects 0.000 description 1
- 239000011236 particulate material Substances 0.000 description 1
- 108010091617 pentalysine Proteins 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 101710036149 pfk-1.2 Proteins 0.000 description 1
- 101700087004 pfkA1 Proteins 0.000 description 1
- 101700035696 pfkB Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000003169 placental Effects 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920003250 poly(2-hydroxyethyl methacrylate) Polymers 0.000 description 1
- 229920000747 poly(lactic acid) polymer Polymers 0.000 description 1
- 108010054442 polyalanine Proteins 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 230000000379 polymerizing Effects 0.000 description 1
- 229920000023 polynucleotide Polymers 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 229920003288 polysulfone Polymers 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 230000034918 positive regulation of cell growth Effects 0.000 description 1
- 230000017363 positive regulation of growth Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000000644 propagated Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000029983 protein stabilization Effects 0.000 description 1
- 235000008164 pyridoxal Nutrition 0.000 description 1
- 239000011674 pyridoxal Substances 0.000 description 1
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 1
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 description 1
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 229910052761 rare earth metal Inorganic materials 0.000 description 1
- 150000002910 rare earth metals Chemical class 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 102000027656 receptor tyrosine kinases Human genes 0.000 description 1
- 108091007921 receptor tyrosine kinases Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 102220177562 rs768249578 Human genes 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 239000004017 serum-free culture media Substances 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 238000004513 sizing Methods 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- KEAYESYHFKHZAL-UHFFFAOYSA-N sodium Chemical compound [Na] KEAYESYHFKHZAL-UHFFFAOYSA-N 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000001187 sodium carbonate Substances 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- IHQKEDIOMGYHEB-UHFFFAOYSA-M sodium dimethylarsinate Chemical compound [Na+].C[As](C)([O-])=O IHQKEDIOMGYHEB-UHFFFAOYSA-M 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 229940048086 sodium pyrophosphate Drugs 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 230000003595 spectral Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000000087 stabilizing Effects 0.000 description 1
- 238000011146 sterile filtration Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 229940014800 succinic anhydride Drugs 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 230000004083 survival Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 235000019818 tetrasodium diphosphate Nutrition 0.000 description 1
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 description 1
- 230000002588 toxic Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N trans-L-hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 230000002103 transcriptional Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 239000012581 transferrin Substances 0.000 description 1
- 239000003656 tris buffered saline Substances 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 238000004450 types of analysis Methods 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamins Natural products 0.000 description 1
- SITLTJHOQZFJGG-XPUUQOCRSA-N α-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 1
- OJUGVDODNPJEEC-UHFFFAOYSA-N α-oxobenzeneacetaldehyde Chemical compound O=CC(=O)C1=CC=CC=C1 OJUGVDODNPJEEC-UHFFFAOYSA-N 0.000 description 1
- DXJZITDUDUPINW-UHFFFAOYSA-N γ-glutamyl-Asparagine Chemical compound NC(=O)CCC(N)C(=O)NC(CC(N)=O)C(O)=O DXJZITDUDUPINW-UHFFFAOYSA-N 0.000 description 1
- SIGGQAHUPUBWNF-UHFFFAOYSA-N γ-glutamyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CCC(N)=O SIGGQAHUPUBWNF-UHFFFAOYSA-N 0.000 description 1
Description
PATENTS ACT, 1992
92/1654
HEREGULINS (HRGS), BINDING PROTEINS OF P185erb2
GENENTECH, INC.
BACKGROUND OF THE INVENTION
FEE (1 the lnvgntgu
This invention relates to polypeptide ligands that bind to receptors implicated in
cellular growth. In particular, it relates to polypeptide ligands that bind to the p185HER2
receptor.
Description of Bgkground and Related Art
Cellular protooncogenes encode proteins that are thought to regulate normal cellular
proliferation and differentiation. Alterations in their structure or amplification of their
expression lead to abnormal cellular growth and have been associated with carcinogenesis
(Bishop JM, Science 235:305-311 (1987)); (Hhims JS, Cancer Detection and Prevention 11:139-
149 (19881); (Nowell PC, Cancer Res. 46:2203-2207 [1986]); (Nicolson GL, Cancer Res.
47:1473-1487 (1987)). Protooncogenes were first identified by either of two approaches.
First. molecular characterization of the genomes of transforming retroviruses showed that
the genes responsible for the transfonning ability of the virus in many cases were altered
versions of genes found in the genomes of normal cells. The normal version is the
protooncogene, which is altered by mutation to give rise to the oncogene. An example of such
a gene pair is represented by the EGF receptor and the v-erb-B gene product. The virally
encoded v-erb-B gene product has suffered truncation and other alterations that render it
constitutively active and endow it with the ability to induce cellular transformation (Yarden et
al., Ann. Rev. Biochem. 57:443-478, 1988).
The second method for detecting cellular transforming genes that behave in a
dominant fashion involves transfection of cellular DNA from tumor cells of various species
into nontransformed target cells of a heterologous species. Most often this was done by
transfection of human, avian, or rat DNAs into the murine NIH 3T3 cell line (Bishop JM,
- 30 Science 235:305-311 (1987)); (Flhims .18, Cancer Detection and Prevention 11:139-149 (1988));
(Nowell PC, Cancer. Res. 46:2203-2207 (1986)); (Nicolson GL, Cancer. Res. 47:1473-1487
(1987)); (Yarden etal., Ann. Flev. Biochem. 57:443-478 [1988]). Following several cycles of
genomic DNA isolation and retransfection, the human or other species DNA was molecularly
cloned from the murine background and subsequently characterized. In some cases, the same
genes were isolated following transfection and cloning as those identified by the direct
characterization of transforming viruses. in other cases, novel oncogenes were identified. An
example of a novel oncogene identified by this transfection assay is the neu oncogene. it was
discovered by Weinberg and colleagues in a transfection experiment in which the initial DNA
was derived from a carcinogen-induced rat neuroblastoma (Padhy et al., Cell28:865—871
[1982]); (Schechter at al., Nature 312:513-516 [1984]). Characterization of the rat neu
oncogene revealed that it had the structure of a growth factor receptor tyrosine kinase, had
homology to the EGF receptor, and differed from its normal counterpart, the neu
protooncogene, by an activating mutation in its transmembrane domain (Bargmann et al., Cell
452649-657 [1986]). The human counterpart to neu is the HER2 protooncogene, also designated
c-erb- B2 (Coussens etal., Science 230:1137-1139 [1985]), W089/06692).
The association of the HER2 protooncogene with cancer was established by yet a
third approach, that is, its association with human breast cancer. The HER2 protooncogene
was first discovered in cDNA libraries by virtue of its homology with the EGF receptor, with
which it shares structural similarities throughout (Yarden et al., Ann. Rev. Biochem. 57:443-
478 [1988]). When radioactive probes derived from the cDNA sequence encoding p185HER2
were used to screen DNA samples from breast cancer patients, amplification of the HER2
protooncogene was observed in about 30% of the patient samples (Slamon et al., Science
235:177-182 [1987]). Further studies have confinned this original observation and extended it
to suggest an important correlation between HER2 protooncogene amplification and/or
overexpression and worsened prognosis in ovarian cancer and non-small cell lung cancer
(Slamon etaI., Science 244:707-712 [1989]); (Wright et al., Cancer Res 49:2087-2090, 1989);
(Paik at al., J. Clin. Oncology 8:103-112 [1990]); (Berchuck et al., Cancer Res. 5024087-4091,
1990); (Kern etal., Cancer Res. 50:5184-5191, 1990).
The association of HER2 amplification/overexpression with aggressive malignancy,
as described above, implies that it may have an important role in progression of human
cancer, however, many tumor-related cell surface antigens have been described in the past,
few of which appear to have a direct role in the genesis or progression of disease (Schlom et
al. Cancer Res. 50:820-827, 1990); (Sza|a et at, Prac. Natl. Acad. Sci. 9823542-3546).
Among the protooncogenes are those that encode cellular growth factors which act
through endoplasmic kinase phosphorylation of cytoplasmic protein. The HER1 gene (or erb-
B1) encodes the epidermal growth factor (EGF) receptor. The B-chain of platelet-derived
growth factor is encoded by the c-sis gene. The granulocyte-macrophage colony stimulating
factor is encoded by the c-tms gene. The neu protooncogene has been identified in
ethylnitrosourea-induced rat neuroblastomas. The HER2 gene encodes the 1,255 amino acid
tyrosine kinase receptor-like glycoprotein p185HER9 that has homology to the human epidermal
growth factor receptor.
The known receptor tyrosine kinases all have the same general structural motif: an
extracellular domain that binds ligand. and an intracellular tyrosine kinase domain that is
necessary for signal transduction and transformation. These two domains are connected by
a single stretch of approximately 20 mostly hydrophobic amino acids, called the
transmembrane spanning sequence. This transmembrane spanning sequence is thought to
play a role in transferring the signal generated by ligand binding from the outside of the cell to
the inside. Consistent with this general structure. the human p185H5R? glycoprotein. which is
located on the cell surface, may be divided into three principal portions: an extracellular
domain, or ECD (also known as XCD); a transmembrane spanning sequence; and a
cytoplasmic, intracellular tyrosine kinase domain. While it is presumed that the extracellular
domain is a ligand receptor, the p185HER9 ligand has not yet been positively identified.
No specific ligand binding to p185HER9 has been identified, although Lupu et al.,
(Science 249:1552-1555, 1989) describe an inhibitory 30 kDa glycoprotein secreted from human
breast cancer cells which is alleged to be a putative ligand for p185H5R3. Lupu et al., Science,
249:1552-1555 (1990); Proceedings of the American Assoc. for Cancer Research, Vol 32, Abs
297, March 1991) reported the purification of a 30 kD factor from MDA-MB-231 cells and a 75
kD factor from SK-BR-3 cells that stimulates p185HER2. The 75 kD factor reponedly induced
phosphorylation of p1B5HER9 and modulated cell proliferation and colony formation of SK-BR-3
cells overexpressing the p185HER? receptor. The 30 kD factor competes with muMab 4D5 for
binding to p185HF-R9, its growth effect on SK-BR-3 cells was dependent on 30 kD
concentration (stimulatory at low concentrations and inhibitory at higher concentrations).
Furthermore. it stimulated the growth of MDA-MB-468 cells (EGF-Ft positive, p185HER-’-
negative), it stimulated phosphosylation of the EGF receptor and it could be obtained from SK-
BR-3 cells. In the rat neu system, Yarden etal, (Biochemistry, 30:3543-3550, 1991) describe
a 35 kDa glycoprotein candidate ligand for the neu encoded receptor secreted by ras
transfonned fibroblasts. Dobashi stat, Proc. Natl. Acad. Sci. USA, 88:8582-8586 (1991);
Biochem. Biophys. Res. Commun.; 179:1536-1542 (1991) described a neu protein-specific
activating factor (NAF) which is secreted by human T-cell line ATL-2 and which has a
molecular weight in the range of 8-24 kD. A 25 kD ligand from activated macrophages was
also described ('l'arakhovsky, et al., J. Cancer Res, 2188-2196 (1991 ).
Methods for the in vivo assay of tumors using HER2 specific monoclonal antibodies
and methods of treating tumor cells using HER2 specific monoclonal antibodies are described in
W089/06692.
There is a current and continuing need in the art to identity the actual ligand or ligands
that activate p185HER?, and to identify their biological role(s), including their roles in cell-
growth and differentiation, cell-transformation and the creation of malignant neoplasms.
Accordingly, it is an object of this invention to identify and purify one or more novel
p185HER9 ligand polypeptide(s) that bind and stimulate p185HER?.
It is another object to provide nucleic acid encoding novel p185HER? binding ligand
polypeptides and to use this nucleic acid to produce a p185HER3 binding ligand polypeptide in
recombinant cell culture for therapeutic or diagnostic use. and for the production of therapeutic
antagonists for use in certain metabolic disorders including, but not necessarily restricted to
the killing, inhibition and/or diagnostic imaging of tumors and tumorigenlc cells.
it is a further object to provide derivatives and modified forms of novel glycoprotein
ligands, including amino acid sequence variants, fusion polypeptides combining a p185HEF’-9
binding ligand and a heterologous protein and covalent derivatives of a p185HER3 binding ligand,
it is an additional object to prepare immunogens for raising antibodies against
pt85”ER2 binding ligands. as well as to obtain antibodies capable of binding to such ligands, and
antibodies which bind a p185HER? binding ligand and prevent the ligand from activating
p185HER?. It is a further object to prepare immunogens comprising a p185HE“2 binding ligand
fused with an immunogenic heterologous polypeptide.
These and other objects of the invention will be apparent to the ordinary artisan upon
consideration of the specification as a whole.
ligand families which bind to p185HER?. These ligands are denominated the heregulin (HRG)
polypeptides, and include HRG-o., HRG-B1, HRG-B2, HRG-B3 and other HRG polypeptides
which cross-react with antibodies directed against these family members and/or which are
substantially homologous as defined ma. A preferred HRG is the ligand disclosed in Fig. 4
and its fragments, further designated HRG-or. Other preferred HRGS are the ligands and
their fragments disclosed in Figure 8, and designated HRG-B1, HFlG—B2 disclosed in Figure
12, and HRG-[33 disclosed in Figure 13.
In another aspect, the invention provides a composition comprising HRG which is
isolated from its source environment, in particular HRG that is free of contaminating human
polypeptides. HFlG is purified by absorption to heparin sepharose, cation (e.g. polyaspartic
acid) exchange resins, and reversed phase HPLC.
HRG or HRG fragments (which also may be synthesized by in vifro methods) are
fused (by recombinant expression or an in vitro peptidyl bond) to an immunogenic polypeptide
and this fusion polypeptide, in turn, is used to raise antibodies against an HRG epitope. Anti-
HRG antibodies are recovered from the serum oi immunized animals. Alternatively.
monoclonal antibodies are prepared from cells in vitro ortrom in vivo immunized animals in
conventional fashion. Preferred antibodies identified by routine screening will bind to HRG, but
will not substantially cross-react with any other known ligands such as EGF, and will prevent
HFlG from activating p185HER2. In addition, anti-Hi-‘iG antibodies are selected that are
capable of binding specifically to individual family members of the HRG family, e.g. HRG-(1,
HRG-B1, HRG-[32, HRG-B3, and thereby may act as specific antagonists thereof.
HRG also is derivatized in vltro to prepare immobilized ‘HRG and labeled HRG,
particularly for purposes of diagnosis of HRG or its antibodies, or for affinity purification of
HRG antibodies. Immobilized anti-HRG antibodies are useful in the diagnosis (in vitro or in
vivo) or purification of HRG. In one preferred embodiment, a mixture of HRG and other
peptides is passed over a column to which the anti-HRG antibodies are bound.
Substitutional, deletional, or insertional variants of HRG are prepared by in vitro or
recombinant methods and screened. for example, for immuno-crossreactivity with the native
forms of HRG and tor HRG antagonist or agonist activity.
In another preferred embodiment, HFIG is used for stimulating the activity of
p185“ER2 in nonnal cells. In another preferred embodiment, a variant of HRG is used as an
antagonist to inhibit stimulation of p185HER?.
HRG, its derivatives, or its antibodies are formulated into physiologically acceptable
vehicles, especially for therapeutic use. Such vehicles include sustained-release forrnulatlons
of HRG or HRG variants. A composition is also provided comprising HRG and a
pharrnaceutically acceptable carrier, and an isolated polypeptide comprising HRG fused to a
heterologous polypeptide.
In still otheraspects, the invention provides an isolated nucleic acid encoding an HRG,
which nucleic acid may be labeled or unlabeled with a detectable moiety, and a nucleic acid
sequence that is complementary, or hybridizes under stringent conditions to, a nucleic acid
sequence encoding an HRG.
The nucleic acid sequence is also useful in hybridization assays for HRG nucleic acid
and in a method of determining the presence of an HFIG, comprising hybridizing the DNA (or
RNA) encoding (or complementary to) an HRG to a test sample nucleic acid and determining
the presence of an HRG. The invention also provides a method of amplifying a nucleic acid
test sample comprising priming a nucleic acid polymerase (chain) reaction with nucleic acid
(DNA or RNA) encoding (or complementary to) a HRG.
In still further aspects, the nucleic acid is DNA and further comprises a replicable
vector comprising the nucleic acid encoding an HRG operably linked to control sequences
recognized by a host transformed by the vector; host cells transformed with the vector; and a
method of using a nucleic acid encoding an HRG to effect the production of HRG, comprising
expressing HRG nucleic acid in a culture of the transformed host cells and recovering an HRG
from the host cell culture.
In further embodiments, the invention provides a method for producing HRG comprising
inserting into the DNA of a cell containing the nucleic acid encoding an HRG a transcription
modulatory element in sufficient proximity and orientation to an HRG nucleic acid to influence
(suppress or stimulate) transcription thereof, with an optional further step comprising culturing
the cell containing the transcription modulatory element and an HRG nucleic acid.
In still further embodiments, the invention provides a cell comprising the nucleic acid encoding
an HRG and an exogenous transcription modulatory element in sufficient proximity and orientation to
an HRG nucleic acid to influence transcription thereof; and a host cell containing the nucleic acid
encoding an HRG operably linked to exogenous control sequences recognized by the host cell.
_ BRIEF DESCRIPTION OF THE DRAWINGS
Purification of Heregulin on PolyAspartic Acid column.
PolyAspartic acid column chromography of heregulin-or was conducted and the elution
profile of proteins measured at A214. The 0.6 M NaCI pool from the heparin Sepharose
purification step was diluted to 0.2 M NaCl with water and loaded onto the polyaspartic acid
column equilibrated in 17 mM Na phosphate, pH 6.8 with 30% ethanol. A linear NaCl gradient
Figure 1
from 0.3 to 0.6 M was initiated at 0 time and was complete at 30 minutes. Fractions were
tested in HRG tyrosine autophosphorylation assay. The fractions corresponding to peak C
were pooled for further purification on C4 reversed phase HPLC.
Figurez C4 Reversed Phase Purification of Heregulin-2.
Panel A: Pool C from the polyaspartic acid column was applied to a C4 HPLC
column (SynChropak RP-4) equilibrated in 0.1% TFA and the proteins eluted with a
linear acetonitrile gradient at 0.25%/minute. The absorbance trace for the run
numbered C4-17 is shown. One milliliter fractions were collected for assay.
Panel B: Ten microliter aliquots of the fractions were tested in HRG tyrosine
autophosphorylation assay. Levels of phosphotyrosine in the p185HER3 protein were
quantitated by a specific antiphosphotyrosine antibody and displayed in arbitrary
units on the abscissa.
Panel C: Ten microliter fractions were taken and subjected to SDS gel
electrophoresis on 4-20% acrylamide gradient gels according to the procedure of
Laemmli (Nature, 227:680-685, 1970). The molecular weights of the standard proteins
are indicated to the left of the lane containing the standards. The major peak of
tyrosine phosphorylation activity found in fraction 17 was associated with a
prominent 45,000 Da band (HRG-oc).
Figure 3. SDS Polyacrylamide Gel Showing Purification of Heregulin-a.
Molecular weight markers are shown in Lane 1. Aliquots from the MDA-MB-231
conditioned media (Lane 2), the 0.6M NaCl pool from the heparin Sepharose column (Lane 3),
Pool 0 from the polyaspartic acid column (Lane 4) and Fraction 17 from the HPLC column
(C4-17) (Lane 5) were electrophoresed on a 4-20% gradient gel and silver stained. Lanes 6
and 7 contained buffer only and shows the presence of gel artifacts in the 50-65 KDa
molecular weight region.
Figures 4a-4d depict the deduced amino acid sequence of the cDNA contained in lgt1oher16
(SEQ ID NO:12 and SEQ ID NO:13). The nucleotides are numbered at the top left of each line
and the amino acids written in three letter code are numbered at the bottom left of each line.
The nucleotide sequence corresponding to the probe is nucleotides 681-720. The probable
transmembrane domain is amino acids 287-309. The six cysteines of the EGF motif are 226,
234, 240, 254, 256 and 265. The five potential three-amino acid N-linked glycosylation sites
are 164-166, 170-172, 208-210, 437-439 and 609-611. The serine-threonine potential 0-
glycosylation sites are 209-221. Serine-glycine dipeptide potential glycosaminoglycan addition
sites are amino acids 42-43, 64-65 and 151-152. The initiating methionine(MET) is at position
#45 of figure 4 although the processed N-terrninal residue is S46.
Figure 5 Northern blot analysis of MDA-MB-231 and SKBR3 RNAs Labeled from left to
right are the following: 1) MDA-MB-231 polyA minus-RNA, (RNA remaining after polyA-
containing RNA is removed); 2) MDA-MB-231 polyA plus-mRNA (RNA which contains polyA);
3) SKBR3 polyA minus-RNA; and, 4) SKBR3 polyA plus-mFtNA. The probe used for this
analysis was a radioactively (3?P) labelled internal xho1 DNA restriction endonuclease
fragment from the cDNA portion of 7Lgt10her16.
Figure 6 Sequence Comparisons in the EGF Family of Proteins. ‘
Sequences of several EGF-like proteins (SEQ ID NOS: 14, 15, 16, 17, 18, and 19)
around the cysteine domain are aligned with the sequence of HRG-ct. The location in figure 6
of the cysteines and the invariant glycine and arginine residues at positions 238 and 264
clearly show that HRG-or is a member of the EGF family. The region in figure 6 of highest
amino acid identity of the family members relative to HRG-or (30-40%) is found between Cys
234 and Cys 265. The strongest identity (40%) is with the heparin-binding EGF (HB-EGF)
species. HFtG-oi has a unique 3 amino acid insert between Cys 240 and Cys 254. Potential
transmembrane domains are boxed (287-309). Bars indicate the carboxy-terminal sites for
EGF and TGF-alpha where proteolytic cleavage detaches the mature growth factors from
their transmembrane associated profonns. HB-EGF is heparin binding-epidermal growth
factor; EGF is epidennal growth factor; TGF-alpha is transforming growth factor alpha; and
schwannoma is the schwannoma-derived growth factor. The residue numbers in Fig. 6 reflect
the Hg. 4 convention.
Figure? Stimulation of Cell Growth by HRG-oi.
Three different cell lines were tested for growth responses to 1 nM HRG-oi. Cell
protein was quantitated by crystal violet staining and the responses normalized to control,
untreated cells. -
Figures 8a-8d (SEQ ID NO:7) depict the entire potential coding DNA nucleotide sequence of the
heregulin-B1 and the deduced amino acid sequence of the cDNA contained in Aher 11.1dbl
(SEQ ID NO:9). The nucleotides are numbered at the top left of each line and the amino acids
written in three letter code are numbered at the bottom left of each line. The probable
transmembrane amino acid domain is amino acids 278-300. The six cysteines of the EGF
motif are 212, 220, 226, 240, 242 and 251. The five potential three-amino acid N-linked
glycosylation sites are 150-152, 156-158, 196-198, 428-430 and 600-612. The serine-threonine
potential O-glycosylation sites are 195-207. Serine-glycine dipeptide potential
glycosaminoglycan addition sites are amino acids 28-29, 50-51 and 137-138. The initiating
methionine (MET) is at position #31. HRG-B1 is processed to the N-terminal residue 832.
Figure 9 depicts a comparison of the amino acid sequences of heregulin-or and -B1. A dash (-)
indicates no amino acid at that position. (SEQ ID N028 and SEQ ID NO:9). This Fig. uses the
numbering convention of Figs. 4 and 6.
Figure 10 shows the stimulation of HER2 autophosphorylation using recombinant HRG-or as
measured by HER2 tyrosine phosphorylation.
Figure 11 depicts the nucleotide and inputed amino acid sequence of >.l5'her13 (SEQ iD NO:22);
the amino acid residue numbering convention is unique to this figure.
Figure 12a-12e depict the nucleotide sequence of ?Lher76, encoding heregulin-[32 (SEQ ID
NO:23). This figure commences amino acid residue numbering with the expressed N-terminal
MET; the N-terrninus is 32.
Figures 13a-13c depict the nucleotide sequence of 7.her78, encoding heregulin-[33 (SEQ ID
NO:24). This figure uses the amino acid numbering convention of Fig. 12; S2 is the processed
N-terminus.
Figures 14a-14d depict the nucleotide sequence of 7ther84, encoding a heregulin-B2-like
polypeptide (SEQ ID NO:25). This figure uses the amino acid numbering convention of Fig. 12;
S2 is the processed N-terrninus.
Figure 15a-15c depict the amino acid homologies between the known heregulins ((1, B1, B2, B2—like
and B3 in descending order) and illustrates the amino acid insertions, deletions or substitutions that
distinguish the different fonns (SEQ ID NOS:26-30). This figure uses the amino acid numbering
convention of Figs. 12-14.
i the description, examples, and claims.
Heregulln ("HRG") is defined herein to be any isolated polypeptide sequence which
possesses a biological activity of a polypeptide disclosed in Figs. 4, 8, 12, 13, or 15,
and fragments, alleles or animal analogues thereof. HRG excludes any
polypeptide heretofore identified, including any known polypeptide which is otherwise
anticipatory under 35 U.S.C. 102, as well as polypeptides obvious over such known
polypeptides under 35 U.S.C. 103, including in particular EFG, TFG-or, amphiregulin (Plowman
et al. Mol. Cell. Biol. 10:1969 (1990), HB-EGF (Higashimaya et al., Science 251:936 [1991]),
schwannoma factor or polypeptides obvious thereover.
‘Biological activity‘ for the purposes herein means an in vivo_eitector or antigenic
function that is directly or indirectly periorrned by an HRG polypeptide (whether in its native
or denatured conformation), or by any subsequence thereof. Effector functions include
receptor binding or activation, induction of differentiation, mitogenic or growth promoting
activity, immune modulation, DNA regulatory functions and the like, whether presently known
or inherent. Antigenic functions include possession of an epitope or antigenic site that is
capable of cross-reacting with antibodies raised against a naturally occurring or denatured
HRG polypeptide or fragment thereof.
Biologically active HRG includes polypeptides having both an effector and antigenic
function, or only one of such functions. HRG includes antagonist polypeptides to HRG,
provided that such antagonists include an epitope of a native HRG. A principal known
effector function of HRG is its ability to bind to p185HER3 and activate the receptor tyrosine
knase.
HRG includes the translated amino acid sequence of full length human HRGs
(proHRG) set forth herein in the Figures; deglycosylated or unglycosylated derivatives; amino
acid sequence variants; and covalent derivatives of HRG, provided that they possess
biological actvity. While the native profonn of HRG is probably a membrane-bound
polypeptide, soluble forms, such as those forms lacking a functional transmembrane domain
(proHRG or its fragments), are also included within this definition.
Fragments of intact HRG are included within the definition of HRG. Two principal
domains are included within the fragments. These are the growth factor domain ("GFD'),
homologous to the EGF family and located at about residues S216-A227 to N268-R286 (Fig. 9,
HRG-cc; the GFD domains for other HFiGs (Fig. 15) are the homologous sequences.).
Preferably, the GFDs for HRG-Of, [31, B2132-like and (33 are, respectively, G175-K241, G175-
K246, G175-K238, G175-K238 and G175-E241 (Fig. 15).
Another fragment of interest is the N-terminal domain ("NTD“). The NTD extends
from the N-terminus of processed HRG (S2) to the residue adjacent to an N-terminal residue
of the GFD, i.e., about T172-C182 (Fig. 15) and preferably T174. An additional group of
fragments are NTD-GFD domains, equivalent to the extracellular domains of HRG-or and B1-
(32. Another fragment is the C-tenninal peptide ("CTP") located about 20 residues N—tenninal
to the first residue of the transmembrane domain, either alone or in combination with the C-
terrninal remainder of the HRG.
In preferred embodiments, antigenically active HRG is a polypeptide that binds with an
affinity of at least about 107 Vmole to an antibody raised against a naturally occurring H-RG
sequence. Ordinarily the polypeptide binds with an affinity of at least about 103 Vmole. Most
preferably, the antigenically active HRG is a polypeptide that binds to an antibody raised
against one of HFlGs in its native conformation. HRG in its native conformation generally is
HRG as found in nature which has not been denatured by chaotropic agents, heat or other
treatment that substantially modifies the three dimensional structure of HRG as determined,
for example, by migration on nonreducing, nondenaturing sizing gels. Antibody used in this
determination is rabbit polyclonal antibody raised by formulating native HRG from a non-
rabbit species in Freund's complete adjuvant, subcutaneously injecting the formulation into
rabbits, and boosting the immune response by intraperitoneal injection of the formulation until
the titer of anti-H RG antibody plateaus. .
Ordinarily, biologically active HRG will have an amino acid sequence having at least
75% amino acid sequence identity with an HRG sequence, more preferably at least 80%, even
more preferably at least 90%, and most preferably at least 95%. Identity or homology with
respect to an HRG sequence is defined herein as the percentage of amino acid residues in the
candidate sequence that are identical with HRG residues in Figs. 15, after aligning the
sequences and introducing gaps, if necessary, to achieve the maximum percent homology, and
not considering any conservative substitutions to be identical residues. None of N-terminal,
C-terminal or internal extensions, deletions, or insertions into HRG sequence shall be construed
as affecting homology.
Thus, the biologically active HRG polypeptides that are the subject of this invention
include each expressed or processed HRG sequence; fragments thereof having a consecutive
sequence of at least 15, 20, 25, 30 or 40 amino acid residues; amino acid sequence
variants of HRG wherein an amino acid residue has been inserted N- or C-terminal to, or
within, HRG sequence or its fragment as defined above; amino acid sequence variants of HRG
sequence or its fragment as defined above wherein a residue has been substituted by another
residue. HRG polypeptides include those containing predetermined mutations by, e.g., site-
directed or PCR mutagenesis. HRG includes HFIG from such as species as rabbit, rat,
porcine, non-human primate, equine, murine, and ovine HRG and alleles or other naturally
occurring variants of the foregoing; derivatives of HRG or its fragments as defined above
wherein HRG or its fragments have been covalently modified by substitution, chemical,
enzymatic, or other appropriate means with a moiety other than a naturally occurring amino
acid (for example a detectable moiety such as an enzyme or radioisotope); glycosylation
variants of HRG (insertion of a glycosylation site or deletion of any glycosylation site by
deletion, insertion or substitution of an appropriate residue); and soluble forms of HRG, such
as HRG-GFD or those that lack a functional transmembrane domain.
in another embodiment, one or more of residues 13-18 independently are varied to
produce a sequence incapable of nuclear targeting. For example G13 is mutated to any other
naturally occurring residue including P, L, l, V, A, M, F, K, D or S; any one or more of K14-K16
are mutated to any other naturally occurring residue including R,H,D,E,N or 0; E17 to any
other naturally occurring residue including D, R, K, H, N or Q; and R18 to any other naturally
occurring residue including K, H, D, E, N or Q. All or any one of residues 13-18 are deleted as
well, or extraneous residues are inserted adjacent to these residues; for example residues
inserted adjacent to residue 13-18 which are the same as the above- suggested substitutions
for the residues themselves.
In another embodiment, enzymes or a nuclear regulatory protein such as a
transcriptional regulatory factor is fused to HRG-NTD, HRG-NTD-GFD, or HRG-GFD. The
enzyme or factor is fused to the N- or C- terminus, or inserted between the NTD and GFD
domains, or is substituted for the region of NTD between the first about 23 residues and the
GFD.
In accordance with this invention, HRG nucleic acid is RNA or DNA containing greater
than ten bases that encodes a biologically or antigenically active HRG, is complementary to
nucleic acid sequence encoding such HRG, or hybridizes to nucleic acid sequence encoding such
HRG and remains stably bound to it under stringent conditions.
Preferably, HRG nucleic acid encodes a polypeptide sharing at least 75% sequence
identity, more preferably at least 80%, still more preferably at least 85%, even more preferably
at 90%, and most preferably 95%, with an HRG sequence as shown in Figures 4, 8, 12, 13 or
. Preferably, the HRG nucleic add that hybridizes contains at least 20, more preferably at
least about 40, and most preferably at least about 90 bases. Such hybridizing or
complementary nucleic acid, however, excludes nucleic acid encoding EGF, TGF-Cl,
amphiregulin, HB-EGF, schwannoma factor or fragments or variants thereof which would
have been obvious as of the filing date hereof.
isolated HRG nucleic acid includes a nucleic acid thatis free from at least one
contaminant nucleic acid with which it is ordinarily associated in the natural source of HRG
nucleic acid. Isolated HRG nucleic acid thus is present in other than in the form or setting in
which it is found in nature. However, isolated HRG encoding nucleic acid includes HRG nucleic
acid in ordinarily HRG-expressing cells where the nucleic acid is in a chromosomal location
different from that of natural cells or is otherwise flanked by a different DNA sequence than
that found in nature. Nucleic acid encoding HRG may be used in specific hybridization assays,
particularly those portions of HRG encoding sequence that do not hybridize with other known
DNA sequences, for example those encoding the EGF-like molecules of figure 6.
'Siringent conditions“ are those that (1) employ low ionic strength and high
temperature for washing, for example, 0.015 M NACI/0.0015 M sodium citrate/0/1%
NaDodSO4 at 50° C; (2) employ during hybridization a denaturing agent such as forrnamlde,
for example, 50% (voVvol) formamide with 0.1% bovine serum albumin, 0.1% Ficoll, 0.1 %
polyvinylpyrrolidone, 50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM
sodium citrate at 42° C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M
sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x
Denhardt‘s solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, and 10% dextran
sulfate at 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS.
Particular HRG-or nucleic acids are nucleic acids or oligonucleotides consisting of or
comprising a nucleotide sequence selected from Figs. 4a-4d and containing greaterthan 17
bases (when excluding nucleic acid sequences of human small polydisperse circular DNA
(HUMPC125), chicken c-mos proto-oncogene homolog (CHKMOS), basement membrane
heparin sulfate proteoglycan (HUMBMHS P) and human llpocortin 2 pseudogene (complete cds-
like region, HUMLlP2B), ordinarily greater than 20 bases, preferably greater than 25 bases,
together with the complementary sequences thereof.
Particular HRG-B1, -B2 or -(33 nucleic acids are nucleic acids or oligonucleotides
consisting of or comprising a nucleotide sequence selected from Figs. 8a-8d, 12a-12e or 13a-13c
and containing greater than 20 bases, but does not include the polyA sequence found at the 3'
end of each gene as noted in the Figures, together with the complements to such sequences.
Preferably the sequence contains contains greater than 25 bases. HRG-B sequences also
may exclude the human small polydisperse circular DNA sequence (HUM P-C1 25).
In other embodiments, the HRG nucleotide sequence contains a 15 or more bass HRG
sequence and is selected from within the sequence encoding the HRG domain extending from
the N-terminus of the GFD to the N-terminus of the transmembrane sequence (or the
complement of that nucleic acid sequence). For example, with respect to HRG-or, the
nucleotide sequence is selected from within the sequence 678-869 (Fig. 4b) and contains a
sequence of 15 or more bases from this section of the HRG nucleic acid.
In other embodiments, the HRG nucleic acid sequence is greater than 14 bases and is
selected from a nucleotide sequence unique to each subtype, for instance a nucleic acid
sequence encoding an amino acid sequence that is unique to each of the HRG subtypes (orthe
complement of that nucleic acid sequence). These sequences are useful in diagnostic assays
for expression of the various subtypes, as well as specific amplification of the subtype DNA.
For example, the HFlG-on sequence of interest would be selected from the sequence encoding
the unique N-terminus or GFD~transmembrane joining sequence, e.g. about bp771-860.
Similarly, a unique HRG-[31 sequence is that which encodes the last 15 C-terrninal amino acid
residues; this sequence is not found in
HRG-oz.
in general, the length of the HRG-or or [3 sequence beyond greater than the above-
indicated number of bases is immaterial since all of such nucleic acids are useful as probes or
amplification primers. The selected HRG sequence may contain additional HRG sequence,
either the normal flanking sequence or other regions of the HRG nucleic acid, as well as other
nucleic acid sequences. For purposes of hybridization, only the HRG sequence is material.
The term "control sequences" refers to DNA sequences necessary for the expression
of an operably linked coding sequence in a particular host organism. The control sequences
that are suitable for prokaryotes, for example, include a promoter, optionally an operator
sequence, a ribosome binding site, and possibly, other as yet poorly understood sequences.
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers.
Nucleic acid is "operably linked" when it is placed into a functional relationship with
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is
operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding
sequence if it affects the transcription of the sequence; or a ribosome binding site is operably
linked to a coding sequence if it is positioned so as to facilitate translation. Generally,
"operably linked" means that the DNA sequences being linked are contiguous and, in the case
of a secretory leader, contiguous and in reading phase. However enhancers do not have to be
contiguous. Linking is accomplished by ligation at convenient restriction sites. if such sites do
not exist, then synthetic oligonucleotide adaptors or linkers are used in accord with
conventional practice.
An "exogenous" element is defined herein to mean nucleic acid sequence that is foreign
to the cell, or homologous to the cell but in a position within the host cell nucleic acid in which
the element is ordinarily not found.
As used herein, the expressions "cell", "cell line", and "cell culture“ are used
interchangeably, and all such designations include progeny. Thus, the words "transformants"
and “transformed cells" include the primary subject cell and cultures derived therefrom without
regard for the number of transfers. It is also understood that all progeny may not be precisely
identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that
have the same function or biological activity as screened for in the_ originally transformed cell
are included. It will be clear from the context where distinct designations are intended.
"Plasmids" are designated by a lower case "p" preceded and/or followed by capital
letters and/or numbers. The starting plasmids herein are commercially available, are publicly
available on an. unrestricted basis, or can be constructed from such available plasmids in
accord with published procedures. in addition, other equivalent plasmids are known in the an
and will be apparent to the ordinary artisan.
"Restriction Enzyme Digestion“ of DNA refers to catalytic cleavage of the DNA with
an enzyme that acts only at certain locations in the DNA. Such enzymes are called
restriction endonucleases. and the sites for which each is specific is called a restriction site.
The various restriction enzymes used herein are commercially available and their reaction
conditions, cofactors, and other requirements as established by the enzyme suppliers are used.
Restriction enzymes commonly are designated by abbreviations composed of a capital letter
followed by other letters representing the microorganism from which each restriction enzyme
originally was obtained, and then a number designating the particular enzyme. in general,
about 1 pg of plasmid or DNA fragment is used with about 1-2 units of enzyme in about 20 pl
of buffer solution. Appropriate buffers and substrate amounts for particular restriction
enzymes are specified by the manufacturer. Incubation of about 1 hour at 37°C is ordinarily
used, but may vary in accordance with the supplier's instructions. After incubation, protein or
polypeptide is removed by extraction with phenol and chlorofonn, and the digested nucleic acid
is recovered from the aqueous fraction by precipitation with ethanol. Digestion with a
restriction enzyme may be followed with bacterial alkaline phosphatase hydrolysis of the
tenninal 5' phosphates to prevent the two restriction cleaved ends of a DNA fragment from
'circularizing" or fonning a closed loop that would impede insertion of another DNA fragment
at the restriction site. Unless otherwise stated, digestion of plasmids is not followed by 5'
terminal dephosphorylation. Procedures and reagents for dephosphorylation are conventional
as described in sections 1.56-1.61 of Sambrook etal., (Molecular Cloning: A Laboratory Manual
New York: Cold Spring Harbor Laboratory Press, 1989).
‘Ligation’ refers to the process of fonning phosphodiester bonds between two nucleic
acid fragments. To ligate the DNA fragments together, the ends of the DNA fragments must
be compatible with each other. In some cases, the ends will be directly compatible after
endonuclease digestion. However, it may be necessary to first convert the staggered ends
commonly produced after endonuclease digestion to blunt ends to make them compatible for
ligation. To blunt the ends, the DNA is treated in a suitable buffer for at least 15 minutes at
°C with about 10 units of the Klenow fragment of DNA polymerase I or T4 DNA
polymerase in the presence of the four deoxyribonucleotide triphosphates. The DNA is then
purified by phenol-chlorofonn extraction and ethanol precipitation. The DNA fragments that
are to be ligated together are put in solution in about equimolar amounts. The solution will also
contain ATP, ligase buffer, and a ligase such as T4 DNA ligase at about 10 units per 0.5 pg
of DNA. It the DNA is to be ligated into a vector, the vector is first linearized by digestion
with the appropriate restriction endonuc|ease(s). The linearized fragment is then treated with
bacterial alkaline phosphatase, or calf intestinal phosphatase to prevent self-ligation during
the ligation step.
The technique of "polymerase chain reaction," or "PCR," as used herein generally
refers to a procedure wherein minute amounts of a specific piece of nucleic acid, RNA and/or
DNA, are amplified as described in U.S. Pat. No. 4,683,195, issued 28 July 1987. Generally,
sequence lntonnation from the ends of the region of interest or.beyond needs to be available,
such that oligonucleotide primers can be designed; these primers will be identical or similar in
sequence to opposite strands of the template to be amplified. The 5‘ terminal nucleotides of
the two primers may coincide with the ends of the amplified material. PCR can be used to
amplify specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA
transcribed from total cellular RNA, bacteriophage or plasmid sequences, etc. See generally
Mullis et al., Cold Spring Harbor Symp. Quant. Biol. 51: 263 (1987); Erlich, ed., PCR
Technology, (Stockton Press, NY, 1989). As used herein, PCR is considered to be one, but
not the only, example of a nucleic acid polymerase reaction method for amplifying a nucleic
acid test sample, comprising the use of a known nucleic acid (DNA or RNA) as a primer, and
utilizes a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid or to
amplify or generate a specific piece of nucleic acid which is complementary to a particular
nucleic acid.
The "HRG tyrosine autophosphorylation assay" to detect the presence of HRG
ligands was used to monitor the purification of a ligand for the p185HE92 receptor. This assay
is based on the assumption that a specific ligand for the p185HER3 receptor will stimulate
autophosphorylation of the receptor, in analogy with EGF and its stimulation of EGF receptor
autophosphorylation. MDA-MB-453 cells or MCF7 cells which contain high levels of p185HER2
receptors but negligible levels of human EGF receptors, were obtained from the American
Type Culture Collection, Rockville, Md. (ATCC No HTB-131) and maintained in tissue culture
with 10% fetal calf serum in DMEM/Hams F12 (1:1) media. For assay, the cells were
trypsinized and plated at about 150,000 cells/well in 24 well dishes (Costar). After incubation
with serum containing media overnight, the cells were placed in serum free media for 2-18
hours before assay. Test samples of 100 uL aliquots were added to each well. The cells
were incubated for 5-30 minutes (typically 30 min) at 37°C and the media removed. The
cells in each well were treated with 100 uL SDS gel denaturing buffer (Seprosol, Enpotech,
Inc.) and the plates heated at 100°C for 5 minutes to dissolve the cells and denature the
proteins. Aliquots from each well were electrophoresed on 5-20% gradient SDS gels (Novex,
Encinitas, CA) according to the manufacturers directions. After the dye front reached the
bottom of the gel, the electrophoresis was terminated and a sheet of PVDF membrane
(ProBlott, ABI) was placed on the gel and the proteins transferred from the gel to the
membrane in a blotting chamber (BioRad) at 200 mAmps for 30-60 min. After blotting, the
membranes were incubated with Tris buffered saline containing 0.1% Tween 20 detergent
buffer with 5% BSA for 2-18 hrs to block nonspecific binding, and then treated with a mouse
anti-phosphotyrosine antibody (Upstate Biological lnc., N.Y.). Subsequently, the membrane
blots were treated with goat anti-mouse antibody conjugated to alkaline phosphatase. The
gels were developed using the ProtoBlot System from Promega. After drying the membranes,
the density of the bands corresponding to p185HER2 in each sample lane was quantitated with
a Hewlett Packard ScanJet Plus Scanner attached to a Macintosh computer. The number
of receptors per cell in the MDA-MB-453 or MCF-7cells is such that under these experimental
conditions the p185HER2 receptor protein is the major protein which is labeled.
‘Protein microsequencing“ was accomplished based upon the following procedures.
Proteins from the final HPLC step were either sequenced directly by automated Edman
degradation with a model 470A Applied Biosystems gas phase sequencer equipped with a
120A PTH amino acid analyzer or sequenced after digestion with various chemicals or
enzymes. PTH amino acids were integrated using the ChromPerfect data system (Justice
Innovations, Palo Alto, CA). Sequence interpretation was performed on a VAX 11/785 Digital
Equipment Corporation computer as described (Henzel etal, J. Chromatography 404:41-52
(1987)). In some cases, aliquots of the HPLC fractions were electrophoresed on 5-20% SDS
polyacrylamide gels, electrotransferred to a PVDF membrane (ProBlott, ABI, Foster City,
CA) and stained with Coomassie Brilliant Blue (Malsudaira, P., J. Biol. Chem. 262:10035-
10038, 1987). The specific protein was excised from the blot for N terminal sequencing. To
detennine intemal protein sequences, HPLC fractions were dried under vacuum (SpeedVac),
resuspended in appropriate buffers, and digested with cyanogen bromide, the lysine-specific
enzyme Lys-C (Wako Chemicals. Richmond, VA) or Asp-N (Boehringer Mannheim,
Indianapolis, Ind.). After digestion, the resultant peptides were sequenced as a mixture or
were resolved by HPLC on a C4 column developed with a propanol gradient in 0.1% TFA
before sequencing as described above.
II. USE AND PREEARATION OF HRG POLYPEPTIDES
. T F P P E l V RIA T
The system to be employed in preparing HRG polypeptides will depend upon the
particular HRG sequence selected. If the sequence is sufficiently small HRG is prepared by in_
yitm polypeptide synthetic methods. Most commonly, however, HRG is prepared in
recombinant cell culture using the host-vector systems described below.
in general, mammalian host cells will be employed, and such hosts may or may not
contain post-translational systems for processing HRG prosequences in the nonnal fashion. It
the host cells contain such systems then it will be possible to recover natural subdomain
fragments such as HRG-GFD OR HRG-NTD-GFD from the cultures. If not, then the proper
processing can be accomplished by transforming the hosts with the required enzyme(s) or by
cleaving the precursor However, it is not necessary to transform cells with DNA
encoding the complete prosequence for a selected HRG when it is desired to only produce
fragments of HRG sequences such as an HRG-GFD. For example, to prepare HRG-GFD a
start codon is ligated to the 5‘ end of DNA encoding an HRG-GFD, this DNA is used to
transform host cells and the product expressed directly as the Met N-terminal form (it
desired, the extraneous Met may be removed 1n_\mm or by endogenous N-terminal
demefhionylases). Alternatively. HRG-GFD is expressed as a fusion with a signal sequence
recognized by the host cell, which will process and secrete the mature HRG-GFD as is further
described below. Amino acid sequence variants of native HRG-G FD sequences are produced
in the same way.
HRG-NTD is produced in the same fashion as the full length molecule but from
expression of DNA encoding only HRG-NTD, with the stop codon after one of S172-C182 (Fig.
).
In addition, HRG variants are expressed from DNA encoding protein in which both the
GFD and NTD domains are in their proper orientation but which contain an amino acid
insertion, deletion or substitution at the NTD-GFD joining site (for example located within the
sequence S172-C182. In another embodiment a stop codon is positioned at the 3' end of the
NTD-GFD-encoding sequence (after any residue T/0222-T245 of Fig. 15). The result is a
soluble form of HFlG-ct or -B1 or -(32 which lacks its transmembrane sequence (this sequence
also may be an internal signal sequence but will be referred to as a transmembrane sequence).
In further variations of this embodiment, an internal signal sequence of another polypeptide is
substituted in place of the native HRG transmembrane domain, or a cytoplasmic domain of
another cell membrane polypeptide, e.g. receptorkinase, is substituted for the HRG-or or HRG
[31-B2 cytoplasmic peptide.
In a still further embodiment, the NTD, GFD and transmembrane domains of HRG and
other EGF family members are substituted for one another, e.g. the NTD equivalent region of
EGF is substituted for the NTD of HRG, or the GFD of HRG is substituted for EGF in the
processed, soluble proform of EGF. Alternatively, an HRG or EGF family member
transmembrane domain is fused onto the C-terminal E236 of HRG-fig.
In a further variant, the HRG sequence spanning K241 to the C-tenninus is fused at
its N-terminus to the C-tenninus of a non-HRG polypeptide.
Another embodiment comprises the functional or structural deletion of the proteolytic
processing site in CTP, the GFD-transmembrane spanning domain. For example, the putative
C-tenninal lysine (K241) of processed HRG-on or 131-132 is deleted, substituted with another
residue, a residue other than K or R inserted between K241 and R242, or other disabling
mutation is made in the prosequence.
in another embodiment, the domain of any EGF family member extending from (a) its
cysteine corresponding to (b) C221 to the C-terminal residue of the family member is
substituted for the analogous domain of HRG-(X or -(31 or-B2 (or fused to the C-terminus of
HRG-B3). Such variants will be processed tree of host cells in the same fashion as the family
member rather than as the parental HRG. In more refined embodiments other specific
cleavage sites (e.g. protease sites) are substituted into the CTP or GFD-transmembrane
spanning domain (about residues T/Q222-T245, Fig. 15). For example, amphiregulin sequence
E84-K99 or TGFot sequence E44-K58 is substituted for HRG-Cl residues E223-K241.
in a further embodiment, a variant (termed HFtG-NTDXGFD) is prepared wherein (1)
the lysine residue found in the NTD-GFD joining sequence VKC (residues 180-182, Figure 15) is
deleted or (preferably) substituted by another residue other than R such as H, A, T or S and
(2) a stop codon is introduced in the sequence RCT or RCQ (residues 220-222, Figure 15) in
place of C, or T (for HRG-ot) or Q (for HRG-beta).
A preferred HRG-or ligand with binding affinity to p185”ER3 comprises amino acids
226-265 of figure 4. This HRG-or ligand funher may comprise up to an additional 1-20 amino
acids preceding amino acid 226 from figure 4 and 1-20 amino acids following amino acid 265
from figure 4. A preferred HRG-[3 ligand with binding affinity to p185”ER3 comprises amino
acids 226-265 of figure 8. This HRG-B ligand may comprise up to an additional 1-20 amino
acids preceding amino acid 226 from figure 8 and 1-20 amino acids following amino acid 265
from figure 8.
GFD sequences include those in which one or more residues corresponding to another
member of the EGF family are deleted or substituted or have a residue inserted adjacent
thereto. For example, F216 of HRG is substituted by Y, L202 with E, F189 with Y, or $203-
P205 is deleted.
HRG also includes NTD-GFD having its C-terminus at one of the first about 1 to 3
extracellular domain residues (QKR, residues 240-243, HRE-oz, Figure 15) or first about 1-2
transmembrane region residues. In addition, in some HRG-GFD variants the codons are
modified at the GFD-transmember proproteolysis site by substitution, insertion or deletion.
The GFD proteolysis site is the domain that contains the GFD C-tenninal residue and about 5
residues N- and 5 residues C-terminal from this residue. At this time neither the natural C-
terminal residue for HRG-or or HRG-[3 has been identified, although it is known that Met-227
tenninal and Val-229 terminal HRG-on-GFD are biologically active. The native C-terminus for
HRG-or-GFD is probably Met-227, Lys-228, Val-229, Gln-230, Asn-231 or Gin-232, and for
HRG B1-B2-GFD is probably Met-226, Ala-227, Ser-228, Phe-229, Trp-230, Lys 231or (for
HRG-B1) K240 or (for HRG-B2) K246. The native C-temtinus is determined readily by C-
tenninal sequencing, although it is not critical that HRG-GFD have the native terminus so long
as the GFD sequence possesses the desired activity. in some embodiments of HRG-GFD
variants, the amino acid change(s) in the CTP are screened for their ability to resist
proteolysis ir1_\/‘rtlggg and inhibit the protease responsible for generation of HRG-GFD.
If it is desired to prepare the full length HRG polypeptides and the 5' or 3' ends of the
given HRG are not described herein, it may be necessary to prepare nucleic acids in which the
missing domains are supplied by homologous regions from more complete HRG nucleic acids.
Altematively, the missing domains can be obtained by probing libraries using the DNAs
disclosed in the Figures or fragments thereof.
A. lglation of DNA Emging Hfiulin
The DNA encoding HRG may be obtained from any cDNA library prepared from
tissue believed to possess HFiG mRNA and to express it at a detectable level. HRG DNA
also is obtained from a genomic library.
Libraries are screened with probes or analytical tools designed to identify the gene of
interest or the protein encoded by it. For cDNA expression libraries, suitable probes include
monoclonal or polyclonal antibodies that recognize and specifically bind to HRG;
oligonucleotides of about 20-80 bases in length that encode known or suspected portions of
HRG cDNA from the same or different species; and/or complementary or homologous cDNAs
or fragments thereof that encode the same or a hydridizing gene. Appropriate probes for
screening genomic DNA libraries include, but are not limited to, oligonucleotides; cDNAs or
fragments thereof that encode the same or hybridizing DNA; and/or homologous genomic
DNAs or fragments thereof. Screening the cDNA or genomic library with the selected probe
may be conducted using standard procedures as described in chapters 10-12 of Sambrook er
al., supra.
An alternative means to isolate the gene encoding HRG is to use polymerase chain
reaction (PCR) methodology as described in section 14 of Sambrook et aI., supra. This
method requires the use of oligonucleotide probes that will hybridize to HRG. Strategies for
selection of oligonucleotides are described below.
Another alternative method for obtaining the gene of interest is to chemically
synthesize it using one of the methods described in Engels et al. (Agnew. Chem. Int. Ed. Engl.,
28: 716-7311,1989). These methods include triester, phosphite, phosphoramidite and H-
Phosphonate methods, PCR and other autoprimer methods, and oligonucleotide syntheses on
solid supports. These methods may be used if the entire nucleic acid sequence of the gene is
known, or the sequence of the nucleic acid complementary to the coding strand is available, or
altematively, if the target amino acid sequence is known, one may infer potential nucleic acid
sequences using known‘ and preferred coding residues for each amino acid residue.
A preferred method of practicing this invention is to use carefully selected
oligonucleotide sequences to screen cDNA libraries from various tissues, preferably human
breast, colon, salivary gland, placental, fetal, brain, and carcinoma cell lines. Other biological
sources of DNA encoding an heregulin-like ligand include other mammals and birds. Among the
preferred mammals are members of the following orders: bovine, ovine, equine, murine, and
rodentia.
The oligonucleotide sequences selected as probes should be of sufficient length and
sufficiently unambiguous that false positives are minimized. The actual nucleotide
sequence(s) is usually based on conserved or highly homologous nucleotide sequences or
regions of HRG-or. The oligonucleotides may be degenerate at one or more positions. The use
of degenerate oligonucleotides may be of particular importance where a library is screened
from a species in which preferential codon usage in that species is not known. The
oligonucleotide must be labeled such that itcan be detected upon hybridization to DNA in the
library being screened. The preferred method of labeling is to use 39?-labeled ATP with
polynucleotide kinase, as is well known in the art, to radiolabel the oligonucleotide. However,
other methods may be used to label the oligonucleotide, including, but not limited to,
biotinylation or enzyme labeling.
Of particular interest is HRG nucleic acid that encodes the full-length propolypeptide.
In some preferred embodiments, the nucleic acid sequence includes the native HRG signal
transmembrane sequence. Nucleic acid having all the protein coding sequence is obtained by
screening selected cDNA or genomic libraries, and, if necessary, using conventional primer
extension procedures as described in section 7.79 of Sambrook et a/., supra, to detect
precursors and processing intermediates of mRNA that may not have been reverse-
transcribed into cDNA.
HRG encoding DNA is used to isolate DNA encoding the analogous ligand from other
animal species via hybridization employing the methods discussed above. The preferred
animals are mammals, particularly bovine, ovine, equine, feline, canine and rodentia, and more
specifically rats, mice and rabbits.
B. Amino Acid mueme Variants of Hergulin
Amino acid sequence variants of HRG are prepared by introducing appropriate
nucleotide changes into HRG DNA, or by in vitro synthesis of the desired HRG polypeptide.
Such variants include, for example, deletions from, or insertions or substitutions of, residues
within the amino acid sequence shown for human HRG sequences. Any combination of
deletion, insertion, and substitution can be made to arrive at the final construct, provided that
the final construct possesses the desired characteristics. The amino acid changes also may
alter post-translational processes of HRG-cc, such as changing the number or position of
glycosylation sites, altering the membrane anchoring characteristics, altering the intra-cellular
location of HRG by inserting, deleting, or otherwise affecting the transmembrane sequence of
native HRG, or modifying its susceptibility to proteolytic cleavage.
in designing amino acid sequence variants of HRG, the location of the mutation site
and the nature of the mutation will depend on HRG characteristic(s) to be modified. The sites
for mutation can be modified individually or in series, e.g., by (1) substituting first with
conservative amino acid choices and then with more radical selections depending upon the
results achieved, (2) deleting the target residue, or (3) inserting residues of other ligands
adjacent to the located site.
A useful method for identification of HRG residues or regions for mutagenesis is called
‘alanine scanning mutagenesis“ as described by Cunningham and Wells (Science, 244: 1081-
1085, 1989). Here, a residue or group of target residues are identified (e.g., charged residues
such as arg, asp, his, lys, and glu) and replaced by a neutral or negatively charged amino acid
(most preferably alanine or polyalanine) to affect the interaction of the amino acids with the
surrounding aqueous environment in or outside the cell. Those domains demonstrating
functional sensitivity to the substitutions then are refined by introducing further or other
variants at or for the sites of substitution. Thus, while the site for introducing an amino acid
sequence variation is predetermined, the nature of the mutation per se need not be
predetermined. For example, to optimize the perfomtance of a mutation at a given site, ala
scanning or random mutagenesis may be conducted at the target codon or region and the
expressed HRG variants are screened for the optimal combination of desired activity.
There are two principal variables in the construction of amino acid sequence variants:
the location of the mutation site and the nature of the mutation. These are variants from
HRG sequence, and may represent naturally occurring alleles (which will not require
manipulation of HRG DNA) or predetermined mutant forms made by mutating the DNA, either
to arrive at an allele or a variant not found in nature. in general, the location and nature of
the mutation chosen will depend upon HRG characteristic to be modified. Obviously, such
variations that, for example, convert HRG into a known receptor ligand, are not included
within the scope of this invention, nor are any other HRG variants or polypeptide sequences
that are not novel and unobvious over the prior art.
Amino acid sequence deletions generally range from about 1 to 30 residues, more
preferably about 1 to 10 residues, and typically about 1 to 5 contiguous residues. Deletions
may be introduced into regions of low homology with other EGF family precursors to modify
the activity of HRG. Deletions from HRG in areas of substantial homology with other EGF
family sequences will be more likely to modify the biological activity of HRG more significantly.
The number of consecutive deletions will be selected so as to preserve the tertiary structure
of HRG in the affected domain, e.g., cysteine crosslinking, beta-pleated sheet or alpha helix.
Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions
ranging in length from one residue to polypeptides containing a hundred or more residues, as
well as intrasequence insertions of single or multiple amino acid residues. intrasequence
insertions (i.e., insertions within HRG sequence) may range generally from about 1 to 10
residues, more preferably 1 to 5, and most preferably 1 to 3. Examples of terminal insertions
include HRG with an N-terrninal methionyl residue (an artifact of the direct expression of HRG
in bacterial recombinant cell culture), and fusion of a heterologous N-terminal signal sequence
to the N-terrninus of HRG to facilitate the secretion of mature HRG from recombinant host
cells. Such signal sequences generally will be obtained from, and thus be homologous to, the
intended host cell species. Suitable sequences include STll or lpp for E. coli, alpha factor for
yeast, and viral signals such as herpes gD for mammalian cells.
Other insertional variants of HRG include the fusion to the N- or C-terminus of HRG
to an immunogenic polypeptide, e.g., bacterial polypeptides such as beta-lactamase or an
enzyme encoded by the E. colilrp locus, or yeast protein, bovine serum albumin, and
chemotactic polypeptides. C-terminal fusions of HRG-NTD-GFD with proteins having a long
half-life such as immunoglobulin constant regions (or other immunoglobulin regions), albumin, or
ferritin, as described in WO 89/02922, published 6 April 1989 are included.
Another group of variants are amino acid substitution variants. These variants have
at least one amino acid residue in the HRG molecule removed and a different residue inserted
in its place. The sites of greatest interest for substitutional mutagenesis include sites
identified as the active site(s) of HRG, and sites where the amino acids found in HRG ligands
from various species are substantially different in terms of side-chain bulk, charge. and/or
hydrophobicity.
The amino terminus of the cytoplasmic region ot HRG may be fused to the carboxy
terminus of heterologous transmembrane domains and receptors, to form a fusion polypeptide
useful for intracellular signaling of a ligand binding to the heterologous receptor.
Other sites of interest are those in which particular residues of HHG-like ligands
obtained from various species are identical. These positions may be important for the
biological activity of HRG. These sites, especially those tailing within a sequence of at least
three other identically conserved sites, are substituted in a relatively conservative manner.
Such conservative substitutions are shown in Table 1 under the heading of "preierred
substitutions". if such substitutions result in a change in biological activity, then more
substantial changes, denominated exemplary substitutions in Table 1, or as further described
below in reference to amino acid classes, are introduced and the products screened.
TABLE 1
Original Exemplary Preferred
Residue Substitutions Substitutions
Ala (A) val; leu;ile val
Arg (R) lys;gln;asn lys
Asn (N) gln;his; lys; arg gln
Asp (D) glu glu
Cys (C) ser ser
Gin (Q) asn asn
Glu (E) asp asp
Gly (G) pro pro
His (H) asn; gin; lys; arg arg
lie (I) leu; val; met; ala; phe;
norieucine leu
Leu (L) norieucine; ile;val;
met; ala; phe ile
Lys (K) arg;gln;asn arg
Met (M) leu; phe; ile leu
Phe (F) leu; val; ile; ala leu
Pro (Pl ell’ elv
Ser (8) thr thr
Thr (T) ser ser
Trp (W) tyr tyr
Tyr (Y) trp; phe; thr; ser phe
Val (V) ile; leu; met; phe;
ala; norieucine leu
Substantial modifications in function or immunological identity of HRG are
accomplished by selecting substitutions that differ significantly in their effect on maintaining
(a) the structure of the polypeptide backbone in the area of the substitution, for example, as
a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target
site, or (c) the bulk of the side chain. Naturally occurring residues are divided into groups
based on common side chain properties:
1 hydrophobic: norleucine, met, ala, val, leu, ile;
) neutral hydrophilic: cys, ser, thr;
) acidic: asp, glu;
) basic: asn, gln, his, lys, arg;
) residues that influence chain orientation: gly, pro; and
6) aromatic: trp, tyr, phe.
Non-‘conservative substitutions will entail exchanging a member of one of these
classes for another. Such substituted residues may be introduced into regions of HRG that
are homologous with other receptor ligands, or, more preferably, into the non-homologous
regions of the molecule.
In one embodiment of the invention, it is desirable to inactivate one or more protease
cleavage sites that are present in the molecule. These sites are identified by inspection of the
encoded amino acid sequence. Where potential protease cleavage sites are identified, e.g. at
K241 R242, they are rendered inactive to proteolytic cleavage by substituting the targeted
residue with another residue, preferably a basic residue such as glutamine or a hydrophylic
residue such as serine; by deleting the residue; or by inserting a prolyl residue immediately
after the residue.
in another embodiment, any methionyl residue other than the starting methionyl
residue, or any residue located within about three residues N- or C-terrninal to each such
methionyl residue, is substituted by another residue (preferably in accord with Table 1) or
deleted. We have found that oxidation of the 2 GFD M residues in the courses of E. coli
expression appears to severely reduce GFD activity. Thus, these M residues are mutated in
accord with Table 1. Alternatively, about 1-3 residues are inserted adjacent to such sites.
Any cysteine residues not involved in maintaining the proper conformation of HRG
also may be substituted, generally with serine, to improve the oxidative stability of the
molecule and prevent aberrant crosslinking.
Sites particularly suited for substitutions, deletions or insertions, or use as fragments.
include, numbered from the N-terminus of HRG-oi of Figure 4:
) potential glycosaminoglycan addition sites at the serine-glycine dipeptides at 42-43,
64-65, 151-152;
) potential asparagine-linked glycosylation at positions 164, 170, 208 and 437. sites
(NDS) 164-166, (NIT) 170-172, (NTS) 208-210, and NTS (609-611);
) potential O-glycosylation in a cluster of serine and threonine at 209-218;
) cysteines at 226, 234, 240, 254, 256 and 265;
) transmembrane domain at 287-309;
) loop 1 delineated by cysteines 226 and 240;
) loop 2 delineated by cysteines 234 and 254;
) loop 3 delineated by cysteines 256 and 265; and
) potential protease processing sites at 2-3, 8-9, 23-24, 33-34, 36-37, 45-46, 48-49, 62-
63, 66-67, 86-87, 110-111, 123-124, 134-135, 142-143, 272-273, 278-279 and 285-286;
Analogous regions in HRG-[31 may be determined by reference to figure 9 which aligns
analogous amino acids in HRG-oi and HRG-B1. The analogous HRG-[31 amino acids may be
mutated or modified as discussed above for HRG-oi. Analogous regions in HRG-[32 may be
detennined by reference to figure 15 which aligns analogous amino acids in HRG-(1, HRG-B1
and HRG-B2. The analogous HRG-[32 amino acids may be mutated or modified as discussed
above for HRG-cc or HRG-B1. Analogous regions in HRG-[33 may be determined by
reference to figure 15 which aligns analogous amino acids in HRG-(X, HRG-B1 and HRG-B2.
The analogous HRG-[33 amino acids may be mutated or modified as discussed above for
HRG-ct, HRG-[31, or HRG-B2.
DNA encoding amino acid sequence variants of HRG is prepared by a variety of
methods known in the art. These methods include, but are not limited to, isolation from a
natural source (in the case of naturally occurring amino acid sequence variants) or
preparation by oligonucleotide-mediated (or site-directed) mutagenesis, PCR mutagenesis, and
cassette mutagenesis of an earlier prepared variant or a non-variant version of HRG. These
techniques may utilize HRG nucleic acid (DNA or RNA), or nucleic acid complementary to
HRG nucleic acid.
Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution,
deletion, and insertion variants of HRG DNA. This technique is well known in the art as
described by Adelman etaI., DNA, 2: 183 (1983).
Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal
oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the
template on either side of the nucleotide(s) coding for the mutation. This ensures that the
oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The
oligonucleotides are readily synthesized using techniques known -in the art such as that
described by Crea etal. (Proc. Natl. Acad. Sci. USA, 75: 5765,1978).
Single-stranded DNA template may also be generated by denaturing double-stranded
plasmid (or other) DNA using standard techniques.
For alteration of the native DNA sequence (to generate amino acid sequence
variants, for example), the oligonucleotide is hybridized to the single-stranded template under
suitable hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment
of DNA polymerase I, is then added to synthesize the complementary strand of the template
using the oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus formed
such that one strand of DNA encodes the mutated form of HRG, and the other strand (the
original template) encodes the native, unaltered sequence of HRG. This heteroduplex molecule
is then transfonned into a suitable host cell, usually a prokaryote such as E. coli JM101. After
the cells are grown, they are plated onto agarose plates and screened using the oligonucleotide
primer radiolabeled with 32P-phosphate to identify the bacterial colonies that contain the
mutated DNA. The mutated region is then removed and placed in an appropriate vector for
protein production, generally an expression vector of the type typically employed for
transformation of an appropriate host.
The method described immediately above may be modified such that a homoduplex
molecule is created wherein both strands of the plasmid contain the mutation(s). The
modifications are as follows: the single-stranded oligonucleotide is annealed to the single-
stranded template as described above. A mixture of three deoxyrlbonucleotides,
deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (d'l'l'P), is
combined with a modified thio-deoxyribocytosine called dCTP-(as)
(Amersham Corporation). This mixture is added to the template-oligonucleotide complex.
Upon addition of DNA polymerase to this mixture, a strand of DNA identical to the template
except for the mutated bases is generated. In addition, this new strand of DNA will contain
dCTP-(as) instead of dCTP, which serves to protect it from restriction endonuclease
digestion. After the template -strand of the double-stranded heteroduplex is nicked with an
appropriate restriction enzyme, the template strand can be digested with Exglll nuclease or
another appropriate nuclease past the region that contains the site(s) to be mutagenized. The
reaction is then stopped to leave a molecule that is only partially single-stranded. A complete
double-stranded DNA homoduplex is then fonned using DNA polymerase in the presence of all
four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex molecule can
then be transformed into a suitable host cell such as E. coIi.lM101, as described above.
Explanary substitutions common to any HRG include S2T or D; E3D or K; R4 K or E;
KSR or E; E6D or K; G7P or Y; Fl8K or D; G9P or Y; K10R or E; G11P or Y; K12Fl or E; G19P
or Y; S20T or F; G21 P or Y; K22 or E; K23R or E; Q38D; S107N; G108P; N120K; D121 K; S122
T; N126S; l126L; T1278; A163V; N164K; T165-T174; any residue to l, L, V, M, F, D, E, R or
K; G175V or P; T1768 or V; S177K or T; H178K or S; L179F or I; V180L or S; K181R or E; A
183N or V; E1B4K or D; K185R or E; E186D or Y; K187R or D; T1888 or Q; F189Y or S; V191L
or D; N192Q or H; G193P or A; G194P or A; E195D or K; F197Y or I; M198V or Y; V199L or T;
K200V or R; D201E or K; L202E or K; S203A or T; N204A; N204Q; P205A; P205G; S206T or
R; Fi207K or A;,Y208P or F; L209l or D; K211i or D; F216Y or I; T217 H or S; G218A or P;
A/D219K or R; R220K or A; A235/240/232V or F; E236/241/233D or K; E237/242/234D or
K; L238/243/235l or T; Y239/244/236F or T; Q240/245/237N or K; K241/246/238H or R;
R242/247/238H or K; V243/248/239L or T; L244/249/240l or S; T245/250/241$ or I;
I246/251/242V or T and T247/252/243$ or I. Specifically with respect to HRG-or, T2228, K
or V; E223D, R or Q; N224Q, K or F; V225A, R or D; P226G, l K or F; M227V, T, R or Y;
K228R, H or D; V229L, K or D; Q230N, R or Y; N231Q, K or Y; Q232N, R or Y; E2330, K or
T and K 234R, H or D (adjacent K/R mutations are paired in alternative embodiments to
create new proteolysis sites). Specifically with respect to HRG-[3 (any member), Q222N, R
or Y; N223Q, K or Y; Y224F, T or R; V225A, K or D; M226V, T or R; A227V, K, Y or D;
S228T, Y or R; F229Y, l or K and Y230F, T or R are suitable variants. Specifically with
respect to HRG-B1, K231R or D, H232R or D; L233l, K, F or Y; G234P, R, A or S; l235l, K, F
or Y; E236D, R or A; F2371, Y, K or A; M238V, T, R or A and E239D, R or A are suitable
variants. Specifically with respect to HRG-[$1 and HRG-[32, K231R or D are suitable
variants. Altematively, each of these residues may be deleted or the indicated substituents
inserted adjacent thereto. in addition, about from 1-10 variants are combined to produce
combinations. These changes are made in the proHRG, NTD, GFD, NTD-GFD or other
fragments or fusions. Q213-G215, A219 and the about 11-21 residues C-terminal to C221
differ among the various HRG classes. Residues at these are interchanged among HRG
classes or EGF family members, are deleted, or a residue inserted adjacent thereto.
DNA encoding HRG-or mutants with more than one amino acid to be substituted may
be generated in one of several ways. If the amino acids are located close together in the
polypeptide chain, they may be mutated simultaneously using one oligonucleotide that codes
for all of the desired amino acid substitutions. If, however, the amino acids are located some
distance from each other (separated by more than about ten amino acids), it is more difficult
to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of
two alternative methods may be employed.
PCR mutagenesis is also suitable for making amino acid variants of HRG-or. While
the following discussion refers to DNA, it is understood that the technique also finds
application with RNA. The PCR technique generally refers to the following procedure (see
Erlich, supra, the chapter by R. Higuchi, p. 61-70). When small amounts of template DNA are
used as starting material in a PCR, primers that differ slightly in sequence from the
corresponding region in a template DNA can be used to generate relatively large quantities of
a specific DNA fragment that differs from the template sequence only at the positions where
the primers differ from the template. For introduction of a mutation into a plasmid DNA, one
of the primers is designed to overlap the position of the mutation and to contain the mutation;
the sequence of the other primer must be identical to a stretch of sequence of the opposite
strand of the plasmid, but this sequence can be located anywhere along the plasmid DNA. It is
preferred, however, that the sequence of the second primer is located within 200 nucleotides
from that of the first, such that in the end the entire amplified region of DNA bounded by the
primers can be easily sequenced. PCR amplification using a primer pair like the one just
described results in a population of DNA fragments that differ at the position of the mutation
specified by the primer, and possibly at other positions, as template copying is somewhat
error-prone.
If the ratio of template to product material is extremely low, the vast majority of
product DNA fragments incorporate the desired mutation(s). This product material is used to
replace the corresponding region in the plasmid that served as PCR template using standard
DNA technology. Mutations at separate positions can be introduced simultaneously by either
using a mutant second primer, or perfonning a second PCR with different mutant primers and
ligating the two resulting PCR fragments simultaneously to the vector fragment in a three (or
more)-part ligation.
Another method for preparing variants, cassette mutagenesis, is based on the
technique described by Wells et al. (Gene, 34: 1315,1985). The starting material is the plasmid
(or other vector) comprising HRG DNA to be mutated. The codonfs) in HRG DNA to be
mutated are identified. There must be a unique restriction endonuclease site on each side of
the identified mutation sitels). if no such restriction sites exist, they may be generated using
the above-described oligonucleotide-mediated mutagenesis method to introduce them at
appropriate locations in HRG DNA. After the restriction sites have been introduced into the
plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide
encoding the sequence of the DNA between the restriction sites but containing the desired
mutation(s) is synthesized usingstandard procedures. Thetwo strands are synthesized
separately and then hybridized together using standard techniques. This double-stranded
oligonucleotide is referred to as the cassette. This cassette is designed to have 3' and 5‘ ends
that are compatible with the ends of the linearized plasmid, such that it can be directly ligated
to the plasmid. This plasmid now contains the mutated HRG DNA sequence.
C. lnsenign gt‘ DNA into a Qloning or Expression miic_le
The cDNA or genomic DNA encoding native or variant HRG is insened into a
replicable vector for further cloning (amplification of the DNA) or for expression. Many
vectors are available, and selection of the appropriate vector will depend on 1) whether it is to
be used for DNA amplification or for DNA expression, 2) the size of the DNA to be inserted
into the vector, and 3) the host cell to be transformed with the vector. Each vector contains
various components depending on its function (amplification of DNA or expression of DNA)
and the host cell for which it is compatible. The vector components generally include, but are
not limited to, one or more of the following: a signal sequence, an origin of replication, one or
more marker genes, an enhancer element, a promoter, and a transcription termination
sequence. _
in general, the signal sequence may be a component of the vector, or it may be a pan
of HRG DNA that is inserted into the vector. The native HRG DNA is believed to encode a
signal sequence at the amino terminus (5’ end of the DNA encoding HRG) of the polypeptide
that is cleaved during post-translational processing of the polypeptide to form the mature
HRG polypeptide ligand that binds to p185HER2 receptor. although a conventional signal
structure is not apparent. Native proHRG is. secreted from the cell but may remain lodged in
the membrane because it contains a transmembrane domain and a cytoplasmic region in the
carboxyl tenninal region of the polypeptide. Thus, in a secreted, soluble version of HRG the
carboxyl temtinal domain of the molecule, including the transmembrane domain, is ordinarily
deleted. This truncated variant HRG polypeptide may be secreted from the cell. provided that
the DNA encoding the truncated variant encodes a signal sequence recognized by the host.
HRG of this invention may be expressed not only directly, but also as a fusion with a
heterologous polypeptide, preferably a signal sequence or other polypeptide having a specific
cleavage site at the N-and/or C-terrninis of the mature protein or polypeptide. In general, the
signal sequence may be a component of the vector, or it may be a part of HRG DNA that is
inserted into the vector. included within the scope of this invention are HRG with the native
signal sequence deleted and replaced with a heterologous signal sequence. The heterologous
signal sequence selected should be one that is recognized and processed, i.e., cleaved by a
signal peptidase, by the host cell. For prokaryotic host cells that do not recognize and
process the native HRG signal sequence, the signal sequence is substituted by a prokaryotic
signal sequence selected, for example, from the group of the alkaline phosphatase,
penicillinase, lpp, or heat-stable enterotoxin ll leaders. For yeast secretion the native HFlG
signal sequence may be substituted by the yeast invertase, alpha factor, or acid phosphatase
leaders. In mammalian cell expression the native signal sequence is satisfactory, although
other mammalian signal sequences may be suitable.
(ill m
Both expression and cloning vectors generally contain a nucleic acid sequence that
enables the vectorto replicate in one or more selected host cells. Generally, in cloning vectors
this sequence is one that enables the vector to replicate independently of the host
chromosomal DNA, and includes origins of replication or autonomously replicating sequences.
Such sequences are well known fora variety of bacteria, yeast, and viruses. The origin of
replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2p
plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus,
VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the origin of
replication component is not needed for mammalian expression vectors (the SV4O origin may
typically be used only because it contains the early promoter).
Most expression vectors are "shuttle" vectors, i.e., they are capable of replication in
at least one class of organisms but can be transfected into another organism for expression.
For example, a vector is cloned in E. coli and then the same vector is transfected into yeast
or mammalian cells for expression even though it is not capable of replicating independently of
the host cell chromosome.
DNA may also be amplified by insertion into the host genome. This is readily
accomplished using Bacillus species as hosts, for example, by including in the vector a DNA
sequence that is complementary to a sequence found in Bacillus genomic DNA. Transtection
of Bacillus with this vector results in homologous recombination with the genome and insertion
of HRG DNA. However, the recovery of genomic DNA encoding HRG is more complex than
that of an exogenously replicated vector because restriction enzyme digestion is required to
excise HRG DNA. DNA can be amplified by PCR and directly transfected into the host cells
without any replication component.
(iii) Selgion Gem Qmmgent
Expression and cloning vectors should contain a selection gene, also termed a
selectable marker. This gene encodes a protein necessary for the survival or growth of
transformed host cells grown in a selective culture medium. Host cells not transformed with
the vector containing the selection gene will not survive in the culture medium. Typical
selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g.,
ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies,
or (c) supply critical nutrients not available from complex media, e.g., the gene encoding D-
alanine racemase for Bacilii.
One example of a selection scheme utilizes a drug to arrest growth of a host cell.
Those cells that are successfully transformed with a heterologous gene express a protein
conferring drug resistance and thus survive the selection regimen. Examples of such dominant
selection use the drugs neomycin (Southern et al., J. Molec. App/. Genet. 1: 327,1982),
mycophenolic acid (Mulligan etal, Science 209: 1422,1980) or hygromycin (Sugden et al., Mol.
Cell. Biol. 5: 410-413.1985). The three examples given above employ bacterial genes under
eukaryotic control to convey resistance to the appropriate drug G418 or neomycin (geneticin),
xgpt (mycophenolic acid), or hygromycin, respectively.
Another example of suitable selectable markers for mammalian cells are those that
enable the identification of cells competent to take up HRG nucleic acid, such as dihydrofolate
reductase (DHFR) or thymidine kinase. The mammalian cell transformants are placed under
selection pressure which only the transformants are uniquely adapted to survive by virtue of
having taken up the marker. Selection pressure is imposed by cutturing the transformants
under conditions in which the concentration of selection agent in the medium is successively
changed, thereby leading to amplification of both the selection gene and the DNA that encodes
HRG. Amplification is the process by which genes in greater demand for the production of a
protein critical for growth are reiterated in tandem within the chromosomes of successive
generations of recombinant cells. increased quantities of HFlG'are synthesized from the
amplified DNA.
For example, cells transformed with the DHFR selection gene are first identified by
culturing all of the transformants in a culture medium that contains methotrexate (Mix), a
competitive antagonist of DHFR. An appropriate host cell when wild-type DHFR is employed
is the Chinese hamster ovary (CHO) cell line deficient in DHFR activity, prepared and
propagated as described by Urlaub and Chasin, Proc. Natl. Acad. Sci. USA, 77: 4216, 1980.
The transformed cells are then exposed to increased levels of methotrexate. This leads to
the synthesis of multiple copies of the DHFR gene, and, concomitantly, multiple copies of other
DNA comprising the expression vectors, such as the DNA encoding HRG. This amplification
technique can be used with any otherwise suitable host, e.g., ATCC No. CCL61 CHO-K1,
notwithstanding the presence of endogenous DHFR if, for example, a mutant DHFR gene that
is highly resistant to Mtx is employed (EP 117,060). Altematively, host cells (particularly
wild-type hosts that contain endogenous DHFR) transformed or co-transformed with DNA
sequences encoding HRG, wild-type DHFR protein, and another selectable marker such as
aminoglycoside 3‘ phosphotransferase (APH) can be selected by cell growth in medium
containing a selection agent for the selectable marker such as an aminoglycosidic antibiotic,
e.g., kanamycin, neomycin, or G418 (see U.S. Pat. No. 4,965,199).
A suitable selection gene for use in yeast is the trp1 gene present in the yeast plasmid
YRp7 (Stinchcomb etal., Nature, 282: 39, 1979; Kingsman et aI., Gene, 7: 141, 1979; or
Tschemper etaI., Gene, 10: 157, 1980). The trp1 gene provides a selection marker for a
mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No.
44076 or PEP4-1 (Jones, Genetics, 85: 12, 1977). The presence of the trp1 lesion in the yeast
host cell genome then provides an effective environment for detecting transformation by
growth in the absence of tryptophan. Similarly, Leu2-deficient yeast strains (ATCC 20,622
or38,626) are complemented by known plasmids bearing the Leu2 gene.
(iv) 1 0
Expression and cloning vectors usually contain a promoter that is recognized by the
host organism and is operably linked to HRG nucleic acid. Promoters are untranslated
sequences located upstream (5') to the start codon of a structural gene (generally within
about 100 to 1000 bp) that control the transcription and translation of a particular nucleic acid
sequence, such as HRG to which they are operably linked. Such promoters typically fall into
two classes, inducible and constitutive. Inducible promoters are promoters that initiate
increased levels of transcription from DNA under their control in response to some change in
culture conditions, e.g., the presence or absence of a nutrient or a change in temperature. At
this time a large number of promoters recognized by a variety of potential host cells are well
known. These promoters are operably linked to DNA encoding HRG by removing the promoter
from the source DNA by restriction enzyme digestion and inserting the isolated promoter
sequence into the vector. Both the native HRG promoter sequence and many heterologous
promoters may be used to direct amplification and/or expression of HRG DNA. However,
heterologous promoters are preterred, as they generally permit greater transcription and
higher yields of expressed HRG as compared to the native HRG promoter.
Promoters suitable for use with prokaryotic hosts include the B-lactamase and
lactose promoter systems (Chang et al., Nature, 275: 615, 1978; and Goeddel et al., Nature
281: 544, 1979), alkaline phosphatase, a tryptophan (trp) promoter system (Goeddel, Nucleic
Acids Res, 8: 4057, 1980 and EP 36,776) and hybrid promoters such as the tac promoter
(deBoer et al., Proc. Natl. Acad. Sci. USA 80: 21-25, 1983). However, other known bacterial
promoters are suitable. Their nucleotide sequences have been published, thereby enabling a
skilled worker operably to ligate them to DNA encoding HRG (Siebenlist eta/., Ce!/20: 269,
1980) using linkers or adaptors to supply any required restriction sites. Promoters for use in
bacterial systems also generally will contain a Shine-Dalgarno (S.D.) sequence operably linked
to the DNA encoding HRG.
Suitable promoting sequences tor use with yeast hosts include the promoters for 3-
phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem., 255: 2073, 1980) or other glycolytic
enzymes (Hess et al., J. Adv. Enzyme Flag 7: 149, 1968; and Holland, Biochemistg/17: 4900,
1978), such as enolase, glyceraldehydephosphate dehydrogenase, hexokinase, pyruvate
decarboxylase, phosphofructokinase, glucosephosphate isomerase, 3-phosphoglycerate
mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and
gluookinase.
Other yeast promoters, which are inducible promoters having the additional
advantage ol transcription controlled by growth conditions, are the promoter regions for
alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradatlve enzymes
associated with nitrogen metabolism, metallothionein, glyceraldehydephosphate
dehydrogenase, and enzymes responsible tor maltose and galactose utilization. Suitable
vectors and promoters for use in yeast expression are further described in Hitzeman et al.,
EP 73,657A. Yeast enhancers also are advantageously used with yeast promoters.
Promoter sequences are known for eukaryotes. Virtually all eukaryotic genes have
an AT-rich region located approximately 25 to 30 bases upstream from the site where
transcription is initiated. Another sequence lound 70 to 80 bases upstream from the start of
transcription 01 many genes is a CXCAAT (SEQ ID NO:1) region where X may be any
nucleotide. At the 3‘ end ot most eukaryotic genes is an AATAAA sequence (SEQ ID NO:2)
that may be the signal for addition 01 the poly A tail to the 3‘ end ol the coding sequence. All
of these sequences are suitably inserted into mammalian expression vectors.
HRG gene transcription from vectors in mammalian host cells is controlled by
promoters obtained from the genomes of viruses such as polyoma virus, lowlpox virus (UK
2,211,504, published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus,
avian sarcoma virus. cytomegalovirus. a retrovirus. hepatitis-B virus and most preterably
Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the actin promoter or
an immunoglobulin promoter. from heat-shock promoters, and from the promoter normally
associated with HRG sequence, provided such promoters are compatible with the host cell
systems.
The early and late promoters of the SV40 virus are conveniently obtained as an
SV40 restriction lragment that also contains the SV40 viral origin ol replication (Fiers et al.,
Nature. 273:113 (1978); Mulligan and Berg, Science, 209: 1422-1427 (1980); Pavlakis etal,
Proc. Natl. Acad. Sci. USA, 78: 7398-7402 (1981)). The immediate early promoter of the
human cytomegalovirus is conveniently obtained as a Llmdlll E restriction lragment
(Greenaway et al., Gene. 18: 355-360 (1982)). A system for expressing DNA in mammalian
hosts using the bovine papilloma virus as a vector is disclosed in U.S. Pat. No. 4,419,446. A
modification of this system is described in U.S. Pat. No. 4,601,978. See also Gray et al.,
Nature, 295: 503-508 (1982) on expressing cDNA encoding immune interferon in monkey cells;
Reyes at al., Nature, 297: 598-601 (1982) on expression of human [3-interferon cDNA in mouse
cells underthe control of a thymidine kinase promoter from herpes simplex virus; Canaani and
Berg, Proc. Natl. Acad. Sci. USA, 79: 5166-5170 (1982) on expression of the human interferon
[31 gene in cultured mouse and rabbit cells; and Gonnan et al., Proc. Natl. Acad. Sci. USA, 79:
6777-6781 (1982) on expression of bacterial CAT sequences in CV-1 monkey kidney cells,
chicken embryo fibroblasts, Chinese hamster ovary cells, HeLa cells, and mouse NIH-3T3
cells using the Flous sarcoma virus long terminal repeat as a promoter.
(v) Enhancer Element gommnent
Transcription of a DNA encoding HFiG of this invention by higher eui: fiyotes is often
increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting
elements of DNA, usually about from 10-300 bp, that act on a promoter to increase its
transcription. Enhancers are relatively orientation and position independent having been found
‘ (Laimins etal., Proc. Natl. Acad. Sci. USA, 78: 993, 1981) and 3' (Lusky et al., Mol. Cell
Bio., 3: 1108, 1983) to the transcription unit, within an intron (Banerji et al., Cell, 33: 729,
1983) as well as within the coding sequence itself (Osborne et aI., Mol. Cell Bio., 4: 1293,
1984). Many enhancer sequences are now known from mammalian genes (globin, elasiase.
albumin, or-fetoprotein and insulin). Typically, however, one will use an enhancer from a
eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication
origin (bp 100-270), the cytomegalovirus early promoter enhancer, the pofyoma enhancer on
the late side of the replication origin, and adenovirus enhancers (see also Yaniv, Nature, 297:
17-18 (1982)) on enhancing elements for activation of eukaryotic promoters. The enhancer
may be spliced into the vector at a position 5' or 3' to HRG DNA, but is preferably located at
a site 5' from the promoter.
(vi) Transcription Termination Qompgnent
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal,
human, or nucleated cells from other multicellular organisms) will also contain sequences
necessary for the tennination ot transcription and for stabilizing the mRNA. Such sequences
are commonly available from the 5' and, occasionally 3' untranslated regions of eukaryotic or
viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as
polyadenylated fragments in the untranslated portion of the mFiNA encoding HRG. The 3'
untranslated regions also include transcription termination sites.
Construction of suitable vectors containing one or more of the above listed
components the desired coding and control sequences employs standard ligation techniques.
isolated plasmids or DNA tragments are cleaved, tailored, and religated in the form desired to
generate the plasmids required.
.
For analysis to confirm correct sequences in plasmids constructed, the ligation
mixtures are used to transform E. coli K12 strain 294 (ATCC 31,446) and successful
transformants selected by ampicillin or tetracycline resistance where appropriate. Plasmids
from the transformants are prepared, analyzed by restriction endonuclease digestion, and/or
sequenced by the method of Messing et al., Nucleic Acids Res. 9: 309 (1981) or by the method
of Maxam et aI., Methods in Enzymology 65: 499 (1980).
Particularly useful in the practice of this invention are expression vectors that
provide for the transient expression in mammalian cells of DNA encoding HRG. In general,
transient expression involves the use of an expression vector that is able to replicate
efficiently in a host cell, such that the host cell accumulates many copies of the expression
vector and, in turn, synthesizes high levels of a desired polypeptide encoded by the expression
vector. Transient expression systems, comprising a suitable expression vector and a host
cell, allow for the convenient positive identification of polypeptides encoded by cloned DNAs,
as well as for the rapid screening of such polypeptides for desired biological or physiological
properties. Thus, transient expression systems are particularly useful in the invention for
purposes of identifying analogs and variants of HRG that have HRG-like activity. Such a
transient expression system is described in EP 309,237 published 29 March 1989. Other
methods, vectors, and host cells suitable for adaptation to the synthesis of HRG in
recombinant vertebrate cell culture are described in Gething er al., Nature 293: 620-625, 1981;
Mantei etal., Nature, 281: 40-46, 1979; Levinson etal., EP 117,060 and EP 117,058. A
particularly useful expression plasmid for mammalian cell culture expression of HRG is pRK5
(EP pub. no. 307,247).
D. selegtion and Transformation of Host Cells
Suitable host cells for cloning or expressing the vectors herein are the prokaryote,
yeast, or higher eukaryote cells described above. Suitable prokaryotes include eubacteria,
such as Gram-negative or Gram-positive organisms, for example, E. coil, Bacilli such as B.
subtilis, Pseudomonas species such as P. aeruginosa, Salmonella typhimurium, or Serratia
marcescans. One preferred E. coli cloning host is E. call 294 (ATCC 31,446), although other
strains such as E. coliB, E. coli X1776 (ATCC 31,537), and E. co/iW3110 (ATCC 27,325) are
suitable. These examples are illustrative rather than limiting. Preferably the host cell should
secrete minimal amounts of proteolytic enzymes. Alternatively, in vitro methods of cloning,
e.g., PCFt or other nucleic acid polymerase reactions, are suitable.
In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are
suitable hosts for HRG-encoding vectors. Saccharomyces cerevisiae, or common baker's
yeast, is the most commonly used among lower eukaryotic host microorganisms. However, a
number of other genera, species, and strains are commonly available and useful herein, such
as Schizosaccharomyces pombe (Beach and Nurse, Nature, 290: 140 (1981); EP 139,383,
published May 2, 1985), Kluyveromyces hosts (U.S.S.N. 4,943,529) such as, e.g., K. lactis
(Louvencourt etal., J. Bacteriol, 737 (1983); K. fragilis, K. bulgaricus, K. thermotolerans, and
K. marxianus, yarrowia (EP 402,226); Pichla pastoris (EP 183,070), Sreekrishna et al., J.
Basic Microbiol, 28: 265-278 (1988); Candida, Trichoderrna reesia (E P 244,234); Neurospora
crassa (Case etaI., Proc. Natl. Acad. Sci. USA, 76: 5259-5263 (1979), and filamentous fungi
such as, e.g, Neurospora, Penici/Iium, To/ypoc/adium (WO 91/00357, published 10 January
1991), and Aspergillus hosts such as A. nidulans (Ballance et aI., Biochem. Biophys. Res.
Commun., 112: 284-289 (1983); Tilbum etal., Gene, 26: 205-221 (1983); Yelton etaI., Proc.
Natl. Acad. Sci. USA, 81: 1470-1474 (1984) and A. niger (Kelly and Hynes, EMBO J., 4: 475-
479 (1985)).
Suitable host cells for the expression of glycosylated HFlG polypeptide are derived
from multicellular organisms. Such host cells are capable of complex processing and
glycosylation activities. In principle, any higher eukaryotic cell culture is workable, whether
lrorn vertebrate or invertebrate culture. Examples 01 invertebrate cells include plant and
insect cells. Numerous baculoviral strains and variants and corresponding pennissive insect
host cells from hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti (mosquito),
Aedes albopictus (mosquito), Drosophila melanogaster (fruitfly), and Bombyx mori host cells
have been identified (see, e.g., Luckow at al., Bio/Technology, 6: 47-55 (1988); Miller et al., in
Genetic Engineenng, Setlow, ..l.K. etal., eds., Vol.8 (Plenum Publishing, 1986), pp. 277-279; and
Maeda et al., Nature, 315: 592-594 (1985)). A variety of such viral strains are publicly
available, e.g., the L-1 variant, of Autographa californica NPV and the Bm-5 strain of Bombyx
mori NPV, and such viruses may be used as the virus herein according to the present
invention, particularly for transfection of Spodoptera frugrperda cells. Plant cell cultures of
cotton, corn, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts.
Typically, plant cells are transfected by incubation with certain strains of the bacterium
Agrabacterium tumefaciens, which has been previously manipulated to contain HRG DNA.
During incubation of the plant cell culture with A. tumefaciens, the DNA encoding HRG is
transferred to the plant cell host such that it is transtected, and will, under appropriate
conditions, express HRG DNA. In addition, regulatory and signal sequences compatible with’
plant cells are available, such as the nopaline synthase promoter and polyadenylation signal
sequences (Depicker etal., J. Mol. Appl. Gen., 1:561 (1982)). In addition, DNA segments
isolated from the upstream region of the T-DNA 780 gene are capable of activating or
increasing transcription levels of plant-expressible genes in recombinant DNA-containing plant
tissue (see EP 321,196, published 21 June 1989).
However, interest has been greatest in vertebrate cells, and propagation of
vertebrate cells in culture (tissue culture) has become a routine procedure in recent years
(Tissue Culture, Academic Press, Kruse and Patterson, editors (1973)). Examples of useful
mammalian host cell lines are monkey kidney CV1 line transfonned by SV40 (COS-7, ATCC
CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension
culture, Graham etal., J. Gen Virol, 36: 59, 1977); baby hamster kidney cells (BHK, ATCC
CCL10); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad.
Sci. USA, 77:4216 [1980]); mouse sertoli cells (TM4, Mather, Biol. Reprod., 232243-251 (19801);
monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76,
ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney
cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human
lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); mouse mammary
tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et aI., Annals N. Y. Acad. Sci.,
383:44-68 [1982]); MRC 5 cells; FS4 cells; and a human hepatoma cell line (Hep G2).
Preferred host cells are human embryonic kidney 293 and Chinese hamster ovary cells.
Host cells are translected and preferably transformed with the above-described
expression or cloning vectors of this invention and cultured in conventional nutrient media
modified as appropriate for inducing promoters, selecting transtormants, or amplifying the
genes encoding the desired sequences.
Transfection refers to the taking up of an expression vector by a host cell whether or
not any coding sequences are in tact expressed. Numerous methods of transfection are
known to the ordinarily skilled artisan, for example, CaPO4 and electroporation. Successful
transfection is generally recognized when any indication of the operation of this vector occurs
within the host cell.
E. Culturlng the Host Qells
Prokaryotic cells used to produce HRG polypeptide of this invention are cultured in
suitable media as described generally in Sambrook et a/., supra.
The mammalian host cells usedto produce HRG of this invention may be cultured in a
variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal
Essential Medium ([MEM), Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's
Medium ([DMEM], Sigma) are suitable for culturing the host cells. in addition, any of the media
described in Ham and Wallace, Mefh. Enz., 58: 44 (1979), Barnes and Sato, Anal. Biochem.,
102:255 (1980), U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; or 4,560,655; WO 90/03430;
WO 87/00195 and U.S. Pat. Re. 30,985, may be used as culture media for the host cells. Any
of these media may be supplemented as necessary with hormones and/or other growth
factors (such as insulin, transferrin, or epidennal growth factor), salts (such as sodium
chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides (such as
adenosine and thymidine), antibiotics (such as GentamycinTM drug), trace elements (defined
as inorganic compounds usually present at final concentrations in the micromolar range), and
glucose or an equivalent energy source. Any other necessary supplements may also be
included at appropriate concentrations that would be known to those skilled in the art. The
culture conditions, such as temperature, pH, and the like, are those previously used with the
host cell selected for expression, and will be apparent to the ordinarily skilled artisan.
The host cells referred to in this disclosure encompass cells in in vitro culture as well
as cells that are within a host animal.
It is further envisioned that HRG of this invention may be produced by homologous
recombination, or with recombinant production methods utilizing control elements introduced
into cells already containing DNA encoding HRG currently in use in the field. For example, a
powerful promoter/enhancer element, a suppressor, or an exogenous transcription modulatory
element is inserted in the genome of the intended host cell in proximity and orientation
sufficient to influence the transcription of DNA encoding the desired HRG. The control
element does not encode HRG of this invention, but the DNA is present in the host cell genome.
One next screens for cells making HRG of this invention, or increased or decreased levels of
expression, as desired.
F. mjggtigg gene Amgltfigatignfixpgssion
Gene amplification and/or expression may be measured in a sample directly, for
example, by conventional Southern blotting, Northern blotting to quanfitate the transcription
of mRNA (Thomas, Proc. Natl. Acad. Sci. USA, 775201-5205 [1980]), dot blotting (DNA
analysis), or in situ hybridization, using an appropriately labeled probe based on the sequences
provided herein. Various labels may be employed, most commonly radioisotopes, particularly
32P. However, other techniques may also be employed, such as using biotin-modified
nucleotides for introduction into a polynucleotide. The biotin then serves as the site for binding
to avidin or antibodies which may be labeled with a wide variety of labels, such as
radionuclides, fluorescers, enzymes, or the like. Alternatively, antibodies may be employed
that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the
assay may be carried out where the duplex is bound to a surface, so that upon the fonnation
of duplex on the surface, the presence of antibody bound to the duplex can be detected.
Gene expression, alternatively, may be measured by immunological methods, such as
immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to
A quantitate directly the expression of gene product. With immunohistochemical staining
techniques, a cell sample is prepared, typically by dehydration and fixation, followed by
reaction with labeled antibodies specific for the gene product coupled where the labels are
usually visually detectable such as enzymatic labels, fluorescent labels, luminescent labels,
and the like. A particularly sensitive staining technique suitable for use in the present
invention is described by Hsu etaI., Am. J. Clin. Pafh., 75: 734-738 (1980).
Antibodies useful for immunohistochemical staining and/or assay of sample fluids may
be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the
antibodies may be prepared against a native HRG polypeptide or against a synthetic peptide
based on the DNA sequences provided herein as described further in Section 4 below.
G. Purification of The Heregulin Pomgptide
HRG is recovered from a cellular membrane fraction. Alternatively, a proteolyticalLy
cleaved or a truncated expressed soluble HRG fragment or subdomain are recovered from the
culture medium as soluble polypeptides.
When HHG is expressed in a recombinant cell other than one of human origin, HRG is
completely free of proteins or polypeptides of human origin. However, it is desirable to purify
HRG from recombinant cell proteins or polypeptides to obtain preparations that are
substantially homogeneous as to HRG. As a first step, the culture medium or lysate is
centrifuged to remove particulate cell debris. The membrane and soluble protein fractions are
then separated. HRG is then purified from both the soluble protein fraction (requiring the
presence of a protease) and from the membrane fraction of the culture lysate, depending on
whether HRG is membrane bound. The following procedures are exemplary of suitable
purification procedures: fractionation on immunoaffinity or ion-exchange columns; ethanol
precipitation; reversed phase HPLC; chromatography on silica, heparin sepharose or on a
cation exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate
precipitation; and gel filtration using, for example, Sephadex G-75.
HRG variants in which residues have been deleted. inserted or substituted are
recovered in the same fashion as the native HRG, taking account of any substantial changes
in properties occasioned by the variation. For example, preparation of a HRG fusion with
another protein or polypeptide, e.g., a bacterial or viral antigen, facilitates purification; an
immunoaffinity column containing antibody to the antigen can be used to adsorb the fusion.
immunoaffinity columns such as a rabbit polyclonal anti-HRG column can be employed to
absorb HRG variant by binding it to at least one remaining immune epitope. A protease
inhibitor such as phenylmethylsulfonylfluoride (PMSF) also may be useful to inhibit proteolytic
degradation during purification, and antibiotics may be included to prevent the growth of
adventitious contaminants. One skilled in the art will appreciate that purification methods
suitable for native HRG may require modification to account for changes in the character of
HRG variants upon expression in recombinant cell culture.
H Cgvalent Mgglifigatigns of HRQ
Covalent modifications of HRG polypeptides are included within the scope of this
invention. Both native HRG and amino acid sequence variants of HRG optionally are
covalently modified. One type of covalent modification included within the scope of this
invention is a HHG polypeptide fragment. HRG fragments, such as HRG-GDF, having up to
about 40 amino acid residues are conveniently prepared by chemical synthesis, or by
enzymatic or chemical cleavage of the lull-length HRG polypeptide or HRG variant
polypeptide. Other types of covalent modifications of HRG or fragments thereof are
introduced into the molecule by reacting targeted amino acid residues of HRG or fragments
thereof with an organic derivatizing agent that is capable of reacting with selected side chains
or the N- or C-terrninal residues.
Cysteinyl residues most commonly are reacted with on-haloacetates (and
corresponding amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl
or carboxyamidomethyl derivatives. Cysteinyl residues also are derivatized by reaction with
bromotrilluoroacetone, ct-bromo-B-(5-imidozoyl)propicnic acid, chloroacetyl phosphate, N-
alkylmaleimides, 3-nitropyridyl disulfide, methyl 2-pyridyl disulfide. p-chloromercuribenzoate,
2-chloromercurinitrophenol, or chloro-7~nitrobenzo-2oxa-1,3—dia2ole.
Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0
because this agent is relatively specific for the histidyl side chain. Para-bromophenacyl
bromide also is useful; the reaction is preferably perfonned in 0.1M sodium cacodylate at pH
6.0.
Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid
anhydrides. Derivatization with these agents has the effect of reversing the charge of the
lysinyl residues. Other suitable reagents for derivatizing or-amino-containing residues include
imidoesters such as methyl picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride;
trinitrobenzenesulfonic acid; O-mefhylisourea; 2,4-pentanedione; and transaminase-catalyzed
reaction with glyoxylate.
Arginyl residues are modified by reaction with one or several conventional reagents,
among them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin.
Derivatization of arginine residues requires that the reaction be performed in alkaline
conditions because of the high pKa of the guanidine functional group. Furthermore, these
reagents may react with the groups of lysine as well as the arginine epsilon-amino group.
The specific modification of tyrosyl residues may be made, with particular interest in
introducing spectral labels into tyrosyl residues by reaction with aromatic diazonium
compounds or tetranitromethane. Most commonly. N-acetylimidizole and tetranitromethane
are used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl
residues are lodlnated using 125i or 131i to prepare labeled proteins for use in
radioimmunoassay, the chloramine T method described above being suitable.
Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with
carbodiimides (R'-N=C=N-R‘), where R and R‘ are differentalkyl groups, such as 1-cyclohexyl-
3-(2-morpholinylethyl) carbodiimide or t-ethyl(4-azonia-4,4-dimethylpentyl) carbodiimide.
Furthemiore, aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl
residues by reaction with ammonium ions.
Derivatization with biiunctional agents is useful for crosslinking HRG to a water-
insoluble support matrix or surface for use in the method for purifying anti-HRG antibodies,
and vice versa. Commonly used crosslinking agents include, e.g., 1,1-bis(diazoacetyI)
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-
azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3‘-
dithiobis(succinimidylpropionate), and bitunctional maleimides such as bis-N-maIeimido-1,8-
octane. Derivatizing agents such as methyl[(p-azidophenyl)dithio]propioimidate yield
photoactivatable intermediates that are capable of forming crosslinks in the presence of light.
Alternatively, reactive water-insoluble matrices such as cyanogen bromide-activated
carbohydrates and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016;
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein immobilization.
Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding
glutamyl and aspartyl residues, respectively. Alternatively, these residues are deamidated
under mildly acidic conditions. Either form of these residues falls within the scope of this
invention.
Other modifications include hydroxylation of proline and lysine, phosphorylation of
hydroxyl groups of seryl or threonyl residues, methylation of the or-amino groups of lysine,
arginine, and histidine side chains (T.E. Creighton, Pr i 2 ur and Molec r
Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 [1983]), acetylation of the N-
terminal amine, and amidation of any C-terrninal carboxyl group.
HRG optionally is fused with a polypeptide heterologous to HRG. The heterologous
polypeptide optionally is an anchor sequence such as that found in the decay accelerating
system (DAF); a toxin such as ricin, pseudomonas exotoxin, gelonin, or other polypeptide that
will result in target cell death. These heterologous polypeptides are covalently coupled to HRG
through side chains or through the terminal residues. Similarly, HRG is conjugated to other
molecules toxic or inhibitory to a target mammalian cell, e.g. such as tricothecenes, or
antisense DNA that blocks expression of target genes.
HRG also is covalently modified by altering its native glycosylation pattern. One or
more carbohydrate substitutents are modified by adding, removing or varying the
monosaccharide components at a given site, or by modifying residues in HRG such that
glycosylation sites are added or deleted.
Glycosylation of polypeptides is typically either N-linked or O-linked. N-linked refers
to the attachment of the carbohydrate moiety to the side chain of an asparagine residue.
The tri-peptide sequences asparagine-X-serine and asparagine-X-threonine, where X is any
amino acid except proline, are the recognition sequences for enzymatic attachment of the
carbohydrate moiety to the asparagine side chain. Thus, the presence of either of these tri-
peptide sequences in a polypeptide creates a potential glycosylation site. O-linked
glycosylation refers to the attachment of one of the sugars N-acetyigaiactosamine,
galactose, or xylose, to a hydroxyamino acid, most commonly serine or threonine, although 5-
hydroxyproline or 5-hydroxylyslne may also be used.
Glycosylation sites are added to HRG by altering its amino acid sequence to contain
one or more of the above-described tri-peptide sequences (for N-linked glycosylation sites).
The alteration may also be made by the addition of, or substitution by, one or more serine or
threonine residues to HRG (for O-linked glycosylation sites). For ease, HRG is preferably
altered through changes at the DNA level, particularly by mutating the DNA encoding HRG at
preselected bases such that codons are generated that will translate into the desired amino
acids.
Chemical or enzymatic coupling of glycosides to HRG increases the number of
carbohydrate substituents. These procedures are advantageous in that they do not require
production of the polypeptide in a host cell that is capable of N- and 0- linked glycosylation.
Depending on the coupling mode used. the sugar(s) may be attached to (a) arginine and
histidine, (b) free carboxyl groups, (c) free sulfhydryl groups such as those of cysteine, (d)
free hydroxyl groups such as those of serine, threonine, or hydroxyproline, (e) aromatic
residues such as those of phenylalanine, tyrosine, or tryptophan, or (1) the amide group of
glutamine. These methods are described in WC 87/05330, published 11 September 1987, and ‘n
Aplin and Wriston (QRQ Qri_t, Rev, Bigghgm, pp. 259-306 [1981]).
Carbohydrate moieties present on an HRG also are removed chemically or
enzymatically. Chemical deglycosylaticn requires exposure of the polypeptide to the
compound trifluoromethanesulionic acid, or an equivalent compound. This treatment results in
the cleavage of most or all sugars except the linking sugar (N-acetylgluccsamine or N-
acetylgalactosamine), while leaving the polypeptide intact. Chemical deglycosylaticn is
described by Hakimuddin etal. (Arch. Biochem. Biophys., 259152 [1987]) and by Edge et al.
(Anal. Biochem.,118:131 (1981)). Carbohydrate moieties are removed from HRG by a variety
of endo- and exo- glycosidases as described by Thotakura et al. (Meth. EnzymoI.,138:350
[1987]).
Glycosylation added during expression in cells also is suppressed by tunicamycin as
described by Duskin etal. (J. Biol. Chem., 257:3105 (1982)). Tunicamycin blocks the formation
of protein-N-glycoside linkages.
HRG also is modified by linking HRG to various nonproteinaceous polymers, e.g.,
polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S.
Pat. Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337.
One preferred way to increase the in viva circulating half life of non-membrane bound
HRG is to conjugate it to a polymer that confers extended half-life, such as polyethylene
glycol (PEG). (Maxiield, at al, Polymer 16,505-509 [1975]; Bailey, F. E., etai, in Nonionic
Surfactants [Schick, M. J., ed.] pp.794-821 [i967]; Abuchowski, A. et al., J. Biol. Chem.
252:3582-3586 [1977]; Abuchowski, A. etal., Cancer Biochem. Biophys. 7:175-186 [1984]; Katre,
N.V. etaI., Proc. Natl. Acad. Sci., 84:1487-1491 [1987]; Goodson, R. etal. Bio Technology,
8:343-346:[1990]). Conjugation to PEG also has been reported to have reduced
immunogenicity and toxicity (Abuohowski, A. et al., J. Biol. Chem., 252:3578-3581 [1977]).
HRG also is entrapped in microcapsules prepared, for example, by coacervation
techniques or by interiacial polymerization (for example, hydroxymethylcellulose or gelatin-
microcapsules and poly-[methylmethacylate] microcapsules, respectively), in colloidal drug
delivery systems (tor example, liposomes, albumin microspheres, microemulsions, nano-
panicles and nanocapsules), or in macroemulsions. Such techniques are disclosed in
16th edition, Osol, A., Ed., (1980).
HRG is also useiul in generating antibodies, as standards in assays for HFtG (e.g., by
labeling HRG tor use as a standard in a radioimmunoassay, enzyme-linked immunoassay, or
radioreceptor assay), in affinity purification techniques, and in competitive-type receptor
binding assays when labeled with radioiodine, enzymes, iluorophores, spin labels, and the like.
Those skilled in the art will be capable of screening variants in order to select the
optimal variant tor the purpose intended. For example, a change in the immunological
character oi HRG, such as a change in affinity for a given antigen or for the HER2 receptor,
is measured by a competitive-type immunoassay using a standard or control such as a native
HRG (in particular native HRG-GFD). Other potential modifications of protein or polypeptide
properties such as redox or thennal stability, hydrophobicity, susceptibility to proteolytic
degradation, stability in recombinant cell culture or in plasma, or the tendency to aggregate
with carriers or into multimers are assayed by methods well known in the art.
. Therggutic use of Hergulin Ligands
While the role of the p185HER9 and its ligands is unknown in normal cell growth and
diiierentiation, it is an object of the present invention to develop therapeutic uses for the
p‘l85HER2 ligands of the present invention in promoting normal growth and development and in
inhibiting abnormal growth, specifically in malignant or neoplastic tissues.
... Therapeutic Commsitions and Administratigg gf HRG
lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose,
or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-
forming counterions such as sodium; and/or nonionlc suriactants such as Tween, Pluronics or
polyethylene glycol (PEG).
HRG or HRG antibody to be used for in vivo administration must be sterile. This is
readily accomplished by filtration through sterile filtration membranes, prior to or following
lyophilization and reconstitution. HRG or antibody to an HRG ordinarily will be stored in
lyophilized form or in solution.
Therapeutic HRG, or HRG specific antibody compositions generally are placed into a
container having a sterile access port, for example, an intravenous solution bag or vial having
a stopper pierceable by a hypodermic injection needle.
HRG, its antibody or HRG variant when used as an antagonist may be optionally
combined with or administered in concert with other agents known for use in the treatment of
malignacies. When HRG is used as an agonist to stimulate the HER2 receptor, for example in
tissue cultures, it may be combined with or administered in concert with other compositions
that stimulate growth such as PDGF, FGF, EGF, growth honnone or other protein growth
factors.
The route ol HRG or HRG antibody administration is in accord with known methods,
e.g., injection or infusion by intravenous, intraperitoneal, intracerebral, intramuscular,
intraocular, intraarterial. or intralesional routes, or by sustained release systems as noted
below. HRG is administered continuously by infusion or by bolus injection. HRG antibody is
administered in the same fashion, or by administration into the blood stream or lymph.
Suitable examples of sustained—release preparations include semipermeable matrices
of solid hydrophobic polymers containing the protein, which matrices are in the fonn of shaped
articles, e.g. films, or microcapsules. Examples of sustained-release matrices include
polyesters, hydrogels [e.g., poly(2-hydroxyethyl-methacrylate) as described by Langer et al.,
J. Biomed. Mater. Res, 15:167-277 (1981) and Langer, Chem. Tech., 12:98-105 (1982) or
poly(vinylalcohol)], polylactides (U.S. Pat. No. 3,773,919, EP 58,481), copolymers of L-
glutamic acid and gamma ethyl-L-glutamate (Sidman et aI., Biopolymers, 22:547-556 [1983]),
non-degradable ethylene-vinyl acetate (Langer et al., supra), degradable lactic acid-glycolic
acid copolymers such as the Lupron DepotTM (injectable micropheres composed of lactic acid-
glycolic acid copolymer and leuprolide acetate), and poly-D-(-)hydroxybutyric acid (EP
133,988). While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable
release of molecules for over 100 days, certain hydrogels release proteins for shorter time
periods. When encapsulated proteins remain in the body for a long time, they may denature or
aggregate as a result of exposure to moisture at 37°C, resulting in a loss of biological activity
and possible changes in immunogenicity. Rational strategies can be devised for protein
stabilization depending on the mechanism involved. For example, if the aggregation mechanism
is discovered to be intermolecular S-S bond formation through -thio-disulfide interchange.
.stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from acidic
solutions, controlling moisture content, using appropriate additives, and developing specific
polymer matrix compositions.
Sustained-release HRG or antibody compositions also include Iiposomally entrapped
HRG or antibody. Liposomes containing HRG or antibody are prepared by methods known per
se: DE 3,218,121; Epstein eta/., Proc. Natl. Acad. Sci. USA, 82:3688-3692 (1985); Hwang et
al., Proc. Natl. Acad. Sci. USA, 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP
143,949; EP 142,641; Japanese patent application 83-118008; U.S. Pat. No. 4,485,045 and
4,544,545; and EP 102,324. Ordinarily the Iiposomes are of the small (about 200-800
Angstroms) unilamelar type in which the lipid content is greater than about 30 mol. %
cholesterol, the selected proportion being adjusted for the optimal HRG therapy. Liposomes
with enhanced circulation time are disclosed in U.S. Pat. No. 5,013,556.
Another use of the present invention comprises incorporating HRG polypeptide or
antibody into formed articles. Such articles can be used in modulating cellular growth and
development. In addition, cell growth and division and tumor invasion may be modulated with
these articles.
An effective amount of HRG or antibody to be employed therapeutically will depend,
for example, upon the therapeutic objectives, the route of administration, and the condition of
the patient. Accordingly, it will be necessary for the therapist to titer the dosage and modify
the route of administration as required to obtain the optimal therapeutic effect. A typical daily
dosage might range from about 1 pg/kg to up to 100 mg/kg or more, depending on the factors
mentioned above. Typically, the clinician will administer HRG or antibody until a dosage is
reached that achieves the desired effect. The progress of this therapy is easily monitored by
conventional assays.
. Herggulin Antibody Preparation and Theragutic Use
The antibodies of this invention are obtained by routine screening. Polyclonal
antibodies to HRG generally are raised in animals by multiple subcutaneous (so) or
intraperitoneal (ip) injections of HRG and an adjuvant. it may be useful to conjugate HRG or
an HRG fragment containing the target amino acid sequence to a protein that is immunogenic
in the species to be immunized, e.g., keyhole limpet hemocyanin, serum albumin, bovine
thyroglobulin, or soybean trypsin inhibitor using a bifunctional or derivatizing agent, for
example, maleimidobenzoyl sullosuccinimide ester (conjugation through cysteine residues), N-
hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, SOClg, or
R1N = C = NR, where R and R1 are different alkyl groups.
The route and schedule of immunizing an animal or removing and culturing antibody-
producing cells are generally in keeping with established and conventional techniques for
antibody stimulation and production. While mice are frequently immunized, it is contemplated
that any mammalian subject including human subjects or antibody-producing cells obtained
therefrom can be immunized to generate antibody producing cells.
Subjects are typically immunized against HRG or its immunogenic conjugates or
derivatives by combining 1 mg or 1 ug of HRG immunogen (for rabbits or mice, respectively)
with 3 volumes of Freund's complete adjuvant and injecting the solution intraderrnally at
multiple sites. One month later the subjects are boosted with 1/5 to 1/10 the original amount
of immunogen in Freund's complete adjuvant (or other suitable adjuvant) by subcutaneous
injection at multiple sites. 7 to 14 days later animals are bled and the serum is assayed for
anti-HFIG antibody titer. Subjects are boosted until the titer plateaus. Preferably, the subject
is boosted with a conjugate of the same HRG, but conjugated to a different protein and/or
through a different cross-linking agent. Conjugates also can be made in recombinant cell
culture as protein fusions. Also, aggregating agents such as alum are used to enhance the
'mrnune response.
After immunization, monoclonal antibodies are prepared by recovering immune
lymphoid cells--typically spleen cells or lymphocytes from lymph node tissue-from immunized
animals and immortalizing the cells in conventional fashion, e.g., by fusion with myeloma cells
or by Epstein-Barr (EB)-virus transformation and screening for clones expressing the desired
antibody. The hybridoma technique described originally by Kohler and Milstein, Eur. J; Immunol.
6:511 (1976) has been widely applied to produce hybrid cell lines that secrete high levels of
monoclonal antibodies against many specific antigens.
It is possible to fuse cells of one species with another. However, it is preferable that
the source of the immunized antibody producing cells and the myeloma be from the same
species.
Hybridoma cell lines producing antiHRG are identified by screening the culture
supematants for antibody which binds to HRG. This is routinely accomplished by conventional
immunoassays using soluble HRG preparations or by FACS using cell-bound HFtG and labelled
candidate antibody.
The hybrid cell lines can be maintained in culture in vitro in cell culture media. The cell
lines of this invention can be selected and/or maintained in a composition comprising the
continuous cell line in hypoxanthine-aminopterin thymidine (HAT) medium. In fact, once the
hybridoma cell line is established, it can be maintained on a variety of nutritionally adequate
media. Moreover, the hybrid cell lines can be stored and preserved in any number of
conventional ways, including freezing and storage under liquid nitrogen. Frozen cell lines can be
revived and cultured indefinitely with resumed synthesis and secretion of monoclonal antibody.
The secreted antibody is recovered from tissue culture supernatant by conventional methods
such as precipitation, ion exchange chromatography, affinity chromatography, or the like.
The antibodies described herein are also recovered from hybridoma cell cultures by
conventional methods for purification of lgG or lgM as the case may be that heretofore have
been used to purify these immunoglobulins from pooled plasma, e.g., ethanol or polyethylene
glycol precipitation procedures. The purified antibodies are sterile filtered. and optionally are
conjugated to a detectable marker such as an enzyme or spin label for use in diagnostic
assays of HRG in test samples.
While mouse monoclonal antibodies routinely are used, the invention is not so limited; in
fact, human antibodies may be used and may prove to be preferable. Such antibodies can be
obtained by using human hybridomas (Cote et al., Monoclonal Antibodies and Cancer
Therapy, Alan R. Liss, p. 77 (1985)). Chimeric antibodies, Cabilly et al., (Morrison et al., Proc.
Natl. Acad. Sci, 8136851 (1984); Neuberger et al., Nature 3122604 (1984); Takeda etal.,
Nature 314:452 (1985)) containing a murine anti-HRG variable region and a human constant
region of appropriate biological activity (such as ability to activate human complement and
mediate ADCC) are within the scope of this invention, as are humanized anti-HRG
antibodiesproduced by conventional CRD-grafting methods.
Techniques for creating recombinant DNA versions of the
antigen-binding regions of antibody molecules (known as Fab or variable regions fragments)
which bypass the generation of monoclonal antibodies are encompassed within the practice of
this invention. One extracts antibody-specific messenger RNA molecules from immune
system cells taken from an immunized subject, transcribes these into complementary DNA
(cDNA), and clones the cDNA into a bacterial expression system and selects for the desired
binding characteristic. The Scripps/Stratagene method uses a bacteriophage lambda vector
system containing a leader sequence that causes the expressed Fab protein to migrate to the
periplasmic space (between the bacterial cell membrane and the cell wall) or to be secreted.
One can rapidly generate and screen great numbers of functional Fab fragments to identify
those which bind HRG with the desired characteristics.
Antibodies specific to HRG-oi, HRG-(31, HRG-(32 and HFlG-B3 may be produced and
used in the manner described above. HRG—a, HRG-Bl, HRG-B2 and HRG-B3 specific
antibodies of this invention preferably do not cross-react with other members of the EGF
family (Fig. 6) or with each other.
Antibodies capable of specifically binding to the HRG-NTD. HRG-GFD or HRG-CTP
are of particular interest. Also of interest are antibodies capable of specifically binding to the
proteolytic processing sites between the GFD and transmembrane domains. These antibodies
are identified by methods that are conventional per se. For example, a bank of candidate
antibodies capable of binding to HRG-ECD or proHRG are obtained by the above methods
using immunization with full proHRG. These can then be subdivided by their ability to bind to
the various HRG domains using conventional mapping techniques. Less preferably, antibodies
specific for a predetermined domain are initially raised by immunizing the subject with a
polypeptide comprising substantially only the domain in question, e.g. HRG-GFD free of NT D or
CTP polypeptides. These antibodies will not require mapping unless binding to a particular
epitope is desired.
Antibodies that are capable of binding to proteolytic processing sites are of particular
interest. They are produced either by immunizing with an HRG fragment that includes the
CTP processing site. with intact HRG, or with HRG-NTD-GFD and then screening for the
ability to block or inhibit proteolytic processing of HRG into the NTD-GFD fragment by
recombinant host cells or isolated cell lines that are otherwise capable of processing HRG to
the fragment. These antibodies are useful for suppressing the release of NTD-GFD and
therefore are promising for use in preventing the release of NTD-GFD and stimulation of the
HER-2 receptor. They also are useful in controlling cell growth and replication. Anti-GFD
antibodies are useful for the same reasons, but may not be as efficient biologically as
antibodies directed against a processing site.
Antibodies are selected that are capable of binding only to one of the members of the
HRG family, e.g. HRG-alpha or any one of the HRG-beta isofonns. Since each of the HRG
family members has a distinct GFD-transmembrane domain cleavage site, antibodies directed
specifically against these unique sequences will enable the highly specific inhibition of each of
the GFDs or processing sites, and thereby refine the desired biological response. For example,
breast carcinoma cells which are HER-2 dependent may in fact be activated only by a single
GFD isotype or, if not, the activating GFD may originate only from a particular processing
sequence, either on the HER-2 bearing cell itself or on a GFD-generating cell. The identification
of the target activating GFD or processing site is a straight-forward matter of analyzing
HER-2 dependent carcinomas, e.g., by analyzing the tissues for the presence of a particular
GFD family member associated with the receptor, or by analyzing the tissues for expression
of an HHG family member (which then would serve as the therapeutic target). These
selective antibodies are produced in the same fashion as described above, either by
immunization with the target sequence or domain, or by selecting from a bank of antibodies
having broader specificity.
As described above, the antibodies should have high specificity and affinity for the
target sequence. For example, the antibodies directed against GFD sequences should have
greater affinity for the GFD than GFD has for the HER-2 receptor. Such antibodies are
selected by routine screening methods.
. N n-T uti f in nd it Anti ies
The nucleic acid encoding HRG may be used as a diagnostic for tissue specific typing.
For example, such procedures as in situ hybridization, and Nonhem and Southern blotting, and
PCR analysis may be used to detennine whether DNA and/or RNA encoding HRG are
present in the cell type(s) being evaluated. In particular. the nucleic acid may be useful as a
specific probe for certain types of tumor cells such as, for example, mammary gland, gastric
and colon adenocarcinomas, salivary gland and other tissues containing the p185HER2.
Isolated HRG may be used in quantitative diagnostic assays as a standard or control
against which samples containing unknown quantities of HRG may be compared.
isolated HRG may be used as a growth factor for invitro cell culture, and invivo to
promote the growth of cells containing p185H5R?- or other analogous receptors.
HRG antibodies are useful in diagnostic assays for HRG expression in specific cells or
tissues. The antibodies are labeled in the same fashion as HRG described above and/or are
immobilized on an insoluble matrix.
HRG antibodies also are useful for the affinity purification of HRG from recombinant
cell culture or natural sources. HRG antibodies that do not detectably cross-react with other
HRG can be ‘used to purify HRG free from other known ligands or contaminating protein.
Suitable diagnostic assays for HRG and its antibodies are well known per se. Such
assays include competitive and sandwich assays, and steric inhibition assays. Competitive
and sandwich methods employ a phase-separation step as an integral part of the method
while steric inhibition assays are conducted in a single reaction mixture. Fundamentally, the
same procedures are used for the assay of HRG and for substances that bind HRG, although
certain methods will be favored depending upon the molecular weight of the substance being
assayed. Therefore, the substance to be tested is referred to herein as an analyte,
irrespective of its status otherwise as an antigen or antibody, and proteins that bind to the
analyte are denominated binding partners, whether they be antibodies, cell surface receptors,
or antigens.
Analytical methods for HRG or its antibodies all use one or more of the following
reagents: labeled analyte analogue, immobilized analyte analogue, labeled binding partner,
immobilized binding partner and steric conjugates. The labeled reagents also are known as
"tracers."
The label used (and this is also useful to label HRG encoding nucleic acid for use as a
probe) is any detectable functionality that does not interfere with the binding of analyte and
its binding partner. Numerous labels are known for use in immunoassay, examples including
moieties that may be detected directly, such as tluorochrome, chemiluminescent, and
radioactive labels, as well as moieties, such as enzymes, that must be reacted or derivatized
to be detected. Examples of such labels include the radioisotopes 33P, 14C, 135i, 3H, and
131i, fluorophores such as rare earth chelates or fluorescein and its derivatives, rhodamine
and its derivatives, dansyl, umbelliferone, luciferases, e.g., firefly luciferase and bacterial
luciferase (U.S. Pat. No. 4,737,456), luciferin, 2,3-dihydrophthalazinediones, horseradish
peroxidase (HRP), alkaline phosphatase, B-galactosidase, glucoamylase, lysozyme,
saccharide oxidases, e.g., glucose oxidase, galactose oxidase,'and glucosephosphate
dehydrogenase, heterocyclic oxidases such as uricase and xanthine oxidase, coupled with an
enzyme that employs hydrogen peroxide to oxidize a dye precursor such as HRP,
lactoperoxidase, or microperoxidase, biotin/avidin, spin labels, bacteriophage labels, stable
free radicals, and the like.
Conventional methods are available to bind these labels covalently to proteins or
polypeptides. For instance, coupling agents such as dialdehydes, carbodiimides, dimaleimides,
bis-imidates, bis-diazotized benzidine, and the like may be used to tag the antibodies with the
above-described fluorescent, chemiluminescent, and enzyme labels. See, for example, U.S.
Pat. Nos. 3,940,475 (lluorimetry) and 3,645,090 (enzymes); Hunter etal., Nature, 144:945
(1962); David etaI., Biochemistry, 13:1014-1021 (1974); Pain et al., J. lmmunol. Methods,
40:219-230 (1981); and Nygren, J. Histochem. and Cyfochem., 30:407-412 (1982). Preferred
labels herein are enzymes such as horseradish peroxidase and alkaline phosphatase. The
conjugation of such label, including the enzymes, to the antibody is a standard manipulative
procedure for one of ordinary skill in immunoassay techniques. See, for example, O'Su|livan et
aI., ‘Methods for the Preparation of Enzyme-antibody Conjugates for Use in Enzyme
immunoassay,‘ in Methods in Enzymglggy, ed. J.J. Langone and H. Van Vunakis, Vol. 73
(Academic Press, New York, New York, 1981), pp. 147-166. Such bonding methods are
suitable for use with HFlG or its antibodies, all of which are proteinaceous.
immobilization of reagents is required for certain assay methods. Immobilization
entails separating the binding partner from any analyte that remains free in solution. This
conventionally is accomplished by either insolubilizing the binding partner or analyte analogue
before the assay procedure, as by adsorption to a water-insoluble matrix or surface (Bennich
et al., U.S. Pat. No. 3,720,760), by covalent coupling (for example, using glutaraldehyde cross-
linking), or by insolubilizing the partner or analogue afterward, e.g., by immunoprecipitation.
Other assay methods, known as competitive or sandwich assays, are well
established and widely used in the commercial diagnostics industry.
Competitive assays rely on the ability of a tracer analogue to compete with the test
sample analyte for a limited number of binding sites on a common binding partner. The binding
partner generally is insolubilized before or after the competition and then the tracer and
analyte bound to the binding partner are separated from the unbound tracer and analyte.
This separation is accomplished by decanting (where the binding partner was preinsolubilized)
or by centrifuging (where the binding partner was precipitated after the competitive reaction).
The amount of test sample analyte is inversely proportional to the amount of bound tracer as
measured by the amount of marker substance. Dose-response curves with known amounts of
analyte are prepared and compared with the test results to quantitatively determine the
amount of analyte present in the test sample. These assays are called ELISA systems when
enzymes are used as the detectable markers.
Another species of competitive assay, called a “homogeneous” assay, does not
require a phase separation. Here, a conjugate of an enzyme with the analyte is prepared and
used such that when anti-analyte binds to the analyte the presence of the anti-analyte
modifies the enzyme activity. in this case, HRG or its immunologically active fragments are
conjugated with a bifunctional organic bridge to an enzyme such as peroxidase. Conjugates
are selected for use with anti-HRG so that binding of the anti-HRG antibody inhibits or
potentiates the enzyme activity of the label. This method per se is widely practiced under the
name of EMlT.
Steric conjugates are used in steric hindrance methods for homogeneous assay.
These conjugates are synthesized by covalently linking a low-molecular-weight hapten to a
small analyte so that antibody to hapten substantially is unable to bind the conjugate at the
same time as anti-analyle. Under this assay procedure the analyte present in the test
sample will bind anti-analyte, thereby allowing anti-hapten to bind the conjugate, resulting in a
change in the character of the conjugate hapten, e.g., a change in fluorescence when the
hapten is a fluorophore.
Sandwich assays particularly are useful for the determination of HRG or HRG
antibodies. In sequential sandwich assays an immobilized binding partner is used to adsorb
test sample analyte, the test sample is removed as by washing. the bound analyle is used to
adsorb labeled binding partner. and bound material is then separated from residual tracer.
The amount of bound tracer is directly proportional to test sample analyte. in "simultaneous"
sandwich assays the test sample is not separated before adding the labeled binding partner.
A sequential sandwich assay using an anti-HRG monoclonal antibody as one antibody and a
polyclonal anti-HRG antibody as the other is useful in testing samples for HRG activity.
The foregoing are merely exemplary diagnostic assays for HFlG and antibodies.
Other methods now or hereafter developed for the determination of these analytes are
included within the scope hereof. including the bioassays described above.
HRG polypeptides may be used for affinity purification of receptors such as the
p185“ER2 and other similar receptors that have a binding affinity for HRG, and more
specifically HRG-oi, HRG-B1, HFlG-B2 and HRG-B3. HFlG-oz, HRG-B1, HRG-[32 and HRG-
[33 may be used to fonn fusion polypeptides wherein HRG portion is useful for affinity binding
to nucleic acids and to heparin.
HRG polypeptides may be used as ligands for competitive screening of potential
agonists or antagonists for binding to p185H5R?. HRG variants are useful as standards or
controls in assays for HRG provided that they are recognized by the analytical system
employed, e.g. an anti-HRG antibody. Antibody capable of binding to denatured HRG or a
fragment thereof. is employed in assays in which HRG is denatured prior to assay, and in this
assay the denatured HFlG or fragment is used as a standard or control. Preferably, HRG-<1,
HRG-[31, HRG-[32 and HRG-B3 are detectably labelled and a competition assay for bound
p185HER2 is conducted using standard assay procedures.
The methods and procedures described herein with HRG-or may be applied similarly to
HRG-B1, HRG-[32 and HRG-B3 and to other novel HRG ligands and to their variants. The
following examples are offered by way of illustration and not by way of limitation.
EXAMPLES
Examplet
Pre rati n fBre tC ncer ell ernat nt
Heregulin-oi was isolated from the supernatant of the human breast carcinoma MDA-
MB-23i. HRG was released into and isolated from the cell culture medium.
. 9.eLQulMe
MDA-MB-231, human breast carcinoma cells, obtainable from the American Type
Culture Collection (ATCC HTB 26), were initially scaled-up from 25 cm? tissue culture flasks
to 890 cm? plastic roller bottles (Corning, Corning. N Y) by serial passaging and the seed train
was maintained at the roller bottle scale. To passage the cells and maintain the seed train,
flasks and roller bottles were tirst rinsed with phosphate buffered saline (PBS) and then
incubated with trypsin/EDTA (Sigma, St. Louis, Mo) for 1-3 minutes at 37°C. The detached
cells were then pipetted several times in trash culture medium containing fetal bovine serum
(FBS), (Gibco, Grand island, NY) to break up cell clumps and to inactivate the trypsin. The
cells were finally split at a ratio of 1:10 into fresh medium, transferred into new flasks or
bottles, incubated at 37°C, and allowed to grow until nearly confluent. The growth medium in
which the cells were maintained was a combined DME/Ham's-F-12 medium formulation
moditied with respect to the concentrations of some amino acids, vitamins, sugars, and salts,
and supplemented with 5% FBS. The same basal medium is used for the serum-free ligand
production and is supplemented with 0.5% Primatone RL (Sheffield, Norwich, NY).
I: n
Large scale MDA-MB-231 cell growth was obtained by using Percell Biolytica
microcarriers (Hyclone Laboratories, Logan, UT) made of weighted cross-linked gelatin. The
microcarriers were first hydrated, autoclaved, and rinsed according to the manufacturers
recommendations. Cells from 10 roller bottles were trypsinized and added into an inoculation
spinner vessel which contained three liters of growth medium and 10-20 g of hydrated
microcarriers. The cells were stirred gently for about one hour and transferred into a ten-liter
instrumented fennenter containing seven liters of growth medium. The culture was agitated
at 65-75 rpm to maintain the microcarriers in suspension. The fennenter was controlled at
37°C and the pH was maintained at 7.0-7.2 by the addition of sodium carbonate and CO2, Air
and oxygen gases were sparged to maintain the culture at about 40% of air saturation. The
cell population was monitored microscopically with a fluorescent vital stain (fluorescein
diacetate) and compared to trypan blue staining to assess the relative cell viability and the
degree of microcarrier invasion by the cells. Changes in cell-microcarrier aggregate size were
monitored by microscopic photography.
Once the microcarriers appeared 90-100% confluent, the culture was washed with
serum-tree medium to remove the serum. This was accomplished by stopping the agitation
and other controls to allow the carriers to settle to the bottom of the vessel. Approximately
nine liters of the culture supernatant were pumped out of the vessel and replaced with an
equal volume of serum-tree medium (the same basal medium described as above supplemented
either with or without Primatone RL). The microcarriers were briefly resuspended and the
process was repeated until a 1000 fold removal of FBS was achieved. The cells were then
incubated in the serum-tree medium for 3-5 days. The glucose concentration in the culture
was monitored daily and supplemented with additions of glucose as needed to maintain the
concentration in the fermenter at or above 1 g/L. At the time of harvest, the microcarriers
were settled as described above and the supernatant was aseptically removed and stored at
2-8°C for purification. Fresh serum-free medium was replaced into the fermenter, the
microcarriers were resuspended, and the culture was incubated and harvested as before.
This procedure could be repeated four times.
Examplez
Purification Qt growth Fggtgr Activity
Conditioned media (10-20 liters) from MDA-MB-231 cells was clarified by
centrifugation at 10,000 rpm in a Sorvall Centrifuge, filtered through a 0.22 micron filter and
then concentrated 10-50 (approx. 25) fold with a Minitan Tangential Flow Unit (Millipore
Corp.) with a 10 kDa cutoff polysulfone membrane at room temperature. Alternatively,
media was concentrated with a 2.5L Amicon Stirred Cell at 4°C with a YM3 membrane.
After concentration, the media was again centrifuged at 10,000 rpm and the supernatant
frozen in 35-50 ml aliquots at -80°C.
Heparin Sepharose was purchased from Pharmacia (Piscataway, NJ) and was
prepared according to the directions of the manufacturer. l-"we milliliters of the resin was
packed into a column and was extensively washed (100 column volumes) and equilibrated with
phosphate buffered saline (PBS). The concentrated conditioned media was thawed, filtered
through a 0.22 micron filter to remove particulate material and loaded onto the heparin-
Sepharose column at a flow rate of 1 mil min. The normal load consisted of 30-50 mls of 40-
told concentrated media. After loading, the column was washed with PBS until the
absorbance at 280 nm retumed to baseline before elution of protein was begun. The column
was eluted at 1 mVmin with successive salt steps of 0.3 M, 0.6 M, 0.9 M and (optionally) 2.0 M
NaCl prepared in PBS. Each step was continued until the absorbance returned to baseline,
usually 6-10 column volumes. Fractions of 1 milliliter volume were collected. All of the
fractions corresponding to each wash or salt step were pooled and stored for subsequent
assay in the MDA-MB-453 cell assay.
The majority of the tyrosine phosphorylation stimulatory activity was found in the
0.6M NaCl pool which was used for the next step of purification. Active fractions from the
heparin-Sepharose chromatography were thawed, diluted three told with deionized (MilliQ)
water to reduce the salt concentration and loaded onto a polyaspartic acid column (PolyCAT
A, 4.6 x 100 mm, PolyLC, Columbia, MD.) equilibrated in 17 mM Na phosphate, pH 6.8. All
buffers for this purification step contained 30% ethanol to improve the resolution of protein on
this column. After loading, the column was washed with equilibration buffer and was eluted
with a linear salt gradient from 0.3 M to 0.6 M NaCl in 17 mM Na phosphate, pH 6.8, buffer.
The column was loaded and developed at 1 ml/min and 1 ml fractions were collected during the
gradient elution. Fractions were stored at 4°C. Multiple heparin-Sepharose and PolyCat
columns were processed in order to obtain sufficient material for the next purification step. A
typical absorbance profile from a PolyCat A column is shown in Figure 1. Aliquots of 10-25
pL were taken from each fraction for assay and SDS gel analysis.
Tyrosine phosphorylation stimulatory activity was found throughout the eluted
fractions of the PolyCAT A column with a majority of the activity found in the fractions
corresponding to peak C of the chromatogram (salt concentration of approximately 0.45M
NaCl). These fractions were pooled and adjusted to 0.1% trifluoracetic acid (TFA) by
addition of 0.1 volume of 1% TFA. Two volumes of deionized water were added to dilute the
ethanol and salt from the previous step and the sample was subjected to further purification
on high pressure liquid chromatography (HPLC) utilizing a C4 reversed phase column
(SynChropak RP-4, 4.6 x100 mm) equilibrated in a buffer consisting of 0.1% TFA in water
with 15% acetonitrile. The HPLC procedure was carried out at room temperature with a
flow rate of 1 ml/min. After loading of the sample, the column was re-equilibrated in 0.1%
TFA/15% acetonitrile. A gradient of acetonitrile was established such that over a 10 minute
period of time the acetonitrile concentration increased from 15 to 25% (1%/min).
Subsequently, the column was developed with a gradient from 25 to 40% acetonitrile over 60
min time (0.25%/min). Fractions of 1 ml were collected, capped to prevent evaporation, and
stored at 4°C. Aliquots of 10 to 50 pL were taken, reduced to dryness under vacuum
(speedvac), and reconstituted with assay buffer (PBS with 0.1 % bovine serum albumin) for
the tyrosine phosphorylation assay. Additionally, aliquots of 10 to 50 pL were taken and dried
as above for analysis by SDS gel electrophoresis. A typical HPLC profile is shown in Figure
2.
A major peak of activity was found infraction 17 (Figure 2B). By SDS gel analysis,
fraction 17 was found to contain a single major protein species which comigrated with the
45,000 dalton molecular weight standard (Figs. 2C, 3). In other preparations, the presence of
the 45,000 dalton protein comigrated with the stimulation of tyrosine phosphorylation activity
in the MDA-MB-453 cell assay. The chromatographic properties of the 45,000 dalton protein
were atypical; in contrast to many other proteins in the preparation, the 45,000 dalton protein
did not elute from the reversed phase column within 2 or 3 fractions. Instead, it was eluted
over 5-10 fractions. This is possibly due to extensive post-translational modifications.
Fractions containing the 45.000 dalton protein were dried under vacuum for amino acid
sequencing. Samples were redissolved in 70% formic acid and loaded into an Applied
Biosystems, Inc. Model 470A vapor phase sequencer for N—terrninal sequencing. No
discemable N-terrninal sequence was obtained, suggesting that the N-terminal residue was
blocked. Similar results were obtained when the protein was first run on an SDS gel.
transblotted to ProBlott membrane and the 45,000 dalton band excised after localization by
rapid staining with Coomassie Brilliant Blue. .
lntemal amino acid sequence was obtained by subjecting fractions containing the
45,000 dalton protein to partial digestion using either cyanogen bromide, to cleave at
methionine residues, Lysine-C to cleave at the C-terminal side of lysine residues, or Asp-N to
cleave at the N-terminal side of aspartic acid residues. Samples after digestion were
sequenced directly or the peptides were first resolved by HPLC chromatography on a
Synchrom C4 column (4000A, 2 x 100 mm) equilibrated in 0.1% TFA and eluted with a_ 1-
propanol gradient in 0.1% TFA. Peaks from the chromatographic run were dried under
vacuum before sequencing.
Upon sequencing of the peptide in the peak designated number 15 (lysine C-15),
several amino acids were found on each cycle of the run. Alter careful analysis, it was clear
that the fraction contained the same basic peptide with several diflerent N-tennini, giving rise
to the multiple amino acids in each cycle. After deconvolution,the following sequence was
detennined (SEQ ID No.3):
[A]AEKEKTF[C]VNGGEXFMVKDLXNP
1 5 10 15 _ 20
(Residues in brackets were uncertain while an X represents a cycle in which it was not
possible to identity the amino acid.) _ A
The initial yield was 8.5 pmoles. This sequence comprising 24 amino acids did not
correspond to any previously known protein. Residue 1 was later found from the cDNA
sequence to be Cys and residue 9 was found to be correct. The unknown amino acids at
positions 15 and 22 were found to be Cys and Cys, respectively.
Sequencing on samples after cyanogen bromide and Asp-N digestions, but without
separation by HPLC. were pertormed to corroborate the cDNA sequence. The sequences
obtained are given in Table l and coniirrn the sequence for the 45,000 protein deduced from the
cDNA sequence. The N-terminal of the protein appears to be blocked with an unknown
blocking group. On one occasion, direct sequencing of the 45,000 dalton band from a PVDF
blot revealed this sequence with a very small initial yield (0.2 pmole)(SEQ ID NO:4):
XEXKE(G)(R)GK(G)K(G)KKKEXGXG(K)
(Residues which could not be determined are represented by "X', while tentative residues are
in parentheses). This corresponds to a sequence starting at the serine at position 46 near the
present N~terminal of HRG cDNA sequence; this suggests that the N terminus of the 45,000
protein is at or before this point in the sequence.
Example 3
Clonin and uencin ofHuman Her ulin
The cDNA cloning of the p185HER«°- ligand was accomplished as follows. A portion of
the lysine C-15 peptide amino acid sequence was decoded in order to design a probe for
cDNA's encoding the 45kD HRG-on ligand. The following 39 residue long eight fold degenerate
deoxyoligonucleotide corresponding to the amino acid sequence(SEQ lD NO:5) NH2-
...AEKEKTFXVNGGE was chemically synthesized (SEQ ID NO:6):
' GCTGAGAAGGAGAAGACCTTCTGT/CGTGAAT/CGGA/CGGCGAG 5'.
The unknown amino acid residue designated by X in the amino acid sequence was assigned as
cysteine for design of the probe. This probe was radioactively phosphorylated and employed
to screen by low stringency hybridization an oligo dT primed cDNA library constructed from
human MDA-MB-231 cell mRNA in Agt10 (Huyng at al., 1984, In DNA Cloning, Vol 1: A
Practical Approach (D. Glover, ed) pp.49-78. IRL Press, Oxford). Two positive clones
designated kgt10her16 and 7.gt10her13 were identified. DNA sequence analysis revealed that
these two clones were identical.
The 2010 basepair cDNA nucleotide sequence of 7Lgt10her16 (Fig. 4) contains a single
long open reading frame of 669 amino acids beginning with alanine at nucleotide positions 3-5
and ending with glutamine at nucleotide positions 2007-2009. No stop codon was found in the
translated sequence; however, later analysis of heregulin [3-type clones indicates that
methionine encoded at nucleotide positions 135-137 was the initiating methionine. Nucleotide
sequence homology with the probe is found between and including bases 681-719. Homology
between those amino acids encoded by the probe and those flanking the probe with the amino
acid sequence determined for the lysine C-15 fragment verily that the isolated clone encodes
at least the lysine C-15 fragment of the 45kD protein.
Hydropathy analysis shows the existence of a strongly hydrophobic amino acid region
including residues 287-309 (Fig. 4) indicating that this protein contains a transmembrane or
internal signal sequence domain and thus is anchored to the membrane of the cell.
The 669 amino acid sequence encoded by the 2010bp cDNA sequence contains
potential sites for asparagine-linked glycosylation (Winzler,Fi. in Hormonal Proteins and
Peptides, (Li, C.H. ed) pp 1-15 Academic Press, New York (1973)) at positions asparagine
164, 170, 208, 437 and 609. A potential D-glycosylation site (Marshall,R.D. (1974) Biochem.
Soc. Symp. 40:17-26) is presented in the region including a cluster of serine and threonine
residues at amino acid positions 209-218. Three sites of potential glycosaminoglycan addition
(Goldstein, L.A., et al. (1989) Cell 56:1063-1072) are positioned at the serine-glycine dipeptides
occurring at amino acids 42-43, 64-65 and 151-152. Glycosylation probably accounts for the
discrepancy between the calculated NW of about 26KD for the NTD-GFD (extracellular)
region of HRG and the observed NW of about 45 KD for purified HRG.
This amino acid sequence shares a number of features with the epidennal growth
factor (EGF) family of transmembrane bound growth factors (Carpenter,G., and Cohen,S.
(1979) Ann. Rev. Biochem.4B: 193-216; Massenque, J.(1990) J. Biol. Chem. 265: 21393-21396)
including 1) the existence of a protorm of each growth factor from which the mature form is
proteolytically released (Gray,A., Dull, T.J., and Ullrich, A. (1983) Nature 303, 722-725; Bell,
G.l. etal., (1986) Nuc. Acid Res., 14: 8427-8446; Derynck, R. etal. (1984) Cell: 287-297); 2)
the conservation of six cysteine residues characteristically positioned over a span of
approximately 40 amino acids (the EGF-like structural motif) (Savage,R.C., et al. (1973) J.
Biol. Chem. 248: 7669-7672); HRG-oz cysteines 226, 234, 240, 254, 256 and 265 ); and, 3) the
existence of a transmembrane domain occurring proximally on the carboxy-terminal side of
the EGF homologous region (Fig. 4 and 6).
Alignment of the amino acid sequences in the region of the EGF motif and flanking
transmembrane domain of several human EGF related proteins (Fig. 6) shows that between
the first and sixth cysteine of the EGF motif HRG is most similar (50%) to the heparin binding
EGF-like growth factor (HB-EGF) (Higashiyama, S. etal. (1991) Science 251: 936-939). In
this same region HRG is ~35% homologous to amphiregulin (AR) (Plowman, G.D.et al., (1990)
Mol. Cell. Biol. 10: 1969-1981), ~32°/o homologous to transforming growth factor or (T GF or)
(8), 27% homologous with EGF (Bell, G.I. etal., (1986) Nuc. Acid Fles., 14: 8427-8446); and
39% homologous to the schwanoma-derived growth factor (Kimura. H., et al., Nature,
348:257-260, 1990). Disulfide linkages between cysteine residues in the EGF motif have been
determined for EGF (Savage, R.C. etal. (1973) J. Biol. Chem. 248: 7669-7672). These
disulfides define the secondary structure of this region and demarcate three loops. By
numbering the cysteines beginning with 1 on the amino-terminal end, loop 1 is delineated by
cysteines 1 and 3; loop 2 by cysteines 2 and 4; and loop 3 by cysteines 5 and 6. Although the
exact disulfide configuration in the region for the other members of the family has not been
determined, the strict conservation of the six cysteines, as well as several other residues i.e.,
glycine 238 and 262 and arginine at position 264, indicate that they too most likely have the
same arrangement. HRG-(X. and EGF both have 13 amino acids in loop 1. HB-EGF,
amphregulin (AR) and TGF or have 12 amino acids in loop 1. Each member has 10 residues in
loop 2 except HRG-or which has 13. V All five members have 8 residues in the third loop.
EGF, AR, HB-EGF and TGF-or are all newly synthesized as membrane anchored
proteins by virtue of their transmembrane domains. The proproteins are subsequently
processed to yield mature active molecules. In the case of TGF-ct there is evidence that the
membrane associated protorms of the molecules are also biologically active (Brachmann,
R.,et al. (1989) Cell 56: 691-700), a trait that may also be the case for HRG-or. EGF is
synthesized as a 1168 amino acid transmembrane bound proEGF that is cleaved on the amino-
terminal end between arginine 970 and asparagine 971 and at the carboxy-terminal end
between arginine 1023 and histidine 1024 (Carpenter,G., and Cohen,S. (1979) Ann. Rev.
Biochem.48: 193-216) to yield the 53 amino acid mature EGF molecule containing the three
loop, 3 disulfide bond signature structure. The 252 amino acid proAR is cleaved between
aspartic acid 100 and serine 101 and between lysine 184 and serine 185 to yield an 84 amino
acid form of mature AR and a 78 amino acid form is generated by NH2-terminal cleavage
between glutamine 106 and valine 107 (Plowman, G.D. et al., (1990) Mol. Cell. Biol. 10: 1969-
1981). HB-EGF is processed from its 208 amino acid primary translation product to its
proposed 84 amino acid tonn by cleavage between arginine 73 and valine 74 and a second site
approximately 84 amino acids away in the carboxy-temtinal direction (Higashiyama, S., et
al., and Klagsburn, M. (1991) Science 251: 936-939). The 160 amino acid profonn of TGF or is
processed to a mature 50 amino acid protein by cleavages between alanine 39 and valine on one side and downstream cleavage between alanine 89 and valine 90 (Derynck et al.,
(1984) Cell: 38: 287-297). For each of the above described molecules COOH-terminal
processing occurs in the area bounded by the sixth cysteine of the EGF motif and the
beginning of the transmembrane domain. -
The residues between the first and sixth cysteines oi HRGs are most similar (45%)
to heparin—binding EGF-like growth factor (HB-EGF). In this same region they are 35%
identical to amphiregulin (AR), 32% identical to TGF-or, and 27% identical with EGF. Outside
of the EGF motif there is little similarity between HRGS and other members of the EGF
family. EGF, AR, HB-EGF and TGF-on are all derived from membrane anchored proproteins
which are processed on both sides of the EGF structural unit, yielding 50-84 amino acid mature
proteins (16-19). Like other EGF family members, the HFtGs appear to be derived from a
membrane-bound proform but require only a single cleavage, C-terminal to the cysteine
cluster, to produce mature protein.
HRG may exert its biological function by binding to its receptor and triggering the
transduction of a growth modulating signal. This it may accomplish as a soluble molecule or
perhaps as its membrane anchored form such as is sometimes the case with TGF on
(Brachmann, R., et al., (1989) Cell 56: 691-700). "Conversely, or in addition to stimulating
signal transduction, HRG may be internalized by a target cell where it may then interact with
the controlling regions of other regulatory genes and thus directly deliver its message to the
nucleus of the cell. The possibility that HRG mediates some of its effects by a mechanism
such as this is suggested by the fact that a potential nuclear location signal (Roberts,
Biochem-Biophys Acta (1989) 1008: 263-280) exists in the region around the three lysine
residues at positions 58-60 (Fig. 4).
The isolation of full-length cDNA of HRG-or is accomplished by employing the DNA
sequence of Fig 4 to select additional cDNA sequences from the cDNA library constructed
from human MDA-MB-231. Full-length cDNA clones encoding HRG-oc are obtained by
identifying cDNAs encoding HRG-on longer in both the 3' and 5' directions and then splicing
together a composite of the different cDNAs. Additional cDNA libraries are constructed as
required for this purpose. Following are three types of cDNA libraries that may be
constructed: 1) Oligo-dT primed where predominately stretches of polyadenosine residues are
primed, 2) random primed using short synthetic deoxyoligonucleotides non-specific for any
particular region of the mFtNA, and 3) specifically primed using short synthetic
deoxyoligonucleotides specific for a desired region of the mRNA. Methods for the isolation of
such cDNA libraries were previously described.
Example 4
Detection of HRG-on mRNA Expression bv Northern Analyses
Northern blot analysis of MDA-MB-231 and SK-BR-3 cell mFlNA under high stringency
conditions shows at least five hybridizing bands in MDA-MB-231 mRNA where a 6.4Kb band
predominates: other weaker bands are at 9.4, 6.9.2.8 and 1.8Kb (Fig. 5). No hybridizing band
W
is seen in SK-BR-3 mRNA (this cell line overepresses p185HEF*-°-). The existence of these
multiple messages in MDA-MB-231 cells indicates either alternative splicing of the gene,
various processing of the genes‘ primary transcript or the existance of a transcript of another
homologous message. One of these messages may encode a soluble non-transmembrane
bound form of HRG-ot. Such messages (Fig. 5) may be used to produce cDNA encoding soluble
non-transmembrane bound forms of HRG-on.
Examples
ell rwth im Itin Her ui
Several different breast cancer cell lines expressing the EGF receptor or the
p185HER9 receptor were tested for their sensitivity to growth inhibition or stimulation by ligand
preparations. The cell lines tested were: SK-BR-3 (ATCC HTB 30), a cell line which
overexpresses p185H5R2; MDA-MB-468 (ATCC HTB 132), a line which overexpresses the
EGF receptor; and MCF-7 cells (ATCC HTB 22) which have a moderate level of p185HER2
expression. These cells were maintained in culture and passaged according to established cell
culture techniques. The cells were grown in a 1:1 mixture of DMEM and F-12 media with 10%
fetal bovine serum. For the assay, the stock cultures were treated with trypsin to detach the
cells from the culture dish, and dispensed at a level of about 20000 cells/well in a ninety-six
well microtiter plate. During the course of the growth assay they were maintained in media
with 1% fetal bovine serum, The test samples were sterilized by filtration through 0.22 micron
filters and they were added to quadruplicate wells and the cells incubated for 3-5 days at
37°C. At the end of the growth period. the media was aspirated from each well and the cells
treated with crystal violet (Lewis, G. et al., Cancer Research, 347:5382-5385 [1987]). The
amount of crystal violet absorbance which is proportional to the number of cells in each well
was measured on a Flow Plate Reader. Values from replicate wells for each test sample
were averaged. Untreated wells on each dish served as controls. Results were expressed as
percent of growth relative to the control cells.
The purified HFtG-ot ligand was tested tor activity in the cell growth assay and the
results are presented in Figure 7. At a concentration of approximately 1 nM ligand, both of
the cell lines expressing the p185HER«°- receptor (SK-BR-3 and MCF-7) showed stimulation of
growth relative to the controls while the cell type (MDA-MB-468) expressing only the EGF
receptor did not show an appreciable response. These results were consistent to those
obtained from the autophosphorylation experiments with the various cell lines. These results
established that HRG-or ligand is specific for the p185HER2 receptor and does not show
appreciable interaction with the EGF receptor at these concentrations.
HRG does not compete with antibodies directed against the extra-cellular domain of
p185HER2, but anti-p185HER? Mabs 204 and 7F3 (which are antiproliferative in their own
right) do antagonize HRG.
Example 6
Clonin nd uencin of ul'n- 1
The isolation of HRG-B1 cDNA was accomplished by employing a hybridizing
fragment of the DNA sequence encoding HRG-on to select additional cDNA sequences from
the cDNA library constructed from human MDA-MB-231 cells. Clone lher11.1dbl (heregulin-
]31) was identified in a Aglio oligo~dT primed cDNA library derived from MDA MB231 polyA+
mFiNA. Radioactively labelled synthetic DNA probes corresponding to the 5' and 3' ends of
7l.her16 (l-IRG-or) were employed in a hybridization reaction under high stringency conditions to
isolate the ?d1er11.1dbl clone. The DNA nucleotide sequence of the Aher11.1dbl clone is shown
in figure 8 (SEQ ID NO:9) HRG-B1 amino acid sequence is homologous to HRG-or from its
amino-tenninal end at position Asp 15 of HRG-on through the 3'end of HRG-on except at the
positions described below. In addition, HRG-[51 encoding DNA extends 189 base pairs longer
than ?.her16 in the 3‘ direction and supplies a stop codon after Val 675. At nucleotide position
247 of 7Lher11.1dbl there is a G substituted for A thereby resulting in the substitution of
Gln(Q) in place of Arg(Fi) in HRG-B1 as shown in the second line of Figure 9 (SEQ ID NO:8
and SEQ ID N029).
in the area of the EGF motif there are additional differences between HRG-or and
HRG-(31. These differences are illustrated below in an expanded view of the homology
between HRG-on and HRGi-Bl in the region of the EGF motif or the GFD (growth factor
domain). The specific sequence shown corresponds to HRG-on amino acids 221-286 shown in
figure 9. Asterisks indicate identical residues in the comparison below (SEQ ID NO:10 and
SEQ ID NO:11).
KTFCVNGGEC
#*******
I11
N
HEREGULIN-or s H L v K c A
HEREGULIN-[31 * * * * * * *
-9
'0 N
to
LCKCQPGF
9-A-d:*PNE1:
HEREGULIN-on F M V K D L s N P s
* 1- 1- * * -1: 1: g «n -2:
PHKVQNQEK--
MASFYKHLGIE
HEREGULIN-U. '1' G A R c T E N v
HEREGULIN-B1* * D * * Q N Y *
HEREGULIN-(1 - - — A E E L Y Q X R (-Transmembrane)
HEREGULIN-Bl F M E * * * * * * * * (—'l‘ransmembrane)
Example 7
Exgessign of Hflgulins in E. Coli
HRG-or and HRG-B1 have been expressed in E. coli using the DNA sequences of
Figures 4 and 8 encoding heregulin under the control of the alkaline phosphatase promoter and
the STII leader sequence. in the initial characterization of heregulin activity, the precise
natural amino and carboxy termini of the heregulin molecule were not precisely defined.
However. after comparsion of heregulin to EGF and TGF-or sequences. we expected that
shortened forms of heregulin starting around Ser 221 and ending around Glu 277 of figure 4
may have biological activity. Analogous regions of all heregulins may be identified and
expressed. One shortened form was constructed to have an N-tenninal Asp residue followed
by the residues 221 to 277 of HRG-or. Due to an accidental frame shift mutation following Glu
277, HRG-or sequence was extended by 13 amino acids on the carboxy terminal end. Thus,
the carboxy-terminal end was Glu 277 of HRG-or followed by the thirteen amino acid sequence
RPNARLPPGVFYC (SEQ ID NO:20).
Expression of this construct was induced by growth of the cells in phosphate depleted
medium for about 20 hours. Recombinant protein was purified by harvesting cell paste and
resuspending in 10 mM Tris (pH8), homogenizing, incubating at 4°C. for 40 minutes and
followed by centrifuging at 15 K rpm (Sorva|l). The supernatant was concentrated on a 30K
ultrafiltration membrane (Amicon) and the filtrate was applied to a MonoO column
equilibrtated in 10 mM Tris pH8. The flow-through fractions from the MonoO column were
adjusted to 0.05% TFA (trifluoroacetic acid) and subjected to C4 reversed phase HPLC.
Elution was with a gradient of 10-25% acetonitrile in 0.1% TFA/H20. The solvent was
removed by lyophilization and purified protein was resuspended in 0.1% bovine serum albumin
in phosphate buffered saline. Figure 10 depicts HER2 receptor autophophorylation data with
MCF-7 cells in response to the purified E. coli-derived protein. This material demonstrated full
biological activity with an EC5o of 0.8 nM. The purified material was also tested in the cell
growth assays (Example 5) and was found to be a potent stimulator of cell growth.
The recombinant expression vector for synthesis of HRG-B1 was constructed in a
manner similar to HFlG~ot. The expression vector contained DNA encoding HRG-Bi amino
acids from Ser2o7 through LeLl273 (Figure 4). This DNA encoding HRG-(31 was recombinantly
spliced into the expression vector downstream from the alkaline phosphatase promoter and
STII leader sequence. An additional serine residue was spliced on the carboxy terminus as a
result of the recombinant construction process. The expression vector encoding HRG-(31 was
used to transform E. cofi and expressed in phosphate depleted medium. Induced E. coil were
pelleted, resuspended in 10mM Tris (pH7.5) and sonicated. Cell debris was pelleted by
centrifugation and the supernatant was filtered through a sterile filter before assay. The
expression of HRG-(31 was confirmed by the detection of protein having the ability to
stimulate autophosphorylation of the HER2 receptor in MCF-7 cells.
A similar expression vector was constructed as described for HHG-(31 (above) with a
C terminal tyrosine residue instead of the serine residue. This vector was transformed into E.
coil and expressed as before. Purification of this recombinant protein was achieved as
described for recombinant HRG-oi. Mass spectrometric analysis revealed that the purified
protein consisted of forms which were shorter than expected. Amino acid sequencing showed
that the protein had the desired N-tenninal residue (Ser) but it was found by mass
spectrometry to be truncated at the C terminus The majority (>80%) of the protein
consisted of a lorrn 51 amino acids long with a C terminal methionine (MET 271) (SEQ ID
NO:9). A small amount of a shorter form (49 residues) truncated at VAL 269 was also
detected. However. both the shortened forms showed lull biological activity in the HER2
receptor autophosphorylation assay.
Example 8
I LATION F HEREG LIN 2 VAR AN
Heregulin-B2 and -B3 variants were isolated in order to obtain cDNA clones that
extend further in the 5' direction. A specifically primed'cDNA library was constructed in
7.gt10 by employing the chemically synthesized antisense primer
3‘ CCTTCCCGTTCTTCTTCCTCGCTCC (SEQ ID NO:21). This primer is located
between nucleotides 167-190 in the sequence of 7Lher16 (figure 4). The isolation of clone
>.5'her13 (not to be confused with 7.her13) was achieved by hybridizing a synthetic DNA
probe corresponding to the 5' end of Aher16 under high stringency conditions with the
specifically primed cDNA library. The nucleotide sequence olA,5'her13 is shown in figure 11
(SEQ ID NO:22). The 496 base pair nucleotide sequence of l5'her13 is homologous to the
sequence of kher16 between nucleotides 309-496 of 7l5'her13 and 3-190 of }.her16. }.5‘her1 3
extends by 102 amino acids the open reading frame of 7J1er16.
The isolation of variant heregulin-B forms was accomplished by probing a newly
prepared oligodT primed }.gt10 MDA-MB-231 mRNA-derived cDNA library with synthetic
probes corresponding to the 5' end of A5'her13 and the cysteine rich EGF-like region of 7Lher16.
Three variants of heregulin-B were identified, isolated and sequenced. The amino acid
homologies between all heregulins is shown in figure 15 (SEQ ID NOS:26-30).
HRG polypeptides 7ther76 (heregulin-B2) (SEQ ID N023), >.her78 (heregulin-[33)
(SEQ ID NO:24) and lher84 (heregulin [32-like) (SEQ ID NO:25) are considered variants of
}..her11.1db| (heregulin-B1) because although the deduced amino acid sequence is identical
between cysteine 1 and cysteine B of the EGF-like motif their sequences diverge before the
predicted transmembrane domain which probably begins with amino acid 248 in Aher11.1dbl.
The nucleotide sequences and deduced amino acid sequences of Aher76, ).her78 and kher84
are shown in figures 12, 13 and 14.
The variants each contain a TGA stop codon 148 bases 5' of the first methionine
codon in their sequences. Therefore the ATG codon at nucleotide position 135-137 of ?ther16
and the corresponding ATG in the other heregulin clones may be defined as the initiating
methionine (amino acid 1). Clones )ther11.1dbl, >cher76. 7cher84 and )cher7l-3 all encode
glutamine at amino acid 38 (Figure 15) whereas clone her16 encodes arginine (Figure 4,
position 82).
The deduced amino acid sequence of }J1er76 (heregulin-B1) reveals a full-length clone
encoding 637 amino acids. it shares an identical deduced amino acid sequence as ichert 1 .1dbl
except that residues corresponding to amino acids 232-239 of 7l.hert1.1dbl have been deleted.
The deduced amino acid sequence of ?ther84 shows that it posesses the same amino acid
sequence as ?ther76 from the initiating methionine (amino acid 1, Figure 15) through the EGF-
like area and transmembrane domain. However, 7ther84 comes to an early stop codon at
arginine 421 (7ther84 numbering). Thereafter the 3' untranslated sequence diverges. The
deduced amino acid sequence of Aher78 (heregulin-B3) is homologous with heregulins-B‘ and
-B? through amino acid 230 where the sequence diverges for eleven amino acids then
terminates. Thus heregulin-B3 has no transmembrane region. The 3' untranslated sequence is
In order to express heregulin-B forms in mammalian cells, tull-length cDNA nucleotide
sequences from ?ther76 (heregulin-B2) or }.her84 were subcloned into the mammalian
expression vector pRK5.1. This vector is a derivative of pRK5 that contains a
cytomegalovirus promoter followed by a 5' intron, a cloning polylinker and an SV40 early
polyadenylation signal. COS7, monkey or human kidney 293 cells were transiected and
conditioned medium was assayed in the MCF-7 cell p185/her2 autophosphorylation assay. A
positive response confirmed the expression of the cDNA's from 7.her76 (heregulin-B2) and
}.her84 (heregulin-B3).
Supematants from a large scale transient expression experiment were concentrated
on a YM10 membrane (Amicon) and applied to a heparin Sepharose column as described in
Example 1. Activity (tyrosine phosphorylation assay) was detected in the 0.6M NaC| elution
pool and was further puriied on a polyaspartic acid column, as previously described By SDS
gel analysis and activity assays, the active fractions of this column were highly purified and
contained a single band of protein with an apparent molecular weight of 45,000 daltons. Thus,
the expressed protein has chromatographic and structural properties which are very similar to
those of the native form of heregulin originally isolated from the MDA 231 cells. Small scale
transient expression experiments with constructs made from 7.her84 cDNA also revealed
comparable levels of activity in the cell supematants irom this variant form. The expression
of the transmembrane-minus variant, heregulin-B3, is currently under investigation. -
Example 10 '
proHRG-or and proHFiG-B1 cDNAs were spliced into Epstein Barr virus derived
expression vectors containing a cytomegalovirus promoter. rHRGs were purified (essentially
as described in Example 2) from the serum tree conditioned medium oi stably transiected
CEN4 cells [human kidney 293 cells (ATCC No. 1573) expressing the Epstein Barr virus
EBNA-1 transactivator. In other experiments full length proHFtG-or, -[31 and -[32 transient
expression constructs provided p185HER3 phosphorylation activity in the conditioned medium of
transtected COS7 monkey kidney cells. However. similar constructs of lull length proHFlG-B3
tailed to yield activity suggesting that the hydrophobic domain missing in proHFlG-B3 but
present in the other proHRGs is necessary for secretion of mature protein. Truncated
versions of proHRG-or (63 amino acids, serin 177 to tyrosine 239) and proHFlG-[31 (68 amino
acids, serine 177 to tyrosine 241) each encoding the GFD structural unit and immediate
flanking regions were also expressed in E. coli. homologous truncated versions of HRG-B3 are
expected to be expressed as active molecules. These truncated proteins were purified from
the periplasmic space and culture broth of E. coli. transformed with expression vectors
designed to secrete recombinant proteins (C.N. Change, M. Fley, B. Bochenr, H. Heyneker, G.
Gray, Gene, 55:18.9 [1987]). These proteins also stimulated tyrosine phosphorylation of
p185“ER2 but not p107HER1 , indicating that the biological activity of HRG resides in the EGF-
like domain of the protein and that carbohydrate moieties are not essential for activity in this
assay. The NTD does not inhibit or suppress this activity.
Example 11
Various human tissues were examined for the presence of HRG mFlNA. Transcripts
were found in breast, ovary, testis, prostate, heart, skeletal muscle, lung, liver, kidney,
salivary gland, small intestine, and spleen but not in stomach, pancreas, uterus or placenta.
While most of these tissues display the same three classes of transcripts as the MDA-MB-231
cells (6.6 kb, 2.5 kb and 1.8 kb), only the 6.6 kb message was observed for in heart and
skeletal muscle. In brain a single transcript of 2.2 kb is observed and in testis the 6.6 kb
transcript appears along with others of 2.2 kb, 1.9 kb and 1.5 kb, The tissue specific
expression pattern observed for HRG differs from that of p185HER2; for example, adult liver,
spleen, and brain contain HRG but not p185“ER2 transcripts whereas stomach, pancreas,
uterus and placenta contain p185“ER2 transcripts but lack HRG mRNA
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(ii)
(vi)
APPLICANT: Genentech, Inc.
TITLE OF INVENTION: Structure, Production and Use of Heregulin
NUMBER OF SEQUENCES: 30
CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Genentech, Inc.
(B) STREET: 460 Point San Bruno Blvd
(C) CITY: South San Francisco
(D) STATE: California
(E) COUNTRY: USA
(F) ZIP: 94080
COMPUTER READABLE FORM:
(A) MBDIUM TYPE: 5.25 inch, 360 Kb floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: patin (Genentech)
CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE: 21-May-1992
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(Vii)
(vii)
(vii)
(Vii)
(viii)
(ix)
(A) APPLICATION NUMBER:
(B) FILING DATE: 11-May-1992
PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 07/847743
(B) FILING DATE: 06-Mar-1992
PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 07/705256
(B) FILING DATE: 24-May-1991
PRIOR APPLICATION DATA:
(A) APPLICATION NUMER: 07/765212
(B) FILING DATE: 25-Sep-1991
PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 07/790801
(B) FILING DATE: 08-Nov-1991
ATTORNEY/AGENT INFORMATION:
(A) NAME: Hensley, Max D.
(B) REGISTRATION NUMBER: 27,043
(C) REFERENCE/DOCKET NUMBER: 712P4
TELECOMUNICATION INFORMATION:
(A) TELEPHONE: 415/266-1994
(B) TELEFAX: 415/952-9881
(C) TELEX: 910/371-71(2) INFORMATION FOR sag ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
CNCAAT 6
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
AATAAA 6
(2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
Ala Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Xaa
1 5 10 15
Phe Met Val Lys Asp Leu Xaa Asn Pro
24
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Xaa Glu Xaa Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys
‘ 5 10 15
Glu Xaa Gly Xaa Gly Lys
(2)
INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
13 amino acids
(xi) SEQUENCE DESCRIPTION:
Ala Glu Lys Glu Lys Thr Phe Xaa
(A) LENGTH:
(B) TYPE: amino acid
TOPOLOGY: linear
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(xi)
(xi)
(A) LENGTH: 42 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY:
linear
SEQ ID NO:5:
SEQUENCE DESCRIPTION: SEQ ID NO:6:
Val Asn Gly Gly Glu
GCTGAGAAGG AGAAGACCTT CTGTCGTGAA TCGGACGGCG AG 42
INFORMATION FOR SEQ ID NO:7:
SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2199 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS:
(D) TOPOLOGY:
single
linear
SEQUENCE DESCRIPTION: SEQ ID NO:7:
CCA
Pro
TCC
Ser
GGC
Gly
GAC
Asp
1
AAC
Asn
GTC
Val
Lys
40
AAG
Lys
CCT
Pro
AAA
Lys
TCG
Ser
GGG
Gly
CCG
Pro
CCC
Pro
CTT
Leu
CCT
Pro
GGC
Gly
AAG
Lys
GAG
Glu
55
CAA
Gln
TTC
Phe
GCG
Ala
GAG
Glu
GGC
Gly
TCC
Ser
TTG
Leu
CCA
Pro
CCG
Pro
ATG
Met
AAG
Lys
GCG
Ala
Lys
70
AAC
Asn
AGA
Are
TCC
Ser
AAG
Lys
45
GCG
Ala
GAG
Glu
CCG
Pro
GCC
Ala
GAG
Glu
AAG
Lys
GGC
Gly
ATG
Met
ATC
Ile
GTC
Val
CGC
Arg
GAG
Glu
AGC
Ser
CGA
Arg
CGC
Arg
Lys
CGA
Arg
CAG
Gln
AGC
Ser
GCC
Ala
GTA
Val
GAA
Glu
GGC
Gly
AGC
Ser
CAG
Gln
75
CTT
Leu
GAG
Glu
GGC
Gly
TCC
ser
50
GAA
Glu
GGA 38
CGC 77
Arg
AGA 116
Arg
GGC
Gly
GCC
Ala
TCG
Ser
GCT
Ala
TCT
Ser
GGG
Gly
ATG
Met
TCT
Ser
ACT
Thr
195
GTG
Val
AAC
Asn
ACT
Thr
TAC
Tyr
260
GGT
Gly
80
GCA
Ala
TAC
Tyr
GAA
Glu
GAA
Glu
AAT
Asn
105
Ile Gln
Lys Ala
ATC
Ile
145
GTG
Val
ACC
Thr
ATC
Ile
GCC
Ala
CCA
Pro
170
CCC
Pro
ATT
Ile
TCA
Ser
TCT
Ser
GTA
Val
210
CTT
Leu
GGA
Gly
AAT
Asn
TCG
Ser
CCC
Pro
235
GAT
Asp
GGT
Gly
CAT
His
AAG
Lys
CTG TA: CAG AAG AGA GTG CTG ACC ATA ACC sec ATC TGC 857
TCC
Ser
TCC
Ser
TTG
Leu
Lys
120
TCA
Ser
AGC
Ser
ATC
Ile
TCA
Ser
AGA
Arg
185
TCT
Ser
GGG
Gly
AGA
Arg
CGC
Arg
250
CTT
Leu
TCT
Ser
95
AAT
Asn
AAG
Lys
CTG
Leu
GTG
Val
160
ACT
Thr
ATA
Ile
ACA
Thr
TGT
Cys
GAG
Glu
225
TAC
Tyr
TGC
Cys
GGG
Gly
CTA
Leu
CTC
Leu
CGA
Arg
CCA
Pro
GCT
Ala
135
GAA
Glu
GAA
Glu
TCA
Ser
TCT
Ser
200
TCC
Cys
CAA
Gln
ATT
Ile
265
GTC
Val
AGA
Arg
AAA
Lys
110
GAT
Asp
GGA
Gly
TCA
Ser
GGA
Gly
175
GTA
Val
ACA
Thr
GAG
Glu
TTC
Phe
TGC
Cys
240
GAA
Glu
CTT
Leu
85
TTC
Phe
AAC
Asn
TCT
Ser
AAT
Asn
150
AAC
Asn
GCA
Ala
TCC
Ser
TCC
Ser
AAG
Lys
215
ATG
Met
AAG
Lys
TAC
Tyr
TTT
Phe
CGG
Arg
AAG
Lys
TCA
Ser
125
GGA
Gly
GAC
Asp
GAG
Glu
TAT
Tyr
ACA
Thr
190
ACC
Thr
GAG
Glu
GTG
Val
TGC
Cys
GTA
Val
255
ATG
Met
TGT
Cys
TGG
Trp
100
CCA
Pro
GAG
Glu
AGT
Ser
ATC
Ile
165
GTG
Val
GAA
Glu
ACT
Thr
Lys
230
ATG
Met
GAG
Glu
GAA
Glu
TTC
Phe
CAA
Gln
CTT
Leu
TAT
Tyr
140
GCC
Ala
ATC
Ile
TCT
Ser
GGA
Gly
GGG
Gly
205
ACT
Thr
GAC
Asp
AAT
Asn
GCC
Ala
GCG
Ala
270
ACC
Thr
AAG
Lys
AAT
Asn
115
CGC
Arg
ATG
Met
TCT
Ser
ACT
Thr
TCA
Ser
180
GCA
Ala
ACA
Thr
TTC
Phe
CTT
Leu
GAG
Glu
245
GAG
Glu
AGT
Ser
90
AAT
Asn
ATC
Ile
ATT
Ile
TGC
Cys
GCC
Ala
155
GGT
Gly
GAG
Glu
AAT
Asn
AGC
Ser
TGT
Cys
220
TCA
Ser
TTC
Phe
GAG
Glu
Tyr Gln Lys Arg Val Leu Thr Ile Thr Gly Ile Cys
ATC
Ile
TAC
Tyr
CGT
Arg
ATG
Met
325
CCC
Pro
GAG
Glu
CAT
His
Trp
390
CAC
His
GGC
Gly
CAT
His
CAT
His
455
CGT
Arg
GCC
Ala
TGC
Cys
300
CTT
Leu
AAC
Asn
GAG
Glu
GTC
Val
ACA
Thr
365
CAC
His
AGC
Ser
TCT
Ser
AGC
Ser
ACA
Thr
430
GCC
Ala
AGT
Ser
ATG
Met
TCG
Ser
CTC
Leu
CGG
Arg
ATT
Ile
AAT
Asn
340
ATC
Ile
TCC
Ser
TCC
Ser
AAC
Asn
GTA
Val
405
AGC
Ser
GGA
Gly
AGA
Arg
GAA
Glu
CA
Ser
470
an
»L.
CTT
Leu
ACC
Thr
CAG
Gln
315
GCC
Ala
GTC
Val
TCC
Ser
TTT
Phe
ACT
Thr
380
GGA
Gly
ATC
Ile
CCA
Pro
GGC
Gly
GAA
Glu
445
AGG
Arg
CCT
Pro
CCT
Pro
GTG
Val
290
AAG
Lys
AGC
Ser
AAT
Asn
CAG
Gln
AGT
Ser
355
TCC
Ser
ACT
Thr
CAC
His
GTG
Val
ACT
Thr
420
CCT
Pro
ACC
Thr
TAT
TYI
GTA
Val
TCG
Ser
485
GTC
Val
CTT
Leu
GGG
Gly
330
CTG
Leu
GAG
Glu
ACC
Thr
GTC
Val
ACT
Thr
395
ATG
Met
GGG
Gly
CGT
Arg
CCT
Pro
GTG
Val
460
GAT
Asp
GAA
Glu
GGC
Gly
CAG
Gln
305
CGG
Arg
CCT
Pro
GTG
Val
CAT
His
AGT
Ser
370
ACC
Thr
GAA
Glu
TCA
Ser
GGC
Gly
GAA
Glu
435
GAT
Asp
TCA
Ser
TTC
Phe
ATG
Met
ATC
Ile
CGG
Arg
TCT
Ser
CAC
His
AAT
Asn
345
ATT
Ile
CAC
His
CAG
Gln
AGC
Ser
TCC
Ser
410
CCA
Pro
TGT
Cys
TCC
Ser
GCC
Ala
CAC
His
475
TCT
S61’
ATG
Met
GAA
Glu
320
CAT
His
CAA
Gln
GTT
Val
TAT
Tyr
ACT
Thr
385
ATC
Ile
GTA
Val
AGA
Arg
AAC
Asn
Tyr
450
ATG
Met
ACG
Thr
CCA
Pro
TGT
Cys
295
AAG
Lys
CGA
Arg
CCT
Pro
TAC
Tyr
GAG
Glu
360
ACT
Thr
CCT
Pro
CTT
Leu
GAA
Glu
GGA
Gly
425
AGC
Ser
CGA
Arg
ACC
Thr
CCA
Pro
CCC
Pro
490
GTG
Val
CTG
Leu
AAC
ASH
AAC
Asn
335
GTA
Val
AGA
Arg
TCC
Ser
AGC
Ser
TCC
Ser
400
AAC
Asn
CGT
Arg
TTC
Phe
GAC
Asp
ACC
Thr
465
GTG
Val
GTG
Val
CAT
His
310
AAT
Asn
CCA
Pro
TCT
Ser
GAA
Glu
ACA
Thr
375
CAC
His
GAA
Glu
AGT
Ser
CTT
Leu
CTC
Leu
440
CCG
Pro
TCC
Ser
TCC
Ser
GCC
Ala
GAC
Asp
ATG
Met
CCC
Pro
Lys
350
GCA
Ala
GCC
Ala
AGC
Ser
AGC
Ser
AGG
Arg
415
AAT
Asn
AGG
Arg
CCT
Pro
GCT
Ala
CCC
Pro
480
AGC
Ser
ATG
Met
ATG
Met
AGG
Arg
520
AGC
Ser
GAG
Glu
CCT
Pro
ACC
Thr
585
GAC
Asp
GAA
Glu
CTG
Leu
ACA
Thr
GCA
Ala
650
CTG
Leu
ACG
Thr
495
GAA
Glu
CTG
Leu
AGC
Ser
CTC
Leu
Tyr
560
GTT
Val
AAG
Lys
AGC
Ser
ACA
Thr
GGC
Gly
625
GGC
Gly
TCT
Ser
GTG
Val
GAA
Glu
CGG
Arg
TCC
Ser
535
CCT
Pro
GAA
Glu
AAG
Lys
CCC
Pro
AAC
Asn
600
GAA
Glu
ATA
Ile
GCC
Ala
CGC
Arg
AGT
Ser
665
TCC
Ser
GAG
Glu
S10
GAG
Glu
TTC
Phe
GCT
Ala
ACG
Thr
Lys
575
AAT
Asn
ACA
Thr
GAT
Asp
CAG
Gln
TTC
Phe
640
ATG
Met
AGA
Arg
AAG
Lys
CAC
His
AGC
Ser
550
ACC
Thr
CTC
Leu
GGC
Gly
AGC
Ser
GAA
Glu
615
CGC
Arg
TCG
Ser
ATT
Ile
CCT
Pro
CCT
Pro
AAG
Lys
525
CAC
His
CCC
Pro
CAA
Gln
GCC
Ala
CAC
His
590
TCC
Ser
AGA
Arg
CCC
Pro
CTG
Leu
ACA
Thr
655
GCT
Ala
A AACCTAAATA AACACATAGA
TCC
Ser
500
CTA
Leu
TTT
Phe
AAC
Asn
TTG
Leu
GAG
Glu
565
AAT
Asn
ATT
Ile
CAG
Gln
GTA
Val
CTG
Leu
630
GCT
Ala
CAG
Gln
AAC
Asn
ATG
Met
CTT
Leu
GAC
Asp
CCC
Pro
540
AGG
Arg
TAC
Tyr
AGC
Ser
GCT
Ala
AGC
Ser
605
GGT
Gly
GCA
Ala
GAC
Asp
GAA
Glu
CAA
Gln
670
GCG
Ala
CTC
Leu
515
CAT
His
GCG
Ala
ATA
Ile
GAG
Glu
CGG
Arg
580
AAC
Asn
AGT
Ser
GAA
Glu
GCC
Ala
AGC
Ser
645
GAA
Glu
GAC
Asp
GTC
Val
GTG
Val
CAC
His
CAT
His
GTG
Val
S55
CCA
Pro
CGG
Arg
AGA
Arg
AAC
Asn
GAT
Asp
620
AGG
Arg
ATC
Ile
CCT
Pro
AGC
Ser
ACA
Thr
CCT
Pro
530
GAC
Asp
GAG
Glu
GCC
Ala
GCC
Ala
Leu
595
TCA
Ser
ACG
Thr
CTT
Leu
ACT
Thr
CAG
Gln
660
ATT
Ile
CCC
Pro
505
CCA
Pro
CAG
Gln
ACT
Ser
GAT
Asp
CAA
Gln
570
GAA
Glu
GAG
Glu
CCT
Pro
GAG
Glu
635
GCC
Ala
GCT
Ala
TTC
Phe
CCA
Pro
CAG
Gln
AAC
Asn
545
GAG
Glu
GAG
Glu
AGA
Arg
GTG
Val
AGT
Ser
610
TTC
Phe
GCA
Ala
CCA
Pro
AGG
Arg
GTA
Val
675
TTCACCTGTA AAACTTTATT 2070
TA 2029
TTATATAATA AAGTATTCCA CCTTAAATTA AACAATTTAT TTTATTTTAG 2120
CAGTTCTGCA
‘TAGAAAAC AGGAAAAAAA CTTTTATAAA TTAAATATAT 21
(2)
GTATGTAAAA ATGAAAAAAA AAAAAAAAA 2199
INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(B) TYPE: amino acid
(D) TOPOLOGY: linear
Pro Gln Arg Gly
Asn
Val
Glu
50
Gly
65
Ser
95
Leu
110
Asn
125
Glu
140
Met
155
Asn
170
Ala
185
Ile
200
Thr
215
Glu
230
Asp
245
SEQUENCE DESCRIPTION: SEQ
LENGTH: 669 amino acids
ID NO:8:
Ser Leu Ser
Leu
Ser
40
Gly
55
Ser
70
Glu
85
Arg
100
Phe
115
Ile
130
Lys
145
Ser
160
Glu
175
Ala
190
Glu
205
Thr
220
Val
235
Ser
250
-U-'9
Asp
Pro
Met
45
Lys
60
Gln
75
Glu
90
Ser
105
Glu
120
Lys
135
Asp
150
Asp
165
Ile
180
Glu
195
Ser
210
Lys
225
Cys
240
Lys
245
Asn "
Val S
Phe
260
Asn
275
Ile
290
Val
305
Thr
Gln
Thr
Val
Arg
Ile
Gln
His
Tyr
Ser
ser
Gly
TI‘P
Cys
265
Glu
280
Ile
295
Lys
310
Ser
325
Pro
340
Gln
355
Arg
370
Ala
385
Ser
400
Ile
415
Gly
430
Cys
445
Arg
460
Pro
475
Lys
490
Val
505
Arg
520
Phe
535
Ala
550
Glu
565
Leu "
Pro
270
Lys
285
Val
300
Arg
315
Arg
330
Pro
345
Asn
360
Ser
375
Thr
390
Glu
405
Val
420
Arg
435
Arg
45:-
Ser
465
Pro
480
Glu
495
Met
Thr
Tyr
575
Arg
590
Glu
605
Glu
620
Ile
635
Arg
650
Gln
665
Ala
(2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 732 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY:
linear
(xi) SEQUENCE DESCRIPTION:
Lys Leu Phe Pro Asn Pro
Pro
Glu
Arg
Pro
Ala
Tyr
Asn
Gly
Gly
Ala
Ala
Lys
Ser
50
Leu
65
Gly
80
Ser
95
Lys
110
Ser
125
Tyr
140
Ala
155
SEQ ID NO:9:
Gln
669
Glu
580
Lys
595
Thr
610
Arg
625
Ala
640
Arg
655
Ala
Arg
Lys
40
Glu
55
Lys
70
Leu
85
Trp
100
Asn
115
Asn
130
Ile
145
Val
160
Lys
Gly
Gln
Glu
Leu
Pro
Ser
Gly
Ala
Lys
Glu
Asn
Ile
Ser
Leu
Asn
Ser ”
Leu
585
Ile
600
Ser
615
Thr
630
Ala
645
Gly
660
Ser
Glu
Lys
45
Ser
60
Gln
75
Ser
90
Asn
105
Lys
120
Ala
135
Asn
150
Ile
1
Glu
_ Met
Arg
185
Ser
200
Lys
215
Lys
230
Glu
245
Tyr
260
Gln
275
Val
290
Gln
305
Va 1
-
Asn
335
Lys
350
Thr
3 65
Thr
380
Thr
39 5
Ser
410
Gly
425
Leu
440
His
455
Ser
470
Ser
485
Thr
19 0
Gly
205
Cys
220
Pro
23 5
Arg
250
Ile
265
Thr
280
Cys
295
His
310
Met
325
Asn
340
Ser
355
Ser
370
Thr
3 85
Ser
400
Arg
415
Thr
43 O
Glu
445
Val
460
A1-'9
Thr
195
Val
210
Glu
225
Cys
240
Val
255
Ala
270
Cys
285
Cys
300
Gln
315
Gly
330
Asn
345
Glu
360
Thr
375
Trp
390
Val
405
Thr
42 O
Glu
435
Tyr
450
Thr
465
Pro
4
73
Val Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu
500 505 510
Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys
515 520 525
Phe Asp His His Pro Gln Gln Phe Ser Ser Phe His His Asn Pro
530 535 540
Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg Ile Val
545 550 555
Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu Tyr Glu Pro Ala Gln
560 565 570
Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr
575 580 585
Lys Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val Asp Ser Asn
590 595 600
Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu
605 610 615
Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro Leu
620 625 630
Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser
635 640 645
Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln
650 655 660
Ala Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
665 670 675
Xaa Asn Leu Asn Lys His Ile Asp Ser Pro Val Lys Leu Tyr Phe
680 685 690
Ile Xaa Xaa Ser Ile Pro Pro Xaa Ile Lys Gln Phe Ile Leu Phe
695 700 705
Xaa Gln Phe Cys Lys Xaa Lys Thr Gly Lys Lys Leu Leu Xaa Ile
710 715 720
Lys Tyr Met Tyr Val Lys Met Lys Lys Lys Lys Lys
725 730 732
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 66 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
1 5 10 15
Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser
2; 25
Arg Tyr Leu Cys Lys
Thr Glu Asn Val Pro
50
Glu Leu Tyr Gln Lys Arg
65
(2) INFORMATION FOR SEQ ID NO:ll:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 71 amino a
(B) TYPE: amino acid
(D) TOPOLOGY: linear
cids
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1l:
Ser His Leu Val Lys
Asn Gly Gly Glu Cys
Arg Tyr Leu Cys Lys
Gln Asn Tyr Val Met
Phe Met Glu Ala Glu
Cys Ala
Phe
Cys Pro
Ala Ser
50
Glu Leu
Met Val Lys
Glu Lys
Asp
Phe
40
Asn Glu
Phe Tyr Lys
Tyr Gln Lys
(2) INFORMATION FOR SEQ ID NO:l2:
(i) SEQUENCE CHARACTERISTICS:
(xi)
(A) LENGTH: 2010 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: sing
(D) TOPOLOGY: linear
SEQUENCE DESCRIPTION:
GGGCGCGAGC GCCTCAGCGC
CTTTTCCCAA ACCCGATCCG
AGCCGTCCGC GTAGAGCGCT
AAGGCAGAGG CAAAGGGAAG
AAGCCGGAGT cccccccssc
GAAAGAGATG AAAAGCCAGG
GGTGTGAAAC CAGTTCTGAA
SEQ ID NO:1
GGCCGCTCGC
AGCCCTTGGA
CCGTCTCCGG
GGCAAGAAGA
CAGCCAGAGC
AATCGGCTGC
TACTCCTCTC
Glu Lys
Arg
71
TCTCCCCCTC
CCAAACTCGC
CGAGATGTCC
AGGAGCGAGG
CCAGCCTTGC
AGGTTCCAAA
TCAGATTCAA
Cys Gln Pro Gly Phe Thr Gly Ala Arg Cys
Met Lys Val Gln Asn Gln Glu Lys Ala Glu
Thr Phe Cys Val
Ser Asn Pro Ser
Gly Asp Arg Cys
Leu Gly Ile Glu
GAGGGACAAA
CTGCGCCGAG
GAGCGCAAAG
CTCCGGCAAG
CTCCCCGATT
CTAGTCCTTC
GTGGTTCAAG
AATGGGAATG
AAAAAAGCCA
ATTCTGGAGA
GCCTCTGCCA
GCCAGCCTCA
TATCAGTATC
ACCACTGGGA
CTGTGTGAAT
CGAGATACTT
GAGAATGTGC
CCAGAAGAGA
TCGGCATCAT
AAGCTGCATG
GATGAACATT
TCCAGCTGGT
ATTGTTGAGA
CACAGCCCAT
GCAACGGACA
ATGTCATCCG
AGGACGTCTT
GGCATGCCAG
AATTGAATCG
GGGAAGTCAG
GTATATGTGC
ATATCACCAT
ACTGAAGGAG
CACAGAAGGA
CAAGCCATCT
GGAGGGGAGT
GTGCAAGTGC
CCATGAAAGT
GTGOTGACCA
GTGTGTGGTG
ACCGTCTTCG
GCCAATGGGC
GAATCAATAC
GAGAAGCAGA
CACTCCACTA
CACTGAAAGC
TAGAAAACAG
AATGGCACAG
AGAAACCCCT
AAAAAACAAA
AACTTCGCAT
AAAGTGATCA
CGTGGAATCA
CATATGTGTC
GCAAATACTT
TGTAAAATGT
GCTTCATGGT
CAACCTGGAT
CCAAAACCAA
TAACCGGCAT
GCCTACTCCA
GCAGAGCCT
CTCACCATCC
GTATCTAAAA
GACATCCTTT
CTGTCACCCA
ATCCTTTCCG
TAGGCACAGC
GAGGCCCTCG
GATTCCTACC
CCACAAAATA
TAACAAAGCA
GCAAATTAGG
AACGAGATCA
TTCAGAGTCT
CTTCATCTAC
GCGGAGAAGG
GAAAGACCTT
TCACTGGAGC
GAAAAGGCGG
CTGCATCGCC
AAACCAAGAA
CGGTCTGAAC
TAACCCACCC
ACGTCATCTC
TCCACCAGTC
GACTCCTAGC
AAAGCCACTC
AGCCCAACTG
TGAATGTAAC
GAGACTCTCC
TCAAGATACA
TCACTGGCTG
AAATGACAGT
TCACTGGTAT
CCCATTAGAA
ATCTACATCC
AGAAAACTTT
TCAAACCCCT
AAGATGTACT
AGGAGCTGTA
CTCCTTGTGG
ACAGCGGAAA
GAAACAATAT
CCCGAGAATG
CAGTGAGCAT
ACTATACTTC
CACAGCTGGA
TGTAATCGTG
GGGGCCCAAG
AGCTTCCTCA
TCATAGTGAA
AGGTATGTGT
CCACACGCCA
TGTCCAGCAT
GAAGAAGAGA
GAAGTTTGAC
CGCATGACAG
GAGGAGTATG
GAAACTCGCC
TTGCTAACAG
TCAGAGAGTG
GGGCATACAG
GCCTGGCTGA
GAAATCCAGG
CAGCCATGAC
AGCTCCCCCA
GACGGTGTCC
GACCTCTACT
CATCACCCTC
TAACAGCCTC
AAACGACCCA
AATAGCCGGC
ATTGGAAGTG
AAACAGAAGA
AACCCCCTGG
CAGCAGGACT
CACCCCGGCT
AATCGCCCCC
ATGCCTTCCA
TCTCGTGACA
AGCAGTTCAG
CCTGCTAGCC
AGAGTACGAG
GGGCCAAAAG
GACAGCAACA
TGAAAGAGTA
CAGCCAGTCT
AACCCAGCAG
(2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 669 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ
CGTATGTCAC
TTCGGAAATG
TGGCGGTCAG
CCACCAAGGC
CTCCTTCCAC
CCTTGAGGAT
CCAGCCCAAG
AACCAAGCCC
CAAGCTCCCA
GGTGAAGATA
TGAGGCAACA
GCCGCTTCTC
ID NO:l3:
Ser Leu Ser Pro Ser Arg
Arg Ala Pro
Leu Phe Pro
Pro Arg Ala
Glu Arg Lys
Arg Gly Ser
Arg Gly Arg
Pro Ile Arg
Arg Val Glu
Gly Arg Gly
Lys Lys Pro
Leu
Gly Pro Asn Ser
Val Ser Gly Glu
Lys Gly Lys Lys
Ala Ala Gly Ser
CTGTAGATTT
TCTCCACCCG
CCCCTTCATG
TGCGGGAGAA
CACAACCCCG
AGTGGAGGAT
AGCCTGTTAA
AATGGCCACA
GAGCAGTAAC
CGCCTTTCCT
CCTGCCTTCC
GACACAGGAA
Asp
Pro
Met
45
Lys
60
Gln
75
Ser
Ly s
Pro
80
Ser
95
Leu
110
Asn
125
Glu
140
Met
155
Asn
170
Ala
185
Ile
200
Thr
215
Glu
230
Asp
245
Phe
260
Asn
275
Ile
290
Val
305
Asp
320
Asn
335
Val
350
Glu
365
His
380
Glu
85
Arg
100
Phe
115
Ile
130
Lys
145
Ser
160
Glu
175
Ala
190
Glu
205
Thr
22 0
Val
235
Ser
250
Cys
265
Glu
280
Ile
295
Lys
310
Ser
325
Pro
3 40
Gln
355
Arg
370
Ala
385
Ly s
Ly s
Glu
90
Ser
105
Glu
120
Lys
135
Asp
150
Asp
165
.65
Glu
195
Ser
210
Lys
225
Cys
240
Lys
255
Pro
270
Lys
285
Val
300
Arg
315
Arg
330
Pro
345
Asn
360
Ser
375
Thr
390
(2) INFORMATION FOR SEQ ID NO:l4:
Leu Ser
Ser Arg
Gly Thr
Arg Glu
Tyr Val
His
Pro Pro
Ser Pro
Pro Arg
Ser Ser
Ala Ser
Gln Glu
Ser Arg
Arg Leu
Glu Ser
Leu Gly
Ala Phe
Ser Thr
Glu
410
His
425
Gly
440
Thr
455
Ser
470
Thr
485
Val
500
Phe
S15
Leu
530
Pro
560
Tyr
575
Arg
590
Glu
605
Glu
620
Arg
650
Gln
FF
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH:
(B) TYPE: amino acid
(D) TOPOLOGI:
linear
Gln
669
amino acids
Ile
415
Gly
430
Cys
445
Arg
460
Pro
475
Lys
490
Val
505
Arg
520
Phe
535
Ala
550
Glu
565
Glu
S80
Lys
595
Thr
610
Arg
625
Ala
640
Arg
655
val
420
Arg
435
Arg
450
Ser
465
Gln
S40
Ser
555
Glu
570
Leu
585
Ile
600
Ser
615
Thr
630
Ala
645
Gly
6
79
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l4:
His Leu Val Lys Cys Ala Glu Lys Glu Lys
1 5 10
Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu
25
Arg Tyr Leu Cys Lys Cys Gln Pro Gly Phe Thr
40
Thr Glu Asn Val Pro Met Lys Val Gln Asn Gln
50 55
Glu Leu Tyr Gln Lys Arg Val Leu Thr Ile Thr
65 70
Ala Leu Leu Val Val Gly Ile Met Cys Val Val
80 85
Thr Lys Lys Gln Arg
95
(2) INFORMATION FOR SEQ ID NO:l5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 91 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp
1 5 10
His Asp Gly Val Cys Met Tyr Ile Glu Ala Leu
25
Cys Asn Cys Val Val Gly Tyr Ile Gly Glu Arg
40
Asp Leu Lys Trp Trp Glu Leu Arg His Ala Gly
50 55
Gln Lys Val Ile Val Val Ala Val Cys Val Val
65 70
Leu Leu Leu Leu Ser Leu Trp Gly Ala His Tyr
80 85
Lys
91
INFORMATION FOR SEQ ID NO:l6:
(i) SEQUENCE CHARACTERISTICS:
(A)
(B)
(D)
TYPE: amino acid
TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION:
LENGTH: 82 amino acids
SEQ ID NO:16:
Val
Ser
Cys
45
Glu
60
Ile
75
Lys
90
Leu
Ala
Arg
45
Gln
60
Met
75
Gln
90
Asn Asp Cys Pro Asp Ser His Thr Gln Phe Cys Phe His Gly Thr
Cys Arg Phe Leu Val Gln Glu Asp Lys Pro Ala
25
Ser Gly Tyr Val Gly Ala Arg Cys Glu His Ala
40
Val Val Ala Ala Ser Gln Lys Lys Gln Ala Ile
50 55
Val Val Ser Ile Val Ala Leu Ala Val Leu Ile
65 70
Leu Ile His Cys Cys Gln Val
80 82
(2) INFORMATION FOR SEQ ID NO:l7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 87 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l7:
Lys Lys Lys Asn Pro Cys Asn Ala Glu Phe Gln
1 5 10
His Gly Glu Cys Lys Tyr Ile Glu His Leu Glu
25
Lys Cys Gln Gln Glu Tyr Phe Gly Glu Arg Cys
40
Met Lys Thr His Ser Met Ile Asp Ser Ser Leu
50 55
Leu Ala Ala Ile Ala Ala Phe Met Ser Ala Val
65 70
Val Ala Val Ile Thr Val Gln Leu Arg Arg Gln
80 85
(2) INFORMATION FOR SEQ ID NO:l8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 87 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l8:
Lys
1 _ 5
Gly Glu Cys Arg
His
His
Lys Thr Gln Lys
55
Cys His Gln Asp Tyr Phe Gly Glu Arg _
‘ys Asp Asp Ser As
Lys Lys Asn Pro Cys Ala Ala Lys Phe Gln
Tyr Ile Glu Asn Leu Glu
Cvs
40
(11 'u
Tyr
87
His
Ala
45
Val
60
Val
75
Ile
Cys
Ser
45
Ala
75
Ile
Cys
Thr
45
Leu Ala Ala Ile Ile Val Phe Val Ser Ala Val Ser Val Ala Ala
65 70 75
Ile Gly Ile Ile Thr Ala Val Leu Leu Arg Lys Arg
80 85 87
(2) INFORMATION FOR SEQ ID NO:l9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 86 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l9:
Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys Ile
1 ' 5 10 15
His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys
25 30
Ile Cys His Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser
40 45
Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr Thr Ile
50 55 60
Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu Val
65 70 75
Ile Val Gly Leu Leu Met Phe Arg Tyr His Arg
80 85 86
(2) INFORMATION FOR SEQ ID NO:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 13 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
Arg Pro Asn Ala Arg Leu Pro Pro Gly Val Phe Tyr Cys
1 5 10 13
(2) INFORMATION FOR SEQ ID NO:2l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2l:
CCTCGCTCCT TCTTCTTGCC CTTCC
82
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 496 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi)
SEQUENCE DESCRIPTION: SEQ ID NO:22:
AA AGA
Arg
1
GGG
Gly
TCC
Ser
CCG
Pro
CTG
Leu
CCC
Pro
4 0
TCT
Ser
CGC
Arg
GGA
Gly
CCG
Pro
65
GTT
Val
CCC
Pro
CTC
Leu
GCG
Ala
GCG
Ala
5
CGA
Arg
Asp Lys
TCG
Ser
AAC
Asn
130
TCC
Ser
GTC .
Val
Lys Gly
GCC
Ala
CTC
Leu
GAC
Asp
CGA
Arg
GAG
Glu
CGA
Arg
CCA
Pro
8 O
CC'I'
Pro
CCT
Pro
CTT
Leu
CCT
Pro
GGC
Gly
145
AAG
Lys
GGC GAG GAG TTC
GCG
Ala
GAT
Asp
TCG
Ser
CCG
Pro
5
CAG
Gln
GGT
Gly
GCA
Ala
CAG
Gln
TTC
Phe
l2 0
GCG
Ala
GAG
Glu
GGC
G1‘;
CGG
Arg
GGG
G 1y
3 O
GGT
G ly
CCA
Pro
GAG
Glu
GGC
Gly
GGC
Gly
9 5
CGC
Arg
CCA
Pro
CCG
Pro
ATG
Met
AAG
Lys
1 6 O
Glu
AGG
Arg
AGC
Ser
TGC
Cys
GCG
Ala
CAG
Gln
7 0
cat;
Arg
AAC
Asn
GGC
Gly
AAC
Asn
AGA
Arg
13 5
AAG
Ly s
CCA
Pro
GTG
Val
GAG
Glu
45
GTG
Val
CCC
Pro
ACC
Thr
GGG
Gly
CGC
Arg
110
GCC
Ala
GAG
Glu
AAG
Lys
CCC
Pro
GGA
Gly
2 0
AGC
Ser
GGC
Gly
GGA
Gly
CGA
Arg
GCC
Ala
85
AGA
Arg
TCG
Ser
ATC
Ile
GTC
Val
CGC
Arg
0
GAG
Glu
GAA
Glu
GCT
Ala
AGG
Arg
GCC
Ala
CCC
Pro
GAG
Glu
CGC
Arg
CGC
Arg
CTC
Leu
CGA
Arg
125
CGC
Arg
cc».
Arg
164
ACT
Thr
GAG
Glu
ACG
Thr
3 5
GGG
Gly
ATC
Ile
CCA
Pro
CGC
Arg
CCC
Pro
0
TCC
Ser
GCC
Ala
GTA
Val
GAA
Glu
TGT
Cys
CGG
Arg
GTG
Val
CAG
Gln
GAC
Asp
GGG
Gly
75
GTC
Val
CGC
Arg
CCC
Pro
CTT
Leu
GAG
Glu
140
GGC
Gly
GG 496
TI‘?
ATA
I la
AGG
Arg
0
GAC
Asp
CGA
Arg
CGC
Arg
GCA
Ala
TCG
Ser
115
GGA
Gly
ccc
Arg
AGA
Arg
CGG
Arg
ACC
Thr
CCA
Pro
TTC
Phe
GCG
Ala
GCC
Ala
9 0
GCG
Ala
AGG
Arg
CCA
Pro
TCC
Ser
GGC
Gly
1
77
(2)
(i)
(xi)
INFORMATION FOR SEQ ID NO:23:
SEQUENCE CHARACTERISTICS:
(A)
(B)
(C)
(D) TOPOLOG
Y: linear
LENGTH: 2490 bases
TYPE: nucleic acid
STRANDEDNESS: single
SEQUENCE DESCRIPTION: SEQ ID NO:23:
GTGGCTGCGG
GTTGGAACTC
CCGGACGATG
TTGCGAGGGC
CCATCGACGA
GCGCCCGTTC
TGCAGGCAAC
GCTCGCTCTC
CTTGGACCAA
CTCCGGCGAG
GGG
Gly
AAG
Lys
GGC
Gly
CCG
Pro
GAG
Glu
TCC
Ser
CCC
Pro
CAA
Gln
TTG
Leu
GGT
Gly
50
.Ser Lys
TAC TCC
Ser Ser
GGCAATTGAA
CGGGCTCGCG
GGAGCGTGAG
GCCGGGCAGA
CTTCCCGGGG
CAGGTGGCCG
GGGAGACGCC
CCCATCGAGG
ACTCGCCTGC
ATG
Met
TCC
Ser
AAG
Lys
AAG
Lys
GCG
Ala
GCG
Ala
GAG
Lys Glu
CTA
Leu
GTC
Val
CTC
Leu
AGA
Arg
GAG
Glu
AAG
Lys
GGC
Gly
ATG
Met
CTT
Leu
55
TTC
Phe
AAAGAGCCGG
CGGAGGCCAG
CAGGACGGTG
GGCCAGGACG
CGACAGGAGC
GACCGCCCGC
CCCGCGCAGC
GACAAACTTT
GCCGAGAGCC
Arg Lys
Glu Arg
AGC
Ser
CAG
Gln
Lys Ser
TGT
Cys
CGG
Arg
AAG
Lvs
TGG
Trp
70
CGC AAA GAA GGC
Glu
GAG CGA GGC
Gly
AGC
Ser
CAG
Gln
GAA
Glu
TTC
Phe
CGAGGAGTTC
GAGCTGAGCG
ATAACCTCTC
CGAGCCGCCA
AGCCCCGAGA
CGCGTCCGCG
GCGAGCGCCT
TCCCAAACCC
GTCCGCGTAG
Gly Arg
TCC
Ser
GGC
Gly
CCA
Pro
GCC
Ala
GAA
Glu
TCG
Ser
45
ACC
Thr
ACT
Ser
60
AAG
Lys
AAT
Asn
CCCGAAACTT
GCGGCGGCTG
CCCGATCGGG
GCGGCGGGAC
GCCAGGGCGA
CCGCGCTCCC
CAGCGCGGCC
GATCCGAGCC
AGCGCTCCGT
GGC AAA 490
Gly Lys
AAG S29
AAG
Lys
TTG S68
Leu
CCT
Pro
GCT 607
GCA
Ala
TCT
Ser
GAA 646
Glu
GGG
Gly
AAT 685
Asn
75
GAA
Glu
CAA
Gln
GCA
Ala
ATC
Ile
115
ACC
Thr
GCC
Ala
ATT
Ile
TCA
Ser
GTA
Val
180
GGA
Gly
TCG
Ser
GAT
Asp
GCG
Ala
GGC
Gly
245
GTG
Val
CTG
Leu
TTG
Leu
Lys
90
TCA
Ser
AGC
Ser
ATC
Ile
TCA
Ser
AGA
Arg
155
TCT
Ser
GGG
Gly
AGA
Arg
ccc
Arg
220
GAG
Glu
CAT
His
AAT
Asn
AAG
Lys
CTG
Leu
GTG
Val
130
ACT
Thr
ATA
Ile
ACA
Thr
TGT
Cys
GAG
Glu
195
TAC
Tyr
‘rec
Cys
GAG
Glu
TGC
Cys
GCC
Ala
260
GAC
Asp
CGA
Arg
CCA
Pro
GCT
Ala
105
TTA
Leu
GAA
Glu
TCA
Ser
TCT
Ser
170
GCG
Ala
TGC
Cys
TTG
Leu
CAA
Gln
CTG
Leu
235
ATC
Ile
TAC
Tyr
CGT
Arg
Lys
80
GGG
Gly
GAT
Asp
GGA
Gly
TCA
Ser
GGA
Gly
14S
GTA
Val
ACA
Thr
GAG
Glu
TTC
Phe
TCC
Cys
210
AAC
Asn
TAC
Tyr
GCC
Ala
TGC
Cys
CTT
Leu
275
AAC
Asn
AAG
Lys
TCT
Ser
AAT
Asn
120
AAC
Asn
GCA
Ala
TCC
Ser
TCC
Ser
AAG
Lys
185
ATG
Met
AAG
Lys
TAC
Tyr
CAG
Gln
CTC
Leu
250
CGG
Arg
TCA
Ser
95
GGA
Gly
GAC
Asp
GAG
Glu
TAT
Tyr
ACA
Thr
160
ACC
Thr
GAG
Glu
GTG
Val
TGC
Cys
GTA
Val
225
CTT
Leu
ACC
Thr
CAG
Gln
CCA
Pro
GAA
Glu
GAG
Glu
AGT
Ser
ATC
Ile
135
GTG
Val
GAA
Glu
ACT
Thr
Lys
200
CCA
Pro
ATG
Met
AGA
Arg
GTG
Val
AAG
Lys
265
AGC
Ser
CAA
Gln
CTT
Leu
Tyr
110
GCC
Ala
ATC
Ile
TCT
Ser
GGA
Gly
GGG
Gly
175
ACT
Thr
GAC
Asp
AAT
Asn
GCC
Ala
GTG
Val
240
GTC
Val
CTT
Leu
AAT ATC AAG ATA 724
Asn
85
CGC
Arg
ATG
Met
TCT
Ser
ACT
Thr
TCA
Ser
150
GCA
Ala
ACA
Thr
TTC
Phe
CTT
Leu
GAG
Glu
215
AGC
Ser
CTG
Leu
GGC
Gly
CAG
Gln
CGG
Arg
280
ATT
Ile
TGC
Cys
GCC
Ala
125
GGT
Gly
GAG
Glu
AAT
Asn
AGC
Ser
Cys
190
TCA
Ser
TTC
Phe
ACC
Thr
ATC
Ile
255
CGG
Arg
TCT
Ser
AAC
Asn
100
AAT
Asn
ATG
Met
TCT
Ser
ACT
Thr
165
GTG
Val
AAC
Asn
ACT
Thr
TAC
Tyr
230
ATA
Ile
ATG
Met
GAA
Glu
GTG
Val
ATC
Ile
CCA
Pro
140
CCC
Pro
TCT
Ser
CTT
Leu
AAT
Asn
CCC
Pro
205
GGT
Gly
AAG
Lys
ACC
Thr
TGT
Cys
AAG
Lys
270
CGA
Arg
GTA
Val
310
AGA
Arg
TCC
Ser
AGC
Ser
TCC
Ser
Asn
375
CGT
Arg
TTC
Phe
GAC
Asp
ACC
Thr
AGC
Ser
440
GTG
Val
AGC
Ser
ACA
Thr
AAT
Asn
285
CCA
Pro
TCT
Ser
GAA
Glu
ACA
Thr
CAC
His
350
GAA
Glu
AGT
Ser
CTT
Leu
CTC
Leu
TCT
Ser
415
CCG
Pro
TCC
Ser
TCC
Ser
CCC
Pro
CCA
Pro
480
ATG
Met
CCC
Pro
GCA
Ala
325
GCC
Ala
AGC
Ser
AGC
Ser
AGG
Arg
AAT
Asn
390
AGG
Arg
CCT
Pro
GCT
Ala
CCC
Pro
AGC
Ser
455
TTC
Phe
CCA ‘
Pro ‘
ATG
Met
CCC
Pro
300
AAC
Asn
GAG
Glu
CAT
His
TGG
Trp
CAC
His
365
CAC
GGC
Gly
CAT
His
CAT
His
CGT
Arg
430
ATG
Met
ATG
Met
AAC
Asn
GAG
Glu
GTC
Val
ACA
Thr
CAC
His
340
AGC
Ser
TCT
Ser
AGC
Ser
.Thr
GCC
Ala
405
AGT
Ser
ATG
Met
TCG
Ser
ACG
Thr
GAA
Glu
470
CTG
Leu
ATT
Ile
AAT
Asn
ATC
Ile
315
TCC
Ser
TCC
Ser
AAC
Asn
GTA
Val
AGC
Ser
380
GGA
Gly
AGA
Arg
GAA
Glu
TCA
Ser
CCC
Pro
445
GTG
Val
GAA
Glu
CGG
Arg
GCC
Ala
290
GTC
Val
TCC
Ser
TTT
Phe
ACT
Thr
GGA
Gly
355
ATC
Ile
CCA
Pro
GGC
Gly
GAA
Glu
AGG
Arg
420
CCT
Pro
CCT
Pro
TCC
Ser
GAG
Glu
GAG
Glu
485
AAT
Asn
CAG
Gln
AGT
Ser
TCC
Ser
330
ACT
Thr
CAC
His
GTG
Val
ACT
Thr
CCT
Pro
395
ACC
Thr
TAT
Tyr
GTA
Val
TCG
Ser
AAG
Lys
460
AGA
Arg
AAG
Lys
GGG
Gly
CTG
Leu
305
GAG
Glu
ACC
Thr
GTC
Val
ACT
Thr
ATG
Met
370
GGG
Gly
CGT
Arg
CCT
Pro
GTG
Val
GAT
Asp
435
GAA
Glu
CCT
Pro
CCT
Pro
AAG
Lys
CCT
Pro
GTG
Val
CAT
His
AGT
Ser
ACC
Thr
345
GAA
Glu
TCA
Ser
GGC
Gly
GAA
Glu
GAT
Asp
410
TCA
Ser
TTC
Phe
ATG
Met
TCC
Ser
CTA
Leu
475
TTT
Phe
CAC
His
AAT
Asn
ATT
Ile
320
CAC
His
CAG
Gln
AGC
Ser
TCC
Ser
CCA
Pro
385
TGT
Cys
TCC
Ser
GCC
Ala
CAC
His
TCT
Ser
450
ATG
Met
CTT
Leu
GAC
Asp
CAT
His
295
CAA
Gln
GTT
Val
TAT
Tyr
ACT
Thr
ATC
Ile
360
GTA
Val
AGA
Arg
AAC
Asn
TAC
Tyr
ATG
Met
425
ACG
Thr
CCA
Pro
GCG
Ala
CTC
Leu
CAT
His
490
CCT
Pro
TAC
Tyr
GAG
Glu
ACT
Thr
335
CCT
Pro
CTT
Leu
GAA
Glu
GGA
Gly
AGC
Ser
400
CGA
Arg
ACC
Thr
CCA
Pro
CCC
Pro
GTC
Val
465
GTG
Val
CAC
His
CCT
Pro
GAC
Asp
505
GAG
Glu
GCC
Ala
GCC
Ala
TCA
Ser
570
ACG
Thr
CTT
Leu
ACT
Thr
CAG
Gln
ATT
Ile
635
CAG
Gln
AGT
Ser
GAT
Asp
CAA
Gln
Lys
545
GAA
Glu
GAG
Glu
CCT
Pro
GAG
Glu
AAC
Asn
610
GCC
Ala
GCT
Ala
CAG
Gln
AAC
Asn
GAG
Glu
S20
GAG
Glu
AGA
Arg
GTG
Val
AGT
Ser
TTC
Phe
585
GCA
Ala
CCA
Pro
AGG
Arg
GTA
Val
637
TTC
Phe
495
AGC
Ser
GAG
Glu
CCT
Pro
ACC
Thr
GAC
Asp
560
GAA
Glu
CTG
Leu
ACA
Thr
GCA
Ala
CTG
Leu
625
AGC
Ser
CTC
Leu
TAT
Tyr
GTT
Val
535
AAG
Lys
AGC
Ser
ACA
Thr
GGC
Gly
CCT
Pro
600
GGC
Gly
TCT
Ser
TCC
Ser
CCT
Pro
510
GAA
Glu
AAG
Lys
CCC
Pro
AAC
ASH
GAA
Glu
S75
ATA
Ile
GCC
Ala
CGC
Arg
AGT
Ser
TTC
Phe
GCT
Ala
ACG
Thr
AAT
Asn
550
ACA
Thr
GAT
Asp
CAG
Gln
TTC
Phe
TTC
Phe
615
GTA
Val
CAC
His
AGC
Ser
ACC
Thr
525
CTC
Leu
GGC
Gly
AGC
Ser
GAA
Glu
AAC
Asn
590
CGC
Arg
TCG
Ser
ATT
Ile
CAC
His
500
CCC
Pro
CAA
Gln
GCC
Ala
CAC
His
TCC
Ser
565
AGA
Arg
CCC
Pro
CTG
Leu
ACA
Thr
GCT
Ala
630
AAC
Asn
GAG
Glu
AAT
Asn
540
ATT
Ile
CAG
Gln
GTA
Val
CTG
Leu
GCT
Ala
605
CAG
Gln
AAC
Asn
CCC
Pro
AGG
Arg
515
TAC
Tyr
AGC
Ser
GCT
Ala
AGC
ser
GGT
Gly
580
GCA
Ala
GAC
Asp
GAA
Glu
CAA
Gln
TAAAACCTA AATAAACACA TAGATTCACC
TATTTTATAT AATAAAGTAT TCCACCTTAA ATTAAACAAT
TTAGCAGTTC TGCAAATAAA AAAAAAAAAA 2490
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1715 bases
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
GCG
Ala
ATA
Ile
GAG
Glu
CGG
Arg
AAC
Asn
555
AGT
Ser
GAA
Glu
GCC
Ala
AGC
Ser
GAA
Glu
620
GAC
Asp
TGTAAAACTT 2410
TTATTTTATT 2460
CAT
His
GTG
Val
CCA
Pro
530
CGG
Arg
AGA
Arg
AAC
Asn
GAT
Asp
AGT
Ser
595
AGG
Arg
ATC
Ile
CCT
Pro
(xi)
SEQUENCE DESCRIPTION: SEQ ID NO:24:
GCGCCTGCCT CCAACCTGCG GGCGGGAGGT GGGTGGCTGC
AAAAAGAGCC
CGCGGAGGCC
AGCAGGACGG
GAGGCCAGGA
GGCGACAGGA
CGGACCGCCC
CCCCCGCGCA
GGGACAAACT
GCGCCGAGAG
TCC
Ser
AAG
Lys
GCG
Ala
GAG
Glu
GTC
Val
AGA
Arg
Lys
80
GAG
Glu
AAG
Lys
GGC
Gly
ATG
Met
CTT
Leu
55
»Phe
AAC
Asn
CGC
Arg
GAG
Glu
AGC
Ser
Lars
CGG
Arg
AAG
Lys
Ly s
CGA
Arg
CAG
Gln
AGC
Ser
TGT
Cys
TGG
Trp
70
CCA
Pro
GGCGAGGAGT
AGGAGCTGAG
TGATAACCTC
CGCGAGCCGC
GCAGCCCCGA
GCCGCGTCCG
GCGCGAGCGC
TTTCCCAAAC
CCGTCCGCGT
GAA
Glu
GGC
Gly
AGC
Ser
CAG
Gln
45
GAA
Glu
TTC
Phe
CAA
Gln
GGC
Gly
TCC
Ser
CCA
Pro
GAA
Glu
ACC
Thr
AAG
Lys
AAT
Asn
85
AGA
Arg
GGC
Gly
GCC
Ala
TCG
Ser
AGT
Ser
60
ATC
Ile
TCCCCGAAAC
CGGCGGCGGC
TCCCCGATCG
CAGCGGCGGG
GAGCCAGGGC
CGCCGCGCTC
CTCAGCGCGG
CCGATCCGAG
AGAGCGCTCC
GGC
Gly
AAG
Lys
Leu
GCT
Ala
TCT
Ser
GGG
Gly
AAG
Lys
TTGTTGGAAC
TGCCGGACGA
GGTTGCGAGG
ACCCATCGAC
GAGCGCCCGT
CCTGCAGGCA
CCGCTCGCTC
CCCTTGGACC
GTCTCCGGCG
Lys
AAG
Lys
CCT
Pro
GCA
Ala
GAA
Glu
AAT
Asn
75
ATA
Ile
GGG
Gly
CCG
Pro
CCC
Pro
GGT
Gly
50
TAC
Tyr
GAA
Glu
CAA
Gln
AAG
Lys
GAG
Glu
CAA
Gln
TCC
Ser
TCC
Ser
TTG
Leu
Lys
90
GGGGCAATTG 50
TCCGGGCTCG 100
TGGGAGCGTG 150
GCGCCGGGCA 200
GACTTCCCGG 250
TccAc<3-:1" cc
ACGGGAGACG
TCCCCATCGA
AAACTCGCCT
GGC
Gly
TCC
Ser
TTG
Leu
TCT
Ser
65
ATG
Met
1
AAG
Lys
GCG
Ala
Lys
40
CTC
Leu
CGA
Arg
CCA
Pro
GGG
Gly
GAT
Asp
GGA
Gly
TCA
Ser
GGA
Gly
145
GTA
Val
ACA
Thr
GAG
Glu
TTC
Phe
TGC
Cys
210
AAG
Lys
TCT
Ser
AAT
Asn
120
GCA
Ala
TCC
Ser
TCC
Ser
AAG
Lys
185
ATG
Met
AAG
Lys
TAC
Tyr
TCA
Ser
95
GGA
Gly
GAC
Asp
GAG
Glu
TAT
Tyr
ACA
Thr
160
ACC
Thr
GAG
Glu
GTG
Val
TGC
Cys
GTA
Val
225
GAA
Glu
GAG
Glu
AGT
Ser
ATC
Ile
135
GTG
Val
GAA
Glu
ACT
Thr
Lys
200
CCA
Pro
ATG
Met
CTT
Leu
TAT
Tyr
110
GCC
Ala
ATC
Ile
TCT
Ser
GGA
Gly
GGG
Gly
175
ACT
Thr
GAC
Asp
AAT
ASH
GCC
Ala
CGC
Arg
ATG
Met
TCT
Ser
ACT
Thr
TCA
Ser
150
GCA
Ala
ACA
Thr
TTC
Phe
CTT
Leu
GAG
Glu
215
AGC
Ser
ATT
Ile
TGC
Cys
GCC
Ala
125
GGT
Gly
GAG
Glu
AAT
Asn
AGC
Ser
TGT
Cys
190
TTT
Phe
TTC
Phe
AAC
Asn
100
AAT
Asn
ATG
Met
TCT
Ser
ACT
Thr
165
CAT
His
GTG
Val
AAC
Asn
ACT
Thr
Tyr
230
GTG
Val
ATC
Ile
CCA
Pro
140
CCC
Pro
TCT
Ser
CTT
Leu
AAT
Asn
CCC
Pro
205
GGT
Gly
AGT
Ser
GCA
Ala
ATC
Ile
115
ACC
Thr
GCC
Ala
ATT
Ile
TCA
Ser
GTA
Val
180
GGA
Gly
TCG
Ser
GAT
Asp
ACG
Thr
TCA
Ser
AGC
Ser
ATC
Ile
TCA
Ser
AGA
Arg
155
TCT
Ser
GGG
Gly
AGA
Arg
ccc
Arg
22 0
TCC
Ser
CTG
Leu
GTG
Val
130
ACT
Thr
ATA
Ile
ACA
Thr
GAG
Glu
195
TAC
Tyr
TGC
Cys
ACT
Thr
GCT
Ala
105
TTA
Leu
GAA
Glu
GAA
Glu
TCA
Se:
TCT
Ser
170
GCG
Ala
TGC
Cys
CAA
Gln
CCC
Pro
235
TAGGA GCATGCTCAG TTGGTGCTGC
Glu
241
Pro
240
Phe Leu Ser Leu
TTTCTTGTTG CTGCATCTCC CCTCAGATTC CACCTAGAGC TAGATGTGTC 129C
TTACCAGATC TAATATTGAC TGCCTCTGCC TGTCGCATGA GAACATTAAC 1340
AAAAGCAATT GTATTACTTC CTCTGTTCGC GACTAGTTGG CTCTGAGATA 139C
CTAATAGGTG TGTGAGGCTC CGGATGTTTC TGGAATTGAT ATTGAATGAT 144$
GTGATACAAA TTGATAGTCA ATATCAAGCA GTGAAATATG ATAATAAAGG 149C
CATTTCAAAG TCTCACTTTT ATTGATAAAA TAAAAATCAT TCTACTGAAC 15
89
AGTCCATCTT CTTTATACAA TGACCACATC CTGAAAAGGG TGTTGCTAAG 1590
CTGTAACCGA TATGCACTTG AAATGATGGT AAGTTAATTT TGATTCAGAA 1640
TGTGTTATTT GTCACAAATA AACATAATAA AAGGAGTTCA GATGTTTTTC 1690
TTCATTAACC AAAAAAAAAA AAAAA 1715
(2) INFORMATION FOR SEQ ID NO:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2431 bases
B) TYPE: nucleic acid
) STRANDEDNESS: N.A.
TOPOLOGY: linear
(
(C
(D)
SEQUENCE DESCRIPTION:
GAGGCGCCTG
TTGAAAAAGA
TCGCGCGGAG
GTGAGCAGGA
GCAGAGGCCA
CGGGGCGACA
GGCCGGACCG
ACGCCCCCGC
CGAGGGACAA
CCTGCGCCGA
CCTCCAACCT
GCCGGCGAGG
GCCAGGAGCT
CGGTGATAAC
GGACGCGAGC
GGAGCAGCCC
CCCGCCGCGT
GCAGCGCGAG
ACTTTTCCCA
GAGCCGTCCG
G TCC GAG CGC AAA GAA
Ser Glu Arg Lys Glu
SEQ ID NO:25:
GCGGGCGGGA
AGTTCCCCGA
GAGCGGCGGC
CTCTCCCCGA
CGCCAGCGGC
CGAGAGCCAG
CCGCGCCGCG
CGCCTCAGCG
AACCCGATCC
CGTAGAGCGC
GGTCGGTGGC
AACTTGTTGG
GGCTGCCGGA
TCGGGTTGCG
GGGACCCATC
GGCGAGCGCC
CTCCCTGCAG
CGGCCGCTCG
GAGCCCTTGG
TCCGTCTCCG
TGCGGGGCAA
AACTCCGGGC
CGATGGGAGC
AGGGCGCCGG
GACGACTTCC
CGTTCCAGGT
GCAACGGGAG
CTCTCCCCAT
ACCAAACTCG
GCGAG AT
Met
1
GGC AGA GGC AAA GGG AAG GGC AAG 537
Gly Arg Gly Lys Gly Lys Gly Lys
AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG S76
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala
GCG
Ala
GAG
Glu
GTC
Val
AGA
1-‘-I9
Lys
80
GGG
Gly
GAT
Asp
TCA
Ser
GGA
Gly
145
GTA
Val
ACA
Thr
GAG
Glu
TTC
Phe
TGC
Cys
210
AAC
Asn
GGC
Gly
ATG
Met
CTT
Leu
55
TTC
Phe
AAC
Asn
AAG
Lys
TCT
Ser
AAT
Asn
120
AAC
Asn
GCA
Ala
TCC
Ser
TCC
Ser
AAG
Lys
185
ATG
Met
AAG
Lys
TAC
."r
AGC
Ser
CGG
Arg
AAG
Lys
TCA
Ser
95
GGA
Gly
GAC
Asp
GAG
Glu
TAT
Tyr
ACA
Thr
160
ACC
Thr
GAG
Glu
GTG
Val
TGC
Cvs
GTA
Val
225
CAG
Gln
AGC
Ser
TGT
Cys
TIP
70
GAA
Glu
GAG
Glu
AGT
Ser
ATC
Ile
135
GTG
Val
GAA
Glu
ACT
Thr
AGC
Ser
CAG
Gln
45
GAA
Glu
TTC
Phe
CAA
Gln
CTT
Leu
Tyr
110
GCC
Ala
ATC
Ile
TCT
Ser
GGA
Gly
GGG
Gly
175
ACT
Thr
GAC
Asp
AAT
Asn
GCC
Ala
CCA
Pro
GAA
Glu
ACC
Thr
AAG
Lys
AAT
Asn
85
CGC
Arg
ATG
Met
TCT
Ser
ACT
Thr
TCA
Ser
150
GCA
Ala
ACA
Thr
TTC
Phe
CTT
Leu
GAG
Glu
215
AGC
Ser
GCC
Ala
TCG
Ser
AGT
Ser
60
AAT
Asn
ATC
Ile
ATT
Ile
TGC
Cys
GCC
Ala
125
GGT
Gly
GAG
Glu
AAT
Asn
AGC
Ser
TGT
Cys
190
TTT
Phe
TTC
Phe
TTG
Leu
GCT
Ala
TCT
Ser
GGG
Gly
AAG
Lys
AAC
Asn
100
AAT
Asn
ATG
Met
TCT
Ser
ACT
Thr
165
CAT
His
GTC
Val
AAC
Asn
ACT
Thr
Tyr
230
CCT
Pro
GCA
Ala
GAA
Glu
AAT
Asn
75
ATA
Ile
GTG
Val
ATC
Ile
CCA
Pro
140
CCC
Pro
TCT
Ser
CTT
Leu
AAT
Asn
CCC
Pro
205
GGT
Gly
AAG
Lys
CCC
Pro
GGT
Gly
50
TAC
Tyr
GAA
Glu
CAA
Gln
GCA
Ala
ATC
Ile
115
ACC
Thr
GCC
Ala
ATT
Ile
TCA
Ser
GTA
Val
180
GGA
Gly
TCG
Ser
GAT
Asp
GCG
Ala
CAA
Gln
TCC
Ser
TCC
Ser
Lys
90
TCA
Ser
AGC
Ser
ATC
Ile
TCA
Ser
AGA
Arg
155
TCT
Ser
GGG
Gly
AGA
Arg
CGC
Arg
220
GAG
Glu
TTG
Leu
TCT
Ser
65
AAT
Asn
AAG
Lys
CTG
Leu
GTG
Val
130
ACT
Thr
ATA
Ile
ACA
Thr
TGT
Cys
GAG
Glu
195
TAC
Tyr
TGC
Cys
GAG
Glu
Lys
40
CTA
Leu
CTC
Leu
CGA
Arg
CCA
Pro
GCT
Ala
105
TTA
Leu
GAA
Glu
GAA
Glu
TCA
Ser
TCT
Ser
170
GCG
Ala
TGC
Cys
CAA
Gln
CTG
Leu
235
TAC
Tyr
GCC
Ala
TGC
Cys
CTT
Leu
275
GAG
Glu
GTC
Val
ACA
Thr
CAC
His
340
AGC
TCT
Ser
AGC
Ser
ACA
Thr
GCC
Ala
CAG
Gln
CTC
Leu
250
CGG
Arg
ATT
Ile
AAT
Asn
ATC
Ile
315
TCC
Ser
TCC
Ser
AAC
Asn
GTA
Val
AGC
Ser
380
GGA
Gly
AGA
Arg
AGT
Ser
TCAGTCAGAG AATCCCTGTG AGCACCTGCG GTCTCACCTC AGGAAATCTA 1840
GAA
Glu
AAG
Lys
CTT
Leu
ACC
Thr
CAG
Gln
GCC
Ala
290
GTC
Val
TCC
Ser
TTT
Phe
ACT
Thr
GGA
Gly
355
ATC
Ile
CCA
Pro
GGC
Gly
GAA
Glu
AGG
Arg
420
AGA
Arg
GTG
Val
AAG
Lys
265
AGC
Ser
AAT
Asn
CAG
Gln
AGT
Ser
ACT
Thr
CAC
His
GTG
Val
ACT
Thr
CCT
Pro
395
ACC
Thr
GTG
Val
240
GTC
Val
CTT
Leu
GGG
Gly
CTG
Leu
305
GAG
Glu
ACC
Thr
GTC
Val
ACT
Thr
ATG
Met
370
GGG
Gly
CGT
Arg
CCT
Pro
CTG
Leu
GGC
Gly
CAG
Gln
CGG
Arg
280
CCT
Pro
GTG
Val
CAT
His
AGT
Ser
ACC
Thr
345
GAA
Glu
TCA
Ser
GGC
Gly
GAA
Glu
GAT
ASP
4
ACC
Thr
ATC
Ile
255
CGG
Arg
TCT
Ser
CAC
His
AAT
Asn
ATT
Ile
320
CAC
His
CAG
Gln
AGC
Ser
TCC
Ser
CCA
Pro
385
TGT
Cys
TCC
Ser
ATA
Ile
ATG
Met
GAA
Glu
CAT
His
295
CAA
Gln
GTT
Val
TAT
Tyr
ACT
Thr
ATC
Ile
360
GTA
Val
AGA
Arg
AAC
Asn
TAC
Tyr
ACC
Thr
TGT
Cys
AAG
Lys
270
CGA
Arg
CCT
Pro
TAC
Tyr
GAG
Glu
ACT
Thr
335
CCT
Pro
CTT
Leu
GAA
Glu
GGA
Gly
AGC
Ser
400
CGA
Arg
GGC
Gly
245
GTG
Val
CTG
Leu
AAC
Asn
AAC
Asn
GTA
Val
310
AGA
Arg
TCC
Ser
AGC
Ser
TCC
Ser
AAC
Asn
375
CGT
Arg
TTC
Phe
GAC
Asp
TAAAA CCGAAGGCAA AGCTACTGCA
ATC
Ile
GTG
Val
CAT
His
AAT
Asn
285
CCA
Pro
TCT
Ser
GAA
Glu
ACA
Thr
CAC
His
350
GAA
Glu
AGT
Ser
CTT
Leu
CTC
Leu
TCT
Ser
415
TGC
Cys
GCC
Ala
260
GAC
Asp
ATG
Met
CCC
Pro
GCA
Ala
325
GCC
Ala
AGC
Ser
AGC
Ser
AGG
Arg
AAT
Asn
390
AGG
Arg
CCT
Pro
ATC
Ile
TAC
Tyr
CGT
Arg
ATG
Met
CCC
Pro
300
AAC
Asn
GAG
Glu
CAT
His
TGG
Trp
CAC
His
365
CAC
GGC
Gly
CAT
His
CAT
His
GAGGAGAAAC 17
CTCTAATCAG
TTGATGAAGT
CTCGTCGTCC
CTTTGATGCG
TCAGACCCAC
GCATCAATGC
AAACTCTGAT
TGTCTTACCT
TGAATGTCAT
TATGAAATTC
TGGCAGTCTT
AAAAAAAAAA
AATAAGGGGC
CATCTCTTTG
CAGTGACTGA
GAAGGTGCAG
TCGGGGTCTC
TTGATAAGGA
TCGGTGGTCG
TCCAGCCTCA
GGGGGGCAAC
CAAGAAGGGA
CACGGGTGGT
AAAAAAAAAA
GGCAGTTACC
TTTGACGGAA
CAGGCAACAG
CACATGGAGT
AGTGTCCTCA
CCCTTCTATA
AGCTGGCCTC
GTTAAGTCAA
TGCTTGCCCT
TGAATAAATA
TTTCAAAGCA
AAAAAAAAAA
(2) INFORMATION FOR SEQ ID NO:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 625 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION:
Glu Arg Lys
Arg Gly Ser
Pro Ala Leu
Ala Ala Gly
50
Tyr Ser Ser
65
Asn Arg Lys
80
TGTTCTAGGA
CTTATTTCTT
ACTCTTAAAG
TTCCAGCTCT
GTTGTAACAT
ATTCCAATTG
GTGTTCTTAT
ATCAAGGGCT
CCACCCTATA
AATCTCTTGG
GAAAAAAAAA
SEQ ID NO:26:
Arg Gly Lys
Lys Pro Glu
Arg Leu Lys
40
Leu Val Leu
55
Phe Lys Trp
70
Pro Gln Asn
85
_ Cys
GTGCTCCTAG
CTGAGCTTCT
AGCTGGGATG
GGCCATGGGC
TAGAGAGATG
CCAGTTATCC
CTGCTAACCC
ATGTCATTGC
GTATCTATTT
ATGCTGCGTC
AAAAAAAAAA
A 2431
Gly Lys Lys
Ala Gly Ser
Lys Ser Gln
Glu Thr Ser
Asn Gly Asn
Ile Gln Lys
Lys
Ser
95
Tyr
110
Ala
125
Pro
140
Arg
155
Ser
170
Lys
185
Lys
200
Gly
215
Gln
230
Thr
245
Cys
260
His
275
Met
290
Asn
305
Ser
320
Ser
335
Thr
350
Ser
365
Arg
380
Thr
395
Arg Ile
Lys Val
Thr Ile
Thr Glu
Val Ser
Thr Thr
Thr Phe
Ser Asn
Gly Ala
Glu Lys
Gly Ile
Ala Tyr
Leu
Ala Asn
Leu Val
Ile Val
Thr Ser
His Ser
His Ser
Ser Pro
Pro Arg
Asn
100
Ile
115
Val
130
Gly
145
Thr
160
Gly
175
Cys
190
Pro
205
Arg
220
Ala
235
Cys
250
Cys
265
Gln
280
Gly
295
Asn
310
Glu
325
Thr
340
Trp
355
Val
370
Thr
385
Glu
400
Ser
Leu
Asn
Val
Ala
His
Gly
Tyr
Glu
Leu
Leu
Lys
Arg
His
Val
Ala
His
Gly
Met
Pro
.Tyr
Ala
105
Asn
120
Ile
135
Ser
150
Thr
165
Val
180
Glu
195
Cys
210
Val
225
Gln
240
Val
255
Gln
270
Glu
285
Asn
300
Lys
315
Thr
330
Thr
345
Thr
360
Ser
375
Gly
390
Leu
4
Arg
(2) INFORMATION FOR SEQ ID NO:27:
A1-‘Q
Glu
410
Val
425
His
440
Pro
455
Pro
470
Arg
485
Ser
500
Ser
515
Glu
530
Arg
545
Leu
560
Ser
575
Gly
590
Phe
605
Thr
620
Asp Ser Tyr
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 645 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION:
Met Ser Glu Arg Lys
Lys Glu Arg Gly Ser
Gln Ser Pro Ala Leu
Thr
430
Pro
445
Thr
460
Glu
475
Lys
490
Pro
505
Val
520
Gln
535
Thr
550
Asn
565
Glu
S80
Leu
595
Ser
610
Gln
625
A‘-'9‘
SEQ ID NO:27:
His
420
Ser
435
Ser
450
Ser
465
Leu
480
Pro
495
Asn
510
Tyr
525
Lys
540
His
555
Ser
570
Asp
585
Glu
600
Ala
615
Arg Gly Lys Gly Lys Gly Lys Lys
Lys Pro Glu Ser Ala Ala Gly Ser
Gln Leu Lys Glu Met Lys Ser Gln
Met:
'Iyr
Ala
Tyr
Asn
Gly
Gly
Ala
Gly
Pro
Ser
Ala
Met
Pro
Ser
Leu
Leu
Lys
A-‘C9
His
Val
Ala
Gly
50
Ser
65
Lys
80
Ser
95
Tyr
110
Ala
125
Pro
140
Arg
155
Ser
170
Lys
185
Lys
200
Glu
215
Tyr
230
Gln
245
Val
260
Gln
2'75
Glu
290
Asn
305
Lys
320
Thr
335
Thr
3 50
Ly s
Leu
5
Trp
7 O
Asn
85
Asn
100
Ile
115
Va}.
130
Gly
145
Thr
160
Gly
1'75
Cys
190
Pro
205
Arg
220
Ile
235
Thr
0
Cys
265
His
280
Met
295
Asn
310
Ser
325
Ser
340
Thr
355
Ser
60
Asn
75
Lys
9 0
Ala
105
Asn
120
Ile
135
Ser
150
Thr
165
Val
180
Glu
195
Cys
210
Val
225
Ala
240
Cys
255
Cys
270
Gln
285
Gly
300
Asn
3 15
Glu
330
Thr
345
Trp
3
Ser
Gly
Met
Pro
Ser
Ser
Arg
Pro
Met
Leu
His
Asp
Glu
Val
Asn
Ser
Gly
Ser
Asn
Ser Se
Thr
365
Ser
380
Gly
395
Leu
410
His
425
Ser
440
Ser
455
Ser
470
Leu
485
Pro
500
Asn
515
Tyr
530
Lys
545
His
560
Ser
575
Asp
590
Glu
605
Ala
620
%
Ile
Phe '
(2 ) INFORMATION FOP. SEQ ID NO:28:
(1) SEQUENCE CHARACTERISTICS :
(A)
{B}
{D} TOPOLOGE: linear
LENGTH: 637 amino acids
TYPE: amino acid
Ser
370
Arg
385
Thr
400
Glu
415
Val
430
His
445
Pro
460
Pro
475
Arg
490
Ser
505
Ser
520
Glu
535
Arg
550
Leu
565
Ser
580
Gly
595
Phe
610
Thr
625
Gln
640
Val
375
Thr
390
Glu
405
Tyr
420
Thr
435
Pro
450
Thr
465
Glu
480
Lys
495
Pro
510
Val
525
Gln
540
Thr
555
Asn
570
Glu
S85
Leu
600
Ser
615
Gln
630
Val
6
(xi) SEQUENCE DESCRIPTION:
Arg
Pro
Ala
Tyr
Asn
Gly
Gly
Ala
Gly
Pro
Ser
Ala
Met
Pro
Ser
Ile
Val
Asp
Ser
Leu
Gly
50
Ser
Lys
80
Ser
95
Tyr
110
Ala
125
Pro
140
Arg
155
Ser
170
Lys
185
Lys
200
Glu
215
Tyr
230
Gly
245
Ala
260
_ Leu
Ala
290
Met Ser Glu Arg Lys Glu Gly
ID NO:28:
Gly Lys Gly
Glu
Lys
40
Leu
55
Trp
70
Asn
85
Asn
100
Ile
115
Val
130
Gly
145
Thr
160
Gly
175
Cys
190
Pro
205
Arg
220
Leu
235
Leu
250
Lys
265
Arg
280
His
295
Lys
Ala
Met
Cys
Lys
Lys
Ala
Lys
Ser
Tyr
Gly
Ser
Asn
Arg
Gln
Gln
Val
Gln
Glu
Lys
Ser
Gln
45
Ser
60
Asn
75
Lys
90
Ala
105
Asn
120
Ile
135
Ser
150
Thr
165
Val
180
Glu
195
Cys
210
Val
225
Val
240
Ile
255
Lys
270
Asn
285
Pro
3
Glu
A?-‘9
Va 1
AI-'9
AIS!
_ Leu
Leu
3 05
Ile
320
Thr
335
His
350
His
365
Ser
3 80
Pro
395
Asp
410
Met
425
Ser
440
Ser
455
Glu
470
Glu
485
His
500
A359
515
Pro
53 0
Lys
545
Asp
560
Glu
575
Asn
590
Ala
605
Va 1
'Iyr
ASI1
Val
310
Ala
325
His
340
Gly
355
Met
370
Pro
3 85
Ser
400
Ser
415
A339
430
Pro
445
Lys
460
Leu
475
His
490
Asp
505
Glu
520
Val
53 5
Asn
550
Ser
565
Gly
S80
Ser
595
Asn
610
Lys
Thr
Thr
Thr
Ser
Gly
Leu
His
Ser
Ser
Ser
Leu
Pro
Asn
Tyr
Lys
His
Ser
Asp
Glu
Va 1
Va 1
Ile
315
Ser
330
Thr
3 4 S
Ile
360
Asn
375
Asn
390
Ala
405
Arg
42 0
Asp
435
Ser
450
Val
465
Pro
480
Phe
495
Pro
510
Thr
525
Asn
540
Asn
555
Ser
570
Phe
585
Pro
600
Phe
6
Ser Thr Gln Glu Glu Ile Gln Ala Arg Leu Ser Ser Val Ile Ala
Asn Gln Asp Pro Ile Ala Val
(2) INFORMATION FOR SEQ ID NO:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 420 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY:
linear
(xi) SEQUENCE DESCRIPTION:
Ser Glu Arg Lys Glu Gly
S
Arg
Pro
Ala
Tyr
Asn
Gly
Gly
Ala
Gly
Pro
Ser
Ala
Me?
Pro
Ser
Leu
Gly
50
Ser
65
Lys
80
Ser
95
Tyr
110
Ala
125
Pro
140
Arg
155
Ser
170
Lys
185
Lys
200
Glu
215
Tyr
230
AIS!
ID NO:29:
Gly Lys Gly
Glu
Lys
40
Leu
55
Trp
70
Asn
85
Asn
100
Ile
115
Val
130
Gly
145
Thr
160
Gly
175
Cys
190
Pro
205
Arg
220
Leu
235
Lys
Ser
Gln
Ser
60
Asn
75
Lys
90
Ala
105
Asn
120
Ile
135
Ser
150
Thr
165
Val
180
Glu
195
Cys
210
Val
225
Val
2
Leu
(2) INFORMATION FOR SEQ ID NO:30:
Gly Ile Cys
Ala
260
Tyr Cys
Leu Gln
Ala
290
Asn Gly
Leu Val Asn
Glu
Ile Val
Thr Thr
335
His Ser Trp
His Ser Val
Thr
Ser Pro
Pro Glu
395
Asp Ser Tyr
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 241 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION:
Lys Glu Gly
Ser Gly Lys
Leu Pro Pro
G ' Ser Lys
L" I"
K...) '1.
Leu Arg
er
pp
03
Asn Lys
Leu
250
Lys
265
Arg
280
His
295
Val
310
Ala
325
His
340
Gly
355
Met
370
Pro
385
Ser
400
Ser
415
ID NO:30:
Gly Lys Gly
Glu
Lys
40
Leu
55
Trp
70
Asn
85
Ile
255
Lys
270
Asn
285
Pro
300
Ile
315
Ser
330
Thr
345
Ile
360
Asn
375
Asn
390
Ala
405
Arg
420
Lys
Ser
Gln
45
Ser
60
\o I.:
'“.I ll)
Ly s
Ly s
Glu
241
SE1‘
Ser
95
Tyr
110
Ala
125
Pro
140
Arg
155
Ser
170
Lys
185
Lys
200
Glu
215
Tyr
230
Arg
Ly s
Asn
100
Ile
115
Val
130
Gly
145
Thr
160
Gly
175
Cys
190
Pro
205
Arg
22 0
Pro
235
A1-*9
Ala
105
Asn
120
Ile
135
Ser
150
Thr
165
Val
180
Glu
195
Cys
210
Val
225
Pro
240
Claims (35)
1. A composition comprising an isolated polypeptide comprising a sequence of at least 15 amino acid residues having an in vivo effector or antigenic activity of a heregulin (HRG) of Figures 4, 8, 12, 13 or 15 and at least 75% amino acid sequence identity therewith.
2. The composition of claim 1 wherein the isolated polypeptide heregulin has antigenic activity.
3. The composition of claim 1 wherein the isolated polypeptide has a biological activity of heregulin.
4. The composition of claim 3 wherein the heregulin is HRG—GFD (growth factor domain).
5. The composition of claim 1 wherein the heregulin is heregulin -oi, —l31, -62, or —[33 as shown in Figure 15 or alleles or animal analogues thereof.
6. The composition of claim 3 wherein the heregulin is human heregulin-o-growth factor domain (GFD).
7. The composition of claim 3 wherein the heregulin is human heregulin-[3-GFD, heregulin-[32-GFD or heregulin-[53-GFD.
8. The composition of claim 1 further comprising pharmaceutically acceptable carrier.
9. The composition of claim 8 wherein the heregulin is a heregulin GFD (growth factor domain).
10.
11. an immunogenic, non-heregulin polypeptide. The composition of claim 10 wherein the heregulin GFD comprises The composition of claim 9 further comprising an immune adjuvant. Ix) U]
12. The composition of claim 1 wherein the heregulin consists of N- terminal domain (NTD) and growth factor domain (GFD).
13. The composition of claim 1 wherein the heregulin is NTD-GFD— transmembrane polypeptide.
14. The composition of claim 1 wherein the heregulin is HRG—GFD_
15. The composition of claim 1 wherein the heregulin comprises a cytoplasmic domain.
16. The composition of claim 1 wherein the heregulin is NTD—GFD and it has an amino acid sequence which is at least 85% homologous with the native heregulin-o, -[31, -B3 NTD—GFD sequence as shown in Figure 15 or alleles or animal analogues thereof.
17. The composition of claim 1 wherein the heregulin polypeptide comprises an enzyme.
18. The composition of claim 16 wherein the heregulin is HRG-d.
19. The composition of claim 18 wherein the heregulin-o has an amino acid substituted, deleted or inserted adjacent to any one of residues 1-23, 107-108, 121-123, 128-130 and 163-247 as shown in Figure 15.
20. The composition of claim 16 wherein the heregulin is HRG-B1.
21. The composition of claim 20 wherein the heregulin [31 has an amino acid substituted, deleted or inserted adjacent to residues 1-23, 107-108, 121-123, 128-130 and 163-252 as shown in Figure 15.
22. The composition of claim 16 wherein the heregulin is HRG—[32.
23. The composition of claim 22 wherein the heregulin [32 has an amino acid substituted, deleted or inserted adjacent to any one of residues 1-23, 107-108, 121-123, 128-130 and 163-244 as shown in Figure 15.
24. The composition of claim 16 wherein the heregulin is HRG-B3.
25. The composition of claim 24 wherein the heregulin 63 has an amino acid substituted, deleted or inserted adjacent to any one of residues 1-23, 107-108, 121-123, 128-130 and 163-241 as shown in Figure 15.
26. An isolated antibody that is capable of specifically binding a heregulin polypeptide according to claim 1.
27. The isolated antibody of claim 26 that is capable of binding specifically to a heregulin—d, heregulin-B1, heregulin—l32, or heregulin—B3.
28. Nucleic acid encoding a polypeptide of claim 1.
29. The nucleic acid of claim 28 which encodes heregulin-d, heregulin- B1, heregulin—B2, or heregulin-B3 polypeptide.
30. The nucleic acid of claim 28 that encodes a heregu|in—GFD.
31. An expression vector comprising the nucleic acid of claim 28.
32. The expression vector of claim 31 wherein the nucleic acid encodes a heregulin-GFD.
33. A host cell transformed with a vector of claim 31.
34. A method of comprising culturing the host cell of claim 33 to express the heregulin and recovering the heregulin from the host cell.
35. The method of claim 34 wherein the heregulin is heregu|in—d, heregu|in—[31, heregulin-B2, or heregulin—[33 as shown is
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
USUNITEDSTATESOFAMERICA24/05/19917 | |||
US70525691A | 1991-05-24 | 1991-05-24 | |
US76521291A | 1991-09-25 | 1991-09-25 | |
US79080191A | 1991-11-08 | 1991-11-08 | |
US07/847,743 US5367060A (en) | 1991-05-24 | 1992-03-06 | Structure, production and use of heregulin |
US88091792A | 1992-05-11 | 1992-05-11 |
Publications (2)
Publication Number | Publication Date |
---|---|
IE83896B1 true IE83896B1 (en) | |
IE921654A1 IE921654A1 (en) | 1992-12-02 |
Family
ID=27542108
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
IE20010175A IE20010175A1 (en) | 1991-05-24 | 1992-07-01 | HEREGULINS (HRGs), BINDING PROTEINS OF P185erb2 |
IE165492A IE921654A1 (en) | 1991-05-24 | 1992-07-01 | Structure, production and use of heregulin |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
IE20010175A IE20010175A1 (en) | 1991-05-24 | 1992-07-01 | HEREGULINS (HRGs), BINDING PROTEINS OF P185erb2 |
Country Status (12)
Country | Link |
---|---|
US (2) | US5641869A (en) |
EP (2) | EP0586607B1 (en) |
JP (1) | JP3595552B2 (en) |
AU (1) | AU659863B2 (en) |
CA (2) | CA2331239A1 (en) |
DE (1) | DE69233137T2 (en) |
DK (1) | DK0586607T3 (en) |
ES (1) | ES2206448T3 (en) |
IE (2) | IE20010175A1 (en) |
IL (4) | IL101943A0 (en) |
NZ (1) | NZ242857A (en) |
WO (1) | WO1992020798A1 (en) |
Families Citing this family (119)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050106666A1 (en) * | 1991-04-10 | 2005-05-19 | Cambridge Neuroscience | Glial mitogenic factors, their preparation and use |
US5530109A (en) * | 1991-04-10 | 1996-06-25 | Ludwig Institute For Cancer Research | DNA encoding glial mitogenic factors |
US5716930A (en) * | 1991-04-10 | 1998-02-10 | Ludwig Institute For Cancer Research | Glial growth factors |
IL101943A0 (en) * | 1991-05-24 | 1992-12-30 | Genentech Inc | Structure,production and use of heregulin |
US5834229A (en) * | 1991-05-24 | 1998-11-10 | Genentech, Inc. | Nucleic acids vectors and host cells encoding and expressing heregulin 2-α |
KR950701379A (en) * | 1992-04-29 | 1995-03-23 | 예다 리서치 앤드 디벨럽먼트 캄파니 리미티드 | RECOMBINANT STIMULATING FACTOR OF THE neu RECEPTOR |
AU693767B2 (en) * | 1992-08-10 | 1998-07-09 | Cambridge Neuroscience, Inc. | Inhibitors of cell proliferation, their preparation and use |
GB9217316D0 (en) * | 1992-08-14 | 1992-09-30 | Ludwig Inst Cancer Res | Schwann cell mitogenic factor,its preparation and use |
CA2103323A1 (en) * | 1992-11-24 | 1994-05-25 | Gregory D. Plowman | Her4 human receptor tyrosine kinase |
AU7042994A (en) * | 1993-05-21 | 1994-12-20 | Amgen, Inc. | Recombinant (neu) differentiation factors |
CA2141417A1 (en) * | 1993-07-06 | 1995-01-19 | Urs Eppenberger | Mammamodulin, a hormone-independent mammary tumor cells specific protein |
US5518885A (en) * | 1994-04-19 | 1996-05-21 | The United States Of America As Represented By The Department Of Health & Human Services | ERBB2 promoter binding protein in neoplastic disease |
US5770567A (en) * | 1994-11-14 | 1998-06-23 | Genentech, Inc. | Sensory and motor neuron derived factor (SMDF) |
AUPM959894A0 (en) * | 1994-11-22 | 1994-12-15 | Crc For Biopharmaceutical Research Pty Ltd | Fusion proteins with cytotoxic activity against cells overexpressing erbb2-4 |
US6685940B2 (en) | 1995-07-27 | 2004-02-03 | Genentech, Inc. | Protein formulation |
US6267958B1 (en) | 1995-07-27 | 2001-07-31 | Genentech, Inc. | Protein formulation |
US5912326A (en) * | 1995-09-08 | 1999-06-15 | President And Fellows Of Harvard College | Cerebellum-derived growth factors |
US5968511A (en) | 1996-03-27 | 1999-10-19 | Genentech, Inc. | ErbB3 antibodies |
DE69732711T2 (en) | 1996-07-12 | 2006-03-16 | Genentech, Inc., South San Francisco | GAMMA-heregulin |
US7371376B1 (en) * | 1996-10-18 | 2008-05-13 | Genentech, Inc. | Anti-ErbB2 antibodies |
US6387638B1 (en) * | 1997-02-10 | 2002-05-14 | Genentech, Inc. | Heregulin variants |
US6136558A (en) * | 1997-02-10 | 2000-10-24 | Genentech, Inc. | Heregulin variants |
AU7267398A (en) * | 1997-04-30 | 1998-11-24 | Hans Klingemann | Natural killer cell lines and methods of use |
ZA9811162B (en) * | 1997-12-12 | 2000-06-07 | Genentech Inc | Treatment with anti-ERBB2 antibodies. |
EP1947119A3 (en) | 1997-12-12 | 2012-12-19 | Genentech, Inc. | Treatment of cancer with anti-erb2 antibodies in combination with a chemotherapeutic agent |
US8101349B2 (en) | 1997-12-23 | 2012-01-24 | Novartis Vaccines And Diagnostics, Inc. | Gene products differentially expressed in cancerous cells and their methods of use II |
US20010023241A1 (en) | 1998-02-04 | 2001-09-20 | Sliwkowski Mark X. | Use of heregulin as a growth factor |
EP1941905A1 (en) | 1998-03-27 | 2008-07-09 | Genentech, Inc. | APO-2 Ligand-anti-her-2 antibody synergism |
US6080845A (en) * | 1998-08-05 | 2000-06-27 | Amgen Inc. | Monoclonal antibody against utricular epithelium |
US6017886A (en) * | 1998-08-05 | 2000-01-25 | Amgen Inc. | Use of NDF peptide as growth factor for sensory epithelium |
WO2000018916A2 (en) * | 1998-09-28 | 2000-04-06 | Chiron Corporation | Human genes and gene expression products |
WO2000027426A1 (en) * | 1998-11-07 | 2000-05-18 | Genentech, Inc. | Method for enhancing proliferation of inner ear hair cells using ligands for her2 and/or her3 receptors |
AUPP785098A0 (en) | 1998-12-21 | 1999-01-21 | Victor Chang Cardiac Research Institute, The | Treatment of heart disease |
US7396810B1 (en) | 2000-08-14 | 2008-07-08 | Oregon Health Sciences University | Compositions and methods for treating cancer by modulating HER-2 and EGF receptors |
US7393823B1 (en) | 1999-01-20 | 2008-07-01 | Oregon Health And Science University | HER-2 binding antagonists |
US7625859B1 (en) | 2000-02-16 | 2009-12-01 | Oregon Health & Science University | HER-2 binding antagonists |
WO2000053813A1 (en) * | 1999-03-11 | 2000-09-14 | Massachusetts Institute Of Technology | Pangenomic libraries |
US6635249B1 (en) | 1999-04-23 | 2003-10-21 | Cenes Pharmaceuticals, Inc. | Methods for treating congestive heart failure |
DE60033658T2 (en) * | 1999-06-25 | 2007-11-22 | Genentech, Inc., South San Francisco | TREATMENT OF PROSTATE CANCER WITH ANTI-ERBB2 ANTIBODIES |
US20040013667A1 (en) * | 1999-06-25 | 2004-01-22 | Genentech, Inc. | Treatment with anti-ErbB2 antibodies |
US20030086924A1 (en) * | 1999-06-25 | 2003-05-08 | Genentech, Inc. | Treatment with anti-ErbB2 antibodies |
US7041292B1 (en) | 1999-06-25 | 2006-05-09 | Genentech, Inc. | Treating prostate cancer with anti-ErbB2 antibodies |
BRPI0012196B8 (en) | 1999-06-25 | 2021-05-25 | Genentech Inc | industrialized article |
US6949245B1 (en) | 1999-06-25 | 2005-09-27 | Genentech, Inc. | Humanized anti-ErbB2 antibodies and treatment with anti-ErbB2 antibodies |
AU784045B2 (en) | 1999-06-25 | 2006-01-19 | Genentech Inc. | Humanized anti-ErbB2 antibodies and treatment with anti-ErbB2 antibodies |
US20020034772A1 (en) * | 1999-06-29 | 2002-03-21 | Orlow Seth J. | Methods and compositions that affect melanogenesis |
US6924359B1 (en) * | 1999-07-01 | 2005-08-02 | Yale University | Neovascular-targeted immunoconjugates |
EP1246597B1 (en) | 1999-08-03 | 2015-01-14 | The Ohio State University | Polypeptides and polynucleotides for enhancing immune reactivity to her-2 protein |
KR20110008112A (en) | 1999-08-27 | 2011-01-25 | 제넨테크, 인크. | Dosages for treatment with anti-erbb2 antibodies |
AU775373B2 (en) | 1999-10-01 | 2004-07-29 | Immunogen, Inc. | Compositions and methods for treating cancer using immunoconjugates and chemotherapeutic agents |
US7538197B2 (en) | 2000-02-11 | 2009-05-26 | Proteosys Ag | Use of neuregulin-β as an indicator and/or target |
US20040052793A1 (en) * | 2001-02-22 | 2004-03-18 | Carter Paul J. | Caspase activivated prodrugs therapy |
AU783679B2 (en) * | 2000-02-24 | 2005-11-24 | Genentech Inc. | Caspase activated prodrugs therapy |
US7097840B2 (en) | 2000-03-16 | 2006-08-29 | Genentech, Inc. | Methods of treatment using anti-ErbB antibody-maytansinoid conjugates |
EP1272647B1 (en) * | 2000-04-11 | 2014-11-12 | Genentech, Inc. | Multivalent antibodies and uses therefor |
EP1280923A2 (en) * | 2000-04-28 | 2003-02-05 | Millennium Pharmaceuticals, Inc. | 14094, a human trypsin family member and uses thereof |
CA2407556C (en) | 2000-05-19 | 2011-06-21 | Genentech, Inc. | Gene detection assay for improving the likelihood of an effective response to an erbb antagonist cancer therapy |
US20020119148A1 (en) * | 2000-09-01 | 2002-08-29 | Gerritsen Mary E. | ErbB4 antagonists |
US20030017491A1 (en) * | 2000-09-14 | 2003-01-23 | Zuo-Rong Shi | Chromogenic in situ hybridization methods, kits, and compositions |
WO2003014156A2 (en) | 2001-08-06 | 2003-02-20 | Proteosys Ag | NEUREGULIN-β ISOFORMS ASSOCIATED WITH NEURONAL PROCESSES |
DE10210427A1 (en) | 2002-03-09 | 2003-10-09 | Hans Konrad Mueller-Hermelink | Human monoclonal antibody |
AU2003224916B2 (en) | 2002-04-10 | 2009-01-08 | Genentech, Inc. | Anti-HER2 antibody variants |
AU2002304965A1 (en) | 2002-05-24 | 2003-12-12 | Zensun (Shanghai) Sci-Tech.Ltd | Neuregulin based methods and compositions for treating viral myocarditis and dilated cardiomyopathy |
EP1545608A4 (en) | 2002-06-28 | 2006-09-13 | Centocor Inc | Mammalian ch1 deleted mimetibodies, compositions, methods and uses |
EP1521776A2 (en) | 2002-07-04 | 2005-04-13 | Oncomab Gmbh | Neoplasm specific antibodies and uses thereof |
ES2392525T3 (en) | 2002-07-15 | 2012-12-11 | F. Hoffmann-La Roche Ag | Cancer treatment with the antibody directed against ErbB2 rhuMAb 2C4 |
CA2506320A1 (en) * | 2002-11-21 | 2004-06-10 | Genentech, Inc. | Therapy of non-malignant diseases or disorders with anti-erbb2 antibodies |
US8088387B2 (en) | 2003-10-10 | 2012-01-03 | Immunogen Inc. | Method of targeting specific cell populations using cell-binding agent maytansinoid conjugates linked via a non-cleavable linker, said conjugates, and methods of making said conjugates |
BR122018071808B8 (en) | 2003-11-06 | 2020-06-30 | Seattle Genetics Inc | conjugate |
WO2005094159A2 (en) | 2003-11-12 | 2005-10-13 | Oncomab Gmbh | Methods of identifying neoplasm-specific antibodies and uses thereof |
US20060034840A1 (en) * | 2004-04-08 | 2006-02-16 | Agus David B | ErbB antagonists for pain therapy |
BRPI0510883B8 (en) | 2004-06-01 | 2021-05-25 | Genentech Inc | drug-antibody conjugate compound, pharmaceutical composition, method of manufacturing a drug-antibody conjugate compound, and uses of a formulation, a drug-antibody conjugate and a chemotherapeutic agent, and a combination |
GT200500155A (en) * | 2004-06-16 | 2006-05-15 | PLATINUM-RESISTANT CANCER THERAPY | |
KR20120068807A (en) * | 2004-07-22 | 2012-06-27 | 제넨테크, 인크. | Her2 antibody composition |
JO3000B1 (en) * | 2004-10-20 | 2016-09-05 | Genentech Inc | Antibody Formulations. |
ZA200704796B (en) * | 2004-12-07 | 2008-11-26 | Genentech Inc | Selecting patients for therapy with a HER inhibitor |
BRPI0518104B8 (en) * | 2005-01-21 | 2021-05-25 | Genentech Inc | industrialized article and use of her2 antibody |
ZA200706247B (en) * | 2005-02-09 | 2008-11-26 | Genentech Inc | Inhibiting HER2 shedding with matrix metalloprotease antagonists |
UA95902C2 (en) * | 2005-02-23 | 2011-09-26 | Дженентек, Инк. | Extending time to disease progression or survival in cancer patients |
WO2006096861A2 (en) * | 2005-03-08 | 2006-09-14 | Genentech, Inc. | METHODS FOR IDENTIFYING TUMORS RESPONSIVE TO TREATMENT WITH HER DIMERIZATION INHIBITORS (HDIs) |
JP2006316040A (en) | 2005-05-13 | 2006-11-24 | Genentech Inc | Herceptin(r) adjuvant treatment |
AU2006261342B2 (en) | 2005-06-15 | 2012-02-02 | The Ohio State University Research Foundation | Her-2 peptides |
US20070213264A1 (en) | 2005-12-02 | 2007-09-13 | Mingdong Zhou | Neuregulin variants and methods of screening and using thereof |
RU2457854C2 (en) | 2005-12-30 | 2012-08-10 | Цзэньсунь (Шанхай) Сайенс Энд Текнолоджи Лимитед | Prolonged release of neuregulin for improvement in cardiac function |
EP2511301B1 (en) | 2006-08-04 | 2017-12-06 | MedImmune Limited | Human antibodies to erbb 2 |
CA2677108A1 (en) * | 2007-03-02 | 2008-09-12 | Genentech, Inc. | Predicting response to a her inhibitor |
WO2008118970A2 (en) | 2007-03-27 | 2008-10-02 | Sea Lane Biotechnologies, Llc | Constructs and libraries comprising antibody surrogate light chain sequences |
EP2171090B1 (en) | 2007-06-08 | 2013-04-03 | Genentech, Inc. | Gene expression markers of tumor resistance to her2 inhibitor treatment |
CA2705328C (en) | 2007-11-16 | 2018-01-02 | Proteosys Ag | Active soluble post-translationally modified neuregulin isoforms |
TWI472339B (en) | 2008-01-30 | 2015-02-11 | Genentech Inc | Composition comprising antibody that binds to domain ii of her2 and acidic variants thereof |
US8663643B2 (en) | 2008-03-18 | 2014-03-04 | Genentech, Inc. | Combinations of an anti-HER2 antibody-drug conjugate and chemotherapeutic agents, and methods of use |
BRPI0812682A2 (en) | 2008-06-16 | 2010-06-22 | Genentech Inc | metastatic breast cancer treatment |
US20100234283A1 (en) | 2009-02-04 | 2010-09-16 | The Ohio State University Research Foundation | Immunogenic epitopes, peptidomimetics, and anti-peptide antibodies, and methods of their use |
RU2504553C2 (en) | 2009-03-20 | 2014-01-20 | Дженентек, Инк. | Antibodies to her |
CN102482345A (en) | 2009-05-13 | 2012-05-30 | 航道生物技术有限责任公司 | Neutralizing molecules to influenza viruses |
WO2010136569A1 (en) | 2009-05-29 | 2010-12-02 | F. Hoffmann-La Roche Ag | Modulators for her2 signaling in her2 expressing patients with gastric cancer |
SI2536748T1 (en) | 2010-02-18 | 2014-12-31 | Genentech, Inc. | Neuregulin antagonists and use thereof in treating cancer |
WO2011146568A1 (en) | 2010-05-19 | 2011-11-24 | Genentech, Inc. | Predicting response to a her inhibitor |
JP5766296B2 (en) | 2010-12-23 | 2015-08-19 | エフ.ホフマン−ラ ロシュ アーゲーF. Hoffmann−La Roche Aktiengesellschaft | Polypeptide-polynucleotide complexes and their use in targeted delivery of effector components |
CA2842860A1 (en) | 2011-07-28 | 2013-01-31 | Sea Lane Biotechnologies, Llc | Sur-binding proteins |
EP3974832A1 (en) | 2011-10-06 | 2022-03-30 | Aveo Pharmaceuticals, Inc. | Predicting tumor response to anti-erbb3 antibodies |
WO2013053076A1 (en) | 2011-10-10 | 2013-04-18 | Zensun (Shanghai)Science & Technology Limited | Compositions and methods for treating heart failure |
KR20140098834A (en) | 2011-11-30 | 2014-08-08 | 제넨테크, 인크. | Erbb3 mutations in cancer |
US9376715B2 (en) | 2011-12-09 | 2016-06-28 | Roche Molecular Systems, Inc | Methods for detecting mutations in the catalytic subunit of the phosphoinositol-3 kinase (PIK3CA) gene |
EP2793940B1 (en) | 2011-12-22 | 2018-11-14 | i2 Pharmaceuticals, Inc. | Surrogate binding proteins |
KR102291355B1 (en) | 2012-11-30 | 2021-08-19 | 에프. 호프만-라 로슈 아게 | Identification of patients in need of pd-l1 inhibitor cotherapy |
US10239951B2 (en) | 2013-05-08 | 2019-03-26 | Zymeworks Inc. | Bispecific HER2 and HER3 antigen binding constructs |
WO2015067986A1 (en) | 2013-11-07 | 2015-05-14 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Neuregulin allosteric anti-her3 antibody |
WO2015073721A1 (en) | 2013-11-13 | 2015-05-21 | Zymeworks Inc. | Monovalent antigen binding constructs targeting egfr and/or her2 and uses thereof |
RU2737882C2 (en) | 2013-11-27 | 2020-12-04 | Займворкс Инк. | Bispecific antigen-binding constructs against her2 |
EP3091033A1 (en) | 2015-05-06 | 2016-11-09 | Gamamabs Pharma | Anti-human-her3 antibodies and uses thereof |
AU2016271124C1 (en) | 2015-05-30 | 2020-05-14 | Molecular Templates, Inc. | De-immunized, Shiga toxin a subunit scaffolds and cell-targeting molecules comprising the same |
CN106729743B (en) | 2015-11-23 | 2021-09-21 | 四川科伦博泰生物医药股份有限公司 | anti-ErbB 2 antibody-drug conjugate, and composition, preparation method and application thereof |
SI3570884T1 (en) | 2017-01-17 | 2021-02-26 | Genentech, Inc. | Subcutaneous her2 antibody formulations |
KR102417583B1 (en) | 2017-03-02 | 2022-07-07 | 제넨테크, 인크. | Adjuvant treatment of her2-positive breast cancer |
CN107589261B (en) * | 2017-09-08 | 2019-02-22 | 上海宝藤生物医药科技股份有限公司 | It is a kind of for detecting the kit of breast cancer |
CN107589262B (en) * | 2017-09-08 | 2019-02-22 | 枫木年轮生物科技(广州)有限公司 | For detecting the kit of breast cancer |
ES2980163T3 (en) | 2018-04-11 | 2024-09-30 | Salubris Biotherapeutics Inc | Human neuregulin-1 (nrg-1) recombinant fusion protein compositions and methods of using the same |
CA3097178A1 (en) | 2018-04-17 | 2019-10-24 | Molecular Templates, Inc. | Her2-targeting molecules comprising de-immunized, shiga toxin a subunit scaffolds |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4882275A (en) * | 1984-02-29 | 1989-11-21 | The Children's Medical Center Corporation | Method of purifying endothelial cell growth factors using immobilized heparin |
US4683195A (en) * | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4968603A (en) * | 1986-12-31 | 1990-11-06 | The Regents Of The University Of California | Determination of status in neoplastic disease |
US5464751A (en) * | 1990-04-06 | 1995-11-07 | Trustees Of The University Of Pennsylvania | Ligand for the neu gene product |
US5578482A (en) * | 1990-05-25 | 1996-11-26 | Georgetown University | Ligand growth factors that bind to the erbB-2 receptor protein and induce cellular responses |
EP0597845A1 (en) * | 1990-05-25 | 1994-05-25 | Georgetown University | GROWTH FACTOR WHICH INHIBITS THE GROWTH OF CELLS OVEREXPRESSING THE HUMAN ONCOGENE erbB-2 |
FR2664086A1 (en) * | 1990-07-02 | 1992-01-03 | Centre Nat Rech Scient | IMPROVED PROCESS FOR THE OPTIMIZATION OF MAGNETIC PROPERTIES OF POWDER MAGNETIC MATERIALS AND PRODUCTS THUS OBTAINED. |
EP0574414B1 (en) * | 1991-01-14 | 1999-07-14 | Georgetown University | LIGAND GROWTH FACTORS THAT BIND TO THE erbB-2 RECEPTOR PROTEIN AND INDUCE CELLULAR RESPONSES |
FI921133A (en) * | 1991-03-17 | 1992-09-18 | Yeda Res & Dev | FACTOR, SOM STIMULER NEU-RECEPTORN |
US5169837A (en) * | 1991-03-28 | 1992-12-08 | Allelix Biopharmaceuticals Inc. | Isolated osteogenic factor |
GB9107566D0 (en) * | 1991-04-10 | 1991-05-29 | Ludwig Inst Cancer Res | Glial mitogenic factors,their preparation and use |
IL101943A0 (en) * | 1991-05-24 | 1992-12-30 | Genentech Inc | Structure,production and use of heregulin |
US5367060A (en) * | 1991-05-24 | 1994-11-22 | Genentech, Inc. | Structure, production and use of heregulin |
US5237056A (en) * | 1991-05-29 | 1993-08-17 | President And Fellows Of Harvard College | DNA encoding a protein which copurifies with acetylcholine receptor inducing activity and uses therefor |
NZ252486A (en) * | 1992-04-29 | 1997-08-22 | Univ Georgetown | Blocking peptides capable of binding to ligands of the epidermal growth factor receptor (erbb-2) and their use in detecting such ligands |
KR950701379A (en) * | 1992-04-29 | 1995-03-23 | 예다 리서치 앤드 디벨럽먼트 캄파니 리미티드 | RECOMBINANT STIMULATING FACTOR OF THE neu RECEPTOR |
GB9217316D0 (en) * | 1992-08-14 | 1992-09-30 | Ludwig Inst Cancer Res | Schwann cell mitogenic factor,its preparation and use |
-
1992
- 1992-05-20 IL IL101943A patent/IL101943A0/en not_active IP Right Cessation
- 1992-05-21 CA CA002331239A patent/CA2331239A1/en not_active Abandoned
- 1992-05-21 JP JP50030293A patent/JP3595552B2/en not_active Expired - Lifetime
- 1992-05-21 ES ES92915976T patent/ES2206448T3/en not_active Expired - Lifetime
- 1992-05-21 DK DK92915976T patent/DK0586607T3/en active
- 1992-05-21 AU AU23474/92A patent/AU659863B2/en not_active Expired
- 1992-05-21 EP EP92915976A patent/EP0586607B1/en not_active Expired - Lifetime
- 1992-05-21 EP EP01100940A patent/EP1114863A3/en not_active Withdrawn
- 1992-05-21 CA CA002108473A patent/CA2108473C/en not_active Expired - Lifetime
- 1992-05-21 WO PCT/US1992/004295 patent/WO1992020798A1/en active IP Right Grant
- 1992-05-21 DE DE69233137T patent/DE69233137T2/en not_active Expired - Lifetime
- 1992-05-22 NZ NZ242857A patent/NZ242857A/en not_active IP Right Cessation
- 1992-07-01 IE IE20010175A patent/IE20010175A1/en not_active IP Right Cessation
- 1992-07-01 IE IE165492A patent/IE921654A1/en not_active IP Right Cessation
-
1995
- 1995-05-31 US US08/456,201 patent/US5641869A/en not_active Expired - Lifetime
- 1995-05-31 US US08/456,241 patent/US5840525A/en not_active Expired - Lifetime
-
1998
- 1998-03-29 IL IL12387398A patent/IL123873A0/en unknown
- 1998-03-29 IL IL12387298A patent/IL123872A0/en not_active IP Right Cessation
-
2001
- 2001-08-14 IL IL14490401A patent/IL144904A0/en unknown
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0586607B1 (en) | HEREGULINS (HRGs), BINDING PROTEINS OF P185?erb2 | |
US5367060A (en) | Structure, production and use of heregulin | |
US6399746B1 (en) | Structure, production and use of heregulin 2 ligands | |
IE83896B1 (en) | Heregulins (HRGs), binding proteins of P185erb2 | |
US5286654A (en) | Detection and purification of activin polypeptide | |
CA2175893C (en) | Protein tyrosine kinases named rse | |
US5763213A (en) | Sensory and motor neuron derived factor (SMDF) | |
AU705107B2 (en) | HTK ligand | |
JPH09510612A (en) | Human trk receptor and neurotrophic factor inhibitor | |
US5635601A (en) | Beta-8 integrin subunit antibodies | |
Vandlen et al. | Method of using HRG2-α to stimulate P185 HeR2 | |
Vandlen et al. | Polynucleotides encoding polypeptides that bind to the P185 HER2 receptor |