AU713198B2 - Cloning, expression and uses of a novel secreted protein, F-spondin - Google Patents
Cloning, expression and uses of a novel secreted protein, F-spondin Download PDFInfo
- Publication number
- AU713198B2 AU713198B2 AU12698/97A AU1269897A AU713198B2 AU 713198 B2 AU713198 B2 AU 713198B2 AU 12698/97 A AU12698/97 A AU 12698/97A AU 1269897 A AU1269897 A AU 1269897A AU 713198 B2 AU713198 B2 AU 713198B2
- Authority
- AU
- Australia
- Prior art keywords
- spondin
- pro
- lys
- ala
- lou
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 101710092167 Spondin-1 Proteins 0.000 title description 138
- 102100036428 Spondin-1 Human genes 0.000 title description 135
- 108090000623 proteins and genes Proteins 0.000 title description 86
- 102000004169 proteins and genes Human genes 0.000 title description 77
- 238000010367 cloning Methods 0.000 title description 5
- 210000004027 cell Anatomy 0.000 claims description 126
- 210000002569 neuron Anatomy 0.000 claims description 66
- 150000007523 nucleic acids Chemical class 0.000 claims description 37
- 238000009396 hybridization Methods 0.000 claims description 36
- 108020004707 nucleic acids Proteins 0.000 claims description 36
- 102000039446 nucleic acids Human genes 0.000 claims description 36
- 238000000034 method Methods 0.000 claims description 34
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 31
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 28
- 229920001184 polypeptide Polymers 0.000 claims description 26
- 101000642262 Homo sapiens Spondin-1 Proteins 0.000 claims description 21
- 230000012010 growth Effects 0.000 claims description 17
- 239000011159 matrix material Substances 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 14
- 125000003729 nucleotide group Chemical group 0.000 claims description 14
- 241000282414 Homo sapiens Species 0.000 claims description 10
- 230000000694 effects Effects 0.000 claims description 10
- 230000005709 nerve cell growth Effects 0.000 claims description 10
- 230000004936 stimulating effect Effects 0.000 claims description 8
- 239000003937 drug carrier Substances 0.000 claims description 7
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims description 4
- 230000000295 complement effect Effects 0.000 claims description 4
- 239000002853 nucleic acid probe Substances 0.000 claims description 4
- 239000008194 pharmaceutical composition Substances 0.000 claims description 4
- 230000001172 regenerating effect Effects 0.000 claims description 4
- 102000050159 human SPON1 Human genes 0.000 claims 20
- 239000003814 drug Substances 0.000 claims 4
- 238000002360 preparation method Methods 0.000 claims 4
- 230000008929 regeneration Effects 0.000 claims 2
- 238000011069 regeneration method Methods 0.000 claims 2
- 230000000638 stimulation Effects 0.000 claims 2
- 235000018102 proteins Nutrition 0.000 description 75
- 210000000278 spinal cord Anatomy 0.000 description 60
- 239000002299 complementary DNA Substances 0.000 description 47
- 108020004999 messenger RNA Proteins 0.000 description 41
- 210000003050 axon Anatomy 0.000 description 31
- 235000001014 amino acid Nutrition 0.000 description 29
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 28
- 108060008245 Thrombospondin Proteins 0.000 description 28
- 102000002938 Thrombospondin Human genes 0.000 description 28
- 150000001413 amino acids Chemical class 0.000 description 28
- 210000002241 neurite Anatomy 0.000 description 26
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 23
- 108020004414 DNA Proteins 0.000 description 20
- 241000700159 Rattus Species 0.000 description 18
- 239000000523 sample Substances 0.000 description 18
- 210000003594 spinal ganglia Anatomy 0.000 description 17
- 238000000338 in vitro Methods 0.000 description 14
- 125000003275 alpha amino acid group Chemical group 0.000 description 13
- 239000002609 medium Substances 0.000 description 13
- 239000000758 substrate Substances 0.000 description 13
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 12
- 101000642260 Rattus norvegicus Spondin-1 Proteins 0.000 description 12
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 12
- 230000006870 function Effects 0.000 description 12
- 229960002897 heparin Drugs 0.000 description 11
- 210000000276 neural tube Anatomy 0.000 description 11
- 230000014511 neuron projection development Effects 0.000 description 11
- 239000003795 chemical substances by application Substances 0.000 description 10
- 229920000669 heparin Polymers 0.000 description 10
- 210000000578 peripheral nerve Anatomy 0.000 description 10
- HXNYBZQLBWIADP-WDSKDSINSA-N Pro-Cys Chemical compound OC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 HXNYBZQLBWIADP-WDSKDSINSA-N 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 210000000653 nervous system Anatomy 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 238000001000 micrograph Methods 0.000 description 8
- 210000003061 neural cell Anatomy 0.000 description 8
- 230000001537 neural effect Effects 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- 230000002861 ventricular Effects 0.000 description 8
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 7
- 108010085895 Laminin Proteins 0.000 description 7
- 102000011830 Neural cell adhesion Human genes 0.000 description 7
- 108050002172 Neural cell adhesion Proteins 0.000 description 7
- 230000001070 adhesive effect Effects 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010060199 cysteinylproline Proteins 0.000 description 7
- 238000007901 in situ hybridization Methods 0.000 description 7
- 230000004807 localization Effects 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 6
- 102000003886 Glycoproteins Human genes 0.000 description 6
- 108090000288 Glycoproteins Proteins 0.000 description 6
- 229920002683 Glycosaminoglycan Polymers 0.000 description 6
- QBGPXOGXCVKULO-BQBZGAKWSA-N Lys-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(O)=O QBGPXOGXCVKULO-BQBZGAKWSA-N 0.000 description 6
- UEKYKRQIAQHOOZ-KBPBESRZSA-N Pro-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)[O-])C(=O)[C@@H]1CCC[NH2+]1 UEKYKRQIAQHOOZ-KBPBESRZSA-N 0.000 description 6
- 210000004556 brain Anatomy 0.000 description 6
- 230000021164 cell adhesion Effects 0.000 description 6
- 230000004069 differentiation Effects 0.000 description 6
- 210000002257 embryonic structure Anatomy 0.000 description 6
- 102000005962 receptors Human genes 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 108020004705 Codon Proteins 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 5
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 5
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 5
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 5
- 108010005642 Properdin Proteins 0.000 description 5
- 102100038567 Properdin Human genes 0.000 description 5
- 210000003169 central nervous system Anatomy 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010087823 glycyltyrosine Proteins 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 229930182817 methionine Natural products 0.000 description 5
- 210000003458 notochord Anatomy 0.000 description 5
- 239000002953 phosphate buffered saline Substances 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 210000001202 rhombencephalon Anatomy 0.000 description 5
- 230000001953 sensory effect Effects 0.000 description 5
- ROWCTNFEMKOIFQ-YUMQZZPRSA-N Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N ROWCTNFEMKOIFQ-YUMQZZPRSA-N 0.000 description 4
- FKBFDTRILNZGAI-IMJSIDKUSA-N Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O FKBFDTRILNZGAI-IMJSIDKUSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- YFAFBAPQHGULQT-HJPIBITLSA-N Cys-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N YFAFBAPQHGULQT-HJPIBITLSA-N 0.000 description 4
- XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 4
- 241000287828 Gallus gallus Species 0.000 description 4
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 4
- 108010069196 Neural Cell Adhesion Molecules Proteins 0.000 description 4
- 239000000020 Nitrocellulose Substances 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 4
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 4
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 4
- 101100043251 Xenopus laevis spon1 gene Proteins 0.000 description 4
- 239000000853 adhesive Substances 0.000 description 4
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 239000002975 chemoattractant Substances 0.000 description 4
- 210000003837 chick embryo Anatomy 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 210000000020 growth cone Anatomy 0.000 description 4
- 238000010166 immunofluorescence Methods 0.000 description 4
- 229920001220 nitrocellulos Polymers 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- SQDAZGGFXASXDW-UHFFFAOYSA-N 5-bromo-2-(trifluoromethoxy)pyridine Chemical compound FC(F)(F)OC1=CC=C(Br)C=N1 SQDAZGGFXASXDW-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 3
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 3
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 3
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 3
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 3
- 229920001287 Chondroitin sulfate Polymers 0.000 description 3
- BUXAPSQPMALTOY-WHFBIAKZSA-N Cys-Glu Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BUXAPSQPMALTOY-WHFBIAKZSA-N 0.000 description 3
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 3
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 3
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 3
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 3
- 108010067306 Fibronectins Proteins 0.000 description 3
- 102000016359 Fibronectins Human genes 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 102000020897 Formins Human genes 0.000 description 3
- 108091022623 Formins Proteins 0.000 description 3
- 101000642253 Gallus gallus Spondin-1 Proteins 0.000 description 3
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 3
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 3
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 3
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 3
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 3
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- HJUPAYWVVVRYFQ-PYJNHQTQSA-N His-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N HJUPAYWVVVRYFQ-PYJNHQTQSA-N 0.000 description 3
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 3
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 3
- ZUKPVRWZDMRIEO-VKHMYHEASA-N L-cysteinylglycine Chemical compound SC[C@H]([NH3+])C(=O)NCC([O-])=O ZUKPVRWZDMRIEO-VKHMYHEASA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 3
- XFANQCRHTMOEAP-WDSOQIARSA-N Lys-Pro-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XFANQCRHTMOEAP-WDSOQIARSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 102000001068 Neural Cell Adhesion Molecules Human genes 0.000 description 3
- 229930040373 Paraformaldehyde Natural products 0.000 description 3
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 3
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 3
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 3
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 3
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 3
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 3
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 3
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 3
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 3
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 3
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 3
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 3
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 3
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 230000001464 adherent effect Effects 0.000 description 3
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 3
- 210000002469 basement membrane Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 229940059329 chondroitin sulfate Drugs 0.000 description 3
- 239000003636 conditioned culture medium Substances 0.000 description 3
- 230000013020 embryo development Effects 0.000 description 3
- 210000002744 extracellular matrix Anatomy 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 108010044426 integrins Proteins 0.000 description 3
- 102000006495 integrins Human genes 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 210000001259 mesencephalon Anatomy 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 230000003988 neural development Effects 0.000 description 3
- 229920002866 paraformaldehyde Polymers 0.000 description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 239000008363 phosphate buffer Substances 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 230000002797 proteolythic effect Effects 0.000 description 3
- 210000004116 schwann cell Anatomy 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 230000007839 spinal cord development Effects 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- JQDFGZKKXBEANU-IMJSIDKUSA-N Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(O)=O JQDFGZKKXBEANU-IMJSIDKUSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 2
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- BNODVYXZAAXSHW-IUCAKERBSA-N Arg-His Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BNODVYXZAAXSHW-IUCAKERBSA-N 0.000 description 2
- QYLJIYOGHRGUIH-CIUDSAMLSA-N Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N QYLJIYOGHRGUIH-CIUDSAMLSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- LQJAALCCPOTJGB-YUMQZZPRSA-N Arg-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 2
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 2
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 2
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 2
- 102000049320 CD36 Human genes 0.000 description 2
- 108010045374 CD36 Antigens Proteins 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 108010005939 Ciliary Neurotrophic Factor Proteins 0.000 description 2
- 102100031614 Ciliary neurotrophic factor Human genes 0.000 description 2
- 108010069112 Complement System Proteins Proteins 0.000 description 2
- 102000000989 Complement System Proteins Human genes 0.000 description 2
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 2
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- MTNUYDIILCWPEP-GUBZILKMSA-N Cys-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CS MTNUYDIILCWPEP-GUBZILKMSA-N 0.000 description 2
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 2
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- PABVKUJVLNMOJP-WHFBIAKZSA-N Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(O)=O PABVKUJVLNMOJP-WHFBIAKZSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 2
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 229920002971 Heparan sulfate Polymers 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- LNCFUHAPNTYMJB-IUCAKERBSA-N His-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNCFUHAPNTYMJB-IUCAKERBSA-N 0.000 description 2
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 2
- FBTYOQIYBULKEH-ZFWWWQNUSA-N His-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 FBTYOQIYBULKEH-ZFWWWQNUSA-N 0.000 description 2
- HYXQKVOADYPQEA-CIUDSAMLSA-N Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HYXQKVOADYPQEA-CIUDSAMLSA-N 0.000 description 2
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- BCXBIONYYJCSDF-CIUDSAMLSA-N Ile-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BCXBIONYYJCSDF-CIUDSAMLSA-N 0.000 description 2
- 108010058683 Immobilized Proteins Proteins 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- 102100032352 Leukemia inhibitory factor Human genes 0.000 description 2
- 108090000581 Leukemia inhibitory factor Proteins 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- UASDAHIAHBRZQV-YUMQZZPRSA-N Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N UASDAHIAHBRZQV-YUMQZZPRSA-N 0.000 description 2
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 2
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 2
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 2
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 2
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 2
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 230000004988 N-glycosylation Effects 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 2
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 2
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 102000016611 Proteoglycans Human genes 0.000 description 2
- 108010067787 Proteoglycans Proteins 0.000 description 2
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 2
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 2
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 101710137500 T7 RNA polymerase Proteins 0.000 description 2
- 102000007000 Tenascin Human genes 0.000 description 2
- 108010008125 Tenascin Proteins 0.000 description 2
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 2
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 2
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 2
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 2
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 2
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 2
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 2
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 2
- 101710104031 Thrombospondin-related anonymous protein Proteins 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical group O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 2
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 2
- MYVYPSWUSKCCHG-JQWIXIFHSA-N Trp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 MYVYPSWUSKCCHG-JQWIXIFHSA-N 0.000 description 2
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 2
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 2
- NQIHMZLGCZNZBN-PXNSSMCTSA-N Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 NQIHMZLGCZNZBN-PXNSSMCTSA-N 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 2
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 2
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 2
- 241000269370 Xenopus <genus> Species 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 210000004960 anterior grey column Anatomy 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000003376 axonal effect Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 244000309466 calf Species 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000004956 cell adhesive effect Effects 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000030570 cellular localization Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000000663 chemotropic effect Effects 0.000 description 2
- 230000030643 chemotropism Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000024203 complement activation Effects 0.000 description 2
- 102000033815 complement binding proteins Human genes 0.000 description 2
- 108091009760 complement binding proteins Proteins 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 230000001143 conditioned effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 229960002086 dextran Drugs 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 125000001165 hydrophobic group Chemical group 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 229940047122 interleukins Drugs 0.000 description 2
- 210000001153 interneuron Anatomy 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010009114 laminin beta2 Proteins 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 210000001428 peripheral nervous system Anatomy 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 210000001044 sensory neuron Anatomy 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- 108010045269 tryptophyltryptophan Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- CETWSOHVEGTIBR-FORAGAHYSA-N (2s)-2,6-diamino-n-[(2s)-1-phenylpropan-2-yl]hexanamide;methanesulfonic acid Chemical compound CS(O)(=O)=O.CS(O)(=O)=O.NCCCC[C@H](N)C(=O)N[C@@H](C)CC1=CC=CC=C1 CETWSOHVEGTIBR-FORAGAHYSA-N 0.000 description 1
- WDVIDPRACNGFPP-QWRGUYRKSA-N (2s)-2-[[(2s)-6-amino-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NCC(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WDVIDPRACNGFPP-QWRGUYRKSA-N 0.000 description 1
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 description 1
- CBCKQZAAMUWICA-UHFFFAOYSA-N 1,4-phenylenediamine Chemical compound NC1=CC=C(N)C=C1 CBCKQZAAMUWICA-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- VUFNLQXQSDUXKB-DOFZRALJSA-N 2-[4-[4-[bis(2-chloroethyl)amino]phenyl]butanoyloxy]ethyl (5z,8z,11z,14z)-icosa-5,8,11,14-tetraenoate Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)OCCOC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 VUFNLQXQSDUXKB-DOFZRALJSA-N 0.000 description 1
- KDZIGQIDPXKMBA-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound CC(C)C(N)C(=O)NCC(=O)NC(CO)C(=O)NC(C(O)=O)CCC(O)=O KDZIGQIDPXKMBA-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- PLIKAWJENQZMHA-UHFFFAOYSA-N 4-aminophenol Chemical compound NC1=CC=C(O)C=C1 PLIKAWJENQZMHA-UHFFFAOYSA-N 0.000 description 1
- 101150106774 9 gene Proteins 0.000 description 1
- 241001290610 Abildgaardia Species 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- OSASDIVHOSJVII-WDSKDSINSA-N Arg-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N OSASDIVHOSJVII-WDSKDSINSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- XNSKSTRGQIPTSE-ACZMJKKPSA-N Arg-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XNSKSTRGQIPTSE-ACZMJKKPSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- 208000002109 Argyria Diseases 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- RNAQPBOOJRDICC-BPUTZDHNSA-N Asp-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N RNAQPBOOJRDICC-BPUTZDHNSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- NALWOULWGHTVDA-UWVGGRQHSA-N Asp-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NALWOULWGHTVDA-UWVGGRQHSA-N 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 240000005220 Bischofia javanica Species 0.000 description 1
- 235000010893 Bischofia javanica Nutrition 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000208199 Buxus sempervirens Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 108010067225 Cell Adhesion Molecules Proteins 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 241000669072 Chrysomphalus dictyospermi Species 0.000 description 1
- 101710117490 Circumsporozoite protein Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 240000000244 Crotalaria pallida Species 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- HAYVTMHUNMMXCV-IMJSIDKUSA-N Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CS HAYVTMHUNMMXCV-IMJSIDKUSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 1
- JDHMXPSXWMPYQZ-AAEUAGOBSA-N Cys-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N JDHMXPSXWMPYQZ-AAEUAGOBSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- VBIIZCXWOZDIHS-ACZMJKKPSA-N Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CS VBIIZCXWOZDIHS-ACZMJKKPSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- JUUMIGUJJRFQQR-KKUMJFAQSA-N Cys-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O JUUMIGUJJRFQQR-KKUMJFAQSA-N 0.000 description 1
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 1
- OOULJWDSSVOMHX-WDSKDSINSA-N Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CS OOULJWDSSVOMHX-WDSKDSINSA-N 0.000 description 1
- KVGPYKUIHZJWGA-BQBZGAKWSA-N Cys-Met-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O KVGPYKUIHZJWGA-BQBZGAKWSA-N 0.000 description 1
- HSAWNMMTZCLTPY-DCAQKATOSA-N Cys-Met-Leu Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HSAWNMMTZCLTPY-DCAQKATOSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 101100453960 Drosophila melanogaster klar gene Proteins 0.000 description 1
- 108700030338 Drosophila wg Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 101710126496 Envelope glycoprotein I Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 208000001308 Fasciculation Diseases 0.000 description 1
- 102000002090 Fibronectin type III Human genes 0.000 description 1
- 108050009401 Fibronectin type III Proteins 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- YIWFXZNIBQBFHR-LURJTMIESA-N Gly-His Chemical compound [NH3+]CC(=O)N[C@H](C([O-])=O)CC1=CN=CN1 YIWFXZNIBQBFHR-LURJTMIESA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 description 1
- 238000004566 IR spectroscopy Methods 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 1
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 1
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000010789 Interleukin-2 Receptors Human genes 0.000 description 1
- 108010038453 Interleukin-2 Receptors Proteins 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- NDSNUWJPZKTFAR-DCAQKATOSA-N Lys-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN NDSNUWJPZKTFAR-DCAQKATOSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- FMIIKPHLJKUXGE-GUBZILKMSA-N Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN FMIIKPHLJKUXGE-GUBZILKMSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- XYVRXLDSCKEYES-JSGCOSHPSA-N Met-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 XYVRXLDSCKEYES-JSGCOSHPSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 206010028293 Muscle contractions involuntary Diseases 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010025020 Nerve Growth Factor Proteins 0.000 description 1
- 102100023616 Neural cell adhesion molecule L1-like protein Human genes 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 208000030852 Parasitic disease Diseases 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- 241000223810 Plasmodium vivax Species 0.000 description 1
- 102100031574 Platelet glycoprotein 4 Human genes 0.000 description 1
- 101710202087 Platelet glycoprotein 4 Proteins 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 101100043250 Rattus norvegicus Spon1 gene Proteins 0.000 description 1
- 108091006629 SLC13A2 Proteins 0.000 description 1
- 239000012506 Sephacryl® Substances 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 1
- 102000019361 Syndecan Human genes 0.000 description 1
- 108050006774 Syndecan Proteins 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- CUTPSEKWUPZFLV-WISUUJSJSA-N Thr-Cys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(O)=O CUTPSEKWUPZFLV-WISUUJSJSA-N 0.000 description 1
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 1
- NRBUKAHTWRCUEQ-XGEHTFHBSA-N Thr-Cys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O NRBUKAHTWRCUEQ-XGEHTFHBSA-N 0.000 description 1
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- YKRQRPFODDJQTC-CSMHCCOUSA-N Thr-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN YKRQRPFODDJQTC-CSMHCCOUSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- OHGNSVACHBZKSS-KWQFWETISA-N Trp-Ala Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)N[C@@H](C)C([O-])=O)=CNC2=C1 OHGNSVACHBZKSS-KWQFWETISA-N 0.000 description 1
- LCPVBXOHXMBLFW-JSGCOSHPSA-N Trp-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)=CNC2=C1 LCPVBXOHXMBLFW-JSGCOSHPSA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- PEEAINPHPNDNGE-JQWIXIFHSA-N Trp-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 PEEAINPHPNDNGE-JQWIXIFHSA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- PWIQCLSQVQBOQV-AAEUAGOBSA-N Trp-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 PWIQCLSQVQBOQV-AAEUAGOBSA-N 0.000 description 1
- PITVQFJBUFDJDD-XEGUGMAKSA-N Trp-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 PITVQFJBUFDJDD-XEGUGMAKSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 1
- DZHDVYLBNKMLMB-ZFWWWQNUSA-N Trp-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 DZHDVYLBNKMLMB-ZFWWWQNUSA-N 0.000 description 1
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 1
- BVZABQIRMYTKCF-JSGCOSHPSA-N Trp-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BVZABQIRMYTKCF-JSGCOSHPSA-N 0.000 description 1
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- LWFWZRANSFAJDR-JSGCOSHPSA-N Trp-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 LWFWZRANSFAJDR-JSGCOSHPSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- KYPMKDGKAYQCHO-RYUDHWBXSA-N Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 1
- AVFGBGGRZOKSFS-KJEVXHAQSA-N Tyr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O AVFGBGGRZOKSFS-KJEVXHAQSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- BMPPMAOOKQJYIP-WMZOPIPTSA-N Tyr-Trp Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C([O-])=O)C1=CC=C(O)C=C1 BMPPMAOOKQJYIP-WMZOPIPTSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- WPSXZFTVLIAPCN-WDSKDSINSA-N Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(O)=O WPSXZFTVLIAPCN-WDSKDSINSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 235000018936 Vitellaria paradoxa Nutrition 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- VREFGVBLTWBCJP-UHFFFAOYSA-N alprazolam Chemical compound C12=CC(Cl)=CC=C2N2C(C)=NN=C2CN=C1C1=CC=CC=C1 VREFGVBLTWBCJP-UHFFFAOYSA-N 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 230000004009 axon guidance Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 238000000423 cell based assay Methods 0.000 description 1
- 210000005056 cell body Anatomy 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000005859 cell recognition Effects 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000031902 chemoattractant activity Effects 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 230000009073 conformational modification Effects 0.000 description 1
- 230000009133 cooperative interaction Effects 0.000 description 1
- 239000004148 curcumin Substances 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical group O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 108010038082 heparin proteoglycan Proteins 0.000 description 1
- 102000053563 human MYC Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 239000012133 immunoprecipitate Substances 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 210000003716 mesoderm Anatomy 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- KRTSDMXIXPKRQR-AATRIKPKSA-N monocrotophos Chemical compound CNC(=O)\C=C(/C)OP(=O)(OC)OC KRTSDMXIXPKRQR-AATRIKPKSA-N 0.000 description 1
- 210000002161 motor neuron Anatomy 0.000 description 1
- 108700024542 myc Genes Proteins 0.000 description 1
- DIOQZVSQGTUSAI-UHFFFAOYSA-N n-butylhexane Natural products CCCCCCCCCC DIOQZVSQGTUSAI-UHFFFAOYSA-N 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 230000008271 nervous system development Effects 0.000 description 1
- 210000001020 neural plate Anatomy 0.000 description 1
- 238000003522 neurite outgrowth assay Methods 0.000 description 1
- 230000007472 neurodevelopment Effects 0.000 description 1
- 210000000715 neuromuscular junction Anatomy 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 210000004248 oligodendroglia Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000000059 patterning Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 210000002856 peripheral neuron Anatomy 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 150000004492 retinoid derivatives Chemical class 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 210000003497 sciatic nerve Anatomy 0.000 description 1
- 210000002265 sensory receptor cell Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 210000000273 spinal nerve root Anatomy 0.000 description 1
- 210000003046 sporozoite Anatomy 0.000 description 1
- 108010053226 substrate adhesion molecules Proteins 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/465—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from birds
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/475—Growth factors; Growth regulators
- C07K14/48—Nerve growth factor [NGF]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Description
b b* 0 a 0* a a. a a. a.
a
AUSTRALIA
PATENTS ACT 1990 DIVISIONAL APPLICATION NAME OF APPLICANT(S): The Trustees of Columbia University in the City of New York ADDRESS FOR SERVICE: DAVIES COLLISON CAVE Patent Attorneys 1 Little Collins Street Melbourne, 3000.
INVENTION TITLE: Cloning, expression and uses of a novel secreted protein, F-spondin The following statement is a full description of this invention, including the best method of performing it known to us: Q:\oPER\EJH\39455-DL44 13t2/97 P:\OPER\MRO\1883748.SPE 28/9/99 -1- CLONING, EXPRESSION AND USES OF A NOVEL SECRETED PROTEIN,
F-SPONDIN
Background of the invention Throughout this application various references are referred to within parenthesis. Disclosures of these publications in their entireties are hereby incorporated by reference into this application to more fully describe the state of the art to which this invention pertains. Full bibliographic citation for these references may be found at the end of this application, preceding the sequence listing and the claims.
10 Throughout this specification and the claims which follow, unless the context requires .otherwise, the word "comprise" and variations such as "comprises" and "comprising" will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
As used herein, the term "derived from" shall be taken to indicate that a particular integer or group of integers has originated from the species specified, but has not necessarily been obtained directly from the specified source.
The early development of the vertebrate nervous system is controlled by local cell interactions 20 that determine the identity of specific neural cell types and the pathways of growing axons.
S* One of the first cell types to differentiate within the embryonic nervous system is the floor plate, a small group of epithelial cells located at the ventral midline of the neural tube (Schoenwolf and Smith, 1990). The differentiation of the floor plate is induced by local, possibly contact-dependent signals from the notochord (Figure 1) (van Straaten et al., 1988; Placzek et al., 1990c; Hatta et al., 1991). Signals that derive from the floor plate have been implicated in the control of cell identity in the neural tube and in the guidance of axons (Figure 1) (Jessell and Dodd, 1991).
Evidence that the floor plate is a source of polarizing signals that control cell identity and pattern in the neural tube has come from experiments in chick embryos in which floor plate cells grafted next to the neural tube of host embryos give rise to additional ectopic motor Ii~.
neurons and to other ventral neuronal types defined by cell specific antigenic markers (Yamada et al., 1991; Placzek et al., 1991). Inversely, preventing floor plate differentiation by removing the notochord leads to the formation of a spinal cord that is devoid of motor neurons and other ventral neurons. These grafting experiments suggest that the floor plate has a central role in establishing the identity and pattern of neuronal cell types present in the ventral spinal cord. The floor plate also has limb polarizing activity when grated into the chick wing bud, possibly through the release of morphogenically active retinoids (Wagner et al., 1990).
After the identity of spinal cord neurons has been established, the floor plate appears to provide both long-range and local guidance cues that promote the growth of axons to and across the ventral midline of the spinal cord. First, the floor plate secretes a diffusible chemoattractant which can orient the growth of axons of commissural neuron in vitro (Figure 1) (Tessier- Lavigne et al., 1988; Placzek et al., 1990a; Tessier- 20 Lavigne and Placzek, 1991) and may account for the homing of these axons to the floor plate in vitro (Weber, 1938; Placzek et al., 1990b; Bovolenta and Dodd, 1991; Yaginuma and Oppenheim, 1991). Second, the floor plate may contribute to the change in trajectory of commissural axons from the transverse to the longitudinal plane that occurs immediately after crossing the ventral midline (Figure 1) (Holley and Silver, 1987; Dodd et al., 1988; Bovolenta and Dodd, 1990). In support of this proposal, Sgenetic mutations in mice and zebrafish that result in the absence of the floor plate during embryonic development lead to errors in the pathfinding of commissural axons at the midline of the spinal cord (Bovolenta and Dodd, 1991; Bernhardt and Kuwada, 1990).
Third, the floor plate may promote the fasciculation of commissural axons that occurs after they cross the midline of the spinal cord (Holley and Silver, 1987) by regulating the expression of glycoproteins of the immunoglobulin superfamily (Dodd et al., 1988; Schachner et al., 1990; Purley et al., 1990). The specialized role of the floor plate in vertebrate neural development has parallels in invertebrate organisms in that cells at the midline of the embryonic drosophila and C. elegans central nervous systems have been implicated in neural patterning and axon guidance (Klambt et al., 1991; Nambu et al., 1991; Hedgecock and Hall, 1990).
To identify molecules that may mediate the diverse functions of the floor plate during early neural development, subtractive hybridization techniques have been used to isolate cDNA clones expressed selectively by the floor plate. Th? characterization of cDNA clones encoding a novel secreted protein, F-spondin, that is expressed at high levels by the rate floor plate during embryonic development is described here. The predicted amino acid sequence of F-spondin reveals that the protein contains domains similar to those present in the thrombospondin and other proteins implicated in cell adhesion and neurite outgrowth. In vitro assays show that F-spondin promotes neural cell adhesion and neurite outgrowth suggesting that the secretion of this protein by the floor plate contributes to the growth and guidance of axons in the developing CNS.
4ii-;r-~rr Summarv of the invention This invention provides isolated vertebrate nucleic acid molecule encoding F-spondin. The isolated nucleic acid may be cDNA or RNA. The isolated vertebrate nucleic acid may be derived from human, rat, chicken or Xenopus.
This invention also provides a nucleic acid probe comprising a nucleic acid molecule of at least nucleotides capable of specifically hybridizing with a 1 sequence included within the sequence of a nucleic acid molecule encoding a F-spondin. The nucleic acid probe may be DNA or RNA.
This invention provides the method to obtain F-spondin 1 nucleic acid molecule. In an embodiment, a rat F-spondin gene is isolated by substractive hybridization. In another embodiment, a chicken F-spondin gene is isolated by screening a chicken cDNA library using a rat F-spondin probe. In a further embodiment, a Xenopus F-spondin is 20 also isolated.
This invention further provides a host vector system for the production of a polypeptide having the biological activity of F-spondin. The isolated vertebrate F-spondin nucleic acid molecule is linked to a promoter of RNA transcription and then to a plasmid. The suitable host is a bacterial cell, insect cell, or animal cell, depending on the type of promoter and plasmid used. This invention also provides a method of producing a polypeptide having the biological activity of F-spondin, which comprises growing the selected host vector system under suitable conditions permitting production of the polypeptide and recovering the polypeptide so produced.
This invention further provides purified vertebrate F- 1 spondin. Such purified F-spondin will be useful for adhesion and outgrowth of axon. This invention provides a method of attaching nerve cells to a matrix comprising contacting the matrix with nerve cell and purified Fspondin at a concentration effective to effect attachment of the cells to the matrix. This invention further provides a method of stimulating growth of a nerve cell comprising contacting the nerve cell with purified Fspondin at a concentration effective to stimulate growth of the nerve cell. This invention provides a method of regenerating nerve cells in a subject comprising administering to the subject purified F-spondin at a concentration effective to regenerate nerve cells in the subject. Finally, this invention provides a pharmaceutical composition for stimulating nerve cell growth comprising a pharmaceutically acceptable carrier and purified F-spondin at a concentration effective to stimulate nerve cell growth.
*3 a. 6 Brief Description of FiGures Figure 1.
Figure 2.
Figure 3.
Diagram showing the induction and proposed functions of the floor plate during early spinal cord development. For details see text.
Schematic diagram of the subtractive hybridization protocol used to identify floor plate specific cDNA clones.For details see text.
Expression of F-spondin mRNA. Total cellular RNA or poly RNA was isolated from different tissues and separated on 1% agaroseformaldehyde gels and blotted to nylon membranes. The blot was analyzed with cDNA probes derived from the F-spondin 3' noncoding region labelled by random priming.
A. Preferential expression of F-spondin mRNA in E13 (embryonic day 13) floor plate compared with E13 dorsal spinal cord at adult spleen. Two transcripts of 4.5 and 4.7 kb are detected in floor plate RNA.
B. NCAM, Neural Cell Adhesion Molecule, mRNA is expressed at approximately equivalent levels in E13 floor plate and dorsal spinal cord and PO (postnatal; day 0) brain.
C. F-spondin mRNA is detected in blots of total RNA adult kidney and brain but not in adult liver or sciatic nerve.
Figure 4. Restriction map of the F-spondin cDNA. The arrow indicates the direction of translation.
The restriction sites are indicated above the cDNA.
Figure 5. cDNA and predicted amino acid sequence of Fspondin.
A. Nucleotide and amino acid sequence of rat F-spondin determined from cDNA clones.
The numbering of amino acids starts at the first methionine. Underline NHZ terminal residues indicates the putative signal sequence. Potential sites of N-linked glycosylation are indicated by double lines.
B. Analysis of the hydrophobicity of the predicted F-spondin amino acid sequence.
The plot was generated using the parameters given in Kyte and Doolittle (1982). The NH 2 terminus of the protein is to the left. Negative values indicate o hydrophobic residues.
Figure 6. Alignment of the carboxy terminal domain Fspondin and homology to thrombospondin type one repeats in other proteins.
A. Schematic representation of the domain structure of F-spondin. The black box represents the signal sequence. The hatched box represents the thrombospondin type 1 repeats (TSRs).
B. Alignment of the six repeats motifs in F- Sspondin which occupy residues 440-807 of the protein. The position of the first and last amino acids of each repeat is shown on the left. Numbers over each repeat refer to the position of residues.
T
8 Positions in which there are four or more identical residues are enclosed in boxes.
C. Comparison of the conserved F-spondin motif with the conserved TSRs found in thrombospondin I, thrombospondin
II,
region II of the plasmodial circumsporooite (cs) proteins thrombospondin-related anonymous protein (TRAP), properdin and in the N-and Cterminal regions of the complement proteins C6, C7, C8a, C8b and C9. The number at the right of the figure indicates the number of TSR domains that contain VTCG sequence as a proportion of the total number of TSR domains.
Figure 7. Localization of F-spondin mRNA in the developing spinal cord.
A. Autoradiographic localization of F-spondin mRNA in the hindbrain of a day 10 rat embryo by in situ hybridization using an 20 antisense RNA probe. Intense hybridization is detected at the ventral .1 midline of the neural tube and possible also in the axial mesoderm underlying the neural tube.
B. Localization of whole mount in situ Fspondin mRNA by Ell (embryonic day 11) rat embryos hybridization histochemistry using digoxigenin-labelled antisense probe.
.p Hybridization is detected in the floor 30 plate of the midbrain, hindbrain and spinal cord (arrow heads).
C. Bright field micrograph showing localization of F-spondin mRNA in E12 (embryonic day 12) rat spinal cord. The floor plate is intensely labelled.
D. Dark field micrograph of a similar section showing a low level of hybridization is in the ventral horn in addition to intense labelling in the floor plate.
Hybridization is also detected in the ventral root.
E. Dark field micrograph showing the floor plate and the ventral ventricular zone of E13 spinal cord express high levels of Fspondin mRNA.
F. Bright field micrograph of E16 (embryonic day 16) spinal cord showing that F-spondin mRNA levels are still high in the floor plate and the ventral ventricular zone.
G. Dark field micrograph showing that by E16, significant hybridization is also detected in ventral and intermediate regions of the spinal cord.
H. Dark field micrographs showing a uniform distribution of F-spondin mRNA.
Scale bar: A-100 Am; B-350 ym; C-80 mm; E-100 Am; F-170 Am; G-170 Am; H-120am.
F-Spondin myc is secreted by cos cells and associated with the cell surface.
A. Position of insertion of an oligonucleotide encoding for a 10 amino acid region of the c-myc oncogene ligated into unique NcoI site or Spel sites within the F-spondin cDNA.
B. Immunoprecipitation of conditioned media obtained by exposure of 40h to cos cells transfected with pFP5myS, pFP5myN and to 9.
9 $9 9*e* 9* .9 9 9.
$9 9* 9 9 9* 9 Figure 8.
Figure 9.
mock transfected cells. Both constructs generated a single protein band at 116 kDa.
C. Phase contrast micrograph showing a small group of transfected cos cells.
D. Immunofluorescence micrograph showing the localization of F-spondinmyc on the cell surface. Immunoreactivity is detectable at much higher levels at cell-cell rather than at cell-substrate contacts.
Scale bar in C, D 204m F-spondinmye promotes the extension of neurites from DRG neurons in vitro. F-spondinye protein obtained from transfected cos cells supernatants was affinity purified and analyzed by SDS-PAGE(8-25%) and silver staining. (A) Two stained bands are observed, which may reflect differences in the glycosylation of Fspondin. Neural cells isolated from E14 rat dorsal root ganglia were plated on F-spondin or on cos cell-conditioned media or BSA (not shown) substrates for 14h and then fixed and labelled with MAb 3A10 and visualized by indirect immunofluorescence. The length of the longest neurite of each 3A10-positive neurons was measured (or recorded as 0 mm if no neurite was seen). The percentage of neurons (ordinate) with neurites longer than a given length in gm (abscissa) is plotted. Similar results were obtained in 5 experiments. Only non-fasciculated neurites were included in the plots shown in D. Scale bar in B and C 100 Am.
4 04 S. 4 54
S.
S. 4 *4SS Figure 10. F-spondin promotes the adhesion of dorsal spinal cord cells. A single cell suspension of E13 dorsal spinal cord cells (106 cells/35mm disk) was plated on, Fspondin m ye on BSA and on Fspondinmyc substrate in the presence of heparin (1 mg/ml)(D), for lh. Cells were then washed in PBS, fixed and counted.
E. Box plot showing dose-dependent adhesion of E13 dorsal spinal cord cells to different amounts of Fspondinmyc substrate. Each box represents cell counts from different fields. Similar results were obtained in 3 separate experiments.
F. Box plot showing inhibition of the adhesion of E13 dorsal spinal cord cells to F-spondinyc in the presence of different concentrations of heparin and chondroitin sulfate.
The inhibition at all concentrations of chondroitin sulfate and heparin is significant (p<0.001; Ttest).
Scale bar in A, C, D 200 gm, B 50 Mm Box plot: The box enclosed 50% of the population with the median marked as a bold line and the mean as a dot. The range of the data is indicated by the extent of the lines. Each plot represents determinations form one of three similar experiments.
Detailed Description of the Invention This invention provides isolated vertebrate nucleic acid molecules which encode F-spondin. As used herein, the term F-spondin encompasses any amino acid sequence, polypeptide or protein having the biological activities provided by the F-spondin.
In one embodiment of this invention, the isolated nucleic acid molecules described hereinabove are DNA. In other embodiments of this invention, the isolated nucleic acid molecules described hereinabove are cDNA, or RNA. In the preferred embodiment of this invention, the isolated nucleic molecules are cDNAs as shown in sequence ID numbers 11 and 13) 1 This invention also encompasses DNAs and cDNAs which encode amino acid sequences which differ from those of Fspondin, but which should not produce phenotypic changes.
Alternatively, this invention also encompasses DNAs and cDNAs which hybridize to the DNA and cDNA of the subject 20 invention. Hybridization methods are well known to those of skill in the art.
The DNA molecules of the subject invention also include DNA molecules coding for polypeptide analogs, fragments 2 or derivatives of antigenic polypeptides which differ *from naturally-occurring forms in terms of the identity or location of one or more amino acid residues (deletion analogs containing less than all of the residues specified for the protein, substitution analogs wherein one or more residues specified are replaced by other residues and addition analogs wherein one or more amino acid residues is added to a terminal or medial p6rtion of the polypeptides) and which share some or all properties of naturally-occurring forms. These sequences include: the incorporation of codons "preferred" for expression by selected non-mammalian host; the provision of sites for cleavage by restriction endonuclease enzymes; and the provision of additional initial, terminal or intermediate DNA sequences that facilitate construction of readily expressed vectors.
The DNA molecule described and claimed herein are useful for the information which they provide concerning the amino acid sequence of the polypeptide and as products for the large scale synthesis of the polypeptide by a variety of recombinant techniques. The molecule is useful for generating new cloning and expression vectors, transformed and transfected procaryotic and eucaryotic host cells, and new and useful methods for cultured growth of such host cells capable of expression of the polypeptide and related products.
Moreover, the isolated nucleic acid molecules are useful for the development of probes to study the neurodevelopment.
20 F-spondin may be produced by a variety of vertebrates. In *e an embodiment, a rat F-spondin nucleic acid is isolated.
A restriction map of the cDNA of rat F-spondin is shown in Figure 4. The Xhol-Dral fragment of rat F-spondin was excised from the F-spondin cDNA. The Xhol site was blunt-ended with T4 DNA polymerase, and Bgl2 linkers (12 .A mers) was ligated. The fragment was subcloned into BamH1 site of pBluescript SK (Strategene). The 5' of the gene is located near the T3 promoter. The resulting plasmid, encoding the rat F-spondin was deposited on March 19, 1992 with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Maryland 20852,
_I
U.S.A. under the provisions of the Budapest Treaty for the International Recognition of the Deposit of Microorganism for the Purposes of Patent Procedure.
Plasmid, pFP5/KS was accorded ATCC accession number 75215.
In another embodiment, a chicken F-spondin cDNA was isolated (Seq. ID No. 11). The translation initiates at nucleotide position 136. In a further embodiment, a partial Xenopus F-spondin was isolated (Seq. ID No. 13).
1 Throughout this application, references to specific nucleotides are to nucleotides present on the coding strand of the nucleic acid. The following standard abbreviations are used throughout the specification to indicate specific nucleotides: C=cytosine A-adenosine T=thymidine G=guanosine For the purpose of illustration only, applicants used a substractive hybridization techniques to isolate and characterized F-spondin cDNA clones in rats. Similar 20 substractive hybridization techniques are applicable to isolate and characterize the F-spondin genes in different vertebrates.
Alternatively, the F-spondin genes may be isolated using the probe generated from the rat F-spondin gene. The chicken and Xenopus homologous F-spondin genes have recently been cloned by applicants. These genes are Sextremely conserved and share 90% homology at the amino acid level and about 70% homology at the nucleic acid level. The chicken gene was isolated by low stringency screening of embryonic spinal cord cDNA library whereas the Xenopus F-spondin gene was isolated by low stringency screening of the whole embryo cDNA library, both using probes from the coding region of rat F-spondin.
For the human F-spondin gene, it is conceivable that the degree of homology between rat and human would be even greater since both rat and humans are mammals. Human embryonic brain cDNA library, available from Clontech, and human genomic library may be used for such screening.
Duplicated filters of human libraries may be screened with radiolabelled probe derived from the rat F-spondin.
The probe may be encompassing the coding region, since the homology of F-spondin across species is through the whole coding region. The filters containing the human libraries will be hybridized with the probes at low stringency (Sambrook et al. 1989) and positive clone will be further analyzed by DNA sequencing techniques which are well known to an ordinary skilled artisan.
This invention provides a nucleic probe comprising a nucleic acid molecule of at least 15 nucleotides capable of specifically hybridizing with a sequence included within the sequence of a nucleic acid molecule encoding a F-spondin, for example, with a coding sequencing included within the sequence shown in Figure 5 and Sequence ID number 9. As used herein, the phrase "specifically hybridizing" means the ability of a nucleic acid molecule to recognize a nucleic acid sequence complementary to its own and to form double-helical segments through hydrogen bonding between complementary Sbase pairs. Nucleic acid probe technology is well known to those skilled in the art who will readily appreciate Sthat such probes may vary greatly in length and may be labeled with a detectable label, such as a radioisotope or fluorescent dye, to facilitate detection of the probe.
DNA probe molecules may be produced by insertion of a DNA molecule which encodes F-spondin into suitable vectors, such as plasmids or bacteriophages, followed by transforming into suitable bacterial host cells, replication in the transformed bacterial host cells and harvesting of the DNA probes, using methods well known in the art. Alternatively, probes may be generated chemically from DNA synthesizers.
The probes are useful for 'in situ' hybridization to locate tissues which express this gene, or for other hybridization assays for the presence of this gene or its mRNA in various biological tissues.
Vectors which comprise the isolated nucleic acid molecule described hereinabove also are provided. Suitable vectors comprise, but are not limited to, a plasmid or a virus. These vectors may be transformed into a suitable host cell to form a host cell vector system for the production of a polypeptide having the biological activity of F-spondin.
20 This invention further provides an isolated DNA or cDNA molecule described hereinabove wherein the host cell is selected from the group consisting of bacterial cells (such as E.coli), yeast cells, fungal cells, insect cells and animal cells. Suitable animal cells include, but are Snot limited to Vero cells, HeLa cells, Cos cells, CV1 cells and various primary mammalian cells.
This invention provides a method to identify and purify expressed F-spondin proteins. A myc-epitope was first S introduced into the F-spondin protein. This F-spondin carrying myc-spondin may linked to an expression vector.
Such vector may be used to transfect cell and the distribution of F-spondin in the cell can be detected by 17 reacting myc antibodies known to be reactive to the introduced myc-epitope with the transfected cells which is expressing the F-spondin carrying myc-epitope. Taking advantage of this myc-epitope, F-spondin may be purified by an antibody affinity column which binds with this mycepitope.
In one embodiment, myc-epitope is introduced in the Ncol site of the rat F-spondin. After that the smal (125), Dra (2731) fragment of the rat F-spondin was isolated.
Bgl2 linkers were added, and the fragment was subcloned into BamH 1 site of pcDNA neo (InVitrogene). The of the gene is located near the T7 RNA promoter. The resulting plasmid, pcFP5.myn, was deposited on March 19, 1992 with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Maryland 20852, U.S.A.
under the provisions of the Budapest Treaty for the International Recognition of the Deposit of Microorganism for the Purposes of Patent Procedure. Plasmid, pcFPS.myn was accorded ATCC designation number 75216.
S. 20 The above uses of the myc-epitope for identification and purification of F-spondin should not be considered limiting only to the myc-epitope. Other epitopes with *specific antibodies against them which are well known to *"an ordinary skilled in the art could be similarly used.
Also provided by this invention are F-spondin complete protein sequences (seq. ID Nos. 10, 12). In an Sembodiment a complete rat F-spondin protein sequence is disclosed (Seq. ID No. 10). In aother embodiment a complete chicken F-spondin protein sequence is provided (Seq. ID No. 12). In a further embodiment a partial Xenopus F-spondin protein sequence is also provided (Seq.
ID No. 14).
Further provided by this invention is a purified, Fspondin polypeptide. As used herein, the term "purified F-spondin" shall mean isolated naturally-occurring Fspondin or protein (purified from nature or manufactured such that the primary, secondary and tertiary conformation, and posttranslational modifications are identical to naturally-occurring material) as well as non-naturally occurring polypeptides having a primary structural conformation continuous sequence of amino acid residues). Such polypeptides include derivatives and analogs.
Such F-spondin will be useful for adhesion and outgrowth of axon. Therefore, this invention also provides a method of attaching nerve cells to a matrix comprising contacting the matrix with nerve cell and purified F- 1 spondin at a concentration effective to effect attachment of the cells to the matrix.
Methods to determine such a concentration are well-known in the art. The effect concentration of F-spondin may be determined by using different concentrations of purified 20 F-spondin to the matrix and the nerve cell. The concentration in which attachment of the matrix and the nerve cell is observed is the effective concentration.
This invention further provides a method of stimulating growth of a nerve cell comprising contacting the nerve -cell with purified F-spondin at a concentration effective to stimulate growth of the nerve cell.
SThis invention also provides a method of regenerating nerve cells in a subject comprising administering to the subject purified F-spondin at a concentration effective to regenerate nerve cells in the subject.
Finally, this invention provides a pharmaceutical composition for stimulating nerve cell growth comprising a pharmaceutically acceptable carrier and purified Fspondin at a concentration effective to stimulate nerve cell growth.
For the purposes of this invention "pharmaceutically acceptable carriers" means any of the standard pharmaceutical vehicles. Examples of suitable vehicles are well known in the art and may include, but not limited to, any of the standard pharmaceutical vehicles such as a phosphate buffered saline solutions, phosphate buffered saline containing Polysorb 80, water, emulsions such as oil/water emulsion, and various type of wetting agents.
This invention will be better understood from the Experimental Details which follow. However, one skilled in the art will readily appreciate that the specific methods and results discussed are merely illustrative of the invention as described more fully in the claims which follow thereafter.
S
Experimental Details Experimental Procedures Library Construction and Screening Directional cDNA libraries were constructed in Lambda ZAP* II (Stratagene®) from embryonic day 13 floor plate and dorsal spinal cord poly (A)'-selected RNA. The ends of the cDNA inserts were located downstream of the T3 RNA polymerase promotor, and the 3' ends downstream of the T7 RNA polymerase promotor. DNA was prepared from the library using the plate lysate method (Sambrook et al., 1989). The DNA was linearised with XhoI and RNA was transcribed with T3 RNA polymerase (Stratagene®). RNA from the dorsal spinal cord library was transcribed in the presence of UTP-biotin (Clontec) diluted 1:10 with UTP. First-strand cDNA was transcribed 1 from the T3 floor plate RNA using an oligo dT XhoI linker (Stratagene*).
Solution hybridization of first strand floor plate cDNA and the dorsal T3 biotinylated RNA was performed as 20 described by Sive and St. John (1988). Approximately 1 Ag of cDNA was hybridized with a 30-fold molar excess of RNA. The nucleic acids were dissolved in 10 ul of hybridization buffer containing 50 mM HEPES (pH 7.6), 0.2% SDS, 2 mM EDTA, 500 mM NaC1, and incubated at 68*C.
Under these conditions, CoT values greater than 100 were obtained. The hybridization mixture was diluted to 60 Ml with hybridization buffer without SDS, and 10 Mg streptavidin was added. The cDNA/biotin RNA hybrids were Sremoved by phenol-chloroform extraction. The remaining single strand cDNA was isolated and hybridized with a 300 fold excess of biotinylated RNA as described above.
About 10% of the starting cDNA was recovered in the first hybridization and about 15-20% from the second T-i-ri Xi--C hybridization.
The subtracted cDNAs were subjected to 20 cycles of a PCR reaction using oligo dT XhoI linker primer and SK primers (Stratagene®). The products of the PCR reaction were cut with EcoRI and XhoI, the primers and the flanking sequences were removed with sephacryl S-300 spin columns (Pharmacia®). The inserts were cloned into Lambda ZAP II arms.
Duplicate filters of the subtracted floor plate library 1 were screened with radiolabelled first strand cDNA derived from floor plate and dorsal spinal cord. 100 ng of mRNA was incubated in 20 Al of 50 mM Tris pH 8.3, mM MgCl 2 150 mM KC1, 1.0 mM dGTP, 1.0 mM dTTP, 100 MCi[32p]dATP (3000 Ci/mmol), 100 pCi[3 2 P]dCTP (3000 Ci/mmol), 100 mg/ml oligo dT, 10 mM DTT, 10 U of RNasin (Promega), 20 U of MulV reverse transcriptase (BRL), for min in 37*C. 4x10 3 recombinant phage were plated and screened. Hybridization and washes were performed at high stringency (Sambrook et al., 1989). The floor plate 20 cDNA probe hybridized selectively with 24 phages. Cross hybridization analysis revealed that these corresponded to three difference cDNAs designated FP2, FP5 and FP24.
The pattern of expression in the spinal cord was determined by in situ hybridization. FP2 and FP5 are 25 expressed selectively in the floor plate while FP24 is expressed in the floor plate, roofplate and in the ventricular zone of the spinal cord. The degree of enrichment as determined by screening the floor plate r enriched library and floor plate library with FP2, and FP35, which is expressed selectively in the floor plate (McKanna Cohen, 1989) is about RNA Transfer Analysis Total RNA was prepared from various tissues using the RNA Azol method (Biotex Laboratories) and then enriched for poly containing transcripts by passage over an oligo (dT) cellulose matrix. RNA transfer was performed as described by Thomas (1980). Probes were labelled by random priming (Feinberg and Vogelstein, 1984) and hybridized under standard conditions.
DNA Seauencin and Analysis cDNA inserts were excised directly as Bluescript plasmids (Stratagenee). The nucleotide sequence of the inserts were determined by the dideoxy chain-termination method (Sanger et al., 1977) using both double-stranded and single-stranded DNA as template for T7 DNA polymerase (Sequenase, United States Biochemicals). The nucleotide sequence of the entire coding region was determined by sequencing both strands. Sequences were assembled on an Apple Maclintosh computer using MacVector (IBI) program.
In Situ Hybridization In situ hybridization was preformed as described 20 previously (Wilkinson et al., 1987) using a T3 or T7 RNA polymerase-derived [3 5 S]UTP-labelled single stranded antisense RNA probe which encompasses a region of the 3' untranslated region of F-spondin (nt 3359-4029), or the TSRs (nt 1545-2626). Exposure times range from four to 25 fourteen days. Sense probes were used as controls.
For whole mount in situ hybridization, Ell rat embryos were fixed in 0.1 M MOPS, 2 mM EGTA, 1 mM Mg S04, 3.7% formaldehyde for 2 hours. In Asit hybridization was preformed essentially as described by Harland (1991), with a few modifications: anti-digoxygenin antibody (Boehringer Mannheim), was preabsorbed to E14 rat acetone powder (Harlow and Lane, 1988) before addition to the hybridization mixture. The chromoqenic reaction was carried out for l-2h.
DNA Constructs The myc epitope was introduced as follows: Two partially complementary ol igonuc leot ides with the sequence: 5'1- CTAGCGAGCAGAAGCTGATCTCCGAGGAGGACCTCA-3' (Seq. ID No. 1) and 5'-CTAGTGAGGTCCTCCTCGGAGATCAGCTTCTGCTCG-3' (Seq. ID No. 2) were annealed to obtain a double-stranded DNA fragment coding for the c-myc proto-oncogene epitope EQKLISEEDL (Seq. ID No. 3) flanked by a SpeI site. the fragment was cloned into a unique SpeI site (nt 1365) in F-spondin. The same epitope was also introduced into a NcoI site (nt 1575) using the oligonucleotides: CATGGGAGCAGAAGCTGATCTCCGAGGAGGACCTCG-3' (Seq. ID No. 4) and 5' -CATGCGAGGTCCTCCTCGGAGATCAGC~rCTGCTCC-3' (Seq. ID No. The tagged F-spondin DNA was subcloned into the expression vector pMT21 (provided by Genetics Institute), or pcDNA-I (InVitrogen).
cos Cells Transfection a. *aa*a.
a a a. a. a a.
20 Cos cells were transfected follows: 80% confluent transfected with 5 jsg DNA, DEAE Dextran (Pharmacia@), After 6h cells were washed serum, 0.1mM choloroquine incubation in DNEN 10% isolation of F-spondin the (BRL), and the cells were by the DEAE-Dextran method as overnight cultures were per 100 zmdish, in 250 g/ml 100 mM Tris pH 7.3, in DMEM.
and incubated in DMD( 10% calf (Sigma) for 2.5h, followed by calf seru.m overnight. For medium was changed to OPTI-ND( incubated f or 48h.
Met&boli Labeling of Cos Cells and Immunooeciitation' Transfected cos cells were- preincubated in =ethioninefree DMEM (BRL@-GIBCO). After 1h at 370C, 25OjsCi/ul[ 3 5
S]
methionine (NEN) was added, and the cells were incubated for an additional 3h. The medium was collected and incubated with anti-myc antibody (MAb 9E10) for hl. The immune complex was precipitated with fixed Staphylococcus aureus (BRL*) for lh. Pellets were washed three times with PBS, before resuspension in lx sample buffer. 35
S-
labelled immunoprecipitated proteins were visualized after electrophoresis on 10% SDS-polyacrylamide gels.
Immunocvtochemistrv F-spondin tagged with the c-myc epitope was detected with MAb 9E10 (Evan et al., 1985). Fluoresceinated isotypespecific second antibody (BoehringerO Mannheim; goat antimouse IgG) was used at a dilution of 1:100. For Immunofluorescence labelling (Dodd and Jessell, 1985), cultures were washed once at 22*C with L15 and then incubated with primary antibody for 30 min at 220C.
1 Cultures were then washed twice in L15-1% normal goat serum (NGS) and incubated with secondary FITC conjugated isotype-specific antibody diluted in L15-1% NGS for min at 229C. Cultures were washed twice and fixed in 4% paraformaldehyde in 0.2 M phosphate buffer (PB) for min, rinsed in 0.12M PB and coverslipped in 0.05% paraphenylenediamine (Sigma) in 0.2 M sodium carbonate (pH glycerol Cultures were viewed on a Zeiss Axioplan microscope under epifluorescence optics.
*SSS
25 Cell Culture Spinal cords were dissected from embryonic day 13 rats and placed into L15 medium at 4*C. The dorsal region of the spinal cord were dissected and incubated Swith 0.05 trypsin (Gibco) for 20 min in a Ca 2 +/Mg2+-free modified essential medium (S-MEM) (Gibco) supplemented with 8 mg ml 1 glucose. The tissue was then washed with S-MEN and triturated to give a single cell suspension.
Spinal cord cells were plated in 35 mm tissue culture dishes on appropriate substrates and grown in Ham's E12 medium (Gibco) supplemented with N3 additive (F12-N3) (Romijin et al., 1982) at a density of 106 cells/dish in a 5% C02 humidified incubator at 370C. Dorsal root ganglia were dissected from E14 rats and treated as described above. Cells were incubated with 0.1 trypsin, and plated with F12-N3 supplemented with 100 ng NGF at a density of 4xl0 4 /dish.
Neurite Outgrowth Assays 5x10 10 cos cells were transfected with pFP5myN and 1 conditioned medium was collected. F-spondin m yc, was affinity purified on a monoclonal anti-myc (9E10) affinity column. Affinity purified F-spondinmYc pl/ml) was absorbed onto nitrocellulose (Lemmon et al., 1989). For controls, parental cos cell conditioned 1 medium was purified on the same column and used as a substrate on nitrocellulose. The nitrocellulose was then blocked with bovine serum albumin (10 mg/ml) which provided a further control for background neurite outgrowth. E14 dorsal root ganglion (DRG) neurons were 20 plated on immobilized protein substrates at a density of 20 2-10x104 cells/35 mm tissue culture dish (Nunc, 35 mm diameter) and grown for 14h. Cultures were then fixed in 4% paraformaldehyde, permeabilized with 0.1% Triton X-100 and stained using MAb 3A10 (Furley et al., 1990; available from Developmental Studies Hybridoma Bank), which recognizes a neuronal filament-associated protein .and serves as a Larker for fine neurites. Neuronal cell bodies and neurites were visualized by indirect immunofluorescence on a Zeiss Axioplan microscope.
Neurite lengths were measured as the distance from the 30 Sedge of the soma (sharply defined by 3A10 fluorescence) to the tip of its longest neurite. Neurite lengths were only measured if the entire length to the neurite could be unambiguously identified. About 25 neurites were measurable within each protein-coated area (3-4 mm 2 Adhesion Assay Dissociated E13 dorsal spinal cord cells were plated on immobilized protein substrate at a density of 106 cells/35 mm tissue culture dish (Nunc, 35 mm diameter).
After one hour the cultures were washed twice with PBS and fixed in 4% paraformaldehyde. Cells were counted on a Zeiss Axioplan microscope at 400x magnification. Ten independent counts were taken from each experiment. The floor plate is a transient neural cell group implicated in the control of cell pattern and axonal growth in the developing vertebrate nervous system.
ExDerimental Results Identification and Sequence of a Floor Plate-Enriched cDNA Clone Cellular assays have revealed that the floor plate has several specializing signalling functions during the embryonic development of the spinal cord. Floor plate- 20 derived signals are likely to be encoded by proteins whose mRNAs are restricted to or are highly enriched in the floor plate. In order to identify such molecules subtractive hybridization techniques have been used to isolate cDNA clones that are expressed by the floor plate 25 but not by the dorsal spinal cord in embryonic day 13 rat embryos (see Figure 2 and Experimental Procedures).
One cDNA clone identified in this screen, designated contained a 0.5 kb insert which hybridized to two major transcripts of 4.5 and 4.7 kb in poly -selected RNA 30 derived from E13 rat floor plate (Figure 3A). Very faint hybridization to the same two transcripts was detected in RNA derived from E13 dorsal spinal cord (Figure 3A) and post-natal day 0 brain (Figure 3C), whereas no 27 hybridization was detected to RNA derived from adult liver and spleen (Figures 3A, The specificity of expression of FP5 transcripts within E13 rat spinal cord was confirmed by in situ hybridization histochemistry which showed that FP5 mRNA is expressed at very high levels in the floor plate but is undetectable in the dorsal region of E13 rat spinal cord (see below). These studies indicate that FP5 transcripts are highly enriched in the floor plate.
Screening of an E13 rat floor plate cDNA library with the 0.5 kb cDNA insert from the FP5 clone identified several additional cDNA clones of which clone FP5-9 contained a 4 kb insert. The FP5-9 cDNA contains a single long open reading frame that starts with a methionine codon at nucleotide 226 associated with a conventional translation initiation sequence (Kozak, 1984) and ends with a TGA stop codon at nucleotide 2646 (Fig. 5A). No in-frame methionine codons were found upstream of the putative translation initiation site and sequences 5' of the initiation site contain stop codons in all three reading frames. Sequencing of several other independently 20 isolated FP5 cDNA subclones spanning the entire coding region did not reveal any differences in the nucleotide sequence of the open reading frame.
2. Translation of the open reading frame FP5-9 predicts a 25 protein 807 amino acids with a molecular mass of 90,766 daltons, and N-terminal hydrophobic leader sequence (Figure 5A; Seq. ID No. 9) with a consensus signal peptide cleavage site (von Heijne, 1985). No other long stretches of hydrophobic residues were observed (Figure 5B) suggesting that the protein does not possess a transmembrane spanning domain. The amino terminal domain of FP5-9 contains a region of clustered basic residues (residues 138-142) which could represent a site for proteolytic processing by mammalian subtilisin-like cleavage enzymes (Steiner, 1991). In addition, the predicted protein contains three N-linked glycosylation sites (Figure 5A). Collectively, these features suggest that the FP5-9 cDNA encodes a secreted protein.
The Protein Encoded by the FP5-9 cDNA has Structural Features of Cell and Substrate Adhesion Molecules Analysis of the predicted amino acid sequence of the 9 encoded protein reveals that it is separable into two major domains (Figure 6A). The NH 2 -terminal domain of 440 residues contains 10 cysteine residues and exhibits no sequence homology to other proteins in the Genbank database. The COOH terminal of the protein extends from residues 441-807 and contains six repeats of a domain 59 amino acids in length which can be aligned on the basis of conserved cysteine, tryptophan and arginine residues (Figures 6B, C).
Similar domains are present in a small number of proteins 20 (Patthy, 1988; Smith et al., 1991). In particular, the adhesive glycoprotein encoded by the thrombospondin I and II genes (Lawler and Hynes, 1986; Bornstein et al., 1991) each possess 3 of these domains which have been designated thrombospondin type 1 repeats (TSRs) (Lawler and Hynes, 1986) (Figure 6C). Two TSRs are found in protein C6-C9 of the alternative complement cascade, one at the NH 2 -terminal and one at the COOH-terminal of each protein (Haefliger et al., 1989; Smith et al., 1991).
Il Moreover, the complement-binding protein properdin 30 contains 6 TSRs which comprise 80% of the protein (Goundis and Reid, 1988). In addition to these vertebrate proteins, the central core of the TSR is similar to region II of malarial circumsporozoite (CS) 29 and other plasmodial proteins (Figure 6C) (Rich et al., 1990; Robson et al., 1988) which appear to mediate the binding of malarial sporozoites to host cells in the early stages of parasitic infection (Dame et al., 1984).
Finally, two TSRs are present in the C.elegans gene Uncwhich appears to regulate axonal pathfinding in a subset of neurons (Hedgecock et al., 1990; Culotti et al., 1991). The organization of cysteine and tryptophan residues in the TSRs of the FP5-9 encoded protein is not similar to that of the NH 2 -terminal TSRs of the C6-C9 complement proteins (Figure 6B). However, the core region of the TSRs in FP5-9 (residues 14-19) is most similar to that of thrombospondin, properdin and the malarial CS proteins (figure 6B). We have named the FPS- 9 gene F-spondin to reflect its high level of expression in the floor plate (see below) and the presence of the TSRs.
The TSRs in thrombospondin promote the adhesion of a variety of different cell types (Prater et al., 1991).
Similarly, the TSR core region of the plasmodium vivax CS '20 protein promotes the attachment of human hematopoietic 20 cell lines in vitro (Rich et al., 1990). The amino acid sequence VTCG which is contained within this common motif appears to be critical to the cell adhesive properties of the CS proteins. A VTCG sequence (Seq. ID No. 6) is also present in the two TSRs of thrombospondin that promote cell adhesion (Prater et al., 1991). Strikingly, there is a VTCG in the fourth TSR of F-spondin and the second and third TSRs of F-spondin contain sequences (VSCG, Seq.
SID No. 7; ATCG, Seq. ID No. 8) that vary by a single conservative substitution (Figure 6B). These 30 observations raise the possibi :-ty that the TSRs in Fspondin mediate cell adhesion. A search of the Genbank database for other proteins implicated in cell adhesion and recognition that contain a VTCG sequence identified V-CAM1 (Hession et al., 1991) and the VLA4 integrin a subunit (Takada et al., 1989).
Analysis of the predicted amino acid sequence of Fspondin reveals several other structural features that may contribute to the functional properties of the protein. The charged region that is interposed between the fifth and sixth TSRs contains the sequence LRE that has been shown to function as a neuronal cell attachment site in the extracellular matrix glycoprotein S-laminin (Hunter et al., 1989a, The first, third, fifth and sixth TSR's of F-spondin contain clusters of basic residues that have been implicated in the binding of proteins to heparin and other sulfated glycosaminoglycans (Cardin and Weintraub, 1989). The first, fourth and fifth TSRs of F-spondin also contain a WSXWS sequence (Figure 6B) which is present in the variant fibronectin type III repeats found in the receptors for several growth and differentiation factors, including ciliary neurotrophic factor (CNTF), leukemia inhibitory factor 20 (LIF) and the interleukins (ILs) 2-7 (Bazan, 1990; Davis, et al., 1991; Patthy, 1990). The function of the WSXWS motif is unclear although mutation at this site in the IL2 receptor blocks transmembrane signalling (Miyazaki et al., 1991).
Expression Pattern of F-Spondin mRNA Northern blot analyses of E13 embryos indicate that Fspondin is expressed at much higher levels in the floor plate than in the dorsal spinal cord. More detailed information on the distribution of F-spondin was provided by localizing its mRNA in developing rat embryos by in situ hybridization. F-spondin mRNA was first detected at E10.5 in cells located at the ventral midline of the 31 neural tube at the level of the prospective midbrain, hindbrain and spinal cord (Figure 7A). At this stage, cells at the ventral midline of the neural tube have acquired floor plate-derived chemoattractant activity (Placzek, et al., 1990c) although no antigenic markers of floor plate differentiation can be detected. The expression of F-spondin mRNA therefore provides an early molecular marker of floor plate differentiation.
The expression of F-spondin mRNA is maintained at high levels in E11-E12 floor plate (Figure 7B) whereas other regions of the spinal cord and hindbrain exhibit undetectable levels of hybridization at this stage. By E12-E13 low levels of mRNA are detected in the ventral horn although there is still no detectable mRNA in the dorsal horn (Figures 7C, In addition, the ventral ventricular zone immediately above the floor plate begins to express high levels of F-spondin mRNA (Figure 7) whereas hybridization to cells in the ventricular zone in the dorsal half of the spinal cord is not detectable (Figure 7E). Thus, expression of F-spondin mRNA reveals Sa molecular difference between ventricular zone cells in the dorsal and ventral spinal cord. Recent studies have suggested that the ventral ventricular zone is the site of origin of oligodendrocyte and astrocyte precursors that subsequently migrate laterally and dorsally to populate the remainder of the spinal cord (Miller, 1991).
F-spondin mRNA levels remain high in the floor plate and ventral ventricular zone at E16 and by this stage significant hybridization is also detected in cells in EP the ventral and intermediate regions of the spinal cord (Figures 7F, By PO, the levels of F-spondin mRNA in the floor plate have decreased and there is an increase in hybridization to other cells in the spinal cord, resulting in an uniform expression of F-spondin mRNA (Figure 7H). F-spondin mRNA is also preferentially expressed in the floor plate of the Ell-E16 hindbrain and midbrain and becomes more widely expressed in the brain at later embryonic stages (not shown).
In addition to the expression of F-spondin in the embryonic CNS, from E11-E12 onwards hybridization is also detected in association with sensory and motor nerve branches that project into the periphery (Figure 7D).
The association with peripheral nerve branches suggests that F-spondin mRNA is expressed in Schwann cells. The expression of F-spondin mRNA in association with peripheral nerves persists till E16, but appears to decrease at later stages, and by PO, little or no hybridization is detected in peripheral nerve (Figure 3C). These results provide evidence that over the period of initial outgrowth of central and peripheral axons, Fspondin mRNA is expressed predominantly by the floor plate with lower levels of expression in cells of the peripheral nerves, probably Schwann cells.
V.
20 F-spondin mRNA is also expressed outside the nervous system. In particular, mesodermal cells underlying the ventral midline of the spinal cord express low levels of F-spondin mRNA from Ell (Figure 7D). In addition, embryonic and PO kidney (Figure 3C), lung and condensing 25 cartilage (not shown) expresses F-spondin mRNA.
Expression of mRNA in the CNS, lung and kidney persists post-natally and in the adult (not shown).
*V Secretion and Cell Surface Association of F-Soondin 30 To determine the cellular localization of the F-spondin protein when expressed in mammalian cells, two epitopetagged derivatives, F-spondinmYc were generated, each of which contain a 10 amino acid insert derived from the
I
human c-myc proto-oncogene that can be detected by MAb 9E10 (Evan et al., 1985) (Figu:. 8A). The cDNAs encoding F-spondin my c were cloned into a mammalian expression vector and transfected into cos cells. To examine whether F-spondin m yc is present in medium conditioned by transfected cells, cos cells were labelled with 3 5
S-
methionine for 3-4h and the released proteins were immunprecipitated with MAb 9E10. Immunoprecipitates from cos cells transfected with two different F-spondinmyc constructs revealed a single major band of -116 kDa that was absent from mock-transfected cells (Figure 8B).
Immunoprecipitation of proteins extracted from the cos cells indicated that the amount of F-spondin recovered from the medium was similar to that associated with the cells (not shown). Thus cos cells release a significant fraction of synthesized F-spondin m YC. Other myc epitopetagged proteins, for example the drosophila wingless protein, are synthesized by cos cells but are not detected in the medium Basler, Personal communication) suggesting that the presence of Fspondinmyc in the medium does not result from leakage from 20 damaged cells. Thus, under these in vitro conditions Fspondinmye is secreted from cells. The apparent molecular weight of F-spondin determined by SDS-PAGE (-116kDa) is significantly greater than that predicted from the amino acid sequence (-90kDa). This difference in molecular weight may derive, in part, from glycoslyation of the core protein.
The cellular localization of F-spondinmyc in transfected cos cells was also determined by immunocytochemistry.
30 High levels of immunoreactivity were associated with the cell surface (Figures 8C, D) with both F-spondin m ye constructs (Figure 8A). No immunoreactivity was detected on the surface of untransfected cos cells (not shown).
The absence of a membrane spanning region and the presence of multiple heparin attachment sites in Fspondin suggests that the cell surface association of Fspondin m y c involves the binding of the secreted protein to the cell surface or extracellular matrix. In support of this, F-spondinmye present in the medium removed from transfected cos cells was found to bind to the surface of untransfected cos cells in vitro (not shown).
F-SDondin Promotes Neural Cell Adhesion and Neurite Outgrowth in vitro The structural features of F-spondin combined with its secretion and association with the cell surface raise the possibility that F-spondin can promote the adhesion of neural cells and the outgrowth of axons. Since F-spondin is expressed at highest levels in the floor plate, the effect of F-spondin on the adhesion and outgrowth of dorsal spinal cord cells to include the population of commissural neurons that project to and across the floor plate was examined. In addition, the expression of Fspondin mRNA in peripheral nerve suggested that the dorsal root ganglion (DRG) neurons might adhere to and extend neurites on F-spondin.
*54* The F-spondin m yc protein was purified on a MAb 9E10 affinity column from medium exposed to transfected cos cells (Figure 9A) and immobilized onto a nitrocellulose substrate (Lemmon et al., 1989). The ability of FspondinmYe to promote the outgrowth of E14 DRG neurons was compared with that of MAb 9E10 affinity-purified proteins secreted from untransfected cos cells and BSA. Outgrowth i 3 of DRG neurons on EHS laminin was used as a positive control. Over 80% of DRG neurons extended neurites on Fspondin (Figures 9B, D) and the length of DRG neurites that extended on F-spondin was similar to that on laminin (not shown) and significantly greater than that on parental cos cell proteins and on BSA (Figures 9C, D).
Similar results were obtained with both versions of Fspondinmyc (not shown). In addition, the number of DRG neurons that adhered to a substrate of F-spondinye after 18h was about 3 fold greater than that to BSA and parental cos cell proteins, and similar to that on laminin (not shown). These observations provide evidence that F-spondin can promote the adhesion of DRG neurons and the extension of neurites in vitro. The expression of F-spondin by peripheral nerve cells in vivo occurs before many sensory neurons have extended peripheral projections and could therefore contribute to the growth of developing sensory axons in the peripheral nervous system.
The ability of F-spondin my to promote the adhesion and outgrowth of dorsal spinal cord cells was also examined.
We found that dorsal spinal cord cells adhered well to Fspondin myc Within 60 min (Figures 10A, E) the number of cells adhering to F-spondin was 10-15 fold greater than 20 that to MAb 9E10 affinity-purified proteins secreted from untransfected cos cells or to BSA (Figures 10C, The majority of the adherent cells are neurons as determined by detection of the polysialic acid side chain of NCAM with MAb 5A5 (not shown; see Dodd et al., 1988; Karagogeos et al., 1991). Moreover, many adherent spinal cord neurons extended short neurites during this time period (Figure 10B). To examine further whether Fspondin promotes the outgrowth of spinal cord neurites the neurite length of adherent spinal cord neurons after 18h in vitro was determined. The length of spinal cord neurites on F-spondin my had increased by 18 hours; however neurites outgrowth on purified cos cell proteins and on BSA has also increased significantly and was not detectably different from that on F-spondinmyc. Thus it remains unclear whether F-spondin promotes extensive neurite outgrowth as well as the adhesion of spinal cord neurons.
The adhesion of a variety of cell lines to TSRs or to peptide derived from these repeats has been shown to be inhibited by glycosaminoglycans and other sulfated glycoconjugates (Roberts, 1988; Bernfield and Sanderson, 1990; Prater et al., 1991). Moreover, heparin sulfate proteoglycans have been suggested to function as cell surface receptors for thrombospondin (Holt et al., 1984; Sun et al., 1989; Bernfield and Sanderson, 1990). It is possible therefore that the interactions of neural cells with F-spondin may be inhibitable by addition of soluble glycosaminoglycans. It was found that adhesion of dorsal spinal cord neurons to F-spondin was markedly inhibited by heparin, dextran sulfate (not shown) and to a lesser extent by chondroitin sulfate (Figures 10D, To control for non-specific inhibition of the interactions of spinal cord cells with all adhesive substrates, the 20 spinal cord neurons adhere well to fibronectin was determined and it was found that their adhesion is not significantly affected by concentrations of heparin that block adhesion to F-spondin (not shown). Heparin also reduced to near background levels the adhesion of DRG neurons to F-spondin (not shown). It was not possible to determine whether the outgrowth of neurites from DRG neurons is also blocked by addition of glycosaminoglycans because heparin caused the detachment of virtually all neurons from the F-spondin substrate, even when added to DRG neurons that had been permitted to settle on Fspondin for 2-3h.
Experimental discussion Floor plate cells are located at the ventral midline of the developing nervous system and have been implicated in the control of neural cell identity and in the guidance of developing axons (Jessell and Dodd, 1991). In order to identify genes that might contribute to the functions of the floor plate, subtractive hybridization techniques have been used to isolate cDNA clones encoding a novel protein, F-spondin. F-spondin mRNA expressed at high levels by the developing floor plate and at low or undetectable levels in other regions of the embryonic spinal cord over the period that axons first extend. The predicted structure of F-spondin together with its biochemical properties indicates that it is a secreted glycoprotein with homo'ogy to other proteins that mediate cell adhesion and neurite outgrowth. F-spondin promotes the adhesion and outgrowth of axons from embryonic neurons in vitro, suggesting that it may contribute to the growth and guidance of commissural axons at the ventral midline of-the spinal cord and of sensory axons in the periphery.
20 Localization of F-SDondin Several lines of evidence suggest that the F-spondin protein may be associated with the extracellular matrix.
First, F-spondin has several clusters of basic residues that function as glycosaminoglycan binding domains in other secreted proteins. Second, F-spondin is associated with the surface of cos cell transfectants. Third, the complement binding protein properdin which consists almost entirely of 6 TSRs has been shown to bind sulfated glycoconjugates (Holt et al., 1990).
The restricted distribution of F-spondin mRNA in -ne embryonic nervous system contrasts with that of other secreted glycoproteins which promote neural cell adhesion and neurite outgrowth. For example, the expression of Fspondin mRNA is more restricted than that of thrombospondin I (O'Shea and Dixit, 1988; O'Shea et al., 1990) and of tenascin/cytotactin (Wehrle and Chiqet, 1990) which appears to be widely expressed in the embryonic central nervous system. Similarly, laminin and fibronectin are expressed in many regions of the developing peripheral nervous system (Sanes et al., 1990). One glycoprotein which has a restricted distribution during nervous system development is Slaminin, an isoform of the laminin B chain (Hunter et al., 1989a).
The TSRs of F-Snondin may be Responsible for Neural Cell Adhesion and Axon Extension The domains of F-spondin that mediate neural cell adhesion and neurite extension have not been mapped although several indirect lines of evidence suggest that the TSRs may be involved. First, proteolytic fragments of thrombospondin which contain the TSRs promote the adhesion of melanoma cells and antibodies directed 20 against the TSRs domain block cell adhesion (Prater et al., 1991). Second, both native thrombospondin and a 140 kDa proteolytic fragment which includes the TSR domains promote the outgrowth of neurites from central and peripheral neurons in vitro (Osterhout and Higgins, 1990; Osterhout et al., 1992; Neugebauer et al., 1991; O'Shea et al., 1991). In addition, antibodies directed against the TSR domains block neurite outgrowth on thrombospondin (Osterhout and Higgins, 1990; Osterhout at al., 1992).
Third, the plasmodial CS proteins, which contain the core 30 domain of the TSRs also promote the adhesion of a wide variety of mammalian cells (Rich et al., 1990).
The adhesive properties of the CS proteins have been mapped to the VTCG sequence (Rich et al., 1990). In addition, the two peptides derived from the TSRs in thrombospondin that are potent attachment factors for melanoma cells also contain the VTCG sequence whereas the peptide derived from the third TSR which does not contain this sequence is not adhesive (Prater et al., 1991).
Thus, the presence of a VTCG in the fourth TSR of Fspondin suggests that this domain could be involved in the adhesive properties of F-spondin. Nevertheless, other domains within F-spondin may be involved in neural cell adhesion or neurite outgrowth. For example, the region interposed between the fifth and sixth TSP-1 repeats of F-spondin contains an LRE sequence that mediates the neuronal attachment properties of S-laminin (Hunter et al., 1989b).
The ability of neural cells to adhere to and extend neurites on F-spondin suggests that there are neural receptors for this protein. The inhibition by heparin of the adhesion of dorsal spinal cord cells and DRG neurons to F-spondin suggests that proteoglycans may constitute 20 neuronal F-spondin receptors or may regulate receptor function.
The conservation of TSRs in F-spondin and thrombospondin also raises the possibility that receptors for the TSR domains of thrombospondin may interact with the related domains of F-spondin. There is evidence that the TSRs of thrombospondin can interact with 3 distinct classes of cellular receptors (Frazier, 1991). First, thrombospondin and a VTCG-containing peptide from the TSR core region can bind to an 88 kDa membrane glycoprotein, GPIV, or CD36, which is present on many cell types (Asch et al., 1990, 1991). Second, thrombospondin can bind to sulfated glycoconjugates including the heparin sulfate proteoglycan syndecan (Roberts, 1988; Sun et al., 1989; Holt et al., 1989; Bernfied and Sanderson, 1990). In addition, the adhesion of cells to VTCG-containing peptides derived from the TSR domains of thrombospondin and plasmodial CS proteins can be inhibited by heparin and other glycosaminoglycans (Holt et al., 1990; Prater et al., 1991; Rich et al., 1991). Third, antibodies against integrins block neurite outgrowth on thrombospondin (Neugebauer et al., 1991). Since antibodies to the TSR domains of thrombospondin block the outgrowth of neurites on thrombospondin (Osterhout and Higgins, 1990; Osterhout et al., 1992) it is possible that sequences within the TSRs interact with neuronal integrins.
Possible Functions of F-Soondin in Neural DeveloDment The most prominent expression of F-spondin in the embryonic nervous system is in the floor plate, an epithelial cell group that has been implicated in several aspects of spinal cord development. Midline neural plate cells that give rise to the floor plate undergo marked 20 cell shape changes during the closure of the neural tube.
Thus, one possible function of F-spondin could be to mediate adhesive interactions between floor plate cells that maintain the integrity of the floor plate during the formation of the embryonic spinal cord. The expression of F-spondin mRNA in floor plate cells is highest at the time that the floor plate has been suggested to have roles in the chemotropic (Tessier-Lavigne et al., 1988; Placzek et al., 1990a) and contact (Dodd et al., 1988) guidance of commissural axons. It is found that recombinant F-spondinmYc secreted from cos cells does not mimic the ability of the floor plate derived chemoattractant to promote the outgrowth of commissural axons from dorsal spinal cord explants (Klar, Placzek, Tessier-Lavigne, Dodd and Jessell, unpublished observations). This suggests that F-spondin may not be involved in the long-range guidance of commissural axons to the floor plate, at least through chemotropism.
F-spondin could be involved in the contact-dependent guidance of commissural axons once they reach the ventral midline of the spinal cord under the influence of chemotropic guidance cues. The growth cones of commissural neurons cross the midline by growing between the basal surface of floor plate cells and the underlying basal lamina (Kuwada et al., 1990; Yaginuma et al., 1991). F-spondin secreted by the floor plate may accumulate at high levels in association with the basal surface of floor plate cells or with the underlying basal lamina thus generating a difference in adhesive properties of the floor plate and the lateral neuroepithelium. The growth cones of commissural neurons may adhere preferentially to F-spondin, prompting them to change trajectory at the boundary of the floor plate and lateral neuroepithelium. It is also possible that F- 20 spondin has a more active signalling role which induces changes in the properties of commissural growth cones that permits them to respond to other midline guidance cues. Several proteins are expressed selectively on the Ssurface of floor plate cells at this stage of spinal cord development (Dodd and Jessell, 1988; Chuang and Lagenaur, 1990) and could provide cues that contribute to the guidance of commissural axons at the midline.
F-spondin mRNA is also expressed by cells in the S peripheral nerve, presumably Schwann cells, from Ell to E16 over the period that motor and sensory axons project to their peripheral targets. Non-neuronal cells in peripheral nerve are known to secrete a variety of 42 extracellular matrix glycoprotein, including laminin and fibronectin that can promote the growth of developing axons. Antibody inhibition studies have provided evidence for the existence of additional molecules that mediate neuronal outgrowth on peripheral nerve substrates (Tuttle et al., 1989). The ability of recombinant Fspondin to promote the outgrowth of embryonic sensory neurons in vitro suggests that the protein may be released by non-neuronal cells in the peripheral nerve and could contribute to the initial outgrowth of sensory axons in vivo.
Taken together, the present studies identify F-spondin as a novel secreted protein with potential roles in neural cell adhesion and neurite outgrowth in vivo. The development of antibodies that recognize native F-spondin will be important in determining the localization of the protein within the nervous system and in assessing its function in more detail.
*20 e 43 1. Asch, Heiner, E. and Nachman, R.L. (1990) An amino acid sequence motif in thrombospondin is responsible for CD36 binding. Blood 2&:445a, Suppi.
I1.
2. Asch, Tepler, Silbiger, S. and Nachman, R.L. (1991) Cellular attachment to thrombospondin: Cooperative interactions between receptor systems.
3. Biol. Chem. 2&:1740-1745.
3. Bazan, J.F. (1990) Structural design and molecular evolution of cytokine receptor superfamily. Proc.
Natl. Acad. Sci. 87Z:6934-6938.
4. Bernfield, M. and Sanderson, R.D. (1990) Syndean, a developmentally regulated cell surface proteoglycan that binds extracellular matrix and growth factors.
Philos. Trans. R. Soc. Lond. 22:171-186.
5. Bernhardt, R.R. and Kuwada, J.Y. (1990) Floor plate ablations induces axonal pathfindings errors by spinal coumissural cells in the zebrafish embryo.
Soc. Neurosci. Abst. 1&~:139.2.
6. Bornstein, O'Rourke, Wikstrom, Wolff, Katz, Li, P. and Dixit, V.M. (1991) A second, expressed thrombospondin gene (Thbs2) exists in the mouse genome. J. Biol. Chem af&:12821-12824.
V 7. Bovolenta, P. and Dodd, J. (199) Guidance of Commissural growth cones at the floor plate in the embryonic rat spinal cord. Development ILQ2:435-447.
Bovolenta, P. and Dodd, J. (1991) Perturbation of neuronal differentiation and axon guidance in the spinal cord of mouse embryos lacking a floor plate: Analysis of Danforth's short-tail mutation.
Development =m:625-639.
9. Cardin, A.D. and Weintraub, H.J.R. (1989) Molecular modeling of protein-glycosaminoglycan interactions.
Arteriosclerosis 9-:21-32.
44 Chuang, W. and Lagenaur, C.F. (1990) Central nervous system antigen P84 can serve as a substrate for neurite outgrowth. Dev. Biol. .21Z:219-232.
11. Culotti, Spence, Zhou, Scott, I., Leugn-Hagesteijn, Stern, B. and Hedgecock, E.
(1991) The unc-5 axon guidance gene of C.elegans has features of a cell adhesion receptor. J. Cell Biol.
122a.
12. Dame, Williams, McCutchan, Weber, Writz, Hockmeyer, Maloy, W.L., Haynes, Schneider, Roberts, Sanders, Reddy, Diggs, C.L. and Miller, L.H.
(1984) Structure of the gene encoding the immunodominant surface antigen on the sporozoite of the human malaria parasite plasmodium falciparum.
Science :593-599.
13. Davis, Aldrich, Valenzuela, Wong, Furth, Squinto, S.P. and Yancopoulos, G.D. (1991) The receptor for ciliary neurotrophic factor. Science 2,U:59-63.
14. Dodd, J. and Jessell, T.M (1985) Lactoseries carbohydrates specify subsets of dorsal .root ganglion neurons and projecting to the superficial dorsal horn of rat spinal cord. J. Neurosci. A:3278- 3294.
*15. Dodd, J. and Jessell, T.M (1988) Axon guidance and the patterning of neuronal projections in vertebrates. Science ZJZ:692-699.
16. Dodd, Morton, Karagogoes, Yamamoto, M.
and Jessell, T.M. (1988) Spatial regulation of axonal glycoprotein expression on subsets of embryonic spinal neurons. Neuron 1:105-116.
17. Evan, Lewis, Rausay, Bishop, J.M.
(1985) Isolation of monoclonal antibodies specific for human c-myc proto-oncogene product. Mol. Cell.
'i Biol. 5:3610-3616.
18. Feinberg, A.P. and Volgelsten, B. (1983) A technique for radiolabelling DNA restriction endonuclease fragments to high specific activity. Anal. Biochem.
J132:6-13.
19. Frazier, W.A. (1991). Thrombospondins. Current Opinions in Cell Biology 2:792-799.
Furley, Morton, Manalo, Karagogeos, Dodd, J. and Jessell, T.M. (1990) The axonal glycoprotein TAG-1 is an immunoglobulin superfamily member with neurite outgrowth-promoting activity.
Cell 61:157-170.
21. Goundis, D. and Reid, K.B.M. (1988) Properdin, the terminal complement components, thrombospondin and the circumsporozoite protein of malaria parasites contain similar sequence motifs. Nature 315:62-65.
22. Haefliger, Tschopp, Vial, N. and Jennet, D.E. (1989) Complete primary structure and functional characterization of the sixth component of the human complement system. J. Biol. Chem.
264:18041-18051.
.o 23. Harland, R.M. (1991) In situ hybridization: an 20 improved whole mount method for Xenopus embryos.
Methods in Cell Biology 36:675-685.
24. Harlow, E. and Lane, D. (1988) Antibodies: A Laboratory Manual. Cold Spring Harbor Laboratory.
25. Hatta, Kimmel, Ho, R.K. and Walker,
C.
(1991) The cyclops mutation blocks specification of the floor plate of the zebrafish central nervous system. Nature a35:339-341.
26. Hedgecock, Culotti, J.G. and Hall, D.H. (1990) "3 The unc-5, unc-6 and unc-40 genes guide circumferential migrations of pioneer axons and mesodermal cells on the epidermis in C. Elegans.
Neuron 2:61-85.
27. Hedgecock, E.M. and Hall, D.H. (1990) Homologies in the neurogenesis of nematodes, arthropods and chordates. Seminar in Neurosci. 2,:159-172.
28. Hessian, Tizard, Vassallo, Schiffer, Goff, May, Chi-Rosso, Luhowskyj, Lobb, R. and Osborn, L. (1991) Cloning of an alternate form of vascular cell adhesion molecule-i (VCAM1). J. Biol. Chem. jjU:6682-6685.
29. Holley, J. and Silver, J. (1987) Growth pattern of pioneering chick spinal cord axons. Devl. Biol.
112.:375-388.
30 Holdt, Kaesberg, Ershier, Esko, J.D. and Masher, D.F. (1989) Chinese hamster ovary cell adhesion to human platelet thrombospandin is dependent on cell surface heparan sulfate proteoglycan. 12:994-1001.
31. Holt, Pangburn, M.K. and Ginsburg, V. (1990) Praperdin binds to sulfatide (Gal(3-S0 4 )B1-1Cer] and has a sequence homology with other proteins that bind sulfated glycoconjugates. J. Biol. Chem.
Mj: 2852-2855.
32. Hunter, Shah, Merlie, J.P. and Sanes, J.R.
(1989b) A laminin-like adhesive protein concentrated in the synaptic cleft of the neuromuscular junction.
Nature 2U:229-233.
33. Hunter, Porter, Bulock, Adams, Merlie, J.P. and Sanes, J.R. (1989) Primary sequence of a motor neuron-selective adhesive site in the synaptic basal lamina protein S-laminin. Cell 34. Jessell, T.M. and Dodd, J. (1991) Floor platederived signals and the control of neural cell pattern in vertebrates. Harvey Lecture Series (In Pres).
Karagogeos, Morton, Casano, Dodd, J.
47 and Jessell, T.M. (1991) Developmental expression of the axonal glycoprotein TAG-1: Differential regulation by central and peripheral neurons in vitro. Development 112:61-67.
36. Klambt, Jacobs, J.R. and Goodman, C.S. (1991) The midline of the drosophila central nervous system: a model for the genetic analysis of cell fate, cell migration and growth cone guidance. Cell §A:801-815.
37. Kozak, M. (1984) Compilation and analysis of sequences upstream from the translational start site in eukaryotic mRNA. Nucl. Acids Res. 12:857-872.
38. Kuwada, Bernhardt, R.R. and Nguyen, N. (1990) Development of spinal neurons and tracts in the zebrafish embryo. J. Comp. Neurol. 1=2:617-628.
39. Kyte, J. and Doolittle, R.F. (1982) A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157:105-132.
40. Lawler, J. and Hynes, R.O. (1986) The structure of human thrombospondin, an adhesive glycoprotein with multiple calcium-binding sites and homologies with 20 several different proteins. J. Cell. Biol.
1 3:51635-1648.
41. Lemmon, Farr, K.L. and Lagenaur, C. (1989) Llmediated axon outgrowth occurs via a homophilic binidng mechanism. Neuron 2:1597-1603.
42. McKanna, J.A. and Cohen, S. (1989) The EGF receptor kinase substrate p35 in the floor plate of the embryonic rat CNS. Science 242:1477-1479.
43. Miller, R.H. (1991) Ventral origin of immunoreactive glial precursor cells in the 30 developing rat spinal cord. Soc. Neuro. Sci. Abst.
12:235.
44. Miyazaki, Maruyama, Yamada, Hatakeyama, M. and Taniguchi, T. (1991) The integrity of the 48 conserved IWS motif'I common to IL-2 and other cytokine receptors is essential for ligand binding and signal transduction. The EMfBO Journal 1,Q:3191- 3197.
Nambu, Lewis, Wharton, K.A. and Crews, S.T. (1991) The drosophila single-minded gene encodes a helix-loop-helix protein that acts as a master regulator of CNS midline development. cell _UZ: 1157-1167.
46. Neugebauer, Emmett, Venstrom, K.A. and Reichardt, L.F. (1991) Vitronectin and thrombospondin promote retinal neurite outgrowth: Developmental regulation and role of integrins.
Neuron 1:345-358.
47. O'Shea, K.S. and Dixit, V.M. (1988) Unique distribution of the extracellular matrix component thrombospondin in the developing mouse embryo. J.
Cell Biol. ISQ2:2737-2748.
48. O'Shea, K.S. and Rheiriheimer, J.S.T. and Dixit, V.14.
(1990) Deposition and role of thrombospondin in the histogenesis of the cerebellar cortex. J. Cell Biol.
I.Q:1275-1283.
49. O'Shea, Liu, L-H.J. and Dixit, V.14. (1991) Thrombospondin and a 140 kd fragment promote adhesion and neurite outgrowth from embryonic centra and peripheral neurons and from PC12 cells.
.:25 Neuron 2_:231-237.
Osterhout, D.J. and Higgins, D. (1990) Thrombospondin promotes axonal growth in sympathetic neurons. Soc. Neurosci. Abet. 12i:312.
51. Osterhout, Frazier, W.A. and Higgin, D. (1992) Thrombospondin promotes process outgrowth in neurons from the peripheral and central nervous systems.
Dae Biol. In Press.
52. Patthy, L. (1990) Homology of a domain of the growth 49 hormon/prolactin receptor family with type III modules of fibronectin. Cell 1:13-14.
53. Patthy, L. (1988) Detecting distant homologies of Ixosaic proteins. Analysis of the sequences of thrombomodulin, thrombospondin complement components C9, CS alpha and CS beta, vitronectin and plasma cell membrane glycoprotein PC-i. J. Hal. Biol.
2.U: 689-696.
54. Perkcins, Nealis, Haris, Chapman, Goundis, D. and Reid, K.B.N. (1989) Secondary structure in properdin of the complement cascades and related proteins: A study by fourier transforms infrared spectroscopy. Biochemistry 7176-7182.
Placzek, Tessier-Lavigne, Jessell, T.M. and Dodd, J. (1990a) Orientation of Commissural axons in vitro in response to a floor plate derived chemoattractant. Development 19-30.
56. Placzek, Tessier-Lavigne, Yamada, T., Jessell, and Dodd, J. (1990b) The guidance of developing axons by diffusible chemoattractants.
Cold Spring Harbor Symp U:279-988.
57. Placzek, Tessier-Lavigne, Yamada, T., Jessell, T.M. and Dodd, J. (1990c) Mesodermal control of neural cell identity: floor plate induction by the notochord. Science 2,,:985-988.
so**58. Placzek, Yamada, Tessier-Lavigne, N., 00. Jessell, Dodd, J. (1991) Control of dorso 4, ventral pattern in vertebrate neural development: 0 0000induction and polarizing properties of the floor plate. Development (In Press).
59. Prater, Plotkin, Jays, D. and Frazier, W.A. (1991) The properdin-like type I repeats of a human thrombospondin contain a call attachment site.
Cell Biol. .IU:1031-L040.
Rich, George, Law, J.L. and Martin, W.J.
(1990) Cell-adhesive motif in region II of malarial circumsporozoite protein. science W.i:1574-1577.
61. Rich, Hinton, D.R. and Blanks, J.B. (1991) Attachment of developing mouse retinal and lens cells to a sequence common to thrombospondin and malarial proteins. J. Cell Biol. 1.lU:441a.
62. Roberts, D.D. (1988) Interactions of thrombospondin with sulf ated glycolipids and proteoglycans of human melanoma cells. Cancer Research jj:6785-6793.
63. Robson, Hall, Jennings, M.W., Harris, Marsh, Newbold, Tate, Weatherall, D.J. (1988) A highly conserved amino-acid sequence in thrombospondin, properdin and in proteins from sporozoites and blood stages of a human malaria parasite. Nature =.j:79-82.
64. Romijin, Gabets, Mud, M.T. and Walter, P.S. (1982) Nerve outgrowth, synaptogenesis and bioelectric activating in rate cerebral cortex tissue cultured in serum-free, chemically defined medium. Devi. Brain. Res. 2,:583-589.
Sambrook, Fritsch, E.F. and Maniatis, T. (1989) Molecular Cloning. Cold Spring Harbor Laboratory Press.
*66. Sanes, Engvall, Butkowski, R. and Hunter, D.D. 91990) Molecular heterogeneity of basal laminas: Isoforms of laminin and collagen IV at the 25 neuromuscular junction and elsewhere. J. Cell Biol.
1: 1685-1699.
67. Sanger, Nicklen, S. and Coulson, A.R. (1988) DNA sequencing with chain-terminating inhibitors. Proc.
Natl. Acad. Sci. 74:5463.
68. Schachner, Antonicek, Fahrig, T. et al.
(1990) Families of neural cell adhesion molecules.
In Morphoregulatory Molecules (eds. G.M. Edelman, B.A. Cunningham and J.P. Thiery) John Wiley and Sons, New York, pp. 443-468.
69. Sive, H.L. and John, T.S. (1988) A simple subtractive hybridization technique employing photoactivatable blotin and phenol extraction.
Nuci. Acids. Res. 1&:10937.
7C. Smith, Nolan, Reid, K.B.N. and Perkins, S.J. (1991) Neutron and X-ray scattering studies of the human complement protein properdin provide an analysis of the thrombospondin repeat. Biochemistry 8000-80008.
71. Steiner, D.F. (1991) Prohormone convertases revealed at last. Current Biology .2:375-377.
72. Sun, Mosher, D.F. and Rapraeger, A. (1989) Heparan sulfate-mediated binding of epithelial cell surface proteoglycan to thrombospondin. J. Biol.
Chem 2&A:2885-2889.
73 Takada, Elices, Crouse, Healer, N.E.
(1989) The primary structure of the a subunit of VLA-4: homology to other integrins and a possible cell-cell adhesion function. EMBO J. 1:1361-1368.
.74. Tessier -Lavigne, N. and Paczek, M. (1991) Target :attraction: are developing axons guided by 7.chemotropism? Trends in Neuroscience 11:303-310.
7.Tessier-Lavigne, Placzek, Lumaden, A.G.S., Dodd, J. and Jessell, T.N. (1988) Chemotropic guidance of developing axons in the mammalian central nervous system. Nature M:775-778.
76. Thomas, P. (1983) Hybridization of denatured RNA transferred or dotted onto nitrocellulose paper.
Moth. Enzymol. =&:255-266.
77. Tuttle, Sandrock, A.W. and Matthew, W.D. (1989) Analysis of complex matrices functional in neuronal process extension using monoclonal antibodies in vitro and in vivo. Dev. Neurosci. 11:289-299.
78. van Straaten, Hekkinq, Wiertz- Hoessels, Thors, F. and Drukker, J. (1988) Effect of the notochord on the differentiation of a f loor plate area in the neural tube of the chick embryo. Anat. Embryol. 177:317-324.
79. von Heijine, G. (1985) Signal sequences: the limits of variation. J. Mol. Biol. .11:99-105.
80. Wagner, Thaller, Jessell, T.M. and Eichele, G. (1990) Polarizing activity and retinoid synthesis by the f loor plate of the neural tube. Nature 2Al: 819-822.
81. Weber, A. (1938) Croissance de fibres nerveuses commissurales lors de lesion do la moelle epiniere chez de jeunes embryons de poulet. Biomorphosis .1:30-35.
82. Wehrle, B. and Chiqet M. (1990) Tenascin is accumulated along developing peripheral nerves and allow. neurite outgrowth in vitro. Development =J.:401-415.
83. Wilkinson, Bailes, Champion, J.E. and McMahon, A.P. (1987) A molecular analysis of mouse development from 8 to 10 dy otciu eet changes ony in emrync loi exprmein.
Development 22: 493-500.
:84. Yaginum-A, Homma, Kunzi4, L. and Oppenheim, (1991) Pathfinding by growth cones of coumissural interneurons in the chick embryo spinal cord: a light and electric micropacopic study. J.
Coup. Neurol. IQA:78-102.
Yaginuma, H. and Oppenheim, R.W. (1991) An experimental analysis of in vitro guidance cues used by axons of spinal interneurons in the chick embryo: evidence for chemotropism and related guidance mechanisms. J. Neuroscience 11:2598-2613.
86. Yamada, Placzek, Tanaka, Dodd, J. and Jessell, T.M. (1991) Control of cell pattern in the 53 developing nervous system: Polarizing activity of the floor plate and notochord. Cell §A:635-647.
SEQUENCE LISTING GENERAL INFORMATION: APPLICANT: Jessell, Thomas M Klar, Avihu (ii) TITLE OF INVENTION: CLONING, EXPRESSION AND USES OF A NOVEL SECRETED PROTEIN, F-SPONDIN (iii) NUMBER OF SEQUENCES: (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: Cooper Dunham STREET: 30 Rockefeller Plaza CITY: New York STATE: New York COUNTRY: USA ZIP: 10112 COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.25 (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: US FILING DATE:
CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION: NAMtE White, John P REGISTRATION NUMBER: 28,678 REFERENCE/DOCKET NUMBER: 40028
S*
20 (ix) TELECOMMUNICATION INFORMATION: TELEPHONE: (212) 977-9550 TELEFAX: (212) 664-0525 S* TELEX: 422523 COOP UI INFORMATION FOR SEQ ID NO:1: S(1) SEQUENCE CHARACTERISTICS: 25 LENGTH: 36 base pairs TYPEI nucleic acid STRANDEDNESS: singl* TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: CTAGCGAGCA GAAGCTGATC TCCGAGGAGG ACCTCA 36 INFORMATION FOR SEQ ID NO:2: SEQUENCE CHARACTERISTICS: LENGTH: 36 base pairs TYPE: nucleic acid STRANDZDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: eDNA
S*
S
S.
S
S**S
a S S. S *5 p5 *5e5 5 (xi) SEQUENCE DESCRIPTION: SEQ ID 11:2: CTAGTGAGGT CCTCCTCGGA GATCAGCTTC TGCTCG INFORMATION FOR SEQ ID 11:3: SEQUENCE CHARACTERISTICS: LENGTH: 10 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID 11:3s Glu Gln Lys Lou Ile Ser Glu Glu Asp Lou 1 5 INFORMATION FOR SEQ ID 110:4: SEQUENCE CHARACTERISTICS:- LENGTH: 36 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear 20 (1i) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID 110:4: CATGGGAGCA GAAGCTGATC TCCGAGGAGG ACCTCG 25 INFORMATION FOR SEQ ID 11:5: SEQUENCE CHARACTERISTICS: LENGTH: 36 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID N110: CATGCGAGGT CCTCCTCGGA GATCAGCTTC TGCTCC INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 4 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: Val Thr Cyn Gly 1 INFORMATION FOR SEQ ID NO:7: Wi SEQUENCE CHARACTERISTICS: LENGTH: 4 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (1i) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: Val Ser Cys Gly 1 INFORMATION FOR SEQ ID NO:8: SEQUENCE CHARACTERISTICS: LENGTH: 4 amino acids 20 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: Ala Thr Cys Gly INFORMATION FOR SEQ ID NO:9: SEQUENCE CHARACTERISTICS: LENGTH: 4029 base pairs TYPE: nucleic acid STRANDRDNESS: single *1 TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA 57 (Ix) FEATURE: NAME/KEY: CDS LOCATION: 226..2647 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: CCCTCCCTCT TCGCGCTCCT T0GCCAC0GC CCGCCCCTCA GCTCCGCTGC ToGr4CTCCGC TCAGAGCAGC GCAGCTCCGC AGCCAAAGCG AGOGGGCTC GGGCTCCCCA CCGCCAGTGC 120 CACCCGGOCT CCTCCAGCTT TCGCCTCTGC AGCTCCCGTC ACTTGOAGTA AAAGTGTCCT 180 GACAG=GTC TGCAACATCA GCAGAAAGTT GGGAOOTCCT CGAGA ATG AGO CTA 234 Me0t Arg Lou
I
TCT CCC OCO CCC CTG AGO CTT AGC CGO GOT CCG GCG CTG CTOG CC CTG 282 Ser Pro Ala Pro Lou Ar; Lou Ser Ar; Gly Pro Ala Lou Lou Ala Lou 10 GCG CTG CCC CTG GCC OCA GCG CTC OCT TTC TCG GAT GAG ACC CTG GAC 330 Ala Lou Pro Lou Ala Ala Ala Lou Ala Phe Ser Asp Glu Thr Lou Asp 25 30 AAA GTG 0CC AAO TCG GAG 000 TAC TGC AGC CGC ATC TTG CCC 0CC CAG 378 Lys Val Ala Lys Sor Glu Gly Tyr Cys 8cr Ar; Ile Lou Ar; Ala Gin 45 000 ACA COG COT GAG GGA TAO ACA GAG TTC AGO CTC COO GTG OAA 000 426 Gly Thr Arg Arg Glu Gly Tyr Thr Glu Phe So: Lou Ar; Val Glu Oly 60 GAO COT GAO TTC TAT AAG CR GGA AGO AGC TAO CGA GTO ACA CTC TCG 474 *Asp Pro Asp Ph* Tyr Lys Pro Oly Ser Tyr Ar; Val Tb: Lou Ser *70 75 OCT 0CC CCT CCC TCC TAC TTC AGA GOC TTC ACG TTA ATT OCT CTC AAA 522 *Ala Ala Pro Pro Ser Tyr Phe Ar; Oly Phe Tb: Lou Ile Ala Lou Lys 90 GAO AAC COO GAA 000 GAT AAG OAK GAA GAO CRC CG 000 ACC TTC CR0 570 Olu Ann Ar; Olu Gly Asp Lys Glu Glu Asp His Ala Cly Thr Ph. Gin 100 105 110 115 ATO ATA OAT OAK GAA OAK ACC CAG TTT ATO AGT AAC TOT CCT OTO CR 618 Ile Ile Asp Glu Glu Glu Tb: Gin Ph. Not Ser Ann Cys Pro Val Ala 120 125 130 OTC ACT OAK AGO ACC COT COO AGO AGO ACA COG ATC CAG OTO TTT TOO 666 *.:Val Tb: Olu $or Thr Pro Ar; Ar; Ar; Tb: Arg Ile Gin Val Ph& Try, 135 140 145 ATA 0GM CM CCC-ACK 000 ACA 000 TOT GTO ATT CTO KAG 0CC AGC ATT 714 Ile Ala Pro Pro Th: Oly Tb: Gly Cys Val Ile Lou Lys Ala 8cr Ile 150 155 160 OTA CA AA CGO ATT ATC TAT TTT OAK GAC GAG 000 TCC CTC hOCC hAG 762 Val Gin Lys Ar; Ile Ile Tyr Pb. Gin Asp Glu Gly So: Lou Tb: Lye 165 170 175 hAG CTO TOT OAK CAG OAT CC ACA OTT GAT GGA OTC AG GAO AGA CCC 810 LLili;-~i~ 58 Lys Lou Cys Glu Gin Amp Pro Thr Lou Asp Gly Val Thr Asp Arg Pro 180 185 190 195 ATC TTA GAC TOC TC GCC TOC OGA ACT 0CC AAG TAC AGA CTC AGO TTT 858 Ile Lou Ap Cys Cym Ala Cys Gly Thr Ala Lys Tyr Ar; Lou Thr Ph.
200 205 210 TAT GGG AAC TOG TOO GAG AAG ACT CAT CCA AAG GAT TAC CCT COT COG 906 Tyr Gly Ann Trp Ser Glu Lys Thr His Pro Lys Amp Tyr Pro Arg Arg 215 220 225 OCT hAT CAC TOG TT GCC ATC ATT GOC GGA TCC CAC TCC AAG AAC TAC 954 Ala Mn Hi Trp So: Ala Ile Ile Oly Gly Scr Him So: Lys Ann Tyr 230 235 240 OTO CTO TOG GAG TAC OGA 000 TAT 0CC AGT OAA 000 OTC AAG CAA GTT 1002 Val Lou Trp Glu Tyr Oly Gly Tyr Ala Sor Glu Gly Val Lys Gin Val 245 250 255 OCT OAA CTT GGC TCA CCA GTA AM ATO GAG OAA OAK ATT COA CM CAG 1050 Ala Oiu Lou Gly Pro Val Lys Mot Giu Glu Oiu Ile Arg Gin Gln 260 265 270 275 AOT OAT OAA OTC CTC ACT OTC ATC MA 0CC AM 0CC CAG TOG CCA TCC 1098 Sor Asp Glu Val Lou Thr Val Ile Lys Ala Lys Ala Gin Trp Pro Ser 280 285 290 TOG CAG CCT OTC MT GTO AGA 0CA 0CA CCC TCA 0CC OAA TTC TCA OTG 1146 Trp Gin Pro Val Mn Val Arg Ala Ala Pro 8or Ala Glu Ph. Sor Val 295 300 305 GAC AGO ACK COC CAC TTG ATO TOO TTC CTA ACC ATO ATG GC CCC AGT 1194 Amp Ar; Thr Arg Him Lou Not So: Ph* Lou Thr Not Not Oly Pro So: 310 315 320 CCT GAC TOG MC oTG 000 CTA TOT CA GAG OAT CTO TOO ACC AA GAG 1242 Pro Amp Trp Ann Val Oly Lou Ser Ala Olu Asp Lou Cym Tb: Lys Olu 325 330 335 TT WGC TOO GTC CAG AM OTO OTO CAG GAC CTA ATT CCC TOG OAT OCT 1290 Cy: Gly Trp Val Gin Lys Val Val Gin Amp Lou le Pro Trp Asp Ala 340 345 350 355 GOC ACO GAO AGC 000 OW' AC TAC GAG TCA CCA MC MG CCC ACA ATT 1338 Oly Thr Amp So: Oly Val Thr Tyr Olu Sor Pro Mn Lys Pro Thr Ile 360 365 370 OCT CAG GM AM ATC C0 CCC CTO ACT AGT CTO GAO CAT CCT CAG AOT 1386 Pro Gin Olu Lye le Ar; Pro Lou Thr So: Lou Amp His Pro Gin So: 375 380 385 COT TC TAT GAC CG GM GOT 000 TCC ATC ACA CM TO 0CC AGA GTC 1434 Pro Ph Tmyr Asp Pro Clu Gly Gly So Ile Thr Gin Val Ala A Val 390 395 400 GT ATC GAG AGRAKT 0CC COO MO GOA GM OA TOO MC ATT OTA COT 1482 Val Ile Giu Arg Ile Ala Ar; Lys Oly Glu Oln Cym Mn le Val Pro 405 410 415 GAC MT OTO CAT OAT ATT 0TA 0= GAC CT OCT CCA GM GAG AR OAT 1530 Amp Mn Val Amp Amp Ii Val Ala Amp Lou Ala Pro Oiu Giu Lys Ap 420 425 430 435 if I liii Thii~ n~Ti}1i1JiTi1Fffi n I GAh GAT GAC Glu Asp Asp TCQ o=c TGC Sur coc Arg
CAG
Gin
TCC
Sur 500
TOO
Sur ccc Pro
TGC
Cys
TOG
Trp hAG Lys 580
AAO
Lys
ACC
Thr
OTG
Val
CTG
Lou Ala
ATG
Hot
GAC
Asp 485
AC
Thr
TOT
cys
GAA
o iu Aco Th:
GOT
Gly 565
CG
Ar;
GOO
Ala
ATC
Ile
AC
Thbr
OCA
Ala 645 Cys
CTG
Lou 470
TTC
Phs
TOT
Cy.
GGC
Oly
GAC
Asp
GTC
Val 550
GAG
Glu
CAC
His
GAG
Glu
CO
Pro
TOT
cys 630
GAG
Glu 59 A=C C= GAA A=C TOC ATC TAC T=C AAC TOOGC C TOG Thr Pro Giu Tb: Cys Ile Tyr Sur Amin Try, 8ur Pro Trp 440 445 450 ACC TCT TCC ACT TOT OAA AAG GOT AAG AGO ATO COO CAA 8ur 8ur 8ur Tb: Cys Glu Lys Gly Lys Arg Met Arg Gin 455 460 465 AMG GCA CMG CTG GAC CTC AGT OTC CCC TOT CCT GAC ACC Lys Ala Gin Lou Asp Lou Sur Val Pro Cys Pro Asp Th: 475 480 CAG CCC TGC ATO 0CC =C 0CC TOC MGC GAT GMA GAT C Gin Pro Cy. Net Gly Pro Gly Cys Sur Asp Glu Asp Gly 490 495 ACC ATOv TCO CAG TOG ATC ACC TOO TCA CCC TGC MGT GTC Tb: Met Sur Glu Trp, Ile Tb: Trp Sur Pro Cys Sur Val 505 510 515 ATOv GOT ATO& AGO TCC CG GAG AGO TAC GTG AMG CMG TTC Met Oly Hot Ar; Sur Ar; Olu Ar; Tyr Val Lys Gin Ph.
520 525 530 0CC TCG OTC TOC ATG CTG CC ACO GMA GAG ACA GAG AAG Gly Sc: Val Cys Not Lou Pro Tb: Glu 02.u Tb: Giu Lys 535 540 545 MAC GAG GAG TOC TCT CCT AMC MGC TOC CTG OTG ACT GMG Asn Glu Glu Cym 8cr Pro Sur 8cr Cy. Lou Val Tb: Giu 555 560 TOG GAT GAC TOC MGC GCC ACC TOT OGA ATO GCC ATO AMG Trp Asp Asp Cys 8cr Ala Tb: Cys Gly Not Gly Met Lys 570 575 OC ATGOGTC AMG ATO ACC CCC GOG GAC GCC TCC ATO TOC Ar; Met Val Lys Met Sur Pro Ala Asp Oly Sur Met Cye 585 590 595 ACT TO CM OO GAG MAA TOC ATO ATG CCT GAG TGC CAT Tb: 8cr Gin Ala Giu Lys Cys Met Met Pro Glu Cys His 600 605 610 TOC ?YG CTG TCT CCT TOO TCC GMA TOO AGC GAC TOT MGC Cys Lou Lou 8cr Pro Trp Sur Olu Trp Sur Asp Cys 8ur 615 620 625 0OO, AMG CCC ATO COG A0O COC CMG COO ATG CTC AMG TCT Gly Lys Oly Met Ar; Tb: Arg Gin Arg Met Lou Lys Sur 635 640 CTO COG, CAC TOT MAT GAG GAT CTG GAG CMG GC GAG AMG Lou Gly Asp Cys Asn Giu Asp Lou Glu Gin Ala Giu Lys 650 655 1578 1626 1674 1722 1770 1818 1866 1914 1962 2010 2058 2106 2154 2202 2250 2298 C.
C.
S C C. C C. C C
.C
TOT ATO Cys 660 CMo Gin not ToG Trp, =T CCA 04 TOC CC KTT GAC T= GM =CTC T GAG TOO T= Lou Pro Giu Cys Pro Ile Asp Cym Giu Lou Sur Glu Trp, 8cr 665 670 675 TCT GMA TOT MAC AMQ TCC TOT 00O MAA GOT CRC ATO ATT CGA Sur Giu Cys Ann Lye 8cr Cys Gly Lys Oly His Met Ile Ar; 680 685 690 ACCC= ACM ATC MAA MT GAA C= CAG WA WOT GCA =CC Tc ccA 2346 Thr Arg Thr Ile Gin Met Glu Pro Gin Ph* Gly Gly Ala Pro Cys Pro 695 700 705 GAG ACT GTG MAA COC AAG AAG TGC COT GCC CGG AAA TGC CT? cc AGC 2394 Glu Thr Val Gin Arg Lys Lys Cys Arg Ala Arg Lys Cys Lou Arg Ser 710 715 720 CCA TCO ATC CAG AAG CTG OC TOG AGO GAG GCC COA GAG AGC AGO AGO 2442 Pro 8cr Ile Gin Lys Lou Arg Try Arg Glu Ala Arq Glu 8cr Arg Arg 725 730 735 AG? GAG CAG CTO AGO GAA GAG TCA OAT GGA GAG CAG TTC CCA GGC TOT 2490 8cr Glu Gin Lou Arg Giu Giu 8cr Amp Gly Giu Gin Ph. Pro Gly Cys 740 745 750 755 OG ATGO CC COG TOG ACA GCC TOG TCA GMG TOC ACC AAA CTG TGC OGA 2538 Arg Met Arg Pro Trp Thr Ala Trp 8cr Giu Cym Thr Lys Lou Cys Gly 760 765 770 GOT 000 ATC MAA GAR CGC TAO ATO ACT 070 AAG AAG AGO TTC AAA AGC 2586 Gly Gly Ile Gin Giu Arg Tyr Met Thr Val Lys Lys Arg Ph. Lys 8cr 775 780 785 TOO CAG TTT ACC AGC TGC AAA CAC AAG AAG GAG ATC AGA OCO TOC AAC 2634 Gin Ph. Thr 8cr Cym Lys Asp Lys Lys Glu Ile Arq Ala Cys Amn 790 795 800 OTO CAC CCT TOT T AGTAQGOGG? CAACTCCOCA GCTGCATT CCAGATTCTA 2687 Val Hisi Pro Cys 805 GTCACCAATG GTTGGGTOOT GTATTTGCTT OTTTAAGATG ATTTAAATTG TGTCCACATG 2747 TTTTCATTT-v- TACOGOTOTO GTTTGCCCAA TAGTCTTATO GAGGOOOGAGG GACATCTTGT 2807 CTGAATACTT CTTGGTGAGT ACAGGCCAAG COGGOCATCT TGTCCCCAGG CGCCATCTTC 2867 *CTGCACTGAG TTGAGTAGTG TTGGOTCACC TTGGTACTAA ACTOAAT~GT OTCOCTCTGG 2927 AGCATOC? GGTCAAGCAG GGTGGAGACT TOCTCACAGAAGCAACCAGGA 2987 TGCAGCATGC OAACC CCTTAATT GCAAAGGACAGTCOT TCTCOTTT 3047 OGOCTGTCA CTCTACAGA AACCTGTTTG TTTAT? GCAACTCC 3107 *AGGCATCTTO GTAAOTCTCC AGGGTCATOG CTTCTTCGG GCCCTGAAGG AGAAGCCCTG 3167 *AGGTGAGOTO GCATTTA CAAACCTCCC AATACTGCTT TACTGOCATC ACAAOGTCAO 3227 CAGGTGAATQA TGCAfCATTTCATTOT GAGCTOAT TTTGATTOT 3287 TGGTGCCATA AATGTCCTAG GATQCTOGAC GGAOCATCA U £A CAaATCCTTC 3347 GAGOCRA TOTAGACACT AAGCTGGGCR CTOOTTC A AGCCAACTTA AAATCTTCCT 3407 ACACATATCC AGACCTTTTAGTO CAAACTTCCT TAGAATAAAG CATTTTAGCT 3467 CTGAGAACTA CTTGATAAGT CTOCCAGQAA GOCOCCAAGT CRATTCTTCA ACAAAAATAC 3527 61 TATCTTCCCT ACTTAATTTT TTTTAAGTCA TGATATTTTA TAGTTAGAGG AGAGAGAGAC 351 AATCTATTCC CATGACTAAG ACACAAACCT ACAAGAAAGG GTTACTCAGT CAAGCCTGTG 36, CCTGACTTCT GGACCPAGGCC CCTGATTTT-17C ATGGATAGTC CAAAGGAAcX CCAGGGGTTC 37( CCACTGACTC CAAGCCATCA GCAGCACCCA AACCCAGGAG CAACAAATAT TCAGAGAAAG 37f AGGATGTTTA TCTCAGCTAT GAGCTCATTG GCAGGTTGTA CTCATGCATC TGTTAAAAGC 38 ACCACCACAT CCTTTTGCAA GTCTGTTTAT TACCGCTTCA TCCAAATACA TTTTGTGGTC 388 AAGATCGACA CAGTGCTATG AATACAGTAC TTTAAGGTCT GCATTAAACA CATCAGAATA 394 TTTCCTGCCA CATCTATGTA CAACCCCTGA ATATGTATTT TTCCTTAACA CAAGAGAGCC 400 TGTTCAATTA AAAAAA AA 402 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICSi LENGTH: 807 amino acids TYPE: amino acid TOPOLOGY; linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l0: Het Arg Leu Ser Pro Ala Pro Lou Arg Lou 5cr Arg Gly Pro Ala Lou 10 is Leu Ala Lau Ala Lou Pro Lou Ala Ala Ala Lou Ala Ph. Ser Amp Glu 25 Thr Lou Asp Lys Val Ala Lys Sec Glu Gly Tyr Cys Ser Arg Ile Lou to135 40 4S .20 Arg Ala Gin Gly Thr Arg Arg Glu Gly Tyr Thr Glu Ph. Ser Lou Arg :50 55 Val Glu Gly Asp Pro Asp Ph. Tyr Lys Pro Gly Ser Ser Tyr Arg Val 70 75 to to.Thr Lou Ser Ala Ala Pro Pro Sor Tyr Ph. Arg Gly Ph. Thr Lou Ile 90 too 25 Ala Lou Lys Glu Ann Arq Glu Gly Asp Lys Glu Glu Asp His Ala Gly 100 105 110 Thr Ph. Gin Ile Ile Asp Glu Glu Glu Thr Gln Phe Not Soc Ann Cys 115..ii 120 125 Pro Val I Val Thr Glu Sor Thr Pro Arg Arq Arg Thr 4krq Ila Gin 130 135 140 Val Ph. Trp Ile Ala Pro Pro Thr Gly Thr Gly Cys Val Ile Lou Lys Al Sr IeVlGin Lys Arg Ile I1e Tyr Ph* Gin Asp Glu, Gly Sor 165 170 175 62 Lou Thr Lye Lye Leu Cys Glu Gin Asp Pro Thr Leu Asp Gly Val Thr 180 185 190 Asp Arg Pro 119 Leu Asp Cye Cys Ala Cye Gly Thr Ala Lye Tyr Arg 195 200 205 Leu Thr Phe Tyr Gly Ann Trp Ser Glu Lye Thr His Pro Lye Asp Tyr 210 215 220 Pro Arg Arg Ala Asn His Trp Ser Ala Ile Ile Gly Gly Ser His Ser 225 230 235 240 Lys Asn Tyr Val Leu Trp Glu Tyr Gly Gly Tyr Ala Ser Glu Gly Val 245 250 255 Lye Gln Val Ala Glu Leu Gly Ser Pro Val Lye Met Glu Giu Glu Ile 260 265 270 Arg Gin Gin Ser Asp Glu Val Leu Thr Val Ile Lye Ala Lye Ala Gln 275 280 285 Trp Pro Ser Trp Gin Pro Val Aen Val Arg Ala Ala Pro Ser Ala Glu 290 295 300 Phe Ser Val Asp Arg Thr Arg His Leu Met Ser Phe Leu Thr Met Met 305 310 315 320 Gly Pro Ser Pro Asp Trp Aen Val Gly Leu Ser Ala Glu Asp Lou Cys 325 330 335 Thr Lye Glu Cym Gly Trp Val Gin Lye Val Val Gin Asp Leu 11e Pro 340 345 350 Trp Asp Ala Gly Thr Asp Ser Gly Val Thr Tyr Glu Ser Pro Ann Lye 355 360 365 Pro Thr Ile Pro Gin Glu Lye Ile Arg Pro Leu Thr Ser Lou Asp His 370 375 380 Pro Gin Ser Pro Phe Tyr Asp Pro Glu Gly Gly Ser Ile Thr Gin Val 385 390 395 400 Ala Arg Val Val Ile Glu Arg Ile Ala Arg Lye Gly Glu Gin Cys Aen 405 410 415 Ile Val Pro Asp Asn Val Asp Asp Ile Val Ala Asp Leu Ala Pro Glu 420 425 430 25 Glu Lys Asp Glu Asp Asp Thr Pro Glu Thr Cys Ile Tyr Ser Aen Trp 435 440 445 Ser Pro Trp Ser Ala Cys Ser Ser Ser Thr Cys Glu Lys Gly Lys Arg .450 455 460 Met Arg Gin Arg Met Lei Gin Lou Asp Lou Ser Val Pro Cys 465 470 475 480 30 Pro Asp Thr Gin Asp Phe Gin Pro Cyn Met Gly Pro Gly cys Ser Asp 485 490 495 Glu Asp Gly Ser Thr Cys Thr Met Ser Glu Trp Ile Thr Trp Ser Pro 500 505 510 Cys Lys Thr 545 Val Gly Ser Glu Asp 625 Lou Ala Glu mot Pro 705 20 Lou sor Pro Lou Phe 785 Ala scr Gin 530 Glu Thr mot met Cy.
610 Cys Lys Glu Trp Ile 690 cys Arg Arg Gly Cys 770 Lys Cys Val 515 Phe Lys Glu Lys Cys 595 His 5cr 5cr Lys Ser 675 Arg Pro 5cr Arg Cys 755 Gly Ser Aen ser Pro CYl Trp Lys 580 Lys Thr Val Lou Cys 66C Gin Thr Glu Pro 5cr 740 Arg Gly 8cr Val Cys Gly Met Glu Asp Gly 535 Thr Val Asn 550 Gly Glu Trp 565 A~rg His Arg Ala Glu Thr Il Pro Cys 615 Thr Cys Gly 630 Ala Giu Lou 645 Met Lou Pro Trp Ser Glu Arg Thr 11e 695 Thr Val Gin 710 5cr Ile Gin 725 Glu Gin Lou Met Ar; Pro Gly Ile Gin 775 Gin Pho Thr 790 His Pro Cys 805 Gly 520 5cr Glu Asp Mot Ser 600 Lou Lys Gly Glu Cya 680 Gin Arg Lys Arg Trp 760 Giu met Val Glu Asp Val 585 Gin Lou Gly Asp Cys 665 Asn met Lys Lou Glu 745 Thr Arg 63 A~r Cys Cy.
Cys 570 Lys Ala Ser Met Cys 650 Pro Lys C u Lys Ar; 730 Glu Ala Tyr Ser mot Ser 555 5cr met Glu Pro Arg 635 hAn Ser Pro Cys 715 Trp Saw Trp met Ap 795 Ar; Lou 540 Pro Ala Ser Lye Trp 620 Thr Glu ASp Cys Gin 700 Arg Arg ap Ser Thr 780 Glu 525 Pro Ser Thr Pro Cys 605 Sec Arg Asp Cye Gly 685 Ph* Ala Glu Gly Glu 765 Val Akr Thr Ser Cys Al a 590 met Glu Lou Glu 670 Lys Gly Ar; Ala Glu 750 Cys Lys Tyr Glu Cys Gly 575 Asp met Trp Ar; Glu 655 Lou Gly Gly Lys Arg 735 Gln Thr Lys Val Glu Lou 560 mot Gly Pro Sec mot 640 Gin 8cr His Ala Cys 720 Glu Ph@ Lys Ar; Ar; 800 9 99 9 9999
I.
9. 9 9* 9 99 9* 9* *99 9 r 5cr Cys Lys Lys Lys Glu Ili INORMATION FOR SEQ ID NO:11: sEQUENCE CHARACTERISTICS: LEOTH: 3226 base pairs TYPE: nucleic acid 64 STRANDEDNESSI Single (D),TOPOLOGY: linear (ii) MOLECULE TYPE: CDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 136..2S43 (xi) SEQUENCE DEsCRZPTION: SEQ ID NO:11: GTGTCCCTC TCTC ccCTCTC :CCCGCCTGCC CCCTCCCGCc ~&sI~j~CCCCTCC GCCGCAGC TccCOCGGC CGC=G= TGCCCGAGCT GTGCGGOGCGC CGAGG ATG GCA OW COO CTG COO CCC CTG GCC CTG COG CTG Not Ala Ala Ar; Lou Arg Pro Lou Ala Lou Ar; Lou 1 5 @0 00 S
S.
OS
*5 5
S.
5 0*O@ 0* S S S
S.
*5S55* S S 0*
SSOS
S
5* Sb 0 S. S Ge 45 .5.5 0 .9 055055
S
CTO
Lou
CTO
Lou
GC
Ala
GAG
Glu
CTI
Lou
CTG
Lou
TX?
Ph.
GTC
Val 125
TTC
Phe
MGT
A=C
OG
Ala
GAG
Glu
CAA
Gin
GOC
Oly
TCT
S.:
AMG
Lys
CMG
Gin 110 Ala Too Trp
ATT
11e
AA
COC
Ar; is
A
Lys
GOC
Oly
CAT
Amp
OCT
Ala
GFAA
ciu 95
ATC
11e
OTT
Val
ACA
Thr anO Val
A
ACC
Tb: 0CC Ala
ACC
Tb: cco Pro
G=C
Ala so
OA
Oly
ATA
110
ACT
Thr
OCT
Ala Cho Gin 160
ATC
Phe 0CC Ala
AGO
Arg
GAA
Olu 65
ACT
Thr
AAA
Lys
GAT
Asp Gma clu
CCT
pro 145
MAG
Lys
TOT
Pro
AAA
Lys
AGO
Ar; so
TTC
Phe
CCT
pro
GAA
Giu
OAA
Glu
MGC
Bar 130
CCT
Pro
COC
Ar;
A
Lou
TCC
Ser 35
GAA
Giu
TAC
Tyv Ala
GOT
Oly
GAA
Olu 115 Ach Tb:
ACT
Thr
AT?
CAR
Val 20 Glu Gly
AMG
Lys
TAC
Tyr
GAT
Asp 100
GAG
Giu
CCT
Pro
GOT
Gly
AT?
GAT
TTC CCC TTG OTO OO AGG GOC TTC TOC GAC GAG ACC Ala Ar; Oly Phe 8cr Asp Giu Thr GGC TAC TGC MCC COO ATC CTG CGA Gly Tyr Cya 8cr Ar; Ile Lou Ar; TAC AAT CIM TTT AGC CTG AGO GTG Tyr Amn Giu Ph* 8cr Lou Ar; Val 55 CC? 000 MAC MGT TAC CGC OTG ACO Pro Gly Awn 8cr Tyr Arg Val Thr 70 TTT COA GOA TTC ACA TTG ATT OCT Ph@ Ar; Gly Phe Tb: Lou Ile Ala e5 MAA GMG GMA GAC CAT GCM GGA ACT Lys Glu Glu Asp His Ala Gly Tb: 105 ACO CMG TTC ATG MGC MAT TGT CCC Tb: Gin Ph* Not 8cr Asn Cys Pro 120 AGA AGO AGO ACA CGC ATC CAG GTC Ar; Ar; Ar; Tb: Ar; Ile Gin Val 135 140 ACG GOC TOT GTC ATT CTO MAA GCC Tb: oly Cys Val Ile Lou Lys Ala 150 155 TAT TrT CMG QAC GAG GOT TCT CTC Tyr Phs Gin Asp Oiu Oly 8cr Lou 165 170 TcA 0CC TCT GMA GOT GTO ACT GAC 120 171 219 267 315 363 411 459 507 603 651 699 Thr
AAA
Lys
ACO
Thr 205 cGO Arg
AAC
Asn
CAG
Gln
CAA
Gin
CCT
Pro 285
TCT
Ser
CCC
Pro
AAG
Lys
GAT
Asp Pro 190 Phe CGc Arg
TAC
Tyr
OTT
Val
CAA
Gin 270 0CC Ala
GTT
Val
AGT
Ser
GAC
Asp
GCC
Ala 350 Ar; 110 Cys Glu 175 ACA TTA GAT TGC Thr Lou Asp CYl TAT GGA AAT TG Tyr Gly Ann Trp 210 ACC AAC CAT TOG Tb: Ann His Trp 225 ATC CT? TOG GAG Ila Lou Trp Glu 240 GCA GAG CTG OGA Ala Glu Lou Gly 255 AG? GAT GAG .uT.
Ser Asp Glu Val TOG CAG CC? CTO Try Gin Pro Lou 290 Gin
TOT
Cys 195
TOG
Ser
TCT
Ser
TAT
Tyr
TCC
So:
TTA
LOU
275
AAT
Asn Asp Sor Ala Sor Oiu Oly Val Tb: Asp 180 185 0cc TOT OA ACT GCC AAA TAC AGO CTA Ala Cy8 Gly Tb: Ala Lys Tyr Ar; Lou 200 GAA AAM ACA CAT CCC MAA GAC TTT CC? Glu Lys Tb: His Pro Lys Asp Ph* Pro 215 220 GCO, ATC ATT GOT AGC TCT CAC TCA AG, Ala Ile Ile Gly So: Ser His Sor Lys 230 235 GOA 000 TAT OCT AGT GMA GOT GTC MAG Gly Gly Tyr Ala So: Glu Gly Val Lys 245 250 CCA OTC MAG ATG GAA GAM GM AT? CGA Pro Val Lys Met Glu Giu Giu Ile Ar; 260 265 ACP~ GTC ATC MAG GCA AAA GMA CAG TG Tb: Val Ile Lys Ala Lys Ala Gin Trp 280 OTO AGA OCT OCT CCC TCT OCT GAO TTT Val Arg Ala Ala Pro So: Ala Giu Phe 295 300 t OAT cC ASP A~r; CCC GAC Pro Asp 320 TOT GGC Cye Oly 335 GO= ACA Gly Tb: CA COG CA CT ATO T= MT CT A=C M A M CGG0 His Ar; His Lou Not Sor Ph* Lou Tb: got Lou Oly 305 310 315 TOO MAT GTO GOC CTG TCT OCT GAG GAC CTC TGC ACC Trp Asn Val Gly Lou Ala Glu Asp Lou Cys Tb: 325 330 TOG OTT CM MAA GTC OTO CMG GAT TA ATC CCC TG Trp Val Gin Lys Val Val Gin Asp Lou I1e Pro Trp 340 345 GAC ACT GGC OTC ACC TAT GAG TCA CCC MAC AMA CCT Asp So: Gly Val Tb: Tyr 01u Pro Asn Lys Pro 35S 360 747 795 843 891 939 987 1035 1083 1131 1179 1227 1275 1323 1371 1419 A CT? CCT CM GMAOM AT? A CC CT? AC AMC TA OAT CA CC Tb: Val Pro Gin Giu Lys Ile Arg Pro Lou Tb: So: Lou Asp His Pro 365 370 375 380 CMG MT CCA TTT TAT CAT CCA GMA GGA OGA TCT ATC MhG CT? OTA C Gin Sor Pro Phe Tyr Asp Pro 01u Gly Gly Ile Lys Lou Val Ala 385 390 395 MGA OTC OTO CT? GM MGA AT? OCA CGC MAG 000G GAG CMG TOC MC TTC Ar; Val Val Lou Giu Arg Ile Ala Arg Lys Gly Giu Gin Cys Amn Ph* 400 405 410 OTA CC? GAT MAC ATA GA? OAT AT? OTW WA GAC CTA GMA CCA GAA GAA Val Pro Asp Ann Ile Asp Asp Ile Val Ala Asp Lou Ala Pro Glu.Giu 415 420 425 66 AM GMA GAA GAT GAT ACC C= GAG A=C TGC ATA TAT MC AAC TGG TCC 1467 Lys Olu, Glu hop Asp Thr Pro Glu Th: Cys Ile Tyr 8cr Ann Trp Ser 430 435 440 CCC TOO TCA GCC TGC AGC TCC TCT ACC TOT GAG AAG GGC AAG AGO ATG 1515 Pro Trp Ser Ala Cys Scr Ser Ser Thr Cys Glu Lys Gly Lys Arq Met 445 450 455 460 AGO CAG AGA ATG CTT AMA OCT CAG CTO GAC CTC AGT OTO CCC TOT CCT 1563 Arg Gin Ar; Met Lou Lys Ala Gin Lou Asp Lou Ser Val Pro Cys Pro 465 470 475 GAT ACC CAA GAT TTT CAG CCA TOC ATO GOT CCA GOC TGC ACT GAT GAA 1611 Asp Thr Gin Asp Phe Gin Pro Cyrn Net Gly Pro Gly Cys 8cr Asp Glu 480 485 490 GAT GOT TCA ACT TGC ATG ATG TCT GAC TGG ATT ACA TOG TCC CCC TOT 1659 Asp Gly 8cr Thr Cys Met Not 8cr Asp Trp Ile Thr Trp, Ser Pro Cys 495 500 505 AG? OTT TCC TOT GGA ATO OGA ACG CGA TCT AGA GAG AGA TAT GTA hAG 1707 Ser Val Ser Cys Oly Hot Gly Tkir Arg 8cr Arg Glu Ar; Tyr Val Lys 510 515 CAA TTC CCC GAA OAT GOC TCT ATO TGC AAA OTO CC? ACT GAA GMh ACT 1755 Gin Pb. Pro Giu hop Oly 8cr Met Cys Lys Val Pro Thr Glu Glu Thr 525 530 535 540 GAG AAA TOT AT? OTA hT GAG GAA TGC TCC CC? ACC AGC TGC CT? GC 1803 Glu Lys Cys Ile Val Ann Giu Glu Cyu 8cr Pro 8cr 8cr Cys Lou Val 545 550 555 .ACC GAA TOO GGA GAG TOG OAT GAM TOC AG? OCT AGC TOT GGC ACA OGA 1851 Thr Glu Trp, Gly Giu Trp Asp Oiu Cys 8cr Ala 8cr Cys Oly Thr Gly .*560 565 570 ATO AAA AGGOCGA CAC A ATO ATC AMG ATO ACT CC? OCT GAT GOA TCT 1899 *.:Met Lys Ax; Arg His Arq Met Ile Lys Met Thr Pro Ala Asp Giy 8cr *575 580 585 *ATO TOC AMG OCA OAA AC? ACA GAG GCA GAG AAA TOC ATG ATG CCC GAA 1947 Met Cys Lys Ala Glu Tbr Tb: Olu, Ala Glu Lys Cyu Met Met Pro 01u 590 595 600 *TOC CAT ACT AT? CCC TGC CT'? CTA TCC CCA TOG TC? OAA TOO MGC GAC 1995 Cy. His Thr Ile Pro Cys Lou Lou 8cr Pro Trp 8cr Glu Try, 5cr Asp 605 610 615 620 :TOC MGC GTO WA TOT 000 hG OGA ATO CGA ACC GG CRA MG ATO CTO 2043 *Cys 8cr Val Thr Cyn Oly Lys Oly Mist Ar; Tb: Ar; Gin Ar; Not Lou 625 630 635 AM TCT GCA OCT GMG GOA GAC TOC GAG OAk CTO GAG CAA OCA 2091 Lys 8cr Ala Ala Oiu Lou Oly Asp Cys Aen Oiu Oiu Lou Glu Gin Ala 645 650 GAG AMA TOC ATG CTA CC? GMA TOC CCC ATT GAC TOT GKO CTA AG GAO 2139 Glu Lye Cys Met Lou Pro Oiu Cys Pro Ile Asp Cys Oiu Lou Thr Giu 655 660 665 TOG TC= CMG TOG TCC GAG TGC Ah? ACC TCC TOT 000 AMG GOC CAC ATO 2187 ?rp 5cr Gin Trp 8cr Glu Cys Amn Tb: 8cr Cys Oly Lye Oly Big Met 670 ATC AGA Ile Ar; C. a 675 680 ACA AOA ATG ATC AAM AA GMA CCA CAG GOA GGA ACA OCA Thr Arig Hot Ile Lys Ile Giu Pro Gin Phst Gly Gly Thr Ala 690 695 700
TOC
Cy.
AGA
Ar;
AGA
Ar; CCA GAA ACT OTC CAA COT ACT AMA TOT COA OTA AGO AAA TGC CTG Pro Oiu Thr Val Gin Ar; Th: Lys Cy. Arg Val Ar; Lys Cys Lou 705 710 715 GGC CCA GOT ATO GMA MO AGO COT TOOG MO GAG 0CC OG GAG AMA Gly Pro Gly Hot Gilu Lys Arg Ar; Trp, Lys Glu Ala Ar; Glu Lys 720 .725 730 AGA AG? GMA CAA GCA AMA AMA MT AT? OAT MAT GAG CMA TAT CCA Arg Ser Giu Gin Ala Lys Lys Asn Ile Amp Asn Giu Gin Tyr Pro 735 740 745 OTT TGT Val Cyrn 750
AGO
Ar; CTG AM CCA TOO ACT OCT TOG ACA GAA TOT TCT ACA CT Lou LyS Pro Trp Thr Ala Trp, Thr O.lu Cys Ser Thr Lou 755 760
TOT
Cys 765
AAA
Lys
TOT
GGA GOT OGA ATT CAG GAG =O TAC ATO ATO OTA MOG MO AGO T= Gly Oly Gly Ile Gin Giu Ar; Tyr Het Miet Val Lys Lys Ar; Ser 770 775 780 AGC ACT CAG TTT ACT AGC TGC AMA GAC AMA MO GAG CTA AGA GCA Ser Tb: Gin Phis Th: Ser Cys Lys Asp Lys Lys Giu Lou Ar; Ala 785 790 795 MAC OTT? CAT CCT TOT TA GGAMAACACA AGGCTTCCAA
GTGATOCACT
2235 2283 2331 2379 2427 2475 2523 2573 2633 2693, 2753 2813 2873 2933 2993 3053 3113 3173 3226 Cys Aen Val
S.
.5
S
S
S
S.
S
55..
S
5* *5 S S His Pro Cys S00 CTGAGCTATA AGGAAMGTCA ACCTTOOTTT GGTTTTTAMA
ACAMACMMA
TATATTAGTT :?ATTTTTG CAGTGTGGTT TGCTTTTAGT CTI 5
GCTGGTO
ATTTTATAMA TATTTCCTCC GATTAATCTA GGTAMACTTT GATGCTCCAG CTGCATAAMA ATAOTAGGTC ATTOTGAGTC ATTTMACTGA AGTACAGACA CATGGAATAG CCATATAGMA ATACTACTTG TAMOGACATO OGATOCATGC MACTAATTTG AAGTGACATO TTTCATATOT GAGGATTT
CTCTCTTGAT
MATCCMAGCC AGTGOCATG TGATTATACA ACTATGCCMA
GGAGAAATTT
GGTTCAATAA TATTAAAGGT GCATGTTTAT CTTTTACM ATOTA GAAATAATTA CCCTACATAC TTTTTCAC ATGGATGCTG
GGTTATOC
TTGTTTCTCA AATAGCAACT TACTTAAATA ATCTOGCAG CTCAATAGTG
OTATAMOGTG
CAAGAMATAT
CTAGCCCTTA
TATCTGTGGA
ATATTAACAT
TTGATTTAAM
CAGTAATGCT
AGCOATAGTT
MMATCATCT
ATGTCAGCCC
ATAACACACT CAAC CAAAGACATGT GGCATCACA GTAcCTGCA
CTG
IKIoRNATIoII FoR SQ iD 110:12: SZQUZNcg COARACTER!STICS? LENGTH: 802 aino acdsm Typigs amino acid 68 TOPOLOGY: linear (Li) moLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: Met Ala Ala Ar; Lou Ar; Pro Lou Ala Lou Ar; Lou Lou Ala Ar; Thr 1 5 10 Ph. Pro Lou Val Ala Arg Gly Phe 5cr Amp Glu Thr Lou Glu Lys Ala 25 Ala Lys 5cr Giu Gly Tyr Cym Ser Ar; Ile Lou Ar; Ala Gin Gly Thr 4045 Ar; Arg Giu Gly Tyr Aen Giu Ph* 5cr Lou Ar; Val Glu Gly Amp Pro 55 Glu Ph. Tyr Lys Pro Gly Asn 5cr Tyr Arg Val Thr Lou 5cr Ala Ala 70 75 Thr Pro Ala Tyr Ph. Ar; Gly Ph* Thr Lou Ile Ala Lou Lys Glu Gly 90 Lys Glu Gly Asp Lys Glu Giu Asp His Ala Gly Thr Ph., Gin Ile Ile 100 105 110 Asp Giu Giu Glu Thr Gin Phe Not 5cr Asn Cym Pro Val Ala Val Thr 115 120 125 Glu 5cr Thr Pro Ar; Ar; Ar; Thr Ar; Ile Gin Val Ph* Trp Thr Ala 130 135 140 Pro Pro Thr Gly Thr Gly Cym Val Ile Lou L~ye Ala 8cr Ile Val Gin ~*145 150 155 160 Lys Ar; Ile Ii. Tyr Ph., Gin Asp Giu Gly Bar Lou Thr Lye Ar; Ile ''t165 170 175 Cym Glu Gin Asp Bar Ala 5cr Giu Gly Val Thr Asp Lys Pro Thr Lou 180 185 190 ft*otAmp Cys Cys Ala Cym Gly Thr Ala Lye Tyr Ar; Lou Thr Ph* Tyr Gly ft195 200 205 ASTI Trp 8cr Giu Lys Thr Him Pro Lys Asp Ph. Pro Ar; Ar; Thzr Aun 210 215 220 His Trp Bar Ala Ile Ile Gly 8cr Bcr His Bar Lys Asn Tyr Ile Lou 't..225 230 235 240 Trp Glu Tyr Gly Gly Tyc Ala Ser Giu Gly Val Lys Gin Val Ala Glu ~~)245 250 255 Lou Gly Bcr Pro Val Lys Met Giu Giu Giu Ile Ar; Gin Gin Bar Asp ft260 265 270 Giu Val Lou Thr Val Ile Lye Ala Lye Ala Gin Tcp, Pro Ala Trp, Gin 275 280 285 Pro Lou Asn Val Ar; Ala Ala Pro 5cr Ala Giu Phe 8cr Val Asp Ar; 290 295 300 His 305 Trp Trp Ap Tyr 385 Glu Il Asp cys Lou 465 Pri.
cys 20 Gly Ap Val 545 25 Glu His Glu Pro Arg His Lou Not Ser Phs, Lou Thr 310 Ann Val Ser Lys 370 Asp Art;
ASP
Thr Ser 450 Lys Gin met Hot Gly 530 Ann Trp Art; Thr Cyu 610 Val Gin Gly 355 Ile Pro Ile Asp Pro 435 Ser Ala Pro met Giy 515 Ser Glu Asp Net, Thr 595 Lou Gly Lys Val Art Giu Ala Ile 420 Giu Ser Gin Cym Ser 500 Thr met Giu Giu Ile Giu Lou Lou 325 Val1 Thr Pro Gly Art; 405 Val Thr Thr Lou met 485
ASP
Arg cyn Cym Cym 565 Lys Ala Ser ser Val1 Tyr Lou Giy 390 Lys Ala Gym cys Ap 470 Gly Try Sec Lys Ser 550 Ser met Glu Pro Ala Glu Asp Gin Asp Lou 345 Giu Ser Pro 360 Thr Ser Lou 375 Ser Ile Lys Gly Giu Gin Asp Lou Ala 425 Tyr Ser 440 Giu Lys Gly 455 Lou Ser Val Pro Gly Cys Ile Thr Trp 505 Art; Giu Arg 520 Val Pro Thr 535 Pro Ser Ser Ala Ser Cym Thr Pro Ala 585 Lys Cym Hot 600 Trp Ser Glu 615 69 Hot Lou 330 Ile Ann Asp Lou Gym 410 Pro Ann Lys Pro Ser 490 Ser Tyr Glu Gym Gly 570 Ap Met Trp Lou 315 Gym Pro Lys His Val 395 Ann Glu Trp Arg Gys 475 Asp Pro Val G lu
LOU
555 Thr Gly Pro Ser Gly Thr Trp Pro Pro 380 Ala Phe, Glu Sec met 460 Pro Glu Gym Lys Thr 540 Val Gly Sec Glu Asp 620 Pro Lys Asp Thr 365 Gin Ar; Val1 Lys Pro 445 Arg Ap Asp Ser Gin 525 Glu Thr Met met Gym 605 Gym Ser Asp Ala 350 Val Ser Val Pro Glu 430 TrP Gin Thr Gly Val 510 Ph* Lys Clu Lys Gym 590 His Bar Pro Gym, 335 Gly Pro Pro Val Asp 415 Giu Ser Arg Gin Ser 495 Ser Pro Gym TrP Arg 575 Lys Thr Val Asp 320 Gly Thr Gin Ph* Lou 400 Ann Asp Ala Hot Asp 480 Thr Gym Glu I le Gly 560 Art; Ala 11e Thr 9* 0 *0
M
Gym 625 Gly Lys Gly Met Art; Thr Art; Gin Art; Not Lou Lys 8cr Ala Ala 630 635 640 Giu Lou Gly Asp Cyu Ann Glu Giu Lou Giu Gin Ala Glu Lys Cys Met 645 650 655 Lou Pro Glu Cys Pro Ile Alp Cys Glu Lou Thr Glu Trp 8cr Gin Trp 660 665 670 Ser Glu Cys Ann Thr Sr Cys Gly Lys Gly His Met Ile Arg Thr Ar; 675 680 685 Hot Ile Lys Ile Glu Pro Gin Phe Gly Gly Thr Ala Cys Pro Glu Thr 690 695 700 val Gin Arg Thr Lys Cys Arg Val Ar; Lys Cys Lou Arg Gly Pro Gly 705 710 715 720 Met Giu Lys Arg Ar; Trp Lys Glu Ala Ar; Giu Lys Arg Arg Sr Glu 725 730 735 Gin Ala Lys Lys Aen Ile Asp Aen Glu Gin Tyr Pro Val Cya Ar; Lou 740 745 750 Lys Pro Trp Thr Ala Trp Thr Giu Cys Sr Thr Lou Cyn Gly Gly Gly 755 760 765 Ile Gin Glu Ar; Tyr Met Mt Val Lys Lys Arg Sr Lys Sr Thr Gln 770 775 780 Ph@ Thr 8cr Cys Lys Asp Lys Lys Giu Lou Arg Ala Cys Ann Val His 785 790 795 800 Pro Cys INrORMATION FOR SEQ ID NO:13: SEQUENCE CHARACTRISTICS: LENGTH 1816 base pairs .i TYPZ: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPZ: cA (ii) FETURE: NANE/KEY: CDS LOCATION: 2..1705 .1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: T TCA OGT GAA TAT OTT CTT TG AG? ATG AGA CAA CC AGT GAT OT 46 8cr Oly Glu Tyr Val Lou Trp Ser Mt Arg Gin Ala Sr Asp Gly 1 5 10 GTC ARA CAA GTA OCT GAG TTG GGT TCT CCA GTC AAA ATO GAA GA GAA 94 Val Lys Gin Val Ala Olu Lou Gly 8cr Pro Val Lys Mt Olu Giu Glu 20 25 ATT CGA CAG hAG GGA GAT GAA GTT CTA ACA GTA ATC AAA CC AAA GCT 142 Ile Arg Gin Lys Gly Asp Giu Val Lou Thr Val Ile Lys Ala Lys Ala 40 71 CAGTG CG CC TGG cAG CCC CTC AK? GTG AGO GCC GCC CCT TCA GC? 190 Gin Try Pro Ala Trp Gin Pro Lou Ann Val Arg Ala Ala Pro Sex Ala so 55 GAG TTC TC?, cTO QAC AGA AGC COT CAC CTG ATO TCA TTT CTG GCC ATO 238 Giu Ph. Ser Val Asp Arg Sex Arg His Lou Hot Ser Ph. LOU Ala Hot 70 ATG GOT CCT AGC CCA GAC TOG AAT GTA GGA CTC ACC TCC GAG GAT CTC 286 Hot Gly Pro Sex Pro Asp Trp Asn Val Gly Lou Thr Ser Glu Asp Lou so 85 90 TOT ACC AAA GAG TGT GOC TOG GTT CAG AAG GTG GTC CAG GA? TTG ATT 334 Cys Thr Lys Olu Cys Gly Trp Val Gin Lys Val Val Gin Asp Lou Ile 100 105 110 CCA TGG GAT GCK GOC AC? GAC AG? GGG GTA ACC TAC GAG TCT CCA AAC 382 Pro Trp Asp Ala Gly Thr Asp Sex Gly Val Thr Tyr Glu Ser Pro Ann 115 120 125 AAG CCC ACC AT? CCC CAG GAT AAA ATC CGA CCT CTG ACA AG? CG GAT 430 Lys Pro Thr Ile Pro Gin Asp Lys Ile Ar; Pro Lou Tb: Ser Lou Asp 130 135 140 CAC CCA CAA AGC CCT TCT ATG ACC AGA GOT GGG CCA ATC ATA CC? ATA 478 His Pro Gin Ser Pro Ser Net Thr Ar; Gly Gly Pro Ile Ile Pro Ile 145 150 155 GCT CGA GTT GTO AT? OAK AGO AT? 0CC AGO AG 00K OAK CAG TGC AK? 526 Ala Arg Val Vai Ile Giu Ar; Ile Ala Ar; Lys Giy Oiu Gin Cys Ann 160 165 170 175 *ATT ATA CCC GAC AAC OTG GAT GAC ATA GTA OCK OAT CTO GTA ACG GAA 574 *Ile Ile Pro Asp Annl Val Asp Asp le Val Ala Asp Lou Val Tb: Olu 180 185 190 a..GAG AAA GAC GAA GAT GA? ACC CCG GAG ACC TGC ATA TAT TCC AAC TOG 622 :Giu Lys Asp Giu Asp Asp Tb: Pro Giu Tb: Cys Ile Tyr Ser Ann Trp 195 200 205 TCC CCC TOO TCG GCC TGC AGC TCO GCC ACC TGC GAC AKO GOC AKO COG 670 Pro Trp Ser Ala Cys Ser Ser Ala Tb: Cys Asp Lys Oly Lys Ar; 210 215 220 ATG AGA CAG CGC ATO TTA AAG OCT CAG TTA GA? CTC AG? OTT CCC TOC 718 mot Ar; Gin Arg Het Lou Lye Ala Gin Lou Asp Lou 8cr Val Pro Cys 225 230 235 .CCA GAC ACT CAG GAC TTT OAK CCC TGC ATG GGG CCC GGC TGC AGC OAT 766 Pro Asp Tb: Gin Asp Phe Giu Pro Cys Not Gly Pro Gly Cys Ser Asp 240 245 250 255 GAC GAK 0CC TCT ACC TGC ATG ATO TCA GA, TOG ATC ACC TOG TCO CCG 814 4,Asp Giu Ala Ser Tb: Cy. Not Met Ser Oiu Trp Ile Tb: Trp Ser Pro 260 265 270 TGC AGC GCC TCC TGC 000 ATG 00K ATT GAG OTC AGO GAG AGA TAC OTC 862 Cys Ser Ala Ser Cys Oly Met Oly Ile Oiu Val Ar; Giu Ar; Tyr Val 275 280 285 AKO CAG TTC CCA OA GAC GOT TCC TTN TOT AA GTC CCA AG OAK OAK 910 Lys Gin Phe Pro O.u, Asp Oly Ser Lou Cys Lys Val Pro Tb: Glu, Giu 290 295 300 ACT GAG AAA TGC AT? GTC ART GAG GAG TOT GAG CCA AGC AGC TOT ATA Thr Giu Lys Cy Ile Val Kan Giu Giu Cys Giu Pro Ser 8cr Cy. Ile 305 310 315 958 GTC ACO GMA Val Thr 320 GOT ATO Gly Met TCT ATO, Ser Met GMA TOT Giu Cys GAT TOC Ap Cys 385 TTG AG Lou Lys 400 AAA CAA Lys Gin .:TTG AM.
**Lou Thr *GGT CAC Gly His :6..,GGA GCC *Gly Aia ~:'.465 MAA TOT :Lys Cys *6 480 CGA GAG Arg Giu CAG TAC Gin Tyr ACC A Thr Lys Glu
AAG
Lys
TOC
Cy.
CAT
His 370
AGC
8cr Tcc 8cr
GTG
Val
GAG
Giu
ATO
Hot 450
GT=
Val
MA
Gin
MGO
Lys
CCT
Pro
TTC
Ph* 530 TOO GM GAG TOO GAG GAG TC MOC OC ACA TGC CO AMO Trp Aia Giu Try, Giu Giu CY8 8cr Ala Th~r Cys Ar; Met 325 330 335 MAG COO MAC AGO ATO ATA MAG ATO ACT CCA GCG GAT GGA Lye Ar; His Ar; Hot Ile Lys Hot Thr Pro Ala Asp Giy 340 345 350 MAA GC= GAC ACM ACA GAO OTT GAG AMA TGC ATO ATO CCC Lys Ala Asp Thr Tb: Giu Vai Giu Lye Cy. Met Hot Pro 355 360 1365 ACC ATC COO TOC OTO TTG TCC CCT TOO TCT GMA TOO AGT Thr Ile Pro Cys Val Lou So: Pro Try, 8cr Giu Try, 8cr 375 380 OTT ACC TOT GOC AAA GOC ACC AGA ACC AGA CAG AGA ATC Val Tbr Cy. Gly Lye Gly Thr Ar; Tb: Ar; Gln Ar; Hot 390 395 CCO TCT GMA CTT OGA. OAT TOC AAT GAG QAA CTO GMA CTO Pro 8cr Giu Lou Giy Asp Cys Ann Giu Giu. Lou Giu Lou 405 410 415 GMA MG TOC ATO CTT CCT GMA TGC CCT ATA AGC TOT GMA Giu Lye Cye Hot Lou Pro Giu Cys Pro Ile 8cr Cys Giu 420 425 430 TOO TCT TAC TOO TCT GAG TOT MC MAA TOC TCG 0CC MAG Trp 8cr Tyr Try, 8cr Oiu Cys Asn Lys Cye 8cr Giy Lye 435 440 445 ATT COT ACC COA ATO ATC ACA ATO GMR CCA CAG TTT GGA Ile Ar; Th: Ar; Hot Ile Tb: Hot Giu Pro Gin Ph* Gly 455 460 TOT COG GMA ACC OTO CMR CO MA MAA TOO OA TTA COT Cys Pro Glu Tb: Val Gin Ar; Lys Lye Cys Ar; Lou Ar; 470 475 AAA hOT TCC 000 hAT GAG CGA AGO CAT TTA MhG OAT 0CC Lye 8cr 8cr Gly hen Giu Ar; Ar; His Lou Lys Asp Ala 485 490 495 AGA AGO hOT GM MAA ATA MAG GMR OAT TCA OAT OGA CAM Ar; Arg Ser Giu Lye Ile Lye Glu Asp 8cr Asp Gly Giu 500 505 510 0TA TOT MAA ATO A CCA TOG ACT GMA TOG ACC GAR TOT Val Cye Lye Hot Lys Pro Trp Tb: Ala Try, Tb: Giu Cym 515 520 525 TOC GOT 000 000 ATA CM GAG COG TTC ATO ACT GTO hAG Cys Gly Gly Gly Ile Gin Oiu Arg Ph. Net Thr Vai Lye 535 540 1006 1054 1102 1150 1198 1246 1294 1342 1390 1438 1486 1534 1582 1630 73 AAO AGA TTC AAA hOT TCT CAG TTC ACC AGC TOC AAG GAC AAG AAG GAG 1678 Lys Arg Ph. Lys Ser Gin Ph* Thr Sor Cys Lys Asp Lys Lys Giu 550 555 ATC COG OCT TGC AAT GTC CAT CCA TOT TAACCTGCCT GAlAAGAGGG 1725 Ile Arg Ala Cy. hen Val Bin Pro Cys 560 565 ATTGACACTA CAATCGCAAC AGAAGTCAAT CTTTATTAGA TATTTTTTAT CATAGAlTAT 1785 ATACATGTGC CAI'T CATOTACTIT T 1816 INFORMATION FOR SEQ ID NOt14t SEQUENCE CHARACTERISTICS: LENGTH: 568 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
C.
C
C
Ce..
C
C
C.
CC..
C C C. CC a a C
C
CCC.
CC.*C
C
Ser Lys Arg Trp, Ph* 65 Gly Thr Try Pro Pro 145 Arg Ile Gin Gin Pro 50 Sor Pro Lys Ap Th: 130 Gin Val Val Lys Ala Val 5cr G iu Ala 115 110 8cr Val Ala Gly Trp Ap Pro Cyu 100 Gly Pro Pro Ile Oly Olu Tyr Val Olu Asp Gin Arg Asp Gly Thr Gin 5cr Glu 165 Val Lou Trp Jar Met Arg 10 Gin Ala Ser Asp Lou Glu Pro So: 70 Trp Trp Ap Ap Not 150 Arg Asp Gly Val Lou 55 Arg an Val
S.:
Lys 135 Tb: I10 Ap Se: Pro Val 25 Lou Tb: Val 40 an Val Arg His Lou Net Val Gly Lou 90 Gin Lys Val 105 Gly Val Tb: 120 Ile Arg Pro Arq Gly Gly Ala Arg Lys 170 11-w Y1t- Ala 185 Lys I10 Ala 8cr 75 Tb: Val Tyr
LOU
Pro 155 Gly Ap Net Lys Ala Phe 5cr Gin Giu Tb: 140 Ile Giu
LOU
Glu Ala Pro Lou Giu ap Ser 125 Sc: Ile Gin Val Clu Lys 8cr Ala Ap Lou 110 Pro Lou Pro Cys Tb: 190 Giu Ala Ala met Lou Ile an Ap I10 hen 175 Glu Ile Gin Oiu met s0 Cys Pro Lys His Ala 160 110 Glu Oly Val Pro Ap An 180 Lys Ap Glu 195 Ap Ap Tb: Pro Glu Tb: Cys Ile Tyr gar 200 An Trp Scr 74 Pro Trp $or Ala Cym 8cr 8cr Ala Thr Cys Asp Lys Gly Lys Ar; met 210 215 220 Ar; Gin Ar; Met Lou Lys Ala Gin Lou Asp Lou 8cr Val Pro Cys Pro 225 230 235 240 Asp Thr Gin Asp Ph. Giu Pro Cys Mot Gly Pro Gly Cys 8cr Asp Asp 245 250 255 Giu Ala Ser Thr Cys Met Met Ser Glu Trp Ile Thr Trp 5cr Pro Cys 260 265 270 Ser Ala 8cr Cy. Gly Mot Gly Ile Giu Val Ar; Giu Arg Tyr Val Lys 275 280 285 Gin Phe Pro Giu Asp Gly 8cr Lou Cym Lys Val Pro Thr Giu Giu Thr 290 295 300 Giu Lys Cys le Val Ann Giu Giu Cym Giu Pro 8cr Sec Cys Ile Vai 305 310 315 320 Thr Giu Trp Ala Giu Trp Giu Giu Cys 8cr Ala Thr Cys Ar; Met Gly 325 330 335 Mot Lys Lys Ar; His Ar; Met Ile Lys Met Thr Pro Ala Asp Gly 8cr 340 345 350 Met Cy. Lys Ala Asp Thr Thr Giu Val Glu Lys Cyu Met Met Pro Glu 355 360 365 Cys His Thr Ile Pro Cy. Val Lou 8cr Pro Trp 8cr Glu Trp, 8cr Asp 370 375 380 Cys 8cr Val Thr Cys Giy Lys Gly Thr Ar; Tkxr Ar; Gin Ar; Met Lou *385 390 395 400 Lys Ser Pro 8cr Glu Lou Gly Asp Cys Ann Giu Glu Lou Glu Lou Lys Gin Val Giu Lys Cy. Met Lou Pro Giu Cys Pro Ile 8cr Cys Giu Lou 420 425 430 Thr Giu Trp 8cr Tyr Trp 8cr Glu Cys Ann Lys Cys $or Gly Lye Gly *435 440 445 :His Met Ile Ar; Thr Ar; Met le Thr Met Giu Pro Gin Phc Giy Gly 450 455 460 Aia Val Cys Pro Giu Thr Val Gin Ar; Lys Lys Cys Ar; Lou Arq Lye 465 470 475 480 Cys Gin Lye 8cr 8cr Gly Aen Giu Ar; Ar; His Lou Lys Asp Ala Ar; 485 490 495 Glu Lys Ar; Ar; 8cr Giu Lys Ile Lye Gltv Asp 8cr Asp Gly Giu Gin 500 505 510 3' Tyr Pro Val Cys Lye Not Lye Pro Trp Thr Ala Trp Thr Glu Cys Th: 515 520 525 Lys Ph* Cys Gly Giy Giy Ile Gin Qiu Ar; Ph* Met Tb: Val Lye Lye 530 535 540 Ar; Phe Lye ser Ser Gin Phes Thr Ser Cys Lys Asp Lys Lys Gu Ile 545 550 555 560 Arg Ala Cys Ann Val His Pro Cys 565 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 59 ami.no acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID 110:15: Giu Thr Cys Ile Tyr Ser Ann Trp Scr Pro Trp Ser Ala Cys Scr Scr Thr Cys Giu Lys Gly Lys Arg Mot Ar; Gin Arg Met Lou Lys Ala 25 Gin Lou Amp Lou Ser Vai Pro Cys Pro Asp Thr Gin Asp Ph* Gin Pro 40 Cys Met Gly Pro Gly Cys Ser Asp Giu Asp Gly a. INFORMATION FOR SEQ ID NO:16: SEQUENCE CHARACTERISTICS: LENGTH: 56 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: Ser Thr Cys Thr Met Ser Giu Trp Ile Thr Trp Ser Pro Cys Ser Val.
.25 15 10 Ser eye Gly Met Gly Not Ar; Ser Ar; Glu krg Tyr VJal Lys Gin Phe 25 Pro Asp Gly Ser Vai Cys Met Leu Pro Thr Glu Giu Thr Giu Lys Cys 40 Thr Val Ann Giu Giu Cys Ser Pro 50 INFORMATION FOR SEQ ID NO:17: SEQUENCE CHARACTERISTICS: LENGTH: 56 amino acids 76 TYPE: amino acid (C STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID 140:17: Ser Sor Cys Leu Val Thr Glu Trp Gly Glu Trp Amp Asp Cys Ser Ala 1 5 10 Thr Cys Gly Het Gly Met Lys Lys Arg His Arg Met Val Lys Met Ser 25 Pro Ala Asp Gly Ser Met Cys Lys Ala Glu Thr Ser Gln Ala Glu Lys 40 Cys Met Met Pro Glu Cys His Thr so0S INFORMATION FOR SEQ ID 190:18: SEQUENCE CHARACTERISTICS: LENGTH: 53 amino acids TYPE: amino acid STRANDEDNESSZ single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID 190:18: Ile Pro Cys Leu Leu Ser Pro Trp Glu Trp Ser Asp Cys Ser Val Thr 5 10 Cys Gly Lys Gly Met Arq Thr Ar; Gin Arg Met Lou Lys Ser Lou Ala 25 Glu Leu Gly Asp Cys Asn Glu Asp Lou Glu Gln Ala Glu Lys Cys Met 40 Leu Pro Glu Cys Pro .5.25 INFORMATIONI FOR SEQ ID N90:19: SEQUENCE CHARACTERISTICS: LENGTH: 56 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (1i) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID 190:19: 77 Ile Asp Cys Glu Lou Ser Glu Trp, Ser Gin Trp, 5cr Glu Cys Asn Lys 1 5 10 Sor Cym Gly Lys Gly His met Ile Arg Thr Arg Thr Ile Gin Met Giu 25 Pro Gin Ph& Gly Giy Ala Pro Cys Pro Glu Thr Val Gin Ar; Lys Lys 40 Cys Arg Ala Arg Lys Cy. Lou Arg so INFORMATION FOR SEQ ID N0:20: SEQUENCE CHARACTERISTICS: LENGTH: 55 ami.no acids TYPE: amio aci.d STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID Pro Gin Cys Ar; Met Arg Pro Trp Thr Ala Trp Ser Glu Cys Thr Lys 1 5 10 Lou Cys Gly Gly Gly Ile Gin Giu Ar; Tyr Met Thr Val Lys Lys Arg 20 25 Ph* Lys Ser Gin Ph. Thr Se: Cys Lys Asp Lys Lys Giu Ile Ar; 40 :.Ala Cys Aen Val His Pro Cys
Claims (16)
1. An isolated nucleic acid molecule encoding human F-spondin wherein said nucleic acid molecule comprises a nucleotide sequence that is complementary to a sequence that is capable of hybridizing under low stringency conditions to SEQ ID NO: 9 and/or SEQ ID NO: 11 and/or SEQ ID NO: 13.
2. An isolated nucleic acid molecule that encodes a human F spondin polypeptide when isolated by a process comprising contacting nucleic acid derived from a human source with a hybridisation-effective amount of a nucleic acid probe comprising at least 15 contiguous nucleotides of any one of SEQ ID NOs: 9, 11, or 13 for a time and under conditions sufficient for hybridisation to occur and then isolating the hybridised human nucleic acid.
An isolated or recombinant human F-spondin polypeptide.
4. An isolated or recombinant human F-spondin polypeptide encoded by the nucleic acid molecule of claim 1 or 2.
5. A method of attaching nerve cells to a matrix comprising contacting the matrix with nerve cell and human F-spondin at a concentration effective to effect attachment of the cells o to the matrix.
6. A method of attaching nerve cells to a matrix comprising contacting the matrix with nerve cell and human F-spondin at a concentration effective to effect attachment of the cells to the matrix, wherein said human F spondin polypeptide is encoded by the isolated nucleic acid molecule of claim 1 or 2.
7. A method of stimulating growth of a nerve cell comprising contacting the nerve cell with human F-spondin at a concentration effective to stimulate growth of the nerve cell.
8. A method of stimulating growth of a nerve cell comprising contacting the nerve cell v *I -v j P:\OPER\MRO\1883748.SPE 28/9/99 -79- with human F-spondin at a concentration effective to stimulate growth of the nerve cell, wherein said human F spondin polypeptide is encoded by the isolated nucleic acid molecule of claim 1 or 2.
9. A method of regenerating nerve cells in a subject comprising administering to the subject human F-spondin at a concentration effective to regenerate nerve cells in the subject.
A method of regenerating nerve cells in a subject comprising administering to the subject human F-spondin at a concentration effective to regenerate nerve cells in the subject, wherein said human F spondin polypeptide is encoded by the isolated nucleic acid molecule of claim 1 or 2.
11. A pharmaceutical composition for stimulating nerve cell growth comprising a pharmaceutically acceptable carrier and human F-spondin at a concentration effective to stimulate nerve cell growth. i:
12. A pharmaceutical composition for stimulating nerve cell growth comprising a pharmaceutically acceptable carrier and human F-spondin at a concentration effective to stimulate nerve cell growth, wherein said human F spondin polypeptide is encoded by the isolated nucleic acid molecule of claim 1 or 2.
13. Use of the human F spondin polypeptide according to claim 3 in the preparation of a medicament for the regeneration of a nerve cell in a subject.
14. Use of the human F spondin polypeptide according to claim 4 in the preparation of a medicament for the regeneration of a nerve cell in a subject. Use of the human F spondin polypeptide according to claim 3 in the preparation of a medicament for the stimulation of nerve cell growth in a subject.
P;\OPER\MRO\1883748.SPE 28/9/99 80
16. Use of the human F spondin polypeptide according to claim 4 in the preparation of a medicament for the stimulation of nerve cell growth in a subject. DATED this TWENTY EIGHTH day of SEPTEMBER 1999 The Trustees of Columbia University in the City of New York by their Patent Attorneys DAVIES COLLISON CAVE 0 0 0 00* S.0.0 0 00* .Z*
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/862,021 US5279966A (en) | 1992-04-02 | 1992-04-02 | Cloning, expression and uses of a novel secreted protein, F-spondin |
US862021 | 1992-04-02 | ||
AU39455/93A AU677185B2 (en) | 1992-04-02 | 1993-04-02 | Cloning, expression and uses of a novel secreted protein, F-spondin |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU39455/93A Division AU677185B2 (en) | 1992-04-02 | 1993-04-02 | Cloning, expression and uses of a novel secreted protein, F-spondin |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU19479/00A Division AU1947900A (en) | 1992-04-02 | 2000-02-25 | Cloning, expression and uses of a novel secreted protein F-spondin |
Publications (2)
Publication Number | Publication Date |
---|---|
AU1269897A AU1269897A (en) | 1997-05-15 |
AU713198B2 true AU713198B2 (en) | 1999-11-25 |
Family
ID=25624774
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU12698/97A Ceased AU713198B2 (en) | 1992-04-02 | 1997-02-13 | Cloning, expression and uses of a novel secreted protein, F-spondin |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU713198B2 (en) |
-
1997
- 1997-02-13 AU AU12698/97A patent/AU713198B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
AU1269897A (en) | 1997-05-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU677185B2 (en) | Cloning, expression and uses of a novel secreted protein, F-spondin | |
Klar et al. | F-spondin: a gene expressed at high levels in the floor plate encodes a secreted protein that promotes neural cell adhesion and neurite extension | |
US6664098B1 (en) | Differentiation inhibitory agent | |
US6337387B1 (en) | Differentiation-suppressive polypeptide | |
AU729880C (en) | Recombinant vascular endothelial cell growth factor D (VEGF-D) | |
US6638741B2 (en) | Differentiation-suppressive polypeptide serrate-2 | |
JPH11512709A (en) | Truncated glial cell line-derived neurotrophic factor | |
NO320839B1 (en) | Isolated polypeptide, process for its preparation, isolated nucleic acid molecule, isolated antibody, antisense molecule, preparation, use of a polypeptide and nucleic acid molecule, host cell and vector respectively. | |
AU690912B2 (en) | Cloning, expression and uses of (dorsalin-1) | |
AU713198B2 (en) | Cloning, expression and uses of a novel secreted protein, F-spondin | |
JP4139508B2 (en) | Human delta 3 | |
JP4332581B2 (en) | Human delta-3 | |
AU1947900A (en) | Cloning, expression and uses of a novel secreted protein F-spondin | |
US7252973B1 (en) | Protein and processes for producing the same | |
JP4171762B2 (en) | Novel differentiation inhibitor | |
US20020128440A1 (en) | Endoderm, cardiac and neural inducing factors - oligonucleotides for expressing human frazzled (frzb-1) protein | |
AU2003200912B2 (en) | Novel protein and processes for producing the same | |
JP2006212033A (en) | Differentiation-suppressive polypeptide |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |