CA2304169A1 - Laminins and uses thereof - Google Patents
Laminins and uses thereof Download PDFInfo
- Publication number
- CA2304169A1 CA2304169A1 CA002304169A CA2304169A CA2304169A1 CA 2304169 A1 CA2304169 A1 CA 2304169A1 CA 002304169 A CA002304169 A CA 002304169A CA 2304169 A CA2304169 A CA 2304169A CA 2304169 A1 CA2304169 A1 CA 2304169A1
- Authority
- CA
- Canada
- Prior art keywords
- leu
- gly
- cys
- ala
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 241001367053 Autographa gamma Species 0.000 claims 1
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 247
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 219
- 229920001184 polypeptide Polymers 0.000 abstract description 184
- 102000040430 polynucleotide Human genes 0.000 abstract 2
- 108091033319 polynucleotide Proteins 0.000 abstract 2
- 239000002157 polynucleotide Substances 0.000 abstract 2
- 108010085895 Laminin Proteins 0.000 description 136
- 102000007547 Laminin Human genes 0.000 description 136
- 108090000623 proteins and genes Proteins 0.000 description 121
- 210000004027 cell Anatomy 0.000 description 119
- 210000001519 tissue Anatomy 0.000 description 116
- 238000000034 method Methods 0.000 description 106
- 150000007523 nucleic acids Chemical class 0.000 description 79
- 108020004707 nucleic acids Proteins 0.000 description 74
- 102000039446 nucleic acids Human genes 0.000 description 74
- 108020004414 DNA Proteins 0.000 description 56
- 239000002773 nucleotide Substances 0.000 description 50
- 125000003729 nucleotide group Chemical group 0.000 description 50
- 102000004169 proteins and genes Human genes 0.000 description 49
- 235000018102 proteins Nutrition 0.000 description 48
- 239000012634 fragment Substances 0.000 description 44
- 125000003275 alpha amino acid group Chemical group 0.000 description 37
- 230000014509 gene expression Effects 0.000 description 37
- -1 RNA or DNA Chemical class 0.000 description 34
- 239000002299 complementary DNA Substances 0.000 description 30
- 150000001875 compounds Chemical class 0.000 description 30
- 235000001014 amino acid Nutrition 0.000 description 29
- 241001465754 Metazoa Species 0.000 description 28
- 108091034117 Oligonucleotide Proteins 0.000 description 27
- 229940024606 amino acid Drugs 0.000 description 27
- 150000001413 amino acids Chemical class 0.000 description 27
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 27
- 208000035475 disorder Diseases 0.000 description 25
- 238000001727 in vivo Methods 0.000 description 22
- 230000035772 mutation Effects 0.000 description 22
- 108091028043 Nucleic acid sequence Proteins 0.000 description 21
- 239000013612 plasmid Substances 0.000 description 21
- 239000013638 trimer Substances 0.000 description 21
- 238000002360 preparation method Methods 0.000 description 20
- 230000004071 biological effect Effects 0.000 description 19
- 239000013598 vector Substances 0.000 description 19
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 18
- 108020001507 fusion proteins Proteins 0.000 description 18
- 102000037865 fusion proteins Human genes 0.000 description 18
- 238000000338 in vitro Methods 0.000 description 18
- 108700019146 Transgenes Proteins 0.000 description 17
- 125000000539 amino acid group Chemical group 0.000 description 17
- 239000000523 sample Substances 0.000 description 17
- 238000006467 substitution reaction Methods 0.000 description 17
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 15
- 101150051936 Y-3 gene Proteins 0.000 description 15
- 230000009261 transgenic effect Effects 0.000 description 15
- 239000005557 antagonist Substances 0.000 description 14
- 230000027455 binding Effects 0.000 description 14
- 230000003993 interaction Effects 0.000 description 14
- 210000005036 nerve Anatomy 0.000 description 14
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 239000003446 ligand Substances 0.000 description 13
- 208000027418 Wounds and injury Diseases 0.000 description 12
- 210000002469 basement membrane Anatomy 0.000 description 12
- 238000001476 gene delivery Methods 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- FFEARJCKVFRZRR-SCSAIBSYSA-N D-methionine Chemical compound CSCC[C@@H](N)C(O)=O FFEARJCKVFRZRR-SCSAIBSYSA-N 0.000 description 11
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 11
- 206010052428 Wound Diseases 0.000 description 11
- 238000002703 mutagenesis Methods 0.000 description 11
- 231100000350 mutagenesis Toxicity 0.000 description 11
- 210000003491 skin Anatomy 0.000 description 11
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 10
- 241000700605 Viruses Species 0.000 description 10
- 239000000556 agonist Substances 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- 230000002163 immunogen Effects 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 210000002826 placenta Anatomy 0.000 description 10
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 9
- 238000003780 insertion Methods 0.000 description 9
- 230000037431 insertion Effects 0.000 description 9
- 238000012216 screening Methods 0.000 description 9
- 241001430294 unidentified retrovirus Species 0.000 description 9
- 101150002210 34 gene Proteins 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 8
- 239000000499 gel Substances 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 238000002347 injection Methods 0.000 description 8
- 239000007924 injection Substances 0.000 description 8
- 230000003902 lesion Effects 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 210000002569 neuron Anatomy 0.000 description 8
- 239000002245 particle Substances 0.000 description 8
- 238000011069 regeneration method Methods 0.000 description 8
- 230000002103 transcriptional effect Effects 0.000 description 8
- 239000013603 viral vector Substances 0.000 description 8
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 7
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 7
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- 241000283973 Oryctolagus cuniculus Species 0.000 description 7
- 230000000692 anti-sense effect Effects 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 210000004671 cell-free system Anatomy 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 108010060199 cysteinylproline Proteins 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 7
- 230000004927 fusion Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 210000004927 skin cell Anatomy 0.000 description 7
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 241000701161 unidentified adenovirus Species 0.000 description 7
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 6
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 6
- 230000000735 allogeneic effect Effects 0.000 description 6
- 210000002808 connective tissue Anatomy 0.000 description 6
- 210000004748 cultured cell Anatomy 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 210000004379 membrane Anatomy 0.000 description 6
- 108010008217 nidogen Proteins 0.000 description 6
- 210000002729 polyribosome Anatomy 0.000 description 6
- 230000001737 promoting effect Effects 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 230000008929 regeneration Effects 0.000 description 6
- 230000002207 retinal effect Effects 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 230000009885 systemic effect Effects 0.000 description 6
- 230000000699 topical effect Effects 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- AGPKZVBTJJNPAG-RFZPGFLSSA-N D-Isoleucine Chemical compound CC[C@@H](C)[C@@H](N)C(O)=O AGPKZVBTJJNPAG-RFZPGFLSSA-N 0.000 description 5
- WHUUTDBJXJRKMK-GSVOUGTGSA-N D-glutamic acid Chemical compound OC(=O)[C@H](N)CCC(O)=O WHUUTDBJXJRKMK-GSVOUGTGSA-N 0.000 description 5
- ROHFNLRQFUQHCH-RXMQYKEDSA-N D-leucine Chemical compound CC(C)C[C@@H](N)C(O)=O ROHFNLRQFUQHCH-RXMQYKEDSA-N 0.000 description 5
- KZSNJWFQEVHDMF-SCSAIBSYSA-N D-valine Chemical compound CC(C)[C@@H](N)C(O)=O KZSNJWFQEVHDMF-SCSAIBSYSA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 5
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- 108010001267 Protein Subunits Proteins 0.000 description 5
- 102000002067 Protein Subunits Human genes 0.000 description 5
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 5
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 210000004252 chorionic villi Anatomy 0.000 description 5
- 210000001339 epidermal cell Anatomy 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- 210000005260 human cell Anatomy 0.000 description 5
- 208000015181 infectious disease Diseases 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- 108700028369 Alleles Proteins 0.000 description 4
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 4
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 4
- 101710132601 Capsid protein Proteins 0.000 description 4
- 101710094648 Coat protein Proteins 0.000 description 4
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 4
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 4
- DCXYFEDJOCDNAF-UWTATZPHSA-N D-Asparagine Chemical compound OC(=O)[C@H](N)CC(N)=O DCXYFEDJOCDNAF-UWTATZPHSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UWTATZPHSA-N D-aspartic acid Chemical compound OC(=O)[C@H](N)CC(O)=O CKLJMWTZIZZHCS-UWTATZPHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-GSVOUGTGSA-N D-glutamine Chemical compound OC(=O)[C@H](N)CCC(N)=O ZDXPYRJPNDTMRX-GSVOUGTGSA-N 0.000 description 4
- AYFVYJQAPQTCCC-STHAYSLISA-N D-threonine Chemical compound C[C@H](O)[C@@H](N)C(O)=O AYFVYJQAPQTCCC-STHAYSLISA-N 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- 241000702421 Dependoparvovirus Species 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 241000724791 Filamentous phage Species 0.000 description 4
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 4
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 4
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 4
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 4
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- 101710125418 Major capsid protein Proteins 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 239000000020 Nitrocellulose Substances 0.000 description 4
- 238000000636 Northern blotting Methods 0.000 description 4
- 101710141454 Nucleoprotein Proteins 0.000 description 4
- 101710083689 Probable capsid protein Proteins 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 241000283984 Rodentia Species 0.000 description 4
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 4
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 4
- 201000006793 Walker-Warburg syndrome Diseases 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 4
- 238000012867 alanine scanning Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000007877 drug screening Methods 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 230000035876 healing Effects 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 235000013336 milk Nutrition 0.000 description 4
- 239000008267 milk Substances 0.000 description 4
- 210000004080 milk Anatomy 0.000 description 4
- 229920001220 nitrocellulos Polymers 0.000 description 4
- 230000002483 superagonistic effect Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 238000003160 two-hybrid assay Methods 0.000 description 4
- 238000010396 two-hybrid screening Methods 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- 230000029663 wound healing Effects 0.000 description 4
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 3
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 3
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 3
- 241000283707 Capra Species 0.000 description 3
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 3
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 3
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 3
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 3
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 3
- XUJNEKJLAYXESH-UWTATZPHSA-N D-Cysteine Chemical compound SC[C@@H](N)C(O)=O XUJNEKJLAYXESH-UWTATZPHSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 101100029932 Drosophila melanogaster lama gene Proteins 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 3
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- 108010052285 Membrane Proteins Proteins 0.000 description 3
- 101100127662 Mus musculus Lama1 gene Proteins 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 3
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 3
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 3
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 3
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 230000003042 antagnostic effect Effects 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000002742 combinatorial mutagenesis Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 230000002500 effect on skin Effects 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 230000001605 fetal effect Effects 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- 210000002510 keratinocyte Anatomy 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000037230 mobility Effects 0.000 description 3
- 239000002853 nucleic acid probe Substances 0.000 description 3
- 238000004806 packaging method and process Methods 0.000 description 3
- 238000002823 phage display Methods 0.000 description 3
- 239000000825 pharmaceutical preparation Substances 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 238000002708 random mutagenesis Methods 0.000 description 3
- 230000001177 retroviral effect Effects 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 108700026220 vif Genes Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 108020005065 3' Flanking Region Proteins 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 108020005029 5' Flanking Region Proteins 0.000 description 2
- 239000013607 AAV vector Substances 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 2
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 2
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 2
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 2
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 108010062580 Concanavalin A Proteins 0.000 description 2
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 2
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 2
- TULNGKSILXCZQT-IMJSIDKUSA-N Cys-Asp Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O TULNGKSILXCZQT-IMJSIDKUSA-N 0.000 description 2
- OLIYIKRCOZBFCW-ZLUOBGJFSA-N Cys-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)O OLIYIKRCOZBFCW-ZLUOBGJFSA-N 0.000 description 2
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 2
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 2
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 2
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 2
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 2
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 2
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 2
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 2
- IRDBEBCCTCNXGZ-AVGNSLFASA-N Cys-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IRDBEBCCTCNXGZ-AVGNSLFASA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- AHLPHDHHMVZTML-SCSAIBSYSA-N D-Ornithine Chemical compound NCCC[C@@H](N)C(O)=O AHLPHDHHMVZTML-SCSAIBSYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-SCSAIBSYSA-N D-Proline Chemical compound OC(=O)[C@H]1CCCN1 ONIBWKKTOPOVIA-SCSAIBSYSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-Serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 2
- ODKSFYDXXFIFQN-SCSAIBSYSA-N D-arginine Chemical compound OC(=O)[C@H](N)CCCNC(N)=N ODKSFYDXXFIFQN-SCSAIBSYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-RXMQYKEDSA-N D-histidine Chemical compound OC(=O)[C@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-RXMQYKEDSA-N 0.000 description 2
- KDXKERNSBIXSRK-RXMQYKEDSA-N D-lysine Chemical compound NCCCC[C@@H](N)C(O)=O KDXKERNSBIXSRK-RXMQYKEDSA-N 0.000 description 2
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 235000017274 Diospyros sandwicensis Nutrition 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 2
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 2
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 2
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 2
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 2
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 2
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 2
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 2
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 2
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical group OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 2
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 2
- CTCFZNBRZBNKAX-YUMQZZPRSA-N His-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 CTCFZNBRZBNKAX-YUMQZZPRSA-N 0.000 description 2
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 2
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 2
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 2
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 2
- QEFRNWWLZKMPFJ-YGVKFDHGSA-N L-methionine S-oxide Chemical compound CS(=O)CC[C@H](N)C(O)=O QEFRNWWLZKMPFJ-YGVKFDHGSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 241000282838 Lama Species 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- ZGVYWHODYWRPLK-GUBZILKMSA-N Met-Pro-Cys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O ZGVYWHODYWRPLK-GUBZILKMSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 101100386053 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-3 gene Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 2
- 108010039918 Polylysine Proteins 0.000 description 2
- 241000288906 Primates Species 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 2
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 101800001440 Rimorphin Proteins 0.000 description 2
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 208000031737 Tissue Adhesions Diseases 0.000 description 2
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 2
- WVAKXMOGMWLWHK-VJBMBRPKSA-N Trp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WVAKXMOGMWLWHK-VJBMBRPKSA-N 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 208000014769 Usher Syndromes Diseases 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000005875 antibody response Effects 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000006470 autoimmune attack Effects 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000012219 cassette mutagenesis Methods 0.000 description 2
- 230000036755 cellular response Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000008711 chromosomal rearrangement Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 210000001508 eye Anatomy 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 230000008348 humoral response Effects 0.000 description 2
- 230000008004 immune attack Effects 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 208000014674 injury Diseases 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 210000004779 membrane envelope Anatomy 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000000653 nervous system Anatomy 0.000 description 2
- 238000007857 nested PCR Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 238000004091 panning Methods 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 230000003169 placental effect Effects 0.000 description 2
- 229920000656 polylysine Polymers 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000000734 protein sequencing Methods 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 210000001525 retina Anatomy 0.000 description 2
- 201000008610 retinitis pigmentosa-deafness syndrome Diseases 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 210000002027 skeletal muscle Anatomy 0.000 description 2
- 208000017520 skin disease Diseases 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 230000025366 tissue development Effects 0.000 description 2
- 230000008467 tissue growth Effects 0.000 description 2
- 230000017423 tissue regeneration Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 208000034373 type A muscular dystrophy-dystroglycanopathy Diseases 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- SMWADGDVGCZIGK-AXDSSHIGSA-N (2s)-5-phenylpyrrolidine-2-carboxylic acid Chemical compound N1[C@H](C(=O)O)CCC1C1=CC=CC=C1 SMWADGDVGCZIGK-AXDSSHIGSA-N 0.000 description 1
- JWBYADXJYCNKIE-SYKZBELTSA-N (2s)-5-phenylpyrrolidine-2-carboxylic acid;(2s)-pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1.N1[C@H](C(=O)O)CCC1C1=CC=CC=C1 JWBYADXJYCNKIE-SYKZBELTSA-N 0.000 description 1
- AGTSSZRZBSNTGQ-ITZCFHCWSA-N (2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[2-[[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomet Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 AGTSSZRZBSNTGQ-ITZCFHCWSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- SVUOLADPCWQTTE-UHFFFAOYSA-N 1h-1,2-benzodiazepine Chemical compound N1N=CC=CC2=CC=CC=C12 SVUOLADPCWQTTE-UHFFFAOYSA-N 0.000 description 1
- 101150033839 4 gene Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- HWPXGQCMZITGFN-XVYDVKMFSA-N Ala-Cys-His Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HWPXGQCMZITGFN-XVYDVKMFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MLNSNVLOEIYJIU-ZUDIRPEPSA-N Ala-Leu-Thr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLNSNVLOEIYJIU-ZUDIRPEPSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 1
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 1
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 1
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 1
- QQJSJIBESHAJPM-IHRRRGAJSA-N Arg-Cys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QQJSJIBESHAJPM-IHRRRGAJSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- TWVTVZUGEDBAJF-ACZMJKKPSA-N Asn-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N TWVTVZUGEDBAJF-ACZMJKKPSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 1
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 1
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- CPMKYMGGYUFOHS-FSPLSTOPSA-N Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O CPMKYMGGYUFOHS-FSPLSTOPSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102100021257 Beta-secretase 1 Human genes 0.000 description 1
- 101710150192 Beta-secretase 1 Proteins 0.000 description 1
- 208000014644 Brain disease Diseases 0.000 description 1
- 108091028026 C-DNA Proteins 0.000 description 1
- 101100505076 Caenorhabditis elegans gly-2 gene Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 235000002937 Clintonia borealis Nutrition 0.000 description 1
- 240000002814 Clintonia borealis Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- XZFYRXDAULDNFX-UWVGGRQHSA-N Cys-Phe Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UWVGGRQHSA-N 0.000 description 1
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 1
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 1
- ZHCCYSDALWJITB-SRVKXCTJSA-N Cys-Phe-Cys Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O ZHCCYSDALWJITB-SRVKXCTJSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- OELDIVRKHTYFNG-WDSKDSINSA-N Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CS OELDIVRKHTYFNG-WDSKDSINSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 1
- OUYCCCASQSFEME-MRVPVSSYSA-N D-tyrosine Chemical compound OC(=O)[C@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-MRVPVSSYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-WFVLMXAXSA-N DEAE-cellulose Chemical compound OC1C(O)C(O)C(CO)O[C@H]1O[C@@H]1C(CO)OC(O)C(O)C1O GUBGYTABKSRVRQ-WFVLMXAXSA-N 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108010000916 Fimbriae Proteins Proteins 0.000 description 1
- 108010028690 Fish Proteins Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- KRHYYFGTRYWZRS-UHFFFAOYSA-M Fluoride anion Chemical compound [F-] KRHYYFGTRYWZRS-UHFFFAOYSA-M 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- LTLXPHKSQQILNF-CIUDSAMLSA-N Gln-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N LTLXPHKSQQILNF-CIUDSAMLSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- RRBLZNIIMHSHQF-FXQIFTODSA-N Gln-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RRBLZNIIMHSHQF-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 1
- GLAPJAHOPFSLKL-SRVKXCTJSA-N Gln-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N GLAPJAHOPFSLKL-SRVKXCTJSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- ARPVSMCNIDAQBO-YUMQZZPRSA-N Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ARPVSMCNIDAQBO-YUMQZZPRSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- YGNPTRVNRUKVLA-DCAQKATOSA-N Gln-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N YGNPTRVNRUKVLA-DCAQKATOSA-N 0.000 description 1
- DSRVQBZAMPGEKU-AVGNSLFASA-N Gln-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DSRVQBZAMPGEKU-AVGNSLFASA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- MPZWMIIOPAPAKE-BQBZGAKWSA-N Glu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MPZWMIIOPAPAKE-BQBZGAKWSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 1
- QQJMARNOLHSJCQ-DCAQKATOSA-N His-Cys-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N QQJMARNOLHSJCQ-DCAQKATOSA-N 0.000 description 1
- LDTJBEOANMQRJE-CIUDSAMLSA-N His-Cys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LDTJBEOANMQRJE-CIUDSAMLSA-N 0.000 description 1
- IDQKGZWUPVOGPZ-GUBZILKMSA-N His-Cys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IDQKGZWUPVOGPZ-GUBZILKMSA-N 0.000 description 1
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 1
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 1
- NJZGEXYLSFGPHG-GUBZILKMSA-N His-Gln-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NJZGEXYLSFGPHG-GUBZILKMSA-N 0.000 description 1
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 1
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 101100181375 Homo sapiens LAMB4 gene Proteins 0.000 description 1
- 101001051308 Homo sapiens Laminin subunit beta-4 Proteins 0.000 description 1
- 241000701806 Human papillomavirus Species 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- 102100024623 Laminin subunit beta-4 Human genes 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- JYOAXOMPIXKMKK-YUMQZZPRSA-N Leu-Gln Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCC(N)=O JYOAXOMPIXKMKK-YUMQZZPRSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- KVSBQLNBMUPADA-AVGNSLFASA-N Leu-Val-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KVSBQLNBMUPADA-AVGNSLFASA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- QBGPXOGXCVKULO-BQBZGAKWSA-N Lys-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(O)=O QBGPXOGXCVKULO-BQBZGAKWSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- OAPNERBWQWUPTI-YUMQZZPRSA-N Lys-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O OAPNERBWQWUPTI-YUMQZZPRSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- ZSHJENHLFBWELN-UHFFFAOYSA-N Met-Thr-Thr-Tyr Chemical compound CSCCC(N)C(=O)NC(C(C)O)C(=O)NC(C(C)O)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 ZSHJENHLFBWELN-UHFFFAOYSA-N 0.000 description 1
- VEKRTVRZDMUOQN-AVGNSLFASA-N Met-Val-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 VEKRTVRZDMUOQN-AVGNSLFASA-N 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- GHAZCVNUKKZTLG-UHFFFAOYSA-N N-ethyl-succinimide Natural products CCN1C(=O)CCC1=O GHAZCVNUKKZTLG-UHFFFAOYSA-N 0.000 description 1
- HDFGOPSGAURCEO-UHFFFAOYSA-N N-ethylmaleimide Chemical compound CCN1C(=O)C=CC1=O HDFGOPSGAURCEO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108010079246 OMPA outer membrane proteins Proteins 0.000 description 1
- 235000007602 Opuntia linguiformis Nutrition 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- MIDZLCFIAINOQN-WPRPVWTQSA-N Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 MIDZLCFIAINOQN-WPRPVWTQSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- FSXRLASFHBWESK-HOTGVXAUSA-N Phe-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 FSXRLASFHBWESK-HOTGVXAUSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- 101710193132 Pre-hexon-linking protein VIII Proteins 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- YSUZKYSRAFNLRB-ULQDDVLXSA-N Pro-Gln-Trp Chemical compound N([C@@H](CCC(=O)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 YSUZKYSRAFNLRB-ULQDDVLXSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- TYMBHHITTMGGPI-NAKRPEOUSA-N Pro-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 TYMBHHITTMGGPI-NAKRPEOUSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 208000017442 Retinal disease Diseases 0.000 description 1
- 108700025701 Retinoblastoma Genes Proteins 0.000 description 1
- 102400000235 Rimorphin Human genes 0.000 description 1
- 239000012506 Sephacryl® Substances 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- HEJJDUDEHLPDAW-CUJWVEQBSA-N Thr-His-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N)O HEJJDUDEHLPDAW-CUJWVEQBSA-N 0.000 description 1
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- CFMGQWYCEJDTDG-XIRDDKMYSA-N Trp-Lys-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CFMGQWYCEJDTDG-XIRDDKMYSA-N 0.000 description 1
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 1
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- NLKUJNGEGZDXGO-XVKPBYJWSA-N Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLKUJNGEGZDXGO-XVKPBYJWSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- SFSZDJHNAICYSD-PMVMPFDFSA-N Tyr-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC4=CC=C(C=C4)O)N SFSZDJHNAICYSD-PMVMPFDFSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 1
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 102000005918 Ubiquitin Thiolesterase Human genes 0.000 description 1
- 108010005656 Ubiquitin Thiolesterase Proteins 0.000 description 1
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- IWADHXDXSQONEL-GUBZILKMSA-N Val-Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O IWADHXDXSQONEL-GUBZILKMSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 210000000577 adipose tissue Anatomy 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000003314 affinity selection Methods 0.000 description 1
- 230000001270 agonistic effect Effects 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010091818 arginyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- XYOVOXDWRFGKEX-UHFFFAOYSA-N azepine Chemical compound N1C=CC=CC=C1 XYOVOXDWRFGKEX-UHFFFAOYSA-N 0.000 description 1
- 229940049706 benzodiazepine Drugs 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 210000001638 cerebellum Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 201000010251 cutis laxa Diseases 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 210000004207 dermis Anatomy 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000002121 endocytic effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 230000008175 fetal development Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 210000003495 flagella Anatomy 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012248 genetic selection Methods 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 238000005734 heterodimerization reaction Methods 0.000 description 1
- 238000012203 high throughput assay Methods 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 208000003906 hydrocephalus Diseases 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000010189 intracellular transport Effects 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 150000003951 lactams Chemical group 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000032575 lytic viral release Effects 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 208000011042 muscle-eye-brain disease Diseases 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 210000005157 neural retina Anatomy 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 210000000715 neuromuscular junction Anatomy 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 238000011580 nude mouse model Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 108010054442 polyalanine Proteins 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 210000000449 purkinje cell Anatomy 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 239000012622 synthetic inhibitor Substances 0.000 description 1
- 210000001779 taste bud Anatomy 0.000 description 1
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 210000002105 tongue Anatomy 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 230000008733 trauma Effects 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention is drawn to a purified laminin 12 polypeptide that includes the .alpha.2 subunit, the .beta.1 subunit and the .gamma.3 subunit. The invention is also drawn to isolated laminin .beta.4 subunit and to isolated laminin .gamma.3 subunit. The invention is also drawn to polynucleotides encoding purified laminin 12 polypeptide as well as to polynucleotides encoding laminin .beta.4 subunit and laminin .gamma.3 subunit.
Description
LANJZNINS AND USES THEREOF
BACKGROUND OF THE INVENTION
The invention relates to the laminin 12, laminin subunit y3, and laminin subunit (31, and methods of making and using these molecules.
SUMMARY OF THE INVENTION
The present invention is based, in part, on the discovery of a novel member of the laminin family, laminin 12. Accordingly, the present invention features a purified or isolated preparation or a recombinant preparation of laminin 12 which includes an a2 subunit, a ail subunit and a y3 subunit.
In a preferred embodiment, the a2 subunit has at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human a2 subunit, e.g., the human a2 subunit of SEQ ID N0:7. The a2 subunit can be identical to a human a2 sequence, e.g., that of SEQ ID N0:7. In another embodiment, the a2 subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID N0:8. In addition, the a2 subunit can have substantially the same electrophoretic mobility as human a2 subunit, e.g., it appears as a 205 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an a2 subunit which is reactive with an a2-specific antibody, e.g., an antibody which binds to the epitope recognized by mAb SH2. a2 specific antibodies can be made by methods known in the art.
Another preferred embodiment of the invention features a (31 subunit having at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human (31 subunit, e.g., the human ail subunit of SEQ 1D N0:9. Preferably, the X31 subunit has the identical amino acid sequence of human ~i 1 subunit, e.g., that of SEQ ID
N0:9. In another embodiment, the ail subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID
NO:10. In addition, the (31 subunit can have substantially the same electrophoretic mobility as human (31 subunit, e.g., it appears as a 185 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an [31 subunit which is reactive with an (31-specific antibody, e.g., an antibody which binds to the epitope recognized by mAb 545.
ail-specific antibodies can be made by methods known in the art.
In yet another preferred embodiment, the Y3 subunit of laminin 12 has at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human y3 subunit, e.g., the y3 subunit of SEQ ID N0:3. The y3 subunit can be identical to a naturally occuring human y3 subunit, e.g., that of SEQ ID N0:3. In another embodiment, the y3 subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID N0:4. In addition, the y3 subunit can have substantially the same electrophoretic mobility as human y3 subunit, e.g., it appears as a 170 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an y3 subunit which is reactive with an y3-specific antibody. y3-specific antibodies can be made by methods known in the art and taught herein.
In a preferred embodiment, the laminin 12 is a trimer which can be found in, or can be isolated from human placental chorionic villi. In another embodiment, the laminin 12 is expressed by a recombinant cell, e.g., a bacterial cell, a cultured cell (e.g., a cultured eukaryotic cell) or a cell of a non-human transgenic animal. Cultured cells can include CHO
cells or SF8 cells. Expression of laminin 12 in a transgenic animal can be general or can be under the control of a tissue specific promoter. Preferably, one or more sequences which encode subunits of the laminin 12 trimer are expressed in a preferred cell-type by a tissue specific promoter, e.g., a milk specific promoter.
The present invention is also based, in part, on the discovery of a novel laminin subunit, y3. Accordingly, the invention features a recombinant or substantially pure or isolated preparation of a y3 polypeptide.
In a preferred embodiment, the y3 polypeptide has the following biological acitivities:
1) it promotes adhesion between tissue elements; 2) provides a site for insertion of nerves into the basement membrane. Im other preferred embodiments: the y3 polypeptide includes an amino acid sequence with at least 60%, 80%, 90%, 95%, 98%, or 99% sequence identity to an amino acid sequence from SEQ ID N0:3; the y3 polypeptide includes an amino acid sequence essentially the same as the amino acid sequence in SEQ ID N0:3; the y3 polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the y3 polypeptide includes at least 5, preferably at least 10, more preferably at least 20, most preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID N0:3; the y3 polypeptide is either, an agonist or an antagonist, of a biological activity of a naturally occurring y3 subunit; the y3 polypeptide is a vertebrate, e.g., a mammalian, e.g. a primate, e.g., a human, y3 polypeptide.
In a preferred embodiment, the invention includes a y3 polypeptide encoded by a DNA insert of a plasmid deposited with ATCC as Accession No: 209357. In another embodiment, the y3 polypeptide is a polypeptide encoded by nucleotide sequences of the overlapping DNA inserts of more than one, preferably all seven of the plasmids deposited with ATCC as Accession No:209357.
In preferred embodiments: the y3 polypeptide is encoded by the nucleic acid in SEQ
ID N0:4, or by a nucleic acid having at least about 85%, more preferably at least about 90%
to about 95%, and most preferably at least about 99% sequence identity with the nucleic acid from SEQ ID NO: 4.
BACKGROUND OF THE INVENTION
The invention relates to the laminin 12, laminin subunit y3, and laminin subunit (31, and methods of making and using these molecules.
SUMMARY OF THE INVENTION
The present invention is based, in part, on the discovery of a novel member of the laminin family, laminin 12. Accordingly, the present invention features a purified or isolated preparation or a recombinant preparation of laminin 12 which includes an a2 subunit, a ail subunit and a y3 subunit.
In a preferred embodiment, the a2 subunit has at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human a2 subunit, e.g., the human a2 subunit of SEQ ID N0:7. The a2 subunit can be identical to a human a2 sequence, e.g., that of SEQ ID N0:7. In another embodiment, the a2 subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID N0:8. In addition, the a2 subunit can have substantially the same electrophoretic mobility as human a2 subunit, e.g., it appears as a 205 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an a2 subunit which is reactive with an a2-specific antibody, e.g., an antibody which binds to the epitope recognized by mAb SH2. a2 specific antibodies can be made by methods known in the art.
Another preferred embodiment of the invention features a (31 subunit having at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human (31 subunit, e.g., the human ail subunit of SEQ 1D N0:9. Preferably, the X31 subunit has the identical amino acid sequence of human ~i 1 subunit, e.g., that of SEQ ID
N0:9. In another embodiment, the ail subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID
NO:10. In addition, the (31 subunit can have substantially the same electrophoretic mobility as human (31 subunit, e.g., it appears as a 185 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an [31 subunit which is reactive with an (31-specific antibody, e.g., an antibody which binds to the epitope recognized by mAb 545.
ail-specific antibodies can be made by methods known in the art.
In yet another preferred embodiment, the Y3 subunit of laminin 12 has at least 60% to about 70%, more preferably at least about 80%, even more preferably at least about 90% to about 95%, and most preferably at least about 99% sequence identity with human y3 subunit, e.g., the y3 subunit of SEQ ID N0:3. The y3 subunit can be identical to a naturally occuring human y3 subunit, e.g., that of SEQ ID N0:3. In another embodiment, the y3 subunit is encoded by a nucleic acid molecule which hybridizes under stringent conditions to a nucleic acid molecule of the nucleic acid sequence shown in SEQ ID N0:4. In addition, the y3 subunit can have substantially the same electrophoretic mobility as human y3 subunit, e.g., it appears as a 170 kDa electrophoretic band on reducing gels. Yet another preferred embodiment of the invention features an y3 subunit which is reactive with an y3-specific antibody. y3-specific antibodies can be made by methods known in the art and taught herein.
In a preferred embodiment, the laminin 12 is a trimer which can be found in, or can be isolated from human placental chorionic villi. In another embodiment, the laminin 12 is expressed by a recombinant cell, e.g., a bacterial cell, a cultured cell (e.g., a cultured eukaryotic cell) or a cell of a non-human transgenic animal. Cultured cells can include CHO
cells or SF8 cells. Expression of laminin 12 in a transgenic animal can be general or can be under the control of a tissue specific promoter. Preferably, one or more sequences which encode subunits of the laminin 12 trimer are expressed in a preferred cell-type by a tissue specific promoter, e.g., a milk specific promoter.
The present invention is also based, in part, on the discovery of a novel laminin subunit, y3. Accordingly, the invention features a recombinant or substantially pure or isolated preparation of a y3 polypeptide.
In a preferred embodiment, the y3 polypeptide has the following biological acitivities:
1) it promotes adhesion between tissue elements; 2) provides a site for insertion of nerves into the basement membrane. Im other preferred embodiments: the y3 polypeptide includes an amino acid sequence with at least 60%, 80%, 90%, 95%, 98%, or 99% sequence identity to an amino acid sequence from SEQ ID N0:3; the y3 polypeptide includes an amino acid sequence essentially the same as the amino acid sequence in SEQ ID N0:3; the y3 polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the y3 polypeptide includes at least 5, preferably at least 10, more preferably at least 20, most preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID N0:3; the y3 polypeptide is either, an agonist or an antagonist, of a biological activity of a naturally occurring y3 subunit; the y3 polypeptide is a vertebrate, e.g., a mammalian, e.g. a primate, e.g., a human, y3 polypeptide.
In a preferred embodiment, the invention includes a y3 polypeptide encoded by a DNA insert of a plasmid deposited with ATCC as Accession No: 209357. In another embodiment, the y3 polypeptide is a polypeptide encoded by nucleotide sequences of the overlapping DNA inserts of more than one, preferably all seven of the plasmids deposited with ATCC as Accession No:209357.
In preferred embodiments: the y3 polypeptide is encoded by the nucleic acid in SEQ
ID N0:4, or by a nucleic acid having at least about 85%, more preferably at least about 90%
to about 95%, and most preferably at least about 99% sequence identity with the nucleic acid from SEQ ID NO: 4.
In preferred embodiments, the y3 polypeptide includes a nidogen-binding domain.
Generally, the nidogen-binding domain is at least 5 residues in length and preferably, has about 70, 80, 90, or 95% sequence identity with the nidogen-binding domain of the protein shown in SEQ ID NO: 3 (amino acid residues 750-755). In another embodiment, the y3 polypeptide includes at least 5, preferably 6 to 7, and most preferably 8 of the cysteins found Z O in native y3 protein. In yet another embodiment of the invention features a y3 polypeptide that does not include or has an inactivated nidogen-binding domain which serves as an antagonist to y3 biological activities. Furthermore, a y3 polypeptide which has antagonist activity can have inactivated or excluded regions which comprise at least one cystein found in native y3 protein.
In a preferred embodiment, the y3 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 residues, from a sequence in SEQ ID NO: 3. In other preferred embodiments, the y3 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 %
of the residues from a sequence in SEQ ID NO: 3. Preferably, the differences are such that:
the y3 polypeptide exhibits a y3 biological activity, e.g., the y3 polypeptide retains a biological activity of a naturally occurring y3 subunit.
In preferred embodiments the y3 polypeptide includes a y3 subunit sequence described herein as well as other N-terminal andlor C-terminal amino acid sequence.
In preferred embodiments, the y3 polypeptide includes all or a fragment of an amino acid sequence from SEQ ID NO: 3, fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence from SEQ ID NO: 3.
In yet other preferred embodiments, the y3 polypeptide is a recombinant fusion protein having a first y3 portion and a second polypeptide portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to y3. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.
In a preferred embodiment the y3 polypeptide includes amino acid residues 750-of SEQ ID N0:3. In another embodiment, the y3 polypeptide encodes domains IV-VI of the y3 subunit.
In preferred embodiments the y3 polypeptide has antagonistic activity, and is capable of inhibiting adhesion between connective tissues.
In a preferred embodiment, the y3 polypeptide is a fragment of a naturally occurring y 3 which inhibits connective tissue adhesion.
Polypeptides of the invention include those which arise as a result of the existence of multiple genes, alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events. The y3 polypeptide can be expressed in systems, e.g., cultured cells, which result in substantially the same postranslational modifications present when expressed y3 is expressed in a native cell, or in systems which result in the omission of postranslational modifications present when expressed in a native cell. ~ _ The invention includes an immunogen which includes a y3 polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for the y3 polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g., a unique determinant, from a protein represented by SEQ ID NO: 3.
The present invention also includes an antibody preparation specifically reactive with an epitope-of the y3 immunogen or generally of a y3 polypeptide, preferably an epitope which consists all or in part of residues from the the amino acid sequence of SEQ ID
N0:3, or an epitope, which when bound to an antibody, results in the modulation of a biological activity.
In preferred embodiments the y3-like polypeptide, as expressed in the cells in which it is normally expressed or in other eukaryotic cells, has a molecular weight of 170 kDa as determined by SDS-PAGE.
In another embodiment, the y3 polypeptide comprises amino acid residues 100-of SEQ ID NO: 3.
In a preferred embodiment, the y3 polypeptide has one or more of the following characteristics:
(i) it has the ability to promote adhesion between connective tissues;
(ii) it has a molecular weight, amino acid composition or other physical characteristic of y3 subunit of SEQ ID N0:3;
(iii) it has an overall sequence similarity of at least 50%, preferably at least 60%, more preferably at least 70, 80, 90, or 95%, with a y3 polypeptide of SEQ
ID N0:3;
(iv) it can be isolated from human placenta chorionic villi;
(v) it has a nidogen-binding domain which is preferably about 70%, 80%, 90% or 95% with amino acid residues 750-755 of SEQ ID N0:3;
(vi) it can colocalize with protein ubiquitin carboxy terminal hydroxylase I;
(vii) it has at least 5, preferably 6 or 7, and most preferably 8 of the cysteins found amino acid sequence of native y3.
Also included in the invention is a composition which includes a y3 polypeptide (or a nucleic acid which encodes it) and one or more additional components, e.g., a carrier, diluent, or solvent. The additional component can be one which renders the composition useful for in vitro and in vivo pharmaceutical or veterinary use.
In another aspect, the invention provides an isolated or substantially pure nucleic acid having or comprising a nucleotide sequence which encodes a y3 polypeptide, e.g., a y3 polypeptide described herein.
A preferred embodiment of the invention features a nucleic acid molecule having a nucleotide sequence at least about 85% sequence identity to a nucleotide sequence of SEQ iD
N0:4. In other preferred embodiments, the y3 polypeptide is encoded by a nucleic acid S
S molecule having a nucleotide sequence with at least about 90% to about 9S%, and more preferably about 98% to about 99% sequence identity to the nucleotide sequence from SEQ
ID N0:4. In another preferred embodiment, the y3 polypeptide is encoded by the nulceic acid molecule of SEQ ID N0:4.
In prefered embodiments, the isolated nucleic acid molecule includes the nucleotide sequence of at least one and preferably all of the DNA inserts of the plasmids deposited with ATCC as Accession No: 209357.
In preferred embodiments, the subject y3 nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the y3 gene sequence (also referred to as LAMG3), e.g., to 1 S render the y3 gene sequence suitable for use as an expression vector.
In yet a further preferred embodiment, the nucleic acid which encodes a y3 polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID N0:4. More preferably, the nucleic acid probe corresponds to at least 20 consecutive nucleotides from SEQ
ID NO: 4.
The invention also provides a probe or primer which includes or comprises a substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence from SEQ ID NO: 4, or naturally occurring mutants thereof. In preferred embodiments, the probe or primer further includes a label group attached thereto.
2S The label group can be, e.g., a radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less than 20, 30, S0, 100, or 150 nucleotides in length.
The invention involves nucleic acids, e.g., RNA or DNA, encoding a y3 polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.
In another aspect, the invention features a cell or purified preparation of cells which include a y3 subunit transgene, or which otherwise misexpress a y3 gene. The cell preparation can consist of human or non human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the cell or cells include a y3 3S transgene, e.g., a heterologous form of a r3 gene, e.g., a gene derived from humans (in the case of a non-human cell). The y3 transgene can be misexpressed, e.g., overexpressed or underexpressed. In other preferred embodiments, the cell or cells include a gene which misexpress an endogenous y3 gene, e.g., a gene the expression of which is disrupted, e.g., a knockout. Such cells can serve as a model for studying disorders which are related to mutated or mis-expressed y3 alleles or for use in drug screening.
In another aspect, the invention features a transgenic y3 animal, e.g., a rodent, e.g., a mouse or a rat, a rabbit, a pig, a goat, or a cow. In preferred embodiments, the transgenic animal includes (and preferably express) a heterologous form of a y3 gene, e.g., a gene derived from humans. In a further embodiment, the y3 transgene includes a tissue specific promoter, e.g., a milk-specific promoter. In other preferred embodiments, the animal has an endogenous y3 gene which is misexpressed, e.g., a knockout. Such a transgenic animal can serve as a model for studying disorders which are related to mutated or mis-expressed y3 alleles or for use in drug screening.
The invention is also based, in part, on the discovery of a novel laminin subunit, (34.
Accordingly, the invention features a recombinant or substantially pure preparation of a (34 polypeptide.
In preferred embodiment, the X34 polypeptide has the following biological activities: 1 ) it promotes adhesion between tissue elements; 2) it aids in wound healing. In other preferred embodiments: the (34 polypeptide includes an amino acid sequence with at least 65%, 80%, 90%, 95%, 98%, or 99% sequence identity to an amino acid sequence from SEQ ID
NO:1;
the (34 polypeptide includes an amino acid sequence essentially the same as an amino acid sequence in SEQ ID NO: 1; the (34 polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the (34 polypeptide includes at least 5, preferably at least 10, more preferably at least 20, most preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID
NO:1; the (34 polypeptide is either, an agonist or an antagonist, of a biological activity of a naturally occurnng ~i4 subunit; the ~i4 polypeptide is a vertebrate, e.g., a mammalian, e.g. a primate, e.g., a human, ~i4 polypeptide.
In preferred embodiments: the ~i4 polypeptide is encoded by the nucleic acid in SEQ
ID N0:2, or by a nucleic acid having at least about 65% to about 70%, more preferably at least 80%, even more preferably at least about 90% to about 95%, and most preferably about 99% sequence identity with the nucleic acid from SEQ ID NO: 2.
In preferred embodiments, the (34 polypeptide includes domains VI and V found in native [34 subunits. Amino acid residues from about 221-262 and 263-535 of SEQ
m NO: 1 are exemplary of domains VI and V, respectively, of (34. Generally, domain VI
is at least 33 residues in length and has preferably at least about 60%, more preferably about 70% to about 80%, and most preferably about 90% to about 95% sequence identity with the amino acid residues 221-262 of the (34 protein shown in SEQ ID NO: 1. Domain V is at least 272 residues in length and has preferably at least about 60%, more preferably about 70% to about 80%, and most preferably about 90% to about 95% sequence identity with the amino acid residues 263-535 of the (34 protein shown in SEQ ID NO: 1. In another embodiment, the (34 polypeptide has at least 5, preferably 6 or 7, and most preferably 8 cysteins as found in native (34. In yet another embodiment, a (34 polypeptide which has antagonist activity has inactivated or excluded regions which comprise at least one of the cysteins found in native [34 protein.
In a preferred embodiment, the ~i4 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 residues, from a sequence in SEQ ID NO: 1. In other preferred embodiments, the (34 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 % of the residues from a sequence in SEQ ID NO: 1. Preferably, the differences are such that:
the (34 polypeptide exhibits a (34 biological activity, e.g., the (34 polypeptide retains a biological activity of a naturally occurring ~i4 subunit.
In preferred embodiments the (34 polypeptide includes a X34 sequence described herein as well as other N-terminal and/or C-terminal amino acid sequence.
In preferred embodiments, the (34 polypeptide includes all or a fragment of an amino acid sequence from SEQ ID NO:1, fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence from SEQ ID NO:1.
In yet other preferred embodiments, the X34 polypeptide is a recombinant fusion protein having a first (34 portion and a second polypeptide portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to (34. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.
In preferred embodiments the ~i4 polypeptide has antagonistic activity, and is capable of inhibiting the adhesion of connective tissues.
Preferably, the [34 polypeptide is a fragment of a naturally occurnng (34 which inhibits connective tissue adhesion.
Polypeptides of the invention include those which arise as a result of the existence of multiple genes, alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events. In one aspect of the invention, the ~i4 polypeptide is a splice variant of the [34 subunit. In another preferred embodiment, the (34 splice variant is encoded by a nucleic acid molecule identical to the nucleotide sequence of SEQ ID N0:6. The polypeptide can be expressed in systems, e.g., cultured cells, which result in substantially the same postranslational modifications present when expressed (34 is expressed in a native cell, or in systems which result in the omission of postranslational modifications present when expressed in a native cell.
The invention includes an immunogen which includes a ~i4 polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for the (34 polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g., a unique determinant, from a protein represented by SEQ ID NO: 1.
The present invention also includes an antibody preparation specifically reactive with an epitope of the (34 immunogen or generally of a (34 polypeptide, preferably an epitope which consists all or in part of residues from the amino acid sequence of SEQ
ID NO:1, or an epitope, which when bound to an antibody, results in the modulation of a biological activity.
In preferred embodiments the (34-like polypeptide, as expressed in the cells in which it is normally expressed or in other eukaryotic cells, has an estimated molecular weight of 200 kDa as determined by SDS-PAGE.
In a preferred embodiment, the (34 polypeptide has one or more of the following characteristics:
(i) it has the ability to promote adhesion between connective tissues;
(ii) it has a molecular weight, amino acid composition or other physical characteristic of [34 subunit of SEQ m NO:1;
(iii) it has an overall sequence similarity of at least 50%, preferably at least 65%, more preferably at least 70, 80, 90, or 95%, with a [34 polypeptide of SEQ >D NO:1;
(iv) it can be isolated from human placenta chorionic villi;
(v) it can associate with a3 or y2 subunits;
(vi) it has coiled coils in domains I and II.
(vii) it has at least 5, preferably 6 or 7, and most preferably 8 of the cysteins found in native (34 sequence.
Also included in the invention is a composition which includes a (34 polypeptide (or a nucleic acid which encodes it) and one or more additional components, e.g., a carrier, diluent, or solvent. The additional component can be one which renders the composition for in vitro and in vivo pharmaceutical or veterinary use. Such uses can include aiding in wound healing or promotion of the adhesion of dermal and epidermal cells.
In another aspect, the invention provides an isolated or substantially pure nucleic acid having or comprising a nucleotide sequence which encodes a ~i4 polypeptide, e.g., a (34 polypeptide described herein.
A preferred embodiment of the invention features a nucleic acid molecule having a nucleotide sequence at least about 65% sequence identity to a nucleotide sequence of SEQ ID
N0:2. In other preferred embodiments, the ~i4 polypeptide is encoded by a nucleic acid molecule having a nucleotide sequence with at least 70%, preferably 80%, more preferably about 90% to about 95%, and even more preferably about 99% sequence identity to the nucleotide sequence from SEQ m N0:2. In another preferred embodiment, the (i4 polypeptide is encoded by the nulceic acid molecule of SEQ ID N0:2.
In preferred embodiments, the subject ~i4 nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the [34 gene sequence (also referred to as LAMB4), e.g., to render the ~i4 gene sequence suitable for use as an expression vector.
In yet a further preferred embodiment, the nucleic acid which encodes a (34 polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides from SEQ ID N0:2, more preferably to at least 20 consecutive nucleotides from SEQ ID N0:2.
In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a nucleotide sequence of SEQ m N0:2, nucleotides 4686-5870.
The invention also provides a probe or primer which includes or comprises a substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence from SEQ ID NO: 2, or naturally occurring mutants thereof. In preferred embodiments, the probe or primer further includes a label group attached thereto.
The label group can be, e.g., a radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length.
The invention involves nucleic acids, e.g., RNA or DNA, encoding a (34 polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.
In another aspect, the invention features a cell or purified preparation of cells which include a p4 transgene, or which otherwise misexpress a ~4 gene. The cell preparation can consist of human or non human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the cell or cells include a (34 transgene, e.g., a heterologous form of a (34 gene, e.g., a gene derived from humans (in the case of a non-human cell). The X34 transgene can be rnisexpressed, e.g., overexpressed or underexpressed.
In other preferred embodiments, the cell or cells include a gene which misexpress an endogenous (34 gene, e.g., a gene the expression of which is disrupted, e.g., a knockout. Such cells can serve as a model for studying disorders which are related to mutated or mis-expressed ~i4 alleles or for use in drug screening.
In another aspect, the invention features a transgenic [34 animal, e.g., a rodent, e.g., a mouse or a rat, a rabbit, a pig, a goat, or a cow. In preferred embodiments, the transgenic animal includes (and preferably express) a heterologous form of a (34 gene, e.g., a gene derived from humans. In a fiutller embodiment, the (34 transgene includes a tissue specific promoter, e.g., a milk-specific promoter. In other preferred embodiments, the animal has an endogenous (34 gene which is misexpressed, e.g., a knockout. Such a transgenic animal can serve as a model for studying disorders which are related to mutated or mis-expressed (34 alleles or for use in drug screening.
In another aspect, the invention features, a method of promoting adhesion of a first tissue element to a second tissue element. The method includes contacting one or both of the first tissue element and the second tissue element with an amount of a laminin molecule described herein, e.g., Iaminin 12, or'y3 (or a Iaminin trimer which inlcudes y3), sufficient to promote adhesion. The method can be performed in vivo, or in vitro. In in vivo methods the laminin is administered to the subject. The administration can be directed to the site where adhesion is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
A tissue element can be a cell or a multi-cellular on acellular structure.
Examples of tissue elements include, skin cells, e.g., epidermal or dermal cells, neuronal cells, e.g., nerve cells, retinal cells, central or pereipheral nervous system components, basement membrane or components of the basement membrane, or any cell or structure which in normal, non-5 traumatized, or non-diseased tissue is adjascent or adhered to a specific tissue element recited herein.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the method is an vivo method. In vivo methods can be 10 autologous, allogeneic, or xenogeneic. In autologous methods, adhesion between two tissue elements from the subject is promoted. In allogeneic methods, adhesion between a recipient tissue element and a donor tissue element from an allogeneic donor is promoted. In xenogeneic methods, adhesion between a recipient tissue element and a donor tissue element from a xenogeneic donor is promoted. Thus, one element can be a donor tissue element which is implanted into a recipient subject.
In preferred embodiments the first tissue is healthy tissue, e.g., skin tissue, and the second tissue is wounded, e.g., burned, diseased, traumatized, cut, and the tissue, or is a wound bed. For example, the first tissue is skin tissue, from the subject or from a donor, and the second tissue is wounded, e.g., burned or abraided tissue.
In preferred embodiments the first tissue and second tissue element are normally adhered but have become detached from one another due to trauma, burn or other physical injury, disease, or age.
In preferred embodiments: the first tissue element is a dermal cell and the second tissue element is an epidermal cell; the first tissue element is a nerve cell or nerve and the second tissue element is a cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to the nerve cell or nerve; the first tissue element is a retinal cell or retina tissue and the second tissue element is a cell or structure which in normal, non-traumatized, or nvn-diseased tissue is adjascent or adhered to the a retinal cell or retina tissue, the first tissue is a nerve and the second tissue is basement membrane.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting wound healing in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., laminin 12, y3 (or a laminin trimer which inlcudes y3), sufficient to promote healing to the wound. The administration can be directed to the site where healing is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
The wound can be in any tissue, but preferably ina tissue in which the laminin normally occurs. Examples skin, central or peripheral nervous tissue, tissues ofthe eye, e.g., the retinal, the basement membrane, or any tissue which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered thereto.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the wound tissue is burned, diseased, traumatized, cut, the subject of immune attack, e.g, autoimmune attack, or abraided.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting nerve growth or regeneration in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., laminin 12, or y3 (or a laminin trimer which inlcudes y3), sufficient to promote nerve growth or regeneration. The administration can be directed to the site where nerve growth or regeneration is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the nerve growth or regeneration is promoted at a wound site.
The administration of laminin can be repeated.
In another aspect, the invention provides, a method of determining if a subject is at risk for a disorder related to a lesion in or the misexpression of a gene which encodes a laminin described herein, e.g., y3 or laminin 12.
Such disorders include, e.g., a disorder associated with the misexpression of a laminin, e.g., laminin 12, or misexpression of the y3 subunit; a disorder of the central or peripheral nervous system;
a disorder associated with a genetic lesion at chromosome 9, region q31-34;
Fukuyama-type muscular dystrophy; muscle-eye-brain disease; Walker-Warburg Syndrome (hydrocephalus, ageria, and retinal displasia); a retinal disorder, e.g, retinitis pigmentosa-deafness syndrome {which may be a subtype of Walker-Warburg Syndrome); a disorder associated with abnormal levels, e.g., abnormally low levels, of adhesion between tissues; a disorder associated with the basement membrane; a skin disorder, e.g., an epidermal or dermal, disorder; a disorder associated with the testis, spleen, placenta, thymus, ovary, small intestine, lung, or liver.
The method includes one or more of the following:
detecting, in a tissue of the subject, the presence or absence of a mutation which affects the expression of the y3 gene, or other gene which encodes a subunit of laminin 12, e.g., detecting the presence or absence of a mutation in a region which controls the expression of the gene, e.g., a mutation in the S' control region;
detecting, in a tissue of the subject, the presence or absence of a mutation which alters the structure of the y3 gene, or other gene which encodes a subunit of laminin 12;
detecting, in a tissue of the subject, the misexpression of they3 gene, or other gene which encodes a subunit of laminin 12 at the mRNA level, e.g., detecting a non-wild type level of a y3, or an other laminin 12 subunit mRNA ;
detecting, in a tissue of the subject, the misexpression of the y3 gene, or other gene which encodes a subunit of laminin 12, at the protein level, e.g., detecting a non-wild type level of a y3, or an other laminin 12 subunit polypeptide.
In preferred embodiments the method includes: ascertaining the existence of at least one of: a deletion of one or more nucleotides from the y3 gene, or other gene which encodes a subunit of laminin 12; an insertion of one or more nucleotides into the gene, a point mutation, e.g., a substitution of one or more nucleotides of the gene, a gross chromosomal -rearrangement of the gene, e.g., a translocation, inversion, or deletion.
For example, detecting the genetic lesion can include: (i) providing a probe/primer including an oligonucleotide containing a region of nucleotide sequence which hybridizes to a sense or antisense sequence from SEQ ID N4:4, or naturally occurring mutants thereof or 5' or 3' flanking sequences naturally associated with the LAMG3 gene; (ii) exposing the probe/primer to nucleic acid of the tissue; and detecting, by hybridization, e.g., in situ hybridization, of the probe/primer to the nucleic acid, the presence or absence of the genetic lesion.
I S In preferred embodiments detecting the misexpression includes ascertaining the existence of at least one of an alteration in the level of a messenger RNA
transcript of the y3 gene, or other gene which encodes a subunit of laminin I2; the presence of a non-wild type splicing pattern of a messenger RNA transcript of the y3 gene, or other gene which encodes a subunit of laminin 12; or a non-wild type level of Y3, or other subunit of laminin 12.
Methods of the invention can be used prenatally or to determine if a subject's offspring will be at risk for a disorder.
In preferred embodiments the method includes determining the structure of a y3 gene, or other gene which encodes a subunit of laminin 12, an abnormal structure being indicative of risk for the disorder.
In preferred embodiments the method includes contacting a sample form the subject with an antibody to the laminin protein or a nucleic acid which hybridizes specifically with the y3 gene, or other gene which encodes a subunit of laminin 12.
In another aspect, the invention features, a method of promoting adhesion of a first tissue element to a second tissue element. The method includes contacting one or both of the first tissue element and the second tissue element with an amount of a laminin molecule described herein, e.g., (34, sufficient to promote adhesion. The method can be performed in vivo, or in vitro. In in vivo methods the laminin is administered to the subject. The administration can be directed to the site where adhesion is desired, e.g., by topical application or by injection, or administered in a systemic fashion.
A tissue element can be a cell or a multi-cellular on acellular structure.
Examples of tissue elements include, skin cells, e.g., epidermal or dermal cells, neuronal cells, e.g., nerve cells, retinal cells, central or pereipheral nervous system components, basement membrane or components of the basement membrane, or any cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to a specific tissue element recited herein.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the method is an vivo method. In vivo methods can be autologous, allogeneic, or xenogeneic. In autologous methods, adhesion between two tissue elements from the subject is promoted. In allogeneic methods, adhesion between a recipient tissue element and a donor tissue element from an allogeneic donor is promoted. In xenogeneic methods, adhesion between a recipient tissue element and a donor tissue element from a xenogeneic donor is promoted. Thus, one element can be a donor tissue element which is implanted into a recipient subject.
In preferred embodiments the first tissue is healthy tissue, e.g., skin tissue, and the second tissue is wounded, e.g., burned, diseased, traumatized, cut, and the tissue, or is a wound bed. For example, the first tissue is skin tissue, from the subject or from a donor, and the second tissue is wounded, e.g., burned or abraided tissue.
In preferred embodiments: the first tissue element is a dermal cell and the second tissue element is an epidermal cell; the first tissue element is a nerve cell or nerve and the second tissue element is a cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to the nerve cell or nerve; the first tissue is a nerve and the second tissue is basement membrane.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting wound healing in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., ~i4, sufficient to promote healing to the wound. The administration can be directed to the site where healing is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
The wound can be in any tissue, but preferably in a tissue in which the laminin normally occurs in fetal or adult life. Examples examples include skin the basement membrane.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the wound tissue is burned, diseased, traumatized, cut, the subject of immune attack, e.g, autoimmune attack, or abraded.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting tissue growth, development, or regeneration in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., (34, sufficient to promote tissue growth, development, or regeneration in a subject. The administration can be directed to the site where nerve growth or regeneration is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the nerve growth or regeneration is promoted at a wound site.
The administration of laminin can be repeated.
In another aspect, the invention provides, a method of determining if a subject is at risk for a disorder related to a lesion in or the misexpression of a laminin molecule described herein, e.g., (34.
Such disorders include, e.g., a disorder associated with the misexpression of a laminin, e.g., ~i 4; a disorder associated with a genetic lesion at chromosome region 7q22-q31.2; a developmetnal disorder; a disorder associated with abnormal levels, e.g., abnormally low levels, of adhesion between tissues; a disorder associated with the basement membrane; a skin disorder, e.g., an epidermal or dermal, disorder.
The method includes one or more of the following:
detecting, in a tissue of the subject, the presence or absence of a mutation which affects the expression of the ~i4 gene, e.g, detecting the presence or absence of a mutation in a region which controls the expression of the gene, e.g., a mutation in the 5' control region;
detecting, in a tissue of the subject, the presence or absence of a mutation which alters the structure of the ~i4 gene;
detecting, in a tissue of the subject, the misexpression of the (34 gene, e.g., detecting a non-wild type level of a (34 mRNA ;
detecting, in a tissue of the subject, the misexpression of the (34, at the protein level, e.g., detecting a non-wild type level of a (34 polypeptide.
In preferred embodiments the method includes: ascertaining the existence of at least one of a deletion of one or more nucleotides from the (34; an insertion of one or more nucleotides into the gene, a point mutation, e.g., a substitution of one or more nucleotides of the (34 gene, a gross chromosomal rearrangement of the (34 gene, e.g., a translocation, inversion, or deletion.
For example, detecting the genetic lesion can include: (i) providing a probe/primer including an oligonucleotide containing a region of nucleotide sequence which hybridizes to a sense or antisense sequence from SEQ ID N0:2, or naturally occurring mutants thereof or 5' or 3' flanking sequences naturally associated with the LAMB4 gene; (ii) exposing the probelprimer to nucleic acid of the tissue; and detecting, by hybridization, e.g., in situ hybridization, of the probe/primer to the nucleic acid, the presence or absence of the genetic lesion.
In preferred embodiments: detecting the misexpression includes ascertaining the existence of at least one of an alteration in the level of a messenger RNA
transcript of the (3 4; the presence of a non-wild type splicing pattern of a messenger RNA
transcript of the (34;
or a non-wild type level of (34.
Methods of the invention can be used prenatally or to determine if a subject's offspring will be at risk for a disorder.
5 In preferred embodiments the method includes determining the structure of the a ~i4, an abnormal structure being indicative of risk for the disorder.
In preferred embodiments the method includes contacting a sample form the subject with an antibody to the (34 protein or a nucleic acid which hybridizes specifically with the ~i4.
In another aspect, the invention features, a method of evaluating a compound for the 10 ability to interact with, e.g., bind, a subject laminin polypeptide, e.g., laminin 12, y3, a laminin trimer which inlcudes ~3, ~i4, or a laminin trimer which includes (34.
The method includes: contacting the compound with the subject laminin polypeptide; and evaluating ability of the compound to interact with, e.g., to bind or form a complex with the subject laminin polypeptide. This method can be performed in vitro, e.g., in a cell free system, or in 15 vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify naturally occurring molecules which interact with subject laminin polypeptide.
It can also be used to find natural or synthetic inhibitors of subject laminin polypeptide.
In another aspect, the invention features, a method of evaluating a compound, e.g., a polypeptide, e.g., a naturally occurring ligand of or a naturally occuring substrate to which binds a subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y 3, (34, or a laminin trimer which includes (34, for the ability to bind a subject laminin polypeptide. The method includes: contacting the compound with the subject laminin polypeptide; and evaluating the ability of the compound to interact with, e.g., to bind or form a complex with the subject laminin polypeptide, e.g., the ability of the compound to inhibit a subject laminin polypeptide/ligand interaction. This method can be performed in vitro, e.g., in a cell free system, or in vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify compounds, e.g., fragments or analogs of a subject laminin polypeptide, which are agonists or antagonists of a subject laminin polypeptide.
In another aspect, the invention features, a method of evaluating a first compound, e.g., a subject laminin polypeptide, e.g., laminin 12, y3, a laminin trimer which inlcudes Y3, (3 4, or a laminin trimer which includes (34, for the ability to bind a second compound, e.g., a second polypeptide, e.g., a naturally occurring ligand of or substrate to which binds a subject laminin polypeptide. The method includes: contacting the first compound with the second compound; and evaluating the ability of the first compound to form a complex with the second compound. This method can be performed in vitro, e.g., in a cell free system, or in vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify compounds, e.g., fragments or analogs of a subject laminin polypeptide, which are agonists or antagonists of a subject laminin polypeptide.
In yet another aspect, the invention features a method for evaluating a compound, e.g., for the ability to modulate an interaction, e.g., the ability to inhibit an interaction of a subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes [34, with a second polypeptide, e.g., a polypeptide, e.g., a natural ligand of the of or a substrate wo which binds a subject laminin polypeptide, or a fragment thereof. The method includes the steps of (i) combining the second polypeptide (or preferably a purified preparation thereof), a subject laminin polypeptide, (or preferably a purified preparation thereof), and a compound, e.g., under conditions wherein in the absence of the compound, the second polypeptide, and the subject laminin polypeptide, are able to interact, e.g., to bind or form a complex; and (ii) detecting the interaction, e.g., detecting the formation (or dissolution) of a complex which includes the second polypeptide, and the subject laminin polypeptide. A change, e.g., a decrease or increase, in the formation of the complex in the presence of a compound (relative to what is seen in the absence of the compound)-is indicative of a modulation, e.g., an inhibition or promotion, of the interaction between the second polypeptide, and the subject laminin polypeptide. In preferred embodiments: the second polypeptide, and the subject laminin polypeptide, are combined in a cell-free system and contacted with the compound; the cell-free system is selected from a group consisting of a cell lysate and a reconstituted protein mixture; the subject laminin polypeptide, and the second polypeptide are simultaneously expressed in a cell, and the cell is contacted with the compound, e.g. in an interaction trap assay (e.g., a two-hybrid assay).
In yet another aspect, the invention features a two-phase method (e.g., a method having an in vitro, e.g., in a cell free system, and an in vivo phase) for evaluating a compound, e.g., for the ability to modulate, e.g., to inhibit or promote, an interaction of a subject laminin polypeptide subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes [34, with a second compound, e.g., a second polypeptide, e.g., a naturally occurring ligand of or a substrate to which binds a subject laminin polypeptide, or a fragment thereof. The method includes steps (i) and (ii) of the method described immediately above performed in vitro, and further includes: (iii) determining if the compound modulates the interaction in vitro, e.g., in a cell free system, and if so; (iv) administering the compound to a cell or animal;
and (v) evaluating the in vivo effect of the compound on an interaction, e.g., inhibition, of a subject laminin polypeptide, with a second polypeptide.
In another aspect, the invention features, a method of evaluating a compound for the ability to bind a nucleic acid encoding a subject laminin polypeptide, e.g., a laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes (34 polypeptide regulatory sequence. The method includes: contacting the compound with the nucleic acid;
and evaluating ability of the compound to form a complex with the nucleic acid.
In another aspect, the invention features a method of making a y3 or [34 polypeptide, e.g., a peptide having a non-wild type activity, e.g., an antagonist, agonist, or super agonist of a naturally occurring y3 or ~i4 polypeptide, e.g., a naturally occurring Y3 or X34 polypeptide.
The method includes: altering the sequence of a y3 or ~i4 polypeptide, e.g., altering the sequence , e.g., by substitution or deletion of one or more residues of a non-conserved region, a domain or residue disclosed herein, and testing the altered polypeptide for the desired activity.
In another aspect, the invention features a method of making a fragment or analog of a y3 or (34 polypeptide having a biological activity of a naturally occurring y3 or [34 polypeptide. The method includes: altering the sequence, e.g., by substitution or deletion of one or more residues, of a y3 or (34 polypeptide, e.g., altering the sequence of a non-conserved region, or a domain or residue described herein, and testing the altered polypeptide for the desired activity.
In another aspect, the invention features, a human cell, e.g., a hematopoietic stem cell, transformed with nucleic acid which encodes a subject laminin polypeptide, e.g., a laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes ~i4.
In another aspect, the invention includes: a y3, ~i4 nucleic acid, e.g., a y3, /34 nucleic acid inserted into a vector; a cell transformed with a y3, ~i4 nucleic acid; a y3, (34 made by culturing a cell transformed with a y3, (34 nucleic acid; and a method of making a y3, [34 polypeptide including culturing a a cell transformed with a y3, (34 nucleic acid.
The inventors have shown that y3 forms laminin 12'in association with a2 and ail.
However, we are unsure of the chain associations of y3 within other tissues.
It is very likely that y3 can also associate with y3, a3, a4, and a5; with (32, ~i3, [34 and (35. Therefore, our results predict 25 new laminins: laminins 12-37. y3 and [i4 polypetides of the invention can be expressed with, assembled with, or administered with other laminin subunits in any of the methods described herein. E.g., y3 can be assembled with an a and a (3 subunit to form a laminin trimer. [34 can be assembled with an a and a (3 subunit to form a laminin trimer.
In any treatment or therapeutic application which administers y3, a (32 subunit can also be administered.
A "heterologous promoter", as used herein is a promoter which is not naturally associated with a gene or a purified nucleic acid.
A "purified"or "substantially pure" or isolated "preparation" of a polypeptide, as used herein, means a polypeptide that has been separated from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, the polypeptide is also separated from substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are used to purify it.
Preferably, the polypeptide constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the preparation contains: sufficient polypeptide to allow protein sequencing; at least 1, 10, or 100 ~,g of the polypeptide; at least 1, 10, or 100 mg of the polypeptide.
A "purified preparation of cells", as used herein, refers to, in the case of plant or animal cells, an in vitro preparation of cells and not an entire intact plant or animal. In the case of cultured cells or microbial cells, it consists of a preparation of at least 10% and more preferably 50% of the subject cells.
A "treatment", as used herein, includes any therapeutic treatment, e.g., the administration of a therapeutic agent or substance, e.g., a drug.
An "isolated" or " pure nucleic acid", e.g., a substantially pure DNA, is a nucleic acid which is one or both of not immediately contiguous with either one or both of the sequences, e.g., coding sequences, with which it is immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the naturally-occurring genome of the organism from which the nucleic acid is derived; or which is substantially free of a nucleic acid sequence with which it occurs in the organism from which the nucleic acid is derived. The term includes, for example, a recombinant DNA which is incorporated into a vector, e.g., into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other DNA sequences. Substantially pure DNA can also includes a recombinant DNA which is part of a hybrid gene encoding sequence.
"Sequence identity or homology", as used herein, refers to the sequence similarity between two polypeptide molecules or between two nucleic acid molecules. When a position in both of the two compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules are homologous or sequence identical at that position. The percent of homology or sequence identity between two sequences is a function of the number of matching or homologous identical positions shared by the two sequences divided by the number of positions compared x 100. For example, if 6 of 10, of the positions in two sequences are the same then the two sequences are 60% homologous or have 60% sequence identity.
By way of example, the DNA sequences ATTGCC and TATGGC share 50% homology or sequence identity. Generally, a comparison is made when two sequences are aligned to give maximum homology.
The tenors "peptides", "proteins", and "polypeptides" are used interchangeably herein.
As used herein, the term "transgene" means a nucleic acid sequence {encoding, e.g., one or more subject laminin polypeptides), which is partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted at a location which differs from that of the natural gene or its insertion results in a knockout). A transgene can include one or more transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be necessary for optimal expression of the selected nucleic acid, all operably linked to the selected nucleic acid, and may include an enhancer sequence.
As used herein, the term "transgenic cell" refers to a cell containing a transgene.
As used herein, a "transgenic animal" is any animal in which one or more, and preferably essentially all, of the cells of the animal includes a transgene.
The transgene can be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. This molecule may be integrated within a chromosome, or it may be extrachromosomally replicating DNA. ~ -As used herein, the term "tissue-specific promoter" means a DNA sequence that serves as a promoter, i.e., regulates expression of a selected DNA sequence operably linked to the promoter, and which effects expression of the selected DNA
sequence in specific cells of a tissue, such as mammary tissue. The term also covers so-called "leaky"
promoters, which regulate expression of a selected DNA primarily in one tissue, but cause expression in other tissues as well.
"Unrelated to a y3 or (34 amino acid or nucleic acid sequence" means having less than 30% sequence identity, less than 20% sequence identity, or, preferably, less than 10%
homology with a naturally occuring y3 or (34 sequence disclosed herein.
A polypeptide has y3 biological activity if it has one or more of the properties of y3 disclosed herein. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the properties of y3 disclosed herein.
A polypeptide has (34 biological activity if it has one or more of the properties of [34 disclosed herein. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the properties of ~i4 disclosed herein.
"Misexpression", as used herein, refers to a non-wild type pattern of gene expression, at the RNA or protein level. It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of expression that differs from wild type in terms of the time or stage at which the gene is expressed, e.g., increased or decreased expression (as compared with wild type) at a predetermined developmental period or stage; a pattern of expression that differs from wild type in terms of decreased expression (as compared with wild type) in a predetermined cell type or tissue type; a pattern of expression that differs from wild type in terms of the splicing size, amino acid sequence, post-transitional modification, or biological activity of the expressed polypeptide; a pattern of expression that differs from wild type in terms of the effect of an environmental stimulus or extracellular stimulus on expression of the gene, e.g., a pattern of increased or decreased expression (as compared with wild type) in the presence of an increase or decrease in the strength of the stimulus.
Subject, as used herein, can refer to a mammal, e.g., a human, or to an experimental or animal or disease model. The subject can also be a non-human animal, e.g., a horse, cow, goat, or other domestic animal.
As described herein, one aspect of the invention features a substantially pure (or recombinant) nucleic acid which includes a nucleotide sequence encoding a Y3 or ~i4 polypeptide and/or equivalents of such nucleic acids. The term nucleic acid as used herein can include fragments and equivalents. The term equivalent refers to nucleotide sequences encoding functionally equivalent polypeptides. Equivalent nucleotide sequences will include sequences that differ by one or more nucleotide substitutions, additions or deletions, such as 5 allelic variants, and include sequences that differ from the nucleotide sequences disclosed herein by degeneracy of the genetic code.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art.
10 Such techniques are described in the literature. See, for example, Molecular Cloning A
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985);
Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195;
Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984);
Transcription And 15 Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I.
Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B.
Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology {Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H.
Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymolo~r, Vols. 154 20 and 155 (Wu et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986).
Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
DETAILED DESCRIPTION
The drawings are briefly described.
Figure 1 depicts the cDNA sequence for human a2 subunit.
Figure 2 depicts the predicted amino acid sequence for human a2 subunit.
Figure 3 depicts the cDNA sequence for human ø4 subunit.
Figure 4 depicts the predicted amino acid sequence for human ø4 subunit.
Figure 5 depicts an alignment of the amino acid sequence of human ø4 of SEQ ID
NO: 1 and ø4 splice varient of SEQ ID N0:5 and laminin ø1, ø2, and ø3 subunits.
Figure 6 provides a comparision of the similarities of laminin ø4 domains with the domains of other known laminin ø subunits.
WO 99/19348 PC'T/US98/21391 Lanvnin 12 was isolated from human placental chorionic villi. Briefly, human chorionic placental villi were frozen in liquid nitrogen, ground in a Waring blender and washed in 1 M NaCI. The final tissue pellet (200g, wet weight) was suspended in 1 L of extraction buffer (50 mM Tris-HCl 50 mM, pH=7.8; NaCI O.SM, EDTA IOmM, 625 mg/1 of N-ethylmaleimide, 150 mg/1 of phenyhnethylsulphonyl fluoride. The suspension was incubated at 4°C with stirring for 48 h. Unless otherwise noted, all subsequent steps were performed at 4oC. The soluble fraction was collected following centrifugation (30000 x g, 60 min) and precipitated by 300g11 of Ammonium Sulfate. The precipitated proteins were collected by centrifugation (30000 x g, 60 min) and redissolved into chromatography buffer (2M Urea, 25 mM NaCI, 5 mM EDTA, and 50 mM Tris-HCI, pH=7.8). The sample was then dialyzed against the same buffer. Following dialysis, 0.5 volumes of buffer equilibrated DEAE-cellulose (DE-52, Whatman) was added and the mixture shaken overnight.
Material not bound to DEAF-cellulose was collected by filtration on a Buchner fimnel (Whatman filter 4) and precipitated by addition of 300g/1 of ammonium sulfate. The proteins were collected by centrifugation (30000 x g, 60 min), redissolved in the Concanavalin-A
buffer (0.5 M
NaCI, 5 mM CaCl2, 5 mM MgCl2, and Tris-HCl 50 mM, pH=7.8) and dialyzed against the same buffer overnight. The fraction was applied to a 2.5 x 5 cm Concanavalin-A
sepharose column (Pharmacia), and unbound material was removed by extensive washing.
Bound proteins were first eluted with 10 mM a-D- Mannopyrannoside (Sigma, St. Louis, MO) and secondly with 1 M a-D-Glucopyrannoside (Sigma, St. Louis, MO). A third elution with 1M
a-D-Manno-pyrannoside (Sigma, St. Louis, MO) allowed the recovery of the proteins of interest. Each fraction was independently concentrated to 10 ml on a AmiconT""
concentrator (30 kDa membrane) and applied to a 2.5 x 100 cm Sephacryl S-500 column in a 0.5 M NaCI, 50 mM Tris-HCI, pH=7.8 buffer. The fractions of interest were pooled, dialyzed against Mono-Q buffer (0.1 M NaCI, 25 mM Tris-HCI, pH=7.8) and applied to the 1 x 5 cm Mono-Q
column (Pharmacia). Elution was achieved with a 60 ml 0.1-0.5 M NaCI gradient.
The final fraction of interest resulting from the above protocol contains multiple laminins. The laminin 12 was resolved from this mixture by SDS-PAGE (3-5%
polyacrylamide) under non-reducing conditions. Six band were resolved. Only the bands at approximately 560 kDa and at the top of the gel were shown to be reactive with polyclonal anti-laminin antiserum (Sigma, St. Louis, MO).
Laminin 12 was excised, equilibrated and reduced in 10°b 2-me SDS-PAGE
sample buffer, and resolved by 5 % SDS-PAGE. Three bands were resolved, which were approximately 205 kDa, 185 kDa, and 170 kDa. The band at 185 kDa reacted with monoclonal antibody 545, specific to the laminin ~i 1 subunit. Each of the three bands were digested with trypsin and the peptides were resolved by HPLC. The selected resolves were subject to peptide sequencing.
Protein sequencing was done according to Aebersold et al. (1987). The complex laminin 5-laminin 7 was run on a polyacrylamide gel in the presence of 2-mercaptoethanol and blotted onto a nitrocellulose membrane (Biorad). The 190 kDa band of ~i2 and the 165 kDa a3 band were separately excised and digested by protease trypsin. The digested product was separated by HPLC and one fragment was sequenced on an Applied Biosystems sequenator (Applied Biosystems, Foster City, CA). The 205 kDa chain contained a sequence identical to human laminin a2, and was thus identified as human laminin a2 subunit. The 185 kDa produced two peptides identical to human (31, and was thus identified as human laminin ~i 1 subunit. The band at 170 kDa contained three sequences not contained in any known laminin chain. A N-terminal sequence of the 170 kDa chain was also determined. In addition, the N-terminal sequence was not identical to any known laminin sequence.
The cDNA sequences of human yl and r2 were used to probe the National Center for Biomedical Information (NCBI) dBestTM data base by BLAST search and a clone was isolated that was homologous, but not identical to yl and y2. This clone was extended by PCR at the 5' end using Marathon cDNA from human placenta from Clonetech (Palo Alto, CA). The resulting sequence was determined to be 100% identical to all three of the 170 kDa band peptide sequences.
Comparison of the nucleotide sequence of the isolated y3 subunit to yl, demonstrated about 80 % sequence identity.
The human cDNA encoding y3, which is approximately 4710 nucleotides in length, encodes a protein having an estimated molecular weight of approximately 146 kDa (including post-translational modifications) and which is approximately 1570 amino acid residues in length. The human y3 protein contains a nidogen-binding domain, which can be found, for example, from about amino acids 750-755 of SEQ ID N0:3. The y3 amino acid sequence and the nucleotide sequence encoding human laminin y3 is shown in SEQ ID N0:3 and SEQ ID
N0:4, respectively.
By Northern analysis the size of the y3 mRNA is approximately 5 kb, which is consistent with other laminin y subunits. The y3 mRNA transcript is expressed in human tissues including spleen, testis, brain, placenta, lung, and possibly liver.
Chromosomal mapping using the y3 cDNA sequence indicates that the human y3 gene is located on WO 99/19348 PCT/US98lZ1391 chromosome 9q31-34. The location of y3 on chromosome 9 was confirmed by FISH
analysis using a 1.3 kb y3 cDNA probe within the predicted domains I and II, which are~the regions of the least sequence identity among y subunits. Four human genes associated with Walker-Walburg syndrome, Fukuyama muscular dystrophy, retinitis pigmentosa-deafness syndrome and Eye, Muscle, Brain disease have also been mapped to chromosome 9q31-34.
Production of a y3 specific antibody and tissue lo~~alization of ~
The 170 kDa (y3) chain was excised from the reducing SDS-PAGE gel described above and inj ected into a rabbit for antibody production. The resulting serum (rabbit 16) was evaluated by Western analysis and shown to react with the 170 kDa y3 chain, and showed minor crossreactivity with other laminin chains.
Using immunofluorescence, this antiserum shows localization of y3 to the following tissue areas: 1) sites of insertions of nerves into the dermal-epidermal junction basement membrane of human skin; 2) the inner nuclear layers, outer nuclear layers, and outer limiting membranes of human, mouse and rat neural retina; 3) the Purkinje cells, and molecular layers, and (perhaps) the glial cells of the mouse and rat cerebellum; 4) the neuromuscular junctions of skeletal muscle; and, S) the taste buds of the cow tongue.
The y3 was also shown to colocalize with protein ubiquitin carboxy terminal hydrolase I using antibody pGp 9.5. The y3 subunit also appears to colocalize with the a2 subunit in the same tissue sections.
Isolation and Sequencing of cDNA encodLng Q4 The initial 350 by fragment of human laminin (34 cDNA was amplified by touchdown RT-PCR from cultured human keratinocyte total RNA using nested primers made from the published chicken laminin (3 x 503 by cDNA sequence (as described in Ybot-Gonzalez et al. (1995)). Subsequent cDNA clones were isolated by nested PCR
directly from a human placenta cDNA library packaged in lambda-gtl 1 (Clontech, Palo Alto, CA) or by nested PCR directly from human placenta Marathon-Ready cDNA (Clontech, Palo Alto, CA).
The 5' end of the cDNA was cloned using the 5'-RACE technique from human placenta total RNA. The Expanded Long Template PCR System (Boehringer Mannheim Biochemicals, Indianapolis, Il~ was used for all PCR reactions. The PCR products were ligated into the pCR2.1 vector (Invitrogen, San Diego, CA) and recombinant plasmids purified for sequencing using the QIAprepT"" kit (Qiagen). The DNA sequence was determined using either the Sequenase version 2.0 DNA Sequencing Kit (Amersham) and 35S-dATP or the Thermo Sequenase Radiolabeled Terminator Cycle Sequencing kit (Amersham) and 33p_ ddNTPs. At least two independent cDNA subclones were sequenced to rule out Taq polymerase-generated nucleotide substitutions. In some cases, PCR product bands were sequenced directly by cycle sequencing after excision from a TAE-EtBr agarose gel and purification using QIAquick Gel Extraction kit (Qiagen).
WO 99/19348 PCT/US98l21391 The human cDNA encoding a long form (34, which is approximately 5.87 kb, encodes a protein having an estimated molecular weight of approximately 200 kDa and which is approximately 1761 amino acid residues in length. The human (34 protein retains the highest amino acid sequence identity with domains VI and V, which can be found, for example, from about amino acids 221-262 and about 263-535 of SEQ ID NO:1. In addition, a short form, splice variant of (34, which is approximately 3.84 kb and an estimated molecular weight of 120 kDa, was also isolated. The splice variant has 132 nucleotide sequence identical to the long form of [34, with the sequence diverging at nucleotide 3375 and spliced into a unique 3' I 5 untranslated region. The short form cDNA encodes a truncated (34 subunit which contains only the short arm of the (34 subunit and is missing the domains necessary for heterodimerization. The (34 amino acid sequence and the nucleotide sequence encoding human laminin (34 is shown in SEQ ID NO:1 and SEQ ID N0:2, respectively.
Northern analysis was performed using total RNA prepared from JAR cell, cultured human keratinocytes and human placenta using either Trizol (Gibco BRL, Bethesda, MD) or RNeasyT"" (Qiagen) which was denatured, separated on a formaldehyde agarose gel and blotted onto nitrocellulose according to standard protocols (Sambrook, et al., 1989). In addition, A human multiple tissue northern blot (Clontech, Palo Alto, CA) and Human Northern Terntory normal tissue blots and custom fetal skin northern blot (Invitrogen, San Diego, CA) were used. Hybridization and washing were performed using NorthernMAXT""
buffer system (Ambion) by manufacturer's recommended protocols. 32P-dCTP-labelled probes were generated from gel-purified restriction fragments using RediprimeT"' random primer labeling kit (Amersham). 32P-UTP-labelled antisense RNA probes were generated using the RNA transcription kit (Stratagene, La Jolla, CA) from cDNAs subcloned into Bluescript II KS+ (Stratagene, La Jolla, CA).
Northern blotting showed that human laminin ~i4 is expressed in JAR cells, derived from undeveloped chronic villi and in placenta. By RT-PCR, it is also expressed in cultured keratinocytes. Using a northern blot of human fetal skin developmental progression, (34 subunit (long form) demonstrates strong expression at week twelve of fetal development and persists until birth, but expression is barely detectable in adult skin. The ~i4 splice variant, however, is expressed in various tissues including adult heart, brain, lung, liver, skeletal muscle, kidney, spleen, stomach, esophagus, intestine, colon, uterus, bladder, adipose tissue and pancreas. Chromosomal mapping with a (34 cDNA probe indicates that the human (34 subunit is located at locus 7q22-q31.2. The gene encoding (31 is located near, but not on, this position of chromosome 7. Statistical analysis of the mapping data using markers for ~i 1 and (34 suggest that the gene encoding ~i 1 is linked to both ends of the gene encoding ~i4. In addition, neonatal cutis laxa with manifold phenotype has been mapped near, but not in the same position, as the gene encoding (34.
In situ hybridization to wounded human skin grafted into nude mice suggests that laminin ~3 x is expressed in the dermis underneath the migrating epidermal tongues during wound closure.
A GenBankTM search using the human nucleotide sequence encoding [34 as shown in SEQ ID N0:3 revealed an EST, which corresponds to nucleotides 4686-5870 of the human 10 nucleotide sequence encoding (34 depicted in SEQ ID N0:3. Alignment of cDNA
encoding ~i4 with the genes encoding human laminin X31 and laminin (32 shows 61 % and 59% sequence identity, respectively, as shown in Figure 5.
Production of a (~4 specific antibodjr and ti ue local'~a ion of j~
15 Antibodies were raised in rabbits against a 26 kDa bacterial fusion protein which corresponds to the 175 amino acid residues of domain VI (e.g., from about amino acid residues 221-262) of SEQ ID NO:1. Briefly the fusion protein was made by PCR
amplification of nucleotides 302-785 of the cDNA encoding ~i4 using adapter primers and cloned in-frame into the NdeI and SacII sites of pET-15b (Novagen). The fusion protein 20 construct was confirmed by restriction mapping and DNA sequencing.
Expression of the fusion protein was induced and separated from E. coli proteins using reducing SDS-PAGE.
Bands corresponding to the fusion protein were excised from the gel, equilibrated and homogenized using Freud's adjuvant. The same fusion protein was also western blotted on nitrocellulose, dissolved in DMSO and used to immunize mice for monoclonal antibody 25 production.
The polyclonal antisera raised in mice against the fusion protein reacted well with (34, as well as, (31 and (32 polypeptides.
The (34 subunit contains six domains, and a interruption and a signal peptide.
The signal peptide and domain VI can be found, for example, at about amino acid residues 1-262 of SEQ ID NO:1. Domain V can be found, for example, at about amino acid residues 263-535 of SEQ ID NO:1. Domains IV and III can be found, for example, at about amino acid residues 536-767 and 768-1178 of SEQ ID NO:1, respectively. Domain I can be found, for example, at about amino acid residues 1409-1761 of SEQ ID NO:I.
The (34 suburut (long form) is most similar in size and domain structure to laminin (31 with an amino acid sequence identity of 42.5%. (34 retains the highest levels of amino acid identity with the other laminin ~3 subunits in domains VI and V, and the lowest levels in domains I and II, as shown in Figure 6. Using the MulticoilT"" program, it was determined that only domains I and II of (34 have a high probability of forming coiled coil structures.
Domains I and II of ~i4 look most similar to human (33. Both [34 and (i3 are epithelial and the coiled coil structures in domains I and II dictate the a and y subunits with which the ~i subunits are associated. Thus, it is likely that /34 associates with a3 and y2, as does the laminin (33 subunit.
The cDNA encoding the splice variant of ~i4 contains only the short arm of the (34 subunit, and is missing the EGF repeat of domain III, as shown in Figure 5.
Thus, the (34 polypeptide encoded by the [34 c DNA splice variant is missing the coiled coil structures in domains I and II, rendering the short subunit unable to associate into a laminin heterotrimer.
PCR amplification of human genomic DNA suggest that the exon which encodes the alternative short form 3' untranslated region is located downstream from the carboxyl-most common exon, exon 23, and is splices out of the (34 subunit, long form, by exon skipping.
Analogs o~r3 and ~
Analogs can differ from naturally occurring y3 or [34 in amino acid sequence or in ways that do not involve sequence, or both. Non-sequence modifications include in vivo or in vitro chemical derivatization of y3 or ~i4. Non-sequence modifications include changes in acetylation, methylation, phosphorylation, carboxylation, or glycosylation.
Preferred analogs include y3 or (34 (or biologically active fiagments thereof) whose sequences differ from the wild-type sequence by one or more conservative amino acid substitutions or by one or more non-conservative amino acid substitutions, deletions, or insertions which do not abolish the Y3 or (34 biological activity.
Conservative substitutions typically include the substitution of one amino acid for another with similar characteristics, e.g., substitutions within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine;
serine, threonine;
lysine, arginine; and phenylalanine, tyrosine. Other conservative substitutions can be taken from the table below.
CONSERVATIVE AMINO ACID REPLACEMENTS
For Amino Acid Code Replace with any of Alanine A D-Ala, Gly, beta-Ala, L-Cys, D-Cys Arginine R D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, Ile, D-Met, D-Ile, Orn, D-Orn Asparagine N D-Asn, Asp, D-Asp, Glu, D-Glu, Gln, D-Gln Aspartic Acid D D-Asp, D-Asn, Asn, Glu, D-Glu, Gln, D-Gln Cysteine C D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr Glutamine Q D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp Glutamic Acid E D-GIu, D-Asp, Asp, Asn, D-Asn, Gln, D-Gln Glycine G Ala, D-Ala, Pro, D-Pro, (3-Ala, Acp ~i Isoleucine I D-Ile, VaI, D-Val, Leu, D-Leu, Met, D-Met I Leucine L D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met Lysine K D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, Ile, D-Ile, Orn, D-Orn Methionine M D-Met, S-Me-Cys, Ile, D-Ile, Leu, D-Leu, Val, D-Val Phenylalanine F D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, Trans-3,4, or 5-phenylproline, cis-3,4, or 5-phenylproline Proline P D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-1-oxazolidine-4-carboxylic acid Serine S D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), D-Met(O), L-Cys, D-Cys Threonine . T D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), D-Met(O), Val, D-Val Tyrosine Y D-Tyr, Phe, D-Phe, L-Dopa, His, D-His Valine V D-Val, Leu, D-Leu, Ile, D-Ile, Met, D-Met Other analogs within the invention are those with modifications which increase peptide stability; such analogs may contain, for example, one or more non-peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included are:
analogs that include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino acids, e.g., [i or y amino acids; and cyclic analogs.
Geny Theranv zs The gene constructs of the invention can also be used as a part of a gene therapy protocol to deliver nucleic acids encoding either an agonistic or antagonistic form of a Y3 ar (3 4 polypeptide. The invention features expression vectors for in vivo transfection and expression of a y3 or ~i4 polypeptide in particular cell types so as to reconstitute the function of, or alternatively, antagonize the function of y3 or (i4 polypeptide in a cell in which that polypeptide is misexpressed. Expression constructs of y3 or (34 polypeptides, may be administered in any biologically effective carrier, e.g. any formulation or composition capable of effectively delivering the Y3 or ~i4 gene to cells in vivo.
Approaches include insertion of the subject gene in viral vectors including recombinant retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus-l, or recombinant bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; plasmid DNA can be delivered with the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin S, artificial viral envelopes or other such intracellular carriers, as well as direct injection of the gene construct or CaP04 precipitation carried out in vivo.
A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a viral vector containing nucleic acid, e.g. a cDNA, encoding a y3 or (34 polypeptide. Infection of cells with a viral vector has the advantage that a large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed efficiently in cells which have taken up viral vector nucleic acid.
Retrovirus vectors and adeno-associated virus vectors can be used as a recombinant gene delivery system for the transfer of exogenous genes in vivo, particularly into humans.
These vectors provide efficient delivery of genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. The development of specialized cell lines (termed "packaging cells") which produce only replication-defective retroviruses has increased the utility of retroviruses for gene therapy, and defective retroviruses are characterized for use in gene transfer for gene therapy purposes (for a review see Miller, A.D.
( 1990) Blood 76:271 ). A replication defective retrovirus can be packaged into virions which can be used to infect a target cell through the use of a helper virus by standard techniques.
Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo with such viruses can be found in Cu_r_rent Protocols in Molecul r ioloev, Ausubel, F.M. et aI. (eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE
and pEM
which are known to those skilled in the art. Examples of suitable packaging virus lines for preparing both ecotropic and amphotropic retroviral systems include yrCrip, y~Cre, yr2 and yr Am. Retroviruses have been used to introduce a variety of genes into many different cell types, including epithelial cells, in vitro and/or in vivo (see for example Eglitis, et al. (1985) Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl. Acad Sci. USA
85:6460-6464; Wilson et al. (1988) Proc. Natl. Acad. Sci. USA 85:3014-3018; Armentano et al. (1990) Proc. Natl. Acad. Sci. USA 87:6141-6145; Huber et al. ( 1991 ) Proc. Natl.
Acad Sci. USA
88:8039-8043; Ferry et al. (1991) Proc. Natl. Acad. Sci. USA 88:8377-8381;
Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proc. Natl.
Acad. Sci. USA
89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et al.
(1992) Proc.
Natl. Acad Sci. USA 89:10892-10895; Hwu et al. (1993) J. Immunol. 150:4104-4115; U.S.
Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 89/07136;
PCT
Application WO 89/02468; PCT Application WO 89/05345; and PCT Application WO
92/07573).
Another viral gene delivery system useful in the present invention utilizes adenovirus-derived vectors. The genome of an adenovirus can be manipulated such that it encodes and expresses a gene product of interest but is inactivated in terms of its ability to replicate in a normal lytic viral life cycle. See, for example, Berkner et al. (1988) BioTechnigues 6:616;
Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155.
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 d1324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are known to those skilled in the art.
Recombinant adenoviruses can be advantageous in certain circumstances in that they are not capable of infecting nondividing cells and can be used to infect a wide variety of cell types, including epithelial cells (Rosenfeld et al. (1992) cited supra). Furthermore, the virus particle is relatively stable and amenable to purification and concentration, and as above, can be modified so as to affect the spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not integrated into the genome of a host cell but remains episomal, thereby avoiding potential problems that can occur as a result of insertional mutagenesis in situations where introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) relative to other gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham (1986) J. Virol. 57:267).
Yet another viral vector system useful for delivery of the subject gene is the adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient replication and a productive life cycle. (For a review see Muzyczka et al.
Curr. Topics in Micro. and Immunol. (1992) 158:97-129). It is also one of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration (see for example Flotte et al. (1992) Am. J. Respir. Cell. Mol. Biol. 7:349-356;
Samulski et al. (1989) J. Virol. 63:3822-3828; and McLaughlin et al. (1989) J. Virol. 62:1963-1973).
Vectors containing as little as 300 base pairs of AAV can be packaged and can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as that described in Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-3260 can be used to introduce DNA into cells.
A variety of nucleic acids have been introduced into different cell types using AAV vectors 5 (see for example Hermonat et al. ( 1984) Proc. Natl. Acad Sci. USA 81:6466-6470; Tratschin et al. (1985) Mol. Cell. Biol. 4:2072-2081; Wondisford et al. (1988) Mol.
Endocrinol. 2~32-39; Tratschin et al. (1984) J. Yirol. 51:611-619; and Flotte et al. (1993) J.
Biol. Chem.
268:3781-3790).
In addition to viral transfer methods, such as those illustrated above, non-viral 10 methods can also be employed to cause expression of a y3 or (34 polypeptide in the tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used by mammalian cells for the uptake and intracellular transport of macromolecules.
In preferred embodiments, non-viral gene delivery systems of the present invention rely on endocytic pathways for the uptake of the subject y3 or ~i4 gene by the targeted cell.
Exemplary gene 15 delivery systems of this type include liposomal derived systems, poly-lysine conjugates, and artificial viral envelopes.
In a representative embodiment, a gene encoding a y3 or (34 polypeptide can be entrapped in liposomes bearing positive charges on their surface (e.g., lipofectins) and (optionally) which are tagged with antibodies against cell surface antigens of the target tissue 20 (Mizuno et al. (1992) No Shinkei Geka 20:547-551; PCT publication W091/06309;
Japanese patent application 1047381; and European patent publication EP-A-43075).
In clinical settings, the gene delivery systems for the therapeutic y3 or ~i4 gene can be introduced into a patient by any of a number of methods, each of which is familiar in the art.
For instance, a pharmaceutical preparation of the gene delivery system can be introduced 25 systemically, e.g. by intravenous injection, and specific transduction of the protein in the target cells occurs predominantly from specificity of transfection provided by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory sequences controlling expression of the receptor gene, or a combination thereof. In other embodiments, initial delivery of the recombinant gene is more limited with introduction into 30 the animal being quite localized. For example, the gene delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by Stereotactic injection (e.g. Chen et al. (1994) PNAS
91: 3054-3057).
The pharmaceutical preparation of the gene therapy construct can consist essentially of the gene delivery system in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery system can be produced in tact from recombinant cells, e.g.
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which produce the gene delivery system.
Transgenic nimal_s The invention includes transgenic animals which include cells (of that animal) which contain a y3 or (34 transgene and which preferably (though optionally) express (or misexpress) an endogenous or exogenous y3 or ~i4 gene in one or more cells in the animal.
The y3 or (34 transgene can encode the wild-type form of the protein, or can encode homologs thereof, including both agonists and antagonists, as well as antisense constructs: In preferred embodiments, the expression of the transgene is restricted to specific subsets of cells, or tissues utilizing, for example, cis-acting sequences that control expression in the desired pattern. Tissue-specific regulatory sequences and conditional regulatory sequences can be used to control expression of the transgene in certain spatial patterns, e.g., to restrict production to the milk or other secreted product of the animal.
Fragments of a protein can be produced in several ways, e.g., recombinantly, by proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a polypeptide can be generated by removing one or more nucleotides from one end (for a terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments.
Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an array of fragments. DNA's which encode fragments of a protein can also be generated by random shearing, restriction digestion or a combination of the above-discussed methods.
Fragments can also be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f Moc or t-Boc chemistry. For example, peptides of the present invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, or divided into overlapping fragments of a desired length.
Amino acid sequence variants of a protein can be prepared by random mutagenesis of DNA which encodes a protein or a particular domain or region of a protein.
Useful methods include PCR mutagenesis and saturation mutagenesis. A library of random amino acid sequence variants can also be generated by the synthesis of a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a library of variants are elsewhere herein.) PCR Mutagenesis In PCR mutagenesis, reduced Taq polymerise fidelity is used to introduce random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). This is a very powerful and relatively rapid method of introducing random mutations. The DNA
region to be mutagenized is amplified using the polymerise chain reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerise, e.g., by using a dGTP/dATP ratio of five and adding Mn2+ to the PCR reaction. The pool of amplified DNA fragments are inserted into appropriate cloning vectors to provide random mutant libraries.
Saturation Mutagenesis Saturation mutagenesis allows for the rapid introduction of a large number of single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This technique includes generation of mutations, e.g., by chemical treatment or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The mutation frequency can be modulated by modulating the severity of the treatment, and essentially all possible base substitutions can be obtained. Because this procedure does not involve a genetic selection for mutant fragments both neutral substitutions, as well as those that alter function, are obtained. The distribution of point mutations is not biased toward conserved sequence elements.
Degenerate Oligonucleotides A library of homologs can also be generated from a set of degenerate oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector.
The synthesis of degenerate oligonucleotides is known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al.
(1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390;
Roberts et al.
(1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815).
Non-random or directed, mutagenesis techniques can be used to provide specific sequences or mutations in specific regions. These techniques can be used to create variants which include, e.g., deletions, insertions, or substitutions, of residues of the known amino acid sequence of a protein. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conserved amino acids and then with more radical choices depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of the same or a different class adjacent to the located site, or combinations of options 1-3.
Alanine Scanning Mutagenesis S Alanine scanning mutagenesis is a useful method for identification of certain residues or regions of the desired protein that are preferred locations or domains for mutagenesis,-Cunningham and Wells (Science 244:1081-1085, 1989}. In alanine scanning, a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction of the amino acids with the surrounding aqueous environment in or outside the cell. Those domains demonstrating functional sensitivity to the substitutions are then refined by introducing further or other variants at or for the sites of substitution. Thus, while the site for introducing an amino acid sequence variation is predetermined, the nature of the mutation per se need not be predetermined. For example, to optimize the performance of a mutation at a given site, alanine scanning or random mutagenesis may be conducted at the target codon or region and the expressed desired protein subunit variants are screened for the optimal combination of desired activity.
Oligonucleotide-Mediated Mutagenesis Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, deletion, and insertion variants of DNA, see, e.g., Adehnan et al., (DNA
2:183, 1983).
Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the desired protein. After hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code for the selected alteration in the desired protein DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotides) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et al. (Proc. Natl.
Acad. Sci. USA, 75: 5765[1978]).
Cassette Mutagenesis Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al. (Gene, 34:315[1985]). The starting material is a plasmid (or other vector) which includes the protein subunit DNA to be mutated. The codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they may be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations in the desired protein subunit DNA. After the restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between the restriction sites but containing the desired mutations) is synthesized using standard procedures. The two strands are synthesized separately and then hybridized together using standard techniques. This double-stranded oligonucleotide is referred to as the cassette.
This cassette is designed to have 3' and 5' ends that are comparable with the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence.
Combinatorial Mutagenesis Combinatorial mutagenesis can also be used to generate mutants. E.g., the amino acid sequences for a group of homologs or other related proteins are aligned, preferably to promote the highest homology possible. All of the amino acids which appear at a given position of the aligned sequences can be selected to create a degenerate set of combinatorial sequences. The variegated library of variants is generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene library. For example, a mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential sequences are expressible as individual peptides, or alternatively, as a set of larger fusion proteins containing the set of degenerate sequences.
Primalv High-Through-Put Me hod for greening Libraries of Peg it de Fra n . or Various techniques are known in the art for screening generated mutant gene products.
Techniques for screening large gene libraries often include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the genes under conditions in which detection of a desired activity, e.g., in this case, binding to other laminin subunits, assembly into a trimeric laminin molecules, binding to natural ligands or substrates, facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Each of the techniques described below is amenable to high through-put analysis for screening large numbers of sequences created, e.g., by random mutagenesis techniques.
Two Hybrid Systems Two hybrid assays such as the system described above (as with the other screening methods described herein), can be used to identify fragments or analogs. These may include agonists, superagonists, and antagonists. (The subject protein and a protein it interacts with are used as the bait protein and fish proteins.) .
Display Libraries 5 In one approach to screening assays, the candidate peptides are displayed on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an appropriate receptor protein via the displayed product is detected in a "panning assay". For example, the gene library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO
10 88/06630; Fuchs et al. (1991) BiolTechnology 9:1370-1371; and Goward et al.
(1992) TIBS
18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can be used to detect homolog which retain ligand-binding activity. The use of fluorescently labeled ligands, allows cells to be visually inspected and separated under a fluorescence microscope, 15 or, where the morphology of the cell permits, to be separated by a fluorescence-activated cell sorter.
A gene library can be expressed as a fusion protein on the surface of a viral particle.
For instance, in the filamentous phage system, foreign peptide sequences can be expressed on the surface of infectious phage, thereby conferring two significant benefits.
First, since these 20 phage can be applied to affinity matrices at concentrations well over 1013 phage per milliliter, a large number of phage can be screened at one time. Second, since each infectious phage displays a gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of infection.
The group of almost identical E. coli filamentous phages M13, fd., and fl are most often used in phage 25 display libraries. Either of the phage gIII or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2-terminal end of pIII and phage bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner et al. PCT
publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol.
30 Chem. 267:16007-16010; Griffiths et al. (1993} EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461).
A common approach uses the maltose receptor of E. coli (the outer membrane protein, Lama) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037).
Oligonucleotides have been inserted into plasmids encoding the Lama gene to produce 35 peptides fused into one of the extracellular loops of the protein. These peptides are available for binding to ligands, e.g., to antibodies, and can elicit an immune response when the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. {1990) Gene 88, 37-45}, and PAL (Fuchs et al. (1991) BiolTech 9, 1369-1372), as well as large bacterial surface structures have served as vehicles for peptide display. Peptides can be fused to pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic information (Thiry et al.
(1989) Appl. Environ. Microbiol. 55, 984-993). Because of its role in interacting with other cells, the pilus provides a useful support for the presentation of peptides to the extracellular environment. Another large surface structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of peptides to the subunit protein flagellin offers a dense array of may peptides copies on the host cells (Kuwajima et al. (1988) BiolTech. 6, 1080-1083).
Surface proteins of other bacterial species have also served as peptide fusion partners.
Examples include the Staphylococcus protein A and the outer membrane protease IgA of Neisseria (Hansson et al. (1992) J. Bacteriol. 174, 4239-4245 and Klauser et al. (1990) EMBO J. 9, 1991-1999}.
In the filamentous phage systems and the Lama system described above, the physical link between the peptide and its encoding DNA occurs by the containment of the DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the peptide captures the particle and the DNA within. An alternative scheme uses the DNA-binding protein LacI
to form a link between peptide and DNA (Cull et al. (1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the LacI gene with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose, a LacI-peptide fusion protein is produced.
This fusion retains the natural ability of LacI to bind to a short DNA
sequence known as LacO operator (LacO). By installing two copies of LacO on the expression plasmid, the LacI-peptide fusion binds tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a single oligonucleotide sequence and each cell expresses only a single peptide sequence, the peptides become specifically and stably associated with the DNA
sequence that directed its synthesis. The cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix of immobilized receptor to recover the complexes containing active peptides. The associated plasmid DNA is then reintroduced into cells for amplification and DNA sequencing to determine the identity of the peptide ligands. As a demonstration of the practical utility of the method, a large random library of dodecapeptides was made and selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl. Aced.
Sci. U.S.A. 89-1869) This scheme, sometimes referred to as peptides-on-plasmids, differs in two important ways from the phage display methods. First, the peptides are attached to the C-terminus of the fusion protein, resulting in the display of the library members as peptides having free carboxy termini. Both of the filamentous phage coat proteins, pIII and pVIII, are anchored to the phage through their C-termini, and the guest peptides are placed into the outward-extending N-terminal domains. In some designs, the phage-displayed peptides are presented right at the amino terminus of the fusion protein. (Cwirla, et al. (1990) Proc. Natl. Aced. Sci.
U.S A. 87, 6378-6382) A second difference is the set of biological biases affecting the population of peptides actually present in the libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells. The phage coat fusions are exposed briefly to the cytoplasm during translation but are rapidly secreted through the inner membrane into the periplasmic compartment, remaining anchored in the membrane by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, protruding into the periplasm while awaiting assembly into phage particles. The peptides in the LacI and phage libraries may differ significantly as a result of their exposure to different proteoLytic activities.
The phage coat proteins require transport across the inner membrane and signal peptidase processing as a prelude to incorporation into phage. Certain peptides exert a deleterious effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J.
Med. Chem. 37(9):1233-1251). These particular biases are not a factor in the LacI display system.
The number of small peptides available in recombinant random libraries is enormous.
Libraries of 10~-109 independent clones are routinely prepared. Libraries as large as 1011 recombinants have been created, but this size approaches the practical limit for clone libraries. This limitation in Library size occurs at the step of transforming the DNA
containing randomized segments into the host bacterial cells. To circumvent this limitation, an in vitro system based on the display of nascent peptides in polysome complexes has recently been developed. This display library method has the potential of producing libraries 3-6 orders of magnitude larger than the currently available phage/phagemid or plasmid Libraries. Furthermore, the construction of the libraries, expression of the peptides, and screening, is done in an entirely cell-free format.
In one application of this method (Gallop et a1. (1994) J. Med Chem.
37(9):1233-1251 ), a molecular DNA library encoding 1012 decapeptides was constructed and the library expressed in an E. coli S30 in vitro coupled transcription/translation system.
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes and yielding complexes containing nascent peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be affinity purified on immobilized receptors in much the same way as the more conventional recombinant peptide display Libraries are screened. RNA from the bound complexes is recovered, converted to cDNA, and amplified by PCR to produce a template for the next round of synthesis and screening. The polysome display method can be coupled to the phage display system. Following several rounds of screening, cDNA from the enriched pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing vector for peptide identification. By expressing the polysome-derived peptides on phage, one can either continue the affinity selection procedure in this format or assay the peptides on individual clones for binding activity in a phage ELISA, or for binding specificity in a completion phage ELISA (Barnet, et al. (1992) Anal. Biochem 204,357-364). To identify the sequences of the active peptides one sequences the DNA produced by the phagemid host.
The high through-put assays described above can be followed by secondary screens in order to identify further biological activities which will, e.g., allow one skilled in the art-to differentiate agonists from antagonists. The type of a secondary screen used will depend on the desired activity that needs to be tested. For example, an assay can be developed in which the ability to inhibit an interaction between a protein of interest and its respective ligand can be used to identify antagonists from a group of peptide fragments isolated though one of the primary screens described above.
Therefore, methods for generating fragments and analogs and testing them for activity are known in the art. Once the core sequence of interest is identified, it is routine to perform for one skilled in the art to obtain analogs and fragments.
P~ntide Mimetics The invention also provides for reduction of the protein binding domains of the subject y3 or (34 polypeptides to generate mimetics, e.g. peptide or non-peptide agents. See, for example, "Peptide inhibitors of human papillomavirus protein binding to retinoblastoma gene protein" European patent applications EP-412,762A and EP-B31,080A.
Non-hydrolyzable peptide analogs of critical residues can be generated using benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffinan et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides:
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) JMed Chem 29:295; and Ewenson et al. in Peptides:
Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), (3-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. ( 1986) J Chem Soc Perkin Trans 1:1231 ), and (3-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun126:419; and Dann et al.
(1986) Biochem Biophys Res Commun 134:71).
~~ntibodies The invention also includes antibodies specifically reactive with a subject y3 or (34 polypeptides. Anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)).
Antibodies which specifically bind y3 or (34 epitopes can also be used in immunohistochemical staining of tissue samples in order to evaluate the abundance and pattern of expression of y3 or [34. Anti y3 or (34 antibodies can be used diagnostically in S immuno-precipitation and immuno-blotting to detect and evaluate y3 or ~i4 levels in tissue or bodily fluid as part of a clinical testing procedure.
Another application of antibodies of the present invention is in the immunological screening of cDNA libraries constructed in expression vectors such as ~,gtl l, ~.gtl8-23, 7v.ZAP, and ~,ORF8. Messenger libraries of this type, having coding sequences inserted in the correct reading frame and orientation, can produce fusion proteins. For instance, ~,gtl l will produce fusion proteins whose amino termini consist of 13-galactosidase amino acid sequences and whose carboxy tenmini consist of a foreign polypeptide. Antigenic epitopes of a subject polypeptide can then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with antibodies of the invention. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the presence of homologs can be detected and cloned from other animals, and alternate isoforms (including splicing variants) can be detected and cloned from human sources.
Other Embodiments Included in the invention are: allelic variations; natural mutants; induced mutants;
proteins encoded by DNA that hybridizes under high or low stringency conditions to a nucleic acid which encodes a polypeptide of SEQ ID NO:1 or SEQ ID N0:3 (for definitions of high and low stringency see Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, polypeptides specifically bound by antisera to Y3 or ~i4.
Nucleic acids and polypeptides of the invention includes those that differ from the sequences discolosed herein by virtue of sequencing errors in the disclosed sequences.
The invention also includes fragments, preferably biologically active fragments, or analogs of y3 or ~i4. A biologically active fragment or analog is one having any in vivo or in vitro activity which is characteristic of they3 or (34 shown in SEQ ID N0:3 and SEQ ID
NO:1, respectively, or of other naturally occurring y3 or (34, e.g., one or more of the biological activities described above. Especially preferred are fragments which exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise from translation of alternatively spliced RNA's. Fragments include those expressed in native or endogenous cells, e.g., as a result of post-translational processing, e.g., as the result of the removal of an amino-terminal signal sequence, as well as those made in expression systems, e.g., in CHO cells. Particularly preferred fragments are fragments, e.g., active fragments, which are generated by proteolytic cleavage or alternative splicing events.
Other embodiments are within the following claims.
What is claimed is:
WO 99/19348 PCT/US98lZ1391 S SEQUENCE LISTING
(1) GENERAL INFORMATION:
ld (i) APPLICANT: Burgeson, Robert, et al.
(ii) TITLE OF INVENTION: DNA Sequences (iii) NUMBER OF SEQUENCES: 6 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: LAHIVE & COCKFIELD
(B) STREET: 28 State Street (C) CITY: Boston 20 (D) STATE: Massachusetts (E) COUNTRY: USA
(F) ZIP: 02109 (v) COMPUTER READABLE FORM:
ZS (A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 3O (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: US 000000 (B) FILING DATE: 13-APR-1994 (C) CLASSIFICATION:
3S (vii) PRIOR
APPLICATION
DATA:
(A) APPLICATION NUMBER: US 08/111,111 (B) FILING DATE: 12-DEC-1909 (viii) ATTORNEY/AGENT
INFORMATION:
40 (A) NAME: Attorney, Name Init (B} REGISTRATION NUMBER: 000000 (C) REFERENCE/DOCKET NUMBER:
oe (ix) TELECOMMUNICATION
INFORMATION:
4S (A) TELEPHONE: (617)227-7400 (B) TELEFAX: (617)742-4214 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1761 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear SS
(ii) MOLECULE TYPE: peptide (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1:
$ Met Gln Phe Gln Leu Thr Leu Phe Leu His Leu Gly Trp Leu Ser Tyr Ser Lys Ala Gln Asp Asp Cys Asn Arg Gly Ala Cys His Pro Thr Thr Gly Asp Leu Leu Val Gly Arg Asn Thr Gln Leu Met Ala Ser Ser Thr Cys Gly Leu Ser Arg Ala Gln Lys Tyr Cys Ile Leu Ser Tyr Leu Glu 1$ 50 55 60 Gly Glu Gln Lys Cys Ser Ile Cys Asp Ser Arg Phe Pro Tyr Aap Pro Tyr Asp Gln Pro Asn Ser His Thr Ile Glu Asn Val Thr Val Ser Phe G1u Pro Asp Arg Glu Lys Lys Trp Trp Gln Ser Glu Asn Gly Leu Asp 2$
His Val Ser Ile Arg Leu Asp Leu Glu Ala Leu Phe Arg Phe Ser His Leu Ile Leu Thr Phe Lys Thr Phe Arg Pro Ala Ala Met Leu Val Glu Arg Ser Thr Asp Tyr Gly His Asn Trp Lys Val Phe Lys Tyr Phe Ala 3$ Lys Asp Cys Ala Thr Ser Phe Pro Asn Ile Thr Ser Gly Gln Ala Gln Gly Val Gly Asp Ile Val Cys Asp Ser Lys Tyr Ser Asp Ile Glu Pro Ser Thr Gly Gly Glu Val Val Leu Lys Val Leu Asp Pro Ser Phe Glu Ile Glu Asn Pro Tyr Ser Pro Tyr Ile Gln Asp Leu Val Thr Leu Thr 4$ 210 215 220 Asn Leu Arg Ile Asn Phe Thr Lys Leu His Thr Leu Gly Asp Ala Leu $0 Leu Gly Arg Arg Gln Asn Asp Ser Leu Asp Lys Tyr Tyr Tyr Ala Leu Tyr Glu Met Ile Val Arg Gly Ser Cys Phe Cys Asn Gly His Ala Ser $$
Glu Cys Arg Pro Met Gln Lys Met Arg Gly Asp Val Phe Ser Pro Pro Gly Met Val His Gly Gln Cys Val Cys Gln His Asn Thr Asp Gly Pro Asn Cys Glu Arg Cys Lys Asp Phe Phe Gln Asp Ala Pro Trp Arg Pro Ala Ala Asp Leu Gln Asp Asn Ala Cys Arg Ser Cys Ser Cys Asn.Ser His Ser Ser Arg Cys His Phe Aap Met Thr Thr Tyr Leu Ala Ser Gly Gly Leu Ser Gly Gly Val Cys Glu Asp Cys Gln His Asn Thr Glu Gly 1$ Gln His Cys Asp Arg Cys Arg Pro Leu Phe Tyr Arg Asp Pro Leu Lys Thr Ile Ser Asp Pro Tyr Ala Cys Ile Pro Cys Glu Cys Asp Pro Asp Gly Thr Ile Ser Gly Gly Ile Cys Val Ser His Ser Asp Pro Ala Leu Gly Ser Val Ala Gly Gln Cys Leu Cys Lys Glu Asn Val Glu Gly Ala 2$ 420 425 430 Lys Cys Aap Gln Cys Lys Pro Asn His Tyr Gly Leu Ser Ala Thr Asp Pro Leu Gly Cys Gln Pro Cys Asp Cys Asn Pro Leu Gly Ser Leu Pro Phe Leu Thr Cys Aep Val Asp Thr Gly Gln Cys Leu Cys Leu Ser Tyr Val Thr Gly Ala His Cys Glu Glu Cys Thr Val Gly Tyr Trp Gly Leu Gly Asn His Leu His Gly Cys Ser Pro Cys Asp Cys Asp Ile Gly Gly Ala Tyr Ser Asn Val Cys Ser Pro Lys Asn Gly Gln Cys Glu Cys Arg 4$ Pro His Val Thr Gly Arg Ser Cys Ser Glu Pro Ala Pro Gly Tyr Phe $d Phe Ala Pro Leu Asn Phe Tyr Leu Tyr Glu Ala Glu Glu Ala Thr Thr Leu Gln Gly Leu Ala Pro Leu Gly Ser Glu Thr Phe Gly Gln Ser Pro Ala Val His Val Val Leu Gly Glu Pro Val Pro Gly Asn Pro Val Thr $$ 580 585 590 Trp Thr Gly Pro Gly Phe Ala Arg Val Leu Pro Gly Ala Gly Leu Arg 60 Phe Ala Val Asn Asn Ile Pro Phe Pro Val Asp Phe Thr Ile Ala Ile His Tyr Glu Thr Gln Ser Ala Ala Asp Trp Thr Val Gln Ile Val Val $ 625 630 635 640 Asn ProProGlyGly SerGluHis CysIlePro LysThrLeu Gln.Ser w Lys ProGlnSerPhe AlaLeuPro AlaAlaThr ArgIleMet LeuLeu Pro ThrProIleCys LeuGluPro AspValGln TyrSerIle AspVal 1$
Tyr PheSerGlnPro LeuGlnGly GluSerHis AlaHisSer HisVal Leu ValAspSerLeu GlyLeuIle ProGlnIle AsnSerLeu GluAsn Phe CysSerLysGln AspLeuAsp GluTyrGln LeuHisAsn CysVal 2$ Glu IleAlaSerAla MetGlyPro GlnValLeu ProGlyAla CysGlu Arg LeuIleIleSer MetSerAla LysLeuHis AspGlyAla ValAla Cys LysCysHisPro GlnGlySer ValGlySer SerCysSer ArgLeu Gly GlyGlnCysGln CysLysPro LeuValVal GlyArgCys CysAsp 3$ 785 790 795 800 Arg CysSerThrGly SerTyrAsp LeuGlyHis HisGlyCys HisPro Cys HisCysHisPro GlnGlySer LysAspThr Va1CysAsp GlnVal Thr GlyGlnCysPro CysHisGly GluValSer GlyArgArg CysAsp 4$
Arg CysLeuAlaGly TyrPheGly PheProSer CysHisPro CysPro Cys AsnArgPheAla GluLeuCys AspProGlu ThrGlySer CysPhe $0 865 870 875 880 Asn CysGlyGlyPhe ThrThrGly ArgAenCys GluArgCys IleAsp $$ Gly TyrTyrGlyAsn ProSerSer GlyGlnPro CysArgPro CysLeu Cys ProAspAspPro SerSerAsn GlnTyrPhe AlaHisSer CysTyr Gln AsnLeuTrpSer SerAspVal IleCysAsn CysLeuGln GlyTyr $ Thr Gly Thr Gln Cys Gly Glu Cys Ser Thr Gly Phe Tyr Gly Asn Pro Arg Ile Ser Gly Ala Pro Cys Gln Pro Cys Ala Cys Asn Asn Asn Ile Asp Val Thr Asp Pro Glu Ser Cys Ser Arg Val Thr Gly Glu Cys Leu Arg Cys Leu His Asn Thr Gln Gly Ala Asn Cys Gln Leu Cys Lys Pro 1$ 995 1000 1005 Gly His Tyr Gly Ser Ala Leu Asn Gln Thr Cys Arg Arg Cys Ser Cys His Ala Ser Gly Val Ser Pro Met Glu Cys Pro Pro Gly Gly Gly Ala Cys Leu Cys Asp Pro Val Thr Gly Ala Cys Pro Cys Leu Pro Asn Val 2$
Thr Gly Leu Ala Cys Asp Arg Cys Ala Asp Gly Tyr Trp Asn Leu Val Pro Gly Arg Gly Cys Gln Ser Cys Asp Cys Asp Pro Arg Thr Ser Gln Ser Ser His Cys Asp Gln Leu Thr Gly Gln Cys Pro Cys Lys Leu Gly 3$ Tyr Gly Gly Lys Arg Cys Ser Glu Cys Gln Glu Asn Tyr Tyr Gly Asp Pro Pro Gly Arg Cys Ile Pro Cys Asp Cys Asn Arg Ala Gly Thr Gln Lys Pro Ile Cys Asp Pro Asp Thr Gly Met Cys Arg Cys Arg Glu Gly Val Ser Gly Gln Arg Cys Asp Arg Cys Ala Arg Gly His Ser Gln Glu 4$ 1155 1160 1165 Phe Pro Thr Cys Leu Gln Cys His Leu Cys Phe Asp Gln Trp Asp His $0 Thr Ile Ser Ser Leu Ser Lys Ala Val Gln Gly Leu Met Arg Leu Ala Ala Asn Met Glu Asp Lys Arg Glu Thr Leu Pro Val Cys Glu Ala Asp $$
Phe Lys Asp Leu Arg Gly Asn Val Ser Glu Ile Glu Arg Ile Leu Lys His Pro Val Phe Pro Ser Gly Lys Phe Leu Lys Val Lys Asp Tyr His Asp Ser Val Arg Arg Gln Ile Met Gln Leu Asn Glu Gln Leu Lys Ala WO 99/19348 PG"T/US98/21391 Val Tyr Glu Phe Gln Asp Leu Lys Asp Thr Ile Glu Arg Ala Lys Asn Glu Ala Asp Leu Leu Leu Glu Asp Leu Gln Glu Glu Ile Asp Leu Gln Ser Ser Val Leu Asn Ala Ser Ile Ala Asp Ser Ser Glu Asn Ile Lys 1S Lys Tyr Tyr His Ile Ser Ser Ser Ala Glu Lys Lys Ile Asn Glu Thr Ser Ser Thr Ile Asn Thr Ser Ala Asn Thr Arg Asn Asp Leu Leu Thr Ile Leu Asp Thr Leu Thr Ser Lys Gly Asn Leu Ser Leu Glu Arg Leu Lys Gln Ile Lys Ile Pro Asp Ile Gln Ile Leu Asn Glu Lys Val Cys 2$ 1365 1370 1375 Gly Asp Pro Gly Asn Val Pro Cys Val Pro Leu Pro Cys Gly Gly Ala Leu Cys Thr Gly Arg Lys Gly His Arg Lys Cys Arg Gly Pro Gly Cys His Gly Ser Leu Thr Leu Ser Thr Asn Ala Leu Gln Lys Ala Gln Glu Ala Lys Ser Ile Ile Arg Asn Leu Asp Lys Gln Val Arg Gly Leu Lys Asn Gln Ile Glu Ser Ile Ser Glu Gln Ala Glu Val Ser Lys Asn Asn Ala Leu Gln Leu Arg Glu Lys Leu Gly Asn Ile Arg Asn Gln Ser Asp Ser Glu Glu Glu Asn Ile Asn Leu Phe Ile Lys Lys Val Lys Asn Phe Leu Leu Glu Glu Asn Val Pro Pro Glu Asp Ile Glu Lys Val Ala Asn Gly Val Leu Asp Ile His Leu Pro Ile Pro Ser Gln Asn Leu Thr Asp Glu Leu Val Lys Ile Gln Lys His Met Gln Leu Cys Glu Aap Tyr Arg Thr Asp Glu Asn Arg Ser Asn Glu Glu Ala Asp Gly Ala Gln Lys Leu Leu Val Lys Ala Lys Ala Ala Glu Lys Ala Ala Asn Ile Leu Leu Asn Leu Aap Lys Thr Leu Asn Gln Leu Gln Gln Ala Gln Ile Thr Gln Gly Arg Ala AsnSerThr IleThr LeuThr AlaAsnIle ThrLysIle Gln Lys Lys AsnValLeu GlnAla AsnGln ThrArgGlu MetLysSer Glu Glu Leu GluLeuAla LysGln SerGly LeuGluAsp GlyLeuSer Arg Leu Leu GlnThrLys LeuGln HisGln AspHisAla ValAanAla Arg Lys Val GlnAlaGlu SerAla HisGln AlaGlySer LeuGluLys Gln G1u Phe ValGluLeu LysLys TyrAla IleLeuGln ArgLysThr Gln ZS Ser Thr ThrGlyLeu ThrLye ThrLeu GlyLysVal LysGlnLeu Glu Lys Asp AlaAlaGlu LysLeu GlyAsp ThrGluAla LysIleArg Ala Arg Ile ThrAspLeu GluArg IleGln AspLeuAen LeuSerArg Lys Gln Ala LysAlaAsp GlnLeu IleLeu GluAspGln ValValAla Arg Ile Lys AsnGluIle ValGlu GluLys LysTyrAla ArgCysTyr Gln Ser (2) INFORMATION FOR SEQ ID N0:2:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5874 base pairs (B} TYPE: nucleic acid (C} STRANDEDNESS: single (D) TOPOLOGY: linear SO
(ii) MOLECULE TYPE: cDNA
SS (xi} SEQUENCE DESCRIPTION: SEQ ID N0:2:
Generally, the nidogen-binding domain is at least 5 residues in length and preferably, has about 70, 80, 90, or 95% sequence identity with the nidogen-binding domain of the protein shown in SEQ ID NO: 3 (amino acid residues 750-755). In another embodiment, the y3 polypeptide includes at least 5, preferably 6 to 7, and most preferably 8 of the cysteins found Z O in native y3 protein. In yet another embodiment of the invention features a y3 polypeptide that does not include or has an inactivated nidogen-binding domain which serves as an antagonist to y3 biological activities. Furthermore, a y3 polypeptide which has antagonist activity can have inactivated or excluded regions which comprise at least one cystein found in native y3 protein.
In a preferred embodiment, the y3 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 residues, from a sequence in SEQ ID NO: 3. In other preferred embodiments, the y3 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 %
of the residues from a sequence in SEQ ID NO: 3. Preferably, the differences are such that:
the y3 polypeptide exhibits a y3 biological activity, e.g., the y3 polypeptide retains a biological activity of a naturally occurring y3 subunit.
In preferred embodiments the y3 polypeptide includes a y3 subunit sequence described herein as well as other N-terminal andlor C-terminal amino acid sequence.
In preferred embodiments, the y3 polypeptide includes all or a fragment of an amino acid sequence from SEQ ID NO: 3, fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence from SEQ ID NO: 3.
In yet other preferred embodiments, the y3 polypeptide is a recombinant fusion protein having a first y3 portion and a second polypeptide portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to y3. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.
In a preferred embodiment the y3 polypeptide includes amino acid residues 750-of SEQ ID N0:3. In another embodiment, the y3 polypeptide encodes domains IV-VI of the y3 subunit.
In preferred embodiments the y3 polypeptide has antagonistic activity, and is capable of inhibiting adhesion between connective tissues.
In a preferred embodiment, the y3 polypeptide is a fragment of a naturally occurring y 3 which inhibits connective tissue adhesion.
Polypeptides of the invention include those which arise as a result of the existence of multiple genes, alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events. The y3 polypeptide can be expressed in systems, e.g., cultured cells, which result in substantially the same postranslational modifications present when expressed y3 is expressed in a native cell, or in systems which result in the omission of postranslational modifications present when expressed in a native cell. ~ _ The invention includes an immunogen which includes a y3 polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for the y3 polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g., a unique determinant, from a protein represented by SEQ ID NO: 3.
The present invention also includes an antibody preparation specifically reactive with an epitope-of the y3 immunogen or generally of a y3 polypeptide, preferably an epitope which consists all or in part of residues from the the amino acid sequence of SEQ ID
N0:3, or an epitope, which when bound to an antibody, results in the modulation of a biological activity.
In preferred embodiments the y3-like polypeptide, as expressed in the cells in which it is normally expressed or in other eukaryotic cells, has a molecular weight of 170 kDa as determined by SDS-PAGE.
In another embodiment, the y3 polypeptide comprises amino acid residues 100-of SEQ ID NO: 3.
In a preferred embodiment, the y3 polypeptide has one or more of the following characteristics:
(i) it has the ability to promote adhesion between connective tissues;
(ii) it has a molecular weight, amino acid composition or other physical characteristic of y3 subunit of SEQ ID N0:3;
(iii) it has an overall sequence similarity of at least 50%, preferably at least 60%, more preferably at least 70, 80, 90, or 95%, with a y3 polypeptide of SEQ
ID N0:3;
(iv) it can be isolated from human placenta chorionic villi;
(v) it has a nidogen-binding domain which is preferably about 70%, 80%, 90% or 95% with amino acid residues 750-755 of SEQ ID N0:3;
(vi) it can colocalize with protein ubiquitin carboxy terminal hydroxylase I;
(vii) it has at least 5, preferably 6 or 7, and most preferably 8 of the cysteins found amino acid sequence of native y3.
Also included in the invention is a composition which includes a y3 polypeptide (or a nucleic acid which encodes it) and one or more additional components, e.g., a carrier, diluent, or solvent. The additional component can be one which renders the composition useful for in vitro and in vivo pharmaceutical or veterinary use.
In another aspect, the invention provides an isolated or substantially pure nucleic acid having or comprising a nucleotide sequence which encodes a y3 polypeptide, e.g., a y3 polypeptide described herein.
A preferred embodiment of the invention features a nucleic acid molecule having a nucleotide sequence at least about 85% sequence identity to a nucleotide sequence of SEQ iD
N0:4. In other preferred embodiments, the y3 polypeptide is encoded by a nucleic acid S
S molecule having a nucleotide sequence with at least about 90% to about 9S%, and more preferably about 98% to about 99% sequence identity to the nucleotide sequence from SEQ
ID N0:4. In another preferred embodiment, the y3 polypeptide is encoded by the nulceic acid molecule of SEQ ID N0:4.
In prefered embodiments, the isolated nucleic acid molecule includes the nucleotide sequence of at least one and preferably all of the DNA inserts of the plasmids deposited with ATCC as Accession No: 209357.
In preferred embodiments, the subject y3 nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the y3 gene sequence (also referred to as LAMG3), e.g., to 1 S render the y3 gene sequence suitable for use as an expression vector.
In yet a further preferred embodiment, the nucleic acid which encodes a y3 polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID N0:4. More preferably, the nucleic acid probe corresponds to at least 20 consecutive nucleotides from SEQ
ID NO: 4.
The invention also provides a probe or primer which includes or comprises a substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence from SEQ ID NO: 4, or naturally occurring mutants thereof. In preferred embodiments, the probe or primer further includes a label group attached thereto.
2S The label group can be, e.g., a radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less than 20, 30, S0, 100, or 150 nucleotides in length.
The invention involves nucleic acids, e.g., RNA or DNA, encoding a y3 polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.
In another aspect, the invention features a cell or purified preparation of cells which include a y3 subunit transgene, or which otherwise misexpress a y3 gene. The cell preparation can consist of human or non human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the cell or cells include a y3 3S transgene, e.g., a heterologous form of a r3 gene, e.g., a gene derived from humans (in the case of a non-human cell). The y3 transgene can be misexpressed, e.g., overexpressed or underexpressed. In other preferred embodiments, the cell or cells include a gene which misexpress an endogenous y3 gene, e.g., a gene the expression of which is disrupted, e.g., a knockout. Such cells can serve as a model for studying disorders which are related to mutated or mis-expressed y3 alleles or for use in drug screening.
In another aspect, the invention features a transgenic y3 animal, e.g., a rodent, e.g., a mouse or a rat, a rabbit, a pig, a goat, or a cow. In preferred embodiments, the transgenic animal includes (and preferably express) a heterologous form of a y3 gene, e.g., a gene derived from humans. In a further embodiment, the y3 transgene includes a tissue specific promoter, e.g., a milk-specific promoter. In other preferred embodiments, the animal has an endogenous y3 gene which is misexpressed, e.g., a knockout. Such a transgenic animal can serve as a model for studying disorders which are related to mutated or mis-expressed y3 alleles or for use in drug screening.
The invention is also based, in part, on the discovery of a novel laminin subunit, (34.
Accordingly, the invention features a recombinant or substantially pure preparation of a (34 polypeptide.
In preferred embodiment, the X34 polypeptide has the following biological activities: 1 ) it promotes adhesion between tissue elements; 2) it aids in wound healing. In other preferred embodiments: the (34 polypeptide includes an amino acid sequence with at least 65%, 80%, 90%, 95%, 98%, or 99% sequence identity to an amino acid sequence from SEQ ID
NO:1;
the (34 polypeptide includes an amino acid sequence essentially the same as an amino acid sequence in SEQ ID NO: 1; the (34 polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the (34 polypeptide includes at least 5, preferably at least 10, more preferably at least 20, most preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID
NO:1; the (34 polypeptide is either, an agonist or an antagonist, of a biological activity of a naturally occurnng ~i4 subunit; the ~i4 polypeptide is a vertebrate, e.g., a mammalian, e.g. a primate, e.g., a human, ~i4 polypeptide.
In preferred embodiments: the ~i4 polypeptide is encoded by the nucleic acid in SEQ
ID N0:2, or by a nucleic acid having at least about 65% to about 70%, more preferably at least 80%, even more preferably at least about 90% to about 95%, and most preferably about 99% sequence identity with the nucleic acid from SEQ ID NO: 2.
In preferred embodiments, the (34 polypeptide includes domains VI and V found in native [34 subunits. Amino acid residues from about 221-262 and 263-535 of SEQ
m NO: 1 are exemplary of domains VI and V, respectively, of (34. Generally, domain VI
is at least 33 residues in length and has preferably at least about 60%, more preferably about 70% to about 80%, and most preferably about 90% to about 95% sequence identity with the amino acid residues 221-262 of the (34 protein shown in SEQ ID NO: 1. Domain V is at least 272 residues in length and has preferably at least about 60%, more preferably about 70% to about 80%, and most preferably about 90% to about 95% sequence identity with the amino acid residues 263-535 of the (34 protein shown in SEQ ID NO: 1. In another embodiment, the (34 polypeptide has at least 5, preferably 6 or 7, and most preferably 8 cysteins as found in native (34. In yet another embodiment, a (34 polypeptide which has antagonist activity has inactivated or excluded regions which comprise at least one of the cysteins found in native [34 protein.
In a preferred embodiment, the ~i4 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 residues, from a sequence in SEQ ID NO: 1. In other preferred embodiments, the (34 polypeptide differs in amino acid sequence at up to 1, 2, 3, 5, or 10 % of the residues from a sequence in SEQ ID NO: 1. Preferably, the differences are such that:
the (34 polypeptide exhibits a (34 biological activity, e.g., the (34 polypeptide retains a biological activity of a naturally occurring ~i4 subunit.
In preferred embodiments the (34 polypeptide includes a X34 sequence described herein as well as other N-terminal and/or C-terminal amino acid sequence.
In preferred embodiments, the (34 polypeptide includes all or a fragment of an amino acid sequence from SEQ ID NO:1, fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence from SEQ ID NO:1.
In yet other preferred embodiments, the X34 polypeptide is a recombinant fusion protein having a first (34 portion and a second polypeptide portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to (34. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.
In preferred embodiments the ~i4 polypeptide has antagonistic activity, and is capable of inhibiting the adhesion of connective tissues.
Preferably, the [34 polypeptide is a fragment of a naturally occurnng (34 which inhibits connective tissue adhesion.
Polypeptides of the invention include those which arise as a result of the existence of multiple genes, alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events. In one aspect of the invention, the ~i4 polypeptide is a splice variant of the [34 subunit. In another preferred embodiment, the (34 splice variant is encoded by a nucleic acid molecule identical to the nucleotide sequence of SEQ ID N0:6. The polypeptide can be expressed in systems, e.g., cultured cells, which result in substantially the same postranslational modifications present when expressed (34 is expressed in a native cell, or in systems which result in the omission of postranslational modifications present when expressed in a native cell.
The invention includes an immunogen which includes a ~i4 polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for the (34 polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g., a unique determinant, from a protein represented by SEQ ID NO: 1.
The present invention also includes an antibody preparation specifically reactive with an epitope of the (34 immunogen or generally of a (34 polypeptide, preferably an epitope which consists all or in part of residues from the amino acid sequence of SEQ
ID NO:1, or an epitope, which when bound to an antibody, results in the modulation of a biological activity.
In preferred embodiments the (34-like polypeptide, as expressed in the cells in which it is normally expressed or in other eukaryotic cells, has an estimated molecular weight of 200 kDa as determined by SDS-PAGE.
In a preferred embodiment, the (34 polypeptide has one or more of the following characteristics:
(i) it has the ability to promote adhesion between connective tissues;
(ii) it has a molecular weight, amino acid composition or other physical characteristic of [34 subunit of SEQ m NO:1;
(iii) it has an overall sequence similarity of at least 50%, preferably at least 65%, more preferably at least 70, 80, 90, or 95%, with a [34 polypeptide of SEQ >D NO:1;
(iv) it can be isolated from human placenta chorionic villi;
(v) it can associate with a3 or y2 subunits;
(vi) it has coiled coils in domains I and II.
(vii) it has at least 5, preferably 6 or 7, and most preferably 8 of the cysteins found in native (34 sequence.
Also included in the invention is a composition which includes a (34 polypeptide (or a nucleic acid which encodes it) and one or more additional components, e.g., a carrier, diluent, or solvent. The additional component can be one which renders the composition for in vitro and in vivo pharmaceutical or veterinary use. Such uses can include aiding in wound healing or promotion of the adhesion of dermal and epidermal cells.
In another aspect, the invention provides an isolated or substantially pure nucleic acid having or comprising a nucleotide sequence which encodes a ~i4 polypeptide, e.g., a (34 polypeptide described herein.
A preferred embodiment of the invention features a nucleic acid molecule having a nucleotide sequence at least about 65% sequence identity to a nucleotide sequence of SEQ ID
N0:2. In other preferred embodiments, the ~i4 polypeptide is encoded by a nucleic acid molecule having a nucleotide sequence with at least 70%, preferably 80%, more preferably about 90% to about 95%, and even more preferably about 99% sequence identity to the nucleotide sequence from SEQ m N0:2. In another preferred embodiment, the (i4 polypeptide is encoded by the nulceic acid molecule of SEQ ID N0:2.
In preferred embodiments, the subject ~i4 nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the [34 gene sequence (also referred to as LAMB4), e.g., to render the ~i4 gene sequence suitable for use as an expression vector.
In yet a further preferred embodiment, the nucleic acid which encodes a (34 polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides from SEQ ID N0:2, more preferably to at least 20 consecutive nucleotides from SEQ ID N0:2.
In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a nucleotide sequence of SEQ m N0:2, nucleotides 4686-5870.
The invention also provides a probe or primer which includes or comprises a substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence from SEQ ID NO: 2, or naturally occurring mutants thereof. In preferred embodiments, the probe or primer further includes a label group attached thereto.
The label group can be, e.g., a radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length.
The invention involves nucleic acids, e.g., RNA or DNA, encoding a (34 polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.
In another aspect, the invention features a cell or purified preparation of cells which include a p4 transgene, or which otherwise misexpress a ~4 gene. The cell preparation can consist of human or non human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the cell or cells include a (34 transgene, e.g., a heterologous form of a (34 gene, e.g., a gene derived from humans (in the case of a non-human cell). The X34 transgene can be rnisexpressed, e.g., overexpressed or underexpressed.
In other preferred embodiments, the cell or cells include a gene which misexpress an endogenous (34 gene, e.g., a gene the expression of which is disrupted, e.g., a knockout. Such cells can serve as a model for studying disorders which are related to mutated or mis-expressed ~i4 alleles or for use in drug screening.
In another aspect, the invention features a transgenic [34 animal, e.g., a rodent, e.g., a mouse or a rat, a rabbit, a pig, a goat, or a cow. In preferred embodiments, the transgenic animal includes (and preferably express) a heterologous form of a (34 gene, e.g., a gene derived from humans. In a fiutller embodiment, the (34 transgene includes a tissue specific promoter, e.g., a milk-specific promoter. In other preferred embodiments, the animal has an endogenous (34 gene which is misexpressed, e.g., a knockout. Such a transgenic animal can serve as a model for studying disorders which are related to mutated or mis-expressed (34 alleles or for use in drug screening.
In another aspect, the invention features, a method of promoting adhesion of a first tissue element to a second tissue element. The method includes contacting one or both of the first tissue element and the second tissue element with an amount of a laminin molecule described herein, e.g., Iaminin 12, or'y3 (or a Iaminin trimer which inlcudes y3), sufficient to promote adhesion. The method can be performed in vivo, or in vitro. In in vivo methods the laminin is administered to the subject. The administration can be directed to the site where adhesion is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
A tissue element can be a cell or a multi-cellular on acellular structure.
Examples of tissue elements include, skin cells, e.g., epidermal or dermal cells, neuronal cells, e.g., nerve cells, retinal cells, central or pereipheral nervous system components, basement membrane or components of the basement membrane, or any cell or structure which in normal, non-5 traumatized, or non-diseased tissue is adjascent or adhered to a specific tissue element recited herein.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the method is an vivo method. In vivo methods can be 10 autologous, allogeneic, or xenogeneic. In autologous methods, adhesion between two tissue elements from the subject is promoted. In allogeneic methods, adhesion between a recipient tissue element and a donor tissue element from an allogeneic donor is promoted. In xenogeneic methods, adhesion between a recipient tissue element and a donor tissue element from a xenogeneic donor is promoted. Thus, one element can be a donor tissue element which is implanted into a recipient subject.
In preferred embodiments the first tissue is healthy tissue, e.g., skin tissue, and the second tissue is wounded, e.g., burned, diseased, traumatized, cut, and the tissue, or is a wound bed. For example, the first tissue is skin tissue, from the subject or from a donor, and the second tissue is wounded, e.g., burned or abraided tissue.
In preferred embodiments the first tissue and second tissue element are normally adhered but have become detached from one another due to trauma, burn or other physical injury, disease, or age.
In preferred embodiments: the first tissue element is a dermal cell and the second tissue element is an epidermal cell; the first tissue element is a nerve cell or nerve and the second tissue element is a cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to the nerve cell or nerve; the first tissue element is a retinal cell or retina tissue and the second tissue element is a cell or structure which in normal, non-traumatized, or nvn-diseased tissue is adjascent or adhered to the a retinal cell or retina tissue, the first tissue is a nerve and the second tissue is basement membrane.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting wound healing in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., laminin 12, y3 (or a laminin trimer which inlcudes y3), sufficient to promote healing to the wound. The administration can be directed to the site where healing is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
The wound can be in any tissue, but preferably ina tissue in which the laminin normally occurs. Examples skin, central or peripheral nervous tissue, tissues ofthe eye, e.g., the retinal, the basement membrane, or any tissue which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered thereto.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the wound tissue is burned, diseased, traumatized, cut, the subject of immune attack, e.g, autoimmune attack, or abraided.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting nerve growth or regeneration in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., laminin 12, or y3 (or a laminin trimer which inlcudes y3), sufficient to promote nerve growth or regeneration. The administration can be directed to the site where nerve growth or regeneration is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the nerve growth or regeneration is promoted at a wound site.
The administration of laminin can be repeated.
In another aspect, the invention provides, a method of determining if a subject is at risk for a disorder related to a lesion in or the misexpression of a gene which encodes a laminin described herein, e.g., y3 or laminin 12.
Such disorders include, e.g., a disorder associated with the misexpression of a laminin, e.g., laminin 12, or misexpression of the y3 subunit; a disorder of the central or peripheral nervous system;
a disorder associated with a genetic lesion at chromosome 9, region q31-34;
Fukuyama-type muscular dystrophy; muscle-eye-brain disease; Walker-Warburg Syndrome (hydrocephalus, ageria, and retinal displasia); a retinal disorder, e.g, retinitis pigmentosa-deafness syndrome {which may be a subtype of Walker-Warburg Syndrome); a disorder associated with abnormal levels, e.g., abnormally low levels, of adhesion between tissues; a disorder associated with the basement membrane; a skin disorder, e.g., an epidermal or dermal, disorder; a disorder associated with the testis, spleen, placenta, thymus, ovary, small intestine, lung, or liver.
The method includes one or more of the following:
detecting, in a tissue of the subject, the presence or absence of a mutation which affects the expression of the y3 gene, or other gene which encodes a subunit of laminin 12, e.g., detecting the presence or absence of a mutation in a region which controls the expression of the gene, e.g., a mutation in the S' control region;
detecting, in a tissue of the subject, the presence or absence of a mutation which alters the structure of the y3 gene, or other gene which encodes a subunit of laminin 12;
detecting, in a tissue of the subject, the misexpression of they3 gene, or other gene which encodes a subunit of laminin 12 at the mRNA level, e.g., detecting a non-wild type level of a y3, or an other laminin 12 subunit mRNA ;
detecting, in a tissue of the subject, the misexpression of the y3 gene, or other gene which encodes a subunit of laminin 12, at the protein level, e.g., detecting a non-wild type level of a y3, or an other laminin 12 subunit polypeptide.
In preferred embodiments the method includes: ascertaining the existence of at least one of: a deletion of one or more nucleotides from the y3 gene, or other gene which encodes a subunit of laminin 12; an insertion of one or more nucleotides into the gene, a point mutation, e.g., a substitution of one or more nucleotides of the gene, a gross chromosomal -rearrangement of the gene, e.g., a translocation, inversion, or deletion.
For example, detecting the genetic lesion can include: (i) providing a probe/primer including an oligonucleotide containing a region of nucleotide sequence which hybridizes to a sense or antisense sequence from SEQ ID N4:4, or naturally occurring mutants thereof or 5' or 3' flanking sequences naturally associated with the LAMG3 gene; (ii) exposing the probe/primer to nucleic acid of the tissue; and detecting, by hybridization, e.g., in situ hybridization, of the probe/primer to the nucleic acid, the presence or absence of the genetic lesion.
I S In preferred embodiments detecting the misexpression includes ascertaining the existence of at least one of an alteration in the level of a messenger RNA
transcript of the y3 gene, or other gene which encodes a subunit of laminin I2; the presence of a non-wild type splicing pattern of a messenger RNA transcript of the y3 gene, or other gene which encodes a subunit of laminin 12; or a non-wild type level of Y3, or other subunit of laminin 12.
Methods of the invention can be used prenatally or to determine if a subject's offspring will be at risk for a disorder.
In preferred embodiments the method includes determining the structure of a y3 gene, or other gene which encodes a subunit of laminin 12, an abnormal structure being indicative of risk for the disorder.
In preferred embodiments the method includes contacting a sample form the subject with an antibody to the laminin protein or a nucleic acid which hybridizes specifically with the y3 gene, or other gene which encodes a subunit of laminin 12.
In another aspect, the invention features, a method of promoting adhesion of a first tissue element to a second tissue element. The method includes contacting one or both of the first tissue element and the second tissue element with an amount of a laminin molecule described herein, e.g., (34, sufficient to promote adhesion. The method can be performed in vivo, or in vitro. In in vivo methods the laminin is administered to the subject. The administration can be directed to the site where adhesion is desired, e.g., by topical application or by injection, or administered in a systemic fashion.
A tissue element can be a cell or a multi-cellular on acellular structure.
Examples of tissue elements include, skin cells, e.g., epidermal or dermal cells, neuronal cells, e.g., nerve cells, retinal cells, central or pereipheral nervous system components, basement membrane or components of the basement membrane, or any cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to a specific tissue element recited herein.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the method is an vivo method. In vivo methods can be autologous, allogeneic, or xenogeneic. In autologous methods, adhesion between two tissue elements from the subject is promoted. In allogeneic methods, adhesion between a recipient tissue element and a donor tissue element from an allogeneic donor is promoted. In xenogeneic methods, adhesion between a recipient tissue element and a donor tissue element from a xenogeneic donor is promoted. Thus, one element can be a donor tissue element which is implanted into a recipient subject.
In preferred embodiments the first tissue is healthy tissue, e.g., skin tissue, and the second tissue is wounded, e.g., burned, diseased, traumatized, cut, and the tissue, or is a wound bed. For example, the first tissue is skin tissue, from the subject or from a donor, and the second tissue is wounded, e.g., burned or abraided tissue.
In preferred embodiments: the first tissue element is a dermal cell and the second tissue element is an epidermal cell; the first tissue element is a nerve cell or nerve and the second tissue element is a cell or structure which in normal, non-traumatized, or non-diseased tissue is adjascent or adhered to the nerve cell or nerve; the first tissue is a nerve and the second tissue is basement membrane.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting wound healing in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., ~i4, sufficient to promote healing to the wound. The administration can be directed to the site where healing is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
The wound can be in any tissue, but preferably in a tissue in which the laminin normally occurs in fetal or adult life. Examples examples include skin the basement membrane.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the wound tissue is burned, diseased, traumatized, cut, the subject of immune attack, e.g, autoimmune attack, or abraded.
The administration of laminin can be repeated.
In another aspect, the invention features a method of promoting tissue growth, development, or regeneration in a subject. The method includes administering an amount of a laminin molecule described herein, e.g., (34, sufficient to promote tissue growth, development, or regeneration in a subject. The administration can be directed to the site where nerve growth or regeneration is desired, e.g., by topical appication or by injection, or administered in a systemic fashion.
In preferred embodiments the molecule is exogenous (e.g., administered to a subject) or is recombinant.
In preferred embodiments the nerve growth or regeneration is promoted at a wound site.
The administration of laminin can be repeated.
In another aspect, the invention provides, a method of determining if a subject is at risk for a disorder related to a lesion in or the misexpression of a laminin molecule described herein, e.g., (34.
Such disorders include, e.g., a disorder associated with the misexpression of a laminin, e.g., ~i 4; a disorder associated with a genetic lesion at chromosome region 7q22-q31.2; a developmetnal disorder; a disorder associated with abnormal levels, e.g., abnormally low levels, of adhesion between tissues; a disorder associated with the basement membrane; a skin disorder, e.g., an epidermal or dermal, disorder.
The method includes one or more of the following:
detecting, in a tissue of the subject, the presence or absence of a mutation which affects the expression of the ~i4 gene, e.g, detecting the presence or absence of a mutation in a region which controls the expression of the gene, e.g., a mutation in the 5' control region;
detecting, in a tissue of the subject, the presence or absence of a mutation which alters the structure of the ~i4 gene;
detecting, in a tissue of the subject, the misexpression of the (34 gene, e.g., detecting a non-wild type level of a (34 mRNA ;
detecting, in a tissue of the subject, the misexpression of the (34, at the protein level, e.g., detecting a non-wild type level of a (34 polypeptide.
In preferred embodiments the method includes: ascertaining the existence of at least one of a deletion of one or more nucleotides from the (34; an insertion of one or more nucleotides into the gene, a point mutation, e.g., a substitution of one or more nucleotides of the (34 gene, a gross chromosomal rearrangement of the (34 gene, e.g., a translocation, inversion, or deletion.
For example, detecting the genetic lesion can include: (i) providing a probe/primer including an oligonucleotide containing a region of nucleotide sequence which hybridizes to a sense or antisense sequence from SEQ ID N0:2, or naturally occurring mutants thereof or 5' or 3' flanking sequences naturally associated with the LAMB4 gene; (ii) exposing the probelprimer to nucleic acid of the tissue; and detecting, by hybridization, e.g., in situ hybridization, of the probe/primer to the nucleic acid, the presence or absence of the genetic lesion.
In preferred embodiments: detecting the misexpression includes ascertaining the existence of at least one of an alteration in the level of a messenger RNA
transcript of the (3 4; the presence of a non-wild type splicing pattern of a messenger RNA
transcript of the (34;
or a non-wild type level of (34.
Methods of the invention can be used prenatally or to determine if a subject's offspring will be at risk for a disorder.
5 In preferred embodiments the method includes determining the structure of the a ~i4, an abnormal structure being indicative of risk for the disorder.
In preferred embodiments the method includes contacting a sample form the subject with an antibody to the (34 protein or a nucleic acid which hybridizes specifically with the ~i4.
In another aspect, the invention features, a method of evaluating a compound for the 10 ability to interact with, e.g., bind, a subject laminin polypeptide, e.g., laminin 12, y3, a laminin trimer which inlcudes ~3, ~i4, or a laminin trimer which includes (34.
The method includes: contacting the compound with the subject laminin polypeptide; and evaluating ability of the compound to interact with, e.g., to bind or form a complex with the subject laminin polypeptide. This method can be performed in vitro, e.g., in a cell free system, or in 15 vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify naturally occurring molecules which interact with subject laminin polypeptide.
It can also be used to find natural or synthetic inhibitors of subject laminin polypeptide.
In another aspect, the invention features, a method of evaluating a compound, e.g., a polypeptide, e.g., a naturally occurring ligand of or a naturally occuring substrate to which binds a subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y 3, (34, or a laminin trimer which includes (34, for the ability to bind a subject laminin polypeptide. The method includes: contacting the compound with the subject laminin polypeptide; and evaluating the ability of the compound to interact with, e.g., to bind or form a complex with the subject laminin polypeptide, e.g., the ability of the compound to inhibit a subject laminin polypeptide/ligand interaction. This method can be performed in vitro, e.g., in a cell free system, or in vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify compounds, e.g., fragments or analogs of a subject laminin polypeptide, which are agonists or antagonists of a subject laminin polypeptide.
In another aspect, the invention features, a method of evaluating a first compound, e.g., a subject laminin polypeptide, e.g., laminin 12, y3, a laminin trimer which inlcudes Y3, (3 4, or a laminin trimer which includes (34, for the ability to bind a second compound, e.g., a second polypeptide, e.g., a naturally occurring ligand of or substrate to which binds a subject laminin polypeptide. The method includes: contacting the first compound with the second compound; and evaluating the ability of the first compound to form a complex with the second compound. This method can be performed in vitro, e.g., in a cell free system, or in vivo, e.g., in a two-hybrid interaction trap assay. This method can be used to identify compounds, e.g., fragments or analogs of a subject laminin polypeptide, which are agonists or antagonists of a subject laminin polypeptide.
In yet another aspect, the invention features a method for evaluating a compound, e.g., for the ability to modulate an interaction, e.g., the ability to inhibit an interaction of a subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes [34, with a second polypeptide, e.g., a polypeptide, e.g., a natural ligand of the of or a substrate wo which binds a subject laminin polypeptide, or a fragment thereof. The method includes the steps of (i) combining the second polypeptide (or preferably a purified preparation thereof), a subject laminin polypeptide, (or preferably a purified preparation thereof), and a compound, e.g., under conditions wherein in the absence of the compound, the second polypeptide, and the subject laminin polypeptide, are able to interact, e.g., to bind or form a complex; and (ii) detecting the interaction, e.g., detecting the formation (or dissolution) of a complex which includes the second polypeptide, and the subject laminin polypeptide. A change, e.g., a decrease or increase, in the formation of the complex in the presence of a compound (relative to what is seen in the absence of the compound)-is indicative of a modulation, e.g., an inhibition or promotion, of the interaction between the second polypeptide, and the subject laminin polypeptide. In preferred embodiments: the second polypeptide, and the subject laminin polypeptide, are combined in a cell-free system and contacted with the compound; the cell-free system is selected from a group consisting of a cell lysate and a reconstituted protein mixture; the subject laminin polypeptide, and the second polypeptide are simultaneously expressed in a cell, and the cell is contacted with the compound, e.g. in an interaction trap assay (e.g., a two-hybrid assay).
In yet another aspect, the invention features a two-phase method (e.g., a method having an in vitro, e.g., in a cell free system, and an in vivo phase) for evaluating a compound, e.g., for the ability to modulate, e.g., to inhibit or promote, an interaction of a subject laminin polypeptide subject laminin polypeptide, e.g., of laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes [34, with a second compound, e.g., a second polypeptide, e.g., a naturally occurring ligand of or a substrate to which binds a subject laminin polypeptide, or a fragment thereof. The method includes steps (i) and (ii) of the method described immediately above performed in vitro, and further includes: (iii) determining if the compound modulates the interaction in vitro, e.g., in a cell free system, and if so; (iv) administering the compound to a cell or animal;
and (v) evaluating the in vivo effect of the compound on an interaction, e.g., inhibition, of a subject laminin polypeptide, with a second polypeptide.
In another aspect, the invention features, a method of evaluating a compound for the ability to bind a nucleic acid encoding a subject laminin polypeptide, e.g., a laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes (34 polypeptide regulatory sequence. The method includes: contacting the compound with the nucleic acid;
and evaluating ability of the compound to form a complex with the nucleic acid.
In another aspect, the invention features a method of making a y3 or [34 polypeptide, e.g., a peptide having a non-wild type activity, e.g., an antagonist, agonist, or super agonist of a naturally occurring y3 or ~i4 polypeptide, e.g., a naturally occurring Y3 or X34 polypeptide.
The method includes: altering the sequence of a y3 or ~i4 polypeptide, e.g., altering the sequence , e.g., by substitution or deletion of one or more residues of a non-conserved region, a domain or residue disclosed herein, and testing the altered polypeptide for the desired activity.
In another aspect, the invention features a method of making a fragment or analog of a y3 or (34 polypeptide having a biological activity of a naturally occurring y3 or [34 polypeptide. The method includes: altering the sequence, e.g., by substitution or deletion of one or more residues, of a y3 or (34 polypeptide, e.g., altering the sequence of a non-conserved region, or a domain or residue described herein, and testing the altered polypeptide for the desired activity.
In another aspect, the invention features, a human cell, e.g., a hematopoietic stem cell, transformed with nucleic acid which encodes a subject laminin polypeptide, e.g., a laminin 12, y3, a laminin trimer which inlcudes y3, (34, or a laminin trimer which includes ~i4.
In another aspect, the invention includes: a y3, ~i4 nucleic acid, e.g., a y3, /34 nucleic acid inserted into a vector; a cell transformed with a y3, ~i4 nucleic acid; a y3, (34 made by culturing a cell transformed with a y3, (34 nucleic acid; and a method of making a y3, [34 polypeptide including culturing a a cell transformed with a y3, (34 nucleic acid.
The inventors have shown that y3 forms laminin 12'in association with a2 and ail.
However, we are unsure of the chain associations of y3 within other tissues.
It is very likely that y3 can also associate with y3, a3, a4, and a5; with (32, ~i3, [34 and (35. Therefore, our results predict 25 new laminins: laminins 12-37. y3 and [i4 polypetides of the invention can be expressed with, assembled with, or administered with other laminin subunits in any of the methods described herein. E.g., y3 can be assembled with an a and a (3 subunit to form a laminin trimer. [34 can be assembled with an a and a (3 subunit to form a laminin trimer.
In any treatment or therapeutic application which administers y3, a (32 subunit can also be administered.
A "heterologous promoter", as used herein is a promoter which is not naturally associated with a gene or a purified nucleic acid.
A "purified"or "substantially pure" or isolated "preparation" of a polypeptide, as used herein, means a polypeptide that has been separated from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, the polypeptide is also separated from substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are used to purify it.
Preferably, the polypeptide constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the preparation contains: sufficient polypeptide to allow protein sequencing; at least 1, 10, or 100 ~,g of the polypeptide; at least 1, 10, or 100 mg of the polypeptide.
A "purified preparation of cells", as used herein, refers to, in the case of plant or animal cells, an in vitro preparation of cells and not an entire intact plant or animal. In the case of cultured cells or microbial cells, it consists of a preparation of at least 10% and more preferably 50% of the subject cells.
A "treatment", as used herein, includes any therapeutic treatment, e.g., the administration of a therapeutic agent or substance, e.g., a drug.
An "isolated" or " pure nucleic acid", e.g., a substantially pure DNA, is a nucleic acid which is one or both of not immediately contiguous with either one or both of the sequences, e.g., coding sequences, with which it is immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the naturally-occurring genome of the organism from which the nucleic acid is derived; or which is substantially free of a nucleic acid sequence with which it occurs in the organism from which the nucleic acid is derived. The term includes, for example, a recombinant DNA which is incorporated into a vector, e.g., into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other DNA sequences. Substantially pure DNA can also includes a recombinant DNA which is part of a hybrid gene encoding sequence.
"Sequence identity or homology", as used herein, refers to the sequence similarity between two polypeptide molecules or between two nucleic acid molecules. When a position in both of the two compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules are homologous or sequence identical at that position. The percent of homology or sequence identity between two sequences is a function of the number of matching or homologous identical positions shared by the two sequences divided by the number of positions compared x 100. For example, if 6 of 10, of the positions in two sequences are the same then the two sequences are 60% homologous or have 60% sequence identity.
By way of example, the DNA sequences ATTGCC and TATGGC share 50% homology or sequence identity. Generally, a comparison is made when two sequences are aligned to give maximum homology.
The tenors "peptides", "proteins", and "polypeptides" are used interchangeably herein.
As used herein, the term "transgene" means a nucleic acid sequence {encoding, e.g., one or more subject laminin polypeptides), which is partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted at a location which differs from that of the natural gene or its insertion results in a knockout). A transgene can include one or more transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be necessary for optimal expression of the selected nucleic acid, all operably linked to the selected nucleic acid, and may include an enhancer sequence.
As used herein, the term "transgenic cell" refers to a cell containing a transgene.
As used herein, a "transgenic animal" is any animal in which one or more, and preferably essentially all, of the cells of the animal includes a transgene.
The transgene can be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. This molecule may be integrated within a chromosome, or it may be extrachromosomally replicating DNA. ~ -As used herein, the term "tissue-specific promoter" means a DNA sequence that serves as a promoter, i.e., regulates expression of a selected DNA sequence operably linked to the promoter, and which effects expression of the selected DNA
sequence in specific cells of a tissue, such as mammary tissue. The term also covers so-called "leaky"
promoters, which regulate expression of a selected DNA primarily in one tissue, but cause expression in other tissues as well.
"Unrelated to a y3 or (34 amino acid or nucleic acid sequence" means having less than 30% sequence identity, less than 20% sequence identity, or, preferably, less than 10%
homology with a naturally occuring y3 or (34 sequence disclosed herein.
A polypeptide has y3 biological activity if it has one or more of the properties of y3 disclosed herein. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the properties of y3 disclosed herein.
A polypeptide has (34 biological activity if it has one or more of the properties of [34 disclosed herein. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the properties of ~i4 disclosed herein.
"Misexpression", as used herein, refers to a non-wild type pattern of gene expression, at the RNA or protein level. It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of expression that differs from wild type in terms of the time or stage at which the gene is expressed, e.g., increased or decreased expression (as compared with wild type) at a predetermined developmental period or stage; a pattern of expression that differs from wild type in terms of decreased expression (as compared with wild type) in a predetermined cell type or tissue type; a pattern of expression that differs from wild type in terms of the splicing size, amino acid sequence, post-transitional modification, or biological activity of the expressed polypeptide; a pattern of expression that differs from wild type in terms of the effect of an environmental stimulus or extracellular stimulus on expression of the gene, e.g., a pattern of increased or decreased expression (as compared with wild type) in the presence of an increase or decrease in the strength of the stimulus.
Subject, as used herein, can refer to a mammal, e.g., a human, or to an experimental or animal or disease model. The subject can also be a non-human animal, e.g., a horse, cow, goat, or other domestic animal.
As described herein, one aspect of the invention features a substantially pure (or recombinant) nucleic acid which includes a nucleotide sequence encoding a Y3 or ~i4 polypeptide and/or equivalents of such nucleic acids. The term nucleic acid as used herein can include fragments and equivalents. The term equivalent refers to nucleotide sequences encoding functionally equivalent polypeptides. Equivalent nucleotide sequences will include sequences that differ by one or more nucleotide substitutions, additions or deletions, such as 5 allelic variants, and include sequences that differ from the nucleotide sequences disclosed herein by degeneracy of the genetic code.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art.
10 Such techniques are described in the literature. See, for example, Molecular Cloning A
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985);
Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195;
Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984);
Transcription And 15 Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I.
Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B.
Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology {Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H.
Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymolo~r, Vols. 154 20 and 155 (Wu et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986).
Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
DETAILED DESCRIPTION
The drawings are briefly described.
Figure 1 depicts the cDNA sequence for human a2 subunit.
Figure 2 depicts the predicted amino acid sequence for human a2 subunit.
Figure 3 depicts the cDNA sequence for human ø4 subunit.
Figure 4 depicts the predicted amino acid sequence for human ø4 subunit.
Figure 5 depicts an alignment of the amino acid sequence of human ø4 of SEQ ID
NO: 1 and ø4 splice varient of SEQ ID N0:5 and laminin ø1, ø2, and ø3 subunits.
Figure 6 provides a comparision of the similarities of laminin ø4 domains with the domains of other known laminin ø subunits.
WO 99/19348 PC'T/US98/21391 Lanvnin 12 was isolated from human placental chorionic villi. Briefly, human chorionic placental villi were frozen in liquid nitrogen, ground in a Waring blender and washed in 1 M NaCI. The final tissue pellet (200g, wet weight) was suspended in 1 L of extraction buffer (50 mM Tris-HCl 50 mM, pH=7.8; NaCI O.SM, EDTA IOmM, 625 mg/1 of N-ethylmaleimide, 150 mg/1 of phenyhnethylsulphonyl fluoride. The suspension was incubated at 4°C with stirring for 48 h. Unless otherwise noted, all subsequent steps were performed at 4oC. The soluble fraction was collected following centrifugation (30000 x g, 60 min) and precipitated by 300g11 of Ammonium Sulfate. The precipitated proteins were collected by centrifugation (30000 x g, 60 min) and redissolved into chromatography buffer (2M Urea, 25 mM NaCI, 5 mM EDTA, and 50 mM Tris-HCI, pH=7.8). The sample was then dialyzed against the same buffer. Following dialysis, 0.5 volumes of buffer equilibrated DEAE-cellulose (DE-52, Whatman) was added and the mixture shaken overnight.
Material not bound to DEAF-cellulose was collected by filtration on a Buchner fimnel (Whatman filter 4) and precipitated by addition of 300g/1 of ammonium sulfate. The proteins were collected by centrifugation (30000 x g, 60 min), redissolved in the Concanavalin-A
buffer (0.5 M
NaCI, 5 mM CaCl2, 5 mM MgCl2, and Tris-HCl 50 mM, pH=7.8) and dialyzed against the same buffer overnight. The fraction was applied to a 2.5 x 5 cm Concanavalin-A
sepharose column (Pharmacia), and unbound material was removed by extensive washing.
Bound proteins were first eluted with 10 mM a-D- Mannopyrannoside (Sigma, St. Louis, MO) and secondly with 1 M a-D-Glucopyrannoside (Sigma, St. Louis, MO). A third elution with 1M
a-D-Manno-pyrannoside (Sigma, St. Louis, MO) allowed the recovery of the proteins of interest. Each fraction was independently concentrated to 10 ml on a AmiconT""
concentrator (30 kDa membrane) and applied to a 2.5 x 100 cm Sephacryl S-500 column in a 0.5 M NaCI, 50 mM Tris-HCI, pH=7.8 buffer. The fractions of interest were pooled, dialyzed against Mono-Q buffer (0.1 M NaCI, 25 mM Tris-HCI, pH=7.8) and applied to the 1 x 5 cm Mono-Q
column (Pharmacia). Elution was achieved with a 60 ml 0.1-0.5 M NaCI gradient.
The final fraction of interest resulting from the above protocol contains multiple laminins. The laminin 12 was resolved from this mixture by SDS-PAGE (3-5%
polyacrylamide) under non-reducing conditions. Six band were resolved. Only the bands at approximately 560 kDa and at the top of the gel were shown to be reactive with polyclonal anti-laminin antiserum (Sigma, St. Louis, MO).
Laminin 12 was excised, equilibrated and reduced in 10°b 2-me SDS-PAGE
sample buffer, and resolved by 5 % SDS-PAGE. Three bands were resolved, which were approximately 205 kDa, 185 kDa, and 170 kDa. The band at 185 kDa reacted with monoclonal antibody 545, specific to the laminin ~i 1 subunit. Each of the three bands were digested with trypsin and the peptides were resolved by HPLC. The selected resolves were subject to peptide sequencing.
Protein sequencing was done according to Aebersold et al. (1987). The complex laminin 5-laminin 7 was run on a polyacrylamide gel in the presence of 2-mercaptoethanol and blotted onto a nitrocellulose membrane (Biorad). The 190 kDa band of ~i2 and the 165 kDa a3 band were separately excised and digested by protease trypsin. The digested product was separated by HPLC and one fragment was sequenced on an Applied Biosystems sequenator (Applied Biosystems, Foster City, CA). The 205 kDa chain contained a sequence identical to human laminin a2, and was thus identified as human laminin a2 subunit. The 185 kDa produced two peptides identical to human (31, and was thus identified as human laminin ~i 1 subunit. The band at 170 kDa contained three sequences not contained in any known laminin chain. A N-terminal sequence of the 170 kDa chain was also determined. In addition, the N-terminal sequence was not identical to any known laminin sequence.
The cDNA sequences of human yl and r2 were used to probe the National Center for Biomedical Information (NCBI) dBestTM data base by BLAST search and a clone was isolated that was homologous, but not identical to yl and y2. This clone was extended by PCR at the 5' end using Marathon cDNA from human placenta from Clonetech (Palo Alto, CA). The resulting sequence was determined to be 100% identical to all three of the 170 kDa band peptide sequences.
Comparison of the nucleotide sequence of the isolated y3 subunit to yl, demonstrated about 80 % sequence identity.
The human cDNA encoding y3, which is approximately 4710 nucleotides in length, encodes a protein having an estimated molecular weight of approximately 146 kDa (including post-translational modifications) and which is approximately 1570 amino acid residues in length. The human y3 protein contains a nidogen-binding domain, which can be found, for example, from about amino acids 750-755 of SEQ ID N0:3. The y3 amino acid sequence and the nucleotide sequence encoding human laminin y3 is shown in SEQ ID N0:3 and SEQ ID
N0:4, respectively.
By Northern analysis the size of the y3 mRNA is approximately 5 kb, which is consistent with other laminin y subunits. The y3 mRNA transcript is expressed in human tissues including spleen, testis, brain, placenta, lung, and possibly liver.
Chromosomal mapping using the y3 cDNA sequence indicates that the human y3 gene is located on WO 99/19348 PCT/US98lZ1391 chromosome 9q31-34. The location of y3 on chromosome 9 was confirmed by FISH
analysis using a 1.3 kb y3 cDNA probe within the predicted domains I and II, which are~the regions of the least sequence identity among y subunits. Four human genes associated with Walker-Walburg syndrome, Fukuyama muscular dystrophy, retinitis pigmentosa-deafness syndrome and Eye, Muscle, Brain disease have also been mapped to chromosome 9q31-34.
Production of a y3 specific antibody and tissue lo~~alization of ~
The 170 kDa (y3) chain was excised from the reducing SDS-PAGE gel described above and inj ected into a rabbit for antibody production. The resulting serum (rabbit 16) was evaluated by Western analysis and shown to react with the 170 kDa y3 chain, and showed minor crossreactivity with other laminin chains.
Using immunofluorescence, this antiserum shows localization of y3 to the following tissue areas: 1) sites of insertions of nerves into the dermal-epidermal junction basement membrane of human skin; 2) the inner nuclear layers, outer nuclear layers, and outer limiting membranes of human, mouse and rat neural retina; 3) the Purkinje cells, and molecular layers, and (perhaps) the glial cells of the mouse and rat cerebellum; 4) the neuromuscular junctions of skeletal muscle; and, S) the taste buds of the cow tongue.
The y3 was also shown to colocalize with protein ubiquitin carboxy terminal hydrolase I using antibody pGp 9.5. The y3 subunit also appears to colocalize with the a2 subunit in the same tissue sections.
Isolation and Sequencing of cDNA encodLng Q4 The initial 350 by fragment of human laminin (34 cDNA was amplified by touchdown RT-PCR from cultured human keratinocyte total RNA using nested primers made from the published chicken laminin (3 x 503 by cDNA sequence (as described in Ybot-Gonzalez et al. (1995)). Subsequent cDNA clones were isolated by nested PCR
directly from a human placenta cDNA library packaged in lambda-gtl 1 (Clontech, Palo Alto, CA) or by nested PCR directly from human placenta Marathon-Ready cDNA (Clontech, Palo Alto, CA).
The 5' end of the cDNA was cloned using the 5'-RACE technique from human placenta total RNA. The Expanded Long Template PCR System (Boehringer Mannheim Biochemicals, Indianapolis, Il~ was used for all PCR reactions. The PCR products were ligated into the pCR2.1 vector (Invitrogen, San Diego, CA) and recombinant plasmids purified for sequencing using the QIAprepT"" kit (Qiagen). The DNA sequence was determined using either the Sequenase version 2.0 DNA Sequencing Kit (Amersham) and 35S-dATP or the Thermo Sequenase Radiolabeled Terminator Cycle Sequencing kit (Amersham) and 33p_ ddNTPs. At least two independent cDNA subclones were sequenced to rule out Taq polymerase-generated nucleotide substitutions. In some cases, PCR product bands were sequenced directly by cycle sequencing after excision from a TAE-EtBr agarose gel and purification using QIAquick Gel Extraction kit (Qiagen).
WO 99/19348 PCT/US98l21391 The human cDNA encoding a long form (34, which is approximately 5.87 kb, encodes a protein having an estimated molecular weight of approximately 200 kDa and which is approximately 1761 amino acid residues in length. The human (34 protein retains the highest amino acid sequence identity with domains VI and V, which can be found, for example, from about amino acids 221-262 and about 263-535 of SEQ ID NO:1. In addition, a short form, splice variant of (34, which is approximately 3.84 kb and an estimated molecular weight of 120 kDa, was also isolated. The splice variant has 132 nucleotide sequence identical to the long form of [34, with the sequence diverging at nucleotide 3375 and spliced into a unique 3' I 5 untranslated region. The short form cDNA encodes a truncated (34 subunit which contains only the short arm of the (34 subunit and is missing the domains necessary for heterodimerization. The (34 amino acid sequence and the nucleotide sequence encoding human laminin (34 is shown in SEQ ID NO:1 and SEQ ID N0:2, respectively.
Northern analysis was performed using total RNA prepared from JAR cell, cultured human keratinocytes and human placenta using either Trizol (Gibco BRL, Bethesda, MD) or RNeasyT"" (Qiagen) which was denatured, separated on a formaldehyde agarose gel and blotted onto nitrocellulose according to standard protocols (Sambrook, et al., 1989). In addition, A human multiple tissue northern blot (Clontech, Palo Alto, CA) and Human Northern Terntory normal tissue blots and custom fetal skin northern blot (Invitrogen, San Diego, CA) were used. Hybridization and washing were performed using NorthernMAXT""
buffer system (Ambion) by manufacturer's recommended protocols. 32P-dCTP-labelled probes were generated from gel-purified restriction fragments using RediprimeT"' random primer labeling kit (Amersham). 32P-UTP-labelled antisense RNA probes were generated using the RNA transcription kit (Stratagene, La Jolla, CA) from cDNAs subcloned into Bluescript II KS+ (Stratagene, La Jolla, CA).
Northern blotting showed that human laminin ~i4 is expressed in JAR cells, derived from undeveloped chronic villi and in placenta. By RT-PCR, it is also expressed in cultured keratinocytes. Using a northern blot of human fetal skin developmental progression, (34 subunit (long form) demonstrates strong expression at week twelve of fetal development and persists until birth, but expression is barely detectable in adult skin. The ~i4 splice variant, however, is expressed in various tissues including adult heart, brain, lung, liver, skeletal muscle, kidney, spleen, stomach, esophagus, intestine, colon, uterus, bladder, adipose tissue and pancreas. Chromosomal mapping with a (34 cDNA probe indicates that the human (34 subunit is located at locus 7q22-q31.2. The gene encoding (31 is located near, but not on, this position of chromosome 7. Statistical analysis of the mapping data using markers for ~i 1 and (34 suggest that the gene encoding ~i 1 is linked to both ends of the gene encoding ~i4. In addition, neonatal cutis laxa with manifold phenotype has been mapped near, but not in the same position, as the gene encoding (34.
In situ hybridization to wounded human skin grafted into nude mice suggests that laminin ~3 x is expressed in the dermis underneath the migrating epidermal tongues during wound closure.
A GenBankTM search using the human nucleotide sequence encoding [34 as shown in SEQ ID N0:3 revealed an EST, which corresponds to nucleotides 4686-5870 of the human 10 nucleotide sequence encoding (34 depicted in SEQ ID N0:3. Alignment of cDNA
encoding ~i4 with the genes encoding human laminin X31 and laminin (32 shows 61 % and 59% sequence identity, respectively, as shown in Figure 5.
Production of a (~4 specific antibodjr and ti ue local'~a ion of j~
15 Antibodies were raised in rabbits against a 26 kDa bacterial fusion protein which corresponds to the 175 amino acid residues of domain VI (e.g., from about amino acid residues 221-262) of SEQ ID NO:1. Briefly the fusion protein was made by PCR
amplification of nucleotides 302-785 of the cDNA encoding ~i4 using adapter primers and cloned in-frame into the NdeI and SacII sites of pET-15b (Novagen). The fusion protein 20 construct was confirmed by restriction mapping and DNA sequencing.
Expression of the fusion protein was induced and separated from E. coli proteins using reducing SDS-PAGE.
Bands corresponding to the fusion protein were excised from the gel, equilibrated and homogenized using Freud's adjuvant. The same fusion protein was also western blotted on nitrocellulose, dissolved in DMSO and used to immunize mice for monoclonal antibody 25 production.
The polyclonal antisera raised in mice against the fusion protein reacted well with (34, as well as, (31 and (32 polypeptides.
The (34 subunit contains six domains, and a interruption and a signal peptide.
The signal peptide and domain VI can be found, for example, at about amino acid residues 1-262 of SEQ ID NO:1. Domain V can be found, for example, at about amino acid residues 263-535 of SEQ ID NO:1. Domains IV and III can be found, for example, at about amino acid residues 536-767 and 768-1178 of SEQ ID NO:1, respectively. Domain I can be found, for example, at about amino acid residues 1409-1761 of SEQ ID NO:I.
The (34 suburut (long form) is most similar in size and domain structure to laminin (31 with an amino acid sequence identity of 42.5%. (34 retains the highest levels of amino acid identity with the other laminin ~3 subunits in domains VI and V, and the lowest levels in domains I and II, as shown in Figure 6. Using the MulticoilT"" program, it was determined that only domains I and II of (34 have a high probability of forming coiled coil structures.
Domains I and II of ~i4 look most similar to human (33. Both [34 and (i3 are epithelial and the coiled coil structures in domains I and II dictate the a and y subunits with which the ~i subunits are associated. Thus, it is likely that /34 associates with a3 and y2, as does the laminin (33 subunit.
The cDNA encoding the splice variant of ~i4 contains only the short arm of the (34 subunit, and is missing the EGF repeat of domain III, as shown in Figure 5.
Thus, the (34 polypeptide encoded by the [34 c DNA splice variant is missing the coiled coil structures in domains I and II, rendering the short subunit unable to associate into a laminin heterotrimer.
PCR amplification of human genomic DNA suggest that the exon which encodes the alternative short form 3' untranslated region is located downstream from the carboxyl-most common exon, exon 23, and is splices out of the (34 subunit, long form, by exon skipping.
Analogs o~r3 and ~
Analogs can differ from naturally occurring y3 or [34 in amino acid sequence or in ways that do not involve sequence, or both. Non-sequence modifications include in vivo or in vitro chemical derivatization of y3 or ~i4. Non-sequence modifications include changes in acetylation, methylation, phosphorylation, carboxylation, or glycosylation.
Preferred analogs include y3 or (34 (or biologically active fiagments thereof) whose sequences differ from the wild-type sequence by one or more conservative amino acid substitutions or by one or more non-conservative amino acid substitutions, deletions, or insertions which do not abolish the Y3 or (34 biological activity.
Conservative substitutions typically include the substitution of one amino acid for another with similar characteristics, e.g., substitutions within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine;
serine, threonine;
lysine, arginine; and phenylalanine, tyrosine. Other conservative substitutions can be taken from the table below.
CONSERVATIVE AMINO ACID REPLACEMENTS
For Amino Acid Code Replace with any of Alanine A D-Ala, Gly, beta-Ala, L-Cys, D-Cys Arginine R D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, Ile, D-Met, D-Ile, Orn, D-Orn Asparagine N D-Asn, Asp, D-Asp, Glu, D-Glu, Gln, D-Gln Aspartic Acid D D-Asp, D-Asn, Asn, Glu, D-Glu, Gln, D-Gln Cysteine C D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr Glutamine Q D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp Glutamic Acid E D-GIu, D-Asp, Asp, Asn, D-Asn, Gln, D-Gln Glycine G Ala, D-Ala, Pro, D-Pro, (3-Ala, Acp ~i Isoleucine I D-Ile, VaI, D-Val, Leu, D-Leu, Met, D-Met I Leucine L D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met Lysine K D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, Ile, D-Ile, Orn, D-Orn Methionine M D-Met, S-Me-Cys, Ile, D-Ile, Leu, D-Leu, Val, D-Val Phenylalanine F D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, Trans-3,4, or 5-phenylproline, cis-3,4, or 5-phenylproline Proline P D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-1-oxazolidine-4-carboxylic acid Serine S D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), D-Met(O), L-Cys, D-Cys Threonine . T D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), D-Met(O), Val, D-Val Tyrosine Y D-Tyr, Phe, D-Phe, L-Dopa, His, D-His Valine V D-Val, Leu, D-Leu, Ile, D-Ile, Met, D-Met Other analogs within the invention are those with modifications which increase peptide stability; such analogs may contain, for example, one or more non-peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included are:
analogs that include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino acids, e.g., [i or y amino acids; and cyclic analogs.
Geny Theranv zs The gene constructs of the invention can also be used as a part of a gene therapy protocol to deliver nucleic acids encoding either an agonistic or antagonistic form of a Y3 ar (3 4 polypeptide. The invention features expression vectors for in vivo transfection and expression of a y3 or ~i4 polypeptide in particular cell types so as to reconstitute the function of, or alternatively, antagonize the function of y3 or (i4 polypeptide in a cell in which that polypeptide is misexpressed. Expression constructs of y3 or (34 polypeptides, may be administered in any biologically effective carrier, e.g. any formulation or composition capable of effectively delivering the Y3 or ~i4 gene to cells in vivo.
Approaches include insertion of the subject gene in viral vectors including recombinant retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus-l, or recombinant bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; plasmid DNA can be delivered with the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin S, artificial viral envelopes or other such intracellular carriers, as well as direct injection of the gene construct or CaP04 precipitation carried out in vivo.
A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a viral vector containing nucleic acid, e.g. a cDNA, encoding a y3 or (34 polypeptide. Infection of cells with a viral vector has the advantage that a large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed efficiently in cells which have taken up viral vector nucleic acid.
Retrovirus vectors and adeno-associated virus vectors can be used as a recombinant gene delivery system for the transfer of exogenous genes in vivo, particularly into humans.
These vectors provide efficient delivery of genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. The development of specialized cell lines (termed "packaging cells") which produce only replication-defective retroviruses has increased the utility of retroviruses for gene therapy, and defective retroviruses are characterized for use in gene transfer for gene therapy purposes (for a review see Miller, A.D.
( 1990) Blood 76:271 ). A replication defective retrovirus can be packaged into virions which can be used to infect a target cell through the use of a helper virus by standard techniques.
Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo with such viruses can be found in Cu_r_rent Protocols in Molecul r ioloev, Ausubel, F.M. et aI. (eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE
and pEM
which are known to those skilled in the art. Examples of suitable packaging virus lines for preparing both ecotropic and amphotropic retroviral systems include yrCrip, y~Cre, yr2 and yr Am. Retroviruses have been used to introduce a variety of genes into many different cell types, including epithelial cells, in vitro and/or in vivo (see for example Eglitis, et al. (1985) Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl. Acad Sci. USA
85:6460-6464; Wilson et al. (1988) Proc. Natl. Acad. Sci. USA 85:3014-3018; Armentano et al. (1990) Proc. Natl. Acad. Sci. USA 87:6141-6145; Huber et al. ( 1991 ) Proc. Natl.
Acad Sci. USA
88:8039-8043; Ferry et al. (1991) Proc. Natl. Acad. Sci. USA 88:8377-8381;
Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proc. Natl.
Acad. Sci. USA
89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et al.
(1992) Proc.
Natl. Acad Sci. USA 89:10892-10895; Hwu et al. (1993) J. Immunol. 150:4104-4115; U.S.
Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 89/07136;
PCT
Application WO 89/02468; PCT Application WO 89/05345; and PCT Application WO
92/07573).
Another viral gene delivery system useful in the present invention utilizes adenovirus-derived vectors. The genome of an adenovirus can be manipulated such that it encodes and expresses a gene product of interest but is inactivated in terms of its ability to replicate in a normal lytic viral life cycle. See, for example, Berkner et al. (1988) BioTechnigues 6:616;
Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155.
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 d1324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are known to those skilled in the art.
Recombinant adenoviruses can be advantageous in certain circumstances in that they are not capable of infecting nondividing cells and can be used to infect a wide variety of cell types, including epithelial cells (Rosenfeld et al. (1992) cited supra). Furthermore, the virus particle is relatively stable and amenable to purification and concentration, and as above, can be modified so as to affect the spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not integrated into the genome of a host cell but remains episomal, thereby avoiding potential problems that can occur as a result of insertional mutagenesis in situations where introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) relative to other gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham (1986) J. Virol. 57:267).
Yet another viral vector system useful for delivery of the subject gene is the adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient replication and a productive life cycle. (For a review see Muzyczka et al.
Curr. Topics in Micro. and Immunol. (1992) 158:97-129). It is also one of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration (see for example Flotte et al. (1992) Am. J. Respir. Cell. Mol. Biol. 7:349-356;
Samulski et al. (1989) J. Virol. 63:3822-3828; and McLaughlin et al. (1989) J. Virol. 62:1963-1973).
Vectors containing as little as 300 base pairs of AAV can be packaged and can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as that described in Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-3260 can be used to introduce DNA into cells.
A variety of nucleic acids have been introduced into different cell types using AAV vectors 5 (see for example Hermonat et al. ( 1984) Proc. Natl. Acad Sci. USA 81:6466-6470; Tratschin et al. (1985) Mol. Cell. Biol. 4:2072-2081; Wondisford et al. (1988) Mol.
Endocrinol. 2~32-39; Tratschin et al. (1984) J. Yirol. 51:611-619; and Flotte et al. (1993) J.
Biol. Chem.
268:3781-3790).
In addition to viral transfer methods, such as those illustrated above, non-viral 10 methods can also be employed to cause expression of a y3 or (34 polypeptide in the tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used by mammalian cells for the uptake and intracellular transport of macromolecules.
In preferred embodiments, non-viral gene delivery systems of the present invention rely on endocytic pathways for the uptake of the subject y3 or ~i4 gene by the targeted cell.
Exemplary gene 15 delivery systems of this type include liposomal derived systems, poly-lysine conjugates, and artificial viral envelopes.
In a representative embodiment, a gene encoding a y3 or (34 polypeptide can be entrapped in liposomes bearing positive charges on their surface (e.g., lipofectins) and (optionally) which are tagged with antibodies against cell surface antigens of the target tissue 20 (Mizuno et al. (1992) No Shinkei Geka 20:547-551; PCT publication W091/06309;
Japanese patent application 1047381; and European patent publication EP-A-43075).
In clinical settings, the gene delivery systems for the therapeutic y3 or ~i4 gene can be introduced into a patient by any of a number of methods, each of which is familiar in the art.
For instance, a pharmaceutical preparation of the gene delivery system can be introduced 25 systemically, e.g. by intravenous injection, and specific transduction of the protein in the target cells occurs predominantly from specificity of transfection provided by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory sequences controlling expression of the receptor gene, or a combination thereof. In other embodiments, initial delivery of the recombinant gene is more limited with introduction into 30 the animal being quite localized. For example, the gene delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by Stereotactic injection (e.g. Chen et al. (1994) PNAS
91: 3054-3057).
The pharmaceutical preparation of the gene therapy construct can consist essentially of the gene delivery system in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery system can be produced in tact from recombinant cells, e.g.
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which produce the gene delivery system.
Transgenic nimal_s The invention includes transgenic animals which include cells (of that animal) which contain a y3 or (34 transgene and which preferably (though optionally) express (or misexpress) an endogenous or exogenous y3 or ~i4 gene in one or more cells in the animal.
The y3 or (34 transgene can encode the wild-type form of the protein, or can encode homologs thereof, including both agonists and antagonists, as well as antisense constructs: In preferred embodiments, the expression of the transgene is restricted to specific subsets of cells, or tissues utilizing, for example, cis-acting sequences that control expression in the desired pattern. Tissue-specific regulatory sequences and conditional regulatory sequences can be used to control expression of the transgene in certain spatial patterns, e.g., to restrict production to the milk or other secreted product of the animal.
Fragments of a protein can be produced in several ways, e.g., recombinantly, by proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a polypeptide can be generated by removing one or more nucleotides from one end (for a terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments.
Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an array of fragments. DNA's which encode fragments of a protein can also be generated by random shearing, restriction digestion or a combination of the above-discussed methods.
Fragments can also be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f Moc or t-Boc chemistry. For example, peptides of the present invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, or divided into overlapping fragments of a desired length.
Amino acid sequence variants of a protein can be prepared by random mutagenesis of DNA which encodes a protein or a particular domain or region of a protein.
Useful methods include PCR mutagenesis and saturation mutagenesis. A library of random amino acid sequence variants can also be generated by the synthesis of a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a library of variants are elsewhere herein.) PCR Mutagenesis In PCR mutagenesis, reduced Taq polymerise fidelity is used to introduce random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). This is a very powerful and relatively rapid method of introducing random mutations. The DNA
region to be mutagenized is amplified using the polymerise chain reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerise, e.g., by using a dGTP/dATP ratio of five and adding Mn2+ to the PCR reaction. The pool of amplified DNA fragments are inserted into appropriate cloning vectors to provide random mutant libraries.
Saturation Mutagenesis Saturation mutagenesis allows for the rapid introduction of a large number of single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This technique includes generation of mutations, e.g., by chemical treatment or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The mutation frequency can be modulated by modulating the severity of the treatment, and essentially all possible base substitutions can be obtained. Because this procedure does not involve a genetic selection for mutant fragments both neutral substitutions, as well as those that alter function, are obtained. The distribution of point mutations is not biased toward conserved sequence elements.
Degenerate Oligonucleotides A library of homologs can also be generated from a set of degenerate oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector.
The synthesis of degenerate oligonucleotides is known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al.
(1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390;
Roberts et al.
(1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815).
Non-random or directed, mutagenesis techniques can be used to provide specific sequences or mutations in specific regions. These techniques can be used to create variants which include, e.g., deletions, insertions, or substitutions, of residues of the known amino acid sequence of a protein. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conserved amino acids and then with more radical choices depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of the same or a different class adjacent to the located site, or combinations of options 1-3.
Alanine Scanning Mutagenesis S Alanine scanning mutagenesis is a useful method for identification of certain residues or regions of the desired protein that are preferred locations or domains for mutagenesis,-Cunningham and Wells (Science 244:1081-1085, 1989}. In alanine scanning, a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction of the amino acids with the surrounding aqueous environment in or outside the cell. Those domains demonstrating functional sensitivity to the substitutions are then refined by introducing further or other variants at or for the sites of substitution. Thus, while the site for introducing an amino acid sequence variation is predetermined, the nature of the mutation per se need not be predetermined. For example, to optimize the performance of a mutation at a given site, alanine scanning or random mutagenesis may be conducted at the target codon or region and the expressed desired protein subunit variants are screened for the optimal combination of desired activity.
Oligonucleotide-Mediated Mutagenesis Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, deletion, and insertion variants of DNA, see, e.g., Adehnan et al., (DNA
2:183, 1983).
Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the desired protein. After hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code for the selected alteration in the desired protein DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotides) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et al. (Proc. Natl.
Acad. Sci. USA, 75: 5765[1978]).
Cassette Mutagenesis Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al. (Gene, 34:315[1985]). The starting material is a plasmid (or other vector) which includes the protein subunit DNA to be mutated. The codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they may be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations in the desired protein subunit DNA. After the restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between the restriction sites but containing the desired mutations) is synthesized using standard procedures. The two strands are synthesized separately and then hybridized together using standard techniques. This double-stranded oligonucleotide is referred to as the cassette.
This cassette is designed to have 3' and 5' ends that are comparable with the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence.
Combinatorial Mutagenesis Combinatorial mutagenesis can also be used to generate mutants. E.g., the amino acid sequences for a group of homologs or other related proteins are aligned, preferably to promote the highest homology possible. All of the amino acids which appear at a given position of the aligned sequences can be selected to create a degenerate set of combinatorial sequences. The variegated library of variants is generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene library. For example, a mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential sequences are expressible as individual peptides, or alternatively, as a set of larger fusion proteins containing the set of degenerate sequences.
Primalv High-Through-Put Me hod for greening Libraries of Peg it de Fra n . or Various techniques are known in the art for screening generated mutant gene products.
Techniques for screening large gene libraries often include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the genes under conditions in which detection of a desired activity, e.g., in this case, binding to other laminin subunits, assembly into a trimeric laminin molecules, binding to natural ligands or substrates, facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Each of the techniques described below is amenable to high through-put analysis for screening large numbers of sequences created, e.g., by random mutagenesis techniques.
Two Hybrid Systems Two hybrid assays such as the system described above (as with the other screening methods described herein), can be used to identify fragments or analogs. These may include agonists, superagonists, and antagonists. (The subject protein and a protein it interacts with are used as the bait protein and fish proteins.) .
Display Libraries 5 In one approach to screening assays, the candidate peptides are displayed on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an appropriate receptor protein via the displayed product is detected in a "panning assay". For example, the gene library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO
10 88/06630; Fuchs et al. (1991) BiolTechnology 9:1370-1371; and Goward et al.
(1992) TIBS
18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can be used to detect homolog which retain ligand-binding activity. The use of fluorescently labeled ligands, allows cells to be visually inspected and separated under a fluorescence microscope, 15 or, where the morphology of the cell permits, to be separated by a fluorescence-activated cell sorter.
A gene library can be expressed as a fusion protein on the surface of a viral particle.
For instance, in the filamentous phage system, foreign peptide sequences can be expressed on the surface of infectious phage, thereby conferring two significant benefits.
First, since these 20 phage can be applied to affinity matrices at concentrations well over 1013 phage per milliliter, a large number of phage can be screened at one time. Second, since each infectious phage displays a gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of infection.
The group of almost identical E. coli filamentous phages M13, fd., and fl are most often used in phage 25 display libraries. Either of the phage gIII or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2-terminal end of pIII and phage bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner et al. PCT
publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol.
30 Chem. 267:16007-16010; Griffiths et al. (1993} EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461).
A common approach uses the maltose receptor of E. coli (the outer membrane protein, Lama) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037).
Oligonucleotides have been inserted into plasmids encoding the Lama gene to produce 35 peptides fused into one of the extracellular loops of the protein. These peptides are available for binding to ligands, e.g., to antibodies, and can elicit an immune response when the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. {1990) Gene 88, 37-45}, and PAL (Fuchs et al. (1991) BiolTech 9, 1369-1372), as well as large bacterial surface structures have served as vehicles for peptide display. Peptides can be fused to pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic information (Thiry et al.
(1989) Appl. Environ. Microbiol. 55, 984-993). Because of its role in interacting with other cells, the pilus provides a useful support for the presentation of peptides to the extracellular environment. Another large surface structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of peptides to the subunit protein flagellin offers a dense array of may peptides copies on the host cells (Kuwajima et al. (1988) BiolTech. 6, 1080-1083).
Surface proteins of other bacterial species have also served as peptide fusion partners.
Examples include the Staphylococcus protein A and the outer membrane protease IgA of Neisseria (Hansson et al. (1992) J. Bacteriol. 174, 4239-4245 and Klauser et al. (1990) EMBO J. 9, 1991-1999}.
In the filamentous phage systems and the Lama system described above, the physical link between the peptide and its encoding DNA occurs by the containment of the DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the peptide captures the particle and the DNA within. An alternative scheme uses the DNA-binding protein LacI
to form a link between peptide and DNA (Cull et al. (1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the LacI gene with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose, a LacI-peptide fusion protein is produced.
This fusion retains the natural ability of LacI to bind to a short DNA
sequence known as LacO operator (LacO). By installing two copies of LacO on the expression plasmid, the LacI-peptide fusion binds tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a single oligonucleotide sequence and each cell expresses only a single peptide sequence, the peptides become specifically and stably associated with the DNA
sequence that directed its synthesis. The cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix of immobilized receptor to recover the complexes containing active peptides. The associated plasmid DNA is then reintroduced into cells for amplification and DNA sequencing to determine the identity of the peptide ligands. As a demonstration of the practical utility of the method, a large random library of dodecapeptides was made and selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl. Aced.
Sci. U.S.A. 89-1869) This scheme, sometimes referred to as peptides-on-plasmids, differs in two important ways from the phage display methods. First, the peptides are attached to the C-terminus of the fusion protein, resulting in the display of the library members as peptides having free carboxy termini. Both of the filamentous phage coat proteins, pIII and pVIII, are anchored to the phage through their C-termini, and the guest peptides are placed into the outward-extending N-terminal domains. In some designs, the phage-displayed peptides are presented right at the amino terminus of the fusion protein. (Cwirla, et al. (1990) Proc. Natl. Aced. Sci.
U.S A. 87, 6378-6382) A second difference is the set of biological biases affecting the population of peptides actually present in the libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells. The phage coat fusions are exposed briefly to the cytoplasm during translation but are rapidly secreted through the inner membrane into the periplasmic compartment, remaining anchored in the membrane by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, protruding into the periplasm while awaiting assembly into phage particles. The peptides in the LacI and phage libraries may differ significantly as a result of their exposure to different proteoLytic activities.
The phage coat proteins require transport across the inner membrane and signal peptidase processing as a prelude to incorporation into phage. Certain peptides exert a deleterious effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J.
Med. Chem. 37(9):1233-1251). These particular biases are not a factor in the LacI display system.
The number of small peptides available in recombinant random libraries is enormous.
Libraries of 10~-109 independent clones are routinely prepared. Libraries as large as 1011 recombinants have been created, but this size approaches the practical limit for clone libraries. This limitation in Library size occurs at the step of transforming the DNA
containing randomized segments into the host bacterial cells. To circumvent this limitation, an in vitro system based on the display of nascent peptides in polysome complexes has recently been developed. This display library method has the potential of producing libraries 3-6 orders of magnitude larger than the currently available phage/phagemid or plasmid Libraries. Furthermore, the construction of the libraries, expression of the peptides, and screening, is done in an entirely cell-free format.
In one application of this method (Gallop et a1. (1994) J. Med Chem.
37(9):1233-1251 ), a molecular DNA library encoding 1012 decapeptides was constructed and the library expressed in an E. coli S30 in vitro coupled transcription/translation system.
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes and yielding complexes containing nascent peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be affinity purified on immobilized receptors in much the same way as the more conventional recombinant peptide display Libraries are screened. RNA from the bound complexes is recovered, converted to cDNA, and amplified by PCR to produce a template for the next round of synthesis and screening. The polysome display method can be coupled to the phage display system. Following several rounds of screening, cDNA from the enriched pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing vector for peptide identification. By expressing the polysome-derived peptides on phage, one can either continue the affinity selection procedure in this format or assay the peptides on individual clones for binding activity in a phage ELISA, or for binding specificity in a completion phage ELISA (Barnet, et al. (1992) Anal. Biochem 204,357-364). To identify the sequences of the active peptides one sequences the DNA produced by the phagemid host.
The high through-put assays described above can be followed by secondary screens in order to identify further biological activities which will, e.g., allow one skilled in the art-to differentiate agonists from antagonists. The type of a secondary screen used will depend on the desired activity that needs to be tested. For example, an assay can be developed in which the ability to inhibit an interaction between a protein of interest and its respective ligand can be used to identify antagonists from a group of peptide fragments isolated though one of the primary screens described above.
Therefore, methods for generating fragments and analogs and testing them for activity are known in the art. Once the core sequence of interest is identified, it is routine to perform for one skilled in the art to obtain analogs and fragments.
P~ntide Mimetics The invention also provides for reduction of the protein binding domains of the subject y3 or (34 polypeptides to generate mimetics, e.g. peptide or non-peptide agents. See, for example, "Peptide inhibitors of human papillomavirus protein binding to retinoblastoma gene protein" European patent applications EP-412,762A and EP-B31,080A.
Non-hydrolyzable peptide analogs of critical residues can be generated using benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffinan et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides:
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) JMed Chem 29:295; and Ewenson et al. in Peptides:
Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), (3-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. ( 1986) J Chem Soc Perkin Trans 1:1231 ), and (3-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun126:419; and Dann et al.
(1986) Biochem Biophys Res Commun 134:71).
~~ntibodies The invention also includes antibodies specifically reactive with a subject y3 or (34 polypeptides. Anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)).
Antibodies which specifically bind y3 or (34 epitopes can also be used in immunohistochemical staining of tissue samples in order to evaluate the abundance and pattern of expression of y3 or [34. Anti y3 or (34 antibodies can be used diagnostically in S immuno-precipitation and immuno-blotting to detect and evaluate y3 or ~i4 levels in tissue or bodily fluid as part of a clinical testing procedure.
Another application of antibodies of the present invention is in the immunological screening of cDNA libraries constructed in expression vectors such as ~,gtl l, ~.gtl8-23, 7v.ZAP, and ~,ORF8. Messenger libraries of this type, having coding sequences inserted in the correct reading frame and orientation, can produce fusion proteins. For instance, ~,gtl l will produce fusion proteins whose amino termini consist of 13-galactosidase amino acid sequences and whose carboxy tenmini consist of a foreign polypeptide. Antigenic epitopes of a subject polypeptide can then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with antibodies of the invention. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the presence of homologs can be detected and cloned from other animals, and alternate isoforms (including splicing variants) can be detected and cloned from human sources.
Other Embodiments Included in the invention are: allelic variations; natural mutants; induced mutants;
proteins encoded by DNA that hybridizes under high or low stringency conditions to a nucleic acid which encodes a polypeptide of SEQ ID NO:1 or SEQ ID N0:3 (for definitions of high and low stringency see Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, polypeptides specifically bound by antisera to Y3 or ~i4.
Nucleic acids and polypeptides of the invention includes those that differ from the sequences discolosed herein by virtue of sequencing errors in the disclosed sequences.
The invention also includes fragments, preferably biologically active fragments, or analogs of y3 or ~i4. A biologically active fragment or analog is one having any in vivo or in vitro activity which is characteristic of they3 or (34 shown in SEQ ID N0:3 and SEQ ID
NO:1, respectively, or of other naturally occurring y3 or (34, e.g., one or more of the biological activities described above. Especially preferred are fragments which exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise from translation of alternatively spliced RNA's. Fragments include those expressed in native or endogenous cells, e.g., as a result of post-translational processing, e.g., as the result of the removal of an amino-terminal signal sequence, as well as those made in expression systems, e.g., in CHO cells. Particularly preferred fragments are fragments, e.g., active fragments, which are generated by proteolytic cleavage or alternative splicing events.
Other embodiments are within the following claims.
What is claimed is:
WO 99/19348 PCT/US98lZ1391 S SEQUENCE LISTING
(1) GENERAL INFORMATION:
ld (i) APPLICANT: Burgeson, Robert, et al.
(ii) TITLE OF INVENTION: DNA Sequences (iii) NUMBER OF SEQUENCES: 6 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: LAHIVE & COCKFIELD
(B) STREET: 28 State Street (C) CITY: Boston 20 (D) STATE: Massachusetts (E) COUNTRY: USA
(F) ZIP: 02109 (v) COMPUTER READABLE FORM:
ZS (A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 3O (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: US 000000 (B) FILING DATE: 13-APR-1994 (C) CLASSIFICATION:
3S (vii) PRIOR
APPLICATION
DATA:
(A) APPLICATION NUMBER: US 08/111,111 (B) FILING DATE: 12-DEC-1909 (viii) ATTORNEY/AGENT
INFORMATION:
40 (A) NAME: Attorney, Name Init (B} REGISTRATION NUMBER: 000000 (C) REFERENCE/DOCKET NUMBER:
oe (ix) TELECOMMUNICATION
INFORMATION:
4S (A) TELEPHONE: (617)227-7400 (B) TELEFAX: (617)742-4214 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1761 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear SS
(ii) MOLECULE TYPE: peptide (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1:
$ Met Gln Phe Gln Leu Thr Leu Phe Leu His Leu Gly Trp Leu Ser Tyr Ser Lys Ala Gln Asp Asp Cys Asn Arg Gly Ala Cys His Pro Thr Thr Gly Asp Leu Leu Val Gly Arg Asn Thr Gln Leu Met Ala Ser Ser Thr Cys Gly Leu Ser Arg Ala Gln Lys Tyr Cys Ile Leu Ser Tyr Leu Glu 1$ 50 55 60 Gly Glu Gln Lys Cys Ser Ile Cys Asp Ser Arg Phe Pro Tyr Aap Pro Tyr Asp Gln Pro Asn Ser His Thr Ile Glu Asn Val Thr Val Ser Phe G1u Pro Asp Arg Glu Lys Lys Trp Trp Gln Ser Glu Asn Gly Leu Asp 2$
His Val Ser Ile Arg Leu Asp Leu Glu Ala Leu Phe Arg Phe Ser His Leu Ile Leu Thr Phe Lys Thr Phe Arg Pro Ala Ala Met Leu Val Glu Arg Ser Thr Asp Tyr Gly His Asn Trp Lys Val Phe Lys Tyr Phe Ala 3$ Lys Asp Cys Ala Thr Ser Phe Pro Asn Ile Thr Ser Gly Gln Ala Gln Gly Val Gly Asp Ile Val Cys Asp Ser Lys Tyr Ser Asp Ile Glu Pro Ser Thr Gly Gly Glu Val Val Leu Lys Val Leu Asp Pro Ser Phe Glu Ile Glu Asn Pro Tyr Ser Pro Tyr Ile Gln Asp Leu Val Thr Leu Thr 4$ 210 215 220 Asn Leu Arg Ile Asn Phe Thr Lys Leu His Thr Leu Gly Asp Ala Leu $0 Leu Gly Arg Arg Gln Asn Asp Ser Leu Asp Lys Tyr Tyr Tyr Ala Leu Tyr Glu Met Ile Val Arg Gly Ser Cys Phe Cys Asn Gly His Ala Ser $$
Glu Cys Arg Pro Met Gln Lys Met Arg Gly Asp Val Phe Ser Pro Pro Gly Met Val His Gly Gln Cys Val Cys Gln His Asn Thr Asp Gly Pro Asn Cys Glu Arg Cys Lys Asp Phe Phe Gln Asp Ala Pro Trp Arg Pro Ala Ala Asp Leu Gln Asp Asn Ala Cys Arg Ser Cys Ser Cys Asn.Ser His Ser Ser Arg Cys His Phe Aap Met Thr Thr Tyr Leu Ala Ser Gly Gly Leu Ser Gly Gly Val Cys Glu Asp Cys Gln His Asn Thr Glu Gly 1$ Gln His Cys Asp Arg Cys Arg Pro Leu Phe Tyr Arg Asp Pro Leu Lys Thr Ile Ser Asp Pro Tyr Ala Cys Ile Pro Cys Glu Cys Asp Pro Asp Gly Thr Ile Ser Gly Gly Ile Cys Val Ser His Ser Asp Pro Ala Leu Gly Ser Val Ala Gly Gln Cys Leu Cys Lys Glu Asn Val Glu Gly Ala 2$ 420 425 430 Lys Cys Aap Gln Cys Lys Pro Asn His Tyr Gly Leu Ser Ala Thr Asp Pro Leu Gly Cys Gln Pro Cys Asp Cys Asn Pro Leu Gly Ser Leu Pro Phe Leu Thr Cys Aep Val Asp Thr Gly Gln Cys Leu Cys Leu Ser Tyr Val Thr Gly Ala His Cys Glu Glu Cys Thr Val Gly Tyr Trp Gly Leu Gly Asn His Leu His Gly Cys Ser Pro Cys Asp Cys Asp Ile Gly Gly Ala Tyr Ser Asn Val Cys Ser Pro Lys Asn Gly Gln Cys Glu Cys Arg 4$ Pro His Val Thr Gly Arg Ser Cys Ser Glu Pro Ala Pro Gly Tyr Phe $d Phe Ala Pro Leu Asn Phe Tyr Leu Tyr Glu Ala Glu Glu Ala Thr Thr Leu Gln Gly Leu Ala Pro Leu Gly Ser Glu Thr Phe Gly Gln Ser Pro Ala Val His Val Val Leu Gly Glu Pro Val Pro Gly Asn Pro Val Thr $$ 580 585 590 Trp Thr Gly Pro Gly Phe Ala Arg Val Leu Pro Gly Ala Gly Leu Arg 60 Phe Ala Val Asn Asn Ile Pro Phe Pro Val Asp Phe Thr Ile Ala Ile His Tyr Glu Thr Gln Ser Ala Ala Asp Trp Thr Val Gln Ile Val Val $ 625 630 635 640 Asn ProProGlyGly SerGluHis CysIlePro LysThrLeu Gln.Ser w Lys ProGlnSerPhe AlaLeuPro AlaAlaThr ArgIleMet LeuLeu Pro ThrProIleCys LeuGluPro AspValGln TyrSerIle AspVal 1$
Tyr PheSerGlnPro LeuGlnGly GluSerHis AlaHisSer HisVal Leu ValAspSerLeu GlyLeuIle ProGlnIle AsnSerLeu GluAsn Phe CysSerLysGln AspLeuAsp GluTyrGln LeuHisAsn CysVal 2$ Glu IleAlaSerAla MetGlyPro GlnValLeu ProGlyAla CysGlu Arg LeuIleIleSer MetSerAla LysLeuHis AspGlyAla ValAla Cys LysCysHisPro GlnGlySer ValGlySer SerCysSer ArgLeu Gly GlyGlnCysGln CysLysPro LeuValVal GlyArgCys CysAsp 3$ 785 790 795 800 Arg CysSerThrGly SerTyrAsp LeuGlyHis HisGlyCys HisPro Cys HisCysHisPro GlnGlySer LysAspThr Va1CysAsp GlnVal Thr GlyGlnCysPro CysHisGly GluValSer GlyArgArg CysAsp 4$
Arg CysLeuAlaGly TyrPheGly PheProSer CysHisPro CysPro Cys AsnArgPheAla GluLeuCys AspProGlu ThrGlySer CysPhe $0 865 870 875 880 Asn CysGlyGlyPhe ThrThrGly ArgAenCys GluArgCys IleAsp $$ Gly TyrTyrGlyAsn ProSerSer GlyGlnPro CysArgPro CysLeu Cys ProAspAspPro SerSerAsn GlnTyrPhe AlaHisSer CysTyr Gln AsnLeuTrpSer SerAspVal IleCysAsn CysLeuGln GlyTyr $ Thr Gly Thr Gln Cys Gly Glu Cys Ser Thr Gly Phe Tyr Gly Asn Pro Arg Ile Ser Gly Ala Pro Cys Gln Pro Cys Ala Cys Asn Asn Asn Ile Asp Val Thr Asp Pro Glu Ser Cys Ser Arg Val Thr Gly Glu Cys Leu Arg Cys Leu His Asn Thr Gln Gly Ala Asn Cys Gln Leu Cys Lys Pro 1$ 995 1000 1005 Gly His Tyr Gly Ser Ala Leu Asn Gln Thr Cys Arg Arg Cys Ser Cys His Ala Ser Gly Val Ser Pro Met Glu Cys Pro Pro Gly Gly Gly Ala Cys Leu Cys Asp Pro Val Thr Gly Ala Cys Pro Cys Leu Pro Asn Val 2$
Thr Gly Leu Ala Cys Asp Arg Cys Ala Asp Gly Tyr Trp Asn Leu Val Pro Gly Arg Gly Cys Gln Ser Cys Asp Cys Asp Pro Arg Thr Ser Gln Ser Ser His Cys Asp Gln Leu Thr Gly Gln Cys Pro Cys Lys Leu Gly 3$ Tyr Gly Gly Lys Arg Cys Ser Glu Cys Gln Glu Asn Tyr Tyr Gly Asp Pro Pro Gly Arg Cys Ile Pro Cys Asp Cys Asn Arg Ala Gly Thr Gln Lys Pro Ile Cys Asp Pro Asp Thr Gly Met Cys Arg Cys Arg Glu Gly Val Ser Gly Gln Arg Cys Asp Arg Cys Ala Arg Gly His Ser Gln Glu 4$ 1155 1160 1165 Phe Pro Thr Cys Leu Gln Cys His Leu Cys Phe Asp Gln Trp Asp His $0 Thr Ile Ser Ser Leu Ser Lys Ala Val Gln Gly Leu Met Arg Leu Ala Ala Asn Met Glu Asp Lys Arg Glu Thr Leu Pro Val Cys Glu Ala Asp $$
Phe Lys Asp Leu Arg Gly Asn Val Ser Glu Ile Glu Arg Ile Leu Lys His Pro Val Phe Pro Ser Gly Lys Phe Leu Lys Val Lys Asp Tyr His Asp Ser Val Arg Arg Gln Ile Met Gln Leu Asn Glu Gln Leu Lys Ala WO 99/19348 PG"T/US98/21391 Val Tyr Glu Phe Gln Asp Leu Lys Asp Thr Ile Glu Arg Ala Lys Asn Glu Ala Asp Leu Leu Leu Glu Asp Leu Gln Glu Glu Ile Asp Leu Gln Ser Ser Val Leu Asn Ala Ser Ile Ala Asp Ser Ser Glu Asn Ile Lys 1S Lys Tyr Tyr His Ile Ser Ser Ser Ala Glu Lys Lys Ile Asn Glu Thr Ser Ser Thr Ile Asn Thr Ser Ala Asn Thr Arg Asn Asp Leu Leu Thr Ile Leu Asp Thr Leu Thr Ser Lys Gly Asn Leu Ser Leu Glu Arg Leu Lys Gln Ile Lys Ile Pro Asp Ile Gln Ile Leu Asn Glu Lys Val Cys 2$ 1365 1370 1375 Gly Asp Pro Gly Asn Val Pro Cys Val Pro Leu Pro Cys Gly Gly Ala Leu Cys Thr Gly Arg Lys Gly His Arg Lys Cys Arg Gly Pro Gly Cys His Gly Ser Leu Thr Leu Ser Thr Asn Ala Leu Gln Lys Ala Gln Glu Ala Lys Ser Ile Ile Arg Asn Leu Asp Lys Gln Val Arg Gly Leu Lys Asn Gln Ile Glu Ser Ile Ser Glu Gln Ala Glu Val Ser Lys Asn Asn Ala Leu Gln Leu Arg Glu Lys Leu Gly Asn Ile Arg Asn Gln Ser Asp Ser Glu Glu Glu Asn Ile Asn Leu Phe Ile Lys Lys Val Lys Asn Phe Leu Leu Glu Glu Asn Val Pro Pro Glu Asp Ile Glu Lys Val Ala Asn Gly Val Leu Asp Ile His Leu Pro Ile Pro Ser Gln Asn Leu Thr Asp Glu Leu Val Lys Ile Gln Lys His Met Gln Leu Cys Glu Aap Tyr Arg Thr Asp Glu Asn Arg Ser Asn Glu Glu Ala Asp Gly Ala Gln Lys Leu Leu Val Lys Ala Lys Ala Ala Glu Lys Ala Ala Asn Ile Leu Leu Asn Leu Aap Lys Thr Leu Asn Gln Leu Gln Gln Ala Gln Ile Thr Gln Gly Arg Ala AsnSerThr IleThr LeuThr AlaAsnIle ThrLysIle Gln Lys Lys AsnValLeu GlnAla AsnGln ThrArgGlu MetLysSer Glu Glu Leu GluLeuAla LysGln SerGly LeuGluAsp GlyLeuSer Arg Leu Leu GlnThrLys LeuGln HisGln AspHisAla ValAanAla Arg Lys Val GlnAlaGlu SerAla HisGln AlaGlySer LeuGluLys Gln G1u Phe ValGluLeu LysLys TyrAla IleLeuGln ArgLysThr Gln ZS Ser Thr ThrGlyLeu ThrLye ThrLeu GlyLysVal LysGlnLeu Glu Lys Asp AlaAlaGlu LysLeu GlyAsp ThrGluAla LysIleArg Ala Arg Ile ThrAspLeu GluArg IleGln AspLeuAen LeuSerArg Lys Gln Ala LysAlaAsp GlnLeu IleLeu GluAspGln ValValAla Arg Ile Lys AsnGluIle ValGlu GluLys LysTyrAla ArgCysTyr Gln Ser (2) INFORMATION FOR SEQ ID N0:2:
4S (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5874 base pairs (B} TYPE: nucleic acid (C} STRANDEDNESS: single (D) TOPOLOGY: linear SO
(ii) MOLECULE TYPE: cDNA
SS (xi} SEQUENCE DESCRIPTION: SEQ ID N0:2:
S
SS
E)OCTGGACCTGGATTTGCCAGGGTTCTCCCTGGGGCTGGCTTGAGATTTGCTGTCAACAACA1920 .
$$
SS
E)OCTGGACCTGGATTTGCCAGGGTTCTCCCTGGGGCTGGCTTGAGATTTGCTGTCAACAACA1920 .
$$
(2) INFORMATION FOR SEQ ID N0:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1524 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (v) FRAGMENT TYPE: internal 3O (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:3:
Ala AlaGlyAla GlyAlaHis CysGlnArg.CysAspAla AlaAspPro 35 Gln ArgHisHis AsnAlaSer TyrLeuThr AspPheHis SerGlnAsp Glu SerThrTrp TrpGlnSer ProSerMet AlaPheGly ValGlnTyr Pro ThrSerVal AsnIleThr LeuXaaArg LeuGlyLys AlaTyrGlu Ile ThrTyrVal ArgLeuLys PheHisThr SerArgPro GluSerPhe Ala IleTyrLys ArgSerArg AlaAspGly ProTrpGlu ProTyrGln S0 Phe TyrSerAla SerCysGln LysThrTyr GlyArgPro GluGlyGln Tyr LeuArgPro GlyGluAsp GluArgVal AlaPheCys ThrSerGlu SS
Phe SerAspIle SerProLeu SerGlyGly AsnValAla PheSerThr Leu GluGlyArg ProSerAla TyrAsnPhe GluGluSer ProGlyLeu Gln GluTrpVal ThrSerThr GluLeuLeu IleSerLeu AspArgLeu Asn Thr Phe Gly Asp Asp Ile Phe Lys Asp Pro Lys Val Leu Gln Ser Tyr Tyr Tyr Ala Val Ser Asp Phe Ser Val Gly Gly Arg Cys Lys Cys Asn Gly His Ala Ser Glu Cys Gly Pro Asp Val Ala Gly Gln Leu Ala 1$ Cys Arg Cys Gln His Asn Thr Thr Gly Thr Asp Cys Glu Arg Cys Leu Pro Phe Phe Gln Asp Arg Pro Trp Ala Arg Gly Thr Ala Glu Ala Ala His Glu Cys Leu Pro Cys Asn Cys Ser Gly Arg Ser Glu Glu Cys Thr Phe Asp Arg Glu Leu Phe Arg Ser Thr Gly His Gly Gly Arg Cys Hia 2$ 275 280 285 His Cys Arg Asp His Thr Ala Gly Pro His Cys Glu Arg Cys Gln Glu Asn Phe Tyr His Trp Asp Pro Arg Met Pro Cys Gln Pro Cys Asp Cys 3$
Gln Ser Ala Gly Ser Leu His Leu Gln Cys Asp Asp Thr Gly Thr Cys Ala Cys Lys Pro Thr Val Thr Gly Trp Lys Cys Asp Arg Cys Leu Pro Gly Phe His Ser Leu Ser Glu Gly Gly Cys Arg Pro Cys Thr Cys Asn Pro Ala Gly Ser Leu Asp Thr Cya Asp Pro Arg Ser Gly Arg Cys Pro 4$ Cys Lys Glu Asn Val Glu Gly Aen Leu Cys Asp Arg Cys Arg Pro Gly Thr Phe Asn Leu Gln Pro His Asn Pro Ala Gly Cys Ser Ser Cys Phe $0 Cys Tyr Gly His Ser Lys Val Cys Ala Ser Thr Ala Gln Phe Gln Val His His Ile Leu Ser Asp Phe His Gln Gly Ala Glu Gly Trp Trp Ala $$ 435 440 445 Arg Ser Val Gly Gly Ser Glu His Ser Pro Gln Trp Ser Pro Asn Gly f)0 Val Leu Leu Ser Pro Glu Asp Glu Glu Glu Leu Thr Ala Pro Gly Lys Phe Leu Gly Asp Gln Arg Phe Ser Tyr Gly Gln Pro Leu Ile Leu Thr $ 485 490 495 Phe Arg Val Pro Pro Gly Asp Ser Pro Leu Pro Val Gln Leu Arg Leu Glu Gly Thr Gly Leu Ala Leu Ser Leu Arg His Ser Ser Leu Ser Gly Pro Gln Asp Ala Arg Ala Ser Gln Gly Gly Arg Ala Gln Val Pro Leu 1$
Gln Glu Thr Ser Glu Aap Val Ala Pro Pro Leu Pro Pro Phe His Phe Gln Arg Leu Leu Ala Asn Leu Thr Ser Leu Arg Leu Arg Val Ser Pro Gly Pro Ser Pro Ala Gly Pro Val Phe Leu Thr Glu Val Arg Leu Thr 2$ Ser Ala Arg Pro Gly Leu Ser Pro Pro Ala Ser Trp Val Glu Ile Cys Ser Cys Pro Thr Gly Tyr Thr Gly Gln Phe Cys Glu Ser Cys Ala Pro Gly Tyr Lys Arg Glu Met Pro Gln Gly Gly Pro Tyr Ala Ser Cys Val Pro Cys Thr Cys Asn Gln His Gly Thr Cys Asp Pro Asn Thr Gly Ile 3$ 645 650 655 Cys Val Cys Ser His His Thr Glu Gly Pro Ser Cys Glu Arg Cys Leu Pro Gly Phe Tyr Gly Asn Pro Phe Ala Gly Gln Ala Asp Asp Cys Gln Pro Cys Pro Cys Pro Gly Gln Ser Ala Cys Thr Thr Ile Pro Glu Ser 4$
Gly Glu Val Val Cys Thr His Cys Pro Pro Gly Gln Arg Gly Arg Arg Cys Glu Val Cys Asp Asp Gly Phe Phe Gly Asp Pro Leu Gly Leu Phe $0 725 730 735 Gly His Pro Gln Pro Cys His Gln Cys Gln Cys Ser Gly Asn Val Asp $$ Pro Asn Ala Val Gly Asn Cys Asp Pro Leu Ser Gly His Cys Leu Arg Cys Leu His Asn Thr Thr Gly Asp His Cys Glu His Cys Gln Glu Gly Phe Tyr Gly Ser Ala Leu Ala Pro Arg Pro Ala Asp Lys Cys Met Pro WO 99/19348 ~ 4 PCT/US98/21391 $ Cys Ser Cys His Pro Gln Gly Ser Val Ser Glu Gln Met Pro Cys Asp Pro Val Thr Gly Gln Cys Ser Cys Leu Pro His Val Thr Ala Arg Asp Cys Ser Arg Cys Tyr Pro Gly Phe Phe Asp Leu Gln Pro Gly Arg Gly Cys Arg Ser Cys Lys Cys His Pro Leu Gly Ser Gln Glu Asp Gln Cys 1$ 850 855 860 His Pro Lys Thr Gly Gln Cys Thr Cys Arg Pro Gly Val Thr Gly Gln Ala Cys Asp Arg Cys Gln Leu Gly Phe Phe Gly Ser Ser Ile Lye Gly 2$
Cys Arg Ala Cys Arg Cys Ser Pro Leu Gly Ala Ala Ser Ala Gln Cys His Tyr Asn Gly Thr Cys Val Cys Arg Pro Gly Phe Glu Gly Tyr Lys Cys Aep Arg Cys His Tyr Asn Phe Phe Leu Thr Ala Asp Gly Thr His Cys Gln Gln Cys Pro Ser Cys Tyr Ala Leu Val Lys Glu Glu Xaa Ala 3$ Lys Leu Lys Ala Arg Leu Thr Leu Thr Glu Gly Trp Leu Gln Gly Ser Asp Cys Gly Ser Pro Trp Gly Pro Leu Asp Ile Leu Leu Gly Glu Ala Pro Arg Xaa Asp Val Tyr Gln Gly His His Leu Leu Pro Gly Ala Arg Glu Ala Phe Leu Glu Gln Met Met Gly Leu Glu Gly Ala Val Lys Ala 4$ solo loss 1020 Ala Arg Glu Gln Leu Gln Arg Leu Asn Lys Gly Ala Arg Cys Ala Gln $0 Ala Gly Ser Gln Lys Thr Cys Thr Gln Leu Ala Asp Leu Glu Ala Val $$
Leu Glu Ser Ser Glu Glu Glu Ile Leu His Ala Ala Ala Ile Leu Ala Ser Leu Glu Ile Pro Gln Glu Gly Pro Ser Gln Pro Thr Lys Trp Ser His Leu Ala Ile Glu Ala Arg Ala Leu Ala Arg Ser His Arg Asp Thr Ala Thr Lys Ile Ala Ala Thr Ala Trp Arg Ala Leu Leu Ala Ser Asn WO 99/19348 ~ 5 PCT/US98/Z1391 Thr Ser Tyr Ala Leu Leu Trp Asn Leu Leu Glu Gly Arg Val Ala L_eu Glu Thr Gln Arg Asp Leu Glu Asp Arg Tyr Gln Glu Val Gln Ala Ala Gln Lys Ala Leu Arg Thr Ala Val Ala Glu Val Leu Pro Glu Ala Xaa IS Lys Arg Val Gly His Arg Ala Ala Ser Trp Arg Arg Tyr Ser Pro Val Pro Gly Leu Ala Gly Phe Pro Gly Ser Ser Ala Ser Xaa Lys Ser Arg Ala Glu Asp Leu Gly Leu Lys Ala Lys Ala Leu Glu Lys Thr Val Ala Ser Trp Gln His Met Ala Thr Glu Ala Ala Arg Thr Leu Gln Thr Ala 2$ 1220 1225 1230 Ala Gln Ala Thr Leu Arg Gln Thr Glu Pro Leu Thr Met Ala Arg Ser Arg Leu Thr Ala Thr Phe Ala Ser Gln Leu His Gln Gly Ala Arg Ala Ala Leu Thr Gln Ala Ser Ser Ser Val Gln Ala Ala Thr Val Thr Val Met Gly Ala Arg Thr Leu Leu Ala Asp Leu Glu Gly Met Lys Leu Gln Phe Pro Arg Pro Lys Asp Gln Ala Ala Leu Gln Arg Lys Ala Asp Ser Val Ser Asp Arg Leu Leu Ala Asp Thr Arg Lys Lys Thr Lys Gln Ala 4S Glu Arg Met Leu Gly Asn Ala Ala Pro Leu Ser Ser Ser Ala Lys Lys Lys Gly Arg Glu Ala Glu Val Leu Ala Lys Asp Ser Ala Lys Leu Ala Lys Ala Leu Leu Arg Glu Arg Lys Gln Ala His Arg Arg Ala Ser Arg Leu Thr Ser Gln Xaa Leu Gln Ala Thr Leu Gln Gln Ala Ser Gln Gln Val Leu Ala Ser Glu Ala Arg Arg Gln Glu Leu Glu Glu Ala Glu Arg Val Gly Ala Gly Leu Ser Glu Met Glu Gln Gln Ile Arg Glu Ser Arg Ile Ser Leu Glu Lys Asp Ile Glu Thr Leu Ser Glu Leu Leu Ala Arg Leu Gly SerLeuAsp Thr GlnAlaPro AlaGlnAlaLeuAsn Glu His Thr Gln TrpAlaLeu Glu LeuArgLeu GlnLeuGlySerPro Gly Arg Ser Leu GlnArgLys Leu LeuLeuGlu GlnGluSerGlnGln Gln Ser Glu Leu GlnIleGln Gly GluSerAsp LeuAlaGluIleArg Ala Phe Asp Lys GlnAsnLeu Glu IleLeuHis SerLeuProGluAsn Cys Ala Ala Ser TrpGln ZS (2) INFORMATION FOR SEQ ID N0:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4890 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:4:
WO 99/19348 ~ ~ PCT/US98/21391 S
lO CTGTCAGGAGAATTTCTATCACTGGGACCCGCGGATGCCATGCCAGCCCTGTGACTGCCA960 GGGAGCCGAAGGCTGGTG~GCCAGAAGTGTGGGGGGCTCTGAGCACTCCCCACAATGGAG1380 SS
WO 99/19348 ~ 8 PCT/US9$/21391 WO 99/19348 ~ g PCT/US98/21391 1$
TGTGTGTATG ACCCAAATAA p~~74AAAAAAA 4 2$
(2) INFORMATION FOR SEQ ID
N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1105 amino acids 30 (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide 3$ (v) FRAGMENT TYPE: internal (xi) SEQUENCE N0:5:
DESCRIPTION:
SEQ
ID
Met GlnPheGln LeuThrLeu PheLeuHis LeuGlyTrp LeuSerTyr Ser LysAlaGln AspAspCys AsnArgGly AlaCysHis ProThrThr 4$ 20 25 30 Gly AepLeuLeu.ValGlyArg AsnThrGln LeuMetAla SerSerThr $0 Cys GlyLeuSer ArgAlaGln LysTyrCys IleLeuSer TyrLeuGiu Gly GluGlnLys CysSerIle CysAspSer ArgPhePro TyrAspPro $$
Tyr AspGlnPro AsnSerHie ThrIleGlu AsnValThr ValSerPhe Glu ProAspArg GluLysLys TrpTrpGln SerGluAsn GlyLeuAsp His ValSerIle ArgLeuAsp LeuGluAla LeuPheArg PheSerHis Leu IleLeuThr PheLysThr PheArgPro AlaAlaMet LeuValGlu Arg SerThrAsp TyrGlyHis AsnTrpLys ValPheLys TyrPheAla Lys AspCysAla ThrSerPhe ProAsnIle ThrSerGly GlnAlaGln 1$ Gly ValGlyAsp IleValCys AspSerLys TyrSerAep IleGluPro Ser ThrGlyGly GluValVal LeuLysVal LeuAspPro SerPheGlu Ile GluAsnPro TyrSerPro TyrIleGln AspLeuVal ThrLeuThr Asn LeuArgIle AenPheThr LysLeuHis ThrLeuGly AspAlaLeu 2$ 225 230 235 240 Leu GlyArgArg GlnAsnAsp SerLeuAsp LysTyrTyr TyrAlaLeu Tyr GluMetIle ValArgGly SerCysPhe CysAsnGly HisAlaSer Glu CysArgPro MetGlnLys MetArgGly RspValPhe SerProPro 3$
Gly MetValHis GlyGlnCys ValCysGln HisAsnThr AspGlyPro Asn CysGluArg CysLysAsp PhePheGln AspAlaPro TrpArgPro Ala AlaAspLeu GlnAspAsn AlaCysArg SerCysSer CysAsnSer 4$ His SerSerArg CysHisPhe AspMetThr ThrTyrLeu AlaSerGly Gly LeuSerGly GlyValCys GluAspCys GlnHisAsn ThrGluGly $~
Gln HisCysAsp ArgCysArg ProLeuPhe TyrArgAsp ProLeuLys Thr IleSerAsp ProTyrAla CysIlePro CysGluCys AspProAsp $$ 385 390 395 400 Gly ThrIleSer GlyGlyIle CysValSer HisSerAsp ProAlaLeu fib Gly SerValAla GlyGlnCys LeuCysLys GluAsnVal GluGlyAla Lys CysAspGln CysLysPro AsnHisTyr GlyLeuSer AlaThrAsp WO 99/19348 2 ~ PCT/US98121391 $ 435 440 445 Pro Leu Gly Cys Gln Pro Cys Aep Cys Asn Pro Leu Gly Ser Leu~Pro Phe Leu Thr Cys Asp Val Asp Thr G1y Gln Cys Leu Cys Leu Xaa Tyr Val Thr Gly Ala His Cys Glu Glu Cys Thr Val Gly Tyr Trp Gly Leu IS
Gly Asn His Leu His Gly Cys Ser Pro Cys Asp Cys Asp Ile Gly Gly Ala Tyr SerAsnVal CysSer ProLysAsn GlyGlnCysGlu CysArg Pro His ValThrGly ArgSer CysSerGlu ProAlaProGly TyrPhe 2$ Phe Ala ProLeuAsn PheTyr LeuTyrGlu AlaGluGluAla ThrThr Leu Gln GlyLeuAla ProLeu GlySerGlu ThrPheGlyGln SerPro Ala Val HisValVal LeuGly GluProVal ProGlyAsnPro ValThr Trp Thr GlyProGly PheAla ArgValLeu ProGlyAlaGly LeuArg 3$ 595 600 605 Phe Ala Val Asn Asn Ile Pro Phe Pro Val Asp Phe Thr Ile Ala Ile 40 His Tyr Glu Thr Gln Ser Ala Ala Asp Trp Thr Val Gln Ile Val Val Asn Pro Pro Gly Gly Ser Glu His Cys Ile Pro Lys Thr Leu Gln Ser Lys Pro Gln Ser Phe Ala Leu Pro Ala Ala Thr Arg Ile Met Leu Leu Pro Thr Pro Ile Cys Leu Glu Pro Asp Val Gln Tyr Ser Ile Asp Val $0 675 680 685 Tyr Phe Ser Gln Pro Leu Gln Gly Glu Ser His Ala His Ser His Val 690 695 ' 700 Leu Val Asp Ser Leu Gly Leu Ile Pro Gln Ile Asn Ser Leu Glu Asn Phe Cys Ser Lys Gln Asp Leu Asp Glu Tyr Gln Leu His Asn Cys Val Glu Ile Ala Ser Ala Met Gly Pro Gln Val Leu Pro Gly Ala Cys Glu WO 99/19348 22 PCT/US98J~1391 Arg Leu Ile Ile Ser Met Ser Ala Lys Leu His Asp Gly Ala Val Ala Cys Lys Cys His Pro Gln Gly Ser Val Gly Ser Ser Cys Ser Arg Leu Gly Gly Gln Cys Gln Cys Lys Pro Leu Val Val Gly Arg Cys Cys Asp Arg Cys Ser Thr Gly Ser Tyr Asp Leu Gly His Hie Gly Cys His Pro Cys His Cys His Pro Gln Gly Ser Lys Asp Thr Val Cys Asp Gln Val Thr Gly Gln Cys Pro Cys His Gly Glu Val Ser Gly Arg Arg Cys Asp Arg Cys Leu Ala Gly Tyr Phe Gly Phe Pro Ser Cys His Pro Cys Pro Cys Xaa Arg Phe Xaa Ala Xaa Asp Xaa Leu Xaa Xaa Asp Pro Glu Thr Gly Ser Cys Phe Asn Cys Gly Gly Phe Thr Thr Gly Arg Asn Cys Glu Arg Cys Ile Asp Gly Tyr Tyr Gly Asn Pro Ser Ser Gly Gln Pro Cys 3$ Arg Pro Cys Leu Cys Pro Asp Asp Pro Ser Ser Asn Gln Tyr Phe Ala His Ser Cys Tyr Gln Asn Leu Trp Ser Ser Asp Val Ile Cys Asn Cys Leu Gln Gly Tyr Thr Gly Thr Gln Cys Gly Glu Cys Ser Thr Gly Phe Tyr Gly Asn Pro Arg Ile Ser Gly Ala Pro Cys Gln Pro Cys Ala Cys Asn Asn Asn Ile Asp Val Thr Asp Pro Glu Ser Cys Ser Arg Val Thr SO Gly Glu Cys Leu Arg Cys Leu His Asn Thr Gln Gly Ala Asn Cys Gln Leu Cys Lys Pro Gly His Tyr Gly Ser Ala Leu Asn Gln Thr Cys Arg Arg Cys Ser Cys His Ala Ser Gly Val Ser Pro Met Glu Cys Pro Pro Gly Gly Gly Ala Cys Leu Cys Asp Pro Val Thr Gly Ala Cys Pro Cys Leu Pro Asn Val Thr Gly Leu Ala Cys Asp Arg Cys Ala Asp Gly Tyr S
Trp Asn Leu Val Pro Gly Arg Gly Cys Gln Ser Cys Asp Cys Asp Pro Arg Xaa Ser Gln Ser Ser His Cys Asp Gln Ala Arg Tyr Phe Lys Ala Tyr IS (2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3754 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:6:
ACATGCCCCGTTTGCT(3CCT GAACCTCTCC ACAAAGACTCCCAGATCCTG AATTGAATTT60 S
AGCGGTGGCC
lO GCAGACCCCTCTTCTACAGGGACCCGCTCAAGACCATCTCAGATCCCTACGCGTGCATTC1260 SS
C)OTCCTGAGACAGGGTCATGCTTCAATTGTGGAGGCTTTACAACTGGCAGAAACTGTGAAAG2760 TCTCAAGTTC AGCTTCGCCTACTTCAGTTTCCCCTC'rGTGACTGAGGAAGTCAGAATTCA 3600 3S CCTTCAAAAC CTAAAAAAAAAAA,AAAAAAAAAAA 3754
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1524 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (v) FRAGMENT TYPE: internal 3O (xi) SEQUENCE
DESCRIPTION:
SEQ
ID
N0:3:
Ala AlaGlyAla GlyAlaHis CysGlnArg.CysAspAla AlaAspPro 35 Gln ArgHisHis AsnAlaSer TyrLeuThr AspPheHis SerGlnAsp Glu SerThrTrp TrpGlnSer ProSerMet AlaPheGly ValGlnTyr Pro ThrSerVal AsnIleThr LeuXaaArg LeuGlyLys AlaTyrGlu Ile ThrTyrVal ArgLeuLys PheHisThr SerArgPro GluSerPhe Ala IleTyrLys ArgSerArg AlaAspGly ProTrpGlu ProTyrGln S0 Phe TyrSerAla SerCysGln LysThrTyr GlyArgPro GluGlyGln Tyr LeuArgPro GlyGluAsp GluArgVal AlaPheCys ThrSerGlu SS
Phe SerAspIle SerProLeu SerGlyGly AsnValAla PheSerThr Leu GluGlyArg ProSerAla TyrAsnPhe GluGluSer ProGlyLeu Gln GluTrpVal ThrSerThr GluLeuLeu IleSerLeu AspArgLeu Asn Thr Phe Gly Asp Asp Ile Phe Lys Asp Pro Lys Val Leu Gln Ser Tyr Tyr Tyr Ala Val Ser Asp Phe Ser Val Gly Gly Arg Cys Lys Cys Asn Gly His Ala Ser Glu Cys Gly Pro Asp Val Ala Gly Gln Leu Ala 1$ Cys Arg Cys Gln His Asn Thr Thr Gly Thr Asp Cys Glu Arg Cys Leu Pro Phe Phe Gln Asp Arg Pro Trp Ala Arg Gly Thr Ala Glu Ala Ala His Glu Cys Leu Pro Cys Asn Cys Ser Gly Arg Ser Glu Glu Cys Thr Phe Asp Arg Glu Leu Phe Arg Ser Thr Gly His Gly Gly Arg Cys Hia 2$ 275 280 285 His Cys Arg Asp His Thr Ala Gly Pro His Cys Glu Arg Cys Gln Glu Asn Phe Tyr His Trp Asp Pro Arg Met Pro Cys Gln Pro Cys Asp Cys 3$
Gln Ser Ala Gly Ser Leu His Leu Gln Cys Asp Asp Thr Gly Thr Cys Ala Cys Lys Pro Thr Val Thr Gly Trp Lys Cys Asp Arg Cys Leu Pro Gly Phe His Ser Leu Ser Glu Gly Gly Cys Arg Pro Cys Thr Cys Asn Pro Ala Gly Ser Leu Asp Thr Cya Asp Pro Arg Ser Gly Arg Cys Pro 4$ Cys Lys Glu Asn Val Glu Gly Aen Leu Cys Asp Arg Cys Arg Pro Gly Thr Phe Asn Leu Gln Pro His Asn Pro Ala Gly Cys Ser Ser Cys Phe $0 Cys Tyr Gly His Ser Lys Val Cys Ala Ser Thr Ala Gln Phe Gln Val His His Ile Leu Ser Asp Phe His Gln Gly Ala Glu Gly Trp Trp Ala $$ 435 440 445 Arg Ser Val Gly Gly Ser Glu His Ser Pro Gln Trp Ser Pro Asn Gly f)0 Val Leu Leu Ser Pro Glu Asp Glu Glu Glu Leu Thr Ala Pro Gly Lys Phe Leu Gly Asp Gln Arg Phe Ser Tyr Gly Gln Pro Leu Ile Leu Thr $ 485 490 495 Phe Arg Val Pro Pro Gly Asp Ser Pro Leu Pro Val Gln Leu Arg Leu Glu Gly Thr Gly Leu Ala Leu Ser Leu Arg His Ser Ser Leu Ser Gly Pro Gln Asp Ala Arg Ala Ser Gln Gly Gly Arg Ala Gln Val Pro Leu 1$
Gln Glu Thr Ser Glu Aap Val Ala Pro Pro Leu Pro Pro Phe His Phe Gln Arg Leu Leu Ala Asn Leu Thr Ser Leu Arg Leu Arg Val Ser Pro Gly Pro Ser Pro Ala Gly Pro Val Phe Leu Thr Glu Val Arg Leu Thr 2$ Ser Ala Arg Pro Gly Leu Ser Pro Pro Ala Ser Trp Val Glu Ile Cys Ser Cys Pro Thr Gly Tyr Thr Gly Gln Phe Cys Glu Ser Cys Ala Pro Gly Tyr Lys Arg Glu Met Pro Gln Gly Gly Pro Tyr Ala Ser Cys Val Pro Cys Thr Cys Asn Gln His Gly Thr Cys Asp Pro Asn Thr Gly Ile 3$ 645 650 655 Cys Val Cys Ser His His Thr Glu Gly Pro Ser Cys Glu Arg Cys Leu Pro Gly Phe Tyr Gly Asn Pro Phe Ala Gly Gln Ala Asp Asp Cys Gln Pro Cys Pro Cys Pro Gly Gln Ser Ala Cys Thr Thr Ile Pro Glu Ser 4$
Gly Glu Val Val Cys Thr His Cys Pro Pro Gly Gln Arg Gly Arg Arg Cys Glu Val Cys Asp Asp Gly Phe Phe Gly Asp Pro Leu Gly Leu Phe $0 725 730 735 Gly His Pro Gln Pro Cys His Gln Cys Gln Cys Ser Gly Asn Val Asp $$ Pro Asn Ala Val Gly Asn Cys Asp Pro Leu Ser Gly His Cys Leu Arg Cys Leu His Asn Thr Thr Gly Asp His Cys Glu His Cys Gln Glu Gly Phe Tyr Gly Ser Ala Leu Ala Pro Arg Pro Ala Asp Lys Cys Met Pro WO 99/19348 ~ 4 PCT/US98/21391 $ Cys Ser Cys His Pro Gln Gly Ser Val Ser Glu Gln Met Pro Cys Asp Pro Val Thr Gly Gln Cys Ser Cys Leu Pro His Val Thr Ala Arg Asp Cys Ser Arg Cys Tyr Pro Gly Phe Phe Asp Leu Gln Pro Gly Arg Gly Cys Arg Ser Cys Lys Cys His Pro Leu Gly Ser Gln Glu Asp Gln Cys 1$ 850 855 860 His Pro Lys Thr Gly Gln Cys Thr Cys Arg Pro Gly Val Thr Gly Gln Ala Cys Asp Arg Cys Gln Leu Gly Phe Phe Gly Ser Ser Ile Lye Gly 2$
Cys Arg Ala Cys Arg Cys Ser Pro Leu Gly Ala Ala Ser Ala Gln Cys His Tyr Asn Gly Thr Cys Val Cys Arg Pro Gly Phe Glu Gly Tyr Lys Cys Aep Arg Cys His Tyr Asn Phe Phe Leu Thr Ala Asp Gly Thr His Cys Gln Gln Cys Pro Ser Cys Tyr Ala Leu Val Lys Glu Glu Xaa Ala 3$ Lys Leu Lys Ala Arg Leu Thr Leu Thr Glu Gly Trp Leu Gln Gly Ser Asp Cys Gly Ser Pro Trp Gly Pro Leu Asp Ile Leu Leu Gly Glu Ala Pro Arg Xaa Asp Val Tyr Gln Gly His His Leu Leu Pro Gly Ala Arg Glu Ala Phe Leu Glu Gln Met Met Gly Leu Glu Gly Ala Val Lys Ala 4$ solo loss 1020 Ala Arg Glu Gln Leu Gln Arg Leu Asn Lys Gly Ala Arg Cys Ala Gln $0 Ala Gly Ser Gln Lys Thr Cys Thr Gln Leu Ala Asp Leu Glu Ala Val $$
Leu Glu Ser Ser Glu Glu Glu Ile Leu His Ala Ala Ala Ile Leu Ala Ser Leu Glu Ile Pro Gln Glu Gly Pro Ser Gln Pro Thr Lys Trp Ser His Leu Ala Ile Glu Ala Arg Ala Leu Ala Arg Ser His Arg Asp Thr Ala Thr Lys Ile Ala Ala Thr Ala Trp Arg Ala Leu Leu Ala Ser Asn WO 99/19348 ~ 5 PCT/US98/Z1391 Thr Ser Tyr Ala Leu Leu Trp Asn Leu Leu Glu Gly Arg Val Ala L_eu Glu Thr Gln Arg Asp Leu Glu Asp Arg Tyr Gln Glu Val Gln Ala Ala Gln Lys Ala Leu Arg Thr Ala Val Ala Glu Val Leu Pro Glu Ala Xaa IS Lys Arg Val Gly His Arg Ala Ala Ser Trp Arg Arg Tyr Ser Pro Val Pro Gly Leu Ala Gly Phe Pro Gly Ser Ser Ala Ser Xaa Lys Ser Arg Ala Glu Asp Leu Gly Leu Lys Ala Lys Ala Leu Glu Lys Thr Val Ala Ser Trp Gln His Met Ala Thr Glu Ala Ala Arg Thr Leu Gln Thr Ala 2$ 1220 1225 1230 Ala Gln Ala Thr Leu Arg Gln Thr Glu Pro Leu Thr Met Ala Arg Ser Arg Leu Thr Ala Thr Phe Ala Ser Gln Leu His Gln Gly Ala Arg Ala Ala Leu Thr Gln Ala Ser Ser Ser Val Gln Ala Ala Thr Val Thr Val Met Gly Ala Arg Thr Leu Leu Ala Asp Leu Glu Gly Met Lys Leu Gln Phe Pro Arg Pro Lys Asp Gln Ala Ala Leu Gln Arg Lys Ala Asp Ser Val Ser Asp Arg Leu Leu Ala Asp Thr Arg Lys Lys Thr Lys Gln Ala 4S Glu Arg Met Leu Gly Asn Ala Ala Pro Leu Ser Ser Ser Ala Lys Lys Lys Gly Arg Glu Ala Glu Val Leu Ala Lys Asp Ser Ala Lys Leu Ala Lys Ala Leu Leu Arg Glu Arg Lys Gln Ala His Arg Arg Ala Ser Arg Leu Thr Ser Gln Xaa Leu Gln Ala Thr Leu Gln Gln Ala Ser Gln Gln Val Leu Ala Ser Glu Ala Arg Arg Gln Glu Leu Glu Glu Ala Glu Arg Val Gly Ala Gly Leu Ser Glu Met Glu Gln Gln Ile Arg Glu Ser Arg Ile Ser Leu Glu Lys Asp Ile Glu Thr Leu Ser Glu Leu Leu Ala Arg Leu Gly SerLeuAsp Thr GlnAlaPro AlaGlnAlaLeuAsn Glu His Thr Gln TrpAlaLeu Glu LeuArgLeu GlnLeuGlySerPro Gly Arg Ser Leu GlnArgLys Leu LeuLeuGlu GlnGluSerGlnGln Gln Ser Glu Leu GlnIleGln Gly GluSerAsp LeuAlaGluIleArg Ala Phe Asp Lys GlnAsnLeu Glu IleLeuHis SerLeuProGluAsn Cys Ala Ala Ser TrpGln ZS (2) INFORMATION FOR SEQ ID N0:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4890 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:4:
WO 99/19348 ~ ~ PCT/US98/21391 S
lO CTGTCAGGAGAATTTCTATCACTGGGACCCGCGGATGCCATGCCAGCCCTGTGACTGCCA960 GGGAGCCGAAGGCTGGTG~GCCAGAAGTGTGGGGGGCTCTGAGCACTCCCCACAATGGAG1380 SS
WO 99/19348 ~ 8 PCT/US9$/21391 WO 99/19348 ~ g PCT/US98/21391 1$
TGTGTGTATG ACCCAAATAA p~~74AAAAAAA 4 2$
(2) INFORMATION FOR SEQ ID
N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1105 amino acids 30 (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide 3$ (v) FRAGMENT TYPE: internal (xi) SEQUENCE N0:5:
DESCRIPTION:
SEQ
ID
Met GlnPheGln LeuThrLeu PheLeuHis LeuGlyTrp LeuSerTyr Ser LysAlaGln AspAspCys AsnArgGly AlaCysHis ProThrThr 4$ 20 25 30 Gly AepLeuLeu.ValGlyArg AsnThrGln LeuMetAla SerSerThr $0 Cys GlyLeuSer ArgAlaGln LysTyrCys IleLeuSer TyrLeuGiu Gly GluGlnLys CysSerIle CysAspSer ArgPhePro TyrAspPro $$
Tyr AspGlnPro AsnSerHie ThrIleGlu AsnValThr ValSerPhe Glu ProAspArg GluLysLys TrpTrpGln SerGluAsn GlyLeuAsp His ValSerIle ArgLeuAsp LeuGluAla LeuPheArg PheSerHis Leu IleLeuThr PheLysThr PheArgPro AlaAlaMet LeuValGlu Arg SerThrAsp TyrGlyHis AsnTrpLys ValPheLys TyrPheAla Lys AspCysAla ThrSerPhe ProAsnIle ThrSerGly GlnAlaGln 1$ Gly ValGlyAsp IleValCys AspSerLys TyrSerAep IleGluPro Ser ThrGlyGly GluValVal LeuLysVal LeuAspPro SerPheGlu Ile GluAsnPro TyrSerPro TyrIleGln AspLeuVal ThrLeuThr Asn LeuArgIle AenPheThr LysLeuHis ThrLeuGly AspAlaLeu 2$ 225 230 235 240 Leu GlyArgArg GlnAsnAsp SerLeuAsp LysTyrTyr TyrAlaLeu Tyr GluMetIle ValArgGly SerCysPhe CysAsnGly HisAlaSer Glu CysArgPro MetGlnLys MetArgGly RspValPhe SerProPro 3$
Gly MetValHis GlyGlnCys ValCysGln HisAsnThr AspGlyPro Asn CysGluArg CysLysAsp PhePheGln AspAlaPro TrpArgPro Ala AlaAspLeu GlnAspAsn AlaCysArg SerCysSer CysAsnSer 4$ His SerSerArg CysHisPhe AspMetThr ThrTyrLeu AlaSerGly Gly LeuSerGly GlyValCys GluAspCys GlnHisAsn ThrGluGly $~
Gln HisCysAsp ArgCysArg ProLeuPhe TyrArgAsp ProLeuLys Thr IleSerAsp ProTyrAla CysIlePro CysGluCys AspProAsp $$ 385 390 395 400 Gly ThrIleSer GlyGlyIle CysValSer HisSerAsp ProAlaLeu fib Gly SerValAla GlyGlnCys LeuCysLys GluAsnVal GluGlyAla Lys CysAspGln CysLysPro AsnHisTyr GlyLeuSer AlaThrAsp WO 99/19348 2 ~ PCT/US98121391 $ 435 440 445 Pro Leu Gly Cys Gln Pro Cys Aep Cys Asn Pro Leu Gly Ser Leu~Pro Phe Leu Thr Cys Asp Val Asp Thr G1y Gln Cys Leu Cys Leu Xaa Tyr Val Thr Gly Ala His Cys Glu Glu Cys Thr Val Gly Tyr Trp Gly Leu IS
Gly Asn His Leu His Gly Cys Ser Pro Cys Asp Cys Asp Ile Gly Gly Ala Tyr SerAsnVal CysSer ProLysAsn GlyGlnCysGlu CysArg Pro His ValThrGly ArgSer CysSerGlu ProAlaProGly TyrPhe 2$ Phe Ala ProLeuAsn PheTyr LeuTyrGlu AlaGluGluAla ThrThr Leu Gln GlyLeuAla ProLeu GlySerGlu ThrPheGlyGln SerPro Ala Val HisValVal LeuGly GluProVal ProGlyAsnPro ValThr Trp Thr GlyProGly PheAla ArgValLeu ProGlyAlaGly LeuArg 3$ 595 600 605 Phe Ala Val Asn Asn Ile Pro Phe Pro Val Asp Phe Thr Ile Ala Ile 40 His Tyr Glu Thr Gln Ser Ala Ala Asp Trp Thr Val Gln Ile Val Val Asn Pro Pro Gly Gly Ser Glu His Cys Ile Pro Lys Thr Leu Gln Ser Lys Pro Gln Ser Phe Ala Leu Pro Ala Ala Thr Arg Ile Met Leu Leu Pro Thr Pro Ile Cys Leu Glu Pro Asp Val Gln Tyr Ser Ile Asp Val $0 675 680 685 Tyr Phe Ser Gln Pro Leu Gln Gly Glu Ser His Ala His Ser His Val 690 695 ' 700 Leu Val Asp Ser Leu Gly Leu Ile Pro Gln Ile Asn Ser Leu Glu Asn Phe Cys Ser Lys Gln Asp Leu Asp Glu Tyr Gln Leu His Asn Cys Val Glu Ile Ala Ser Ala Met Gly Pro Gln Val Leu Pro Gly Ala Cys Glu WO 99/19348 22 PCT/US98J~1391 Arg Leu Ile Ile Ser Met Ser Ala Lys Leu His Asp Gly Ala Val Ala Cys Lys Cys His Pro Gln Gly Ser Val Gly Ser Ser Cys Ser Arg Leu Gly Gly Gln Cys Gln Cys Lys Pro Leu Val Val Gly Arg Cys Cys Asp Arg Cys Ser Thr Gly Ser Tyr Asp Leu Gly His Hie Gly Cys His Pro Cys His Cys His Pro Gln Gly Ser Lys Asp Thr Val Cys Asp Gln Val Thr Gly Gln Cys Pro Cys His Gly Glu Val Ser Gly Arg Arg Cys Asp Arg Cys Leu Ala Gly Tyr Phe Gly Phe Pro Ser Cys His Pro Cys Pro Cys Xaa Arg Phe Xaa Ala Xaa Asp Xaa Leu Xaa Xaa Asp Pro Glu Thr Gly Ser Cys Phe Asn Cys Gly Gly Phe Thr Thr Gly Arg Asn Cys Glu Arg Cys Ile Asp Gly Tyr Tyr Gly Asn Pro Ser Ser Gly Gln Pro Cys 3$ Arg Pro Cys Leu Cys Pro Asp Asp Pro Ser Ser Asn Gln Tyr Phe Ala His Ser Cys Tyr Gln Asn Leu Trp Ser Ser Asp Val Ile Cys Asn Cys Leu Gln Gly Tyr Thr Gly Thr Gln Cys Gly Glu Cys Ser Thr Gly Phe Tyr Gly Asn Pro Arg Ile Ser Gly Ala Pro Cys Gln Pro Cys Ala Cys Asn Asn Asn Ile Asp Val Thr Asp Pro Glu Ser Cys Ser Arg Val Thr SO Gly Glu Cys Leu Arg Cys Leu His Asn Thr Gln Gly Ala Asn Cys Gln Leu Cys Lys Pro Gly His Tyr Gly Ser Ala Leu Asn Gln Thr Cys Arg Arg Cys Ser Cys His Ala Ser Gly Val Ser Pro Met Glu Cys Pro Pro Gly Gly Gly Ala Cys Leu Cys Asp Pro Val Thr Gly Ala Cys Pro Cys Leu Pro Asn Val Thr Gly Leu Ala Cys Asp Arg Cys Ala Asp Gly Tyr S
Trp Asn Leu Val Pro Gly Arg Gly Cys Gln Ser Cys Asp Cys Asp Pro Arg Xaa Ser Gln Ser Ser His Cys Asp Gln Ala Arg Tyr Phe Lys Ala Tyr IS (2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3754 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE
DESCRIPTION:
SEQ ID
N0:6:
ACATGCCCCGTTTGCT(3CCT GAACCTCTCC ACAAAGACTCCCAGATCCTG AATTGAATTT60 S
AGCGGTGGCC
lO GCAGACCCCTCTTCTACAGGGACCCGCTCAAGACCATCTCAGATCCCTACGCGTGCATTC1260 SS
C)OTCCTGAGACAGGGTCATGCTTCAATTGTGGAGGCTTTACAACTGGCAGAAACTGTGAAAG2760 TCTCAAGTTC AGCTTCGCCTACTTCAGTTTCCCCTC'rGTGACTGAGGAAGTCAGAATTCA 3600 3S CCTTCAAAAC CTAAAAAAAAAAA,AAAAAAAAAAA 3754
Claims (3)
1. An isolated laminin 12 which includes an .alpha.2 subunit, a .beta.1 subunit arid a .gamma.3 -subunit.
2. An isolated .gamma.3 subunit.
3. An isolated .beta.4 subunit.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US6160997P | 1997-10-10 | 1997-10-10 | |
US60/061,609 | 1997-10-10 | ||
PCT/US1998/021391 WO1999019348A1 (en) | 1997-10-10 | 1998-10-08 | Laminins and uses thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2304169A1 true CA2304169A1 (en) | 1999-04-22 |
Family
ID=22036904
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002304169A Abandoned CA2304169A1 (en) | 1997-10-10 | 1998-10-09 | Laminins and uses thereof |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1021460A4 (en) |
JP (1) | JP2001519180A (en) |
AU (1) | AU751632B2 (en) |
CA (1) | CA2304169A1 (en) |
WO (1) | WO1999019348A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6682911B1 (en) * | 1997-10-10 | 2004-01-27 | The General Hospital Corporation | Laminins and uses thereof |
KR101219512B1 (en) * | 2011-03-15 | 2013-01-11 | 서울대학교산학협력단 | Human Laminin α2 Chain LG3 Domain and Active Peptides Promoting Cell Adhesion, Spreading, Migration,and Neurite Outgrowth |
SG10201908826UA (en) * | 2018-10-22 | 2020-05-28 | Euroimmun Medizinische Labordiagnostika Ag | Diagnosis of blistering autoimmune diseases |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5660982A (en) * | 1994-10-04 | 1997-08-26 | Tryggvason; Karl | Laminin chains: diagnostic uses |
-
1998
- 1998-10-08 JP JP2000515919A patent/JP2001519180A/en not_active Withdrawn
- 1998-10-08 WO PCT/US1998/021391 patent/WO1999019348A1/en not_active Application Discontinuation
- 1998-10-08 EP EP98953370A patent/EP1021460A4/en not_active Withdrawn
- 1998-10-09 CA CA002304169A patent/CA2304169A1/en not_active Abandoned
- 1998-10-09 AU AU10765/99A patent/AU751632B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
JP2001519180A (en) | 2001-10-23 |
EP1021460A1 (en) | 2000-07-26 |
WO1999019348A9 (en) | 1999-07-29 |
AU1076599A (en) | 1999-05-03 |
WO1999019348A8 (en) | 1999-09-02 |
AU751632B2 (en) | 2002-08-22 |
EP1021460A4 (en) | 2005-03-09 |
WO1999019348A1 (en) | 1999-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6274717B1 (en) | Splicing variant of human membrane-type matrix metalloproteinease-5 (MT-MMP5-L) | |
US6682911B1 (en) | Laminins and uses thereof | |
AU751632B2 (en) | Laminins and uses thereof | |
US5641748A (en) | Caip-like gene family | |
US5656438A (en) | CAIP-like gene family | |
US5837844A (en) | CAIP-like gene family | |
US6566489B1 (en) | Syndecan-4 binding protein (S4BP) and uses thereof | |
US6656705B1 (en) | Sciellin and uses thereof | |
US6818214B2 (en) | Two novel genes from psoriatic epidermis: psoriastatin type I and psoriastatin type II | |
EP1064364A1 (en) | The cytokine family member 2-21 | |
US6171800B1 (en) | Method of making and binding CAIP polypeptides | |
US6423824B1 (en) | CAIP-like gene family | |
WO2000002577A1 (en) | Therapeutic use of uncoupling protein hhfcw60 | |
US20030013673A1 (en) | Modulating the Rad-nm23 interaction | |
JP2003520577A (en) | EPO primary response gene 1, EPRG1 | |
WO1997049808A1 (en) | The caip-like gene family | |
WO2000034470A1 (en) | Npcbac06: human transitional endoplasmic reticulum atpase gene | |
WO2000061598A2 (en) | Rat kidney specific gene profilin-3 | |
US20020055476A1 (en) | RAMP2a: receptor activity modifying protein-2a | |
WO1999043208A1 (en) | The helios gene | |
WO2000034328A1 (en) | Tpaaoh04: human g protein sara gene | |
JP2002513577A (en) | Endothelin converting enzyme-like protein | |
JP2002503635A (en) | EPRG3Spt, EPO primary response gene | |
JP2002262893A (en) | Sbhunc 50 polypeptide and sbhunc 50 polynucleotide | |
WO2000022116A1 (en) | Human mdg1 gene (cbfawb10) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |