US6534477B2 - Production and use of modified cystatins - Google Patents
Production and use of modified cystatins Download PDFInfo
- Publication number
- US6534477B2 US6534477B2 US09/775,932 US77593201A US6534477B2 US 6534477 B2 US6534477 B2 US 6534477B2 US 77593201 A US77593201 A US 77593201A US 6534477 B2 US6534477 B2 US 6534477B2
- Authority
- US
- United States
- Prior art keywords
- val
- ser
- gln
- lys
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108050004038 cystatin Proteins 0.000 title abstract description 136
- 102000015833 Cystatin Human genes 0.000 title abstract description 125
- 238000004519 manufacturing process Methods 0.000 title description 23
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 48
- 101000912205 Homo sapiens Cystatin-C Proteins 0.000 claims description 35
- 102000049632 human CST3 Human genes 0.000 claims description 34
- 230000004048 modification Effects 0.000 claims description 17
- 238000012986 modification Methods 0.000 claims description 17
- 230000013595 glycosylation Effects 0.000 abstract description 52
- 238000006206 glycosylation reaction Methods 0.000 abstract description 52
- 238000000034 method Methods 0.000 abstract description 28
- 230000000694 effects Effects 0.000 abstract description 14
- 230000017854 proteolysis Effects 0.000 abstract description 9
- 125000003275 alpha amino acid group Chemical group 0.000 description 87
- 235000001014 amino acid Nutrition 0.000 description 69
- 108090000623 proteins and genes Proteins 0.000 description 69
- 102000004169 proteins and genes Human genes 0.000 description 63
- 235000018102 proteins Nutrition 0.000 description 60
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 43
- 238000006467 substitution reaction Methods 0.000 description 40
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 38
- 125000003729 nucleotide group Chemical group 0.000 description 38
- 210000004027 cell Anatomy 0.000 description 37
- 239000004365 Protease Substances 0.000 description 34
- 239000002773 nucleotide Substances 0.000 description 33
- 102000035195 Peptidases Human genes 0.000 description 32
- 108091005804 Peptidases Proteins 0.000 description 32
- 229940024606 amino acid Drugs 0.000 description 32
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 31
- 150000001413 amino acids Chemical class 0.000 description 29
- 108020004414 DNA Proteins 0.000 description 27
- 241000282414 Homo sapiens Species 0.000 description 27
- 150000007523 nucleic acids Chemical class 0.000 description 26
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 24
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 23
- 239000002299 complementary DNA Substances 0.000 description 23
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 21
- 235000013305 food Nutrition 0.000 description 19
- 108010076504 Protein Sorting Signals Proteins 0.000 description 18
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 17
- 108020004707 nucleic acids Proteins 0.000 description 17
- 102000039446 nucleic acids Human genes 0.000 description 17
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 235000019419 proteases Nutrition 0.000 description 15
- 101000884770 Homo sapiens Cystatin-M Proteins 0.000 description 14
- 108091034117 Oligonucleotide Proteins 0.000 description 14
- 102000051607 human CST6 Human genes 0.000 description 14
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 14
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 13
- 241000283690 Bos taurus Species 0.000 description 12
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 12
- 230000002401 inhibitory effect Effects 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 11
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 230000007170 pathology Effects 0.000 description 11
- 235000019465 surimi Nutrition 0.000 description 11
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 10
- 102100021277 Beta-secretase 2 Human genes 0.000 description 10
- 101710150190 Beta-secretase 2 Proteins 0.000 description 10
- 241000252233 Cyprinus carpio Species 0.000 description 10
- 125000000539 amino acid group Chemical group 0.000 description 10
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 10
- 241000196324 Embryophyta Species 0.000 description 9
- 241000277331 Salmonidae Species 0.000 description 9
- 238000002741 site-directed mutagenesis Methods 0.000 description 9
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 8
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 8
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 8
- 108010062796 arginyllysine Proteins 0.000 description 8
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 108010025488 pinealon Proteins 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 241000282326 Felis catus Species 0.000 description 7
- 101000916687 Homo sapiens Cystatin-D Proteins 0.000 description 7
- 101000722966 Homo sapiens Cystatin-S Proteins 0.000 description 7
- 101000722958 Homo sapiens Cystatin-SA Proteins 0.000 description 7
- 101000884768 Homo sapiens Cystatin-SN Proteins 0.000 description 7
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 7
- 102000045166 human CST5 Human genes 0.000 description 7
- 238000003752 polymerase chain reaction Methods 0.000 description 7
- 239000000523 sample Substances 0.000 description 7
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 6
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 6
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 6
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 6
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 6
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 6
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 6
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 6
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 6
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 6
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 6
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 6
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 6
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- 241000277329 Oncorhynchus keta Species 0.000 description 6
- 108090000526 Papain Proteins 0.000 description 6
- 241000235648 Pichia Species 0.000 description 6
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 6
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 6
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 230000006378 damage Effects 0.000 description 6
- 235000019688 fish Nutrition 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 6
- 102000051463 human CST1 Human genes 0.000 description 6
- 102000051461 human CST2 Human genes 0.000 description 6
- 102000051460 human CST4 Human genes 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 230000004481 post-translational protein modification Effects 0.000 description 6
- 230000002441 reversible effect Effects 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 5
- 241000251468 Actinopterygii Species 0.000 description 5
- 241000972773 Aulopiformes Species 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 241000233866 Fungi Species 0.000 description 5
- 241000235058 Komagataella pastoris Species 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 235000019834 papain Nutrition 0.000 description 5
- 229940055729 papain Drugs 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 235000019515 salmon Nutrition 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 4
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 4
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 4
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 4
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 4
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 4
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 4
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 108010005843 Cysteine Proteases Proteins 0.000 description 4
- 102000005927 Cysteine Proteases Human genes 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 4
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 4
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 4
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 4
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 4
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 4
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 4
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 4
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 4
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 4
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108010065395 Neuropep-1 Proteins 0.000 description 4
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 4
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 4
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 4
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 4
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 4
- OFSZYRZOUMNCCU-BZSNNMDCSA-N Pro-Trp-Met Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C(=O)[C@@H]1CCCN1 OFSZYRZOUMNCCU-BZSNNMDCSA-N 0.000 description 4
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 4
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 4
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 4
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 4
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 4
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 4
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 4
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 4
- 108010064997 VPY tripeptide Proteins 0.000 description 4
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 4
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 4
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 4
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 4
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 4
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 108010036533 arginylvaline Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 230000009545 invasion Effects 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 239000003471 mutagenic agent Substances 0.000 description 4
- 231100000707 mutagenic chemical Toxicity 0.000 description 4
- 230000003505 mutagenic effect Effects 0.000 description 4
- VBEGHXKAFSLLGE-UHFFFAOYSA-N n-phenylnitramide Chemical compound [O-][N+](=O)NC1=CC=CC=C1 VBEGHXKAFSLLGE-UHFFFAOYSA-N 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 3
- 241000711573 Coronaviridae Species 0.000 description 3
- 102000002322 Egg Proteins Human genes 0.000 description 3
- 108010000912 Egg Proteins Proteins 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 241000222722 Leishmania <genus> Species 0.000 description 3
- 241000033345 Merluccius productus Species 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000277275 Oncorhynchus mykiss Species 0.000 description 3
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000013599 cloning vector Substances 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000001816 cooling Methods 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 235000014103 egg white Nutrition 0.000 description 3
- 210000000969 egg white Anatomy 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 239000002674 ointment Substances 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000002797 proteolythic effect Effects 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- 230000000699 topical effect Effects 0.000 description 3
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 2
- 108010025188 Alcohol oxidase Proteins 0.000 description 2
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 2
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 2
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 2
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 2
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 2
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 2
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- 241001465180 Botrytis Species 0.000 description 2
- 102000005600 Cathepsins Human genes 0.000 description 2
- 108010084457 Cathepsins Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 2
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 2
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 2
- UYYZZJXUVIZTMH-AVGNSLFASA-N Cys-Glu-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UYYZZJXUVIZTMH-AVGNSLFASA-N 0.000 description 2
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 2
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 2
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 2
- 108010061642 Cystatin C Proteins 0.000 description 2
- 241000255601 Drosophila melanogaster Species 0.000 description 2
- 241000287830 Gallus sp. Species 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 2
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 2
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- WVWZIPOJECFDAG-AVGNSLFASA-N Glu-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N WVWZIPOJECFDAG-AVGNSLFASA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 2
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 2
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 241000244206 Nematoda Species 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- 241000709664 Picornaviridae Species 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 2
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 2
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- 108010022999 Serine Proteases Proteins 0.000 description 2
- 102000012479 Serine Proteases Human genes 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 241000295644 Staphylococcaceae Species 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 2
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 2
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 2
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 2
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 2
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 2
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 2
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010084455 Zeocin Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 235000015278 beef Nutrition 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 244000038559 crop plants Species 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000009931 harmful effect Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 210000003292 kidney cell Anatomy 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 230000031700 light absorption Effects 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 235000013372 meat Nutrition 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 108010066642 phenylalanyl-valyl-valyl-tyrosine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 2
- 239000006187 pill Substances 0.000 description 2
- 244000000003 plant pathogen Species 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- HBOMLICNUCNMMY-KJFJCRTCSA-N 1-[(4s,5s)-4-azido-5-(hydroxymethyl)oxolan-2-yl]-5-methylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)C(C)=CN1C1O[C@H](CO)[C@@H](N=[N+]=[N-])C1 HBOMLICNUCNMMY-KJFJCRTCSA-N 0.000 description 1
- RNAMYOYQYRYFQY-UHFFFAOYSA-N 2-(4,4-difluoropiperidin-1-yl)-6-methoxy-n-(1-propan-2-ylpiperidin-4-yl)-7-(3-pyrrolidin-1-ylpropoxy)quinazolin-4-amine Chemical compound N1=C(N2CCC(F)(F)CC2)N=C2C=C(OCCCN3CCCC3)C(OC)=CC2=C1NC1CCN(C(C)C)CC1 RNAMYOYQYRYFQY-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 206010059245 Angiopathy Diseases 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000238421 Arthropoda Species 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 101000898643 Candida albicans Vacuolar aspartic protease Proteins 0.000 description 1
- 101000898783 Candida tropicalis Candidapepsin Proteins 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 206010008111 Cerebral haemorrhage Diseases 0.000 description 1
- 241000195585 Chlamydomonas Species 0.000 description 1
- 241000867607 Chlorocebus sabaeus Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 235000019750 Crude protein Nutrition 0.000 description 1
- 101000898784 Cryphonectria parasitica Endothiapepsin Proteins 0.000 description 1
- 102000012192 Cystatin C Human genes 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 241000271032 Daboia russelii Species 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 108091092566 Extrachromosomal DNA Proteins 0.000 description 1
- 108010028690 Fish Proteins Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 241000255967 Helicoverpa zea Species 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 241000224421 Heterolobosea Species 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 206010021531 Impetigo Diseases 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 239000004166 Lanolin Substances 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000269435 Rana <genus> Species 0.000 description 1
- 241000270934 Rana catesbeiana Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 101000933133 Rhizopus niveus Rhizopuspepsin-1 Proteins 0.000 description 1
- 101000910082 Rhizopus niveus Rhizopuspepsin-2 Proteins 0.000 description 1
- 101000910079 Rhizopus niveus Rhizopuspepsin-3 Proteins 0.000 description 1
- 101000910086 Rhizopus niveus Rhizopuspepsin-4 Proteins 0.000 description 1
- 101000910088 Rhizopus niveus Rhizopuspepsin-5 Proteins 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 101000898773 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Saccharopepsin Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000242678 Schistosoma Species 0.000 description 1
- 239000012506 Sephacryl® Substances 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000580858 Simian-Human immunodeficiency virus Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 241000255993 Trichoplusia ni Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 235000010726 Vigna sinensis Nutrition 0.000 description 1
- 244000042314 Vigna unguiculata Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 210000003001 amoeba Anatomy 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 230000008952 bacterial invasion Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000005081 chemiluminescent agent Substances 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000013377 clone selection method Methods 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 239000002852 cysteine proteinase inhibitor Substances 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 239000013583 drug formulation Substances 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 244000000013 helminth Species 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- -1 i.e. Chemical group 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 229940039717 lanolin Drugs 0.000 description 1
- 235000019388 lanolin Nutrition 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 235000004213 low-fat Nutrition 0.000 description 1
- 230000002132 lysosomal effect Effects 0.000 description 1
- 239000002075 main ingredient Substances 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 230000023837 negative regulation of proteolysis Effects 0.000 description 1
- 230000009826 neoplastic cell growth Effects 0.000 description 1
- 230000010309 neoplastic transformation Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 238000001821 nucleic acid purification Methods 0.000 description 1
- 230000005257 nucleotidylation Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 229940024999 proteolytic enzymes for treatment of wounds and ulcers Drugs 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 239000012429 reaction media Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 235000014102 seafood Nutrition 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/81—Protease inhibitors
- C07K14/8107—Endopeptidase (E.C. 3.4.21-99) inhibitors
- C07K14/8139—Cysteine protease (E.C. 3.4.22) inhibitors, e.g. cystatin
Definitions
- the present invention relates to protease inhibitors, specifically to cystatins, that have been modified by glycosylation in order to enhance stability and activity; to methods of making such modified protease inhibitors and to methods of using such modified protease inhibitors to inhibit proteolysis of a protein substrate.
- Proteases are enzymes that degrade proteins. Proteases are classified by the substrate upon which they act and include serine proteases, cysteine proteases, aspartate proteases and metalloproteinases. Serine and cysteine proteases are widespread and are found in diverse organisms including eukaryotic and prokaryotic animals and plants. Cysteine proteases are generally well characterized enzymes having a known primary structure composed of alpha helices and beta pleated sheets (8).
- Proteases mediate many processes that are harmful to man, either by producing pathology or by causing economic loss, for instance by degrading foods.
- Protease-mediated pathology is known to be caused by a wide variety of organisms including bacteria, such as staphylococci and streptococci, fungi, arthropods, nematodes, protozoa such as amoebas, intestinal flagellates, haemoflagelates, such as Leishmania and trypanosomes and helminths.
- Proteases are known to be important in the pathology of certain viruses (9, 11, 12, 31) including Polio virus, Herpes virus, Corona virus, HIV and Rotavirus. Proteases are also known to play a role in various diseases with no clear etiological agent, such as muscular dystrophy (7) and cancers (1) including breast cancer (2), and amyloid angiopathy, a genetic disease that often leads to fatal cerebral hemorrhages in young adults (10, 13).
- Proteases are also responsible for the spoilage of economically important foodstuffs, necessitating huge annual expenditures on preventative measures.
- Botrytis cinera causes widespread disease in over thirty species of commercial crop plants. Molds and fungi are the major destroyers of citrus fruit crops. Foods rich in protein, such as meat and fish, are degraded and made inedible by proteases from Pseudomonads and other bacteria. Foods with a high muscle content may be quickly broken down by endogenous proteases released from tissues upon death. These enzymes degrade myosin, and destroy the texture of the food.
- surimi is a form of processed minced fish, commonly made from Pacific Whiting ( Merluccius productus ), and is the main ingredient of seafood analogs such as “imitation crab meat”.
- Surimi is an important source of relatively cheap, high quality, low fat protein important to the diet of many people in the Far East and of increasing economic significance worldwide. Endogenous proteolytic enzymes released during surimi production cause rapid degradation of muscle tissue and lead to poor quality surimi (3, 4, 5, 6, 11, 17, 18, 19). It is thought that the protease released from the fish tissue is a cathepsin, which is a common cysteine protease (3).
- protease inhibitors have been investigated for their potential role in preventing disease and degradation of foodstuffs (4, 5, 18).
- Partially refined substances that contain protease inhibitors are commonly used in food processing, for instance, to prevent proteolytic breakdown of fish protein during the production of surimi (4, 17, 18).
- the most commonly used food-grade protease inhibitors are beef plasma protein (BPP), egg white powder and potato powder (4).
- BPP beef plasma protein
- egg white powder egg white powder
- potato powder (4).
- Genetic engineering techniques have been used to introduce protease-inhibitor genes from chickens into cereal and grass plants to control protease-producing plant pathogens.
- a plant protease inhibitor gene, from Cowpea has been recombinantly introduced into tobacco, tomato, cotton and other plants to inhibit destruction by nematode worms (16).
- Cystatins are cysteine protease inhibitors that are members of Family 2 of the cystatin superfamily, characterized by a single chain of about 115 to 122 amino acids with a molecular weight of about 13000, having two disulfide bonds (7, 8, 10). Cystatins and protease enzymes such as cathepsins form tight (but reversible) enzyme-inhibitor complexes with dissociation constants typically in the nannomolar range (10).
- cystatins A number of cystatins have been characterized including human cystatins (C, S, SN, SA, D, M and E), mouse cystatin, egg-white cystatin, bovine cystatin, carp cystatin, trout cystatin and salmon cystatin.
- cystatins protect the body by inhibiting the potentially harmful effects of proteolysis, and may prevent destruction of connective tissue by protease enzymes, for instance, lysosomal proteases, released from dying or damaged cells (16).
- Cystatins have been investigated for potential medical applications, for instance, to inhibit replication and pathology of Picornaviruses (12), Coronaviruses (9) and Herpes Simplex type 1 virus (15, 16, 30). Cystatins may also play a natural role in prevention of bacterial infection by E. coli , Shigella (13), Leishmania, Schistosoma and Entamobea (10) which appear to use proteases to facilitate tissue invasion.
- Cystatins are likely the primary protease inhibitors in food-grade protease inhibitor preparations such as beef plasma protein and egg white powder. Since cystatins are themselves proteins, they are prone to denaturation and loss of activity when exposed to unfavorable temperatures or pH. Many food production processes, including surimi production, involve elevated temperatures (17). Presently, in order to maintain cystatin activity, more cystatin must be added after cooling. Adding additional cystatin is both labor-intensive and expensive. Also, when cystatins are used for medical treatment, either as a topical or ingested medication, it is preferable for the cystatin-containing composition to be sterile. A common method of sterilization involves treatment with elevated temperatures. A cystatin that could maintain activity despite exposure to elevated temperatures would thus be useful in food processing and in drug formulation.
- the present invention provides modified, glycosylated, heat-stable cystatins and methods of making and using these cystatins.
- the present invention also provides nucleic acid molecules encoding such cystatins.
- the nucleic acid molecules of the invention have been modified so that when such a nucleic acid molecule is expressed in a eukaryotic cell, certain amino-acid residues of the expressed cystatin protein are glycosylated during post-translational modification of the protein.
- the resulting mature protein has attached, at specific amino acid residues, sugar molecule chains of varying length.
- the present invention includes the nucleic acid molecules that encode modified cystatins based on the cystatins from humans (C, S, SN, SA, D, M and E), egg white, cow, carp, trout and salmon.
- cystatin primary amino acid sequence Various residues in the cystatin primary amino acid sequence have been identified where the introduction glycosylated residues increases heat stability of the expressed protein without severely affecting enzymatic activity.
- the sites for glycosylation include amino acid residues at positions 35, 36 and 79.
- the present invention also includes a method of making modified heat-stable cystatins by modifying the nucleic acid molecules that encode cystatins. Such nucleic acid molecules are modified at certain defined sites and expressed in a eukaryotic cells.
- the present invention also includes a cell that contains at least one nucleic acid molecule encoding at least one modified, glycosylated, heat-stable cystatin.
- the cells of the invention may be of many types, for instance they may be cells from a yeast, a mammal, an insect, or a plant.
- the invention also includes methods of inhibiting proteolysis of a protein substrate by contacting the protein substrate with a modified heat-stable cystatin having at least one engineered glycosylation site. Such a method may be applied, for example, to food processing, such as the production of surimi.
- the invention also includes a method of treating a protease-mediated pathology of an organism, such as a mammal, a fish or a plant by administering to the organism a modified heat-stable cystatin of the invention.
- a modified heat-stable cystatin of the invention contacts the protease that mediates the pathology, thereby inhibiting proteolysis by the protease and thereby treating the pathology.
- SEQ ID NO: 1 shows the cDNA sequence and the amino acid sequence of native human cystatin C.
- SEQ ID NO: 3 shows the cDNA sequence and the amino acid sequence of native human cystatin S.
- SEQ ID NO: 4 shows the amino acid sequence of native human cystatin S.
- SEQ ID NO: 5 shows the cDNA sequence and the amino acid sequence of native human cystatin SN.
- SEQ ID NO: 6 shows the amino acid sequence of native human cystatin SN.
- SEQ ID NO: 7 shows the cDNA sequence and the amino acid sequence of native human cystatin SA.
- SEQ ID NO: 8 shows the amino acid sequence of native human cystatin SA.
- SEQ ID NO: 9 shows the cDNA sequence and the amino acid sequence of native human cystatin D.
- SEQ ID NO: 10 shows the amino acid sequence of native human cystatin D.
- SEQ ID NO: 11 shows the cDNA sequence and the amino acid sequence of native human cystatin M.
- SEQ ID NO: 12 shows the amino acid sequence of native human cystatin M.
- SEQ ID NO: 13 shows the cDNA sequence and the amino acid sequence of native human cystatin E.
- SEQ ID NO: 14 shows the amino acid sequence of native human cystatin E.
- SEQ ID NO: 15 shows the cDNA sequence and the amino acid sequence of native egg white cystatin.
- SEQ ID NO: 16 shows the amino acid sequence of native egg white cystatin.
- SEQ ID NO: 17 shows the cDNA sequence and the amino acid sequence of native carp cystatin.
- SEQ ID NO: 19 shows the cDNA sequence and the amino acid sequence of native salmon cystatin.
- SEQ ID NO: 20 shows the amino acid sequence of native salmon cystatin.
- SEQ ID NO: 21 shows the cDNA sequence and the amino acid sequence of native trout cystatin.
- SEQ ID NO: 24 shows the amino acid sequence of native bovine cystatin.
- SEQ ID NO: 25 shows the first of four oligonucleotides used to create a nucleotide coding for modified human cystatin C.
- SEQ ID NO: 26 shows the second of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
- SEQ ID NO: 27 shows the third of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
- SEQ ID NO: 28 shows the fourth of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
- SEQ ID NO: 29 shows a forward primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.
- SEQ ID NO: 30 is a reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.
- SEQ ID NO: 32 is the reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 36 of a modified human cystatin C.
- FIG. 1 shows the native amino acid sequence of the mature human cystatin C peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 2 shows the native amino acid sequence of the mature human cystatin S peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 3 shows the native amino acid sequence of the mature human cystatin SN peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 4 shows the native amino acid sequence of the mature human cystatin SA peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 5 shows the native amino acid sequence of the mature human cystatin D peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 8 shows the native amino acid sequence of the mature Egg White cystatin peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- FIG. 12 shows the native amino acid sequence of the mature chum salmon cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
- Heat stability refers to the ability of a protein to function at high temperatures. Proteins typically lose activity as the temperature is raised above their normal in vivo operating range, and generally become denatured as the temperature increases further. A modified protein may be more heat stable than the unmodified form of the protein, meaning that the modified form of the protein retains greater activity at higher temperatures than the unmodified form of the protein.
- a feature has been introduced means that a non-wild-type feature has been intentionally, artificially added, for instance, a glycosylation site may be said to be introduced into a protein if it has been intentionally, artificially added to the protein, also a nucleotide may be said to be introduced into a cell or other nucleotide sequence when it has been intentionally, artificially added into a cell or sequence, likewise, a deletion may be introduced into a DNA sequence. Likewise, amino acid substitutions, additions and deletions may be introduced into a protein.
- a glycosylation site is a place on a molecule at which sugars may be added.
- the addition of sugars (glycosylation) may occur at introduced sites in modified proteins such as at amino acid residue 37 of the modified human cystatin C.
- X the native amino acid that has been removed
- # the position of that amino acid in the protein
- Z the substituted amino acid that has been inserted in place of X.
- Ala (37) Ser denotes that Alanine at position 37 has been removed and replaced with Serine; likewise, for one-letter code, A (31) N denotes that Alanine at position 31 has been removed and replaced with Asparagine.
- the numbering system of cystatin proteins in this text reflects that shown in the accompanying figures. While alternative numbering systems are possible, the nomenclature use herein is intended to identify a particular amino acid residue, rather than any residue that happens to be at a given distance from an arbitrary point on an amino acid sequence.
- An organism refers to any organism of any kingdom, phylum, class, order, family, genus or species.
- a cancer refers to any neoplastic transformation or any tissue that has undergone neoplasia and includes solid, non-solid, benign and malignant transformed tissue of both plants and animals.
- a protease-mediated pathology is any pathology that is caused in part or in whole by the action of a protease, for instance, the proteolytic invasion of human tissue by a cancer cell, the proteolytic destruction of human tissue by a bacterium or a protozoan (e.g., Pseudomonas or Leishmania) and the tissue destruction of a plant by a fungus (e.g., Phytopthora infestans ) are all examples of protease-mediated pathologies.
- a protease for instance, the proteolytic invasion of human tissue by a cancer cell, the proteolytic destruction of human tissue by a bacterium or a protozoan (e.g., Pseudomonas or Leishmania) and the tissue destruction of a plant by a fungus (e.g., Phytopthora infestans ) are all examples of protease-mediated pathologies.
- a polynucleotide (or a gene or genome) is recombinant means that it has been altered by the addition at some site of non-native nucleic acids or nucleotides, i.e., nucleotides that are not normally found in the particular polynucleotide, or that are not normally found at that site.
- a human cystatin gene that has been altered by nucleotide substitution is said to be recombinant.
- a recombinant protein is a protein that is the product of a recombinant polynucleotide and that contains non-native amino acid residues.
- Nucleic acid probes and primers may readily be prepared based on the nucleic acid sequences provided by this invention.
- a probe comprises an isolated nucleic acid attached to a detectable label or reporter molecule.
- Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are well known in the field of molecular biology.
- Primers are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more in length, which are annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, then extended along the target DNA strand by a DNA polymerase enzyme.
- Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art.
- Probes and primers as used in the present invention preferably comprise at least 15 nucleotides of the nucleic acid sequences that encode a modified cystatin protein. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise 20, 30 or 40 consecutive nucleotides of the disclosed nucleic acid sequences. Methods for preparing and using probes and primers are described in a number of reference works, for example Sambrook et al.
- PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.).
- a first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence.
- a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence.
- operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.
- purified peptide does not require absolute purity, rather, it is intended as a relative term.
- a purified cystatin preparation is one in which cystatin is enriched compared to the cystatin in its natural environment, i.e., within a cell.
- a purified preparation of a cystatin is prepared such that the cystatin represents at least 20% of the portion of the protein content of the preparation. Preparations comprising at least 50%, 75% or at least 90% cystatin (expressed as a percentage of total protein) may be desirable for certain applications.
- a series of anti-parallel and complementary nucleotides may be synthesized and annealed together to form synthetic double-stranded nucleotide fragment. These fragments may then be ligated together to produce a contiguous double-stranded nucleotide that codes for a particular cystatin having one or more engineered glycosylation sites.
- the synthetic nucleotides may be amplified in vitro using the Polymerase Chain Reaction (PCR) (27) so that many copies of the nucleotide are available for cloning into a prokaryotic cloning vector (26) or into an expression vector, as explained below.
- PCR Polymerase Chain Reaction
- Additional glycosylation sites may be introduced using site-directed mutagenesis to add, delete or substitute particular amino acid residues.
- site-directed mutagenesis Various standard techniques are known to carry out site-directed mutagenesis (26, chapter 15), and commercial kits are also available such as the QUICKCHANGETM mutagenesis site-directed mutagenesis kit (STRATAGENETM, CA). Nucleotides may be modified or synthesized so as to include restriction enzyme sites that can be used for cloning.
- the modified nucleotide may be cloned into a standard prokaryotic cloning vector, for example pBR322, pUC18 or pUC19 (26, chapter 1).
- a standard prokaryotic cloning vector for example pBR322, pUC18 or pUC19 (26, chapter 1).
- the sequence of the cloned nucleotide may be checked by sequencing using standard methods (26, chapters 1 and 13).
- the cloned expression vector may then be transformed into a particular cell type and the nucleotide expressed.
- Many different types of cell may be used to express the modified nucleic acid molecules. Examples of such cells include cells of yeasts, fungi, insects and humans and plants, including transformed and non-transformed cells.
- common mammalian cells that could be used for the invention include human HeLa cells, SW-527 human cells (ATCC deposit #7940), WISH cells (ATCC deposit #CCL-25), Daudi cells (ATCC deposit #CCL-213), Mandin-Darby bovine kidney cells (ATCC deposit #CCL-22) and Chinese Hamster ovary cells (ATCC deposit #CRL-2092).
- Reptile cells that may be used include those from Russell's Viper (ATCC deposit #CCL-140). Plant cells that could be used include Chlamydomonas cells (ATCC deposit #30485), Arabidopsis cells (ATCC deposit #54069) and tomato plant cells (ATCC deposit #54003). Many of these cell types are commonly used and are available from the ATCC as well as from commercial suppliers such as PHARMACIATM and INVITROGENTM.
- Expressed protein may be accumulated within a cell or may be secreted from the cell. Such expressed protein may then be collected and purified. This protein may then be characterized for activity and heat stability and may be used to practice the methods of the invention.
- the amino acid sequences of the cystatins are shown in their mature form without a signal peptide.
- the signal peptide is cleaved off during post-translational modification to produce the mature peptide.
- the invention may be equally practiced with cystatin peptides which retain the signal peptide.
- the present invention includes a general method of inhibiting proteolysis of a substrate by providing the modified cystatin and by contacting this cystatin with the substrate.
- a substrate may be any substrate containing protein.
- a modified cystatin may be used in food processing applications, in agricultural applications and in human and non-human medical applications.
- a protein substrate such as minced Pacific Whiting (as used for surimi production) may be treated with a modified cystatin of the invention to inhibit proteolysis caused by the release of endogenous protease from the fish tissue.
- Suggested concentrations for the application of modified cystatin are, for instance, for surimi processing, from about 1 ⁇ g/g to 100 ⁇ g/g of surimi.
- the present invention also includes methods of using a modified cystatin for the inhibition of proteases for therapeutic purposes.
- the cystatins of the invention may be used to inhibit tissue destruction and invasion by pathogens such as staphylococci and streptococci.
- Streptococcus is the etiological agent of the common skin disease impetigo, and certain particularly rapacious “flesh eating” strains of Staphylococcus and Streptococcus have recently received much media attention, particularly because of their multiple antibiotic resistance.
- cystatins of the invention in preventing bacterial invasion of host-tissues can be measured, for example, by the method of Betts and Finlay (32). This method can be used to determine tissue invasion of green monkey kidney cells (ATCC deposit #CCL-70) by bacteria such as E. coli and Salmonella typhimurium.
- the modified cystatin may be administered topically or systemically.
- the cystatin may be formulated with a carrier or pharmacologically acceptable excipient.
- the cystatin may be mixed with a carrier such as a petroleum-based or lanolin-based oil to form a gel, and administered topically at the site of infection, thereby contacting the modified cystatin with the protease and preventing tissue destruction.
- the cystatins of the present invention may also be administered systemically, either orally, intravenously, sub-cutaneously, transdermally or by other methods.
- the cystatin of the present invention may be formulated into a tablet or solution form and administered orally.
- Formulation of drugs into pills and tablets is well known in the art.
- Systemic administration may be used to treat infections that cannot be treated topically such as cancer and systemic viral infections of humans and animals.
- concentrations of modified cystatin in a cream or ointment may be from about 1 ng to 100 mg per gram total weight of ointment, or may be from about 1 ⁇ g to 1 mg total weight, or may be from about 10 ⁇ g to 100 ⁇ g total weight.
- the amount of modified cystatin per Kg mass of a patient may be from about 1 ng to 100 mg, or may be from about 1 ⁇ g to 1 mg, or may be from about 10 ⁇ g to 100 ⁇ g.
- the cystatins of the present invention may be used to inhibit proteolysis caused by pathogens of crop plants and fruits.
- the modified cystatins may be applied to the surface of fruit or crops to inhibit proteases produced by plant pathogens such as fungi, for instance, Botrytis.
- the modified cystatin may, for instance, be sprayed or painted onto crops, either in a pure form or diluted in a carrier liquid such as water.
- the cystatins of the invention show superior heat resistance, maintaining a high degree of activity after exposure to elevated temperatures. These heat resistant cystatins may be particularly useful where sterility is required, such as in medical applications where it is generally desirable for medicines to be uncontaminated with biological organisms.
- the cystatins of the present invention may be formulated into topical ointments or pills which may then be packaged and sterilized and administrated therapeutically.
- food processing, such as surimi production involves elevated temperatures.
- the cystatins of the invention are useful in such processes due to their enhanced resistance.
- FIGS. 2-12 The same is so for all the other cystatins, FIGS. 2-12.
- the amino acid sequences for native and modified human cystatin SN are shown in FIG. 3 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native human cystatin SN is shown in SEQ ID NO: 5.
- the human cystatin SN protein may be modified to introduce one or more glycosylation sites by the introduction of one or more of the following amino acid substitutions: Ala (31) Asn, Ala (38) Ser, Ala (38) Thr, Lys (37) Asn, Lys (81) Asn, Asp (82) Ser, Asp (82) Thr.
- the amino acid sequences for native and modified human cystatin SA are shown in FIG. 4 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native human cystatin SA is shown in SEQ ID NO: 7.
- the human cystatin SA protein may be modified by the introduction of the following amino acid substitutions: Val (31) Asn, Ala (38) Ser, Ala (38) Thr, Lys (37) Asn, Asp (82) Ser, Asp (82) Thr, Leu (81) Asn.
- the amino acid sequences for native and modified human cystatin D are shown in FIG. 5 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native human cystatin D is shown in SEQ ID NO: 9.
- the human cystatin D protein may be modified by the introduction of the following amino acid substitutions: Ala (31) Asn, Val (38) Ser, Val (38) Thr, Asp (42) Ser, Asp (42) Thr, Asp (83) Ser, Asp (83) Thr, Pro (86) Ser, Pro (86) Thr, Gln (90) Ser, Gln (90) Thr, Tyr (44) Asn.
- the human cystatin M protein may be modified by the introduction of the following amino acid substitutions: Val (35) Asn, Met (40) Asn, Gly (41) Ser, Gly (41) Thr, Ser (42) Asn, Ile (45) Ser, Ile (45) Thr, Arg (78) Asn, Arg (81) Asn, Asp (88) Asn, Leu (89) Asn.
- the amino acid sequences for native and modified human cystatin E are shown in FIG. 7 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native human cystatin E is shown in SEQ ID NO: 13.
- the human cystatin E protein may be modified by the introduction of the following amino acid substitutions: Val (28) Asn, Met (33) Asn, Gly (34) Ser, Gly (34) Thr, Ser (35) Asn, Ile (38) Ser, Ile (38) Thr, Asp (81) Asn, Leu (82) Asn.
- the amino acid sequences for native and modified egg white cystatin are shown in FIG. 8 .
- Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein.
- One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native egg white cystatin is shown in SEQ ID NO: 15.
- the egg white cystatin protein may be modified by the introduction of the following amino acid substitutions: Ala (35) Ser, Ala (35) Thr, Arg (34) Asn, Lys (39) Ser, Lys (39) Thr, Lys (39) Asn, Tyr (40) Asn, Leu (78) Asn, Lys (91) Asn, Tyr (92) Asn.
- the amino acid sequences for native and modified bovine cystatin are shown in FIG. 9 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native bovine cystatin is shown in SEQ ID NO: 23.
- the bovine cystatin protein may be modified by the introduction of the following amino acid substitutions: Ala (29) Asn, Arg (36) Ser, Arg (36) Thr, Lys (35) Asn, Ala (40) Ser, Ala (40) Thr, Tyr (41) Asn, Leu (79) Asn, Asp (80) Ser, Asp (80) Thr, Pro (88) Ser, Pro (88) Thr.
- the amino acid sequences for native and modified carp cystatin are shown in FIG. 10 .
- Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein.
- One or more such amino acid substitutions may be present in the protein of the present invention.
- the cDNA sequence for native carp cystatin is shown in SEQ ID NO: 17.
- the trout cystatin protein was modified by the introduction of the following amino acid substitutions: Lys (29) Asn, Lys (30) Ser, Lys (30) Thr, Met (34) Thr, Met (34) Ser, Lys (39) Asn, Lys (88) Asn.
- the amino acid sequences for native and modified chum salmon cystatin are shown in FIG. 12 . Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention.
- the CDNA sequence for native chum salmon cystatin is shown in SEQ ID NO: 19.
- Cloning was done using the pUC19 cloning vector and E. coli using standard gene cloning techniques (26).
- the yeast strain Pichia pastons KM71 was used to express mammalian genes and constructs.
- T4 DNA ligase, restriction enzymes, the 7-DEAZA sequencing kit and blunting kit were all purchased from TAKARA SHUZOTM of Kyoto, Japan.
- the oligonucleotide in vitro mutagenesis system (version 2) was purchased from AMERSHAMTM International.
- CM TOYOPEARLTM 650M resin was purchased from TOSOHTM of Tokyo.
- Concanavalin A-sepharose and a-methylmannoside were purchased from PHARMACIATM and from WAKOTM of Tokyo, respectively.
- Sephadex-G50 was purchased from PHARMACIATM.
- M13mp19 was used as a vector for CDNA construction.
- the modified nucleotide sequences were made in two steps.
- a synthetic double-stranded DNA was constructed that codes for human cystatin C, modified to have a glycosylation site at residue number 79.
- This DNA was made by chemically synthesizing four oligonucleotides using an automated oligonucleotide synthesizer (SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27 and SEQ ID NO: 28).
- SEQ ID NO: 25 SEQ ID NO: 26
- SEQ ID NO: 27 and SEQ ID NO: 28 an automated oligonucleotide synthesizer
- the oligonucleotides were chosen on the basis of the known nucleotide sequence of the native human cystatin C gene and the known codon usage of P. pastoris (Table 1).
- DNA was sequenced using the Sanger method (26, chapter 13). Polypeptides were sequenced using the Edman degradation method in a gas-phase protein automated sequencer (SHIMADZUTM model PSQ2).
- the complementary pairs of the four oligonucleotides were annealed together and the resulting double-stranded fragments were ligated using T4 ligase.
- the resulting synthetic open reading frame contained an XhoI site at the 5′ end and an Xba site at the 3′ end that were used for cloning.
- the gene product was ligated into pUC19 (26, chapter 1) and sequenced in both directions to check that the sequence was as predicted.
- N-glycosylation site was introduced at either residue 35 or 36 using the QUICKCHANGETM site-directed mutagenesis kit (STRATAGENETM, CA) according to manufacturers instructions and using the forward and reverse primers shown in SEQ ID NOs: 29-32.
- QUICKCHANGETM site-directed mutagenesis kit
- the Pichia transformants were incubated in yeast minimal medium (YMM).
- the Pichia transformants were grown at 30° C. for one day in 5 mL of YMM, and then subcultured at 30° C. for four days in 500 mnl of fresh YMM. 100% methanol was added into the fresh YMM to a final concentration of 0.5% methanol every 24 hours to maintain induction of the cystatin gene.
- Recombinant modified human cystatin C was secreted in the Pichia culture media.
- the extracellular proteins were collected using an ultrafiltration system with 10,000 MW cut-off (PELLICONTM cassette filter, MILLIPORETM, Bedford, Mass.).
- the crude proteins thus recovered were applied to a column of Q-SEPHAROSE FAST FLOWTM (PHARMACIATM, Upsala, Sweden) equilibrated with a linear gradient of 0-0.5 M NaCl in 20 mM Tris-HCI buffer (pH7.5).
- the fraction including the cystatin was determined by the inhibitory activity against papain as described below.
- the fraction was applied to a column of sephacryl S-100 HR (PHARMACIATM) equilibrated with 0.15 M NaCl-20 mM Tris-HCI buffer (pH7.5). The fraction which showed the inhibitory activity was collected.
- Cystatin activity was assayed by measuring papain inhibitory activity using N ⁇ -Benzoyl-DL-Arg-p-Nitroanilide (Bz—Arg—NA) (34).
- Bz—Arg—NA N ⁇ -Benzoyl-DL-Arg-p-Nitroanilide
- 0.1 ml of cystatin sample and 0.1 ml of papain solution were pipetted into 0.1 mL of 50 mM Tris-HCI buffer (pH7.5) containing 100 mM Bz—Arg—NA, 2 mM EDTA, and 5 mM cystein. The solution was incubated for 25 min at 37° C.
- the reaction was stopped with 0.2 ml of 30% acetic acid, and the nitroaniline liberated by enzymatic activity is quantified by measurement of light absorption of the solution at 410nm.
- the inhibitory activity was expressed as the amount of enzyme (mg) inhibited by 1 mg of inhibitor (U/mg).
- Heat resistance assays were performed by heating a solution containing a known amount of cystatin to a controlled temperature for a controlled time; cooling the solution; mixing the cooled cystatin solution with a known amount of papain and protein substrate, and measuring turbidity of the mixture.
- the recombinant cystatins were heated to 95° C. at a rate of 1° C./min for 30° C. in 50 mM sodium phosphate buffer (pH 7.5). Protein concentration was 1 mg/ml.
- a glycosylation site at residue 36 #of a modified human cystatin C ⁇ 400> SEQUENCE: 31 ggtgagtaca acaacgcctc taacgacatg # # 30 ⁇ 210> SEQ ID NO 32 ⁇ 211> LENGTH: 30 ⁇ 212> TYPE: DNA ⁇ 213>
- ORGANISM Artificial Sequence ⁇ 220> FEATURE: ⁇ 223> OTHER INFORMATION: Description of Artificial #Sequence: reverse primer used in site-directed mutagen #esis to intro.
- a glycosylation site at residue 36 #of a modified human cystatin C ⁇ 400> SEQUENCE: 32 catgtcgtta gaggcgttgt tactcacc # # # 30
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Cystatins that have been modified by glycosylation in order to enhance stability and activity are disclosed, as are methods of making such cystatins and methods of using such cystatins to inhibit proteolysis.
Description
This is a continuation of International Application No. PCT/CA99/00717, filed Aug. 5, 1999, which claims the benefit of U.S. Provisional Application No. 60/095,503, filed Aug. 5, 1998, both of which are herein incorporated by reference.
The present invention relates to protease inhibitors, specifically to cystatins, that have been modified by glycosylation in order to enhance stability and activity; to methods of making such modified protease inhibitors and to methods of using such modified protease inhibitors to inhibit proteolysis of a protein substrate.
Proteases are enzymes that degrade proteins. Proteases are classified by the substrate upon which they act and include serine proteases, cysteine proteases, aspartate proteases and metalloproteinases. Serine and cysteine proteases are widespread and are found in diverse organisms including eukaryotic and prokaryotic animals and plants. Cysteine proteases are generally well characterized enzymes having a known primary structure composed of alpha helices and beta pleated sheets (8).
Proteases mediate many processes that are harmful to man, either by producing pathology or by causing economic loss, for instance by degrading foods. Protease-mediated pathology is known to be caused by a wide variety of organisms including bacteria, such as staphylococci and streptococci, fungi, arthropods, nematodes, protozoa such as amoebas, intestinal flagellates, haemoflagelates, such as Leishmania and trypanosomes and helminths.
Proteases are known to be important in the pathology of certain viruses (9, 11, 12, 31) including Polio virus, Herpes virus, Corona virus, HIV and Rotavirus. Proteases are also known to play a role in various diseases with no clear etiological agent, such as muscular dystrophy (7) and cancers (1) including breast cancer (2), and amyloid angiopathy, a genetic disease that often leads to fatal cerebral hemorrhages in young adults (10, 13).
Proteases are also responsible for the spoilage of economically important foodstuffs, necessitating huge annual expenditures on preventative measures. For instance, the fungus Botrytis cinera causes widespread disease in over thirty species of commercial crop plants. Molds and fungi are the major destroyers of citrus fruit crops. Foods rich in protein, such as meat and fish, are degraded and made inedible by proteases from Pseudomonads and other bacteria. Foods with a high muscle content may be quickly broken down by endogenous proteases released from tissues upon death. These enzymes degrade myosin, and destroy the texture of the food. An important example of such spoilage occurs during the processing of surimi, which is a form of processed minced fish, commonly made from Pacific Whiting (Merluccius productus), and is the main ingredient of seafood analogs such as “imitation crab meat”. Surimi is an important source of relatively cheap, high quality, low fat protein important to the diet of many people in the Far East and of increasing economic significance worldwide. Endogenous proteolytic enzymes released during surimi production cause rapid degradation of muscle tissue and lead to poor quality surimi (3, 4, 5, 6, 11, 17, 18, 19). It is thought that the protease released from the fish tissue is a cathepsin, which is a common cysteine protease (3).
Because of the role of proteases in these various processes, protease inhibitors have been investigated for their potential role in preventing disease and degradation of foodstuffs (4, 5, 18). Partially refined substances that contain protease inhibitors are commonly used in food processing, for instance, to prevent proteolytic breakdown of fish protein during the production of surimi (4, 17, 18). The most commonly used food-grade protease inhibitors are beef plasma protein (BPP), egg white powder and potato powder (4). Genetic engineering techniques have been used to introduce protease-inhibitor genes from chickens into cereal and grass plants to control protease-producing plant pathogens. Likewise, a plant protease inhibitor gene, from Cowpea, has been recombinantly introduced into tobacco, tomato, cotton and other plants to inhibit destruction by nematode worms (16).
Cystatins are cysteine protease inhibitors that are members of Family 2 of the cystatin superfamily, characterized by a single chain of about 115 to 122 amino acids with a molecular weight of about 13000, having two disulfide bonds (7, 8, 10). Cystatins and protease enzymes such as cathepsins form tight (but reversible) enzyme-inhibitor complexes with dissociation constants typically in the nannomolar range (10).
A number of cystatins have been characterized including human cystatins (C, S, SN, SA, D, M and E), mouse cystatin, egg-white cystatin, bovine cystatin, carp cystatin, trout cystatin and salmon cystatin. In their natural state, cystatins protect the body by inhibiting the potentially harmful effects of proteolysis, and may prevent destruction of connective tissue by protease enzymes, for instance, lysosomal proteases, released from dying or damaged cells (16).
Cystatins have been investigated for potential medical applications, for instance, to inhibit replication and pathology of Picornaviruses (12), Coronaviruses (9) and Herpes Simplex type 1 virus (15, 16, 30). Cystatins may also play a natural role in prevention of bacterial infection by E. coli, Shigella (13), Leishmania, Schistosoma and Entamobea (10) which appear to use proteases to facilitate tissue invasion.
Cystatins are likely the primary protease inhibitors in food-grade protease inhibitor preparations such as beef plasma protein and egg white powder. Since cystatins are themselves proteins, they are prone to denaturation and loss of activity when exposed to unfavorable temperatures or pH. Many food production processes, including surimi production, involve elevated temperatures (17). Presently, in order to maintain cystatin activity, more cystatin must be added after cooling. Adding additional cystatin is both labor-intensive and expensive. Also, when cystatins are used for medical treatment, either as a topical or ingested medication, it is preferable for the cystatin-containing composition to be sterile. A common method of sterilization involves treatment with elevated temperatures. A cystatin that could maintain activity despite exposure to elevated temperatures would thus be useful in food processing and in drug formulation.
The present invention provides modified, glycosylated, heat-stable cystatins and methods of making and using these cystatins. The present invention also provides nucleic acid molecules encoding such cystatins.
The nucleic acid molecules of the invention have been modified so that when such a nucleic acid molecule is expressed in a eukaryotic cell, certain amino-acid residues of the expressed cystatin protein are glycosylated during post-translational modification of the protein. The resulting mature protein has attached, at specific amino acid residues, sugar molecule chains of varying length. The present invention includes the nucleic acid molecules that encode modified cystatins based on the cystatins from humans (C, S, SN, SA, D, M and E), egg white, cow, carp, trout and salmon.
Various residues in the cystatin primary amino acid sequence have been identified where the introduction glycosylated residues increases heat stability of the expressed protein without severely affecting enzymatic activity. In human cystatin C, for instance, the sites for glycosylation include amino acid residues at positions 35, 36 and 79.
The present invention also includes a method of making modified heat-stable cystatins by modifying the nucleic acid molecules that encode cystatins. Such nucleic acid molecules are modified at certain defined sites and expressed in a eukaryotic cells. The present invention also includes a cell that contains at least one nucleic acid molecule encoding at least one modified, glycosylated, heat-stable cystatin. The cells of the invention may be of many types, for instance they may be cells from a yeast, a mammal, an insect, or a plant.
The invention also includes methods of inhibiting proteolysis of a protein substrate by contacting the protein substrate with a modified heat-stable cystatin having at least one engineered glycosylation site. Such a method may be applied, for example, to food processing, such as the production of surimi.
The invention also includes a method of treating a protease-mediated pathology of an organism, such as a mammal, a fish or a plant by administering to the organism a modified heat-stable cystatin of the invention. By such administration, the modified heat-stable cystatin contacts the protease that mediates the pathology, thereby inhibiting proteolysis by the protease and thereby treating the pathology.
SEQ ID NO: 1 shows the cDNA sequence and the amino acid sequence of native human cystatin C.
SEQ ID NO: 2 shows the amino acid sequence of native human cystatin C.
SEQ ID NO: 3 shows the cDNA sequence and the amino acid sequence of native human cystatin S.
SEQ ID NO: 4 shows the amino acid sequence of native human cystatin S.
SEQ ID NO: 5 shows the cDNA sequence and the amino acid sequence of native human cystatin SN.
SEQ ID NO: 6 shows the amino acid sequence of native human cystatin SN.
SEQ ID NO: 7 shows the cDNA sequence and the amino acid sequence of native human cystatin SA.
SEQ ID NO: 8 shows the amino acid sequence of native human cystatin SA.
SEQ ID NO: 9 shows the cDNA sequence and the amino acid sequence of native human cystatin D.
SEQ ID NO: 10 shows the amino acid sequence of native human cystatin D.
SEQ ID NO: 11 shows the cDNA sequence and the amino acid sequence of native human cystatin M.
SEQ ID NO: 12 shows the amino acid sequence of native human cystatin M.
SEQ ID NO: 13 shows the cDNA sequence and the amino acid sequence of native human cystatin E.
SEQ ID NO: 14 shows the amino acid sequence of native human cystatin E.
SEQ ID NO: 15 shows the cDNA sequence and the amino acid sequence of native egg white cystatin.
SEQ ID NO: 16 shows the amino acid sequence of native egg white cystatin.
SEQ ID NO: 17 shows the cDNA sequence and the amino acid sequence of native carp cystatin.
SEQ ID NO: 18 shows the amino acid sequence of native carp cystatin.
SEQ ID NO: 19 shows the cDNA sequence and the amino acid sequence of native salmon cystatin.
SEQ ID NO: 20 shows the amino acid sequence of native salmon cystatin.
SEQ ID NO: 21 shows the cDNA sequence and the amino acid sequence of native trout cystatin.
SEQ ID NO: 22 shows the amino acid sequence of native trout cystatin.
SEQ ID NO: 23 shows the cDNA sequence and the amino acid sequence of native bovine cystatin.
SEQ ID NO: 24 shows the amino acid sequence of native bovine cystatin.
SEQ ID NO: 25 shows the first of four oligonucleotides used to create a nucleotide coding for modified human cystatin C.
SEQ ID NO: 26 shows the second of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
SEQ ID NO: 27 shows the third of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
SEQ ID NO: 28 shows the fourth of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.
SEQ ID NO: 29 shows a forward primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.
SEQ ID NO: 30 is a reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.
SEQ ID NO: 31 is the forward primer used in site-directed mutagenesis to introduce a glycosylation site at residue 36 of a modified human cystatin C.
SEQ ID NO: 32 is the reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 36 of a modified human cystatin C.
FIG. 1 shows the native amino acid sequence of the mature human cystatin C peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 2 shows the native amino acid sequence of the mature human cystatin S peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 3 shows the native amino acid sequence of the mature human cystatin SN peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 4 shows the native amino acid sequence of the mature human cystatin SA peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 5 shows the native amino acid sequence of the mature human cystatin D peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 6 shows the native amino acid sequence of the mature human cystatin M peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 7 shows the native amino acid sequence of the mature human cystatin E peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 8 shows the native amino acid sequence of the mature Egg White cystatin peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 9 shows the native amino acid sequence of the mature bovine cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 10 shows the native amino acid sequence of the mature carp cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 11 shows the native amino acid sequence of the mature trout cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
FIG. 12 shows the native amino acid sequence of the mature chum salmon cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.
A. Definitions
A protein is said to be modified when it has been intentionally, artificially altered from its naturally occurring, wild-type form, e.g., when the primary amino acid sequence of a cystatin protein has been intentionally, artificially altered to add a non-native glycosylation site somewhere in the protein. A DNA sequence is said to be modified when it has been intentionally, artificially altered from its naturally occurring, wild-type form, e.g., when the nucleotide coding for a cystatin protein has been mutated by site-directed mutagenesis to create a non-native nucleotide addition, deletion of substitution. The term engineered may be used synonymously with the term modified.
To say that an organism, or nucleotide sequence has been genetically engineered means that it has been intentionally, artificially genetically altered from its naturally occurring, wild-type genetic form, for instance, a DNA sequence is said to be genetically engineered when its sequence has been intentionally, artificially genetically altered from its wild-type sequence. Any genetically engineered organism or nucleotide sequence has been modified.
Heat stability refers to the ability of a protein to function at high temperatures. Proteins typically lose activity as the temperature is raised above their normal in vivo operating range, and generally become denatured as the temperature increases further. A modified protein may be more heat stable than the unmodified form of the protein, meaning that the modified form of the protein retains greater activity at higher temperatures than the unmodified form of the protein.
To say a feature has been introduced means that a non-wild-type feature has been intentionally, artificially added, for instance, a glycosylation site may be said to be introduced into a protein if it has been intentionally, artificially added to the protein, also a nucleotide may be said to be introduced into a cell or other nucleotide sequence when it has been intentionally, artificially added into a cell or sequence, likewise, a deletion may be introduced into a DNA sequence. Likewise, amino acid substitutions, additions and deletions may be introduced into a protein.
A glycosylation site is a place on a molecule at which sugars may be added. The addition of sugars (glycosylation) may occur at introduced sites in modified proteins such as at amino acid residue 37 of the modified human cystatin C.
A glycosylation site in a protein is denoted by the general formula X (#) Z, where X=the native amino acid that has been removed, and # =the position of that amino acid in the protein, and Z=the substituted amino acid that has been inserted in place of X. For instance, Ala (37) Ser denotes that Alanine at position 37 has been removed and replaced with Serine; likewise, for one-letter code, A (31) N denotes that Alanine at position 31 has been removed and replaced with Asparagine. The numbering system of cystatin proteins in this text reflects that shown in the accompanying figures. While alternative numbering systems are possible, the nomenclature use herein is intended to identify a particular amino acid residue, rather than any residue that happens to be at a given distance from an arbitrary point on an amino acid sequence.
An organism refers to any organism of any kingdom, phylum, class, order, family, genus or species.
A pathology refers to a state that is measurably or detectably at variance with normal, healthy, non-disease-state physiology.
A cancer refers to any neoplastic transformation or any tissue that has undergone neoplasia and includes solid, non-solid, benign and malignant transformed tissue of both plants and animals.
A protease-mediated pathology is any pathology that is caused in part or in whole by the action of a protease, for instance, the proteolytic invasion of human tissue by a cancer cell, the proteolytic destruction of human tissue by a bacterium or a protozoan (e.g., Pseudomonas or Leishmania) and the tissue destruction of a plant by a fungus (e.g., Phytopthora infestans) are all examples of protease-mediated pathologies.
To say that a polynucleotide (or a gene or genome) is recombinant means that it has been altered by the addition at some site of non-native nucleic acids or nucleotides, i.e., nucleotides that are not normally found in the particular polynucleotide, or that are not normally found at that site. For instance, a human cystatin gene that has been altered by nucleotide substitution is said to be recombinant. Likewise, a recombinant protein is a protein that is the product of a recombinant polynucleotide and that contains non-native amino acid residues.
An isolated nucleic acid has been substantially separated or purified away from other nucleic acid sequences in the cell of the organism in which the nucleic acid naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA. The term isolated thus encompasses nucleic acids purified by standard nucleic acid purification methods. The term also embraces nucleic acids prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids.
Nucleic acid probes and primers may readily be prepared based on the nucleic acid sequences provided by this invention. A probe comprises an isolated nucleic acid attached to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are well known in the field of molecular biology. Primers are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more in length, which are annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, then extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art. Probes and primers as used in the present invention preferably comprise at least 15 nucleotides of the nucleic acid sequences that encode a modified cystatin protein. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise 20, 30 or 40 consecutive nucleotides of the disclosed nucleic acid sequences. Methods for preparing and using probes and primers are described in a number of reference works, for example Sambrook et al. (1989) (26); Ausubel et al., (1987) (25); Innis et al., (1990) (27). PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.).
A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.
The term purified peptide does not require absolute purity, rather, it is intended as a relative term. Thus, for example, a purified cystatin preparation is one in which cystatin is enriched compared to the cystatin in its natural environment, i.e., within a cell. A purified preparation of a cystatin is prepared such that the cystatin represents at least 20% of the portion of the protein content of the preparation. Preparations comprising at least 50%, 75% or at least 90% cystatin (expressed as a percentage of total protein) may be desirable for certain applications.
B. General Methods
The present invention utilizes standard laboratory practices for the cloning, manipulation and sequencing of nucleic acids, purification and analysis of proteins and other molecular biological and biochemical techniques, unless otherwise stipulated. Such techniques are explained in detail in standard laboratory manuals such as Sambrook et al., (1989)(25) and Ausubel et al., (1987)(26).
(1) Production, cloning and expression of modified cystatins
A modified nucleotide sequence that codes for a cystatin protein having at least one glycosylation site may be produced by making synthetic sequences using a commercial polynucleotide synthesizer. The synthetic nucleotides are designed on the basis of the known nucleotide sequence of the cystatin gene and the known codon usage for the cell type in which the nucleotides are to be expressed. A suitable target site for the introduction of a glycosylation site is identified based on the primary amino acid sequence of the cystatin to be modified. A nucleotide sequence is designed that will code for a peptide that includes such a site. The synthetic nucleotide sequence may then be synthesized as described herein. A series of anti-parallel and complementary nucleotides may be synthesized and annealed together to form synthetic double-stranded nucleotide fragment. These fragments may then be ligated together to produce a contiguous double-stranded nucleotide that codes for a particular cystatin having one or more engineered glycosylation sites. The synthetic nucleotides may be amplified in vitro using the Polymerase Chain Reaction (PCR) (27) so that many copies of the nucleotide are available for cloning into a prokaryotic cloning vector (26) or into an expression vector, as explained below.
Additional glycosylation sites may be introduced using site-directed mutagenesis to add, delete or substitute particular amino acid residues. Various standard techniques are known to carry out site-directed mutagenesis (26, chapter 15), and commercial kits are also available such as the QUICKCHANGE™ mutagenesis site-directed mutagenesis kit (STRATAGENE™, CA). Nucleotides may be modified or synthesized so as to include restriction enzyme sites that can be used for cloning.
The modified nucleotide may be cloned into a standard prokaryotic cloning vector, for example pBR322, pUC18 or pUC19 (26, chapter 1). The sequence of the cloned nucleotide may be checked by sequencing using standard methods (26, chapters 1 and 13).
Modified nucleotides may be cloned into an expression vector that allows protein production in a particular cell type. Since the proteins of the invention are glycosylated, it is required that the cell type in which the proteins are expressed can readily carry out post-translational modification including glycosylation. Typically, a eukaryotic cell is used that glycosylates peptides at the Asn-X-Ser/Thr motif. Yeast cells are commonly used for such a purpose. Standard cloning techniques may be used (26, chapter 9). Such expression vector/cell systems are well known and commercially available and include vector/cell combinations that carry out post-translational modifications required for the proper expression of glycosylated eukaryotic proteins. Various yeast strains and yeast-derived vectors are commonly used for this type of expression, for instance, Pichia pastoris expression systems that may be used to practice the present invention may be obtained from INVITROGEN™. Such systems include suitable Pichia pastoris strains, vectors, reagents, transformants, sequencing primers and media. Available strains include a GS115 his 4 deficient strain, a KM71 aox1 deficient strain, a GS115 His+ Mut− strain for extracellular expression and a His+ Mut− strain for intracellular expression (33).
Non-yeast eukaryotic vectors may equally be used for expression of the modified nucleotides. Mammalian vector / host cell systems that contain genetic and cellular control elements capable of carrying out transcription, translation and post-translational modification are well known in the art. Examples of such systems are the well known Baculovirus system, the Ecdysone-inducible mammalian expression system that uses regulatory elements from Drosophila melanogaster to allow control of gene expression, and the Sindbis viral expression system that allows high level expression in a variety of mammalian cell lines, which are available from INVITROGEN™.
The cloned expression vector may then be transformed into a particular cell type and the nucleotide expressed. Many different types of cell may be used to express the modified nucleic acid molecules. Examples of such cells include cells of yeasts, fungi, insects and humans and plants, including transformed and non-transformed cells. For instance, common mammalian cells that could be used for the invention include human HeLa cells, SW-527 human cells (ATCC deposit #7940), WISH cells (ATCC deposit #CCL-25), Daudi cells (ATCC deposit #CCL-213), Mandin-Darby bovine kidney cells (ATCC deposit #CCL-22) and Chinese Hamster ovary cells (ATCC deposit #CRL-2092). Common yeast cells include Pichia pastoris (ATCC deposit #201178) and Saccharomyces cerevisiae (ATCC deposit #46024). Insect cells include cells from Drosophila melanogaster (ATCC deposit #CRL-10191), the cotton bollworm (ATCC deposit #CRL-9281) and from Trichoplusia ni egg cell homoflagelates. Fish cells that may be used include those from rainbow trout (ATCC deposit #CLL-55), salmon (ATCC deposit #CRL-1681) and Zebrafish (ATCC deposit #CRL-2147). Amphibian cells that may be used include those of the Bullfrog, Rana catesbelana (ATCC deposit #CLL-41). Reptile cells that may be used include those from Russell's Viper (ATCC deposit #CCL-140). Plant cells that could be used include Chlamydomonas cells (ATCC deposit #30485), Arabidopsis cells (ATCC deposit #54069) and tomato plant cells (ATCC deposit #54003). Many of these cell types are commonly used and are available from the ATCC as well as from commercial suppliers such as PHARMACIA™ and INVITROGEN™.
Expressed protein may be accumulated within a cell or may be secreted from the cell. Such expressed protein may then be collected and purified. This protein may then be characterized for activity and heat stability and may be used to practice the methods of the invention.
The amino acid sequences of the cystatins (FIGS. 1-12) are shown in their mature form without a signal peptide. The signal peptide is cleaved off during post-translational modification to produce the mature peptide. The invention may be equally practiced with cystatin peptides which retain the signal peptide.
(2) Measurement of protease activity and heat resistance
Protease inhibition activity of modified cystatins is assayed by measuring the reduction in activity of the protease papain. A substrate is used that releases nitroaniline into the reaction medium. The amount of nitroaniline released is determined by measuring light absorption by the solution. This assay is described in detail in Example 4, below.
Heat resistance of the modified cystatin is determined by heating a solution containing a known amount of modified cystatin, cooling it, and adding it to a mixture containing a known amount of protease and a known amount of protein. Protease activity is measured by determining turbidity of the mixture after a set time and also by using the nitroaniline assay as described below.
(3) Methods of using modified cystatins
The modified cystatins of the invention may be used to inhibit proteolysis of a protein substrate in generally the same way as non-modified cystatins are used. Such uses include inhibition of proteolysis in food processing (4), therapeutic treatment of viral disease such as those caused by Herpes Simplex (11, 30), picornaviruses (20) and coronavirus (9).
The present invention includes a general method of inhibiting proteolysis of a substrate by providing the modified cystatin and by contacting this cystatin with the substrate. Such a substrate may be any substrate containing protein. Such a modified cystatin may be used in food processing applications, in agricultural applications and in human and non-human medical applications. For instance, a protein substrate such as minced Pacific Whiting (as used for surimi production) may be treated with a modified cystatin of the invention to inhibit proteolysis caused by the release of endogenous protease from the fish tissue. Suggested concentrations for the application of modified cystatin are, for instance, for surimi processing, from about 1 μg/g to 100 μg/g of surimi.
The present invention also includes methods of using a modified cystatin for the inhibition of proteases for therapeutic purposes. For instance, the cystatins of the invention may be used to inhibit tissue destruction and invasion by pathogens such as staphylococci and streptococci. Streptococcus is the etiological agent of the common skin disease impetigo, and certain particularly rapacious “flesh eating” strains of Staphylococcus and Streptococcus have recently received much media attention, particularly because of their multiple antibiotic resistance.
The effectiveness of cystatins of the invention in preventing bacterial invasion of host-tissues can be measured, for example, by the method of Betts and Finlay (32). This method can be used to determine tissue invasion of green monkey kidney cells (ATCC deposit #CCL-70) by bacteria such as E. coli and Salmonella typhimurium.
For medical applications the modified cystatin may be administered topically or systemically. The cystatin may be formulated with a carrier or pharmacologically acceptable excipient. For instance, the cystatin may be mixed with a carrier such as a petroleum-based or lanolin-based oil to form a gel, and administered topically at the site of infection, thereby contacting the modified cystatin with the protease and preventing tissue destruction.
The cystatins of the present invention may also be administered systemically, either orally, intravenously, sub-cutaneously, transdermally or by other methods. For instance, the cystatin of the present invention may be formulated into a tablet or solution form and administered orally. Formulation of drugs into pills and tablets is well known in the art. Systemic administration may be used to treat infections that cannot be treated topically such as cancer and systemic viral infections of humans and animals.
For topical medical treatment, concentrations of modified cystatin in a cream or ointment may be from about 1 ng to 100 mg per gram total weight of ointment, or may be from about 1 μg to 1 mg total weight, or may be from about 10 μg to 100 μg total weight. For ingested medications, the amount of modified cystatin per Kg mass of a patient may be from about 1 ng to 100 mg, or may be from about 1 μg to 1 mg, or may be from about 10 μg to 100 μg.
The cystatins of the present invention may be used to inhibit proteolysis caused by pathogens of crop plants and fruits. For instance, the modified cystatins may be applied to the surface of fruit or crops to inhibit proteases produced by plant pathogens such as fungi, for instance, Botrytis. The modified cystatin may, for instance, be sprayed or painted onto crops, either in a pure form or diluted in a carrier liquid such as water.
The cystatins of the invention show superior heat resistance, maintaining a high degree of activity after exposure to elevated temperatures. These heat resistant cystatins may be particularly useful where sterility is required, such as in medical applications where it is generally desirable for medicines to be uncontaminated with biological organisms. For instance, the cystatins of the present invention may be formulated into topical ointments or pills which may then be packaged and sterilized and administrated therapeutically. Also, food processing, such as surimi production, involves elevated temperatures. The cystatins of the invention are useful in such processes due to their enhanced resistance.
C. Production of Modified Human Cystatin C
Glycosylation of one or more amino acid residues at specific sites in the human cystatin C peptide increases heat stability of the cystatin without substantially inhibiting functionality. Glycosylation at other, inappropriate sites, may drastically decrease or destroy the protein's inhibitory function. The invention identifies three amino acid residues of human cystatin C that may be modified to introduce a glycosylation site; one or more of these sites may be modified to produce a cystatin protein having superior properties, such as enhanced heat stability.
The amino acid sequences for native and modified human cystatin C are shown in FIG. 1. Amino acid substitutions to produce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more of these amino acid substitutions may be present in a modified cystatin protein of the present invention. The cDNA sequence for native human cystatin C is shown in SEQ ID NO: 1.
Human cystatin C protein may be modified to introduce glycosylation sites (Asn-X-Ser/Thr) at amino acid residues 35, 36 or 79 by the introduction of amino acid substitutions at positions 37, 36 or 81, respectively. One or more of the following specific substitutions may be made: Ala (37) Ser, Ala (37) Thr, Lys (36) Asn, Asp (81) Ser, Asp (81) Thr.
It should be noted that the peptide of cystatin C in FIG. 1 is shown in its mature form without a signal peptide. The signal peptide is cleaved off during post-translational modification to produce the mature peptide. The invention may be equally practiced with cystatin peptides which retain the signal peptide.
The same is so for all the other cystatins, FIGS. 2-12.
D. Production of Modified Human Cystatin S
The amino acid sequences for native and modified human cystatin S are shown in FIG. 2. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin S is shown in SEQ ID NO: 3.
The human cystatin S protein may be modified by the introduction of the following amino acid substitutions: Ala (31) Asn, Lys (37) Asn, Ala (38) Ser, Ala (38) Thr, Leu (81) Asn, Asp (82) Thr, and Asp (82) Ser.
E. Production of Modified Human Cystatin SN
The amino acid sequences for native and modified human cystatin SN are shown in FIG. 3. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin SN is shown in SEQ ID NO: 5.
The human cystatin SN protein may be modified to introduce one or more glycosylation sites by the introduction of one or more of the following amino acid substitutions: Ala (31) Asn, Ala (38) Ser, Ala (38) Thr, Lys (37) Asn, Lys (81) Asn, Asp (82) Ser, Asp (82) Thr.
F. Production of Modified Human Cystatin SA
The amino acid sequences for native and modified human cystatin SA are shown in FIG. 4. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin SA is shown in SEQ ID NO: 7.
The human cystatin SA protein may be modified by the introduction of the following amino acid substitutions: Val (31) Asn, Ala (38) Ser, Ala (38) Thr, Lys (37) Asn, Asp (82) Ser, Asp (82) Thr, Leu (81) Asn.
G. Production of Modified Human Cystatin D
The amino acid sequences for native and modified human cystatin D are shown in FIG. 5. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin D is shown in SEQ ID NO: 9.
The human cystatin D protein may be modified by the introduction of the following amino acid substitutions: Ala (31) Asn, Val (38) Ser, Val (38) Thr, Asp (42) Ser, Asp (42) Thr, Asp (83) Ser, Asp (83) Thr, Pro (86) Ser, Pro (86) Thr, Gln (90) Ser, Gln (90) Thr, Tyr (44) Asn.
H. Production of Modified Human Cystatin M
The amino acid sequences for native and modified human cystatin M are shown in FIG. 6. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin M is shown in SEQ ID NO: 11.
The human cystatin M protein may be modified by the introduction of the following amino acid substitutions: Val (35) Asn, Met (40) Asn, Gly (41) Ser, Gly (41) Thr, Ser (42) Asn, Ile (45) Ser, Ile (45) Thr, Arg (78) Asn, Arg (81) Asn, Asp (88) Asn, Leu (89) Asn.
I. Production of Modified Human Cystatin E
The amino acid sequences for native and modified human cystatin E are shown in FIG. 7. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin E is shown in SEQ ID NO: 13.
The human cystatin E protein may be modified by the introduction of the following amino acid substitutions: Val (28) Asn, Met (33) Asn, Gly (34) Ser, Gly (34) Thr, Ser (35) Asn, Ile (38) Ser, Ile (38) Thr, Asp (81) Asn, Leu (82) Asn.
J. Production of Modified Egg White Cystatin
The amino acid sequences for native and modified egg white cystatin are shown in FIG. 8. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native egg white cystatin is shown in SEQ ID NO: 15.
The egg white cystatin protein may be modified by the introduction of the following amino acid substitutions: Ala (35) Ser, Ala (35) Thr, Arg (34) Asn, Lys (39) Ser, Lys (39) Thr, Lys (39) Asn, Tyr (40) Asn, Leu (78) Asn, Lys (91) Asn, Tyr (92) Asn.
K. Production of Modified Bovine Cystatin
The amino acid sequences for native and modified bovine cystatin are shown in FIG. 9. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native bovine cystatin is shown in SEQ ID NO: 23.
The bovine cystatin protein may be modified by the introduction of the following amino acid substitutions: Ala (29) Asn, Arg (36) Ser, Arg (36) Thr, Lys (35) Asn, Ala (40) Ser, Ala (40) Thr, Tyr (41) Asn, Leu (79) Asn, Asp (80) Ser, Asp (80) Thr, Pro (88) Ser, Pro (88) Thr.
L. Production of Modified Carp Cystatin
The amino acid sequences for native and modified carp cystatin are shown in FIG. 10. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native carp cystatin is shown in SEQ ID NO: 17.
The carp cystatin protein may be modified by the introduction of the following amino acid substitutions: Gln (31) Ser, Gln (31) Thr, Gly (30) Asn, Ala (35) Ser, Ala (35) Thr, Lys (39) Asn, Lys (91) Asn.
M. Production of Modified Trout Cystatin
The amino acid sequences for native and modified trout cystatin are shown in FIG. 11. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native trout cystatin is shown in SEQ ID NO: 21.
The trout cystatin protein was modified by the introduction of the following amino acid substitutions: Lys (29) Asn, Lys (30) Ser, Lys (30) Thr, Met (34) Thr, Met (34) Ser, Lys (39) Asn, Lys (88) Asn.
N. Production of Modified Chum Salmon Cystatin
The amino acid sequences for native and modified chum salmon cystatin are shown in FIG. 12. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The CDNA sequence for native chum salmon cystatin is shown in SEQ ID NO: 19.
The chum salmon cystatin protein may be modified by the introduction of the following amino acid substitutions: Lys (29) Asn, Lys (30) Ser, Lys (30) Thr, Met (34) Ser, Met (34) Thr, Lys (88) Asn.
Cloning was done using the pUC19 cloning vector and E. coli using standard gene cloning techniques (26). The yeast strain Pichia pastons KM71 was used to express mammalian genes and constructs. T4 DNA ligase, restriction enzymes, the 7-DEAZA sequencing kit and blunting kit were all purchased from TAKARA SHUZO™ of Kyoto, Japan. The oligonucleotide in vitro mutagenesis system (version 2) was purchased from AMERSHAM™ International. CM TOYOPEARL™ 650M resin was purchased from TOSOH™ of Tokyo. Concanavalin A-sepharose and a-methylmannoside were purchased from PHARMACIA™ and from WAKO™ of Tokyo, respectively. Sephadex-G50 was purchased from PHARMACIA™. M13mp19 was used as a vector for CDNA construction.
The modified nucleotide sequences were made in two steps.
First, a synthetic double-stranded DNA was constructed that codes for human cystatin C, modified to have a glycosylation site at residue number 79. This DNA was made by chemically synthesizing four oligonucleotides using an automated oligonucleotide synthesizer (SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27 and SEQ ID NO: 28). The oligonucleotides were chosen on the basis of the known nucleotide sequence of the native human cystatin C gene and the known codon usage of P. pastoris (Table 1).
DNA was sequenced using the Sanger method (26, chapter 13). Polypeptides were sequenced using the Edman degradation method in a gas-phase protein automated sequencer (SHIMADZU™ model PSQ2).
| TABLE I | |||
| Amino acid | Codon usage | ||
| Glycine | GGT or GGA | ||
| Glutamic acid | GAG of GAA | ||
| Aspartic acid | GAC or GAT | ||
| Valine | GTT or GTC | ||
| Alanine | GCT or GCC | ||
| Arginine | AGA or CGT | ||
| Serine | TCT or TCC | ||
| Lysine | AAG | ||
| Asparagine | AAC | ||
| Methionine | ATG | ||
| Isoleucine | ATT or ATC | ||
| Threonine | ACT or ACC | ||
| Tryptophan | TGG | ||
| Cysteine | TGT | ||
| Tyrosine | TAC | ||
| Leucine | TTG or CTG | ||
| Phenylalanine | TTC | ||
| Glutamine | CAA or CAG | ||
| Histidine | CAC or CAT | ||
| Proline | CCA or CCT | ||
The complementary pairs of the four oligonucleotides were annealed together and the resulting double-stranded fragments were ligated using T4 ligase. The resulting synthetic open reading frame contained an XhoI site at the 5′ end and an Xba site at the 3′ end that were used for cloning. The gene product was ligated into pUC19 (26, chapter 1) and sequenced in both directions to check that the sequence was as predicted.
Second, an N-glycosylation site was introduced at either residue 35 or 36 using the QUICKCHANGE™ site-directed mutagenesis kit (STRATAGENE™, CA) according to manufacturers instructions and using the forward and reverse primers shown in SEQ ID NOs: 29-32.
The yeast expression plasmids pYG-100 (20) and pPICZ α-C containing Saccharomyces cerevissiae α factor secretion signal and alcohol oxidase (AOX1) gene promoter were used to express the proteins of the invention. P. pastoris strain KM71 was transformed using the Pichia EASYCOM™ transformation system (INVITROGEN™, CA). Zeocin-resistant transformants were selected from yeast extract peptone dextrose sorbitol medium (YPDS) agar plates containing the zeocin. PCR sequencing was used to confirm insertion of the cystatin C gene in Pichia clones.
The Pichia transformants were incubated in yeast minimal medium (YMM). The Pichia transformants were grown at 30° C. for one day in 5 mL of YMM, and then subcultured at 30° C. for four days in 500 mnl of fresh YMM. 100% methanol was added into the fresh YMM to a final concentration of 0.5% methanol every 24 hours to maintain induction of the cystatin gene.
Recombinant modified human cystatin C was secreted in the Pichia culture media. The extracellular proteins were collected using an ultrafiltration system with 10,000 MW cut-off (PELLICON™ cassette filter, MILLIPORE™, Bedford, Mass.). The crude proteins thus recovered were applied to a column of Q-SEPHAROSE FAST FLOW™ (PHARMACIA™, Upsala, Sweden) equilibrated with a linear gradient of 0-0.5 M NaCl in 20 mM Tris-HCI buffer (pH7.5). The fraction including the cystatin was determined by the inhibitory activity against papain as described below. The fraction was applied to a column of sephacryl S-100 HR (PHARMACIA™) equilibrated with 0.15 M NaCl-20 mM Tris-HCI buffer (pH7.5). The fraction which showed the inhibitory activity was collected.
Cystatin activity was assayed by measuring papain inhibitory activity using Nα-Benzoyl-DL-Arg-p-Nitroanilide (Bz—Arg—NA) (34). 0.1 ml of cystatin sample and 0.1 ml of papain solution (0.5 mg/ml) were pipetted into 0.1 mL of 50 mM Tris-HCI buffer (pH7.5) containing 100 mM Bz—Arg—NA, 2 mM EDTA, and 5 mM cystein. The solution was incubated for 25 min at 37° C. The reaction was stopped with 0.2 ml of 30% acetic acid, and the nitroaniline liberated by enzymatic activity is quantified by measurement of light absorption of the solution at 410nm. The inhibitory activity was expressed as the amount of enzyme (mg) inhibited by 1 mg of inhibitor (U/mg).
Heat resistance assays were performed by heating a solution containing a known amount of cystatin to a controlled temperature for a controlled time; cooling the solution; mixing the cooled cystatin solution with a known amount of papain and protein substrate, and measuring turbidity of the mixture. The recombinant cystatins were heated to 95° C. at a rate of 1° C./min for 30° C. in 50 mM sodium phosphate buffer (pH 7.5). Protein concentration was 1 mg/ml.
At preset temperatures, each heated sample was transferred into a cuvette and the turbidity measured at 500 nm. The residual papain-inhibiting activity of the heated samples was also measured as described above. This procedure was repeated in triplicate.
The above examples are provided by way of illustration only and are in no way intended to limit the scope of the invention. One of skill in the art will see that the invention may be modified in various ways without departing from the spirit or principle of the invention. We claim all such modifications.
1. Keppler et al., (1993) In Proteases and Cancer Colloquium Queen's Univ., Belfast, 22, 43-49
2. Kuopio et al., (1998) Cancer Research 58 (3), 432-436
3. Seymour et al., (1994) Journal of Agricultural and Food Chemistry 42, 2421
4. Weerasinghe et al., (1996) J. Agric. Food Chem. 44, 2584-2590
5. Izquierdo-Pulido et al., (1994) J. Agric Food Chem 42, 616-622
6. Yamashita et al., (1991) Nippon Suisan Gakkaishi 57 (10), 1917-1922
7. Barrett et al., The Biochemical Journal (1986) Letters 236 (1), 311-312
8. Turk et al., (1991) FEBS 285 (2) 213-219
9. Collins et al., (1998) Oral Microbiol Immunol. 13 (1), 59-61
10. Barrett (1987) TIBS 12, 193-196
11. Grubb et al., (1995) U.S. Pat. No. 5,432,264
12. Korant et al., (1985) Biochem. and Biophys. Res. Comm. 127 (3), 1072-1076
13. Abrahamson et al., (1988) FEBS Letters 236 (1), 14-18
14. Nakamura et al., (1993) J. of Biol. Chem. 268 (17), 12706-12712
15. Saitoh et al., (1998) Arch Biochem Biophys 352 (2), 199-206
16. Atkinson et al., (1996) PCT Patent No. WO 96/116173
17. An et al., (1994) Journal of Food Science 59 (2), 277
18. Morrissey et al., (1993) Journal of Food Science 58 (5), 1050
19. An et al., (1994) Journal of Food Science 59 (5), 1013
20. Turk et al., (1990) U.S. Pat. No. 4,902,509
12. Yamashita et al., (1990) Nippon Suisan Gakkashi 56 (8) 1271-1277
22. Saeki et al., (1995) Journal of Food Science 60 (5),
23. Yamashita et al., (1991) Comp. Biochem. Physiol. 100A (3) 749-751
24. Nakamura et al., (1993) FEBS 328 (3) 259-262
25. Ausubel et al., (1987). Current Protocols in Molecular Biology, ed. Greene Publishing and Wiley-Interscience: New York (with periodic updates)
26. Sarnbrook et al., (1989). Molecular Cloning: A Laboratory Manual, 2nd ed., vol. 1-3, ed. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, N.Y.
27. Innis et al., (1990). PCR Protocols: A Guide to Methods and Applications, Academic Press: San Diego
28. Nakamura et al., (1996) FEBS Letters 383 251-254
29. An et al., (1995) J. Agric. Food Chem. 43 327-330
30. Bjorck et al., J. Virol. 64 (2) 941-943
31. Wyatt, R. G. et al., (1980) Science, 207, 189-191
32. Betts et al., (1992) Can. J. Microb. 38 852-857
33. Invitrogen Product Catalogue, 1998. Invitrogen, Carlsbad, Calif.
34. Barrett et al., (1981) Methods. Enzymol. 80: 771-778
| # SEQUENCE LISTING |
| <160> NUMBER OF SEQ ID NOS: 32 |
| <210> SEQ ID NO 1 |
| <211> LENGTH: 363 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(363) |
| <400> SEQUENCE: 1 |
| tcc agt ccc ggc aag ccg ccg cgc ctg gtg gg |
| #a ggc ccc atg gac gcc 48 |
| Ser Ser Pro Gly Lys Pro Pro Arg Leu Val Gl |
| #y Gly Pro Met Asp Ala |
| 1 5 |
| # 10 |
| # 15 |
| agc gtg gag gag gag ggt gtg cgg cgt gca ct |
| #g gac ttt gcc gtc ggc 96 |
| Ser Val Glu Glu Glu Gly Val Arg Arg Ala Le |
| #u Asp Phe Ala Val Gly |
| 20 |
| # 25 |
| # 30 |
| gag tac aac aaa gcc agc aac gac atg tac ca |
| #c agc cgc gcg ctg cag 144 |
| Glu Tyr Asn Lys Ala Ser Asn Asp Met Tyr Hi |
| #s Ser Arg Ala Leu Gln |
| 35 |
| # 40 |
| # 45 |
| gtg gtg cgc gcc cgc aag cag atc gta gct gg |
| #g gtg aac tac ttc ttg 192 |
| Val Val Arg Ala Arg Lys Gln Ile Val Ala Gl |
| #y Val Asn Tyr Phe Leu |
| 50 |
| # 55 |
| # 60 |
| gac gtg gag ctg ggc cga acc acg tgt acc aa |
| #g acc cag ccc aac ttg 240 |
| Asp Val Glu Leu Gly Arg Thr Thr Cys Thr Ly |
| #s Thr Gln Pro Asn Leu |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| gac aac tgc ccc ttc cat gac cag cca cat ct |
| #g aaa agg aaa gca ttc 288 |
| Asp Asn Cys Pro Phe His Asp Gln Pro His Le |
| #u Lys Arg Lys Ala Phe |
| 85 |
| # 90 |
| # 95 |
| tgc tct ttc cag atc tac gct gtg cct tgg ca |
| #g ggc aca atg acc ttg 336 |
| Cys Ser Phe Gln Ile Tyr Ala Val Pro Trp Gl |
| #n Gly Thr Met Thr Leu |
| 100 |
| # 105 |
| # 110 |
| tcg aaa tcc acc tgt cag gac gcc tag |
| # |
| # 363 |
| Ser Lys Ser Thr Cys Gln Asp Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 2 |
| <211> LENGTH: 120 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 2 |
| Ser Ser Pro Gly Lys Pro Pro Arg Leu Val Gl |
| #y Gly Pro Met Asp Ala |
| 1 5 |
| # 10 |
| # 15 |
| Ser Val Glu Glu Glu Gly Val Arg Arg Ala Le |
| #u Asp Phe Ala Val Gly |
| 20 |
| # 25 |
| # 30 |
| Glu Tyr Asn Lys Ala Ser Asn Asp Met Tyr Hi |
| #s Ser Arg Ala Leu Gln |
| 35 |
| # 40 |
| # 45 |
| Val Val Arg Ala Arg Lys Gln Ile Val Ala Gl |
| #y Val Asn Tyr Phe Leu |
| 50 |
| # 55 |
| # 60 |
| Asp Val Glu Leu Gly Arg Thr Thr Cys Thr Ly |
| #s Thr Gln Pro Asn Leu |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Asp Asn Cys Pro Phe His Asp Gln Pro His Le |
| #u Lys Arg Lys Ala Phe |
| 85 |
| # 90 |
| # 95 |
| Cys Ser Phe Gln Ile Tyr Ala Val Pro Trp Gl |
| #n Gly Thr Met Thr Leu |
| 100 |
| # 105 |
| # 110 |
| Ser Lys Ser Thr Cys Gln Asp Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 3 |
| <211> LENGTH: 366 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(366) |
| <400> SEQUENCE: 3 |
| tcg agc tcc aag gag gag aat agg ata atc cc |
| #a ggt ggc atc tat gat 48 |
| Ser Ser Ser Lys Glu Glu Asn Arg Ile Ile Pr |
| #o Gly Gly Ile Tyr Asp |
| 1 5 |
| # 10 |
| # 15 |
| gca gac ctc aat gat gag tgg gta cag cgt gc |
| #c ctt cac ttc gcc atc 96 |
| Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Al |
| #a Leu His Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| agc gag tac aac aag gcc acc gaa gat gag ta |
| #c tac aga cgc ccg ctg 144 |
| Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Ty |
| #r Tyr Arg Arg Pro Leu |
| 35 |
| # 40 |
| # 45 |
| cag gtg ctg cga gcc agg gag cag acc ttt gg |
| #g ggg gtg aat tac ttc 192 |
| Gln Val Leu Arg Ala Arg Glu Gln Thr Phe Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| ttc gac gta gag gtg ggc cgc acc ata tgt ac |
| #c aag tcc cag ccc aac 240 |
| Phe Asp Val Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| ttg gac acc tgt gcc ttc cat gaa cag cca ga |
| #a ctg cag aag aaa cag 288 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| tta tgc tct ttc gag atc tac gaa gtt ccc tg |
| #g gag gac aga atg tcc 336 |
| Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Tr |
| #p Glu Asp Arg Met Ser |
| 100 |
| # 105 |
| # 110 |
| ctg gtg aat tcc agg tgt caa gaa gcc tag |
| # |
| # 366 |
| Leu Val Asn Ser Arg Cys Gln Glu Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 4 |
| <211> LENGTH: 121 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 4 |
| Ser Ser Ser Lys Glu Glu Asn Arg Ile Ile Pr |
| #o Gly Gly Ile Tyr Asp |
| 1 5 |
| # 10 |
| # 15 |
| Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Al |
| #a Leu His Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Ty |
| #r Tyr Arg Arg Pro Leu |
| 35 |
| # 40 |
| # 45 |
| Gln Val Leu Arg Ala Arg Glu Gln Thr Phe Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| Phe Asp Val Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Tr |
| #p Glu Asp Arg Met Ser |
| 100 |
| # 105 |
| # 110 |
| Leu Val Asn Ser Arg Cys Gln Glu Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 5 |
| <211> LENGTH: 366 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(366) |
| <400> SEQUENCE: 5 |
| tgg agc ccc aag gag gag gat agg ata atc cc |
| #g ggt ggc atc tat aac 48 |
| Trp Ser Pro Lys Glu Glu Asp Arg Ile Ile Pr |
| #o Gly Gly Ile Tyr Asn |
| 1 5 |
| # 10 |
| # 15 |
| gca gac ctc aat gat gag tgg gta cag cgt gc |
| #c ctt cac ttc gcc atc 96 |
| Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Al |
| #a Leu His Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| agc gag tat aac aag gcc acc aaa gat gac ta |
| #c tac aga cgt ccg ctg 144 |
| Ser Glu Tyr Asn Lys Ala Thr Lys Asp Asp Ty |
| #r Tyr Arg Arg Pro Leu |
| 35 |
| # 40 |
| # 45 |
| cgg gta cta aga gcc agg caa cag acc gtt gg |
| #g ggg gtg aat tac ttc 192 |
| Arg Val Leu Arg Ala Arg Gln Gln Thr Val Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| ttc gac gta gag gtg ggc cga acc ata tgt ac |
| #c aag tcc cag ccc aac 240 |
| Phe Asp Val Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| ttg gac acc tgt gcc ttc cat gaa cag cca ga |
| #a ctg cag aag aaa cag 288 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| ttg tgc tct ttc gag atc tac gaa gtt ccc tg |
| #g gag aac aga agg tcc 336 |
| Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Tr |
| #p Glu Asn Arg Arg Ser |
| 100 |
| # 105 |
| # 110 |
| ctg gtg aaa tcc agg tgt caa gaa tcc tag |
| # |
| # 366 |
| Leu Val Lys Ser Arg Cys Gln Glu Ser |
| 115 |
| # 120 |
| <210> SEQ ID NO 6 |
| <211> LENGTH: 121 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 6 |
| Trp Ser Pro Lys Glu Glu Asp Arg Ile Ile Pr |
| #o Gly Gly Ile Tyr Asn |
| 1 5 |
| # 10 |
| # 15 |
| Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Al |
| #a Leu His Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| Ser Glu Tyr Asn Lys Ala Thr Lys Asp Asp Ty |
| #r Tyr Arg Arg Pro Leu |
| 35 |
| # 40 |
| # 45 |
| Arg Val Leu Arg Ala Arg Gln Gln Thr Val Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| Phe Asp Val Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Tr |
| #p Glu Asn Arg Arg Ser |
| 100 |
| # 105 |
| # 110 |
| Leu Val Lys Ser Arg Cys Gln Glu Ser |
| 115 |
| # 120 |
| <210> SEQ ID NO 7 |
| <211> LENGTH: 366 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(366) |
| <400> SEQUENCE: 7 |
| tgg agc ccc cag gag gag gac agg ata atc ga |
| #g ggt ggc atc tat gat 48 |
| Trp Ser Pro Gln Glu Glu Asp Arg Ile Ile Gl |
| #u Gly Gly Ile Tyr Asp |
| 1 5 |
| # 10 |
| # 15 |
| gca gac ctc aat gat gag cgg gta cag cgt gc |
| #c ctt cac ttt gtc atc 96 |
| Ala Asp Leu Asn Asp Glu Arg Val Gln Arg Al |
| #a Leu His Phe Val Ile |
| 20 |
| # 25 |
| # 30 |
| agc gag tat aac aag gcc act gaa gat gag ta |
| #c tac aga cgc ctg ctg 144 |
| Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Ty |
| #r Tyr Arg Arg Leu Leu |
| 35 |
| # 40 |
| # 45 |
| cgg gtg cta cga gcc agg gag cag atc gtg gg |
| #c ggg gtg aat tac ttc 192 |
| Arg Val Leu Arg Ala Arg Glu Gln Ile Val Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| ttc gac ata gag gtg ggc cga acc ata tgt ac |
| #c aag tcc cag ccc aac 240 |
| Phe Asp Ile Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| ttg gac acc tgt gcc ttc cat gaa cag cca ga |
| #a ctg cag aag aaa cag 288 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| ttg tgc tct ttc cag atc tac gaa gtt ccc tg |
| #g gag gac aga atg tcc 336 |
| Leu Cys Ser Phe Gln Ile Tyr Glu Val Pro Tr |
| #p Glu Asp Arg Met Ser |
| 100 |
| # 105 |
| # 110 |
| ctg gtg aat tcc agg tgt caa gaa gcc tag |
| # |
| # 366 |
| Leu Val Asn Ser Arg Cys Gln Glu Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 8 |
| <211> LENGTH: 121 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 8 |
| Trp Ser Pro Gln Glu Glu Asp Arg Ile Ile Gl |
| #u Gly Gly Ile Tyr Asp |
| 1 5 |
| # 10 |
| # 15 |
| Ala Asp Leu Asn Asp Glu Arg Val Gln Arg Al |
| #a Leu His Phe Val Ile |
| 20 |
| # 25 |
| # 30 |
| Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Ty |
| #r Tyr Arg Arg Leu Leu |
| 35 |
| # 40 |
| # 45 |
| Arg Val Leu Arg Ala Arg Glu Gln Ile Val Gl |
| #y Gly Val Asn Tyr Phe |
| 50 |
| # 55 |
| # 60 |
| Phe Asp Ile Glu Val Gly Arg Thr Ile Cys Th |
| #r Lys Ser Gln Pro Asn |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Leu Asp Thr Cys Ala Phe His Glu Gln Pro Gl |
| #u Leu Gln Lys Lys Gln |
| 85 |
| # 90 |
| # 95 |
| Leu Cys Ser Phe Gln Ile Tyr Glu Val Pro Tr |
| #p Glu Asp Arg Met Ser |
| 100 |
| # 105 |
| # 110 |
| Leu Val Asn Ser Arg Cys Gln Glu Ala |
| 115 |
| # 120 |
| <210> SEQ ID NO 9 |
| <211> LENGTH: 369 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(369) |
| <400> SEQUENCE: 9 |
| ggg agt gcc tcg gcc caa tct agg acc ttg gc |
| #a ggt ggc atc cat gcc 48 |
| Gly Ser Ala Ser Ala Gln Ser Arg Thr Leu Al |
| #a Gly Gly Ile His Ala |
| 1 5 |
| # 10 |
| # 15 |
| aca gac ctc aat gac aag agt gtg cag cgt gc |
| #c ctg gac ttt gcc atc 96 |
| Thr Asp Leu Asn Asp Lys Ser Val Gln Arg Al |
| #a Leu Asp Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| agc gag tac aac aag gtc att aat aag gat ga |
| #g tac tac agc cgc cct 144 |
| Ser Glu Tyr Asn Lys Val Ile Asn Lys Asp Gl |
| #u Tyr Tyr Ser Arg Pro |
| 35 |
| # 40 |
| # 45 |
| ctg cag gtg atg gct gcc tac cag cag atc gt |
| #g ggt ggg gtg aac tac 192 |
| Leu Gln Val Met Ala Ala Tyr Gln Gln Ile Va |
| #l Gly Gly Val Asn Tyr |
| 50 |
| # 55 |
| # 60 |
| tac ttc aat gtg aag ttc ggt cga acc aca tg |
| #c acc aag tcc cag ccc 240 |
| Tyr Phe Asn Val Lys Phe Gly Arg Thr Thr Cy |
| #s Thr Lys Ser Gln Pro |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| aac ttg gac aac tgt ccc ttc aat gac cag cc |
| #a aaa ctg aaa gag gaa 288 |
| Asn Leu Asp Asn Cys Pro Phe Asn Asp Gln Pr |
| #o Lys Leu Lys Glu Glu |
| 85 |
| # 90 |
| # 95 |
| gag ttc tgc tct ttc cag atc aat gaa gtt cc |
| #c tgg gag gat aaa att 336 |
| Glu Phe Cys Ser Phe Gln Ile Asn Glu Val Pr |
| #o Trp Glu Asp Lys Ile |
| 100 |
| # 105 |
| # 110 |
| tcc att ctg aac tac aag tgc cgg aaa gtc ta |
| #g |
| # 369 |
| Ser Ile Leu Asn Tyr Lys Cys Arg Lys Val |
| 115 |
| # 120 |
| <210> SEQ ID NO 10 |
| <211> LENGTH: 122 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 10 |
| Gly Ser Ala Ser Ala Gln Ser Arg Thr Leu Al |
| #a Gly Gly Ile His Ala |
| 1 5 |
| # 10 |
| # 15 |
| Thr Asp Leu Asn Asp Lys Ser Val Gln Arg Al |
| #a Leu Asp Phe Ala Ile |
| 20 |
| # 25 |
| # 30 |
| Ser Glu Tyr Asn Lys Val Ile Asn Lys Asp Gl |
| #u Tyr Tyr Ser Arg Pro |
| 35 |
| # 40 |
| # 45 |
| Leu Gln Val Met Ala Ala Tyr Gln Gln Ile Va |
| #l Gly Gly Val Asn Tyr |
| 50 |
| # 55 |
| # 60 |
| Tyr Phe Asn Val Lys Phe Gly Arg Thr Thr Cy |
| #s Thr Lys Ser Gln Pro |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Asn Leu Asp Asn Cys Pro Phe Asn Asp Gln Pr |
| #o Lys Leu Lys Glu Glu |
| 85 |
| # 90 |
| # 95 |
| Glu Phe Cys Ser Phe Gln Ile Asn Glu Val Pr |
| #o Trp Glu Asp Lys Ile |
| 100 |
| # 105 |
| # 110 |
| Ser Ile Leu Asn Tyr Lys Cys Arg Lys Val |
| 115 |
| # 120 |
| <210> SEQ ID NO 11 |
| <211> LENGTH: 387 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(387) |
| <400> SEQUENCE: 11 |
| ctg cca cgc gat gcc cgg gcc cgg ccg cag ga |
| #g cgc atg gtc gga gaa 48 |
| Leu Pro Arg Asp Ala Arg Ala Arg Pro Gln Gl |
| #u Arg Met Val Gly Glu |
| 1 5 |
| # 10 |
| # 15 |
| ctc cgg gac ctg tcg ccc gac gac ccg cag gt |
| #g cag aag gcg gcg cag 96 |
| Leu Arg Asp Leu Ser Pro Asp Asp Pro Gln Va |
| #l Gln Lys Ala Ala Gln |
| 20 |
| # 25 |
| # 30 |
| gcg gcc gtg gcc agc tac aac atg ggc agc aa |
| #c agc atc tac tac ttc 144 |
| Ala Ala Val Ala Ser Tyr Asn Met Gly Ser As |
| #n Ser Ile Tyr Tyr Phe |
| 35 |
| # 40 |
| # 45 |
| cga gac acg cac atc atc aag gcg cag agc ca |
| #g ctg gtg gcc ggc atc 192 |
| Arg Asp Thr His Ile Ile Lys Ala Gln Ser Gl |
| #n Leu Val Ala Gly Ile |
| 50 |
| # 55 |
| # 60 |
| aag tac ttc ctg acg atg gag atg ggg agc ac |
| #a gac tgc cgc aag acc 240 |
| Lys Tyr Phe Leu Thr Met Glu Met Gly Ser Th |
| #r Asp Cys Arg Lys Thr |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| agg gtc act gga gac cac gtc gac ctc acc ac |
| #t tgc ccc ctg gca gca 288 |
| Arg Val Thr Gly Asp His Val Asp Leu Thr Th |
| #r Cys Pro Leu Ala Ala |
| 85 |
| # 90 |
| # 95 |
| ggg gcg cag cag gag aag ctg cgc tgt gac tt |
| #t gag gtc ctt gtg gtt 336 |
| Gly Ala Gln Gln Glu Lys Leu Arg Cys Asp Ph |
| #e Glu Val Leu Val Val |
| 100 |
| # 105 |
| # 110 |
| ccc tgg cag aac tcc tct cag ctc cta aag ca |
| #c aac tgt gtg cag atg 384 |
| Pro Trp Gln Asn Ser Ser Gln Leu Leu Lys Hi |
| #s Asn Cys Val Gln Met |
| 115 |
| # 120 |
| # 125 |
| tga |
| # |
| # |
| # 387 |
| <210> SEQ ID NO 12 |
| <211> LENGTH: 128 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 12 |
| Leu Pro Arg Asp Ala Arg Ala Arg Pro Gln Gl |
| #u Arg Met Val Gly Glu |
| 1 5 |
| # 10 |
| # 15 |
| Leu Arg Asp Leu Ser Pro Asp Asp Pro Gln Va |
| #l Gln Lys Ala Ala Gln |
| 20 |
| # 25 |
| # 30 |
| Ala Ala Val Ala Ser Tyr Asn Met Gly Ser As |
| #n Ser Ile Tyr Tyr Phe |
| 35 |
| # 40 |
| # 45 |
| Arg Asp Thr His Ile Ile Lys Ala Gln Ser Gl |
| #n Leu Val Ala Gly Ile |
| 50 |
| # 55 |
| # 60 |
| Lys Tyr Phe Leu Thr Met Glu Met Gly Ser Th |
| #r Asp Cys Arg Lys Thr |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Arg Val Thr Gly Asp His Val Asp Leu Thr Th |
| #r Cys Pro Leu Ala Ala |
| 85 |
| # 90 |
| # 95 |
| Gly Ala Gln Gln Glu Lys Leu Arg Cys Asp Ph |
| #e Glu Val Leu Val Val |
| 100 |
| # 105 |
| # 110 |
| Pro Trp Gln Asn Ser Ser Gln Leu Leu Lys Hi |
| #s Asn Cys Val Gln Met |
| 115 |
| # 120 |
| # 125 |
| <210> SEQ ID NO 13 |
| <211> LENGTH: 366 |
| <212> TYPE: DNA |
| <213> ORGANISM: Homo sapiens |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(366) |
| <400> SEQUENCE: 13 |
| cgg ccg cag gag cgc atg gtc gga gaa ctc cg |
| #g gac ctg tcg ccc gac 48 |
| Arg Pro Gln Glu Arg Met Val Gly Glu Leu Ar |
| #g Asp Leu Ser Pro Asp |
| 1 5 |
| # 10 |
| # 15 |
| gac ccg cag gtg cag aag gcg gcg cag gcg gc |
| #c gtg gcc agc tac aac 96 |
| Asp Pro Gln Val Gln Lys Ala Ala Gln Ala Al |
| #a Val Ala Ser Tyr Asn |
| 20 |
| # 25 |
| # 30 |
| atg ggc agc aac agc atc tac tac ttc cga ga |
| #c acg cac atc atc aag 144 |
| Met Gly Ser Asn Ser Ile Tyr Tyr Phe Arg As |
| #p Thr His Ile Ile Lys |
| 35 |
| # 40 |
| # 45 |
| gcg cag agc cag ctg gtg gcc ggc atc aag ta |
| #c ttc ctg acg atg gag 192 |
| Ala Gln Ser Gln Leu Val Ala Gly Ile Lys Ty |
| #r Phe Leu Thr Met Glu |
| 50 |
| # 55 |
| # 60 |
| atg ggg agc aca gac tgc cgc aag acc agg gt |
| #c act gga gac cac gtc 240 |
| Met Gly Ser Thr Asp Cys Arg Lys Thr Arg Va |
| #l Thr Gly Asp His Val |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| gac ctc acc act tgc ccc ctg gca gca ggg gc |
| #g cag cag gag aag ctg 288 |
| Asp Leu Thr Thr Cys Pro Leu Ala Ala Gly Al |
| #a Gln Gln Glu Lys Leu |
| 85 |
| # 90 |
| # 95 |
| cgc tgt gac ttt gag gtc ctt gtg gtt ccc tg |
| #g cag aac tcc tct cag 336 |
| Arg Cys Asp Phe Glu Val Leu Val Val Pro Tr |
| #p Gln Asn Ser Ser Gln |
| 100 |
| # 105 |
| # 110 |
| ctc cta aag cac aac tgt gtg cag atg tga |
| # |
| # 366 |
| Leu Leu Lys His Asn Cys Val Gln Met |
| 115 |
| # 120 |
| <210> SEQ ID NO 14 |
| <211> LENGTH: 121 |
| <212> TYPE: PRT |
| <213> ORGANISM: Homo sapiens |
| <400> SEQUENCE: 14 |
| Arg Pro Gln Glu Arg Met Val Gly Glu Leu Ar |
| #g Asp Leu Ser Pro Asp |
| 1 5 |
| # 10 |
| # 15 |
| Asp Pro Gln Val Gln Lys Ala Ala Gln Ala Al |
| #a Val Ala Ser Tyr Asn |
| 20 |
| # 25 |
| # 30 |
| Met Gly Ser Asn Ser Ile Tyr Tyr Phe Arg As |
| #p Thr His Ile Ile Lys |
| 35 |
| # 40 |
| # 45 |
| Ala Gln Ser Gln Leu Val Ala Gly Ile Lys Ty |
| #r Phe Leu Thr Met Glu |
| 50 |
| # 55 |
| # 60 |
| Met Gly Ser Thr Asp Cys Arg Lys Thr Arg Va |
| #l Thr Gly Asp His Val |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Asp Leu Thr Thr Cys Pro Leu Ala Ala Gly Al |
| #a Gln Gln Glu Lys Leu |
| 85 |
| # 90 |
| # 95 |
| Arg Cys Asp Phe Glu Val Leu Val Val Pro Tr |
| #p Gln Asn Ser Ser Gln |
| 100 |
| # 105 |
| # 110 |
| Leu Leu Lys His Asn Cys Val Gln Met |
| 115 |
| # 120 |
| <210> SEQ ID NO 15 |
| <211> LENGTH: 351 |
| <212> TYPE: DNA |
| <213> ORGANISM: Gallus sp. |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(351) |
| <400> SEQUENCE: 15 |
| agc gag gac cgc tcc cgg ctc ctg ggg gct cc |
| #a gtg cct gta gat gag 48 |
| Ser Glu Asp Arg Ser Arg Leu Leu Gly Ala Pr |
| #o Val Pro Val Asp Glu |
| 1 5 |
| # 10 |
| # 15 |
| aac gac gag ggc ttg caa cgg gcc ctg cag tt |
| #c gcg atg gcc gag tac 96 |
| Asn Asp Glu Gly Leu Gln Arg Ala Leu Gln Ph |
| #e Ala Met Ala Glu Tyr |
| 20 |
| # 25 |
| # 30 |
| aac agg gcc agc aac gat aag tac tcc agc cg |
| #g gtg gtg cgg gtc atc 144 |
| Asn Arg Ala Ser Asn Asp Lys Tyr Ser Ser Ar |
| #g Val Val Arg Val Ile |
| 35 |
| # 40 |
| # 45 |
| agc gcc aag cgg cag ctc gtg tct gga atc aa |
| #g tac atc ctg cag gtt 192 |
| Ser Ala Lys Arg Gln Leu Val Ser Gly Ile Ly |
| #s Tyr Ile Leu Gln Val |
| 50 |
| # 55 |
| # 60 |
| gag att ggt cgc aca act tgc ccc aag tca tc |
| #a ggt gat ctc cag agc 240 |
| Glu Ile Gly Arg Thr Thr Cys Pro Lys Ser Se |
| #r Gly Asp Leu Gln Ser |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| tgc gaa ttc cac gat gag cca gag atg gct aa |
| #g tat acc aca tgc acc 288 |
| Cys Glu Phe His Asp Glu Pro Glu Met Ala Ly |
| #s Tyr Thr Thr Cys Thr |
| 85 |
| # 90 |
| # 95 |
| ttt gta gtg tac agt att cct tgg cta aac ca |
| #a att aaa ctg ctg gaa 336 |
| Phe Val Val Tyr Ser Ile Pro Trp Leu Asn Gl |
| #n Ile Lys Leu Leu Glu |
| 100 |
| # 105 |
| # 110 |
| agc aag tgc cag taa |
| # |
| # |
| # 351 |
| Ser Lys Cys Gln |
| 115 |
| <210> SEQ ID NO 16 |
| <211> LENGTH: 116 |
| <212> TYPE: PRT |
| <213> ORGANISM: Gallus sp. |
| <400> SEQUENCE: 16 |
| Ser Glu Asp Arg Ser Arg Leu Leu Gly Ala Pr |
| #o Val Pro Val Asp Glu |
| 1 5 |
| # 10 |
| # 15 |
| Asn Asp Glu Gly Leu Gln Arg Ala Leu Gln Ph |
| #e Ala Met Ala Glu Tyr |
| 20 |
| # 25 |
| # 30 |
| Asn Arg Ala Ser Asn Asp Lys Tyr Ser Ser Ar |
| #g Val Val Arg Val Ile |
| 35 |
| # 40 |
| # 45 |
| Ser Ala Lys Arg Gln Leu Val Ser Gly Ile Ly |
| #s Tyr Ile Leu Gln Val |
| 50 |
| # 55 |
| # 60 |
| Glu Ile Gly Arg Thr Thr Cys Pro Lys Ser Se |
| #r Gly Asp Leu Gln Ser |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Cys Glu Phe His Asp Glu Pro Glu Met Ala Ly |
| #s Tyr Thr Thr Cys Thr |
| 85 |
| # 90 |
| # 95 |
| Phe Val Val Tyr Ser Ile Pro Trp Leu Asn Gl |
| #n Ile Lys Leu Leu Glu |
| 100 |
| # 105 |
| # 110 |
| Ser Lys Cys Gln |
| 115 |
| <210> SEQ ID NO 17 |
| <211> LENGTH: 336 |
| <212> TYPE: DNA |
| <213> ORGANISM: Cyprinus carpio |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(336) |
| <400> SEQUENCE: 17 |
| act ggg att cct gga ggc ctt gta gat gca ga |
| #c att aac gat aaa gat 48 |
| Thr Gly Ile Pro Gly Gly Leu Val Asp Ala As |
| #p Ile Asn Asp Lys Asp |
| 1 5 |
| # 10 |
| # 15 |
| gtt cag aag gcg tta cgc ttc gca gtg gac ca |
| #t tac aac ggc caa agc 96 |
| Val Gln Lys Ala Leu Arg Phe Ala Val Asp Hi |
| #s Tyr Asn Gly Gln Ser |
| 20 |
| # 25 |
| # 30 |
| aac gat gcg ttt gtg cgt aaa gtt tcc aaa gt |
| #a atc aag gtt caa caa 144 |
| Asn Asp Ala Phe Val Arg Lys Val Ser Lys Va |
| #l Ile Lys Val Gln Gln |
| 35 |
| # 40 |
| # 45 |
| caa gtt gcc gct ggc atg aaa tac atc ttc ac |
| #t gtg aag atg gaa gta 192 |
| Gln Val Ala Ala Gly Met Lys Tyr Ile Phe Th |
| #r Val Lys Met Glu Val |
| 50 |
| # 55 |
| # 60 |
| gcc tcc tgc aaa aag ggt gga gtt aag acc at |
| #g tgt gcc gtt ccg aag 240 |
| Ala Ser Cys Lys Lys Gly Gly Val Lys Thr Me |
| #t Cys Ala Val Pro Lys |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| aat ccc agt att gaa cag gtc att cag tgc aa |
| #a ata acg gtc tgg agc 288 |
| Asn Pro Ser Ile Glu Gln Val Ile Gln Cys Ly |
| #s Ile Thr Val Trp Ser |
| 85 |
| # 90 |
| # 95 |
| cag cca tgg tta aac tcc ttg aaa gtc act ga |
| #a aac acc tgc atg tag 336 |
| Gln Pro Trp Leu Asn Ser Leu Lys Val Thr Gl |
| #u Asn Thr Cys Met |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 18 |
| <211> LENGTH: 111 |
| <212> TYPE: PRT |
| <213> ORGANISM: Cyprinus carpio |
| <400> SEQUENCE: 18 |
| Thr Gly Ile Pro Gly Gly Leu Val Asp Ala As |
| #p Ile Asn Asp Lys Asp |
| 1 5 |
| # 10 |
| # 15 |
| Val Gln Lys Ala Leu Arg Phe Ala Val Asp Hi |
| #s Tyr Asn Gly Gln Ser |
| 20 |
| # 25 |
| # 30 |
| Asn Asp Ala Phe Val Arg Lys Val Ser Lys Va |
| #l Ile Lys Val Gln Gln |
| 35 |
| # 40 |
| # 45 |
| Gln Val Ala Ala Gly Met Lys Tyr Ile Phe Th |
| #r Val Lys Met Glu Val |
| 50 |
| # 55 |
| # 60 |
| Ala Ser Cys Lys Lys Gly Gly Val Lys Thr Me |
| #t Cys Ala Val Pro Lys |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Asn Pro Ser Ile Glu Gln Val Ile Gln Cys Ly |
| #s Ile Thr Val Trp Ser |
| 85 |
| # 90 |
| # 95 |
| Gln Pro Trp Leu Asn Ser Leu Lys Val Thr Gl |
| #u Asn Thr Cys Met |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 19 |
| <211> LENGTH: 336 |
| <212> TYPE: DNA |
| <213> ORGANISM: Oncorhynchus keta |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(336) |
| <400> SEQUENCE: 19 |
| ggt ttg gtc gga ggc ccc atg gac gca aat at |
| #g aac gac caa gga acg 48 |
| Gly Leu Val Gly Gly Pro Met Asp Ala Asn Me |
| #t Asn Asp Gln Gly Thr |
| 1 5 |
| # 10 |
| # 15 |
| aga gac gcc ctg cag ttc gcg gtg gtc gaa ca |
| #c aac aag aaa aca aac 96 |
| Arg Asp Ala Leu Gln Phe Ala Val Val Glu Hi |
| #s Asn Lys Lys Thr Asn |
| 20 |
| # 25 |
| # 30 |
| gac atg ttt gtc agg cag gtg gcc aag gtt gt |
| #c aat gca cag aaa cag 144 |
| Asp Met Phe Val Arg Gln Val Ala Lys Val Va |
| #l Asn Ala Gln Lys Gln |
| 35 |
| # 40 |
| # 45 |
| gtg gta tct ggg atg aag tac atc ttc aca gt |
| #g cag atg ggc agg acc 192 |
| Val Val Ser Gly Met Lys Tyr Ile Phe Thr Va |
| #l Gln Met Gly Arg Thr |
| 50 |
| # 55 |
| # 60 |
| cca tgc agg aag gga ggt gtt gag aag atc tg |
| #c tcc gtg cac aaa gac 240 |
| Pro Cys Arg Lys Gly Gly Val Glu Lys Ile Cy |
| #s Ser Val His Lys Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| ccg cag atg gct gtg ccc tac aag tgc acc tt |
| #c gag gtg tgg agc cgc 288 |
| Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Ph |
| #e Glu Val Trp Ser Arg |
| 85 |
| # 90 |
| # 95 |
| ccc tgg atg agc gat atc cag atg gtc aag aa |
| #c cag tgt gaa agt taa 336 |
| Pro Trp Met Ser Asp Ile Gln Met Val Lys As |
| #n Gln Cys Glu Ser |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 20 |
| <211> LENGTH: 111 |
| <212> TYPE: PRT |
| <213> ORGANISM: Oncorhynchus keta |
| <400> SEQUENCE: 20 |
| Gly Leu Val Gly Gly Pro Met Asp Ala Asn Me |
| #t Asn Asp Gln Gly Thr |
| 1 5 |
| # 10 |
| # 15 |
| Arg Asp Ala Leu Gln Phe Ala Val Val Glu Hi |
| #s Asn Lys Lys Thr Asn |
| 20 |
| # 25 |
| # 30 |
| Asp Met Phe Val Arg Gln Val Ala Lys Val Va |
| #l Asn Ala Gln Lys Gln |
| 35 |
| # 40 |
| # 45 |
| Val Val Ser Gly Met Lys Tyr Ile Phe Thr Va |
| #l Gln Met Gly Arg Thr |
| 50 |
| # 55 |
| # 60 |
| Pro Cys Arg Lys Gly Gly Val Glu Lys Ile Cy |
| #s Ser Val His Lys Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Ph |
| #e Glu Val Trp Ser Arg |
| 85 |
| # 90 |
| # 95 |
| Pro Trp Met Ser Asp Ile Gln Met Val Lys As |
| #n Gln Cys Glu Ser |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 21 |
| <211> LENGTH: 336 |
| <212> TYPE: DNA |
| <213> ORGANISM: Oncorhynchus mykiss |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(336) |
| <400> SEQUENCE: 21 |
| ggt ttg atc gga ggc ccc atg gac gca aat at |
| #g aac gac caa gga acg 48 |
| Gly Leu Ile Gly Gly Pro Met Asp Ala Asn Me |
| #t Asn Asp Gln Gly Thr |
| 1 5 |
| # 10 |
| # 15 |
| aga gac gcc ctg cag ttc gcg gtg gtc gaa ca |
| #c aac aag aaa aca aac 96 |
| Arg Asp Ala Leu Gln Phe Ala Val Val Glu Hi |
| #s Asn Lys Lys Thr Asn |
| 20 |
| # 25 |
| # 30 |
| gac atg ttt gtc agg cag gtg gcc aag gtt gt |
| #c aat gca cag aag cag 144 |
| Asp Met Phe Val Arg Gln Val Ala Lys Val Va |
| #l Asn Ala Gln Lys Gln |
| 35 |
| # 40 |
| # 45 |
| gtg gta tct ggg atg aag tac atc ttc aca gt |
| #g cag atg ggc agg acc 192 |
| Val Val Ser Gly Met Lys Tyr Ile Phe Thr Va |
| #l Gln Met Gly Arg Thr |
| 50 |
| # 55 |
| # 60 |
| cca tgc agg aag gga ggt gtt gag aag gtc tg |
| #c tcc gtg cac aag gac 240 |
| Pro Cys Arg Lys Gly Gly Val Glu Lys Val Cy |
| #s Ser Val His Lys Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| cca cag atg gct gtg ccc tac aag tgc acc tt |
| #c gag gtg tgg agc cgc 288 |
| Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Ph |
| #e Glu Val Trp Ser Arg |
| 85 |
| # 90 |
| # 95 |
| ccc tgg atg agc gat atc cag atg gtc aag aa |
| #c cag tgt gaa agt taa 336 |
| Pro Trp Met Ser Asp Ile Gln Met Val Lys As |
| #n Gln Cys Glu Ser |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 22 |
| <211> LENGTH: 111 |
| <212> TYPE: PRT |
| <213> ORGANISM: Oncorhynchus mykiss |
| <400> SEQUENCE: 22 |
| Gly Leu Ile Gly Gly Pro Met Asp Ala Asn Me |
| #t Asn Asp Gln Gly Thr |
| 1 5 |
| # 10 |
| # 15 |
| Arg Asp Ala Leu Gln Phe Ala Val Val Glu Hi |
| #s Asn Lys Lys Thr Asn |
| 20 |
| # 25 |
| # 30 |
| Asp Met Phe Val Arg Gln Val Ala Lys Val Va |
| #l Asn Ala Gln Lys Gln |
| 35 |
| # 40 |
| # 45 |
| Val Val Ser Gly Met Lys Tyr Ile Phe Thr Va |
| #l Gln Met Gly Arg Thr |
| 50 |
| # 55 |
| # 60 |
| Pro Cys Arg Lys Gly Gly Val Glu Lys Val Cy |
| #s Ser Val His Lys Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Ph |
| #e Glu Val Trp Ser Arg |
| 85 |
| # 90 |
| # 95 |
| Pro Trp Met Ser Asp Ile Gln Met Val Lys As |
| #n Gln Cys Glu Ser |
| 100 |
| # 105 |
| # 110 |
| <210> SEQ ID NO 23 |
| <211> LENGTH: 357 |
| <212> TYPE: DNA |
| <213> ORGANISM: Bos taurus |
| <220> FEATURE: |
| <221> NAME/KEY: CDS |
| <222> LOCATION: (1)..(357) |
| <400> SEQUENCE: 23 |
| cag ggc cct agg aag ggt cgc ctg ctg ggc gg |
| #c ctg atg gag gcg gac 48 |
| Gln Gly Pro Arg Lys Gly Arg Leu Leu Gly Gl |
| #y Leu Met Glu Ala Asp |
| 1 5 |
| # 10 |
| # 15 |
| gtc aat gag gag ggc gtg cag gag gcg ctg tc |
| #c ttt gcg gtc agc gag 96 |
| Val Asn Glu Glu Gly Val Gln Glu Ala Leu Se |
| #r Phe Ala Val Ser Glu |
| 20 |
| # 25 |
| # 30 |
| ttc aac aag cgg agc aac gac gct tac cag ag |
| #c cgc gtg gtg cgc gtg 144 |
| Phe Asn Lys Arg Ser Asn Asp Ala Tyr Gln Se |
| #r Arg Val Val Arg Val |
| 35 |
| # 40 |
| # 45 |
| gtg cgc gcc cgc aag cag gtc gtg tca ggg at |
| #g aac tat ttc ttg gac 192 |
| Val Arg Ala Arg Lys Gln Val Val Ser Gly Me |
| #t Asn Tyr Phe Leu Asp |
| 50 |
| # 55 |
| # 60 |
| gtg gag ctt ggc cgg act aca tgt acc aag tc |
| #c cag gcc aac ttt gac 240 |
| Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Se |
| #r Gln Ala Asn Phe Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| agc tgt ccc ttc cat aac cag ccg cac ctg aa |
| #g agg gaa aag ctg tgc 288 |
| Ser Cys Pro Phe His Asn Gln Pro His Leu Ly |
| #s Arg Glu Lys Leu Cys |
| 85 |
| # 90 |
| # 95 |
| tcc ttc cag gtt tac gtc gtc cca tgg atg aa |
| #c acc atc aac ctg gtg 336 |
| Ser Phe Gln Val Tyr Val Val Pro Trp Met As |
| #n Thr Ile Asn Leu Val |
| 100 |
| # 105 |
| # 110 |
| aag ttt agc tgc cag gat taa |
| # |
| # 357 |
| Lys Phe Ser Cys Gln Asp |
| 115 |
| <210> SEQ ID NO 24 |
| <211> LENGTH: 118 |
| <212> TYPE: PRT |
| <213> ORGANISM: Bos taurus |
| <400> SEQUENCE: 24 |
| Gln Gly Pro Arg Lys Gly Arg Leu Leu Gly Gl |
| #y Leu Met Glu Ala Asp |
| 1 5 |
| # 10 |
| # 15 |
| Val Asn Glu Glu Gly Val Gln Glu Ala Leu Se |
| #r Phe Ala Val Ser Glu |
| 20 |
| # 25 |
| # 30 |
| Phe Asn Lys Arg Ser Asn Asp Ala Tyr Gln Se |
| #r Arg Val Val Arg Val |
| 35 |
| # 40 |
| # 45 |
| Val Arg Ala Arg Lys Gln Val Val Ser Gly Me |
| #t Asn Tyr Phe Leu Asp |
| 50 |
| # 55 |
| # 60 |
| Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Se |
| #r Gln Ala Asn Phe Asp |
| 65 |
| # 70 |
| # 75 |
| # 80 |
| Ser Cys Pro Phe His Asn Gln Pro His Leu Ly |
| #s Arg Glu Lys Leu Cys |
| 85 |
| # 90 |
| # 95 |
| Ser Phe Gln Val Tyr Val Val Pro Trp Met As |
| #n Thr Ile Asn Leu Val |
| 100 |
| # 105 |
| # 110 |
| Lys Phe Ser Cys Gln Asp |
| 115 |
| <210> SEQ ID NO 25 |
| <211> LENGTH: 115 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: first of |
| four oligonucleotides used to create |
| # a nucleotide |
| coding for modified human cystatin |
| #C |
| <400> SEQUENCE: 25 |
| gtatctctcg agaaaagatc ttctccaggt aagccaccaa gattggtcgg tg |
| #gtccaatg 60 |
| gacgcctctg tcgaggagga gggtgtcaga agagccttgg acttcgccgt cg |
| #gtg 115 |
| <210> SEQ ID NO 26 |
| <211> LENGTH: 115 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: the second |
| of four oligonucleotides used to |
| #create a nucleotide |
| coding for synthetic human cystatin |
| #C |
| <400> SEQUENCE: 26 |
| caagaagtag ttgacaccgg cgacaatttg ctttctggct ctgacgactt gc |
| #aaggctct 60 |
| ggagtggtac atgtcgttag aggccttgtt gtactcaccg acggcgaagt cc |
| #aag 115 |
| <210> SEQ ID NO 27 |
| <211> LENGTH: 115 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: the third |
| of four oligonucleotides used to |
| #create a nucleotide |
| coding for synthetic human cystatin |
| #C |
| <400> SEQUENCE: 27 |
| caaattgtcg ccggtgtcaa ctacttcttg gacgttgagt tgggtagaac ta |
| #cttgtact 60 |
| aagactcaac caaacttgac taactgtcca ttccacgacc aaccacactt ga |
| #aga 115 |
| <210> SEQ ID NO 28 |
| <211> LENGTH: 115 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: the fourth |
| of four oligonucleotides used to |
| #create a nucleotide |
| coding for synthetic human cystatin |
| #C |
| <400> SEQUENCE: 28 |
| tgttctagat caggcgtctt gacaagtaga cttagacaaa gtcatagtac ct |
| #tgccatgg 60 |
| gacggcgtaa atttggaaag aacagaaggc ctttctcttc aagtgtggtt gg |
| #tcg 115 |
| <210> SEQ ID NO 29 |
| <211> LENGTH: 30 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: forward |
| primer used in site-directed mutagen |
| #esis to intro. a |
| glycosylation site at residue 35 |
| #of a modified human cystatin C |
| <400> SEQUENCE: 29 |
| ggtgagtaca acaagtcctc taacgacatg |
| # |
| # 30 |
| <210> |
| <211> LENGTH: 30 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: reverse |
| primer used in site-directed mutagen |
| #esis to intro. a |
| glycosylation site at residue 35 |
| #of a modified human cystatin C |
| <400> SEQUENCE: 30 |
| catgtcgtta gaggacttgt tgtactcacc |
| # |
| # 30 |
| <210> SEQ ID NO 31 |
| <211> LENGTH: 30 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: forward |
| primer used in site-directed mutagen |
| #esis to intro. a |
| glycosylation site at residue 36 |
| #of a modified human cystatin C |
| <400> SEQUENCE: 31 |
| ggtgagtaca acaacgcctc taacgacatg |
| # |
| # 30 |
| <210> SEQ ID NO 32 |
| <211> LENGTH: 30 |
| <212> TYPE: DNA |
| <213> ORGANISM: Artificial Sequence |
| <220> FEATURE: |
| <223> OTHER INFORMATION: Description of Artificial |
| #Sequence: reverse |
| primer used in site-directed mutagen |
| #esis to intro. a |
| glycosylation site at residue 36 |
| #of a modified human cystatin C |
| <400> SEQUENCE: 32 |
| catgtcgtta gaggcgttgt tgtactcacc |
| # |
| # 30 |
Claims (4)
1. A modified human cystatin C comprising at least one modification of native human cystatin C (SEQ ID NO:2) selected from the group consisting of: Lys (36) Asn, Ala (37) Ser, Ala (37) Thr, Asp (81) Ser, and Asp (81) Thr.
2. The modified human cystatin C of claim 1 , wherein said at least one modification increases the heat stability of the modified human cystatin C.
3. The modified human cystatin C of claim 1 , wherein said at least one modification is Ala (37) Ser or Ala (37) Thr, and Asp (81) Ser or Asp (81) Thr.
4. The modified human cystatin C of claim 1 , said at least one modification is Ala (37) Ser or Asp (81) Thr.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/775,932 US6534477B2 (en) | 1998-08-05 | 2001-02-02 | Production and use of modified cystatins |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US9550398P | 1998-08-05 | 1998-08-05 | |
| PCT/CA1999/000717 WO2000008159A2 (en) | 1998-08-05 | 1999-08-05 | Production and use of modified cystatins |
| US09/775,932 US6534477B2 (en) | 1998-08-05 | 2001-02-02 | Production and use of modified cystatins |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CA1999/000717 Continuation WO2000008159A2 (en) | 1998-08-05 | 1999-08-05 | Production and use of modified cystatins |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20020137671A1 US20020137671A1 (en) | 2002-09-26 |
| US6534477B2 true US6534477B2 (en) | 2003-03-18 |
Family
ID=22252301
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/775,932 Expired - Fee Related US6534477B2 (en) | 1998-08-05 | 2001-02-02 | Production and use of modified cystatins |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US6534477B2 (en) |
| EP (1) | EP1123399A2 (en) |
| AU (1) | AU5144299A (en) |
| CA (1) | CA2335344A1 (en) |
| WO (1) | WO2000008159A2 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050267021A1 (en) * | 2003-10-15 | 2005-12-01 | Schiemann William P | Cystatin C as an antagonist of TGF-beta and methods related thereto |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6436389B1 (en) * | 1998-12-11 | 2002-08-20 | The Salk Institute For Biological Studies | Stimulation of cell proliferation by glycosylated cystatin C |
| GB0009124D0 (en) * | 2000-04-14 | 2000-05-31 | Univ Belfast | Agent against periodontal disease |
| ATE406174T1 (en) | 2000-04-14 | 2008-09-15 | Philip-John Lamey | TREATMENT OF MIGRAINES AND VASODILATION |
| CN104039962B (en) * | 2012-01-09 | 2016-06-22 | 苏州工业园区为真生物医药科技有限公司 | The mark of breast cancer diagnosis and indication |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1988009384A1 (en) * | 1987-05-22 | 1988-12-01 | Novo-Nordisk A/S | Method of producing cystatin c or modifications hereof and dna-sequence for use when carrying out the method |
| JPH01124389A (en) | 1987-07-24 | 1989-05-17 | Gruenenthal Gmbh | Dna sequences which encode protein having biological property of cystacin c, preparation of these dna sequences, manifestation of cystatin c and phamaceutical compound containing it |
| JPH01202287A (en) | 1988-02-05 | 1989-08-15 | Otsuka Pharmaceut Factory Inc | Cystatin C chemical synthesis gene, corresponding plasmid recombinant, corresponding transformant, and method for producing cystatin C |
| US4902509A (en) | 1985-01-16 | 1990-02-20 | Krka, Tovarna Zdravil, N.Sol.O. | Process for the isolation of chicken egg cystatin, antiviral agents containing it and its use as viral protease inhibitor |
| US5432264A (en) | 1987-05-22 | 1995-07-11 | Novo Nordisk A/S | Recombinant 3-des-OH-cystatin C produced by expression in a procaryotic host cell |
| WO1996016173A2 (en) | 1994-11-21 | 1996-05-30 | The University Of Leeds | Modified proteinase inhibitors |
-
1999
- 1999-08-05 CA CA002335344A patent/CA2335344A1/en not_active Abandoned
- 1999-08-05 EP EP99936211A patent/EP1123399A2/en not_active Withdrawn
- 1999-08-05 AU AU51442/99A patent/AU5144299A/en not_active Abandoned
- 1999-08-05 WO PCT/CA1999/000717 patent/WO2000008159A2/en not_active Application Discontinuation
-
2001
- 2001-02-02 US US09/775,932 patent/US6534477B2/en not_active Expired - Fee Related
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4902509A (en) | 1985-01-16 | 1990-02-20 | Krka, Tovarna Zdravil, N.Sol.O. | Process for the isolation of chicken egg cystatin, antiviral agents containing it and its use as viral protease inhibitor |
| WO1988009384A1 (en) * | 1987-05-22 | 1988-12-01 | Novo-Nordisk A/S | Method of producing cystatin c or modifications hereof and dna-sequence for use when carrying out the method |
| US5432264A (en) | 1987-05-22 | 1995-07-11 | Novo Nordisk A/S | Recombinant 3-des-OH-cystatin C produced by expression in a procaryotic host cell |
| JPH01124389A (en) | 1987-07-24 | 1989-05-17 | Gruenenthal Gmbh | Dna sequences which encode protein having biological property of cystacin c, preparation of these dna sequences, manifestation of cystatin c and phamaceutical compound containing it |
| JPH01202287A (en) | 1988-02-05 | 1989-08-15 | Otsuka Pharmaceut Factory Inc | Cystatin C chemical synthesis gene, corresponding plasmid recombinant, corresponding transformant, and method for producing cystatin C |
| WO1996016173A2 (en) | 1994-11-21 | 1996-05-30 | The University Of Leeds | Modified proteinase inhibitors |
Non-Patent Citations (18)
| Title |
|---|
| Abrahamson M. et al. (1988) FEBS Letters 236 (1): 14-18. |
| Barrett AJ. (1987) TIBS 12: 193-196. |
| Barrett AJ. et al. Methods Enzymol. (1981) 80:771-778. |
| Barrett AJ. et al. The Biochemical Journal (1986) Letters 236 (1):312. |
| Ekiel I. et al. J Mol. Biol. (1997) 271(2):266-77. |
| International Search Report WO 00/08159. |
| Li F. et al. Comparative Biochemistry and Physiology. Part B, Biochemistry and Molecular Biology (1998) 121(2):135-43. |
| Nakamura S. et al. FEBS Letters (1996) 383:251-254. |
| Nakamura S. et al., Abstract for poster presented at the Canadian Institute of Food Science and Technology annual conference (dated Aug. 18, 1996). |
| Nakamura S. et al., FEBS Letters (1993) 328(3):259-262. |
| Nakamura S. et al., FEBS Letters (1998) 427(2):252-254. |
| Nakamura S. et al., Journal of Agricultural and Food Chemistry (1998) 46(7):2882-2887. |
| Nakamura S. et al., Journal of Biological Chemistry (1993) 268(17):12706-12712. |
| NI J. et al., Journal of Biological Chemistry (1997) 272(16):10853-10858. |
| Tsai YJ. et al. Comparative Biochemistry and Physiology. Part B, Biochemistry and Molecular Biology (1996) 113(3):573-580. |
| Turk V. et al. (1991) FEBS 285(2): 213-219. |
| Urwin PE. et al. Plant Journal (1995) 8(1):121-131. |
| Yamashita M. et al. Journal of Biochemistry (1996) 120(3):483-487. |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050267021A1 (en) * | 2003-10-15 | 2005-12-01 | Schiemann William P | Cystatin C as an antagonist of TGF-beta and methods related thereto |
| WO2005037221A3 (en) * | 2003-10-15 | 2007-05-10 | Nat Jewish Med & Res Center | Cystatin c as an antagonist of tgf-b and methods related thereto |
| US7282477B2 (en) * | 2003-10-15 | 2007-10-16 | National Jewish Medical And Research Center | Cystatin C as an antagonist of TGF-β and methods related thereto |
| US20090093401A1 (en) * | 2003-10-15 | 2009-04-09 | National Jewish Medical And Research Center | Cystatin c as an antagonist of tgf-beta and methods related thereto |
| US7749958B2 (en) | 2003-10-15 | 2010-07-06 | National Jewish Health | Cystatin C as an antagonist of TGF-β and methods related thereto |
| US20100267644A1 (en) * | 2003-10-15 | 2010-10-21 | National Jewish Health | Cystatin C as an Antagonist of TGF-BETA and Methods Related Thereto |
| US8058396B2 (en) | 2003-10-15 | 2011-11-15 | National Jewish Health | Cystatin C as an antagonist of TGF-β and methods related thereto |
| AU2004281152B2 (en) * | 2003-10-15 | 2012-01-19 | National Jewish Health | Cystatin C as an antagonist of tgf-b and methods related thereto |
Also Published As
| Publication number | Publication date |
|---|---|
| AU5144299A (en) | 2000-02-28 |
| WO2000008159A3 (en) | 2000-05-11 |
| US20020137671A1 (en) | 2002-09-26 |
| WO2000008159A2 (en) | 2000-02-17 |
| CA2335344A1 (en) | 2000-02-17 |
| EP1123399A2 (en) | 2001-08-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Rosenthal et al. | Isolation and characterization of a cysteine proteinase gene of Plasmodium falciparum | |
| JP3126975B2 (en) | Proteinase inhibitors, methods for their preparation and drugs containing them | |
| CA2481489A1 (en) | Cysteine protease inhibitor | |
| EP2875826A1 (en) | Composition for preventing or treating sepsis | |
| JPH02500084A (en) | Bactericidal and/or bacteriostatic peptides, their isolation methods, their production and their applications | |
| Murray et al. | Purification of a trypsin inhibitor (PFTI) from pumpkin fruit phloem exudate and isolation of putative trypsin and chymotrypsin inhibitor cDNA clones | |
| Ahn et al. | Olive flounder (Paralichthys olivaceus) cystatin B: cloning, tissue distribution, expression and inhibitory profile of piscine cystatin B | |
| US6534477B2 (en) | Production and use of modified cystatins | |
| US20200353058A1 (en) | Mitrecin A Polypeptide with Antimicrobial Activity | |
| Suthianthong et al. | A double WAP domain-containing protein PmDWD from the black tiger shrimp Penaeus monodon is involved in the controlling of proteinase activities in lymphoid organ | |
| Zhang et al. | Purification, characterization, and cDNA cloning of a Bowman-Birk type trypsin inhibitor from Apios americana Medikus tubers | |
| Saito et al. | Molecular cloning of cDNA for sarcocystatin A and analysis of the expression of the sarcocystatin A gene during development of Sarcophaga peregrina | |
| Kawasaki et al. | Presence of the Periplaneta lectin-related protein family in the American cockroach Periplaneta americana | |
| NZ536338A (en) | Lactoferrin as a cysteine protease inhibitor | |
| JPH04505162A (en) | Peptides that inhibit leukocyte elastase and cathepsin G, DNA, vectors, host organisms and methods of obtaining them, and pharmaceutical preparations containing the peptides | |
| Abe et al. | Isolation, characterization and cDNA cloning of a one-lobed transferrin from the ascidian Halocynthia roretzi | |
| JPH05247093A (en) | Novel cystatin polypeptide, method for producing the same, and enzyme inhibitor containing the polypeptide as an active ingredient | |
| AU773969B2 (en) | Novel tachykinin peptides, precursor peptides thereof and genes encoding the same | |
| US20110200541A1 (en) | Recombinant preparation of bromelain inhibitors and bromelain inhibitor precursor | |
| KR0137519B1 (en) | Elastase inhibitory protein isolated from Korean blood-sucking leech (Hirudo nipponia) and its preparation method | |
| Sukenaga et al. | Purification and molecular cloning of chymase from human tonsils | |
| JPH1080281A (en) | New protein and its production | |
| JPH05308988A (en) | Novel polypeptide, novel DNA, novel vector, novel transformant, novel pharmaceutical composition, and method for producing novel polypeptide | |
| CA2455918A1 (en) | Hagfish cathelin-associated antimicrobial peptides and genes | |
| Tzeng et al. | Expression of soluble thioredoxin fused‐carp (Cyprinus carpio) ovarian cystatin in Escherichia coli |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: UNIVERSITY OF BRITISH COLUMBIA, THE, CANADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NAKAMURA, SOICHIRO;OGAWA, MASAHIRO;NAKAI, SHURYO;REEL/FRAME:011824/0467;SIGNING DATES FROM 20010208 TO 20010214 |
|
| CC | Certificate of correction | ||
| REMI | Maintenance fee reminder mailed | ||
| LAPS | Lapse for failure to pay maintenance fees | ||
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20070318 |