CN102051370A - 文蛤组织蛋白酶b基因及编码蛋白和应用 - Google Patents
文蛤组织蛋白酶b基因及编码蛋白和应用 Download PDFInfo
- Publication number
- CN102051370A CN102051370A CN2009102298052A CN200910229805A CN102051370A CN 102051370 A CN102051370 A CN 102051370A CN 2009102298052 A CN2009102298052 A CN 2009102298052A CN 200910229805 A CN200910229805 A CN 200910229805A CN 102051370 A CN102051370 A CN 102051370A
- Authority
- CN
- China
- Prior art keywords
- clam
- cathepsin
- gly
- gene
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000712 Cathepsin B Proteins 0.000 title claims abstract description 45
- 108090000623 proteins and genes Proteins 0.000 title claims description 14
- 102000004169 proteins and genes Human genes 0.000 title claims description 9
- 230000012010 growth Effects 0.000 claims abstract description 11
- 230000014509 gene expression Effects 0.000 claims abstract description 10
- 230000006798 recombination Effects 0.000 claims abstract description 5
- 238000005215 recombination Methods 0.000 claims abstract description 5
- 235000015170 shellfish Nutrition 0.000 claims description 9
- 235000018102 proteins Nutrition 0.000 claims description 7
- 230000009571 larval growth Effects 0.000 claims description 2
- 239000000654 additive Substances 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 102000004225 Cathepsin B Human genes 0.000 abstract description 28
- 239000002299 complementary DNA Substances 0.000 abstract description 18
- 108091060211 Expressed sequence tag Proteins 0.000 abstract description 14
- 238000011161 development Methods 0.000 abstract description 12
- 238000005516 engineering process Methods 0.000 abstract description 12
- 230000003321 amplification Effects 0.000 abstract description 8
- 238000000338 in vitro Methods 0.000 abstract description 8
- 238000003199 nucleic acid amplification method Methods 0.000 abstract description 8
- 239000000758 substrate Substances 0.000 abstract description 8
- 238000009395 breeding Methods 0.000 abstract description 7
- 230000001488 breeding effect Effects 0.000 abstract description 7
- 102000005600 Cathepsins Human genes 0.000 abstract description 6
- 108010084457 Cathepsins Proteins 0.000 abstract description 6
- 239000003674 animal food additive Substances 0.000 abstract description 6
- 238000003259 recombinant expression Methods 0.000 abstract description 5
- 238000011160 research Methods 0.000 abstract description 5
- 230000007246 mechanism Effects 0.000 abstract description 4
- 108091036066 Three prime untranslated region Proteins 0.000 abstract description 2
- 238000012215 gene cloning Methods 0.000 abstract description 2
- 235000020639 clam Nutrition 0.000 description 46
- 230000018109 developmental process Effects 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 238000000034 method Methods 0.000 description 8
- 238000004925 denaturation Methods 0.000 description 7
- 230000036425 denaturation Effects 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 241000282326 Felis catus Species 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 241001123258 Meretrix meretrix Species 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- 102000035195 Peptidases Human genes 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 235000016709 nutrition Nutrition 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 241000237519 Bivalvia Species 0.000 description 3
- 102000005927 Cysteine Proteases Human genes 0.000 description 3
- 108010005843 Cysteine Proteases Proteins 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- 241000237852 Mollusca Species 0.000 description 2
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 108090000190 Thrombin Proteins 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 150000001413 amino acids Chemical group 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 229940088598 enzyme Drugs 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 230000029052 metamorphosis Effects 0.000 description 2
- 235000003715 nutritional status Nutrition 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 239000013535 sea water Substances 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 229960004072 thrombin Drugs 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- WYOSXGYAKZQPGF-SRVKXCTJSA-N Asp-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N WYOSXGYAKZQPGF-SRVKXCTJSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 102000035101 Aspartic proteases Human genes 0.000 description 1
- 108091005502 Aspartic proteases Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241000251522 Cephalochordata Species 0.000 description 1
- 241000206751 Chrysophyceae Species 0.000 description 1
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 1
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 1
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 1
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 1
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- VHJLVAABSRFDPM-IMJSIDKUSA-N L-1,4-dithiothreitol Chemical compound SC[C@H](O)[C@@H](O)CS VHJLVAABSRFDPM-IMJSIDKUSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- GRRAECZXRONTEE-UBHSHLNASA-N Ser-Cys-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GRRAECZXRONTEE-UBHSHLNASA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- XNRJFXBORWMIPY-DCPHZVHLSA-N Trp-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XNRJFXBORWMIPY-DCPHZVHLSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- YIKDYZDNRCNFQB-KKUMJFAQSA-N Tyr-His-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O YIKDYZDNRCNFQB-KKUMJFAQSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 238000009360 aquaculture Methods 0.000 description 1
- 244000144974 aquaculture Species 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- NTZOLQRPFQQABU-PMACEKPBSA-N benzyl N-[(2S)-1-[[(2S)-1-[7-amino-2-oxo-4-(trifluoromethyl)chromen-3-yl]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]carbamate Chemical compound N([C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)C=1C(OC2=CC(N)=CC=C2C=1C(F)(F)F)=O)C(=O)OCC1=CC=CC=C1 NTZOLQRPFQQABU-PMACEKPBSA-N 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- GLNDAGDHSLMOKX-UHFFFAOYSA-N coumarin 120 Chemical compound C1=C(N)C=CC2=C1OC(=O)C=C2C GLNDAGDHSLMOKX-UHFFFAOYSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 235000001434 dietary modification Nutrition 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 239000007952 growth promoter Substances 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 210000002429 large intestine Anatomy 0.000 description 1
- 230000001418 larval effect Effects 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 229940127554 medical product Drugs 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 229940127557 pharmaceutical product Drugs 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 229940024999 proteolytic enzymes for treatment of wounds and ulcers Drugs 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 108091006091 regulatory enzymes Proteins 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- FDRCDNZGSXJAFP-UHFFFAOYSA-M sodium chloroacetate Chemical compound [Na+].[O-]C(=O)CCl FDRCDNZGSXJAFP-UHFFFAOYSA-M 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
Images
Landscapes
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及分子生物学中文蛤组织蛋白酶基因克隆及体外重组表达技术。本发明利用表达序列标签(EST)技术、3’和5’末端快速扩增技术(RACE)从文蛤幼虫中克隆到组织蛋白酶B基因cDNA全长1647bp,编码框长为1014bp,3’非翻译区长624bp,发现组织蛋白酶B存在封闭环序列。本发明利用体外原核重组表达技术获得了重组文蛤组织蛋白酶B,重组产物对其特异性底物有分解作用。本发明可以为进一步研究文蛤幼体生长发育机制提供基础,并为文蛤的苗种繁育和饲料添加剂奠定基础。
Description
技术领域
本发明涉及文蛤组织蛋白酶B基因克隆及体外重组表达技术,具体地说是从文蛤cDNA文库中克隆出文蛤组织蛋白酶B基因的cDNA全序列并对该序列进行了原核重组表达,还涉及到该基因及其表达产物在贝类幼虫生长发育促进剂、饲料添加剂以及贝类苗种生产中的应用。
背景技术
在无脊椎动物消化营养物质的酶类中,组织蛋白酶类是人们关注的热点(Williamson et al.,2003)。其中,组织蛋白酶是一大类裂解肽键的蛋白水解酶,因其催化中心不同而分为四种类型,分别为半胱氨酸蛋白酶、丝氨酸蛋白酶、天冬氨酸蛋白酶和金属蛋白酶。其中半胱氨酸蛋白酶是十分重要的一类,组织蛋白酶B(EC 3.4.22.1)属于半胱氨酸蛋白酶之一(McGrath,1999)。组织蛋白酶B是唯一一个同时具有外切酶活性和内切酶活性的组织蛋白酶(Omar Quraishi 1999),该特点决定了组织蛋白酶B在多肽的分解中具有重要的作用,因此以该蛋白酶主要研究对象可以成为研究在幼虫营养过程的分子机制的突破口。
文蛤是一种在东亚各国沿海和滩涂地区广泛分布的双壳贝类(Wang et al.,1993)。在我国沿海,文蛤作为一种重要的经济贝类而被广泛养殖(Liu et al.,2006),文蛤苗种的人工繁育成为这一产业发展的关键环节之一。与众多双壳贝类发育过程相似,文蛤幼虫发育也需经过一段浮游阶段然后转入底栖生活。在贝类种苗的人工繁育过程中,早期幼虫营养状况不仅直接影响幼虫的生长发育,还会影响幼虫变态时的大小和变态后幼体的生长率及成活率(Phillips,2002)。所以,养殖业苗种培育过程中,幼虫营养状况会直接影响苗种的成活率和生产效益。组织蛋白酶B的活性特点和生物学功能,决定了其在贝类幼虫生长发育中重要的营养代谢调控功能。组织蛋白酶B作为重要的贝类幼虫营养调控酶类,对促进文蛤等贝类幼虫生长和发育,提高苗种繁育的稳定性和生产效益具有重要的应用潜力。
发明内容
本发明的目的是提供一种文蛤组织蛋白酶B基因及其编码蛋白和该基因体外重组表达的应用,本发明从文蛤中克隆到组织蛋白酶B,并对其进行原核重组及重组产物的活性鉴定,为进一步研究文蛤幼体生长发育机制提供基础,并为文蛤的苗种生产和进一步开发为饲料添加剂和医药产品奠定基础。
为实现上述目的,本发明采用的技术方案为:
利用构建cDNA文库,采用RACE技术以及体外重组表达等技术,从文蛤幼虫中克隆到组织蛋白酶B基因,cDNA(MmeCB)全长1647bp,编码框长为1014bp,3’非翻译区长624bp,发现组织蛋白酶B具有特有的封闭环序列,其对于该蛋白酶结合底物发挥外切酶活性具有重要作用,该基因在文蛤营养代谢方面发挥着重要作用。它具有SEQ ID NO.1所示的序列。
其编码的蛋白成熟肽具有SEQ ID NO.2所示的氨基酸序列,利用pGEX-4T-1表达载体在大肠杆菌BL21(DE3)重组表达了该蛋白,分子量为37.3kDa,等电点为5.50。表达产物对其特异性底物carbobenzoxy-l-arginyl-l-arginyl-7 -amino-4-trifluoromethylcoumarin(Z-Arg-Arg-AFC)有分解作用,其动力学参数Km,Vmax and kcat分别为6.11μM,0.0174μM·min-1和277.57s-1,kcat/Km值是45.4mM-1·s-1。组织蛋白酶包括2个结构域,其中L域由3个有规则的α螺旋组成,R域由C端6个延长的核苷酸链以反向平行形成β折叠,且不闭合。
本发明利用表达序列标签(EST)技术、cDNA末端快速扩增(RACE)技术从文蛤中克隆到组织蛋白酶B基因cDNA全长序列,通过PCR技术,扩增编码组织蛋白酶B成熟肽的基因片段并将其克隆到pGEX-4T-1表达载体中,在大肠杆菌BL21(DE3)实现体外重组表达。重组产物经GSTrap affinity columns纯化,并用凝血酶直接在柱子上切掉GST,获得MmeCB,对其特异性底物Z-Arg-Arg-AFC有分解作用,本发明可以为进一步研究文蛤幼体生长发育机制提供基础,并为文蛤的苗种繁育和饲料添加剂奠定基础,同时为进一步开发医药产品提供可能。
附图说明
图1:GSTrap FF柱纯化重组文蛤组织蛋白酶B,图中1:标准蛋白分子量,2:BSA(1mg/ml),3:酶切后目的蛋白MmeCB;
图2:纯化的GST重组文蛤组织蛋白酶B酶切结果,图中1.Marker;2.酶切后的GST;3.酶切的重组文蛤组织蛋白B;
图3:双倒数作图法研究文蛤组织蛋白酶B的酶学活性,双倒数作图法求其动力学参数Km,Vmax and kcat分别为6.11μM,0.0174μM·min-1和277.57s-1.,kcat/Km值是45.4mM-1s-1。
具体实施方式
下面的实施例中将本发明作进一步阐述,但本发明不限于此。
实施例1.
一种克隆得到的文蛤组织蛋白酶B,具有SEQ ID NO.1所示的序列。本发明中的文蛤组织蛋白酶B的cDNA序列克隆包括下列步骤:
a)文蛤总RNA的提取及mRNA的纯化;
b)文蛤cDNA文库构建;
c)文蛤cDNA文库EST序列的规模测定;
d)文蛤EST序列的同源性分析及组织蛋白酶B基因片段的筛选;
e)RACE扩增获得文蛤组织蛋白酶B的全序列。
具体操作如下:
1.文蛤总RNA的提取及mRNA的纯化:利用Invitrogen公司的Trizol试剂从文蛤幼虫中中提取总RNA,利用QIAGENE公司的Oligotex mRNA纯化试剂盒纯化mRNA。
2.文蛤cDNA文库构建:利用Stratagene公司cDNA Synthesis Kit和Synthesis Kit(Stratagene)进行cDNA的合成,双链cDNA经末端补平、EcoR I接头连接、EcoR I末端磷酸化、Xho I内切酶酶切后利用QIAGEN公司QIAEX II Agarose Gel Extraction Kit对大于100bp的酶切片段进行回收,与Invitrogen公司Uni-ZAP XR vector载体连接,利用Stratagene公司的III Gold Cloning Kit试剂盒进行文库包装,利用Exassist Helper Phage和SOLR菌株从XR Vector上体外切割pBluescript成为质粒文库。
3.文蛤cDNA文库EST序列的规模测定:文库中筛选阳性克隆,使用载体通用引物T3在MegaBACE1000测序仪上进行序列测定,将得到的原始峰图文件(*.abi,*.abd文件)数据经Phred程序处理转化为序列文件(*.seq)和质量文件(*.seq.qual),依据质量文件提供的数值确定获得序列的误差概率,去除低质量的碱基,用cross-mach程序屏蔽数据中的载体序列,从得到的数据中选取连续碱基质量大于Q13(准确率大于95%)且长度大于100bp的序列作为EST数据,具体见《基因表达序列标签(EST)数据分析手册》(胡松年著,浙江大学出版社,2005年)。
4.文蛤EST序列的同源性分析及组织蛋白酶B基因片段的筛选:将获得的全部有效的EST数据进行聚类拼接,生成Contigs和Singletons,分别将所获的Contigs与Singletons在数据库中进行BLASTn和BLASTx分析,结果显示在EST序列中发现了与文昌鱼、北极虾和家蚕组织蛋白酶B相似性较高的序列,根据相似性分析结果确定了文蛤组织蛋白酶B基因的EST序列。
5.文蛤组织蛋白酶B基因cDNA全长序列的克隆:根据与组织蛋白酶B基因同源的EST序列设计特异性引物F1(5′GAGGACCCTACAACAGCC3′)和R1(5′ATCCCTGATGGCTGTTGTAG3′),分别利用载体通用引物T3(5`ATTAACCCTCACTAAAGGGA 3`)和T7(5`TAATACGACTCACTATAGGG)进行3’和5’末端的扩增。PCR产物用1.0%琼脂糖凝胶电泳进行检测,用胶回收试剂盒(Promega,USA)进行PCR产物的回收和纯化,再与pMD-18T载体(大连宝生物工程有限公司)连接,然后转化大肠杆菌感受态细胞Top10,挑选阳性克隆用载体引物测序,所得结果经CLUSTER分析拼接,得到文蛤组织蛋白酶B基因cDNA全长序列见SEQ ID NO.1。
3’RACE扩增所用反应体系及反应条件:
25μl反应体系,包含:
扩增所用PCR反应程序:94℃变性4min,1个循环;94℃变性50s,54℃退火50s,72℃延伸1min,35个循环;72℃延伸10min,1个循环。
5’RACE扩增所用反应体系及反应条件:
25μl反应体系,包含:
扩增所用PCR反应程序:94℃变性4min,1个循环;94℃变性50s,54℃退火50s,72℃延伸1min,35个循环;72℃延伸10min,1个循环;4℃保温。
阳性克隆的PCR筛选条件为:
25μl反应体系,包含:
扩增所用PCR反应程序:94℃变性4min,1个循环;94℃变性50s,54℃退火50s,72℃延伸1min,35个循环;72℃延伸10min,1个循环。
实施例2.
根据SEQ ID NO.1所示cDNA序列,设计含有限制性内切酶BamH I和Sal I酶切位点的特异性引物F2:(5′GCCGGATCCTACAGGTTCGACTTCCATG 3′)和R2(5′CGCGTCGAC TTATTCAAGAACCATCATTCC 3′),通过PCR技术扩增编码组织蛋白酶B成熟肽的基因片段,反应在PTC-100中进行,反应条件为:94℃预变性5min;然后进行35循环包含94℃变性50s,54℃退火50s,72℃延伸50s;最后72℃延伸10min。然后将其克隆到pGEX-4T-1表达载体中,转化大肠杆菌BL21(DE3),测序确认表达框与SEQ ID NO.1的序列一致,确认表达阅读框正确。接种阳性克隆到LB培养基中,37℃振荡培养至O.D.600=0.6-0.8,加入IPTG至终浓度为1mM诱导3h后离心收集菌体。菌体在冰浴条件下用超声波500W处理30min,每次1s,间隔2s。离心收集沉淀,用BuferA(50mmol/L Tris-HCl,5mmol/L乙二胺四乙酸,0.1%Triton X-100,pH 8.0)洗涤沉淀2次,再用Bufer B(50mmol/L Tris-HCI,5mmol/LEDTA,2mol/L urea,pH 8.0)洗涤2次,将洗涤后的沉淀溶于5mL BuferC(0.1mol/L Tris-HCl,10mmol/L二硫苏糖醇,8mol/L urea,pH 8.0)中,37℃快摇40min,用Bufer C将样品稀释到500μg/mL,在10倍体积透析液(0.1mol/L Tris-HCI,5mmol/L EDTA,5mmol/L Cysteine,pH8.0)中透析使蛋白质复性,中间换2次透析液。利用Amersham公司的GSTrap affinity columns纯化重组产物,获得纯化的GST-MmeCB融合蛋白如图1,经质谱鉴定,两段序列(SGGPLGGHAIK,DLPDTFDAR)与文蛤组织蛋白酶B的相应氨基酸序列一致,因此该重组蛋白为重组文蛤组织蛋白酶B(GST-MmeCB),用凝血酶直接在柱子上酶切掉GST(4℃裂解过夜),获得文蛤重组组织蛋白酶B(图2),用于活性验证。
重组文蛤组织蛋白酶B的酶学活性验证:重组文蛤组织蛋白酶B,加入底物7-氨基-4-甲基香豆素(Z-Arg-Arg-AFC),加入50μl反应缓冲液(Cathepsin B assay kit),然后用双蒸水调至终体积为100μl,底物浓度在0.001--0.05mM之间。37℃孵育10min,加入1ml氯代乙酸钠(100mM,pH4.3)。F-4500荧光分光光度计在400nm激发波长下,505nm检测底物的荧光值。双倒数作图法求其动力学参数Km,Vmax and kcat分别为6.11μM,0.0174μM·min-1和277.57s-1.,kcat/Km值是45.4mM-1s-1(图3)。
本发明利用体外原核重组表达技术获得了重组的文蛤组织蛋白酶B,重组产物对其特异性底物有分解作用。本发明可以为进一步研究文蛤幼体生长发育机制提供基础,并为文蛤的苗种繁育和饲料添加剂奠定基础。
实施例3:
重组蛋白可提高文蛤幼虫的生长速度,为苗种生产中的营养调控和作为饲料添加剂研制提供基础。育苗生产中选取成熟的文蛤亲贝,放入28℃的海水中诱导排放。精子和卵子在水体中自然授精后,约20小时发育至D型幼虫。文蛤幼虫密度为5个/ml水体,海水温度为27℃,盐度为25。培育期间每日换水100%,投喂金藻2-5万细胞/ml水体。幼虫培育过程中水体中添加实施例2的表达产物重组组织蛋白酶B,能够有效促进文蛤幼虫的生长。
组织蛋白酶
SEQUENCE LISTING
<110>中国科学院海洋研究所
<120>文蛤组织蛋白酶B基因及编码蛋白和应用
<130>
<160>2
<170>PatentIn version 3.5
<210>1
<211>1647
<212>DNA
<213>文蛤(Meretrix Meretrix)
<220>
<221>CDS
<222>(55)..(1023)
<400>1
cagttcaaca tgaaggcatt actggtgctg gtgtttgttg gtgcagcctg gagt tac 57
Tyr
1
agg ttc gac ttc cat gat gac tac ttc agt gag gct ttt gta aac tac 105
Arg Phe Asp Phe His Asp Asp Tyr Phe Ser Glu Ala Phe Val Asn Tyr
5 10 15
cac aac agt cgg gat gac gtg tcc tgg aag gct act act gag aac ttc 153
His Asn Ser Arg Asp Asp Val Ser Trp Lys Ala Thr Thr Glu Asn Phe
20 25 30
aag aat gtg cca tac aag ggt agg atg gac tat gtc aag agt cta tgt 201
Lys Asn Val Pro Tyr Lys Gly Arg Met Asp Tyr Val Lys Ser Leu Cys
35 40 45
ggt gcc aat cct gct cct cca gag atg aaa ttc cct gtc aag gag att 249
Gly Ala Asn Pro Ala Pro Pro Glu Met Lys Phe Pro Val Lys Glu Ile
50 55 60 65
gaa gta ccc aag gat cta cct gat acc ttt gat gct cgt acc cag tgg 297
Glu Val Pro Lys Asp Leu Pro Asp Thr Phe Asp Ala Arg Thr Gln Trp
70 75 80
cca gac tgc ccc tct ctg aaa gaa gtt agg gat cag gga gcc tgt gga 345
Pro Asp Cys Pro Ser Leu Lys Glu Val Arg Asp Gln Gly Ala Cys Gly
85 90 95
tca tgc tgg gca ttt ggt tgt gtt gag gct gcc act gac aga ctg tgt 393
Ser Cys Trp Ala Phe Gly Cys Val Glu Ala Ala Thr Asp Arg Leu Cys
100 105 110
ata cag agc aag gga ata gta aat gca cat ctg tcg gct gaa gat ctt 441
Ile Gln Ser Lys Gly Ile Val Asn Ala His Leu Ser Ala Glu Asp Leu
115 120 125
acc tca tgt tgt cgt acc tgt gga aat ggt tgt aat ggt ggt ttc cta 489
Thr Ser Cys Cys Arg Thr Cys Gly Asn Gly Cys Asn Gly Gly Phe Leu
130 135 140 145
gag gga gct tgg aat tac ctg aaa agg gac ggt att gtt aca gga gga 537
Glu Gly Ala Trp Asn Tyr Leu Lys Arg Asp Gly Ile Val Thr Gly Gly
150 155 160
ccc tac aac agc cat cag gga tgt ctt cca tac gaa atc aaa gcc tgt 585
Pro Tyr Asn Ser His Gln Gly Cys Leu Pro Tyr Glu Ile Lys Ala Cys
165 170 175
gat cac cat gtc gtt ggg aaa ctt cag cca tgc aaa gga gat gga cct 633
Asp His His Val Val Gly Lys Leu Gln Pro Cys Lys Gly Asp Gly Pro
180 185 190
aca cca agg tgt aag aaa gag tgt gaa tct gga tat aac aat acc tac 681
Thr Pro Arg Cys Lys Lys Glu Cys Glu Ser Gly Tyr Asn Asn Thr Tyr
195 200 205
agt aag gac gaa cat cat gca aaa aca gta cac gct gtt gaa gga gta 729
Ser Lys Asp Glu His His Ala Lys Thr Val His Ala Val Glu Gly Val
210 215 220 225
gaa cag att atg aca gaa att atg aca aat ggc cct gtg gag gca gct 777
Glu Gln Ile Met Thr Glu Ile Met Thr Asn Gly Pro Val Glu Ala Ala
230 235 240
ttt acc gtt tac tca gat ttc cca act tac aag tca ggc gtc tac gag 825
Phe Thr Val Tyr Ser Asp Phe Pro Thr Tyr Lys Ser Gly Val Tyr Glu
245 250 255
cac aaa tca ggt ggt ccc ctc gga ggc cat gcc atc aag act ctc ggc 873
His Lys Ser Gly Gly Pro Leu Gly Gly His Ala Ile Lys Thr Leu Gly
260 265 270
tgg gga aat gaa gac ggc aaa gat tat tgg ctt gtt gcc aac tcc tgg 921
Trp Gly Asn Glu Asp Gly Lys Asp Tyr Trp Leu Val Ala Asn Ser Trp
275 280 285
aac ccc gac tgg gga gat aac ggt ttc ttc aag atc ctt cgt gga cga 969
Asn Pro Asp Trp Gly Asp Asn Gly Phe Phe Lys Ile Leu Arg Gly Arg
290 295 300 305
gat gag tgt ggt att gag tcc aac att gtc gct gga atg atg gtt ctt 1017
Asp Glu Cys Gly Ile Glu Ser Asn Ile Val Ala Gly Met Met Val Leu
310 315 320
gaa taa ctttgtaaaa aaaccatgtg atcatttaaa cattccttat tgaaaaaaga 1073
Glu
gtttattttt tcaaatattt aatcaaaaag accagatgat aaaaatttat gcttttttaa 1133
acaacgaata tgtatataat gaaacgtatc tcaagttttg ctcagattgt gaccaaaaaa 1193
agattgtaaa tagactattt ttctacatca gaagaaagct ttttctttct tgctttgtgt 1253
aaatcctgcc ttaggacctg ctaacacaat ctactcaaaa tatatctcct gtagcctaac 1313
acttatagtt tctggactat aaaactgaaa agcgtatctg tgagacattt gtacatagta 1373
aactttgagt cgactttcca ctgtaatccg taatcctgag gagactttat gatttaggta 1433
attacattag agcatgattc tgcggagcca tatgtaacag tattggcttt cacccttctc 1493
tgcctattaa ataattttct gacttatatc tcgaaagcaa acattctgat atatttctcc 1553
atgaaaaaaa atcttcatca ttcgaattgt gttggttaat ctattgaaaa aaaatgagag 1613
caataaattt atttttgtac aaaaaaaaaa aaaa 1647
<210>2
<211>322
<212>PRT
<213>文蛤(Meretrix Meretrix)
<400>2
Tyr Arg Phe Asp Phe His Asp Asp Tyr Phe Ser Glu Ala Phe Val Asn
1 5 10 15
Tyr His Asn Ser Arg Asp Asp Val Ser Trp Lys Ala Thr Thr Glu Asn
20 25 30
Phe Lys Asn Val Pro Tyr Lys Gly Arg Met Asp Tyr Val Lys Ser Leu
35 40 45
Cys Gly Ala Asn Pro Ala Pro Pro Glu Met Lys Phe Pro Val Lys Glu
50 55 60
Ile Glu Val Pro Lys Asp Leu Pro Asp Thr Phe Asp Ala Arg Thr Gln
65 70 75 80
Trp Pro Asp Cys Pro Ser Leu Lys Glu Val Arg Asp Gln Gly Ala Cys
85 90 95
Gly Ser Cys Trp Ala Phe Gly Cys Val Glu Ala Ala Thr Asp Arg Leu
100 105 110
Cys Ile Gln Ser Lys Gly Ile Val Asn Ala His Leu Ser Ala Glu Asp
115 120 125
Leu Thr Ser Cys Cys Arg Thr Cys Gly Asn Gly Cys Asn Gly Gly Phe
130 135 140
Leu Glu Gly Ala Trp Asn Tyr Leu Lys Arg Asp Gly Ile Val Thr Gly
145 150 155 160
Gly Pro Tyr Asn Ser His Gln Gly Cys Leu Pro Tyr GluIle Lys Ala
165 170 175
Cys Asp His His Val Val Gly Lys Leu Gln Pro Cys Lys Gly Asp Gly
180 185 190
Pro Thr Pro Arg Cys Lys Lys Glu Cys Glu Ser Gly Tyr Asn Asn Thr
195 200 205
Tyr Ser Lys Asp Glu His His Ala Lys Thr Val His Ala Val Glu Gly
210 215 220
Val Glu Gln Ile Met Thr Glu Ile Met Thr Asn Gly Pro Val Glu Ala
225 230 235 240
Ala Phe Thr Val Tyr Ser Asp Phe Pro Thr Tyr Lys Ser Gly Val Tyr
245 250 255
Glu His Lys Ser Gly Gly Pro Leu Gly Gly His Ala Ile Lys Thr Leu
260 265 270
Gly Trp Gly Asn Glu Asp Gly Lys Asp Tyr Trp Leu Val Ala Asn Ser
275 280 285
Trp Asn Pro Asp Trp Gly Asp Asn Gly Phe Phe Lys Ile Leu Arg Gly
290 295 300
Arg Asp Glu Cys Gly Ile Glu Ser Asn Ile Val Ala Gly Met Met Val
305 310 315 320
Leu Glu
Claims (3)
1.文蛤组织蛋白酶B基因,其特征在于:具有序列表SEDIQ No.1中的碱基序列。
2.一种要求1所述的文蛤组织蛋白酶B基因编码的蛋白,其特征在于:具有序列表SEDIQ No.2中氨基酸序列。
3.一种权利要求1所述文蛤组织蛋白酶B基因的应用,其特征在于:文蛤组织蛋白酶B基因的重组表达产物在贝类幼虫生长发育促进剂、饲料添加剂以及贝类苗种生产中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102298052A CN102051370A (zh) | 2009-10-30 | 2009-10-30 | 文蛤组织蛋白酶b基因及编码蛋白和应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102298052A CN102051370A (zh) | 2009-10-30 | 2009-10-30 | 文蛤组织蛋白酶b基因及编码蛋白和应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102051370A true CN102051370A (zh) | 2011-05-11 |
Family
ID=43956169
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2009102298052A Pending CN102051370A (zh) | 2009-10-30 | 2009-10-30 | 文蛤组织蛋白酶b基因及编码蛋白和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102051370A (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103849624A (zh) * | 2012-12-05 | 2014-06-11 | 中国科学院海洋研究所 | 文蛤DNA结合抑制因子mmID2基因及其编码蛋白和应用 |
CN106811475A (zh) * | 2017-03-27 | 2017-06-09 | 中国科学院海洋研究所 | 文蛤酚氧化酶基因及其编码蛋白和应用 |
CN113249356A (zh) * | 2021-04-25 | 2021-08-13 | 天津师范大学 | 青蛤丝裂原活化蛋白激酶p38 MAPK的cDNA全长序列及其应用 |
-
2009
- 2009-10-30 CN CN2009102298052A patent/CN102051370A/zh active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103849624A (zh) * | 2012-12-05 | 2014-06-11 | 中国科学院海洋研究所 | 文蛤DNA结合抑制因子mmID2基因及其编码蛋白和应用 |
CN103849624B (zh) * | 2012-12-05 | 2016-06-29 | 中国科学院海洋研究所 | 文蛤DNA结合抑制因子mmID2基因及其编码蛋白和应用 |
CN106811475A (zh) * | 2017-03-27 | 2017-06-09 | 中国科学院海洋研究所 | 文蛤酚氧化酶基因及其编码蛋白和应用 |
CN113249356A (zh) * | 2021-04-25 | 2021-08-13 | 天津师范大学 | 青蛤丝裂原活化蛋白激酶p38 MAPK的cDNA全长序列及其应用 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
He et al. | A goose-type lysozyme gene in Japanese scallop (Mizuhopecten yessoensis): cDNA cloning, mRNA expression and promoter sequence analysis | |
CN109797155B (zh) | 三疣梭子蟹甘露糖结合凝集素PtMBL基因及其编码蛋白和应用 | |
CN101182360B (zh) | 一种具有抗菌功能的融合蛋白及其应用 | |
CN102653756A (zh) | 一种定向改造动物基因组特定基因的方法及其应用 | |
CN110343703A (zh) | 三疣梭子蟹C型凝集素PtCLec1基因及其编码蛋白和应用 | |
CN102051370A (zh) | 文蛤组织蛋白酶b基因及编码蛋白和应用 | |
Qian et al. | Identification of a small pacifastin protease inhibitor from Nasonia vitripennis venom that inhibits humoral immunity of host (Musca domestica) | |
CN107937406A (zh) | 一种三疣梭子蟹新型Crustin基因及其重组蛋白的应用 | |
CN101550183A (zh) | 一种抗菌肽及其构建和应用 | |
CN110408624A (zh) | 一种菲律宾蛤仔c型凝集素蛋白及其制备方法与应用 | |
CN102051363A (zh) | 文蛤铁蛋白基因及编码蛋白和其体外重组表达产物的应用 | |
CN101525617A (zh) | 中华绒螯蟹Crustin-1基因及体外重组表达 | |
CN102094023A (zh) | 一种小菜蛾葛佬素基因及编码的蛋白、相应的表达系统和应用 | |
CN104117059B (zh) | 三疣梭子蟹丝氨酸蛋白酶基因的应用 | |
Enault et al. | A complex set of sex pheromones identified in the cuttlefish Sepia officinalis | |
CN106479987B (zh) | 一种可溶性家蝇MdproPO1重组蛋白的制备方法及其应用 | |
CN109021088A (zh) | 一种斑节对虾抗菌肽ALFpm10及其制备方法 | |
CN102337271A (zh) | 三疣梭子蟹抗脂多糖因子PtALF-2基因及其编码蛋白和应用 | |
CN106811475B (zh) | 文蛤酚氧化酶基因及其编码蛋白和应用 | |
CN101565703A (zh) | 中华绒螯蟹Crustin-2基因及其重组蛋白的应用 | |
CN101665788A (zh) | 人工合成猪生长激素基因及其表达纯化方法 | |
CN103484487A (zh) | 一种小菜蛾溶菌酶ⅱ及其制备方法与应用 | |
CN112625118B (zh) | 一种金钱鱼促生殖细胞成熟基因igf3及其应用 | |
CN106755021A (zh) | 文蛤多巴脱羧酶基因及其编码蛋白和应用 | |
Wharam et al. | A Leucine aminopeptidase gene of the Pacific Oyster Crassostrea gigas exhibits an unusually high level of sequence variation, predicted to affect structure, and hence activity, of the enzyme |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20110511 |