CN109652397B - Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof - Google Patents
Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof Download PDFInfo
- Publication number
- CN109652397B CN109652397B CN201710942714.8A CN201710942714A CN109652397B CN 109652397 B CN109652397 B CN 109652397B CN 201710942714 A CN201710942714 A CN 201710942714A CN 109652397 B CN109652397 B CN 109652397B
- Authority
- CN
- China
- Prior art keywords
- recombinant
- lysine arginine
- terminal protease
- protease
- arginine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 title claims abstract description 193
- 239000004472 Lysine Substances 0.000 title claims abstract description 191
- 239000004475 Arginine Substances 0.000 title claims abstract description 188
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 title claims abstract description 188
- 101800000021 N-terminal protease Proteins 0.000 title claims abstract description 176
- 238000002360 preparation method Methods 0.000 title abstract description 6
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 57
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 47
- 108090000631 Trypsin Proteins 0.000 claims abstract description 31
- 102000004142 Trypsin Human genes 0.000 claims abstract description 31
- 239000012588 trypsin Substances 0.000 claims abstract description 31
- 108091005804 Peptidases Proteins 0.000 claims abstract description 20
- 239000004365 Protease Substances 0.000 claims abstract description 17
- 238000011160 research Methods 0.000 claims abstract description 12
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 30
- 239000000243 solution Substances 0.000 claims description 30
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 claims description 24
- 238000006640 acetylation reaction Methods 0.000 claims description 20
- 238000000855 fermentation Methods 0.000 claims description 19
- 230000004151 fermentation Effects 0.000 claims description 19
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 19
- 230000021736 acetylation Effects 0.000 claims description 18
- 238000001819 mass spectrum Methods 0.000 claims description 16
- 239000007983 Tris buffer Substances 0.000 claims description 15
- 239000011780 sodium chloride Substances 0.000 claims description 15
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 claims description 15
- 238000010828 elution Methods 0.000 claims description 14
- 239000013604 expression vector Substances 0.000 claims description 13
- 230000004048 modification Effects 0.000 claims description 13
- 238000012986 modification Methods 0.000 claims description 13
- 238000000034 method Methods 0.000 claims description 12
- 238000006243 chemical reaction Methods 0.000 claims description 11
- 238000003776 cleavage reaction Methods 0.000 claims description 11
- 239000000872 buffer Substances 0.000 claims description 10
- 239000007853 buffer solution Substances 0.000 claims description 10
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 10
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 10
- 230000007017 scission Effects 0.000 claims description 10
- 238000000108 ultra-filtration Methods 0.000 claims description 10
- 150000001413 amino acids Chemical group 0.000 claims description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 claims description 9
- 241000588724 Escherichia coli Species 0.000 claims description 7
- 241001052560 Thallis Species 0.000 claims description 6
- 238000001042 affinity chromatography Methods 0.000 claims description 6
- 238000005571 anion exchange chromatography Methods 0.000 claims description 6
- 238000011068 loading method Methods 0.000 claims description 6
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 claims description 5
- 230000004913 activation Effects 0.000 claims description 5
- 239000001110 calcium chloride Substances 0.000 claims description 5
- 229910001628 calcium chloride Inorganic materials 0.000 claims description 5
- 239000012149 elution buffer Substances 0.000 claims description 5
- 229920001184 polypeptide Polymers 0.000 claims description 5
- 238000003259 recombinant expression Methods 0.000 claims description 5
- 102100029727 Enteropeptidase Human genes 0.000 claims description 4
- 108010013369 Enteropeptidase Proteins 0.000 claims description 4
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims description 4
- 230000003213 activating effect Effects 0.000 claims description 4
- 238000004587 chromatography analysis Methods 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 239000006228 supernatant Substances 0.000 claims description 4
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 claims description 3
- 238000011033 desalting Methods 0.000 claims description 3
- 239000006167 equilibration buffer Substances 0.000 claims description 3
- 229910052751 metal Inorganic materials 0.000 claims description 3
- 239000002184 metal Substances 0.000 claims description 3
- 239000013598 vector Substances 0.000 claims description 3
- 239000013522 chelant Substances 0.000 claims description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims 1
- 238000007865 diluting Methods 0.000 claims 1
- 239000001963 growth medium Substances 0.000 claims 1
- 238000011534 incubation Methods 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 claims 1
- 238000012807 shake-flask culturing Methods 0.000 claims 1
- 102000004190 Enzymes Human genes 0.000 abstract description 33
- 108090000790 Enzymes Proteins 0.000 abstract description 33
- 108010033276 Peptide Fragments Proteins 0.000 abstract description 29
- 102000007079 Peptide Fragments Human genes 0.000 abstract description 29
- 102000035195 Peptidases Human genes 0.000 abstract description 16
- 230000000694 effects Effects 0.000 abstract description 16
- 230000029087 digestion Effects 0.000 abstract description 15
- 125000000539 amino acid group Chemical group 0.000 abstract description 3
- 150000002500 ions Chemical class 0.000 description 22
- 238000001514 detection method Methods 0.000 description 13
- 238000004949 mass spectrometry Methods 0.000 description 12
- 238000001228 spectrum Methods 0.000 description 11
- 239000000758 substrate Substances 0.000 description 11
- 238000001962 electrophoresis Methods 0.000 description 10
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 10
- 125000003275 alpha amino acid group Chemical group 0.000 description 9
- 238000012163 sequencing technique Methods 0.000 description 8
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 238000001976 enzyme digestion Methods 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 239000001301 oxygen Substances 0.000 description 5
- 229910052760 oxygen Inorganic materials 0.000 description 5
- 230000004481 post-translational protein modification Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 239000007640 basal medium Substances 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 239000012266 salt solution Substances 0.000 description 4
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 3
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 3
- 102100031780 Endonuclease Human genes 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000001641 gel filtration chromatography Methods 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 238000007069 methylation reaction Methods 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- DWOSGXZMLQNDBN-FXQIFTODSA-N Asp-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O DWOSGXZMLQNDBN-FXQIFTODSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 241000205276 Methanosarcina Species 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000003277 amino acid sequence analysis Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 238000000265 homogenisation Methods 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 2
- -1 ion ions Chemical class 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 229910021654 trace metal Inorganic materials 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 229910003208 (NH4)6Mo7O24·4H2O Inorganic materials 0.000 description 1
- FZIPCQLKPTZZIM-UHFFFAOYSA-N 2-oxidanylpropane-1,2,3-tricarboxylic acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O.OC(=O)CC(O)(C(O)=O)CC(O)=O FZIPCQLKPTZZIM-UHFFFAOYSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 101100354809 Caenorhabditis elegans pxl-1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 102100023336 Chymotrypsin-like elastase family member 3B Human genes 0.000 description 1
- 229910021580 Cobalt(II) chloride Inorganic materials 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- 101100326341 Drosophila melanogaster brun gene Proteins 0.000 description 1
- 102000010911 Enzyme Precursors Human genes 0.000 description 1
- 108010062466 Enzyme Precursors Proteins 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 1
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 1
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 229910021578 Iron(III) chloride Inorganic materials 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical class NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- NDSNUWJPZKTFAR-DCAQKATOSA-N Lys-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN NDSNUWJPZKTFAR-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 1
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- ZACMJPCWVSLCNS-JYJNAYRXSA-N Met-Phe-Met Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(O)=O)CC1=CC=CC=C1 ZACMJPCWVSLCNS-JYJNAYRXSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 101150040546 PXL1 gene Proteins 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- CYLQUSBOSWCHTO-BPUTZDHNSA-N Trp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CYLQUSBOSWCHTO-BPUTZDHNSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 101710100170 Unknown protein Proteins 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000002299 affinity electrophoresis Methods 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 235000011114 ammonium hydroxide Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000005349 anion exchange Methods 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- DUEPRVBVGDRKAG-UHFFFAOYSA-N carbofuran Chemical compound CNC(=O)OC1=CC=CC2=C1OC(C)(C)C2 DUEPRVBVGDRKAG-UHFFFAOYSA-N 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229910052927 chalcanthite Inorganic materials 0.000 description 1
- 150000001793 charged compounds Chemical class 0.000 description 1
- 239000012295 chemical reaction liquid Substances 0.000 description 1
- 239000012501 chromatography medium Substances 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 239000013065 commercial product Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000009585 enzyme analysis Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 1
- 239000011261 inert gas Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000011565 manganese chloride Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000013365 molecular weight analysis method Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 108010060175 trypsinogen activation peptide Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010072147 type V-S pancreatic trypsin Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/96—Stabilising an enzyme by forming an adduct or a composition; Forming enzyme conjugates
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6818—Sequencing of polypeptides
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Hematology (AREA)
- Biophysics (AREA)
- Urology & Nephrology (AREA)
- Immunology (AREA)
- Food Science & Technology (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Cell Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a recombinant acetylated lysine arginine N-terminal protease, a preparation method thereof and application thereof in proteomics research. The recombinant acetylated lysine arginine N-terminal protease reduces self-digestion characteristics, improves self stability and has higher enzyme activity. The recombinant acetylated lysine arginine N-terminal protease can be used independently or in combination with other proteases, improves the coverage of protein identification sequences and the coverage of trypsin digestion on the sequence of the amino acid residues readable by the mirrored peptide fragments and the identified peptide fragments, and is a potential proteomic tool enzyme.
Description
Technical Field
The invention belongs to the field of proteomics, and particularly relates to protease used in proteomics research.
The invention also relates to a preparation method of the protease and application of the protease in proteomics research.
Background
Proteomics is the sum of all proteins in a cell, tissue or organ and is the final result of gene transcription, translation and post-translational modifications. Studying the abundance, function, interaction, localization and regulation of proteins in cells is crucial to our understanding of the mysteries of life.
In the most common proteomics research based on mass spectrum at present, the identification of protein and post-translational modification requires that a protein sample is firstly enzymolyzed into a peptide fragment, a primary spectrogram is obtained through mass spectrum detection, then fragment ions are generated through collision of peptide fragment ions and inert gas in a mass spectrum trap, and a secondary spectrogram is obtained through mass spectrum detection. Then matching the experimental spectrogram with a theoretical spectrogram deduced based on an existing protein and genome database (called searching a library) to deduce the amino acid sequence of the peptide fragment to be detected. And utilizing the identified peptide fragment to reversely deduce the protein to be identified. This method is also known as shotgun (shotgun). The step of proteolysis into peptide fragments is the core step of proteomics research at present.
Trypsin is widely used in practice as the most commonly used protease in proteomics studies. On the one hand, the enzyme cutting efficiency and the specificity are high, and enzyme cutting hydrolysis can be carried out on the carboxyl terminals of lysine and arginine of a protein substrate specifically (but when the amino acid behind the lysine and the arginine is proline, the hydrolysis cannot happen). On the other hand, the short peptide ending in arginine or lysine formed by enzyme digestion is more suitable for ionization and fragmentation of peptide fragments to form stronger y and weaker b series ion pairs, and is suitable for separation and identification. However, since many peptide fragments formed by trypsin digestion cannot be identified by mass spectrometry, the overall coverage is low [ Swaney DL, Wenger CD & Coon JJ Value of using multiple proteins for large-scale mass spectrometry. J. protein Res 2010,9, 1323-; on the other hand, trypsin is easy to form missed cuts on enzyme cutting sites with posttranslational modifications (such as methylation, phosphorylation, ubiquitination and the like), so that the length of a peptide segment is increased, the ionization efficiency is reduced, the identification of the modification sites is influenced, and the functional analysis of important proteins is not facilitated.
The discovery, introduction and multi-enzyme combination of the novel protease can obviously improve the coverage of the proteome sequence and improve the identification efficiency of the post-translational modification site. For example, a more complete digestion of peptide fragments can be achieved by cascade digestion with lysyl endonuclease (Lys-C) capable of specifically hydrolyzing the carboxy-terminal lysine peptide bond of proteins and trypsin [ Sanders WS, Brudges SM, McCarthy FM, et al.preparation of peptides by mass spectrometry applied at the experimental set level. BMC biologicals, 2007,8Suppl 7: S23], greatly improving the efficiency of specific digestion. Glutamyl endonuclease Glu-C can be well applied to identification of phosphorylated peptides [ Seeley EH, Riggs LD, Regener FE. reduction of non-specific binding in Ga (III) immobilized metal affinity chromatography for peptides by using endogenous enzymes Glu-C as the differential enzyme J chromatography B,2005,817(1): 81-88 ], and in combination with trypsin, can effectively improve the sequence coverage of protein identification and increase the reliability of mass spectrometry detection [ precursor S, Tepp W, DatsGuta BR. Botuling of nucleic acid type B and E: purpose, limited nucleic acid enzyme and C and T-D of nucleic acid of reaction, and D-D.
The discovery of lysine arginine N-terminal protease (Lysagina) with complementary cleavage site and trypsin [ Huesgen PF, Lange PF, Rogers LD, et al, LysargNase mirrors trypsin for protein C-terminal and methylation-site identification. Nat Methods,2015,12(1): 55-58 ] provides a new powerful tool for proteomic research. LysagiNase specifically cleaves the N-terminus of lysine and arginine residues in proteins, mirroring the cleavage site of trypsin, and is called "mirror enzyme". The two enzymes are used together, so that a large number of peptide fragments with overlapped sequences can be identified, and as basic amino acid (K or R) in the peptide fragments cut by the LysagiNase is positioned at the N end, ions are easily distributed on the fragments of the series of molecular ions at the N end, the formed spectrogram has a b series ion peak which is stronger than that of the y series ions, and the b series ion peak is complementary with the stronger characteristic of the y series ion peak obtained by digesting the peptide fragments by trypsin. The method can not only complement the fragment ions which are possibly lost in the ion ions and improve the integrity of the fragment ions, but also ensure that the ion series has certain directionality, is convenient to identify and read, and can more effectively judge the accuracy of the identification sequence. The characteristics are fully utilized to develop a protein de novo sequencing technology (de novo sequencing) based on mass spectrum, and finally get rid of the dependence of shotgun proteomics on an annotated genome database, and the method has wide application prospects in the fields of unknown protein, posttranslational modification, identification of mutation-induced amino acid mutation sites and the like.
The characteristics of the protease determine the self-digestion characteristics of the protease, so that the instability of the protease is increased, the activity of the protease is reduced, and meanwhile, the formed enzymolysis peptide fragment can also increase the noise in subsequent mass spectrometry. Methylation and acetylation of lysine residues have been found to significantly reduce trypsin self-digestion and increase trypsin stability (Sri Ram J, Terminallo L, Bier M, et al. on the mechanism of enzyme action. LVIII. Acetyl-trypsin, a stable trypsin derivative. Arch Biochem Biophys,1954,52: 464. 477; Rice RH, Means GE & Brown WD.stabilization of trypsin by reaction Acta,1977,492: 316. SP. 321; Heat M, Wu F.,. Xu P., expression of protease produced by reaction 120. mu. Protein, 2015,116. mu. Protein). However, no modification has been made to increase the stability of the lysine arginine N-terminal protease.
Disclosure of Invention
It is an object of the present invention to provide an acetylated recombinant lysine arginine N-terminal protease with higher stability and enzyme activity than the same enzyme that has not been acetylated.
Another objective of the invention is to provide a method for preparing an acetylated recombinant lysine arginine N-terminal protease.
Still another object of the present invention is to provide the use of an acetylated recombinant lysine arginine N-terminal protease in proteomic research.
According to one aspect of the invention, a recombinant acetylated lysine arginine N-terminal protease comprises a polypeptide having an amino acid sequence shown as SEQ ID No.1, wherein a lysine residue side chain of the polypeptide is modified by 1-5 acetyl groups, and the acetylated lysine arginine N-terminal protease has a molecular weight selected from the group consisting of: about 29048.121Da, about 29090.004Da, about 29132.211Da, about 29173.834Da, and about 29216.844 Da.
Lysine arginine N-terminal protease is the mirror image enzyme of trypsin, and can cleave the N-terminus of lysine and arginine residues of proteins; but also cleave its own lysine and arginine residues, which is responsible for its instability and low enzymatic activity.
Modification of the lysine as well as arginine of proteases is one direction of research to prevent self-digestion. Methylation modification (Rice et al, 1977) and acetylation modification (Sri Ram et al, 1954; Zhao et al, 2015) of trypsin effectively reduce its autodigestion, increasing enzyme stability without affecting its activity. However, lysine arginine N-terminal protease still cleaves efficiently at the methylation-modified lysine and arginine sites (Huesgen et al, 2015). Experiments show that acetylation can effectively prevent the self digestion of the N-terminal protease of lysine arginine and increase the stability of the N-terminal protease. Acetylation modification occurs on the free epsilon-amino group of protein lysine residues, and the enzyme activity and stability can be improved by acetylation modification of 1, 2, 3, 4,5, 6, 7 and/or 8 lysine residues of lysine arginine N-terminal protease, preferably acetylation modification of 1-5 amino acid residues.
The commercial recombinant N-terminal protease of lysine arginine can be used for acetylation modification, but experiments show that the recombinant N-terminal protease of lysine arginine obtained by the expression vector and the purification method constructed by the invention has better enzyme activity and stability.
The invention adopts molecular weight analysis of high resolution mass spectrum to represent acetylation of the enzyme, and mass spectrum analysis data shows that acetylation modification can occur on 1-5 lysine residues of the N-terminal protease of lysine arginine.
According to another aspect of the invention, the system of the invention establishes a method for preparing an acetylated recombinant lysine arginine N-terminal protease, comprising:
1) operably linking a gene expressing the lysine arginine N-terminal protease to an expression vector; transferring the expression vector into escherichia coli to obtain an escherichia coli strain for expressing recombinant lysine arginine N-terminal protease;
2) culturing the strain of step 1) and collecting the thalli;
3) crushing the thallus obtained in the step 2) to obtain recombinant lysine arginine N-terminal protease and purifying;
4) activating the recombinant lysine arginine N-terminal protease obtained in the step 3);
5) repurifying the recombinant lysine arginine N-terminal protease activated in the step 4);
6) performing acetylation modification on the recombinant lysine arginine N-terminal protease re-purified in the step 5) to obtain the recombinant acetylated lysine arginine N-terminal protease.
According to a further aspect of the present invention, it relates to the use of the recombinant acetylated lysine arginine N-terminal protease of the invention in proteomic studies. Comprises the steps of singly using acetylated recombinant lysine arginine N-terminal protease to cleave the lysine arginine N-terminal of a target protein, jointly using trypsin for cleaving the lysine arginine C-terminal of the target protein, and jointly using other proteases such as lysyl endonuclease.
Has the advantages that: the purified recombinant lysine arginine N-terminal protease prepared by the invention has high enzyme activity and stability; can provide suitable tool enzyme for proteomics research.
Drawings
FIG. 1 shows the N-terminal protease gene coding sequence and the corresponding amino acid sequence of lysine arginine of Methanosarcina acetophaga;
the grey shaded portion of the figure is the portion that is excised upon activation of the enzyme, and the introduced 6 × His tag and the enterokinase cleavage site are marked with solid and dashed lines, respectively.
FIG. 2 shows recombinant lysine arginine N-terminal protease zymogen His affinity chromatography and electrophoresis detection;
a, affinity chromatography chromatogram, wherein the left ordinate is ultraviolet absorbance at 280nm, the right ordinate is buffer solution B linear gradient (%), and the abscissa is volume (mL);
and b, SDS-PAGE detection, wherein a lane M is a protein molecular weight standard with the molecular weight of 10-220kD, and lanes super, flow through, wash and elution are respectively thalli lysis supernatant, sample flow-through, washing and elution samples.
FIG. 3 is a SDS-PAGE electrophoresis of recombinant lysine arginine N-terminal pro-protease activation;
lane M is a protein molecular weight standard with a molecular weight of 10-220kD, lane Nie is recombinant lysine arginine N-terminal pro-protease after His affinity chromatography, lane G25 is desalted recombinant lysine arginine N-terminal pro-protease, lanes 0h, 4h, 8h and O/N are recombinant lysine arginine N-terminal protease samples activated for 0h, 4h, 8h and 16 h;
FIG. 4 shows the anion exchange column chromatography and electrophoresis detection of the recombinant lysine arginine N-terminal protease;
a: anion exchange chromatography;
the left ordinate is the ultraviolet absorbance at 280nm, the right ordinate is the conductivity, and the abscissa is the volume (mL);
b: SDS-PAGE electrophoresis detection;
lane M is a protein molecular weight standard with a molecular weight of 10-220kD, and lanes 1-5 are elution peak samples collected at different times, respectively;
FIG. 5 gel filtration chromatography and electrophoresis detection of recombinant lysine arginine N-terminal protease;
a: chromatogram, with the ordinate being the ultraviolet absorbance at 280nm and the abscissa being the volume (mL);
b: SDS-PAGE detection, lane M is a protein molecular weight standard with a molecular weight of 10-220kD, lane R29 is purified recombinant lysine arginine N-terminal protease;
FIG. 6 SDS-PAGE electrophoresis profiles of recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease;
lane M is a protein molecular weight standard with a molecular weight of 10-220kD, Lane r-LysagiNase is a recombinant lysine arginine N-terminal protease, Lane r-Ac-LysagiNase is a recombinant acetylated lysine arginine N-terminal protease;
FIG. 7 Mass spectrometric detection of recombinant lysine arginine N-terminal protease and acetylated forms thereof;
absolute intensity: absolute signal strength;
a: recombinant lysine arginine N-terminal protease;
b: recombinant acetylated lysine arginine N-terminal protease;
FIG. 8 comparison of the activity of the product enzyme of the present invention and the commercial product enzyme;
r-lysargrnase: recombinant lysine arginine N-terminal protease, r-Ac-lysargina ase: recombinant acetylated lysine arginine N-terminal protease, Comm: a commercial recombinant lysine arginine N-terminal protease; TCL is total protein of yeast cells, 1:50, 1:200, 1:800 and 1:1600 represent the ratio (w: w) of enzyme amount to TCL substrate amount; lane M is a protein molecular weight standard with a molecular weight of 10-220 kD;
FIG. 9. enzyme stability assay for the products of the invention;
r-lysargrnase: recombinant lysine arginine N-terminal protease, r-Ac-lysargina ase: recombinant acetylated lysine arginine N-terminal protease;
a: SDS-PAGE electrophoresis detection;
b: separating and identifying self-digestion characteristic peptide fragment 'RSDEVDDTPNQADPN';
c: quantitative comparison of signal intensity of self-digested peptide fragment "RSDEVDDTPNQADPN";
d: separating and identifying self-digestion characteristic peptide fragment 'KIPVVVH';
e: quantitative comparison of signal intensity of self-digested peptide fragment "KIPVVVH".
FIG. 10. mirror image of the peptide fragment based de novo sequencing technique provides complete fragment ions, identifying the direction of the majority of the ions;
FIG. 11. ion coverage distribution of Trypsin spectrum (Trypsin), acetylated lysine arginine N-terminal protease (Ac-lysargiNase) spectrum and two-way enzyme digestion Mirror image spectrum (Mirror);
FIG. 12 comparison of the proportion of correctly resolved spectra of amino acid sequences obtained by the double restriction mapping strategy (pNovoM) with two trypsin-based single-cleavage methods (pNovo + and PEAKS);
the histogram on the left characterizes the number of mirror spectra each mirror peptide fragment contains.
Detailed Description
The present invention is described in detail below with reference to examples, but the present invention is not limited to the claims. The materials used in the present invention are publicly available.
Example 1 construction of recombinant lysine arginine N-terminal protease E.coli expression vector
The gene sequence of the recombinant lysine arginine N-terminal protease is derived from Methanosarcina acetovorans (GenBank No. AE010299), and is shown in SEQ ID No. 2. We introduced a 6 XHis tag and an enterokinase cleavage site DDDDDDK (15 amino acids added in total) at their N-terminus to obtain an Open Reading Frame (ORF) encoding 357 amino acids, i.e. the sequence shown in SEQ ID NO.3, and the corresponding protein sequence shown in SEQ ID NO.4, the gene coding sequence and the corresponding protein sequence also being shown in FIG. 1. The sequence shown in SEQ ID NO.3 was inserted between Nco I/Xho I cleavage sites of pET28a vector (Novagen, cat # 69864-3CN) to obtain recombinant expression vector of recombinant lysine arginine N-terminal protease, named pET-LysargiNase.
pET-LysargNase was introduced into competent cells of Escherichia coli BL21(DE3) by chemical transformation to obtain a recombinant strain containing a recombinant lysine arginine N-terminal protease gene, and this strain was named recombinant Escherichia coli BL21(DE3) -LysargNase (hereinafter abbreviated as BL 21-LysargNase strain).
Example 2 fermentation and Collection of cells of the [ BL21-LysargiNase Strain ]
The recombinant strain BL 21-LysargNase strain was activated on a solid LB/Amp plate to obtain a monoclonal of BL 21-LysargNase strain. A single clone of BL 21-LysargNase strain was inoculated into 50mL of LB liquid medium (ampicillin concentration 100. mu.g/mL) and shake-cultured at 37 ℃. OD of LB liquid culture System600Seed liquid of BL 21-LysargNase strain was obtained at 1.5-2.0 (LB liquid medium as a blank).
The seed solution of BL 21-LysargNase strain was inoculated alone into a 14L fermentor (BioFlo 310, New Brun) containing 5L of a high-density fermentation basal medium autoclaved at 121 ℃ by a wet-heat methodswick Scientific Co.), OD of the culture system after inoculation6000.01 (blank with high-density fermentation basal medium). Initial fermentation parameters: the culture temperature was 37 ℃, the stirring speed was 400rpm, the air flow was 6L/min, and the pH was 7.0. The pH of the fermentation medium was controlled by 30% ammonia (volume percent concentration). When Dissolved Oxygen (DO) (relative dissolved oxygen)<When the concentration is 15%, the rotation speed is linked, the maximum rotation speed is 1200rpm, and the dissolved oxygen content is kept to be 15% (relative dissolved oxygen content). When OD of the initial fermentation System600>20 (blank in high-density fermentation minimal medium), BL21-LysargiNase strain fermentation broth 1 was obtained.
BL 21-LysargNase strain fermentation broth 1 was fed at a rate of 16-18 mL/h. OD of fermentation system when supplemented600At 45 (blank with high-density fermentation basal medium with feed), the temperature was lowered to 32 ℃ and IPTG was added to a final concentration of 1mM for 10h of induction culture. The induction culture parameters are as follows: the culture temperature was 32 ℃, the stirring speed was 400rpm, the air flow was 6L/min, the pH was 7.0, and the pH of the fermentation medium was controlled by 30% ammonia water (volume percentage concentration). The dissolved oxygen amount and the rotation speed were controlled as above to obtain fermentation broth 2 of BL21-LysargiNase strain.
And centrifuging BL 21-LysargNase strain fermentation liquor 2 for 30min at 4000rpm by adopting a horizontal rotor to collect thalli, thereby obtaining BL 21-LysargNase thalli.
The medium and solution components were as follows:
high-density fermentation basal medium: 26.7g of glycerol, 40.0g of yeast powder, 1.8g of monopotassium phosphate, 1.8g of citric acid, 5mL of salt solution, 1mL of trace metal salt solution and 1mL of defoaming agent 204, distilled water is added to make up the volume of the solution to 1L, and the pH value is 7.2.
The salt solution consists of water and solutes, the solutes and their concentrations being: 250g/L MgCl2·6H2O、100g/L CaCl2·2H2O, 100g/L KCl and 2M Citric acid (Citric acid).
The trace metal salt solution: 6.8g ZnCl2、54.0g FeCl3·6H2O、16.2g MnCl2·4H2O、2.2g CuSO4·5H2O、4.8g CoCl2·6H2O、0.024g(NH4)6Mo7O24·4H2O, 0.2g KI, 119mL of 37% (vol.% concentration) concentrated hydrochloric acid, and distilled water to make up the volume of 1L.
The feed consists of water and solutes at concentrations: 275g/L of glycerol and 225g/L of yeast powder.
Example 3 [ purification of recombinant lysine arginine N-terminal protease ]
200g of BL21-LysargiNase cells obtained in example 2 were resuspended in 20mM Tris pH 7.5, 150mM NaCl buffer at a ratio of 1:10(w/v), homogenized with a high-speed tissue cutter, and disrupted by high-pressure homogenization (APV-1000 homogenizer, SPX FLOW, Inc.). The high pressure homogenization pressure was set at 800bar, 13000g after 3 successive disruptions, centrifugation at 4 ℃ for 30min, and the supernatant collected was subjected to metal chelate affinity chromatography on AKTA Purifier (GE healthcare) (HisTrapFF 5mL pre-column, GE healthcare, cat # 17-5255-01).
The column was pre-equilibrated with buffer A (20mM Tris,150mM NaCl, pH 7.5) and sample loaded at a flow rate of 5 mL/min. After the loading was completed, the hetero-protein was eluted with 10 column volumes of buffer A and 10 column volumes of 5% buffer B (20mM Tris,150mM NaCl,0.5M imidazole, pH 7.5), followed by linear gradient elution with 15 column volumes of 5-80% buffer B. All purification work was carried out at 20-25 ℃. The purified recombinant lysine arginine N-terminal protease solution was stored at 4 ℃. The results of the chromatographic and electrophoretic measurements are shown in FIG. 2.
Example 4 [ activation to recombinant lysine arginine N-terminal protease ]
The purified recombinant lysine arginine N-terminal protease solution obtained in example 3 was desalted by G-25 (HiPrep desalting column, GE healthcare, cat # 17508701, buffer 20mM Tris, pH 7.5), then concentrated by ultrafiltration through a 10kD ultrafiltration tube (Amicon Ultra-15mL,10kDa Centrifugal Filter Unit) to obtain a recombinant lysine arginine N-terminal protease solution with a protein concentration of 0.5-1mg CaCl/mL, and then added with CaCl2To 10mM, activated at 30 ℃ for 0h, 4h, 8h and 16h, and sampled respectively for SDS-PAGE electrophoresis detection. The results are shown in FIG. 3, activation resultsThe recombinant lysine arginine N-terminal protease of (3) has a molecular weight of 29.0 kD.
Example 5 [ purification of recombinant lysine arginine N-terminal protease ]
The activated recombinant lysine arginine N-terminal protease solution obtained in example 4 was purified by anion exchange chromatography (HiTrappQ HP 5mL pre-packed column, GE healthcare Cathe # 17-1154-01). The column was pre-equilibrated with equilibration buffer (20mM Tris, pH 7.5) and sample loaded. After the completion of the sample loading, the sample was eluted in a linear elution system using an elution buffer (20mM Tris,1M NaCl, pH 7.5) containing 1M sodium chloride, wherein the elution gradient was set to a linear gradient of 25mM to 500mM sodium chloride at a flow rate of 5mL/min for 20min (FIG. 4 a).
Elution peaks containing the target protein were pooled based on SDS-PAGE (FIG. 4b), and concentrated by ultrafiltration through a 10kD ultrafiltration tube (Amicon Ultra-15mL,10kD Centrifugal Filter Unit) to give a recombinant lysine arginine N-terminal protease solution at a concentration of 5-10 mg/mL. Further, purification was carried out by gel filtration chromatography (HiLoad Superdex 75PG, GE healthcare, cat. No. 28989334). The eluate was chromatographed by gel filtration (20mM Tris,150mM NaCl,10mM CaCl)2pH 7.5) was used. The separation range of the chromatography medium used for gel filtration chromatography covers 3-70 kD. And collecting the elution peak of the activated recombinant lysine arginine N-terminal protease to obtain a pure recombinant lysine arginine N-terminal protease (figure 5).
Example 6 [ acetylation of recombinant lysine arginine N-terminal protease ]
Adding acetic anhydride with the concentration of 800mM (6mL/L reaction volume) into the recombinant lysine arginine N-terminal protease solution prepared in the example 5 to obtain a recombinant lysine arginine N-terminal protease acetylation reaction solution, and reacting at 20-25 ℃ for 30min to obtain the recombinant acetylated lysine arginine N-terminal protease. Wherein acetic anhydride (product of Beijing Hua Kaiyuan chemical Co., Ltd., catalog No. 10000318) is diluted with dioxane (1,4-dioxane, carbofuran scientific product, catalog No. 156496). In the reaction process, 0.5M sodium hydroxide is used for adjusting the pH value of the acetylation reaction liquid to be stable at 6.2-7.2.
The recombinant acetylated lysine arginine N-terminal protease solution was concentrated to 3mg/mL by ultrafiltration through a 10kD ultrafiltration tube (Amicon Ultra-15mL,10kDa Centrifugal Filter Unit).
Example 7 identification of recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease
The SDS-PAGE results of the recombinant lysine arginine N-terminal protease and the recombinant acetylated lysine arginine N-terminal protease are shown in FIG. 6. The molecular weight was found to be around 29kD, corresponding to the theoretical molecular weight of 29005.272.
Further, the prepared recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease are subjected to mass spectrum identification, and the mass spectrum conditions are as follows:
mass spectrometry: active PLus EMR;
liquid phase: RIGOL L-3000;
mobile phase: 20mM ammonium acetate;
flow rate: 0.3 mL/min;
resolution ratio: 35000.
as shown in FIG. 7a, the molecular weight of the recombinant lysine arginine N-terminal protease is 29006.022Da, which is consistent with its theoretical value of 29005.272 Da. As shown in FIG. 7b, the mass spectrum identification result of the N-terminal protease of the recombinant acetylated lysine arginine showed a total of 5 peaks, wherein the main peak was 29090.004Da and the peak with the largest molecular weight was 29216.844 Da. The molecular weight of the acetyl group is known to be 42Da, and the recombinant acetylated lysine arginine N-terminal protease obtained by us is supposed to be a mixture of 1-5 acetyl modified recombinant lysine arginine N-terminal proteases.
Example 8 [ assay of recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease Activities ]
The activities of the recombinant lysine arginine N-terminal protease and the recombinant acetylated lysine arginine N-terminal protease prepared in the foregoing examples were examined using yeast cell total protein (TCL).
Collecting 20 μ g of TCL, adding 1mM tricarboxymethylphosphonic acid (TCEP, Sigma, C4706), denaturing at 45 deg.C for 30min, and adding enzyme at a ratio of 1:50, 1:200, 1:800, and 1:1600Example, the reaction buffer (50mM HEPES,5mM CaCl) was added thereto and reacted at 45 ℃ for 4 hours21.5M Urea, pH 8.0). After the reaction is finished, the sample is subjected to SDS-PAGE electrophoresis detection. As shown in FIG. 8, the results showed that the digestion efficiency of TCL by the recombinant lysine arginine N-terminal protease and the recombinant acetylated lysine arginine N-terminal protease was equivalent when the substrate amount was 1:50 and 1:200, and that there was substantially no residual protein in the lanes, indicating that the substrate protein was completely degraded and the enzyme activity was high; in contrast, the commercial recombinant lysine arginine N-terminal protease (Structural Biology Unit of CSIC, Spain) has more significant substrate protein remained at the enzyme to substrate amount of 1:50, and more undegraded substrate protein remained at the enzyme to substrate amount of 1:200, indicating that the commercial recombinant lysine arginine N-terminal protease has lower activity. At enzyme to substrate amounts of 1:800 and 1:1600, the acetylated recombinant lysine arginine N-terminal protease lane fades out of residual protein, but the grayscale was significantly lower than the recombinant lysine arginine N-terminal protease digested sample, indicating that the activity of the acetylated recombinant lysine arginine N-terminal protease was significantly higher than the non-acetylated recombinant lysine arginine N-terminal protease.
Example 9 comparison of Mass Spectrometry identification of Yeast proteome samples Using recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease
Furthermore, we compared the activities of the recombinant lysine arginine N-terminal protease and the recombinant acetylated lysine arginine N-terminal protease prepared by the present application by using a liquid phase-mass spectrometry technology (LC-MS/MS) which is commonly used in the current proteomics research. A sample of total yeast cell protein (TCL) was first reductively alkylated with 10mM Dithiothreitol (DTT) and 30mM Iodoacetamide (IAA). The enzyme digestion buffer solution is 50mM HEPES, 0.05% RapiGest and 5mM CaCl2pH8.0, enzyme amount ratio 1:50 (enzyme: protein, w: w), and digestion at 37 ℃ overnight. The analysis was performed using an ultra-high pressure liquid chromatography system (Waters Corporation, Milford, MA, USA) with an LTQ Qrbitrap Velos mass spectrometer (Thermo Electron, San Jose, Calif., USA). Separating with home-made reversed phase chromatography column (75 μm × 15cm,3 μm,200A) for 60min, wherein the mobile phase A is 0.1% FA, 2% AcN, 98% H2O, flowPhase B is 0.1% FA, 98% AcN, 2% H2O; the flow rate was 300 nL/min. And (4) carrying out mass spectrum analysis on the eluted peptide fragments. The mass spectrum scanning range is 300-1600 m/z; the primary and secondary spectral accuracies were 30,000 and 7,500 (at 400m/z), respectively.
Although the electrophoresis result shows that the digestion efficiency of the recombinant acetylated lysine arginine N-terminal protease and the recombinant lysine arginine N-terminal protease on yeast TCL is equivalent when the enzyme-substrate amount is 1:50 (figure 8, example 8), LC-MS/MS identification finds that the recombinant acetylated lysine arginine N-terminal protease is obviously higher than the recombinant lysine arginine N-terminal protease in relevant indexes such as Peptide-Spectrum matching number (PSM), total number of identified Peptide segments, number of K/R specific Peptide segments, protein identification number and the like, thereby having obvious advantages in proteomics research. In addition, a self-digesting peptide fragment was identified in the data for the recombinant lysine arginine N-terminal protease but not in the data for the recombinant acetylated lysine arginine N-terminal protease (table 1). These results indicate that acetylation can significantly improve lysine arginine N-terminal protease stability and activity.
Table 1: comparison of protein Mass Spectrometry identification results
Example 10 [ identification of stability of recombinant acetylated lysine arginine N-terminal protease ]
To investigate the stabilizing effect of acetylation on the recombinant lysine arginine N-terminal protease, the recombinant lysine arginine N-terminal protease and the recombinant acetylated lysine arginine N-terminal protease obtained in the above examples were diluted to 0.5mg/mL, and the same amount of the enzyme (electrophoresis results are shown in FIG. 9 a) was added to a reaction buffer (50mM HEPES,5mM CaCl) respectively21.5M Urea, pH8.0) and reacted at 30 ℃ for 18 hours.
The digestion products were quantitatively assayed for self-digestion-characterized peptides of recombinant lysine arginine N-terminal protease and recombinant acetylated lysine arginine N-terminal protease using an ultra-high pressure liquid chromatography system (Waters Corporation, Milford, MA, USA) in combination with an LTQ Qrbitrap Velos mass spectrometer (Thermo Electron, San Jose, Calif., USA). The sample separation and mass spectrometry conditions were the same as in example 9.
The results showed that after 18 hours of autodigestion, the amount of autodigestion-characteristic peptide stretch "RSDEVDDTPNQADPN" (SEQ No.5) in the recombinant acetylated lysine arginine N-terminal protease was only about 5% of the recombinant lysine arginine N-terminal protease (fig. 9b, 9 c); the quantitative analysis of another self-digestion characteristic peptide fragment "KIPVVVH" (SEQ No.6) also showed that the peptide fragment of the recombinant acetylated lysine arginine N-terminal protease was only about 14% of the recombinant lysine arginine N-terminal protease (FIGS. 9d, 9 e). This result further demonstrates that the recombinant acetylated lysine arginine N-terminal protease is structurally stable, resistant to self-digestion under conventional optimal digestion temperature conditions, and may further enhance its own activity in degrading other protein substrates by its self-digestion resistance.
Example 11 [ establishment of a mirror image peptide fragment-based protein De novo sequencing technology ]
The term "mirror image peptide" as used in this study refers to two peptides (one from trypsin and one from acetylated lysine arginine N-terminal protease) whose sequences are respectively designated A1A2…Al[K/R/-]And [ K/R/-]A1A2…AlWherein A isiAnd can be any of 20 amino acid residues, "-" indicates no amino acid. For example, inGLEWVAR and KGLEWVAAndGLEWVAr andGLEWVAare considered to be mirror image peptide fragments. The mirror image spectrogram refers to a mass spectrum spectrogram corresponding to the mirror image peptide fragment.
Based on the enzyme digestion characteristics of trypsin and acetylated lysine arginine N-terminal protease, the y-series ion peak of a trypsin digestion mass spectrum is stronger, and the b-series ion peak of acetylated lysine arginine N-terminal protease digestion is stronger, so that most ion types can be confirmed; further, conventional trypsin-based cleavage spectra are often limited by the absence of ions, particularly N-terminal ions, as shown in fig. 10, due to the absence of y when trypsin is used alone6Ion, the sequence of two amino acids at the N-terminus isGL or LG are not determinable. But when the corresponding mirror image of the N-terminal protease cleavage of acetylated lysine arginine is introduced, b2The presence of the ion can demonstrate that the N-terminus is GL rather than LG.
By using the strategy of combining trypsin with acetylated lysine arginine N-terminal protease, a new algorithm software pNovoM (in the software copyright application) for mirror image spectrum search and amino acid sequence analysis is developed, and de novo sequencing is carried out on a CHO recombinant expression monoclonal antibody PXL 1.
The result of comparing the amino acid sequence obtained by pnofom analysis with the actual sequence of PXL1 shows that the proportion of the full coverage spectrum of ions can reach 96% by using the matching of the peptide fragment of the double digestion mirror image of trypsin and acetylated lysine arginine N-terminal protease, while the full coverage of ions can be realized by only 48% and 44% of the spectra of trypsin or acetylated lysine arginine N-terminal protease (fig. 11).
Further analysis shows that the proportion of the correct amino acid sequence analysis spectrogram reaches 87 percent (FIG. 12) based on the double-enzyme digestion mirror image spectrogram, and only 4 amino acid errors occur in 416 amino acid sequences obtained by analysis, and the error rate is about 1 percent. If trypsin is used for single digestion, other de novo sequencing software available today, pNovo + (Chi H., Chen H., He K.et al. pNovo +: de novel peptide sequencing HCD and ETD standard Mass spectrum Res.201312: 615) and PEAKS (Ma B., Zhang K., Hendrie C.et al. PEAKS: power functional software for peptide de novel sequencing by Mass spectrometry.Rapid communication Mass Spectrometry.2003,17: 2332347-625) are used, and the proportion of correct amino acid sequence spectra is lower than 60% (FIG. 12). These results indicate that a bi-directional cleavage of trypsin and acetylated lysine arginine N-terminal protease in combination with pNovoM software algorithm can achieve accurate de novo sequencing of a single purified protein.
Sequence listing
<110> Peking proteome research center
<120> recombinant acetylated lysine arginine N-terminal protease, and preparation method and application thereof
<130> BJ1936-17P121691
<160> 6
<170> SIPOSequenceListing 1.0
<210> 1
<211> 342
<212> PRT
<213> Artificial
<220>
<223> amino acid sequence of lysine arginine N-terminal protease
<400> 1
Met Ala Glu Lys Phe Glu Ser Arg Gly Ile Glu Glu Ala Ser Ser Glu
1 5 10 15
Val Pro Thr Gln Arg Arg Cys Gly Ala Met Glu Val His His Arg Leu
20 25 30
Leu Arg Ser Ala Ser Tyr Val Arg Glu Arg Asp Gln Ile Glu Asn Leu
35 40 45
Ala Leu Lys Tyr Lys Gln Gly Phe Arg Ala Ile Ser Arg Met Glu Ile
50 55 60
Val Lys Ile Pro Val Val Val His Val Val Trp Asn Glu Glu Glu Glu
65 70 75 80
Asn Ile Ser Asp Ala Gln Ile Gln Ser Gln Ile Asp Ile Leu Asn Lys
85 90 95
Asp Phe Arg Lys Leu Asn Ser Asp Val Ser Gln Val Pro Ser Val Trp
100 105 110
Ser Asn Leu Ile Ala Asp Leu Gly Ile Glu Phe Phe Leu Ala Thr Lys
115 120 125
Asp Pro Asn Gly Asn Gln Thr Thr Gly Ile Thr Arg Thr Gln Thr Ser
130 135 140
Val Thr Phe Phe Thr Thr Ser Asp Glu Val Lys Phe Ala Ser Ser Gly
145 150 155 160
Gly Glu Asp Ala Trp Pro Ala Asp Arg Tyr Leu Asn Ile Trp Val Cys
165 170 175
His Val Leu Lys Ser Glu Ile Gly Gln Asp Ile Leu Gly Tyr Ala Gln
180 185 190
Phe Pro Gly Gly Pro Ala Glu Thr Asp Gly Val Val Ile Val Asp Ala
195 200 205
Ala Phe Gly Thr Thr Gly Thr Ala Leu Pro Pro Phe Asp Lys Gly Arg
210 215 220
Thr Ala Thr His Glu Ile Gly His Trp Leu Asn Leu Tyr His Ile Trp
225 230 235 240
Gly Asp Glu Leu Arg Phe Glu Asp Pro Cys Ser Arg Ser Asp Glu Val
245 250 255
Asp Asp Thr Pro Asn Gln Ala Asp Pro Asn Phe Gly Cys Pro Ser Tyr
260 265 270
Pro His Val Ser Cys Ser Asn Gly Pro Asn Gly Asp Met Phe Met Asn
275 280 285
Tyr Met Asp Tyr Val Asp Asp Lys Cys Met Val Met Phe Thr Gln Gly
290 295 300
Gln Ala Thr Arg Val Asn Ala Cys Leu Asp Gly Pro Arg Ser Ser Phe
305 310 315 320
Leu Ala Arg Val Glu Glu Thr Glu Lys Lys Glu Ala Pro Ser Lys Arg
325 330 335
Glu Met Pro Met Pro Arg
340
<210> 2
<211> 1029
<212> DNA
<213> Artificial
<220>
<223> DNA sequence encoding lysine arginine N-terminal protease
<400> 2
atggcagaaa aatttgaaag tagagggata gaggaagcgt catctgaagt gccaactcaa 60
cgtagatgcg gagcaatgga agttcaccac aggctgctaa ggtctgcgtc gtacgtcaga 120
gaacgcgatc aaattgaaaa tctggctctc aaatataaac aagggtttcg agcaatatct 180
cggatggaaa ttgtcaagat ccctgtggtt gttcatgtcg tatggaatga agaagaggag 240
aatatttctg acgctcagat ccagagccag atagatattc tcaataaaga cttccgcaag 300
ttgaattccg acgtgtcaca ggtgccttct gtctggagta atctcatagc agacctgggg 360
attgagtttt tcctcgcaac aaaagacccg aatggaaatc agactacagg aataactcgc 420
actcaaactt cagtgacctt tttcactacc agcgatgaag tgaaattcgc atccagtggg 480
ggtgaggatg catggccggc cgatcgttat ctgaatattt gggtatgtca tgtgcttaaa 540
agtgagatag gtcaagacat attgggttac gcgcaatttc caggtgggcc cgctgaaacc 600
gatggtgttg ttattgttga tgcagctttt ggtaccactg gaactgcctt accgccattt 660
gacaagggac gaacggcaac ccatgaaatc ggacactggt taaacctcta ccatatctgg 720
ggtgacgaat tgcgttttga ggatccatgt tcacgttcag atgaggttga tgatactcca 780
aatcaggcag atcctaattt tggctgtcca agttatccac atgtcagctg cagcaatgga 840
ccaaatggcg atatgttcat gaattacatg gattacgtag acgataaatg tatggttatg 900
ttcacacagg gccaggcaac tcgtgtaaat gcatgtcttg acggaccaag atcatcgttc 960
ctggcgagag tggaagaaac agaaaagaaa gaagcaccat ccaagcgtga aatgcctatg 1020
ccaaggtaa 1029
<210> 3
<211> 1074
<212> DNA
<213> Artificial
<220>
<223> open reading frame sequence encoding lysine arginine N-terminal protease
<400> 3
atgggcagca gccatcatca tcatcatcac gatgatgatg acaagatggc agaaaaattt 60
gaaagtagag ggatagagga agcgtcatct gaagtgccaa ctcaacgtag atgcggagca 120
atggaagttc accacaggct gctaaggtct gcgtcgtacg tcagagaacg cgatcaaatt 180
gaaaatctgg ctctcaaata taaacaaggg tttcgagcaa tatctcggat ggaaattgtc 240
aagatccctg tggttgttca tgtcgtatgg aatgaagaag aggagaatat ttctgacgct 300
cagatccaga gccagataga tattctcaat aaagacttcc gcaagttgaa ttccgacgtg 360
tcacaggtgc cttctgtctg gagtaatctc atagcagacc tggggattga gtttttcctc 420
gcaacaaaag acccgaatgg aaatcagact acaggaataa ctcgcactca aacttcagtg 480
acctttttca ctaccagcga tgaagtgaaa ttcgcatcca gtgggggtga ggatgcatgg 540
ccggccgatc gttatctgaa tatttgggta tgtcatgtgc ttaaaagtga gataggtcaa 600
gacatattgg gttacgcgca atttccaggt gggcccgctg aaaccgatgg tgttgttatt 660
gttgatgcag cttttggtac cactggaact gccttaccgc catttgacaa gggacgaacg 720
gcaacccatg aaatcggaca ctggttaaac ctctaccata tctggggtga cgaattgcgt 780
tttgaggatc catgttcacg ttcagatgag gttgatgata ctccaaatca ggcagatcct 840
aattttggct gtccaagtta tccacatgtc agctgcagca atggaccaaa tggcgatatg 900
ttcatgaatt acatggatta cgtagacgat aaatgtatgg ttatgttcac acagggccag 960
gcaactcgtg taaatgcatg tcttgacgga ccaagatcat cgttcctggc gagagtggaa 1020
gaaacagaaa agaaagaagc accatccaag cgtgaaatgc ctatgccaag gtaa 1074
<210> 4
<211> 357
<212> PRT
<213> Artificial
<220>
<223> amino acid sequence corresponding to open reading frame encoding lysine arginine N-terminal protease
<400> 4
Met Gly Ser Ser His His His His His His Asp Asp Asp Asp Lys Met
1 5 10 15
Ala Glu Lys Phe Glu Ser Arg Gly Ile Glu Glu Ala Ser Ser Glu Val
20 25 30
Pro Thr Gln Arg Arg Cys Gly Ala Met Glu Val His His Arg Leu Leu
35 40 45
Arg Ser Ala Ser Tyr Val Arg Glu Arg Asp Gln Ile Glu Asn Leu Ala
50 55 60
Leu Lys Tyr Lys Gln Gly Phe Arg Ala Ile Ser Arg Met Glu Ile Val
65 70 75 80
Lys Ile Pro Val Val Val His Val Val Trp Asn Glu Glu Glu Glu Asn
85 90 95
Ile Ser Asp Ala Gln Ile Gln Ser Gln Ile Asp Ile Leu Asn Lys Asp
100 105 110
Phe Arg Lys Leu Asn Ser Asp Val Ser Gln Val Pro Ser Val Trp Ser
115 120 125
Asn Leu Ile Ala Asp Leu Gly Ile Glu Phe Phe Leu Ala Thr Lys Asp
130 135 140
Pro Asn Gly Asn Gln Thr Thr Gly Ile Thr Arg Thr Gln Thr Ser Val
145 150 155 160
Thr Phe Phe Thr Thr Ser Asp Glu Val Lys Phe Ala Ser Ser Gly Gly
165 170 175
Glu Asp Ala Trp Pro Ala Asp Arg Tyr Leu Asn Ile Trp Val Cys His
180 185 190
Val Leu Lys Ser Glu Ile Gly Gln Asp Ile Leu Gly Tyr Ala Gln Phe
195 200 205
Pro Gly Gly Pro Ala Glu Thr Asp Gly Val Val Ile Val Asp Ala Ala
210 215 220
Phe Gly Thr Thr Gly Thr Ala Leu Pro Pro Phe Asp Lys Gly Arg Thr
225 230 235 240
Ala Thr His Glu Ile Gly His Trp Leu Asn Leu Tyr His Ile Trp Gly
245 250 255
Asp Glu Leu Arg Phe Glu Asp Pro Cys Ser Arg Ser Asp Glu Val Asp
260 265 270
Asp Thr Pro Asn Gln Ala Asp Pro Asn Phe Gly Cys Pro Ser Tyr Pro
275 280 285
His Val Ser Cys Ser Asn Gly Pro Asn Gly Asp Met Phe Met Asn Tyr
290 295 300
Met Asp Tyr Val Asp Asp Lys Cys Met Val Met Phe Thr Gln Gly Gln
305 310 315 320
Ala Thr Arg Val Asn Ala Cys Leu Asp Gly Pro Arg Ser Ser Phe Leu
325 330 335
Ala Arg Val Glu Glu Thr Glu Lys Lys Glu Ala Pro Ser Lys Arg Glu
340 345 350
Met Pro Met Pro Arg
355
<210> 5
<211> 15
<212> PRT
<213> Artificial
<220>
<223> self-digestion characteristic peptide segment of lysine arginine N-terminal protease
<400> 5
Arg Ser Asp Glu Val Asp Asp Thr Pro Asn Gln Ala Asp Pro Asn
1 5 10 15
<210> 6
<211> 7
<212> PRT
<213> Artificial
<220>
<223> another self-digestion characteristic peptide fragment of lysine arginine N-terminal protease
<400> 6
Lys Ile Pro Val Val Val His
1 5
Claims (9)
1. A recombinant acetylated lysine arginine N-terminal protease, wherein the lysine arginine N-terminal protease is a polypeptide with an amino acid sequence shown as SEQ ID No.1, the acetylated lysine arginine N-terminal protease is modified with 1-5 acetyl groups on a lysine residue side chain of the polypeptide, and the acetylated lysine arginine N-terminal protease has a molecular weight selected from the following under the condition of mass spectrum resolution 35000: about 29048.121Da, about 29090.004Da, about 29132.211Da, about 29173.834Da, and about 29216.844 Da;
modifying an acetyl group on a lysine residue side chain of the polypeptide by:
adding acetic anhydride into the activated recombinant lysine arginine N-terminal protease solution for acetylation, and reacting at 20-25 ℃ and pH6.2-7.2 to obtain the recombinant acetylated lysine arginine N-terminal protease.
2. The recombinant acetylated lysine arginine N-terminal protease of claim 1, wherein the recombinant acetylated lysine arginine N-terminal protease is prepared by the method of:
1) operably linking a gene expressing the lysine arginine N-terminal protease to an expression vector; transferring the expression vector into escherichia coli to obtain an escherichia coli strain for expressing recombinant lysine arginine N-terminal protease;
2) culturing the strain of step 1) and collecting the thalli;
3) crushing the thallus obtained in the step 2) to obtain recombinant lysine arginine N-terminal protease and purifying;
4) activating the recombinant lysine arginine N-terminal protease obtained in the step 3);
5) repurifying the recombinant lysine arginine N-terminal protease activated in the step 4);
6) performing acetylation modification on the recombinant lysine arginine N-terminal protease re-purified in the step 5) to obtain the recombinant acetylated lysine arginine N-terminal protease.
3. The recombinant acetylated lysine arginine N-terminal protease of claim 2, wherein the protease is prepared by a process comprising:
1a) introducing a 6 XHis tag and an enterokinase cleavage site DDDDK into the N end of a protease gene sequence expressing the N end of lysine arginine to obtain a recombinant gene sequence, inserting the recombinant gene sequence into a pET28a vector to obtain a recombinant expression vector of the protease gene expressing the N end of the lysine arginine, and transferring the expression vector into a competent cell of escherichia coli BL21(DE3) to obtain a recombinant strain containing a recombinant protease gene expressing the N end of the lysine arginine;
2a) fermenting and culturing the recombinant strain in the step 1a), and collecting;
3a) resuspending the thallus obtained in step 2a), crushing the thallus, centrifuging and collecting supernatant, performing metal chelate affinity chromatography, and purifying to obtain recombinant lysine arginine N-terminal protease;
4a) desalting the purified recombinant lysine arginine N-terminal protease of step 3a), concentrating by ultrafiltration, adding CaCl2Incubation and activation are carried out to obtain activated recombinant lysine arginine N-terminal protease;
5a) subjecting the activated recombinant lysine arginine N-terminal protease of step 4a) to anion exchange chromatography to obtain a purified recombinant lysine arginine N-terminal protease;
6a) adding acetic anhydride solution into the purified recombinant lysine arginine N-terminal protease obtained in the step 5a) for acetylation to obtain the recombinant acetylated lysine arginine N-terminal protease.
4. The recombinant acetylated lysine arginine N-terminal protease of claim 3, wherein the chromatography of step 3a) is performed by equilibrating the column with buffer A containing 20mM Tris,150mM NaCl, pH 7.5, then loading at a flow rate of 5mL/min, eluting the hetero-protein with buffer A and 5% buffer B after loading, and then eluting with a linear gradient of 5-80% buffer B containing 20mM Tris,150mM NaCl,0.5M imidazole, pH 7.5.
5. The recombinant acetylated lysine arginine N-terminal protease of claim 3, wherein the anion exchange chromatography of step 5a) is performed using a HiTrap Q HP 5mL pre-packed column, pre-equilibrated the column with an equilibration buffer, loaded, and eluted linearly with an elution buffer at an elution flow rate of 5mL/min for 20min, wherein the equilibration buffer comprises 20mM Tris, pH 7.5, the elution buffer comprises 20mM Tris,1M NaCl, pH 7.5, and the gradient of the linear elution is 25 mM-500 mM NaCl.
6. A method of making the recombinant acetylated lysine arginine N-terminal protease of claim 1 comprising:
1) operably linking a gene expressing the lysine arginine N-terminal protease to an expression vector; transferring the expression vector into escherichia coli to obtain an escherichia coli strain for expressing recombinant lysine arginine N-terminal protease;
2) culturing the strain of step 1) and collecting the thalli;
3) crushing the thallus obtained in the step 2) to obtain recombinant lysine arginine N-terminal protease and purifying;
4) activating the recombinant lysine arginine N-terminal protease obtained in the step 3);
5) repurifying the recombinant lysine arginine N-terminal protease activated in the step 4);
6) performing acetylation modification on the recombinant lysine arginine N-terminal protease re-purified in the step 5) to obtain the recombinant acetylated lysine arginine N-terminal protease.
7. The method of claim 6, comprising the steps of:
(1) introducing a 6 XHis tag and an enterokinase cleavage site DDDDK into the N end of an expressed lysine arginine N-terminal protease gene to obtain a recombinant gene sequence, and inserting the recombinant gene sequence into a pET28a vector to obtain a recombinant expression vector of the recombinant lysine arginine N-terminal protease; introducing the recombinant expression vector of the recombinant lysine arginine N-terminal protease into competent cells of escherichia coli BL21(DE3) to obtain a recombinant strain containing a recombinant lysine arginine N-terminal protease gene, namely BL21(DE3) -LysargNase;
(2) inoculating BL21(DE3) -LysargNase strain, and performing shake flask culture by using LB culture medium to obtain a seed solution;
inoculating the seed solution in a fermentation tank, performing fed-batch fermentation, performing IPTG induced expression to obtain fermentation liquor, centrifuging the fermentation liquor, and collecting BL21(DE3) -LysargNase thallus;
(3) resuspending the thallus obtained in the step (2), crushing the thallus, centrifuging and collecting supernatant, and performing metal chelating and affinity chromatography;
wherein, in the chromatography process, the chromatography column is balanced by buffer solution A in advance, then the sample is loaded, the flow rate of the sample loading is 5mL/min, the buffer solution A contains 20mM Tris,150mM NaCl and pH 7.5,
after the loading is finished, eluting the hybrid protein by using a buffer solution A and a 5% buffer solution B, and then eluting by using a linear gradient of the 5-80% buffer solution B, wherein the buffer solution B contains 20mM Tris,150mM NaCl,0.5M imidazole and pH 7.5;
(4) desalting, ultrafiltering and concentrating the purified recombinant lysine arginine N-terminal protease solution obtained in the step (3), and adding CaCl2Incubating and activating to 10mM to obtain a recombinant lysine arginine N-terminal protease solution;
(5) carrying out anion exchange chromatography on the activated recombinant lysine arginine N-terminal protease solution obtained in the step (4) to obtain a repurified recombinant lysine arginine N-terminal protease solution;
wherein, anion exchange chromatography adopts a HiTrap Q HP 5mL prepacked column, the chromatographic column is balanced by an equilibrium buffer solution in advance, the sample is loaded, the elution is carried out linearly by an elution buffer solution, the elution flow rate is 5mL/min, the elution time is 20min, the equilibrium buffer solution contains 20mM Tris and pH 7.5, the elution buffer solution contains 20mM Tris,1M NaCl and pH 7.5, and the gradient of linear elution is 25 mM-500 mM sodium chloride;
(6) adding acetic anhydride solution into the repurified recombinant lysine arginine N-terminal protease solution obtained in the step (5) for acetylation to obtain recombinant acetylated lysine arginine N-terminal protease solution, and finally performing ultrafiltration concentration;
the ratio of the purified recombinant lysine arginine N-terminal protease solution to the acetic anhydride solution is 6mL/L reaction volume, the concentration of the acetic anhydride solution is 800mM, the acetic anhydride solution is obtained by diluting with dioxane, the reaction temperature is 20-25 ℃, the reaction lasts for 30min, the pH of the reaction solution is controlled to be 6.2-7.2, and the obtained recombinant acetylated lysine arginine N-terminal protease solution is concentrated by ultrafiltration through an ultrafiltration tube.
8. Use of the recombinant acetylated lysine arginine N-terminal protease of claim 1 in proteomics research.
9. The use of claim 8, wherein the recombinant acetylated lysine arginine N-terminal protease is used in combination with trypsin.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710942714.8A CN109652397B (en) | 2017-10-11 | 2017-10-11 | Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710942714.8A CN109652397B (en) | 2017-10-11 | 2017-10-11 | Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109652397A CN109652397A (en) | 2019-04-19 |
CN109652397B true CN109652397B (en) | 2022-02-25 |
Family
ID=66109133
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710942714.8A Active CN109652397B (en) | 2017-10-11 | 2017-10-11 | Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109652397B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020113552A1 (en) * | 2018-12-07 | 2020-06-11 | 北京蛋白质组研究中心 | Recombinant acetylated lysine arginine n-terminal protease, preparation method therefor, and application thereof |
CN112285265B (en) * | 2019-07-24 | 2022-02-01 | 中国科学院大连化学物理研究所 | Novel method for protein methylation modification reverse enrichment based on mirror image enzyme orthogonality principle and application |
CN111574591B (en) * | 2020-04-29 | 2021-09-21 | 西安交通大学医学院第一附属医院 | Polypeptide and synthetic method thereof |
CN112489721B (en) * | 2020-11-25 | 2021-11-12 | 清华大学 | Mirror image protein information storage and coding technology |
CN115947815A (en) * | 2022-10-10 | 2023-04-11 | 天津科技大学 | Recombinant antibacterial peptide PMAP-37 and preparation method and application thereof |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104694522A (en) * | 2015-02-16 | 2015-06-10 | 中国人民解放军军事医学科学院放射与辐射医学研究所 | Preparation method and application of recombinant acetylation cationoid trypsin |
-
2017
- 2017-10-11 CN CN201710942714.8A patent/CN109652397B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104694522A (en) * | 2015-02-16 | 2015-06-10 | 中国人民解放军军事医学科学院放射与辐射医学研究所 | Preparation method and application of recombinant acetylation cationoid trypsin |
Non-Patent Citations (3)
Title |
---|
LysargiNase mirrors trypsin for protein C-terminal and methylation-site identification;Huesgen P.F.等;《Nature Methods》;20141124;第12卷(第1期);55-62 * |
Molecular analysis of ulilysin, the structural prototype of a new family of metzincin metalloproteases;Tallant C.等;《The Journal of Biological Chemistry》;20060630;第281卷(第26期);17920-17928 * |
Recombinant acetylated trypsin demonstrates superior stability and higher activity than commercial products in quantitative proteomics studies;Feilin Wu 等;《Rapid Commun.Mass Spectrom》;20160430;第30卷(第8期);1059-1066 * |
Also Published As
Publication number | Publication date |
---|---|
CN109652397A (en) | 2019-04-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109652397B (en) | Recombinant acetylated lysine arginine N-terminal protease and preparation method and application thereof | |
EP3233884B1 (en) | Microbial transglutaminases, substrates therefor and methods for the use thereof | |
CN113286810A (en) | Protein purification method | |
CN108796041B (en) | Signal amplification system based on bioluminescence resonance energy transfer and detection method thereof | |
WO2020125707A1 (en) | Mtu δi-cm intein variant and the use thereof | |
CN109486800B (en) | Novel lysyl endopeptidase and preparation method thereof | |
CN109439643B (en) | Novel lysine specific endonuclease and preparation method thereof | |
Strydom et al. | An Angiogenic Protein from Bovine Serum and Milk—Purification and Primary Structure of Angiogenin‐2 | |
Kotik et al. | Purification and characterisation of a novel enantioselective epoxide hydrolase from Aspergillus niger M200 | |
KR101527528B1 (en) | Method for production, extraction and purification of soluble recombinant protein | |
Li et al. | Single-step affinity and cost-effective purification of recombinant proteins using the Sepharose-binding lectin-tag from the mushroom Laetiporus sulphureus as fusion partner | |
CN114839282B (en) | Method for quantitatively analyzing impurities in polypeptide drug sample based on enzymolysis method | |
WO2020113552A1 (en) | Recombinant acetylated lysine arginine n-terminal protease, preparation method therefor, and application thereof | |
Zhang et al. | Recombinant expression, purification and characterization of acetylated LysargiNase from Escherichia coli with high activity and stability | |
CN111705050B (en) | Preparation method and application of novel halophilic archaea extracellular protease | |
CN109880840B (en) | In vivo biotinylation labeling system for recombinant protein escherichia coli | |
Feeney et al. | Novel protein purification system utilizing an N-terminal fusion protein and a caspase-3 cleavable linker | |
US10150803B2 (en) | Method of preparing glucagon-like peptide-2 (GLP-2) analog | |
US7604980B2 (en) | Protein expression system | |
CN112961848B (en) | Novel aminopeptidase and soluble expression method thereof | |
CN114446392B (en) | Method for confirming key essential amino acid residue site of protein combined with aptamer | |
CN114574466B (en) | Chimeric capping enzyme and preparation method and application thereof | |
WO2015126077A1 (en) | Method for production, purification and analysis of stable isotope-labeled protein | |
CN107286223B (en) | Recombinant protein for detecting histone site acetylation and application thereof | |
CN116410961A (en) | Tobacco etching virus protease fusion protein and preparation method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240204 Address after: 100850 No. 27 Taiping Road, Beijing, Haidian District Patentee after: ACADEMY OF MILITARY MEDICAL SCIENCES Country or region after: China Address before: Building 1, No.33, kekeyuan Road, Changping District, Beijing Patentee before: BEIJING PROTEOME RESEARCH CENTER Country or region before: China |