WO2022094146A1 - Methods of converting precursor proteins to mature proteins using kex2 proteases - Google Patents
Methods of converting precursor proteins to mature proteins using kex2 proteases Download PDFInfo
- Publication number
- WO2022094146A1 WO2022094146A1 PCT/US2021/057142 US2021057142W WO2022094146A1 WO 2022094146 A1 WO2022094146 A1 WO 2022094146A1 US 2021057142 W US2021057142 W US 2021057142W WO 2022094146 A1 WO2022094146 A1 WO 2022094146A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- protein
- insulin
- fusion
- fusion protein
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 157
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 157
- 238000000034 method Methods 0.000 title claims abstract description 70
- 239000002243 precursor Substances 0.000 title claims abstract description 59
- 101001007681 Candida albicans (strain WO-1) Kexin Proteins 0.000 title claims abstract description 52
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 67
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 67
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 25
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 25
- 239000002157 polynucleotide Substances 0.000 claims abstract description 25
- 239000013598 vector Substances 0.000 claims abstract description 16
- 229940088597 hormone Drugs 0.000 claims abstract description 13
- 239000005556 hormone Substances 0.000 claims abstract description 12
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 claims description 89
- 108090001061 Insulin Proteins 0.000 claims description 76
- 102000004877 Insulin Human genes 0.000 claims description 75
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 51
- 229940125396 insulin Drugs 0.000 claims description 40
- 229960002869 insulin glargine Drugs 0.000 claims description 40
- 108010057186 Insulin Glargine Proteins 0.000 claims description 35
- 230000004927 fusion Effects 0.000 claims description 35
- COCFEDIXXNGUNL-RFKWWTKHSA-N Insulin glargine Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(=O)NCC(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 COCFEDIXXNGUNL-RFKWWTKHSA-N 0.000 claims description 33
- 238000003776 cleavage reaction Methods 0.000 claims description 33
- 230000007017 scission Effects 0.000 claims description 32
- 101150045458 KEX2 gene Proteins 0.000 claims description 16
- 150000001413 amino acids Chemical group 0.000 claims description 16
- 108010076181 Proinsulin Proteins 0.000 claims description 14
- 125000003729 nucleotide group Chemical group 0.000 claims description 14
- 239000002773 nucleotide Substances 0.000 claims description 12
- 238000000338 in vitro Methods 0.000 claims description 8
- 239000004026 insulin derivative Substances 0.000 claims description 6
- 238000010367 cloning Methods 0.000 claims description 4
- 102100029054 Homeobox protein notochord Human genes 0.000 claims 1
- 101000634521 Homo sapiens Homeobox protein notochord Proteins 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 98
- 125000003275 alpha amino acid group Chemical group 0.000 description 38
- 102000004196 processed proteins & peptides Human genes 0.000 description 36
- 229920001184 polypeptide Polymers 0.000 description 33
- 210000004027 cell Anatomy 0.000 description 30
- 125000000539 amino acid group Chemical group 0.000 description 25
- 235000001014 amino acid Nutrition 0.000 description 14
- 102000039446 nucleic acids Human genes 0.000 description 13
- 108020004707 nucleic acids Proteins 0.000 description 13
- 150000007523 nucleic acids Chemical class 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 10
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 10
- 239000011347 resin Substances 0.000 description 10
- 229920005989 resin Polymers 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 9
- 210000003000 inclusion body Anatomy 0.000 description 9
- JEPNOJXFZHBCTB-ZCUALFGZSA-N insulin c chain Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)NCC(=O)NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N1[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CCC1 JEPNOJXFZHBCTB-ZCUALFGZSA-N 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 108020004414 DNA Proteins 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- -1 linker amino acids Chemical class 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 5
- 108090000631 Trypsin Proteins 0.000 description 5
- 102000004142 Trypsin Human genes 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 239000006227 byproduct Substances 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 108010066381 preproinsulin Proteins 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000012588 trypsin Substances 0.000 description 5
- 239000004471 Glycine Substances 0.000 description 4
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 230000013595 glycosylation Effects 0.000 description 4
- 238000006206 glycosylation reaction Methods 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 230000026731 phosphorylation Effects 0.000 description 4
- 238000006366 phosphorylation reaction Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 125000006239 protecting group Chemical group 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 3
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 3
- 108010051696 Growth Hormone Proteins 0.000 description 3
- 102000018997 Growth Hormone Human genes 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101000976075 Homo sapiens Insulin Proteins 0.000 description 3
- 101500021084 Locusta migratoria 5 kDa peptide Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000003114 blood coagulation factor Substances 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 230000009089 cytolysis Effects 0.000 description 3
- 239000000122 growth hormone Substances 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- PBGKTOXHQIOBKM-FHFVDXKLSA-N insulin (human) Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 PBGKTOXHQIOBKM-FHFVDXKLSA-N 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 108010064733 Angiotensins Proteins 0.000 description 2
- 102000015427 Angiotensins Human genes 0.000 description 2
- 102000055006 Calcitonin Human genes 0.000 description 2
- 108060001064 Calcitonin Proteins 0.000 description 2
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 description 2
- 102000012289 Corticotropin-Releasing Hormone Human genes 0.000 description 2
- 108010022152 Corticotropin-Releasing Hormone Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- UBQYURCVBFRUQT-UHFFFAOYSA-N N-benzoyl-Ferrioxamine B Chemical compound CC(=O)N(O)CCCCCNC(=O)CCC(=O)N(O)CCCCCNC(=O)CCC(=O)N(O)CCCCCN UBQYURCVBFRUQT-UHFFFAOYSA-N 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 102000002808 Pituitary adenylate cyclase-activating polypeptide Human genes 0.000 description 2
- 108010004684 Pituitary adenylate cyclase-activating polypeptide Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 108010056088 Somatostatin Proteins 0.000 description 2
- 102000005157 Somatostatin Human genes 0.000 description 2
- 101000857870 Squalus acanthias Gonadoliberin Proteins 0.000 description 2
- 108010003205 Vasoactive Intestinal Peptide Proteins 0.000 description 2
- 102400000015 Vasoactive intestinal peptide Human genes 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- KBZOIRJILGZLEJ-LGYYRGKSSA-N argipressin Chemical compound C([C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@@H](C(N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N1)=O)N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(N)=O)C1=CC=CC=C1 KBZOIRJILGZLEJ-LGYYRGKSSA-N 0.000 description 2
- 229960004015 calcitonin Drugs 0.000 description 2
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 229940041967 corticotropin-releasing hormone Drugs 0.000 description 2
- KLVRDXBAMSPYKH-RKYZNNDCSA-N corticotropin-releasing hormone (human) Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(N)=O)[C@@H](C)CC)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO)[C@@H](C)CC)C(C)C)C(C)C)C1=CNC=N1 KLVRDXBAMSPYKH-RKYZNNDCSA-N 0.000 description 2
- 229960000958 deferoxamine Drugs 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 229940088598 enzyme Drugs 0.000 description 2
- 229940105423 erythropoietin Drugs 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 125000000524 functional group Chemical group 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 229940047124 interferons Drugs 0.000 description 2
- VBUWHHLIZKOSMS-RIWXPGAOSA-N invicorp Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 VBUWHHLIZKOSMS-RIWXPGAOSA-N 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229940060975 lantus Drugs 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 239000012460 protein solution Substances 0.000 description 2
- XNSAINXGIQZQOO-SRVKXCTJSA-N protirelin Chemical compound NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H]1NC(=O)CC1)CC1=CN=CN1 XNSAINXGIQZQOO-SRVKXCTJSA-N 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 2
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 2
- 229960000553 somatostatin Drugs 0.000 description 2
- 229940034199 thyrotropin-releasing hormone Drugs 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- DDYAPMZTJAYBOF-ZMYDTDHYSA-N (3S)-4-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-1-[[(2S)-4-amino-1-[[(2S)-4-amino-1-[[(2S,3S)-1-[[(1S)-1-carboxyethyl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1,4-dioxobutan-2-yl]amino]-4-methylsulfanyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S,3R)-2-[[2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-imidazol-4-yl)propanoyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]acetyl]amino]-3-hydroxybutanoyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoyl]amino]-3-carboxypropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxypropanoyl]amino]hexanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylpentanoyl]amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]-5-carbamimidamidopentanoyl]amino]-5-carbamimidamidopentanoyl]amino]propanoyl]amino]-5-oxopentanoyl]amino]-4-oxobutanoic acid Chemical class [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DDYAPMZTJAYBOF-ZMYDTDHYSA-N 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 102100036664 Adenosine deaminase Human genes 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 102000013455 Amyloid beta-Peptides Human genes 0.000 description 1
- 108010090849 Amyloid beta-Peptides Proteins 0.000 description 1
- 102400000068 Angiostatin Human genes 0.000 description 1
- 108010079709 Angiostatins Proteins 0.000 description 1
- 102400000059 Arg-vasopressin Human genes 0.000 description 1
- 101800001144 Arg-vasopressin Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101800004538 Bradykinin Proteins 0.000 description 1
- 102400000967 Bradykinin Human genes 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 101800000414 Corticotropin Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102400001047 Endostatin Human genes 0.000 description 1
- 108010079505 Endostatins Proteins 0.000 description 1
- 102000002045 Endothelin Human genes 0.000 description 1
- 108050009340 Endothelin Proteins 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 102100031939 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 102000003974 Fibroblast growth factor 2 Human genes 0.000 description 1
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 1
- 108010088406 Glucagon-Like Peptides Proteins 0.000 description 1
- 108010017544 Glucosylceramidase Proteins 0.000 description 1
- 102000004547 Glucosylceramidase Human genes 0.000 description 1
- 108010053070 Glutathione Disulfide Proteins 0.000 description 1
- NMJREATYWWNIKX-UHFFFAOYSA-N GnRH Chemical compound C1CCC(C(=O)NCC(N)=O)N1C(=O)C(CC(C)C)NC(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)CNC(=O)C(NC(=O)C(CO)NC(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)C(CC=1NC=NC=1)NC(=O)C1NC(=O)CC1)CC1=CC=C(O)C=C1 NMJREATYWWNIKX-UHFFFAOYSA-N 0.000 description 1
- 239000000579 Gonadotropin-Releasing Hormone Substances 0.000 description 1
- QXZGBUJJYSLZLT-UHFFFAOYSA-N H-Arg-Pro-Pro-Gly-Phe-Ser-Pro-Phe-Arg-OH Natural products NC(N)=NCCCC(N)C(=O)N1CCCC1C(=O)N1C(C(=O)NCC(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CO)C(=O)N2C(CCC2)C(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CCCN=C(N)N)C(O)=O)CCC1 QXZGBUJJYSLZLT-UHFFFAOYSA-N 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 102000007625 Hirudins Human genes 0.000 description 1
- 108010007267 Hirudins Proteins 0.000 description 1
- 108091006054 His-tagged proteins Proteins 0.000 description 1
- 101000904173 Homo sapiens Progonadoliberin-1 Proteins 0.000 description 1
- 108010000521 Human Growth Hormone Proteins 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 239000012901 Milli-Q water Substances 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 108010025020 Nerve Growth Factor Proteins 0.000 description 1
- 102000007072 Nerve Growth Factors Human genes 0.000 description 1
- 108090000189 Neuropeptides Proteins 0.000 description 1
- VEQPNABPJHWNSG-UHFFFAOYSA-N Nickel(2+) Chemical compound [Ni+2] VEQPNABPJHWNSG-UHFFFAOYSA-N 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 102400000050 Oxytocin Human genes 0.000 description 1
- 101800000989 Oxytocin Proteins 0.000 description 1
- XNOPRXBHLZRZKH-UHFFFAOYSA-N Oxytocin Natural products N1C(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CC(C)C)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C(C(C)CC)NC(=O)C1CC1=CC=C(O)C=C1 XNOPRXBHLZRZKH-UHFFFAOYSA-N 0.000 description 1
- 102000003982 Parathyroid hormone Human genes 0.000 description 1
- 108090000445 Parathyroid hormone Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108010051456 Plasminogen Proteins 0.000 description 1
- 102000013566 Plasminogen Human genes 0.000 description 1
- 102100027467 Pro-opiomelanocortin Human genes 0.000 description 1
- 102100024028 Progonadoliberin-1 Human genes 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108010077895 Sarcosine Proteins 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 101000996723 Sus scrofa Gonadotropin-releasing hormone receptor Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 102000002070 Transferrins Human genes 0.000 description 1
- 108010015865 Transferrins Proteins 0.000 description 1
- 102000009618 Transforming Growth Factors Human genes 0.000 description 1
- 108010009583 Transforming Growth Factors Proteins 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 108010059993 Vancomycin Proteins 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- GXBMIBRIOWHPDT-UHFFFAOYSA-N Vasopressin Natural products N1C(=O)C(CC=2C=C(O)C=CC=2)NC(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CCCN=C(N)N)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C1CC1=CC=CC=C1 GXBMIBRIOWHPDT-UHFFFAOYSA-N 0.000 description 1
- 108010004977 Vasopressins Proteins 0.000 description 1
- 102000002852 Vasopressins Human genes 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 229960003767 alanine Drugs 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- 230000001772 anti-angiogenic effect Effects 0.000 description 1
- 230000002785 anti-thrombosis Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229960003272 asparaginase Drugs 0.000 description 1
- FZCSTZYAHCUGEM-UHFFFAOYSA-N aspergillomarasmine B Natural products OC(=O)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O FZCSTZYAHCUGEM-UHFFFAOYSA-N 0.000 description 1
- 230000001746 atrial effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 229940019700 blood coagulation factors Drugs 0.000 description 1
- 108010006025 bovine growth hormone Proteins 0.000 description 1
- QXZGBUJJYSLZLT-FDISYFBBSA-N bradykinin Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(=O)NCC(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CO)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CCC1 QXZGBUJJYSLZLT-FDISYFBBSA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 125000002680 canonical nucleotide group Chemical group 0.000 description 1
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 229910017052 cobalt Inorganic materials 0.000 description 1
- 239000010941 cobalt Substances 0.000 description 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 1
- 229960000258 corticotropin Drugs 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000003113 dilution method Methods 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical class OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 238000012248 genetic selection Methods 0.000 description 1
- YPZRWBKMTBYPTK-BJDJZHNGSA-N glutathione disulfide Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@H](C(=O)NCC(O)=O)CSSC[C@@H](C(=O)NCC(O)=O)NC(=O)CC[C@H](N)C(O)=O YPZRWBKMTBYPTK-BJDJZHNGSA-N 0.000 description 1
- XLXSAKCOAKORKW-AQJXLSMYSA-N gonadorelin Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 XLXSAKCOAKORKW-AQJXLSMYSA-N 0.000 description 1
- XLXSAKCOAKORKW-UHFFFAOYSA-N gonadorelin Chemical compound C1CCC(C(=O)NCC(N)=O)N1C(=O)C(CCCN=C(N)N)NC(=O)C(CC(C)C)NC(=O)CNC(=O)C(NC(=O)C(CO)NC(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)C(CC=1NC=NC=1)NC(=O)C1NC(=O)CC1)CC1=CC=C(O)C=C1 XLXSAKCOAKORKW-UHFFFAOYSA-N 0.000 description 1
- 229940035638 gonadotropin-releasing hormone Drugs 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229940006607 hirudin Drugs 0.000 description 1
- WQPDUTSPKFMPDP-OUMQNGNKSA-N hirudin Chemical compound C([C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(OS(O)(=O)=O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]1NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]2CSSC[C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N2)=O)CSSC1)C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=2C=CC(O)=CC=2)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)CSSC1)C(C)C)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 WQPDUTSPKFMPDP-OUMQNGNKSA-N 0.000 description 1
- 239000003668 hormone analog Substances 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000003055 low molecular weight heparin Substances 0.000 description 1
- 229940127215 low-molecular weight heparin Drugs 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- CWWARWOPSKGELM-SARDKLJWSA-N methyl (2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-5-amino-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-5 Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)OC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=CC=C1 CWWARWOPSKGELM-SARDKLJWSA-N 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 108010032806 molgramostim Proteins 0.000 description 1
- 229960003063 molgramostim Drugs 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000002833 natriuretic agent Substances 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 229910001453 nickel ion Inorganic materials 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- YPZRWBKMTBYPTK-UHFFFAOYSA-N oxidized gamma-L-glutamyl-L-cysteinylglycine Natural products OC(=O)C(N)CCC(=O)NC(C(=O)NCC(O)=O)CSSCC(C(=O)NCC(O)=O)NC(=O)CCC(N)C(O)=O YPZRWBKMTBYPTK-UHFFFAOYSA-N 0.000 description 1
- XNOPRXBHLZRZKH-DSZYJQQASA-N oxytocin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@H](N)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(N)=O)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 XNOPRXBHLZRZKH-DSZYJQQASA-N 0.000 description 1
- 229960001723 oxytocin Drugs 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000000199 parathyroid hormone Substances 0.000 description 1
- 229960001319 parathyroid hormone Drugs 0.000 description 1
- 238000000059 patterning Methods 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- UFTCZKMBJOPXDM-XXFCQBPRSA-N pituitary adenylate cyclase-activating polypeptide Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CN=CN1 UFTCZKMBJOPXDM-XXFCQBPRSA-N 0.000 description 1
- 108010094020 polyglycine Proteins 0.000 description 1
- 229920000232 polyglycine polymer Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000000751 protein extraction Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 239000003488 releasing hormone Substances 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 108010038379 sargramostim Proteins 0.000 description 1
- 229960002530 sargramostim Drugs 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- DFVFTMTWCUHJBL-BQBZGAKWSA-N statine Chemical compound CC(C)C[C@H](N)[C@@H](O)CC(O)=O DFVFTMTWCUHJBL-BQBZGAKWSA-N 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 229910052723 transition metal Inorganic materials 0.000 description 1
- 150000003624 transition metals Chemical class 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- MYPYJXKWCTUITO-LYRMYLQWSA-N vancomycin Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 MYPYJXKWCTUITO-LYRMYLQWSA-N 0.000 description 1
- 229960003165 vancomycin Drugs 0.000 description 1
- MYPYJXKWCTUITO-UHFFFAOYSA-N vancomycin Natural products O1C(C(=C2)Cl)=CC=C2C(O)C(C(NC(C2=CC(O)=CC(O)=C2C=2C(O)=CC=C3C=2)C(O)=O)=O)NC(=O)C3NC(=O)C2NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(CC(C)C)NC)C(O)C(C=C3Cl)=CC=C3OC3=CC2=CC1=C3OC1OC(CO)C(O)C(O)C1OC1CC(C)(N)C(O)C(C)O1 MYPYJXKWCTUITO-UHFFFAOYSA-N 0.000 description 1
- 229960003726 vasopressin Drugs 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
- 235000009529 zinc sulphate Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/64—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
- C12N9/6421—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
- C12N9/6424—Serine endopeptidases (3.4.21)
- C12N9/6454—Dibasic site splicing serine proteases, e.g. kexin (3.4.21.61); furin (3.4.21.75) and other proprotein convertases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
- C12Y304/21061—Kexin (3.4.21.61), i.e. proprotein convertase subtilisin/kexin type 9
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
Definitions
- Insulin is a protein hormone that enables the regulation of blood sugar in the body. Impaired function or lack of insulin, as observed with patients suffering from type I or type II diabetes, could lead to lethal consequences.
- the insulin molecule is composed of two peptide chains, A and B, held together by several covalent bonds formed by cysteine residues.
- the insulin molecule begins as a prohormone, containing a connecting chain C, that holds the A and B chains together to enable the folding of the protein and the formation of correct disulfide bonds.
- the A and B chains of insulin held by C chain referred to as the insulin prohormone, or proinsulin, requires a conversion step to release the mature insulin from the connecting C chain. This process happens in the pancreas in vivo.
- insulin analogs have been developed.
- One of these analogs is the dibasic insulin glargine, which is sold under the trade name of LANTUS® insulin, among others.
- Two modifications to the wildtype insulin were introduced to make the insulin glargine molecule.
- One modification is the substitution of Asn with Gly at position A21 at the end of the A chain, and the other is the addition of two Arg residues at the end of the B chain.
- the present disclosure generally relates to methods, compositions, and kits that are useful for processing precursor proteins (e.g. , prohormones) to obtain mature proteins (e.g., mature hormones) using KEX2 proteases.
- precursor proteins e.g. , prohormones
- mature proteins e.g., mature hormones
- the disclosure provides, in various embodiments, a method of converting a precursor protein to a mature protein.
- the method comprises providing a precursor protein that is to be converted to a mature protein, wherein the precursor protein comprises at least one cleavage site for a KEX2 protease, and contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs to provide a mature protein.
- a method of obtaining a target protein comprising providing a precursor protein that comprises a target protein and at least one cleavage site for a KEX2 protease; and contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs and releases the target protein.
- the method further comprises isolating the target protein.
- the present disclosure further provides a fusion protein comprising a DEEP fusion tag, a target protein, and at least one KEX2 protease cleavage site, as well as polynucleotides, vectors, and host cells encoding and/or expressing the fusion protein.
- a target protein obtained by a method of the disclosure is a target protein obtained by a method of the disclosure.
- the present disclosure further provides, in various embodiments, a kit comprising a polynucleotide that comprises a nucleotide sequence encoding a DEEP fusion tag and a cloning site for introducing a nucleotide sequence encoding a target protein to form a fusion protein.
- the kit also comprises a KEX2 protease that comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1.
- FIG. 1 depicts an alignment of example DEEP-glargine precursor proteins disclosed herein.
- Box 1 indicates linker amino acids.
- Boxes 2, 4, and 5 indicate KEX2 protease cleavage sites.
- Box 3 indicates glargine B chain polypeptide (SEQ ID NO:24).
- Box 6 indicates glargine A chain polypeptide (SEQ ID NO:23).
- the unbounded regions of SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28 and SEQ ID NO:29, located between boxes 4 and 5, correspond to DEEP fusion tags having amino acid sequences of SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO: 13 and SEQ ID NO: 17, respectively.
- Protein “Protein,” “peptide” and “polypeptide” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation or phosphorylation).
- a protein, peptide or polypeptide can comprise any suitable L-and/or D-amino acid, for example, common a-amino acids (e.g., alanine, glycine, valine), non-a-amino acids (e.g., 0-alanine, 4- aminobutyric acid, 6-aminocaproic acid, sarcosine, statine), and unusual amino acids (e.g., citrulline, homocitruline, homoserine, norleucine, norvaline, ornithine).
- the amino, carboxyl and/or other functional groups on a peptide can be free (e.g., unmodified) or protected with a suitable protecting group.
- Suitable protecting groups for amino and carboxyl groups, and methods for adding or removing protecting groups are known in the art and are disclosed in, for example, Green and Wuts, “Protecting Groups in Organic Synthesis, ” John Wiley and Sons, 1991.
- the functional groups of a protein, peptide or polypeptide can also be derivatized (e.g., alkylated) or labeled (e.g., with a detectable label, such as a fluorogen or a hapten) using methods known in the art.
- a protein, peptide or polypeptide can comprise one or more modifications (e.g., amino acid linkers, acylation, acetylation, amidation, methylation, terminal modifiers (e.g., cyclizing modifications), A-methyl-a-amino group substitution), if desired.
- modifications e.g., amino acid linkers, acylation, acetylation, amidation, methylation, terminal modifiers (e.g., cyclizing modifications), A-methyl-a-amino group substitution
- a protein, peptide or polypeptide can be an analog of a known and/or naturally-occurring peptide, for example, a peptide analog having conservative amino acid residue substitution(s).
- nucleic acid is used herein to refer to a polymer comprising multiple nucleotide monomers (e.g., ribonucleotide monomers or deoxyribonucleotide monomers).
- Nucleic acid includes, for example, DNA (e.g., cDNA), RNA, and DNA-RNA hybrid molecules. Nucleic acid molecules can be naturally occurring, recombinant, or synthetic. In addition, nucleic acid molecules can be single-stranded, double-stranded or triple-stranded. In some aspects, nucleic acid molecules can be modified.
- Nucleic acid modifications include, for example, methylation, substitution of one or more of the naturally occurring nucleotides with a nucleotide analog, internucleotide modifications such as uncharged linkages (e.g. , methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, and the like), charged linkages (e.g., phosphorothioates, phosphorodithioates, and the like), pendent moi eties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, and the like), chelators, alkylators, and 301
- nucleic acid can refer to either or both strands of the molecule.
- the present disclosure is based, at least in part, on the discovery that cleavage of a prohormone of an insulin analog by KEX2 protease can facilitate production of the mature form of the insulin analog, with high efficiency and without producing undesired byproducts that result from alternative known methods.
- the present disclosure provides a method of converting a precursor protein into a mature protein by contacting the precursor protein with a KEX2 protease.
- the method comprises providing a precursor protein that is to be converted to a mature protein, wherein the precursor protein comprises at least one cleavage site for a KEX2 protease, and contacting (e.g., incubating in vitro) the precursor protein with a KEX2 protease under conditions in which cleavage of the precursor protein by the KEX2 protease occurs to provide a mature protein.
- the methods of the disclosure are useful for converting a prohormone to a mature hormone using KEX2 protease (e.g. , a yeast KEX2 protease).
- the prohormone is insulin prohormone, also known as proinsulin.
- the prohormone is insulin glargine prohormone, also referred to as proinsulin glargine or glargine proinsulin or proglargine.
- the present disclosure provides a method of obtaining a target protein by contacting a precursor protein with a KEX2 protease.
- the method comprises providing a precursor protein that comprises a target protein and at least one cleavage site for a KEX2 protease, and contacting the precursor protein with a KEX2 protease under conditions in which cleavage of the precursor protein by the KEX2 protease occurs and releases the target protein.
- the method further comprises isolating the target protein.
- the precursor protein is a fusion protein comprising the target protein.
- target protein refers to a peptide or polypeptide whose expression in a host system is desired. Such proteins are also referred to herein as proteins of interest, or POIs. Examples of such proteins that can be included in precursor or fusion proteins of the disclosure are green fluorescent protein (GFP), amyloid beta (AP) polypeptide, Trp Cage protein, LS3 polypeptide, insulin A chain polypeptide, and insulin B chain polypeptide.
- GFP green fluorescent protein
- AP amyloid beta
- Trp Cage protein Trp Cage protein
- LS3 polypeptide insulin A chain polypeptide
- insulin B chain polypeptide insulin B chain polypeptide.
- a further example of a target protein that can be included in precursor or fusion proteins of the disclosure is insulin, or a fragment thereof, such as an 301
- WO 2022/094146 PCT/US2021/057142 insulin A chain polypeptide, an insulin B chain polypeptide or an insulin C chain polypeptide, or a combination thereof.
- Another example of a target protein that can be included in the precursor or fusion proteins of the disclosure is glargine, or a fragment thereof.
- the disclosure provides a target protein obtained by a method disclosed herein.
- Suitable KEX2 proteases for use in the methods disclosed herein include the KEX2 protease from S. cerevisiae, NCBI Reference Sequence: NP 014161.1 (SEQ ID NO: 1), and KEX2 proteases comprising the amino acid sequence of SEQ ID NO: 1, or a variant amino acid sequence thereof (e.g., a variant amino acid sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater amino acid sequence identity to wild type S. cerevisiae KEX2 protease).
- KEX2 proteases can be naturally occurring (e.g., isolated, purified, extracted from a natural source, such as S. cerevisiae or another organism that expresses a KEX2 protease endogenously), recombinant, or synthetic.
- Recombinant yeast KEX2 proteases, including recombinant S. cerevisiae KEX2 protease can be obtained commercially from a variety of sources.
- a KEX2 comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 1.
- sequence identity means that two nucleotide or amino acid sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap weights, share at least, e.g., 70% sequence identity, or at least 80% sequence identity, or at least 85% sequence identity, or at least 90% sequence identity, or at least 95% sequence identity or more.
- sequence comparison For sequence comparison, one sequence acts as a reference sequence (e.g., parent sequence) to which one or more test sequences are compared.
- the sequence identity comparison can be examined throughout the entire length of a given protein, or within a desired fragment of a given protein.
- test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated.
- sequence comparison algorithm calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
- Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the 301
- a precursor protein is contacted with a KEX2 protease under conditions suitable for converting the prohormone to a mature hormone.
- Conditions suitable for converting the prohormone to a mature hormone using a KEX2 protease can be readily ascertained by a person of ordinary skill in the art to which the present disclosure pertains, and includes those described herein, as well as those described in product sheets supplied with commercially available KEX2 proteases.
- the conversion is performed in vitro. In other embodiments, the conversion is performed in vivo (e.g., in eukaryotic cells, such as yeast cells, that express a KEX2 protease, either endogenously or exogenously).
- eukaryotic cells such as yeast cells
- precursor protein refers to any protein that can be processed to yield a mature (e.g., active) protein, including preproteins and prohormones. Precursor proteins encompasses both naturally occurring proteins and artificial proteins, such as fusion proteins.
- a precursor protein can be synthetic, semi-synthetic or recombinant.
- the precursor protein includes the sequence of a mature protein.
- the precursor protein is a preprotein.
- the precursor protein is a prohormone.
- the precursor protein is a prohormone of insulin, also referred to as insulin prohormone or proinsulin, or an analog thereof.
- the precursor protein is a prohormone of glargine, also referred to as proinsulin 301
- Precursor proteins can be selected from the following non-limiting example classes of proteins: transcription factors, ligands for cellular receptors, hormones and extracellular binding peptides.
- precursor proteins include enkephlin, LHRH, neuropeptides, glycoincretins, integrin, glucagons and glucagon-like peptides, antithrombotic peptides, cytokines and interleukins, transferrins, interferons, endothelins, natriuretic hormones, extracellular kinase ligands, angiotensin enzyme inhibitors, peptide antiviral compounds, thrombin, substance P, substance G, somatotropin, somatostatin, GnRH, bradykinin, vasopressin, insulin, and growth factors.
- Examples of mature proteins that can be generated from a precursor protein include, without limitation, growth hormones (GH), particularly human and bovine growth hormone, growth hormone-releasing hormones; interferon including a-, P-, or y-interferons, interleukin-I, interleukin-II, erythropoietin, including a- and P-erythropoietin (EPO), granulocyte colony stimulating factor (GCSF), granulocyte macrophage colony stimulating factor (GM-CSF), anti-angiogenic proteins (e.g., angiostatin, endostatin), PACAP polypeptide (pituitary adenylate cyclase activating polypeptide), vasoactive intestinal peptide (VIP), thyrotrophin releasing hormone (TRH), corticotropin releasing hormone (CRH), vasopressin, arginine vasopressin (A VP),
- KEX2 cleavage sites are known in the art. Examples of KEX2 cleavage sites include -Lys-Arg-
- the KEX2 cleavage site can be naturally occurring in a precursor protein, or can be added to the precursor protein (e.g., in a peptide linker or other heterologous peptide sequence), for example, using molecular cloning and recombinant protein expression. 301
- any hormone e.g., any insulin hormone or insulin analog
- the methods disclosed herein can be applied to any hormone (e.g., any insulin hormone or insulin analog) that requires in vitro processing to convert a prohormone to a mature hormone, provided that the prohormone comprises at least one (e.g., 2, 3, 4, 5 or more) KEX2 cleavage site.
- Insulin is transcribed as a 110-amino acid chain, sometimes referred to as preproinsulin.
- the amino acid sequence of human preproinsulin has been assigned UniProt Accession No. P01308 (SEQ ID NO: 18). Removal of the signal peptide of preproinsulin, consisting of amino acid residues 1-24 (SEQ ID NO: 19), produces proinsulin.
- Biologically active insulin results from removal of amino acid residues 57-87 of SEQ ID NO: 18, corresponding to the amino acid sequence of the insulin C chain (SEQ ID NO:21), and formation of disulfide bonds between the A and B chains. Thus, biologically active insulin comprises just 51 amino acids of the original translation product.
- insulin encompasses preproinsulin, proinsulin and biologically active insulin.
- the insulin is biologically active insulin.
- the insulin is proinsulin.
- the insulin is preproinsulin.
- insulin A chain polypeptide can be naturally occurring or non- naturally occurring (e.g., engineered).
- the insulin A chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example).
- insulin A chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin A chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin A chain polypeptide), such as an insulin A chain polypeptide from humans.
- an insulin A chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 90-110 of human insulin assigned UniProt Accession No.
- P01308 (SEQ ID NO: 18), or a variant thereof having at least about 70% (c.g, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid residues 90-110 of the amino acid sequence of SEQ ID NO: 18.
- the amino acid sequence of SEQ ID NO:22 corresponds to amino acid residues 90-110 of the amino acid sequence of SEQ ID NO: 18.
- an insulin A chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:22, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to the amino acid sequence of SEQ ID NO:22.
- “insulin B chain polypeptide” can be naturally occurring or non- naturally occurring (e.g., engineered).
- the insulin B chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example).
- insulin B chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin B chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin B chain polypeptide), such as an insulin B chain polypeptide from humans.
- an insulin B chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 25-54 of human insulin assigned UniProt Accession No.
- P01308 (SEQ ID NO: 18), or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid residues 25-54 of the amino acid sequence of SEQ ID NO: 18.
- the amino acid sequence of SEQ ID NO:20 corresponds to amino acid residues 25-54 of the amino acid sequence of SEQ ID NO: 18.
- an insulin B chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:20, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to the amino acid sequence of SEQ ID NO:20.
- “insulin C chain polypeptide,” can be naturally occurring or non- naturally occurring (e.g., engineered).
- the insulin C chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example).
- insulin C chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin C chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin C chain polypeptide), such as an insulin C chain polypeptide from humans.
- an insulin C chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 57-87 of human insulin assigned UniProt Accession No. P01308 (SEQ ID NO: 14), or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about
- amino acid sequence of SEQ ID NO:21 corresponds to amino acid residues 57-87 of the amino acid sequence of SEQ ID NO: 18.
- U.S. Application Publication No. US 2018/0194827 describes insulin peptides and single chain insulin peptide agonists that include insulin A chains and insulin B chains containing a variety of substitutions, additions and/or modifications compared to native or naturally-occurring insulin A and B chains.
- the teachings of US 2018/0194827 relevant to insulin derivatives and analogs, for example, insulin A chain polypeptides and insulin B chain polypeptides, are incorporated herein by reference in their entirety.
- examples of insulin A chain polypeptides and insulin B chain polypeptides include the insulin A chains and insulin B chains disclosed in US 2018/0194827.
- Glargine is an insulin analog. Two modifications to the wildtype insulin were introduced to make the insulin glargine molecule. One modification is the substitution of Asn with Gly at position A21 at the end of the insulin A chain, and the other is the addition of two Arg residues at the end of the insulin B chain.
- glargine A chain polypeptide is the sequence of insulin A chain with a substitution of Asn with Gly at position A21 at the end of the insulin A chain.
- a glargine A chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:23, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid of SEQ ID NO:23.
- glargine B chain polypeptide is the sequence of insulin B chain with the addition of two Arg residues at the end of the insulin B chain.
- a glargine B chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID 301
- the precursor protein is a fusion protein (e.g., a fusion protein comprising a target protein).
- the precursor protein is a fusion protein that comprises a tag (e.g., a His-tag) to enhance expression, folding and/or purification.
- the precursor protein comprises a De novo Expression Enhancer Protein (DEEP) fusion tag. Examples of DEEP fusion tags that are suitable for inclusion in prohormones capable of being processed by the methods disclosed herein are described herein below, and in International Application No. PCT/US2018/044156, published as International Publication No. WO 2019/023616 Al, the contents of which are incorporated herein by reference in their entirety.
- DEEP De novo Expression Enhancer Protein
- the present disclosure further provides, in various embodiments, a fusion protein comprising a DEEP fusion tag, a target protein, and at least one KEX2 protease cleavage site.
- fusion protein refers to a synthetic, semi -synthetic or recombinant single protein molecule that comprises all or a portion of two or more different proteins and/or peptides.
- DEEP fusion tag refers to a polypeptide having at least two (e.g., 2, 3, 4, 5 or 6) a-helices, wherein each a-helix comprises a binary patterned sequence of seven amino acid residues, or heptad sequence, defined by [PNPPNNP]n, where each “P” is independently selected from the polar amino acid residues Lys (K), His (H), Glu (E), Gin (Q), Asp (D), Asn (N), Thr (T) and Ser (S), each “N” is independently selected from the nonpolar amino acid residues Phe (F), Leu (L), He (I), Met (M), Vai (V) and Trp (W), and n is an integer from 2 to 10 (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10).
- n 3.
- the heptad sequences in an a-helix containing more than one heptad sequence can be identical (i.e., repeats of the same heptad sequence) or they can be different (i.e., each PNPPNNP heptad sequence within the same a-helix can have a different amino acid sequence).
- the amino acid composition of the a-helices in a DEEP fusion tag can vary from helix to helix such that, for example, each of the a-helices in the tag will have a different amino acid sequence.
- a DEEP fusion tag can also include additional amino acid residues, for example, N-terminal to the first a-helix and/or C-terminal to the last a-helix in the tag.
- a DEEP fusion tag will also include interhelical turns between the a-helices, wherein each interhelical turn includes, for example, 4, 5 or 6 amino acid residues encoded by the degenerate DNA codon VAN (V: A, G, or C; N: A, G, C, or T) (e.g., Gly (G), His (H), Gin (Q), Asn (N), Asp (D), Glu (E) and Lys (K)).
- a DEEP fusion tag is generally at least about 70 amino acid residues in length (e.g., 74 amino acid residues). In a particular embodiment, a DEEP fusion tag is at least about 100 amino acid residues in length (e.g., 102 amino acid residues). Typically, a DEEP fusion tag is less than about 500 amino acid residues in length, for example, less than about 450 amino acid residues in length or less than about 420 amino acid residues in length.
- a DEEP fusion tag includes a plurality of histidine residues that are exposed at the surface in a properly folded fusion protein.
- each a-helix in a DEEP fusion tag includes at least one (e.g., 1, 2, 3, 4, 5, 6, or more, for example, 12) histidine residue(s).
- the fusion can be an TV-terminal fusion (with respect to the DEEP fusion tag), a C- terminal fusion (with respect to the DEEP fusion tag) or an internal fusion (with respect to the DEEP fusion tag and/or the target protein).
- polypeptides that are suitable for use as DEEP fusion tags in the present disclosure, as well as methods of designing and making such polypeptides, are described in the following publications, each of which is incorporated by reference herein in its entirety:
- the DEEP fusion tag comprises, consists essentially of, or consists of (e.g., comprises) the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16 or SEQ ID NO: 17, or a functional fragment thereof, such as a fragment lacking the N-terminal methionine residue.
- the DEEP fusion tag comprises a variant amino acid sequence of any of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO:14, SEQ ID NO: 15, SEQ ID NO: 16 or SEQ ID NO: 17 having, for example, at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identity to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO:
- the fusion protein further comprises at least one linker sequence.
- linker amino acid sequences are known in the art and can be used in the invention.
- a linker sequence includes one or more amino acid residues selected from Gly, Ser, Thr, His, Asp, Glu, Asn, Gin, Lys and Arg.
- a linker sequence includes a polyglycine sequence (e.g., a 6X glycine sequence).
- Other examples of linkers include GSAGSAAGSG (SEQ ID NO: 12), GGGGGGSR (SEQ ID NO: 13), KR and RR.
- a linker sequence includes a cleavage site. 301
- the fusion protein comprises at least one KEX2 cleavage site in the target protein. In some embodiments, the fusion protein additionally or alternatively comprises at least one KEX2 cleavage site in a linker sequence.
- the fusion protein comprises, consists essentially of or consists of SEQ ID NO:27.
- the target protein comprises or consists of a hormone and the fusion protein comprises or consists of a prohormone.
- the target protein comprises insulin or an insulin analog.
- the fusion protein comprises proinsulin.
- the target protein comprises glargine.
- the fusion protein comprises proglargine.
- the fusion protein comprises or consists of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID NO:29, or a variant of any of the foregoing having at least about 70% amino acid sequence identity to one or more of SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID NO:29.
- the fusion protein comprises or consists of SEQ ID NO:27.
- fusion proteins of the disclosure can be produced recombinantly or synthetically, using routine methods and reagents that are well known in the art.
- a fusion protein of the disclosure can be produced recombinantly in a suitable host cell (e.g., bacteria, yeast, insect cells, mammalian cells) according to methods known in the art. See, e.g., Current Protocols in Molecular Biology, Second Edition, Ausubel et al. eds., John Wiley & Sons, 1992; and Molecular Cloning: a Laboratory Manual, 2nd edition, Sambrook et al., 1989, Cold Spring Harbor Laboratory Press.
- a nucleic acid molecule comprising a nucleotide sequence encoding a fusion protein described herein can be introduced and expressed in suitable host cells (e.g., E. colt), and the expressed fusion protein can be isolated/purified from the host cells (e.g., in inclusion bodies) using routine methods and readily available reagents.
- suitable host cells e.g., E. colt
- the expressed fusion protein can be isolated/purified from the host cells (e.g., in inclusion bodies) using routine methods and readily available reagents.
- Methods for introducing DNA constructs encoding fusion proteins into host cells are well known in the art and include, for example, standard transformation and transfection techniques (e.g., electroporation, chemical transformation).
- standard transformation and transfection techniques e.g., electroporation, chemical transformation.
- a person of ordinary skill in the field of the disclosure can readily select an appropriate method for introducing a DNA construct into host cells.
- a variety of methods for expressing proteins in host cells are well known in the art (e.g., IPTG-induced expression in E. colt).
- a person of ordinary skill in the field of the disclosure can readily select an appropriate method for expressing a fusion protein of the disclosure in host cells.
- An expressed fusion protein can be isolated from host cells using known methods and reagents including, e.g., lysozyme treatment, sonication, filtration, salting-out, ultracentrifugation, and chromatography.
- a recombinantly-expressed fusion protein can be recovered from host cells and/or the host cell culture medium. Once released from the cells, the fusion protein can be purified from cell lysates by binding to an affinity resin using standard techniques and reagents.
- the fusion protein is isolated by binding of the DEEP fusion tag in the fusion protein to an affinity resin (e.g., on a solid support).
- the DEEP fusion tag comprises a plurality of surface- exposed histidine residues, enabling purification by methods typically used for His-tagged proteins.
- a fusion protein of the disclosure can be isolated using immobilized metal ion affinity chromatography (IMAC).
- IMAC resins containing immobilized transition metals for IMAC applications are known in the art and are commercially available (e.g., TALON® Superfl owTM resins, HisTrapTM High Performance resins, GE Healthcare Life Sciences), and include, e.g., immobilized nickel resins, immobilized cobalt resins, immobilized copper resins, and immobilized zinc resins.
- a fusion protein of the disclosure is purified using an affinity resin comprising immobilized nickel ions.
- the present disclosure further provides, in various embodiments, a polynucleotide encoding a fusion protein disclosed herein.
- the polynucleotide is a DNA polynucleotide.
- the polynucleotide is a RNA polynucleotide.
- the polynucleotide can be in the form of an insert (e.g., for cloning into a vector).
- the polynucleotide can be linear or circular.
- the polynucleotide comprises one or more of a non-canonical nucleotide and a modified nucleotide (e.g., a nucleotide comprising a chemical modification).
- the polynucleotide can be isolated, recombinant, synthetic or semi -synthetic.
- a nucleic acid encoding a fusion protein of the disclosure includes a nucleotide sequence that has been optimized for expression in a particular type of host cell (e.g., through codon optimization).
- Codon optimization refers to a process in which a polynucleotide encoding a protein of interest is modified to replace particular codons in that polynucleotide with codons that encode the same amino acid(s), but are more commonly used/recognized in the host cell in which the nucleic acid is being expressed.
- the polynucleotides encoding a fusion protein of the disclosure are codon optimized for expression in coli.
- the disclosure also provides, in various embodiments, a vector comprising a polynucleotide of the disclosure.
- the vector is an expression vector.
- the vector is a viral vector (e.g., lentiviral vector, adenoviral vector, AAV).
- the vector is a non-viral vector (e.g., plasmid, cloning vector).
- a variety of vectors, including expression vectors, viral vectors and non-viral vectors are known in the art and are commercially available.
- the disclosure further provides, in various embodiments, a host cell comprising a polynucleotide disclosed herein or a vector disclosed herein.
- a host cell refers to a suitable host for expressing a nucleic acid encoding a fusion protein comprising a DEEP fusion tag.
- the host cells are cells that have been transformed or transfected with vectors constructed using recombinant DNA techniques known in the art.
- suitable host cells include yeast cells (e.g., Pichia pastoris and Saccharomyces cerevisiae), insect cells (e.g., Spodoptera frugiperda Sf9 cells), mammalian cells (e.g., CHO cells), and bacterial cells (e.g., E. coli and B. sublHis Agrobacterium tumefaciens). Further examples of suitable host cells include plant cells (e.g., Nicotiana benlhamiana . In a particular aspect, the host cell is E. coli.
- the disclosure additional provides a kit comprising one or more polynucleotides comprising a nucleotide sequence encoding a DEEP fusion tag and a cloning site for introducing a nucleotide sequence encoding a target protein to form a fusion protein, and a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 (e.g., in separate containers).
- the KEX2 protease comprises or consists of SEQ ID NO:1.
- the polynucleotide further comprises a sequence encoding a peptide linker, a sequence providing a KEX2 protease cleavage site, or a combination thereof.
- the kit further comprises instructions for use.
- a synthetic gene encoding a DEEP-insulin glargine fusion protein (SEQ ID NO:27) was constructed using E. coll codon optimized gBlocks (Integrated DNA Technologies), and amplified with forward and reverse primers containing Xbal and Hindlll restriction sites.
- the sequence encoding DEEP -glargine was introduced into pET30 vector carrying kanamycin resistance to yield pET30DEEP- glargine.
- the plasmids were transformed into competent BL21DE3 cells and plated on LB agar supplemented with 30 mg/L kanamycin. The next day, a fresh colony was picked and inoculated into 5mL of LB kanamycin, and incubated overnight at 37°C with continuous shaking at 200 rpm. 4 ml of the overnight culture were inoculated into IL of LB supplemented with kanamycin.
- the culture was induced with 0.5mM IPTG, after a sample of the culture measured an OD600 of 0.8, the culture was further incubated for 3h. Cells were collected using a centrifuge, operating at 4,500xg. Cell pellets were kept at -80°C until further use.
- n is the moles of protein in the peak
- F is the flow rate
- a is the molar extinction coefficient
- d is the pathlength
- Refolding was performed by the rapid dilution method.
- the concentration of the protein in resuspended inclusion bodies was adjusted to -5-10 mg/ml, and further diluted 10- 20 times into the refolding buffer.
- the refolding was performed with solutions pre-chilled on ice or at 4°C. Analysis of the refolding process was performed using an HPLC. Before loading the samples, the reactions were quenched with acetonitrile and TFA, at final concentrations of 10% and 0.5% (v/v), respectively. Refolding yields were calculated based on AUC of the refolded protein divided by the AUC of the fully reduced and denatured one.
- the solution was supplemented with 50mM tris and 300mM NaCl (final concentration), ImM oxidized glutathione, and the pH was adjusted to 8.
- the protein solution was purified on the FPLC (AKTA pure) equipped with a HisTrap column, buffer A (50mM tris, 300mM NaCl pH8), and buffer B same as A supplemented with 500mM imidazole at pH8. Fractions containing the elution peak were combined, and immediately desalted using PD-10 column into lOOmM tris pH8. Aliquots of protein solution were supplemented with various concentrations of CaC12 and KEX2 (PeproTech).
- the cleavage reaction was incubated at 37°C, monitoring the progress using RP-HPLC equipped with C18 column (Zorbax, Agilent). Cleaved insulin glargine was precipitated by the addition of lOmM ZnSO4, and adjusting the pH to 6. Instantly the solution became cloudy, and was further incubated for 12h at 4°C with gentle stirring. The precipitated protein was spun down at 5000Xg, for 15min and the pellet was dissolved in 6M GdnHCl, 20% MeCN, 1% TFA. The protein sample was further purified on the HPLC equipped with a semi-prep Cl 8 301
- DEEP-glargine protein was expressed, extracted and purified as described above.
- lOOul samples were supplemented with CaC12 and KEX2 (Peprotech, US) (FIG. 2). The samples were further incubated at 37°C overnight.
- insulin glargine was expressed as a prohormone comprising a DEEP fusion tag, referred to herein as “DEEP- Glargine,” and having the amino acid sequence of SEQ ID NO:27, and converted in vitro to mature glargine.
- DEEP-Glargine in which the C-chain of proglargine was replaced with a DEEP protein, was expressed in E. coh. extracted, refolded, and converted to mature glargine in vitro by incubation with a recombinant KEX2 protease as described above. The conversion of DEEP-Glargine using KEX2 protease was observed to proceed to near completion without generating any detectable cleavage byproducts.
Abstract
The disclosure provides, in various embodiments, methods of converting a precursor protein ( e.g., a prohormone) to a mature protein (e.g, a hormone), and methods of obtaining a target protein (e.g, from a fusion protein), using a KEX2 protease. In various embodiments, the disclosure further provides fusion proteins, polynucleotides, vectors, host cells and kits that are useful for performing the methods of the disclosure.
Description
METHODS OF CONVERTING PRECURSOR PROTEINS TO MATURE PROTEINS USING KEX2 PROTEASES
RELATED APPLICATION
[0001] This application claims the benefit of U.S. Provisional Application No. 63/106,676, filed on October 28, 2020. The entire teachings of this application are incorporated herein by reference.
INCORPORATION BY REFERENCE OF MATERIAL IN ASCII TEXT FILE
[0002] This application incorporates by reference the Sequence Listing contained in the following ASCII text file being submitted concurrently herewith: a) File name: 53911030001SequenceListing.txt; created October 28, 2021, 33,000 Bytes in size.
BACKGROUND
[0003] It was estimated in 2018 that approximately 7.4 million Americans with diabetes required the administration of insulin. Insulin is a protein hormone that enables the regulation of blood sugar in the body. Impaired function or lack of insulin, as observed with patients suffering from type I or type II diabetes, could lead to lethal consequences.
[0004] The insulin molecule is composed of two peptide chains, A and B, held together by several covalent bonds formed by cysteine residues. During the process of production, the insulin molecule begins as a prohormone, containing a connecting chain C, that holds the A and B chains together to enable the folding of the protein and the formation of correct disulfide bonds. The A and B chains of insulin held by C chain, referred to as the insulin prohormone, or proinsulin, requires a conversion step to release the mature insulin from the connecting C chain. This process happens in the pancreas in vivo.
[0005] To accommodate specific pharmacokinetic and pharmacodynamic needs, several insulin analogs have been developed. One of these analogs is the dibasic insulin glargine, which is sold under the trade name of LANTUS® insulin, among others. Two modifications to the wildtype insulin were introduced to make the insulin glargine molecule. One modification is the substitution of Asn with Gly at position A21 at the end of the A chain, and the other is the addition of two Arg residues at the end of the B chain.
- 1 -
[0006] The most utilized enzyme in the in vitro conversion of proinsulin to insulin is trypsin. Trypsin recognize and cleaves at the C-terminus of Lys and Arg residues. The current state of the art process for producing glargine involves chemical blockage of certain amino acids, enzymatic processing using trypsin, and finally de-blocking. However, since the end of the B chain of glargine contains the sequence KTRR-COOH, (i.e., Lys-Thr-Arg-Arg-COOH), in vitro conversion of glargine with trypsin is inefficient and yields several byproducts, such as desB32R and desB30TRR. Thus, the overall yields achieved by this tedious process are low, and require purification to eliminate byproducts. To increase the efficiency of trypsin cleavage and improve yields of mature glargine, it has been proposed to chemically block B29K. Although yields of insulin glargine can be increased by doing so, this chemical blockage does not prevent the formation of undesirable byproducts.
[0007] Accordingly, there is a need for efficient methods for processing proinsulin, glargine, and other prohormones.
SUMMARY
[0008] The present disclosure generally relates to methods, compositions, and kits that are useful for processing precursor proteins (e.g. , prohormones) to obtain mature proteins (e.g., mature hormones) using KEX2 proteases.
[0009] Accordingly, the disclosure provides, in various embodiments, a method of converting a precursor protein to a mature protein. The method comprises providing a precursor protein that is to be converted to a mature protein, wherein the precursor protein comprises at least one cleavage site for a KEX2 protease, and contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs to provide a mature protein.
[0010] Also provided herein, in various embodiments, is a method of obtaining a target protein, comprising providing a precursor protein that comprises a target protein and at least one cleavage site for a KEX2 protease; and contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs and releases the target protein. In some embodiments, the method further comprises isolating the target protein.
[0011] In various embodiments, the present disclosure further provides a fusion protein comprising a DEEP fusion tag, a target protein, and at least one KEX2 protease cleavage site, as well as polynucleotides, vectors, and host cells encoding and/or expressing the fusion protein.
[0012] Also provided herein, in various embodiments, is a target protein obtained by a method of the disclosure.
[0013] The present disclosure further provides, in various embodiments, a kit comprising a polynucleotide that comprises a nucleotide sequence encoding a DEEP fusion tag and a cloning site for introducing a nucleotide sequence encoding a target protein to form a fusion protein. The kit also comprises a KEX2 protease that comprises an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1.
BRIEF DESCRIPTION OF THE DRAWINGS
[0014] The foregoing will be apparent from the following more particular description of example embodiments.
[0015] FIG. 1 depicts an alignment of example DEEP-glargine precursor proteins disclosed herein. Box 1 indicates linker amino acids. Boxes 2, 4, and 5 indicate KEX2 protease cleavage sites. Box 3 indicates glargine B chain polypeptide (SEQ ID NO:24). Box 6 indicates glargine A chain polypeptide (SEQ ID NO:23). The unbounded regions of SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28 and SEQ ID NO:29, located between boxes 4 and 5, correspond to DEEP fusion tags having amino acid sequences of SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO: 13 and SEQ ID NO: 17, respectively.
[0016] FIG. 2 is a graph showing the concentration of processed (mature active) insulin glargine after treatment of a DEEP-glargine precursor (SEQ ID NO:27) with various molar ratios of KEX2 enzyme at varying CaCl2 concentrations. Concentration was calculated based on injection of 0.03mg/ml LANTUS® insulin (AUC=386).
DETAILED DESCRIPTION
[0017] A description of example embodiments follows.
[0018] When introducing elements disclosed herein, the articles “a,” “an,” “the,” and “said” are intended to mean that there are one or more of the elements. The terms “comprising,” “having” and “including” are intended to be open-ended and mean that there may be additional elements other than the listed elements.
- 3 -
[0019] The term “about,” when referring to a measurable value, such as an amount, refers to variations of ± 20%, e.g., in some embodiments, ± 10%, ± 5%, ± 1% or ± 0.1% from the specified value.
[0020] “Protein,” “peptide” and “polypeptide” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post-translational modification (e.g., glycosylation or phosphorylation). A protein, peptide or polypeptide can comprise any suitable L-and/or D-amino acid, for example, common a-amino acids (e.g., alanine, glycine, valine), non-a-amino acids (e.g., 0-alanine, 4- aminobutyric acid, 6-aminocaproic acid, sarcosine, statine), and unusual amino acids (e.g., citrulline, homocitruline, homoserine, norleucine, norvaline, ornithine). The amino, carboxyl and/or other functional groups on a peptide can be free (e.g., unmodified) or protected with a suitable protecting group. Suitable protecting groups for amino and carboxyl groups, and methods for adding or removing protecting groups are known in the art and are disclosed in, for example, Green and Wuts, “Protecting Groups in Organic Synthesis, ” John Wiley and Sons, 1991. The functional groups of a protein, peptide or polypeptide can also be derivatized (e.g., alkylated) or labeled (e.g., with a detectable label, such as a fluorogen or a hapten) using methods known in the art. A protein, peptide or polypeptide can comprise one or more modifications (e.g., amino acid linkers, acylation, acetylation, amidation, methylation, terminal modifiers (e.g., cyclizing modifications), A-methyl-a-amino group substitution), if desired. In addition, a protein, peptide or polypeptide can be an analog of a known and/or naturally-occurring peptide, for example, a peptide analog having conservative amino acid residue substitution(s).
[0021] The term “nucleic acid” is used herein to refer to a polymer comprising multiple nucleotide monomers (e.g., ribonucleotide monomers or deoxyribonucleotide monomers). “Nucleic acid” includes, for example, DNA (e.g., cDNA), RNA, and DNA-RNA hybrid molecules. Nucleic acid molecules can be naturally occurring, recombinant, or synthetic. In addition, nucleic acid molecules can be single-stranded, double-stranded or triple-stranded. In some aspects, nucleic acid molecules can be modified. Nucleic acid modifications include, for example, methylation, substitution of one or more of the naturally occurring nucleotides with a nucleotide analog, internucleotide modifications such as uncharged linkages (e.g. , methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, and the like), charged linkages (e.g., phosphorothioates, phosphorodithioates, and the like), pendent moi eties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, and the like), chelators, alkylators, and
301
WO 2022/094146 PCT/US2021/057142 modified linkages (e.g., alpha anomeric nucleic acids, and the like). In the case of a doublestranded polymer, “nucleic acid” can refer to either or both strands of the molecule.
[0022] Methods of the Disclosure
[0023] The present disclosure is based, at least in part, on the discovery that cleavage of a prohormone of an insulin analog by KEX2 protease can facilitate production of the mature form of the insulin analog, with high efficiency and without producing undesired byproducts that result from alternative known methods.
[0024] Accordingly, in various embodiments, the present disclosure provides a method of converting a precursor protein into a mature protein by contacting the precursor protein with a KEX2 protease. The method comprises providing a precursor protein that is to be converted to a mature protein, wherein the precursor protein comprises at least one cleavage site for a KEX2 protease, and contacting (e.g., incubating in vitro) the precursor protein with a KEX2 protease under conditions in which cleavage of the precursor protein by the KEX2 protease occurs to provide a mature protein. In particular embodiments, the methods of the disclosure are useful for converting a prohormone to a mature hormone using KEX2 protease (e.g. , a yeast KEX2 protease). In some embodiments, the prohormone is insulin prohormone, also known as proinsulin. In some embodiments, the prohormone is insulin glargine prohormone, also referred to as proinsulin glargine or glargine proinsulin or proglargine.
[0025] In various other embodiments, the present disclosure provides a method of obtaining a target protein by contacting a precursor protein with a KEX2 protease. The method comprises providing a precursor protein that comprises a target protein and at least one cleavage site for a KEX2 protease, and contacting the precursor protein with a KEX2 protease under conditions in which cleavage of the precursor protein by the KEX2 protease occurs and releases the target protein. In some embodiments, the method further comprises isolating the target protein. In some embodiments, the precursor protein is a fusion protein comprising the target protein. As used herein, the term “target protein” refers to a peptide or polypeptide whose expression in a host system is desired. Such proteins are also referred to herein as proteins of interest, or POIs. Examples of such proteins that can be included in precursor or fusion proteins of the disclosure are green fluorescent protein (GFP), amyloid beta (AP) polypeptide, Trp Cage protein, LS3 polypeptide, insulin A chain polypeptide, and insulin B chain polypeptide. A further example of a target protein that can be included in precursor or fusion proteins of the disclosure is insulin, or a fragment thereof, such as an
301
WO 2022/094146 PCT/US2021/057142 insulin A chain polypeptide, an insulin B chain polypeptide or an insulin C chain polypeptide, or a combination thereof. Another example of a target protein that can be included in the precursor or fusion proteins of the disclosure is glargine, or a fragment thereof.
[0026] In some embodiments, the disclosure provides a target protein obtained by a method disclosed herein.
[0027] Suitable KEX2 proteases for use in the methods disclosed herein include the KEX2 protease from S. cerevisiae, NCBI Reference Sequence: NP 014161.1 (SEQ ID NO: 1), and KEX2 proteases comprising the amino acid sequence of SEQ ID NO: 1, or a variant amino acid sequence thereof (e.g., a variant amino acid sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater amino acid sequence identity to wild type S. cerevisiae KEX2 protease). KEX2 proteases can be naturally occurring (e.g., isolated, purified, extracted from a natural source, such as S. cerevisiae or another organism that expresses a KEX2 protease endogenously), recombinant, or synthetic. Recombinant yeast KEX2 proteases, including recombinant S. cerevisiae KEX2 protease, can be obtained commercially from a variety of sources. In various embodiments, a KEX2 comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 1.
[0028] As used herein, the term “sequence identity” means that two nucleotide or amino acid sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap weights, share at least, e.g., 70% sequence identity, or at least 80% sequence identity, or at least 85% sequence identity, or at least 90% sequence identity, or at least 95% sequence identity or more.
[0029] For sequence comparison, one sequence acts as a reference sequence (e.g., parent sequence) to which one or more test sequences are compared. The sequence identity comparison can be examined throughout the entire length of a given protein, or within a desired fragment of a given protein. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
[0030] Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the
301
WO 2022/094146 PCT/US2021/057142 search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by visual inspection (see generally Ausubel et al., Current Protocols in Molecular Biology). One example of algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., J. Mol. Biol. 215:403 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (publicly accessible through the National Institutes of Health NCBI internet server). Typically, default program parameters can be used to perform the sequence comparison, although customized parameters can also be used. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89: 10915 (1989)).
[0031] In the methods of the disclosure, a precursor protein is contacted with a KEX2 protease under conditions suitable for converting the prohormone to a mature hormone. Conditions suitable for converting the prohormone to a mature hormone using a KEX2 protease can be readily ascertained by a person of ordinary skill in the art to which the present disclosure pertains, and includes those described herein, as well as those described in product sheets supplied with commercially available KEX2 proteases.
[0032] In certain embodiments, the conversion is performed in vitro. In other embodiments, the conversion is performed in vivo (e.g., in eukaryotic cells, such as yeast cells, that express a KEX2 protease, either endogenously or exogenously).
[0033] The term “precursor protein” refers to any protein that can be processed to yield a mature (e.g., active) protein, including preproteins and prohormones. Precursor proteins encompasses both naturally occurring proteins and artificial proteins, such as fusion proteins. A precursor protein can be synthetic, semi-synthetic or recombinant. In some embodiments, the precursor protein includes the sequence of a mature protein. In some embodiments, the precursor protein is a preprotein. In some embodiments, the precursor protein is a prohormone. In certain embodiments, the precursor protein is a prohormone of insulin, also referred to as insulin prohormone or proinsulin, or an analog thereof. In particular embodiments, the precursor protein is a prohormone of glargine, also referred to as proinsulin
301
WO 2022/094146 PCT/US2021/057142 glargine or glargine proinsulin or proglargine. When the precursor protein is a prohormone, the corresponding mature protein is generally the mature hormone form of the prohormone. [0034] Precursor proteins can be selected from the following non-limiting example classes of proteins: transcription factors, ligands for cellular receptors, hormones and extracellular binding peptides. Examples of precursor proteins include enkephlin, LHRH, neuropeptides, glycoincretins, integrin, glucagons and glucagon-like peptides, antithrombotic peptides, cytokines and interleukins, transferrins, interferons, endothelins, natriuretic hormones, extracellular kinase ligands, angiotensin enzyme inhibitors, peptide antiviral compounds, thrombin, substance P, substance G, somatotropin, somatostatin, GnRH, bradykinin, vasopressin, insulin, and growth factors.
[0035] Examples of mature proteins that can be generated from a precursor protein (e.g., by the methods disclosed herein) include, without limitation, growth hormones (GH), particularly human and bovine growth hormone, growth hormone-releasing hormones; interferon including a-, P-, or y-interferons, interleukin-I, interleukin-II, erythropoietin, including a- and P-erythropoietin (EPO), granulocyte colony stimulating factor (GCSF), granulocyte macrophage colony stimulating factor (GM-CSF), anti-angiogenic proteins (e.g., angiostatin, endostatin), PACAP polypeptide (pituitary adenylate cyclase activating polypeptide), vasoactive intestinal peptide (VIP), thyrotrophin releasing hormone (TRH), corticotropin releasing hormone (CRH), vasopressin, arginine vasopressin (A VP), angiotensin, calcitonin, atrial naturetic factor, somatostatin, adrenocorticotropin, gonadotropin releasing hormone, oxytocin, insulin, somatotropin, plasminogen tissue activator, coagulation factors including coagulation factors VIII and IX, glucosylceramidase, sargramostim, lenograstin, filgrastin, dornase-a, molgramostim, PEG-L-asparaginase, PEG- adenosine deaminase, hirudin, eptacog-a (human blood coagulation factor Vila), nerve growth factors, transforming growth factor, epidermal growth factor, basic fibroblast growth factor, VEGF, heparin including low molecular weight heparin, calcitonin, antigens, monoclonal antibodies, vancomycin, desferrioxamine (DFO), parathyroid hormone, an immunogen or antigen, an antibody such as a monoclonal antibody.
[0036] KEX2 cleavage sites are known in the art. Examples of KEX2 cleavage sites include -Lys-Arg-|-Xaa- and -Arg-Arg-|-Xaa-, where Xaa can be any amino acid. The KEX2 cleavage site can be naturally occurring in a precursor protein, or can be added to the precursor protein (e.g., in a peptide linker or other heterologous peptide sequence), for example, using molecular cloning and recombinant protein expression.
301
WO 2022/094146 PCT/US2021/057142
[0037] Without wishing to be bound by theory, it is believed that the methods disclosed herein can be applied to any hormone (e.g., any insulin hormone or insulin analog) that requires in vitro processing to convert a prohormone to a mature hormone, provided that the prohormone comprises at least one (e.g., 2, 3, 4, 5 or more) KEX2 cleavage site.
[0038] Insulin is transcribed as a 110-amino acid chain, sometimes referred to as preproinsulin. The amino acid sequence of human preproinsulin has been assigned UniProt Accession No. P01308 (SEQ ID NO: 18). Removal of the signal peptide of preproinsulin, consisting of amino acid residues 1-24 (SEQ ID NO: 19), produces proinsulin. Biologically active insulin results from removal of amino acid residues 57-87 of SEQ ID NO: 18, corresponding to the amino acid sequence of the insulin C chain (SEQ ID NO:21), and formation of disulfide bonds between the A and B chains. Thus, biologically active insulin comprises just 51 amino acids of the original translation product. Unless otherwise indicated, “insulin,” as used herein, encompasses preproinsulin, proinsulin and biologically active insulin. In some embodiments, the insulin is biologically active insulin. In some embodiments, the insulin is proinsulin. In some embodiments, the insulin is preproinsulin. [0039] As used herein, “insulin A chain polypeptide,” can be naturally occurring or non- naturally occurring (e.g., engineered). The insulin A chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example). Examples of insulin A chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin A chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin A chain polypeptide), such as an insulin A chain polypeptide from humans. In some embodiments, an insulin A chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 90-110 of human insulin assigned UniProt Accession No. P01308 (SEQ ID NO: 18), or a variant thereof having at least about 70% (c.g, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid residues 90-110 of the amino acid sequence of SEQ ID NO: 18. The amino acid sequence of SEQ ID NO:22 corresponds to amino acid residues 90-110 of the amino acid sequence of SEQ ID NO: 18. Accordingly, in some embodiments, an insulin A chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:22, or a variant thereof having at
least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to the amino acid sequence of SEQ ID NO:22. [0040] As used herein, “insulin B chain polypeptide,” can be naturally occurring or non- naturally occurring (e.g., engineered). The insulin B chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example). Examples of insulin B chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin B chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin B chain polypeptide), such as an insulin B chain polypeptide from humans. In some embodiments, an insulin B chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 25-54 of human insulin assigned UniProt Accession No. P01308 (SEQ ID NO: 18), or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid residues 25-54 of the amino acid sequence of SEQ ID NO: 18. The amino acid sequence of SEQ ID NO:20 corresponds to amino acid residues 25-54 of the amino acid sequence of SEQ ID NO: 18. Accordingly, in some embodiments, an insulin B chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:20, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to the amino acid sequence of SEQ ID NO:20. [0041] As used herein, “insulin C chain polypeptide,” can be naturally occurring or non- naturally occurring (e.g., engineered). The insulin C chain polypeptide can be recombinant or synthetic, and unmodified or modified (e.g., post-translationally modified, as by glycosylation or phosphorylation, for example). Examples of insulin C chain polypeptides that are suitable for use in the fusion proteins and methods described herein are known in the art and include variants of naturally occurring insulin C chain polypeptides (e.g., variants having at least about 70%, about 75%, about 80%, about 85%, about 90, about 95%, about 96%, about 97%, about 98% or about 99% identity to a naturally occurring insulin C chain polypeptide), such as an insulin C chain polypeptide from humans. In some embodiments, an insulin C chain polypeptide is a polypeptide having the amino acid sequence of amino acid residues 57-87 of human insulin assigned UniProt Accession No. P01308 (SEQ ID NO: 14), or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about
- 10 -
90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid residues 57-87 of the amino acid sequence of SEQ ID NO: 18. The amino acid sequence of SEQ ID NO:21 corresponds to amino acid residues 57-87 of the amino acid sequence of SEQ ID NO: 18.
[0042] U.S. Application Publication No. US 2018/0194827 describes insulin peptides and single chain insulin peptide agonists that include insulin A chains and insulin B chains containing a variety of substitutions, additions and/or modifications compared to native or naturally-occurring insulin A and B chains. The teachings of US 2018/0194827 relevant to insulin derivatives and analogs, for example, insulin A chain polypeptides and insulin B chain polypeptides, are incorporated herein by reference in their entirety. Thus, examples of insulin A chain polypeptides and insulin B chain polypeptides include the insulin A chains and insulin B chains disclosed in US 2018/0194827.
[0043] Mathieu, C., Gillard, P. and Benhalima, K., Nature Reviews Endocrinology 13, 385-399 (2017) (Mathieu et al.) and Zaykov, A.N., Mayer, J.P. and DiMarchi, R.D., Nature Reviews Drug Discovery 15, 425-439 (2016) (Zaykov et al.) describe insulin analogues. The teachings of Mathieu et al. and Zaykov et al. relevant to insulin derivatives and analogues, for example, insulin A chain polypeptides and insulin B chain polypeptides, are incorporated herein by reference in their entirety. Thus, examples of insulin A chain polypeptides and insulin B chain polypeptides include the insulin A chains and insulin B chains disclosed in Mathieu et al. and Zaykov et al.
[0044] Glargine is an insulin analog. Two modifications to the wildtype insulin were introduced to make the insulin glargine molecule. One modification is the substitution of Asn with Gly at position A21 at the end of the insulin A chain, and the other is the addition of two Arg residues at the end of the insulin B chain.
[0045] As used herein, “glargine A chain polypeptide,” is the sequence of insulin A chain with a substitution of Asn with Gly at position A21 at the end of the insulin A chain. In some embodiments, a glargine A chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO:23, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid of SEQ ID NO:23.
[0046] As used herein, “glargine B chain polypeptide,” is the sequence of insulin B chain with the addition of two Arg residues at the end of the insulin B chain. In some embodiments, a glargine B chain polypeptide is a polypeptide having the amino acid sequence of SEQ ID
301
WO 2022/094146 PCT/US2021/057142
NO:24, or a variant thereof having at least about 70% (e.g., about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98% or about 99%) identity to amino acid of SEQ ID NO:24.
[0047] In some embodiments, the precursor protein is a fusion protein (e.g., a fusion protein comprising a target protein). In certain embodiments, the precursor protein is a fusion protein that comprises a tag (e.g., a His-tag) to enhance expression, folding and/or purification. In some embodiments, the precursor protein comprises a De novo Expression Enhancer Protein (DEEP) fusion tag. Examples of DEEP fusion tags that are suitable for inclusion in prohormones capable of being processed by the methods disclosed herein are described herein below, and in International Application No. PCT/US2018/044156, published as International Publication No. WO 2019/023616 Al, the contents of which are incorporated herein by reference in their entirety.
[0048] Fusion Proteins of the Disclosure
[0049] The present disclosure further provides, in various embodiments, a fusion protein comprising a DEEP fusion tag, a target protein, and at least one KEX2 protease cleavage site. [0050] The term “fusion protein” refers to a synthetic, semi -synthetic or recombinant single protein molecule that comprises all or a portion of two or more different proteins and/or peptides.
[0051] As used herein, the term “De novo Expression Enhancer Protein fusion tag” or “DEEP fusion tag” refers to a polypeptide having at least two (e.g., 2, 3, 4, 5 or 6) a-helices, wherein each a-helix comprises a binary patterned sequence of seven amino acid residues, or heptad sequence, defined by [PNPPNNP]n, where each “P” is independently selected from the polar amino acid residues Lys (K), His (H), Glu (E), Gin (Q), Asp (D), Asn (N), Thr (T) and Ser (S), each “N” is independently selected from the nonpolar amino acid residues Phe (F), Leu (L), He (I), Met (M), Vai (V) and Trp (W), and n is an integer from 2 to 10 (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10). In particular embodiments of a DEEP fusion tag, n = 3.
[0052] The heptad sequences in an a-helix containing more than one heptad sequence can be identical (i.e., repeats of the same heptad sequence) or they can be different (i.e., each PNPPNNP heptad sequence within the same a-helix can have a different amino acid sequence). Furthermore, the amino acid composition of the a-helices in a DEEP fusion tag can vary from helix to helix such that, for example, each of the a-helices in the tag will have a different amino acid sequence.
301
WO 2022/094146 PCT/US2021/057142
[0053] A DEEP fusion tag can also include additional amino acid residues, for example, N-terminal to the first a-helix and/or C-terminal to the last a-helix in the tag. Typically, a DEEP fusion tag will also include interhelical turns between the a-helices, wherein each interhelical turn includes, for example, 4, 5 or 6 amino acid residues encoded by the degenerate DNA codon VAN (V: A, G, or C; N: A, G, C, or T) (e.g., Gly (G), His (H), Gin (Q), Asn (N), Asp (D), Glu (E) and Lys (K)).
[0054] A DEEP fusion tag is generally at least about 70 amino acid residues in length (e.g., 74 amino acid residues). In a particular embodiment, a DEEP fusion tag is at least about 100 amino acid residues in length (e.g., 102 amino acid residues). Typically, a DEEP fusion tag is less than about 500 amino acid residues in length, for example, less than about 450 amino acid residues in length or less than about 420 amino acid residues in length.
[0055] In particular embodiments, a DEEP fusion tag includes a plurality of histidine residues that are exposed at the surface in a properly folded fusion protein. In a further embodiment, each a-helix in a DEEP fusion tag includes at least one (e.g., 1, 2, 3, 4, 5, 6, or more, for example, 12) histidine residue(s).
[0056] The fusion can be an TV-terminal fusion (with respect to the DEEP fusion tag), a C- terminal fusion (with respect to the DEEP fusion tag) or an internal fusion (with respect to the DEEP fusion tag and/or the target protein).
[0057] Examples of polypeptides that are suitable for use as DEEP fusion tags in the present disclosure, as well as methods of designing and making such polypeptides, are described in the following publications, each of which is incorporated by reference herein in its entirety:
• Zarzhitsky, Shlomo, et al. (2020), Harnessing synthetic biology to enhance heterologous protein expression. Protein Science 29, 1698-1706.
• Wei Y, Liu T, Sazinsky SL, Moffet DA, Pelczer I, and Hecht MH (2003), Stably folded de novo proteins from a designed combinatorial library. Protein Science 12, 92-102 (see, e.g., Figure 2, proteins designated 86, n86, S-23, S-213, S-285, S-824 and S-836);
• Kamtekar S, Schiffer JM, Xiong H, Babik JM & Hecht MH (1993), Protein Design by Binary Patterning of Polar and Non-Polar Amino Acids. Science 262, 1680-1685;
• Wei Y, Kim S, Fela D, Baum J, & Hecht MH (2003), Solution Structure of a De Novo Protein From a Designed Combinatorial Library. Proc. Natl. Acad. Sci. (USA) 100, 13270-13273;
301
WO 2022/094146 PCT/US2021/057142
• Hecht MH, Das A, Go A, Bradley LH & Wei Y (2004), De Novo Proteins from Designed Combinatorial Libraries. Protein Science 13. 1711-1723;
• Go A, Kim S, Baum J, & Hecht MH (2008), Structure and Dynamics of De novo Proteins from a Designed Superfamily of 4-Helix Bundles. Protein Science 17, 821- 832; and
• Bradley LH, Kleiner RE, Wang AF, Hecht MH & Wood DW (2005), An Intein-Based Genetic Selection Enables Construction of a High-Quality Library of Binary Patterned De Novo Sequences. Protein Engineering, Design & Selection (PEDS) 18, 201-207.
[0058] In particular embodiments, the DEEP fusion tag comprises, consists essentially of, or consists of (e.g., comprises) the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16 or SEQ ID NO: 17, or a functional fragment thereof, such as a fragment lacking the N-terminal methionine residue.
In other embodiments, the DEEP fusion tag comprises a variant amino acid sequence of any of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO:14, SEQ ID NO: 15, SEQ ID NO: 16 or SEQ ID NO: 17 having, for example, at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identity to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO:15, SEQ ID NO: 16 or SEQ ID NO: 17, respectively.
[0059] In some embodiments, the fusion protein further comprises at least one linker sequence. A variety of linker amino acid sequences are known in the art and can be used in the invention. In some embodiments, a linker sequence includes one or more amino acid residues selected from Gly, Ser, Thr, His, Asp, Glu, Asn, Gin, Lys and Arg. In some embodiments, a linker sequence includes a polyglycine sequence (e.g., a 6X glycine sequence). Other examples of linkers include GSAGSAAGSG (SEQ ID NO: 12), GGGGGGSR (SEQ ID NO: 13), KR and RR. In certain embodiments, a linker sequence includes a cleavage site.
301
WO 2022/094146 PCT/US2021/057142
[0060] In some embodiments, the fusion protein comprises at least one KEX2 cleavage site in the target protein. In some embodiments, the fusion protein additionally or alternatively comprises at least one KEX2 cleavage site in a linker sequence.
[0061] In certain embodiments, the fusion protein comprises, consists essentially of or consists of SEQ ID NO:27.
[0062] In some embodiments, the target protein comprises or consists of a hormone and the fusion protein comprises or consists of a prohormone.
[0063] In some embodiments, the target protein comprises insulin or an insulin analog.
[0064] In some embodiments, the fusion protein comprises proinsulin.
[0065] In particular embodiments, the target protein comprises glargine.
[0066] In certain embodiments, the fusion protein comprises proglargine.
[0067] In some embodiments, the fusion protein comprises or consists of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID NO:29, or a variant of any of the foregoing having at least about 70% amino acid sequence identity to one or more of SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID NO:29.
[0068] In some embodiments, the fusion protein comprises or consists of SEQ ID NO:27.
[0069] The fusion proteins of the disclosure can be produced recombinantly or synthetically, using routine methods and reagents that are well known in the art. For example, a fusion protein of the disclosure can be produced recombinantly in a suitable host cell (e.g., bacteria, yeast, insect cells, mammalian cells) according to methods known in the art. See, e.g., Current Protocols in Molecular Biology, Second Edition, Ausubel et al. eds., John Wiley & Sons, 1992; and Molecular Cloning: a Laboratory Manual, 2nd edition, Sambrook et al., 1989, Cold Spring Harbor Laboratory Press. For example, a nucleic acid molecule comprising a nucleotide sequence encoding a fusion protein described herein can be introduced and expressed in suitable host cells (e.g., E. colt), and the expressed fusion protein can be isolated/purified from the host cells (e.g., in inclusion bodies) using routine methods and readily available reagents.
[0070] Methods for introducing DNA constructs encoding fusion proteins into host cells are well known in the art and include, for example, standard transformation and transfection techniques (e.g., electroporation, chemical transformation). A person of ordinary skill in the field of the disclosure can readily select an appropriate method for introducing a DNA construct into host cells.
301
WO 2022/094146 PCT/US2021/057142
[0071] A variety of methods for expressing proteins in host cells are well known in the art (e.g., IPTG-induced expression in E. colt). A person of ordinary skill in the field of the disclosure can readily select an appropriate method for expressing a fusion protein of the disclosure in host cells.
[0072] An expressed fusion protein can be isolated from host cells using known methods and reagents including, e.g., lysozyme treatment, sonication, filtration, salting-out, ultracentrifugation, and chromatography. A recombinantly-expressed fusion protein can be recovered from host cells and/or the host cell culture medium. Once released from the cells, the fusion protein can be purified from cell lysates by binding to an affinity resin using standard techniques and reagents. In a particular embodiment, the fusion protein is isolated by binding of the DEEP fusion tag in the fusion protein to an affinity resin (e.g., on a solid support). In some embodiments, the DEEP fusion tag comprises a plurality of surface- exposed histidine residues, enabling purification by methods typically used for His-tagged proteins. For example, a fusion protein of the disclosure can be isolated using immobilized metal ion affinity chromatography (IMAC). Suitable IMAC resins containing immobilized transition metals for IMAC applications are known in the art and are commercially available (e.g., TALON® Superfl ow™ resins, HisTrap™ High Performance resins, GE Healthcare Life Sciences), and include, e.g., immobilized nickel resins, immobilized cobalt resins, immobilized copper resins, and immobilized zinc resins. In a particular embodiment, a fusion protein of the disclosure is purified using an affinity resin comprising immobilized nickel ions.
[0073] Nucleic Acids, Vectors, Host Cells, and Kits of the Disclosure
[0074] The present disclosure further provides, in various embodiments, a polynucleotide encoding a fusion protein disclosed herein. In some embodiments, the polynucleotide is a DNA polynucleotide. In some embodiments, the polynucleotide is a RNA polynucleotide. The polynucleotide can be in the form of an insert (e.g., for cloning into a vector). The polynucleotide can be linear or circular. In some embodiments, the polynucleotide comprises one or more of a non-canonical nucleotide and a modified nucleotide (e.g., a nucleotide comprising a chemical modification). The polynucleotide can be isolated, recombinant, synthetic or semi -synthetic.
[0075] Although the genetic code is degenerate in that most amino acids are represented by several codons (called “synonyms” or “synonymous” codons), it is understood in the art
301
WO 2022/094146 PCT/US2021/057142 that codon usage by particular organisms is nonrandom and biased towards particular codon triplets. Accordingly, in a particular embodiment, a nucleic acid encoding a fusion protein of the disclosure includes a nucleotide sequence that has been optimized for expression in a particular type of host cell (e.g., through codon optimization). Codon optimization refers to a process in which a polynucleotide encoding a protein of interest is modified to replace particular codons in that polynucleotide with codons that encode the same amino acid(s), but are more commonly used/recognized in the host cell in which the nucleic acid is being expressed. In some embodiments, the polynucleotides encoding a fusion protein of the disclosure are codon optimized for expression in
coli.
[0076] The disclosure also provides, in various embodiments, a vector comprising a polynucleotide of the disclosure. In some embodiments, the vector is an expression vector. In certain embodiments, the vector is a viral vector (e.g., lentiviral vector, adenoviral vector, AAV). In other embodiments, the vector is a non-viral vector (e.g., plasmid, cloning vector). A variety of vectors, including expression vectors, viral vectors and non-viral vectors are known in the art and are commercially available.
[0077] The disclosure further provides, in various embodiments, a host cell comprising a polynucleotide disclosed herein or a vector disclosed herein. As used herein, the term “host cell” refers to a suitable host for expressing a nucleic acid encoding a fusion protein comprising a DEEP fusion tag. In some embodiments, the host cells are cells that have been transformed or transfected with vectors constructed using recombinant DNA techniques known in the art. Examples of suitable host cells include yeast cells (e.g., Pichia pastoris and Saccharomyces cerevisiae), insect cells (e.g., Spodoptera frugiperda Sf9 cells), mammalian cells (e.g., CHO cells), and bacterial cells (e.g., E. coli and B. sublHis Agrobacterium tumefaciens). Further examples of suitable host cells include plant cells (e.g., Nicotiana benlhamiana . In a particular aspect, the host cell is E. coli.
[0078] In various embodiments, the disclosure additional provides a kit comprising one or more polynucleotides comprising a nucleotide sequence encoding a DEEP fusion tag and a cloning site for introducing a nucleotide sequence encoding a target protein to form a fusion protein, and a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 (e.g., in separate containers). In some embodiments, the KEX2 protease comprises or consists of SEQ ID NO:1.
301
WO 2022/094146 PCT/US2021/057142
[0079] In some embodiments, the polynucleotide further comprises a sequence encoding a peptide linker, a sequence providing a KEX2 protease cleavage site, or a combination thereof.
[0080] In certain embodiments, the kit further comprises instructions for use.
Exemplification
[0081] The following materials and methods were used in the experiments described in FIGs. 1 and 2 herein.
[0082] Construction of plasmids and strains
[0083] A synthetic gene encoding a DEEP-insulin glargine fusion protein (SEQ ID NO:27) was constructed using E. coll codon optimized gBlocks (Integrated DNA Technologies), and amplified with forward and reverse primers containing Xbal and Hindlll restriction sites.
[0084] Using standard genetic cloning techniques, the sequence encoding DEEP -glargine was introduced into pET30 vector carrying kanamycin resistance to yield pET30DEEP- glargine. For overexpression, the plasmids were transformed into competent BL21DE3 cells and plated on LB agar supplemented with 30 mg/L kanamycin. The next day, a fresh colony was picked and inoculated into 5mL of LB kanamycin, and incubated overnight at 37°C with continuous shaking at 200 rpm. 4 ml of the overnight culture were inoculated into IL of LB supplemented with kanamycin. The culture was induced with 0.5mM IPTG, after a sample of the culture measured an OD600 of 0.8, the culture was further incubated for 3h. Cells were collected using a centrifuge, operating at 4,500xg. Cell pellets were kept at -80°C until further use.
[0085] Lysis and Extraction of Inclusion Bodies (IBs)
[0086] Cell pellets were resuspended in lysis buffer containing 50mM Tris, 300mM NaCl (TBS), 4mM EDTA and lysed using Emulsiflex. Immediately after lysis the samples were spundown at 15,000xg for 30 min. The supernatant was discarded, and the pellet containing inclusion bodies was washed 3 times with: 1. TBS, 1% Triton, 2M urea, 4mM EDTA, 2. TBS, and 3.milli-q water. The washed inclusion bodies were eventually resuspended at 4°C in a buffer containing 8M urea, 50mM glycine pH=10.5. After overnight incubation, the resuspended inclusion bodies were spun down at 35,000xg for 30min. Lysis and protein extraction from inclusion bodies was performed with solutions either kept on ice or at 4°C. To determine protein concentration, an aliquot of resuspended IBs was fully reduced and
301
WO 2022/094146 PCT/US2021/057142 denatured by xlO dilution into 8M GdnHCl lOOmM DTT. After 20min of incubation at room temperature the reactions were quenched with acetonitrile and TFA, at final concentrations of 10% and 0.5% (v/v), respectively. This fully reduced and denatured protein sample was loaded on the HPLC equipped with Zorbax C-18 analytical column (Agilent). The concentration of the protein was calculated based on the area under the curve (AUC) using the following equation:
[0088] where n is the moles of protein in the peak, F is the flow rate, a is the molar extinction coefficient, and d is the pathlength.
[0089] Refolding
[0090] Refolding was performed by the rapid dilution method. The concentration of the protein in resuspended inclusion bodies was adjusted to -5-10 mg/ml, and further diluted 10- 20 times into the refolding buffer. The refolding buffer was 50mM glycine pH=10.5 supplemented with different ratios of P-mercaptoethanol to protein’s cys residues. The refolding was performed with solutions pre-chilled on ice or at 4°C. Analysis of the refolding process was performed using an HPLC. Before loading the samples, the reactions were quenched with acetonitrile and TFA, at final concentrations of 10% and 0.5% (v/v), respectively. Refolding yields were calculated based on AUC of the refolded protein divided by the AUC of the fully reduced and denatured one.
[0091] Purification and KEX2 digest
[0092] After 48h of refolding at 4°C, the solution was supplemented with 50mM tris and 300mM NaCl (final concentration), ImM oxidized glutathione, and the pH was adjusted to 8. The protein solution was purified on the FPLC (AKTA pure) equipped with a HisTrap column, buffer A (50mM tris, 300mM NaCl pH8), and buffer B same as A supplemented with 500mM imidazole at pH8. Fractions containing the elution peak were combined, and immediately desalted using PD-10 column into lOOmM tris pH8. Aliquots of protein solution were supplemented with various concentrations of CaC12 and KEX2 (PeproTech). The cleavage reaction was incubated at 37°C, monitoring the progress using RP-HPLC equipped with C18 column (Zorbax, Agilent). Cleaved insulin glargine was precipitated by the addition of lOmM ZnSO4, and adjusting the pH to 6. Instantly the solution became cloudy, and was further incubated for 12h at 4°C with gentle stirring. The precipitated protein was spun down at 5000Xg, for 15min and the pellet was dissolved in 6M GdnHCl, 20% MeCN, 1% TFA. The protein sample was further purified on the HPLC equipped with a semi-prep Cl 8
301
WO 2022/094146 PCT/US2021/057142 column. The eluting fractions containing insulin were collected, lyophilized, and stored at - 80°C until further use.
[0093] Purification of protein
[0094] DEEP-glargine protein was expressed, extracted and purified as described above. The combined fractions of the Ni-IMAC peak (~7mg/ml in 7.5ml) were further desalted using PD10, into lOOmM Tris pH = 8 (~5mg/ml in 10.5 ml). lOOul samples were supplemented with CaC12 and KEX2 (Peprotech, US) (FIG. 2). The samples were further incubated at 37°C overnight.
[0095] Results
[0096] To verify experimentally the methods disclosed herein, insulin glargine was expressed as a prohormone comprising a DEEP fusion tag, referred to herein as “DEEP- Glargine,” and having the amino acid sequence of SEQ ID NO:27, and converted in vitro to mature glargine. Specifically, DEEP-Glargine, in which the C-chain of proglargine was replaced with a DEEP protein, was expressed in E. coh. extracted, refolded, and converted to mature glargine in vitro by incubation with a recombinant KEX2 protease as described above. The conversion of DEEP-Glargine using KEX2 protease was observed to proceed to near completion without generating any detectable cleavage byproducts.
[0097] The teachings of all patents, published applications and references cited herein are incorporated by reference in their entirety.
[0098] While example embodiments have been particularly shown and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the embodiments encompassed by the disclosure herein.
Claims
1. A method of converting a precursor protein to a mature protein comprising: providing a precursor protein that is to be converted to a mature protein, wherein the precursor protein comprises at least one cleavage site for a KEX2 protease; and contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs, thereby converting the precursor protein to a mature protein.
2. A method of obtaining a target protein, comprising: providing a precursor protein that comprises a target protein and at least one cleavage site for a KEX2 protease; contacting the precursor protein with a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 under conditions in which cleavage of the precursor protein by the KEX2 protease occurs and releases the target protein; and isolating the target protein, thereby obtaining the target protein.
3. The method of claim 1 or 2, wherein the precursor protein is a prohormone.
4. The method of claim 3, wherein the prohormone is a prohormone of insulin or an analog thereof.
5. The method of claim 3, wherein the prohormone is a prohormone of glargine.
6. The method of any one of claims 1-5, wherein the precursor protein is a fusion protein.
7. The method of claim 6, wherein the fusion protein comprises a DEEP fusion tag.
- 21 -
The method of claim 7, wherein the DEEP fusion tag comprises an amino acid sequence having at least 70% sequence identity to one or more of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NOTO, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO:15, SEQ ID NO: 16 or SEQ ID NO: 17. The method of any one of claims 6-8, wherein the fusion protein further comprises at least one linker sequence. The method of claim 9, wherein at least one KEX2 cleavage site is in the linker sequence. The method of any one of claims 2-9, wherein at least one KEX2 cleavage site is in the target protein. The method of any one of claims 1-11, wherein the precursor protein comprises SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID NO:29. The method of any one of claims 1-11, wherein the precursor protein comprises SEQ ID NO:27. The method of any one of claims 1-13, wherein the method is performed in vitro. A fusion protein comprising a DEEP fusion tag, a target protein, and at least one KEX2 protease cleavage site. The fusion protein of claim 15, wherein the fusion protein further comprises at least one linker sequence. The fusion protein of claim 15 or 16, wherein at least one KEX2 cleavage site is in the target protein.
The fusion protein of any one of claims 15-17, wherein at least one KEX2 cleavage site is in the linker sequence. The fusion protein of claim 15, 16 or 17, comprising SEQ ID NO:27. The fusion protein of any one of claims 15-19, wherein the target protein comprises a hormone and wherein the fusion protein comprises a prohormone. The fusion protein of any one of claims 15-20, wherein the target protein comprises insulin or an insulin analog. The fusion protein of any one of claims 15-21, wherein the fusion protein comprises proinsulin. The fusion protein of any one of claims 15-20, wherein the target protein comprises glargine. The fusion protein of any one of claims 15-20 and 23, wherein the fusion protein comprises proglargine. A polynucleotide encoding the fusion protein of any one of claims 15-24. The polynucleotide of claim 25, wherein the polynucleotide is a DNA polynucleotide. A vector comprising the polynucleotide of claim 25 or 26. A host cell comprising the polynucleotide of claim 25 or 26, or the vector of claim 27. A KEX2 protease for use in the method of any one of claims 1-14. A target protein obtained by the method of any one of claims 2-14.
101
WO 2022/094146 PCT/US2021/057142 A kit comprising: a polynucleotide comprising a nucleotide sequence encoding a DEEP fusion tag and a cloning site for introducing a nucleotide sequence encoding a target protein to form a fusion protein; and a KEX2 protease comprising an amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1. The kit of claim 31, wherein the polynucleotide further comprises a sequence encoding a peptide linker. The kit of claim 31 or 32, wherein the polynucleotide further comprises a sequence encoding a KEX2 protease cleavage site. The method of any one of claims 1-14, or the kit of any one of claims 31-33, wherein the KEX2 protease comprises SEQ ID NO: 1.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063106676P | 2020-10-28 | 2020-10-28 | |
US63/106,676 | 2020-10-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022094146A1 true WO2022094146A1 (en) | 2022-05-05 |
Family
ID=78771190
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2021/057142 WO2022094146A1 (en) | 2020-10-28 | 2021-10-28 | Methods of converting precursor proteins to mature proteins using kex2 proteases |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022094146A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008062010A2 (en) * | 2006-11-22 | 2008-05-29 | Novo Nordisk A/S | Method for making activated carboxypeptidases |
US20150118710A1 (en) * | 2012-03-29 | 2015-04-30 | Biocon Limited | Secretion of functional insulin glargine directly into the culture medium through over expression of kex2p intracellularly |
US20180194827A1 (en) | 2013-03-14 | 2018-07-12 | Indiana University Research And Technology Corporation | Insulin-incretin conjugates |
WO2019023616A1 (en) | 2017-07-28 | 2019-01-31 | The Trustees Of Princeton University | Fusion tags for recombinant protein expression |
-
2021
- 2021-10-28 WO PCT/US2021/057142 patent/WO2022094146A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008062010A2 (en) * | 2006-11-22 | 2008-05-29 | Novo Nordisk A/S | Method for making activated carboxypeptidases |
US20150118710A1 (en) * | 2012-03-29 | 2015-04-30 | Biocon Limited | Secretion of functional insulin glargine directly into the culture medium through over expression of kex2p intracellularly |
US20180194827A1 (en) | 2013-03-14 | 2018-07-12 | Indiana University Research And Technology Corporation | Insulin-incretin conjugates |
WO2019023616A1 (en) | 2017-07-28 | 2019-01-31 | The Trustees Of Princeton University | Fusion tags for recombinant protein expression |
Non-Patent Citations (22)
Title |
---|
"Current Protocols in Molecular Biology", 1992, JOHN WILEY & SONS |
"NCBI", Database accession no. NP 014161.1 |
"UniProt", Database accession no. P01308 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 |
AUSUBEL ET AL., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY |
BRADLEY LHKLEINER REWANG AFHECHT MHWOOD DW: "An Intein-Based Genetic Selection Enables Construction of a High-Quality Library of Binary Patterned De Novo Sequences", PROTEIN ENGINEERING, DESIGN & SELECTION (PEDS, vol. 18, 2005, pages 201 - 207 |
GO AKIM SBAUM JHECHT MH: "Structure and Dynamics of De novo Proteins from a Designed Superfamily of 4-Helix Bundles", PROTEIN SCIENCE, vol. 17, 2008, pages 821 - 832 |
GREENWUTS: "Protecting Groups in Organic Synthesis", 1991, JOHN WILEY AND SONS |
HECHT MHDAS AGO ABRADLEY LHWEI Y: "De Novo Proteins from Designed Combinatorial Libraries", PROTEIN SCIENCE, vol. 13, 2004, pages 1711 - 1723 |
HENIKOFFHENIKOFF, PROC. NATL. ACAD. SCI. USA, vol. 89, 1989, pages 10915 |
KAMTEKAR SSCHIFFER JMXIONG HBABIK JMHECHT MH: "Protein Design by Binary Patterning of Polar and Non-Polar Amino Acids", SCIENCE, vol. 262, 1993, pages 1680 - 1685, XP001182910, DOI: 10.1126/science.8259512 |
MATHIEU, C.GILLARD, P.BENHALIMA, K., NATURE REVIEWS ENDOCRINOLOGY, vol. 13, 2017, pages 385 - 399 |
NEEDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
PEARSONLIPMAN, PROC. NAT'L. ACAD. SCI. USA, vol. 85, 1988, pages 2444 |
SAMBROOK ET AL.: "Molecular Cloning: a Laboratory Manual", COLD SPRING HARBOR LABORATORY PRESS |
SMITHWATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 |
SREENIVAS SUMA ET AL: "Enhancement in production of recombinant two-chain Insulin Glargine by over-expression of Kex2 protease inPichia pastoris", APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, SPRINGER BERLIN HEIDELBERG, BERLIN/HEIDELBERG, vol. 99, no. 1, 20 September 2014 (2014-09-20), pages 327 - 336, XP035422293, ISSN: 0175-7598, [retrieved on 20140920], DOI: 10.1007/S00253-014-6052-5 * |
WEI YKIM SFELA DBAUM JHECHT MH: "Solution Structure of a De Novo Protein From a Designed Combinatorial Library", PROC. NATL. ACAD. SCI. (USA), vol. 100, 2003, pages 13270 - 13273 |
WEI YLIU TSAZINSKY SLMOFFET DAPELCZER IHECHT MH: "Stably folded de novo proteins from a designed combinatorial library", PROTEIN SCIENCE, vol. 12, 2003, pages 92 - 102 |
ZARZHITSKY SHLOMO ET AL: "Harnessing synthetic biology to enhance heterologous protein expression", PROTEIN SCIENCE, vol. 29, no. 8, 13 July 2020 (2020-07-13), US, pages 1698 - 1706, XP055877135, ISSN: 0961-8368, Retrieved from the Internet <URL:https://onlinelibrary.wiley.com/doi/full-xml/10.1002/pro.3907> DOI: 10.1002/pro.3907 * |
ZARZHITSKY, SHLOMO ET AL.: "Harnessing synthetic biology to enhance heterologous protein expression", PROTEIN SCIENCE, vol. 29, 2020, pages 1698 - 1706 |
ZAYKOV, A.N.MAYER, J.P.DIMARCHI, R.D., NATURE REVIEWS DRUG DISCOVERY, vol. 15, 2016, pages 425 - 439 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8796431B2 (en) | Efficient production of peptides | |
WO2013142859A2 (en) | Fusion proteins of superfolder green fluorescent protein and use thereof | |
Liew et al. | Preparation of recombinant thioredoxin fused N-terminal proCNP: analysis of enterokinase cleavage products reveals new enterokinase cleavage sites | |
US20160083713A1 (en) | Novel peptidyl alpha-hydroxyglycine alpha-amidating lyases | |
AU2018307960B2 (en) | Fusion tags for recombinant protein expression | |
US7892787B2 (en) | Method for production of recombinant growth hormone in form of hybrid protein | |
EP4067373A1 (en) | Multifunctional multispecific multimeric biomolecule polymer having prolonged in-vivo duration | |
WO2022094146A1 (en) | Methods of converting precursor proteins to mature proteins using kex2 proteases | |
WO2020187270A1 (en) | Fusion protein containing fluorescent protein fragments and uses thereof | |
JP2003511014A5 (en) | ||
CN103998606B (en) | The Enterpeptidase light chain of modification | |
WO2019143193A9 (en) | N-terminal fusion partner for producing recombinant polypeptide, and method for producing recombinant polypeptide using same | |
CN112105635A (en) | Leader sequences for higher expression of recombinant proteins | |
CN111073925B (en) | High-efficiency polypeptide-polypeptide coupling system and method based on disordered protein coupling enzyme | |
Park et al. | Optimization of the hydroxylamine cleavage of an expressed fusion protein to produce a recombinant antimicrobial peptide | |
CA2529282C (en) | Recombinant igf expression systems | |
WO2009005973A2 (en) | Synthetic gene for enhanced expression in e.coli | |
KR100305726B1 (en) | The Method for Expressing Polypeptides in E.coli Using Carboxypeptidase Y propeptide as a Fusion partner, Recombinant Vector Used thereby and Transformant thereof | |
Oh et al. | Use of carboxypeptidase Y propeptide as a fusion partner for expression of small polypeptides in Escherichia coli |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21814998 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21814998 Country of ref document: EP Kind code of ref document: A1 |