CN102676563B - Method for preparing human serum albumin-human parathyroid hormone - Google Patents
Method for preparing human serum albumin-human parathyroid hormone Download PDFInfo
- Publication number
- CN102676563B CN102676563B CN 201210134869 CN201210134869A CN102676563B CN 102676563 B CN102676563 B CN 102676563B CN 201210134869 CN201210134869 CN 201210134869 CN 201210134869 A CN201210134869 A CN 201210134869A CN 102676563 B CN102676563 B CN 102676563B
- Authority
- CN
- China
- Prior art keywords
- yps1
- seq
- ser
- leu
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000445 Parathyroid hormone Proteins 0.000 title claims abstract description 34
- 102000003982 Parathyroid hormone Human genes 0.000 title claims abstract description 32
- 239000000199 parathyroid hormone Substances 0.000 title claims abstract description 32
- 238000000034 method Methods 0.000 title claims abstract description 26
- 210000002966 serum Anatomy 0.000 title claims abstract description 18
- 108091006905 Human Serum Albumin Proteins 0.000 title claims abstract description 17
- 102000008100 Human Serum Albumin Human genes 0.000 title claims abstract description 17
- 229960001319 parathyroid hormone Drugs 0.000 title claims abstract description 12
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 35
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 31
- 101150008621 YPS1 gene Proteins 0.000 claims abstract description 20
- 101100166113 Candida albicans (strain SC5314 / ATCC MYA-2876) SAP9 gene Proteins 0.000 claims abstract description 18
- 101000788159 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Aspartic proteinase 3 Proteins 0.000 claims abstract description 14
- 241000235648 Pichia Species 0.000 claims description 27
- 230000004927 fusion Effects 0.000 claims description 17
- 230000003321 amplification Effects 0.000 claims description 13
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 13
- 230000002950 deficient Effects 0.000 claims description 12
- 108091008146 restriction endonucleases Proteins 0.000 claims description 12
- 238000011144 upstream manufacturing Methods 0.000 claims description 11
- 230000006801 homologous recombination Effects 0.000 claims description 9
- 238000002744 homologous recombination Methods 0.000 claims description 9
- 239000013598 vector Substances 0.000 claims description 9
- 108010084455 Zeocin Proteins 0.000 claims description 8
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 claims description 8
- 108091005804 Peptidases Proteins 0.000 claims description 7
- 230000002797 proteolythic effect Effects 0.000 claims description 6
- 238000013461 design Methods 0.000 claims description 5
- 239000013604 expression vector Substances 0.000 claims description 5
- 239000004365 Protease Substances 0.000 claims description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 4
- 230000008521 reorganization Effects 0.000 claims description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 2
- 230000029087 digestion Effects 0.000 claims description 2
- 238000012163 sequencing technique Methods 0.000 claims description 2
- 238000001890 transfection Methods 0.000 claims description 2
- 101100268670 Caenorhabditis elegans acc-3 gene Proteins 0.000 claims 1
- 102000004169 proteins and genes Human genes 0.000 abstract description 23
- 230000008569 process Effects 0.000 abstract description 5
- 238000000746 purification Methods 0.000 abstract description 5
- 238000000926 separation method Methods 0.000 abstract description 4
- 230000015556 catabolic process Effects 0.000 abstract description 3
- 238000006731 degradation reaction Methods 0.000 abstract description 3
- 238000004519 manufacturing process Methods 0.000 abstract description 3
- 208000001132 Osteoporosis Diseases 0.000 abstract description 2
- 102000037865 fusion proteins Human genes 0.000 abstract 2
- 108020001507 fusion proteins Proteins 0.000 abstract 2
- 101001091368 Rattus norvegicus Glandular kallikrein-7, submandibular/renal Proteins 0.000 abstract 1
- 101000898773 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Saccharopepsin Proteins 0.000 abstract 1
- 229940079593 drug Drugs 0.000 abstract 1
- 239000003814 drug Substances 0.000 abstract 1
- 238000005215 recombination Methods 0.000 abstract 1
- 230000006798 recombination Effects 0.000 abstract 1
- 230000001131 transforming effect Effects 0.000 abstract 1
- 241000282326 Felis catus Species 0.000 description 25
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 22
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 18
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 18
- 235000018102 proteins Nutrition 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 16
- 102000004190 Enzymes Human genes 0.000 description 14
- 108090000790 Enzymes Proteins 0.000 description 14
- 150000001413 amino acids Chemical group 0.000 description 13
- 229940024606 amino acid Drugs 0.000 description 12
- 235000001014 amino acid Nutrition 0.000 description 12
- OGBMKVWORPGQRR-UMXFMPSGSA-N teriparatide Chemical compound C([C@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)[C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 OGBMKVWORPGQRR-UMXFMPSGSA-N 0.000 description 12
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 12
- 239000000047 product Substances 0.000 description 10
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 9
- 229930003756 Vitamin B7 Natural products 0.000 description 9
- 229910052757 nitrogen Inorganic materials 0.000 description 9
- 239000011735 vitamin B7 Substances 0.000 description 9
- 235000011912 vitamin B7 Nutrition 0.000 description 9
- 101710180012 Protease 7 Proteins 0.000 description 8
- 239000001888 Peptone Substances 0.000 description 7
- 108010080698 Peptones Proteins 0.000 description 7
- 229940041514 candida albicans extract Drugs 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 235000019319 peptone Nutrition 0.000 description 7
- 239000012138 yeast extract Substances 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- 101150047086 arm gene Proteins 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 238000001962 electrophoresis Methods 0.000 description 6
- 229930182817 methionine Natural products 0.000 description 6
- 229940093916 potassium phosphate Drugs 0.000 description 6
- 229910000160 potassium phosphate Inorganic materials 0.000 description 6
- 235000011009 potassium phosphates Nutrition 0.000 description 6
- MEIRRNXMZYDVDW-MQQKCMAXSA-N (2E,4E)-2,4-hexadien-1-ol Chemical compound C\C=C\C=C\CO MEIRRNXMZYDVDW-MQQKCMAXSA-N 0.000 description 5
- 241000235058 Komagataella pastoris Species 0.000 description 5
- 230000009977 dual effect Effects 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 229920000936 Agarose Polymers 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000005611 electricity Effects 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 101100184165 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MKC7 gene Proteins 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 229960002989 glutamic acid Drugs 0.000 description 3
- 235000011187 glycerol Nutrition 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 238000001426 native polyacrylamide gel electrophoresis Methods 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- 101100219555 Candida albicans (strain SC5314 / ATCC MYA-2876) SAP10 gene Proteins 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- CFVQPNSCQMKDPB-CIUDSAMLSA-N Lys-Cys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N CFVQPNSCQMKDPB-CIUDSAMLSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- 101100160515 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YPS1 gene Proteins 0.000 description 2
- 101100160516 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YPS3 gene Proteins 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- NNRFRJQMBSBXGO-CIUDSAMLSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NNRFRJQMBSBXGO-CIUDSAMLSA-N 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 108010025188 Alcohol oxidase Proteins 0.000 description 1
- 102100036826 Aldehyde oxidase Human genes 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- OMSMPWHEGLNQOD-UWVGGRQHSA-N Asn-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UWVGGRQHSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- UOUHBHOBGDCQPQ-IHPCNDPISA-N Asn-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)N)N UOUHBHOBGDCQPQ-IHPCNDPISA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102000005572 Cathepsin A Human genes 0.000 description 1
- 108010059081 Cathepsin A Proteins 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 1
- LWTTURISBKEVAC-CIUDSAMLSA-N Cys-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N LWTTURISBKEVAC-CIUDSAMLSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- MXPBQDFWIMBACQ-ACZMJKKPSA-N Glu-Cys-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O MXPBQDFWIMBACQ-ACZMJKKPSA-N 0.000 description 1
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- 101000928314 Homo sapiens Aldehyde oxidase Proteins 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 101150029183 PEP4 gene Proteins 0.000 description 1
- 102100036893 Parathyroid hormone Human genes 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 102000004245 Proteasome Endopeptidase Complex Human genes 0.000 description 1
- 108090000708 Proteasome Endopeptidase Complex Proteins 0.000 description 1
- 101100160519 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YPS7 gene Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 108010049264 Teriparatide Proteins 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- 108010011164 acein 1 Proteins 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 101150010487 are gene Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 230000004087 circulation Effects 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000013583 drug formulation Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000001009 osteoporotic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 150000003905 phosphatidylinositols Chemical group 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 101150076304 spp gene Proteins 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000033772 system development Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention provides a method for preparing human serum albumin-human parathyroid hormone. By means of transforming a yeast host SMD1168, yapson1 coded sequences on the SMD 1168 can be knocked out. On the basis of the SMD 1168, coded gene YPS1 of yapson1 is further knocked out, namely double knockouts of a proteinase A and yapsin1 are achieved, and the transformed host is named as SMD1168yps1 delta. By the aid of the transformed host, a recombination expression of fusion proteins of human serum albumin-human parathyroid hormone (1-34) is optimized, so that the degradation of interest proteins in the expressing process can be reduced, and downstream mass production costs and difficulties of separation and purification are lowered. According to the method, the optimized the human serum albumin-human parathyroid hormone (1-34) fusion proteins can be applied to prepare parathyroid hormone (PTH) (1-34) medicines which are used for treating osteoporosis.
Description
Technical field
The invention belongs to biological technical field, particularly, the structure that relates to pichia pastoris protein enzyme A and the yapsin 1 dual genetic engineering bacterium that knocks out, and this host is used for corresponding proteolytic ferment responsive type heterologous protein---human serum albumin-human parathyroid hormone (1-34) fusion rotein recombinant expressed.
Technical background
Pichia pastoris phaff (Pichia pastors) expression system is fitst water at present, one of most widely used exogenous protein expression system.It not only overcome protein, expressed proteins multiform that escherichia expression system can not the expression structure complexity become insoluble inclusion body, background albumen many, express defective such as yield poorly, remedy cells of mamma animals, insect cell expression system complicated operation, expression level is low, industrialization production involves great expense deficiency, also had an incomparable superior part of other yeast expression systems.The advantage of this expression system mainly contain following some: (1) has strong alcohol oxidase promotor (P
AOX1), expression that can strict regulation and control foreign protein; (2) high-density large-scale that is easy to be implemented in the synthetic medium is cultivated, pichia spp self be secreted in the substratum albumen seldom, the foreign protein of high secretion easily separates from protein free substratum.(3) degree of glycosylation is low, compares with yeast saccharomyces cerevisiae, and pichia spp does not produce excessive glycosylation; The immunogenicity of secreted glycoprotein is lower, is more conducive to clinical application.Utilize at present this expression system successful expression nearly 500 kinds of heterologous proteins.
Yet, the degraded of recombinant expressed foreign protein, the proteolytic degradation that is caused by the multiple protein lytic enzyme that exists in the pichia spp born of the same parents particularly, usually make the expression amount of foreign protein reduce greatly, because the physico-chemical property of degraded product and intact proteins is very close, also the separation and purification in downstream is had higher requirement simultaneously.This also is the bottleneck problem of pichia yeast expression system development.When the optimization culture condition can't reach the purpose that reduces the foreign protein degraded, usually can consider to utilize genetic engineering means that active proteolytic ferment is knocked out, SMD1168 as the use of commercialization, the PEP4 gene of proteins encoded enzyme A is knocked out in this strain gene group, cause the protease A loss of activity, also partly reduce the activity of proteolytic enzyme B and carboxypeptidase y simultaneously, made expression product avoid the degraded of this proteinoid enzyme.
Yet host SMD1168 can only be effective to some specific albumen, and the degraded that not all exogenous protein expression produces can both be effectively suppressed.Early stage research is according to secretion and the where-used of proteolytic enzyme, the intracellular protein enzyme of yeast is divided into three major types: the proteolytic enzyme of vacuole protein enzyme, cell matrix proteasome, Secretory Pathway.Yapsins (corresponding encoding gene be called YPS) is that a newfound class is present in proteolytic ferment on the yeast emiocytosis approach in recent years, belong to aspartic acid proteolysis enzyme family, it is characterized in that having one section glycosylation phosphatidylinositols structure (GPI), it is anchored on the cytolemma.This proteinoid lytic enzyme is the enzymolysis site with basic aminoacids sequence single or paired on the substrate mainly.First identified out and study more deep be yapsin 1 in the yeast saccharomyces cerevisiae.In other fungies, also identified successively afterwards and had multiple yapsin member.Yapsin 1 in the pichia spp identifies in 2005 first by Wertem etc., and regrettably, they find that GS115 bacterial strain that YPS1 knocks out does not improve effect for the degraded of gelatin.But more and more evidences shows that the yapsins defective has significant improvement effect to the degraded of some specific protein in yeast saccharomyces cerevisiae or the lactic acid Crewe dimension genus yeast.As the yeast saccharomyces cerevisiae of using yapsin 1 defective can reduce the degraded of reorganization PTH greatly.People such as Schutter had announced pichia spp gene order-checking result in 2009, through sequential analysis, find except YPS1, in the pichia spp born of the same parents, also to have 6 YPS members, and be respectively YPS2, YPS2 ", YPS3, YPS3 " and, YPS7 and MKC7.
Summary of the invention
The purpose of this invention is to provide a kind of method for preparing human serum albumin-human parathyroid hormone.At first, the yeast host SMD1168 that is bought by Invitrogen is transformed, namely by the method for homologous recombination, knock out the encoding sequence (SEQ ID NO.11) of yapsin 1 on the SMD1168 genome.SMD1168 is that a kind of protease A knocks out the type host, and it is the gene knockout gained with proteins encoded enzyme A among the host GS115.The present invention further knocks out the encoding gene YPS1 of yapsin 1 on the basis of SMD1168 and has realized that namely protease A and the dual of yapsin 1 knock out, and should transform host's called after SMD1168yps1 Δ.The above-mentioned transformation host of recycling optimizes recombinant expressed human serum albumin-human parathyroid hormone (1-34) fusion rotein, thereby reduces the degraded of the target protein in the expression process.
The present invention realizes by following steps: (a) 5 ' the end homology arm of design YPS1 and 3 ' end homology arm amplimer, and from the pichia spp genome, effectively amplify this two homology arm sequences; (b) these two homology arm sequences that obtain in the step (a) are inserted among the carrier pPICZ α B according to sequencing, obtain to be used for the recombinant vectors pPICZ α B-YPS1 Δ of subsequent homo reorganization; (c) with the recombinant vectors that makes up in the step (b) with single digestion with restriction enzyme after transfection SMD1168, cultivation screening recon with the selective pressure that contains Zeocin, with the exactness of corresponding positive primer and negative primer checking homologous recombination, correct defective type recon is designated as the SMD1168yps1 Δ; (d) utilizing the protease-deficient host SMD1168yps1 Δ that obtains in the step (c) to express corresponding proteolytic ferment responsive type recombinant protein---human serum albumin-human parathyroid hormone (1-34) fusion rotein, SEQ ID NO:13 are gene and the protein sequence of this fusion rotein.
In the step (a), the invention provides the primer sequence for YPS15 ' end homology arm and 3 ' end homology arm amplification, the gene order (SEQ ID NO.9) of YPS15 ' end homology arm is provided simultaneously, the gene order (SEQ ID NO.10) of YPS13 ' end homology arm and coding yapsin 1 gene order YPS1 (SEQ ID NO.11).Wherein, being used for amplification YPS15 ' holds the upstream primer of homology arm sequence to be: SEQ ID NO.1:5 '-aagc
CtgcagAgctccattg cgccaacccc-3 ', the line part is Pst I restriction enzyme site, downstream primer is: SEQ ID NO.2:5 '-cggc
GtcgacAatctggctg agcggaaagt ttga-3 ', the line part is the SalI restriction enzyme site.The upstream primer that is used for amplification YPS13 ' end homology arm sequence is: SEQ ID NO.3:5 '-ccc
AgatctC acatttcgcg cctgccttcc t-3 ', the line part is the BglII restriction enzyme site, downstream primer is: SEQ ID NO.4:5 '-aaa
CtgcagG ggtgagcctg tctggccct-3 ', the line part is Pst I restriction enzyme site.
In the step (b), the invention provides the recombinant vectors that carries YPS15 ' end homology arm and 3 ' end homology arm dna sequence dna, determine that by PCR checking and dna sequencing above-mentioned two homology arms are inserted among the carrier pPICZ α B in correct mode, are designated as pPICZ α B-YPS1 Δ (SEQ ID NO.12) with recombinant vectors.
In the step (c), the present invention is according to the target homologous recombination zone change that sequence takes place before and after homologous recombination of yeast, design a pair of negative primer (SEQ ID NO.5:5 '-ctccttccac cgccacggc-3 ', SEQ ID NO.6:5 '-accacttgag gcgtccttgg a-3 ') determines that the encoding sequence of YPS1 in the recon is knocked out, further a pair of positive primer of design (SEQ ID NO.7:5 '-atggaggagt tgaggacc-3 ', SEQ ID NO.8:5 '-gcaaatggca ttctgacatc c-3 ') is proved conclusively the exactness of homologous recombination position again.
Utilize the known method in this area can finish the structure of homologous recombination vector, the extraction of yeast genes group can be with reference to (A. Adam Si etc., " yeast genetics method experiment guide ", Science Press, 2000).The acquisition of YPS15 ' end homology arm and 3 ' end homology arm can be used the method for PCR Science.239:487-491 such as (, 1988) Saiki or synthetic.As the method by PCR, introduce the restriction enzyme enzyme recognition site in the both sides of target sequence, cut the generation sticky end by enzyme, can be cloned into required carrier easily.The molecular cloning process of used standard is seen (J. Sa nurse Brooker etc., " molecular cloning experiment guide " second edition, Science Press, 1995.).
Yeast host GS115, SMD1168 and expression vector pPICZ α B can buy (Invitrogen Corp.San Diego.California.USA) from company.Preferable methods is that 5 ' end and 3 ' end homology arm are cloned into carrier pPICZ α B.The Zeocin resistant gene that this carrier carries can be used as the colony screening mark after the subsequent homo reorganization.
Transform required nucleic acid and to host cell, go the available common method that gets, as electroporation, the competent spheroplast of preparation etc.The success cell transformed can be identified by the technology that people know, and through collecting and cracking, extracts DNA as cell, and PCR method is identified then.Specifically can be with reference to implementing regulations 3.
Human serum albumin-human parathyroid hormone (1-34) fusion rotein (HSA/PTH (1-34)) utilizes genetic engineering means that both genes are merged, last a kind of long-acting human parathyroid hormone (1-34) form of in pichia spp, expressing acquisition, this albumen is a kind of very promising pharmaceutical protein that can treat osteoporosis, sees granted patent (application number 200410066459.8).The applicant finds when adopting pichia spp GS115 as the expressive host of this albumen in earlier stage, produce the human serum albumin-human parathyroid hormone fusion rotein with biologic activity though utilize GS115 to express, but there is serious signs of degradation in expression product, cause the expression amount of intact proteins not high, and just can obtain highly purified intact proteins through more purification procedures, the productive rate of recombinant protein is extremely low.Use commercial defective type host SMD1168 also can only partly suppress the to degrade generation of band instead, prompting also has other proteolytic ferments to play a leading role in the recombinant expressed process of HSA/PTH (1-34) except protease A.This experiment of later stage is by making up 7 kinds of single defective hosts that knock out of YPS, it is the GS115yps1 Δ, the GS115yps2 Δ, GS115yps2 " Δ, GS115yps3 Δ, GS115yps3 " Δ, the GS115yps7 Δ, the GS115mkc7 Δ, only in the host that YPS1 knocks out, the degraded of HSA/PTH (1-34) obviously reduces in discovery.According to this result, we have determined that yapsin 1 is another principal element that causes HSA/PTH (1-34) degraded.A kind of method that makes up protease A and the yapsin 1 dual protease-deficient pichia spp host who knocks out is provided, utilize protease A and the yapsin 1 dual pichia spp that knocks out as expressive host, and utilize this host to optimize the recombinant expressed of human serum albumin-Rat parathyroid hormone 1-34 (1-34) fusion rotein, reduce the signs of degradation of human serum albumin-human parathyroid hormone (1-34) fusion rotein in the expression process, thereby reduce downstream scale operation cost and separation and purification difficulty, when improving expression amount, also be conducive to separation and purification, make this albumen when industrial amplification production, reduce difficulty, reduce cost.Lay the foundation for further studying treatment osteoporotic PTH (1-34) long lasting drug formulations.
Figure of description
Fig. 1 is the structure synoptic diagram of recombinant plasmid pPICZ alpha B-YPS1 Δ.
Fig. 2 is the PCR proof diagram of recombinant plasmid pPICZ alpha B-YPS1 Δ.
Fig. 3 identifies synoptic diagram for structure synoptic diagram and the PCR that homologous recombination method knocks out the YPS1 gene.
Fig. 4 is that the PCR of host SMD1168yps1 Δ identifies electrophorogram.
The PAGE that Fig. 5 expresses in normal host GS115 and protease-deficient host SMD1168 and SMD1168yps1 Δ for fusion rotein analyzes.
Embodiment
The present invention is described further with specific embodiment by reference to the accompanying drawings.
The amplification of embodiment 1:YPS15 ' end homology arm and 3 ' end homology arm
5 ' the end homology arm fragment and 3 ' that obtains the YPS1 sequence with PCR method from the yeast genes group is held the homology arm fragment, and used primer YPS1_N up (SEQ ID NO.1), YPS1_N dn (SEQ ID NO.2) and YPS1_C up (SEQ ID NO.3), YPS1_C dn (SEQ ID NO.4) synthesize with the oligonucleotide synthesizer.The upstream and downstream primer that is used for 5 ' end homology arm amplification is introduced PstI and Sa1I site and protection base respectively, and the upstream and downstream primer that is used for 3 ' end homology arm amplification is introduced BglII and PstI site and protection base respectively, and line place is the restriction endonuclease recognition sequence.
YPS1_N up:5’-aagc
ctgcag agctccattg cgccaacccc-3’
YPS1_N dn:5’-cggc
gtcgac aatctggctg agcggaaagt ttga-3’
YPS1_C up:5’-ccc
agatctc acatttcgcg cctgccttcc t-3’
YPS1_C dn:5’-aaa
ctgcagg ggtgagcctg tctggccct-3’
PCR reaction conditions: in the 50 μ L reaction systems, add 1 μ L yeast genes group, each 1 μ L of the upstream and downstream primer of 10 μ mol/L, the dNTP mixture 4 μ L of 2.5mmol/L, 5 * Buffer (Mg
2+Plus) 10 μ L, PrimeSTAR archaeal dna polymerase 0.5 μ L, residue ddH
2O supplies.With (model is Matstercycler Gradient) PCR instrument of EPPENDORF company, the PCR reaction conditions is 98 ℃ of pre-sex change 5 minutes; 98 ℃ of sex change 10s; 55 ℃ of annealing 15min; 72 ℃ are extended 30s, and after totally 30 circulations, 72 ℃ are extended 10min again.Obtain being contemplated to 5 ' the end homology arm sequence of 257bp and 3 ' the end homology arm sequence of 318bp by 1.5% gel electrophoresis analysis.
Embodiment 2: the structure of recombinant vectors pPICZ α B-YPS1 Δ
Reclaim purifying purpose band with reclaiming test kit with dna fragmentation behind 3 ' the end homology arm PCR product electrophoresis purifying that obtains among the embodiment 1.Behind the purifying 3 ' end homology arm PCR product is cut with the BglII/PstI enzyme, and electrophoresis reclaims; Plasmid pPICZ α B cuts with the BglII/PstI enzyme equally, and electrophoresis reclaims the linearizing fragment.3 ' end homology arm and carrier pPICZ α B after enzyme cut mix with suitable proportion, connect in 16 ℃ of water-baths with the T4 ligase enzyme and spend the night, formation pPICZ α B-C.After being transformed into DH5 α, be laid on the LB agar plate that contains 25 μ g/mL Zeocin, 37 ℃ of overnight incubation.Select on the plate several clone's inoculations 5mL and contain in the LB liquid nutrient medium of 25 μ g/mL Zeocin 37 ℃ of overnight incubation.With primer YPS1_C up/YPS1_C dn is carried out the PCR checking and obtain positive colony pPICZ α B-C/DH5 α (see figure 2), it is correct that positive colony is served the further sequence verification of Hai Shenggong.
Reclaim purifying purpose band with reclaiming test kit with dna fragmentation behind 5 ' the end homology arm PCR product electrophoresis purifying that obtains among the embodiment 1.Behind the purifying 5 ' end homology arm PCR product is cut with the PstI/SalI enzyme, and electrophoresis reclaims; Plasmid pPICZ α B-C cuts with the PstI/SalI enzyme equally, and electrophoresis reclaims the linearizing fragment.5 ' end homology arm and carrier pPICZ α B-C after enzyme cut mix with suitable proportion, connect in 16 ℃ of water-baths with the T4 ligase enzyme and spend the night, formation pPICZ α B-YPS1 Δ.After being transformed into DH5 α, be laid on the LB agar plate that contains 25 μ g/mL Zeocin, 37 ℃ of overnight incubation.Select on the plate several clone's inoculations 5mL and contain in the LB liquid nutrient medium of 25 μ g/mLZeocin 37 ℃ of overnight incubation.With primer YPS1_C up/YPS1_N dn is carried out the PCR checking and obtain positive colony pPICZ α B-YPS1 Δ/DH5 α (see figure 2), it is correct that positive colony is served the further sequence verification of Hai Shenggong.
Embodiment 3: homologous recombination method knocks out the YPS1 gene among the host SMD1168
Therefore pichia spp host SMD1168 is protease A defective type host, on the basis of SMD1168 aspartic acid lytic enzyme 1 gene is further knocked out and can obtain protease A and the yapsin 1 dual pichia spp host who knocks out.
The mode that pPICZ α B-YPS1 Δ is transformed with electricity after with the PstI linearizing is transformed into pichia spp SMD1168 (the yeast expression test kit is provided by Invitrogen Corp.USA), hatching 2h in the YPD liquid nutrient medium after, SMD1168 after the conversion is laid on YPD Agar plate (the 1%yeast extract that contains 50 μ g/mL Zeocin and 1M sorbyl alcohol, 2%peptone, 2% glucose, the 1M sorbyl alcohol, 2% agarose) on, cultivated 3-5 days for 30 ℃.Several clones are inoculated in 5mL and contain in the YPD liquid nutrient medium of 50 μ g/mL Zeocin 30 ℃ of overnight incubation on the picking plate.PCR verifies positive colony, concrete mode is seen Fig. 3, Fig. 4: design a pair of negative primer (yps1 negative up (SEQ ID NO.5), yps1 negative dn (SEQ ID NO.6)), a pair of positive primer (yps1 positive up (SEQ ID NO.7), yps1 positive dn (SEQ ID NO.8)), the positive colony that the correct YPS1 that recombinates knocks out is designated as the SMD1168yps1 Δ, can amplify the 833bp band with positive primer, can not amplify any band with negative primer.
yps1 negative up:5’-ctccttccac cgccacggc-3’
yps1 negative dn:5’-accacttgag gcgtccttgg a-3’
yps1 positive up:5’-atggaggagt tgaggacc-3’
yps1 positive dn:5’-gcaaatggca ttctgacatc c-3’
Embodiment 4: the expression of fusion rotein HSA/PTH (1-34) in host GS115
Fusion rotein HSA/PTH (1-34) (SEQ ID NO.13) applies for a patent and authorizes (application number 200410066459.8) by this laboratory.Described in this patent 200410066459.8, the mode that recombinant expression vector pPIC9-HSA/PTH (1-34) is transformed with electricity after with Sal I linearizing is transformed into pichia spp GS115, and detailed electric transform mode can be with reference to Invitrogen yeast laboratory manual.Conversion fluid is laid on RDB agar plate (1mol/L sorbyl alcohol, 1% glucose, 4*10
-5The % vitamin H, 1.34% no amino acid yeast nitrogen (YNB, Difeocorp.), 0.005% amino acid mixing liquid (L-glutamic acid, methionine(Met), Methionin, leucine, Isoleucine each 0.005%), 2% agarose) on, 3~5d, picking His cultivated for 30 ℃
+The clone is seeded to 20mL BMGY substratum (1%yeast extract, 2%peptone, 100mmol/L potassiumphosphate, pH6.0,4*10 is housed
-5% vitamin H, 1.34% no amino acid yeast nitrogen, 1% glycerine) the 100mL triangular flask, 250 rev/mins, 30 ℃ be cultured to OD=2~6 after, change 20mL BMMY substratum (1%yeast extract, 2%peptone into, the 100mmol/L potassiumphosphate, pH6.0,4*10
-5% vitamin H, 1.34% no amino acid yeast nitrogen, 1% methyl alcohol), after this add 1% methyl alcohol/24h, induce centrifuging and taking supernatant behind the 3d, native-PAGE analyzes expression product, and the result finds that referring to Fig. 5 there is serious degraded in the fusion rotein in the culture supernatant.
Embodiment 5: the expression of fusion rotein HSA/PTH (1-34) in host SMD1168
Described in patent No. zl200410066459.8, the mode that recombinant expression vector pPIC9-HSA/PTH (1-34) is transformed with electricity after with Sal I linearizing is transformed into pichia spp SMD1168, and detailed electric transform mode can be with reference to Invitrogen yeast laboratory manual.Conversion fluid is laid on RDB agar plate (1mol/L sorbyl alcohol, 1% glucose, 4*10
-5The % vitamin H, 1.34% no amino acid yeast nitrogen (YNB, Difeocorp.), 0.005% amino acid mixing liquid (L-glutamic acid, methionine(Met), Methionin, leucine, Isoleucine each 0.005%), 2% agarose) on, 3~5d, picking His cultivated for 30 ℃
+The clone is seeded to 20mL BMGY substratum (1%yeast extract, 2%peptone, 100mmol/L potassiumphosphate, pH6.0,4*10 is housed
-5% vitamin H, 1.34% no amino acid yeast nitrogen, 1% glycerine) the 100mL triangular flask, 250 rev/mins, 30 ℃ be cultured to OD=2~6 after, change 20mL BMMY substratum (1%yeast extract, 2%peptone into, the 100mmol/L potassiumphosphate, pH6.0,4*10
-5The % vitamin H, 1.34% no amino acid yeast nitrogen, 1% methyl alcohol), after this add 1% methyl alcohol/24h, induce centrifuging and taking supernatant behind the 3d, native-PAGE analyzes expression product, the result is referring to Fig. 5, though find that with respect to host GS115 the degraded of the target protein that host SMD1168 produces is alleviated to some extent, the percentage composition of degraded band in culture supernatant is still greater than 50% in SMD1168.
The optimization expression of embodiment: 6: fusion rotein HSA/PTH (1-34) in host SMD1168yps1 Δ of the present invention
The mode that recombinant expression vector pPIC9-HSA/PTH (1-34) described in the patent No. z1200410066459.8 is transformed with electricity after with Sal I linearizing is transformed into pichia spp SMD1168 yps1 Δ, and detailed electric transform mode can be with reference to Invitrogen yeast laboratory manual.Conversion fluid is laid on RDB agar plate (1mol/L sorbyl alcohol, 1% glucose, 4*10
-5The % vitamin H, 1.34% no amino acid yeast nitrogen (YNB, Difeocorp.), 0.005% amino acid mixing liquid (L-glutamic acid, methionine(Met), Methionin, leucine, Isoleucine each 0.005%), 2% agarose) on, 3~5d, picking His cultivated for 30 ℃
+The clone is seeded to 20mL BMGY substratum (1%yeast extract, 2%peptone, 100mmol/L potassiumphosphate, pH6.0,4*10 is housed
-5% vitamin H, 1.34% no amino acid yeast nitrogen, 1% glycerine) the 100mL triangular flask, 250 rev/mins, 30 ℃ be cultured to OD=2~6 after, change 20mL BMMY substratum (1%yeastextract, 2%peptone into, the 100mmol/L potassiumphosphate, pH6.0,4*10
-5% vitamin H, 1.34% no amino acid yeast nitrogen, 1% methyl alcohol), after this add 1% methyl alcohol/24h, induce centrifuging and taking supernatant behind the 3d, native-PAGE analyzes expression product.The result is referring to Fig. 5, and than host GS115 and SMD1168, in host SMD1168 yps1 Δ, the degraded band has obtained maximum inhibition, illustrates that the SMD1168yps1 Δ is optimum relatively a kind of host.
<110〉Zhejiang University
<120〉a kind of method for preparing human serum albumin-human parathyroid hormone
<160> 13
<210> 1
<211> 30
<212> DNA
<213〉artificial sequence
<220>
<223〉the used upstream primer of amplification YPS1 5' homology arm gene order from pichia spp GS115 genome
<440> 1
aagcctgcag agctccattg cgccaacccc 30
<210> 2
<211> 34
<212> DNA
<213〉artificial sequence
<220>
<223〉the used downstream primer of amplification YPS1 5' homology arm gene order from pichia spp GS115 genome
<440> 2
cggcgtcgac aatctggctg agcggaaagt ttga 34
<210> 3
<211> 31
<212> DNA
<213〉artificial sequence
<220>
<223〉the used upstream primer of amplification YPS1 3' homology arm gene order from pichia spp GS115 genome
<440> 3
cccagatctc acatttcgcg cctgccttcc t 31
<210> 4
<211> 29
<212> DNA
<213〉artificial sequence
<220>
<223〉the used downstream primer of amplification YPS1 3' homology arm gene order from pichia spp GS115 genome
<440> 4
aaactgcagg ggtgagcctg tctggccct 29
<210> 5
<211> 19
<212> DNA
<213〉artificial sequence
<220>
<223〉for the identification of the negative upstream primer of SMD1168 yps1 △ recon
<440> 5
ctccttccac cgccacggc 19
<210> 6
<211> 21
<212> DNA
<213〉artificial sequence
<220>
<223〉for the identification of the negative downstream primer of SMD1168 yps1 △ recon
<440> 6
accacttgag gcgtccttgg a 21
<210> 7
<211> 18
<212> DNA
<213〉artificial sequence
<220>
<223〉for the identification of the positive upstream primer of SMD1168 yps1 △ recon
<440> 7
atggaggagt tgaggacc 18
<210> 8
<211> 21
<212> DNA
<213〉artificial sequence
<220>
<223〉for the identification of the positive downstream primer of SMD1168 yps1 △ recon
<440> 8
gcaaatggca ttctgacatc c 21
<210> 9
<211> 237
<212> DNA
<213> Pichia pastoris
<220>
<223〉pichia spp GS115 YPS1 5' homology arm gene order
<440> 9
agctccattg cgccaacccc cgctctccag actccttcgt tatccagcat tcagcatgga 60
caggttcaaa aaataaaatt tcttgatatg ggtccacttc aaacatgcgc ctacctgtag 120
gaaaaaaaaa gagaacataa atatgccgcg aacagaaaac gtaatgtact gttctatata 180
taaactgttc agatcaatca taaattctca gtttcaaact ttccgctcag ccagatt 237
<210> 10
<211> 300
<212> DNA
<213> Pichia pastoris
<220>
<223〉pichia spp GS115 YPS1 3' homology arm gene order
<440> 10
cacatttcgc gcctgccttc ctttaggttc tttgaatcat catcaatcgt cgccgtctac 60
atcagagcag gacttatctt tgccttcccc aaaaattgcc actccgtcaa atagattctt 120
ttgaatcctt gactattttt gcctaaatag gtttttgtta gtttttcttc aaagcccaaa 180
agaaactcta tttagattca tccagaaaca atctttttct caccccattt cgaagtgccg 240
tggagcacag acataaaaag atgactaccg ttcaacctac agggccagac aggctcaccc 300
<210> 11
<211> 1800
<212> DNA
<213> Pichia pastoris
<220>
<221> CDS
<222> (1)~(1797)
<223〉pichia spp GS115 YPS1 sequence
<440> 11
gt ttt tag ctc cat tgc gcc aac ccc cgc tct -240
cca gac tcc ttc gtt atc cag cat tca gca tgg aca ggt tca aaa aat aaa att tct tga -180
tat ggg tcc act tca aac atg cgc cta cct gta gga aaa aaa aag aga aca taa ata tgc -120
cgc gaa cag aaa acg taa tgt act gtt cta tat ata aac tgt tca gat caa tca taa att -60
ctc agt ttc aaa ctt tcc gct cag cca gat ttt att cgt aaa gaa cgc atc att ggc tct 0
atg ttg aag gat cag ttc ttg tta tgg gtt gct ttg ata gcg agc gta ccg gtt tcc ggc 60
Met Leu Lys Asp Gln Phe Leu Leu Trp Val Ala Leu Ile Ala Ser Val Pro Val Ser Gly
1 5 10 15 20
gtg atg gca gct cct agc gag tcc ggg cat aac acg gtt gaa aaa cga gat gcc aaa aac 120
Val Met Ala Ala Pro Ser Glu Ser Gly His Asn Thr Val Glu Lys Gln Asp Ala Lys Asn
21 25 30 35 40
gtt gtt ggc gtt cga cag ttg gac ttc agc gtt ctg agg ggt gat tcc ttc gaa agt gcc 180
Val Val Gly Val Arg Gln Leu Asp Phe Ser Val Leu Arg Gly Asp Ser Phe Glu Ser Ala
41 45 50 55 60
tct tca gag aac gtg cct cgg ctt gtg agg aga gat gac acg cta gaa gct gag cta atc 240
Ser Ser Glu Asn Val Pro Arg Leu Val Arg arg Asp Asp Thr Leu Glu Ala Glu Leu Ile
61 65 70 75 80
aac cag caa tca ttc tac ttg tca cga ctg aaa gtt gga tca cat caa gcg gat att gga 300
Asn Gln Gln Ser Phe Tyr Leu Ser Arg Leu Lys Val Gly Ser His Gln Ala Asp Ile Gly
81 85 90 95 100
atc cta gtg gac aca gga tcc tct gat tta tgg gta atg gac tcg gta aac cca tac tgc 360
Ile Leu Val Asp Thr Gly Ser Ser Asp Leu Trp Val Met Asp Ser Val Asn Pro Tyr Cys
101 105 110 115 120
agt agc cgt tcc cgc gtg aag aga gat ata cac gat gag aag atc gcc gaa tgg gat ccc 420
Ser Ser Arg Ser Arg Val Lys Arg Asp Ile His Asp Glu Lys Ile Ala Glu Trp Asp Pro
121 125 130 135 140
atc aat ctc aag aaa aat gaa act tct cag aat aaa aat ttt tgg gat tgg ctc gtt gga 480
Ile Asn Leu Lys Lys Asn Glu Thr Ser Gln Asn Lys Asn Phe Trp Asp Trp Leu Val Gly
141 145 150 155 160
act agc act agt tct cct tcc acc gcc acg gca act ggt agt ggt agt ggt agt ggt agt 540
Thr Ser Thr Ser Ser Pro Ser Thr Ala Thr Ala Thr Gly Ser Gly Ser Gly Ser Gly Ser
161 165 170 175 180
ggt agt ggt agt ggt agt gct gcc aca gcc gta tcg gta agt tct gca cag gca aca ttg 600
Gly Ser Gly Ser Gly Ser Ala Ala Thr Ala Val Ser Val Ser Ser Ala Gln Ala Thr Leu
181 185 190 195 200
gat tgc tct acg tat gga acg ttt gat cac gct gat tcc tcg acg ttc cat gac aat aat 660
Asp Cys Ser Thr Tyr Gly Thr Phe Asp His Ala Asp Ser Ser Thr Phe His Asp Asn Asn
201 205 210 215 220
aca gac ttt ttc atc tca tac gct gat acc act ttt gct tca gga atc tgg ggt tat gac 720
Thr Asp Phe Phe Ile Ser Tyr Ala Asp Thr Thr Phe Ala Ser Gly Ile Trp Gly Tyr Asp
221 225 230 235 240
gac gtc att atc gac ggc ata gag gtg aaa gaa ctt tcc ttc gcc gtt gca gac atg acc 780
Asp Val Ile Ile Asp Gly Ile Glu Val Lys Glu Leu Ser Phe Ala Val Ala Asp Met Thr
241 245 250 255 260
aat tcc tct att ggt gtg tta ggt att gga ctg aaa ggc cta gaa tcc aca tat gct agt 840
Asn Ser Ser Ile Gly Val Leu Gly Ile Gly Leu Lys Gly Leu Glu Ser Thr Tyr Ala Ser
261 265 270 275 280
gca tct tcg gtc agt gaa atg tat cag tat gac aat ttg cca gcc aag atg gtc acc gat 900
Ala Ser Ser Val Ser Glu Met Tyr Gln Tyr Asp Asn Leu Pro Ala Lys Met Val Thr Asp
281 285 290 295 300
ggg ttg atc aac aaa aat gca tac tcc ttg tac ttg aac tcc aag gac gcc tca agt ggt 960
Gly Leu Ile Asn Lys Asn Ala Tyr Ser Leu Tyr Leu Asn Ser Lys Asp Ala Ser Ser Gly
301 305 310 315 320
tcc atc ctc ttt gga ggt gtg gat cat gaa aaa tat tcg gga caa ttg ttg aca gtt cca 1020
Ser Ile Leu Phe Gly Gly Val Asp His Glu Lys Tyr Ser Gly Gln Leu Leu Thr Val Pro
321 325 330 335 340
gtc atc aac aca ctc gct tcc agt ggt tac aga gag gca att cgt tta caa att act tta 1080
Val Ile Asn Thr Leu Ala Ser Ser Gly Tyr Arg Glu Ala Ile Arg Leu Gln Ile Thr Leu
341 345 350 355 360
aat gga ata gat gtg aaa aag ggt tct gac cag gga act ctt tta caa ggg aga ttt gct 1140
Asn Gly Ile Asp Val Lys Lys Gly Ser Asp Gln Gly Thr Leu Leu Gln Gly Arg Phe Ala
361 365 370 375 380
gca tta ttg gac tct gga gct acg cta acg tat gct cct tct tct gtt tta aat tca att 1200
Ala Leu Leu Asp Ser Gly Ala Thr Leu Thr Tyr Ala Pro Ser Ser Val Leu Asn Ser Ile
381 385 390 395 400
ggc cgg aac ctg ggc ggc tcc tat gat tcg tca aga caa gct tat acc att cgt tgt gtt 1260
Gly Arg Asn Leu Gly Gly Ser Tyr Asp Ser Ser Arg Gln Ala Tyr Thr Ile Arg Cys Val
402 405 410 415 420
tct gca tca gat acc act tct ctg gta ttc aat ttt ggg ggt gct aca gtg gaa gtt tcc 1320
Ser Ala Ser Asp Thr Thr Ser Leu Val Phe Asn Phe Gly Gly Ala Thr Val Glu Val Ser
421 425 430 435 440
ctg tac gat cta cag att gca aca tat tac acc ggg gga agt gcc acg caa tgt ctt att 1380
Leu Tyr Asp Leu Gln Ile Ala Thr Tyr Tyr Thr Gly Gly Ser Ala Thr Gln Cys Leu Ile 460
441 445 450 455 460
gga ata ttc agc tct gga agt gat gag ttt gtg ctc ggt gat acc ttc ttg agg tca gcc 1440
Gly Ile Phe Ser Ser Gly Ser Asp Glu Phe Val Leu Gly Asp Thr Phe Leu Arg Ser Ala 480
461 465 470 475 480
tac gtg gtt tac gat ctt gat ggg ctt gaa gtg tcg ctt gcc caa gcc aac ttc aac gaa 1500
Tyr Val Val Tyr Asp Leu Asp Gly Leu Glu Val Ser Leu Ala Gln Ala Asn Phe Asn Glu 500
481 485 490 495 500
acc gat tct gat gtt gag gct att acc tcc agt gta cct tcc gct act cgt gca tcc gga 1560
Thr Asp Ser Asp Val Glu Ala Ile Thr Ser Ser Val Pro Ser Ala Thr Arg Ala Ser Gly 520
501 505 510 515 520
tac agt tct aca tgg tct ggt tct gcc agc ggt aca gtt tac act tcg gtt cag atg gaa 1620
Tyr Ser Ser Thr Trp Ser Gly Ser Ala Ser Gly Thr Val Tyr Thr Ser Val Gln Met Glu 540
521 525 530 535 540
tcc ggt gct gct tcc agc tcc aac tct tct gga tcg aat atg ggt tcc tct tcc tca tcg 1680
Ser Gly Ala Ala Ser Ser Ser Asn Ser Ser Gly Ser Asn Met Gly Ser Ser Ser Ser Ser 560
541 545 550 555 560
tcc tct tca tcg tcc tcg act tcc agt gga gac gaa gaa gga ggg agc tcc gcc aac agg 1740
Ser Ser Ser Ser Ser Ser Thr Ser Ser Gly Asp Glu Glu Gly Gly Ser Ser Ala Asn Arg 580
561 565 570 575 580
gtc ccc ttc agc tac ctt tct ctc tgt ttg gta gtt att ctc ggc gtg tgt ata gta tag 1800
Val Pro Phe Ser Tyr Leu Ser Leu Cys Leu Val Val Ile Leu Gly Val Cys Ile Val 599
581 585 590 595
tat aaa agg gcc tac att gga tag gct tca aca ttc ctc aat aaa caa aca tcc aac atc 1860
gcg cat tcc gca ttt cgc att tca cat ttc gcg cct gcc ttc ctt tag gtt ctt tga atc 1920
atc atc aat cgt cgc cgt cta cat cag agc agg act tat ctt tgc ctt ccc caa aaa ttg 1980
cca ctc cgt caa ata gat tct ttt gaa tcc ttg act att ttt gcc taa ata ggt ttt tgt 2040
tag ttt ttc ttc aaa gcc caa aag aaa ctc tat tta gat tca tcc aga aac aat ctt ttt 2100
ctc acc cca ttt cga agt gcc gtg gag cac aga cat aaa aag atg act acc gtt caa cct 2160
aca ggg cca gac agg ctc acc ctg ccg cat att cta ctg gaa ttc aac gat ggc tcc tcg 2220
cag cat gca gtg atc gag cta agc atg aac gag ggg att aat ata tcc acc cat gag tgg 2280
<210> 12
<211> 2829
<212> DNA
<213〉artificial sequence
<220>
<223〉dna sequence dna of recombinant vectors pPICZ α B-YPS1 △
<440> 12
agatctcaca tttcgcgcct gccttccttt aggttctttg aatcatcatc aatcgtcgcc 60
gtctacatca gagcaggact tatctttgcc ttccccaaaa attgccactc cgtcaaatag 120
attcttttga atccttgact atttttgcct aaataggttt ttgttagttt ttcttcaaag 180
cccaaaagaa actctattta gattcatcca gaaacaatct ttttctcacc ccatttcgaa 240
gtgccgtgga gcacagacat aaaaagatga ctaccgttca acctacaggg ccagacaggc 300
tcacccctgc agagctccat tgcgccaacc cccgctctcc agactccttc gttatccagc 360
attcagcatg gacaggttca aaaaataaaa tttcttgata tgggtccact tcaaacatgc 420
gcctacctgt aggaaaaaaa aagagaacat aaatatgccg cgaacagaaa acgtaatgta 480
ctgttctata tataaactgt tcagatcaat cataaattct cagtttcaaa ctttccgctc 540
agccagattg tcgaccatca tcatcatcat cattgagttt gtagccttag acatgactgt 600
tcctcagttc aagttgggca cttacgagaa gaccggtctt gctagattct aatcaagagg 660
atgtcagaat gccatttgcc tgagagatgc aggcttcatt tttgatactt ttttatttgt 720
aacctatata gtataggatt ttttttgtca ttttgtttct tctcgtacga gcttgctcct 780
gatcagccta tctcgcagct gatgaatatc ttgtggtagg ggtttgggaa aatcattcga 840
gtttgatgtt tttcttggta tttcccactc ctcttcagag tacagaagat taagtgagac 900
cttcgtttgt gcggatcccc cacacaccat agcttcaaaa tgtttctact ccttttttac 960
tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac ccaagcacag 1020
catactaaat tttccctctt tcttcctcta gggtgtcgtt aattacccgt actaaaggtt 1080
tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg caataaaaat 1140
ttttatcacg tttctttttc ttgaaatttt tttttttagt ttttttctct ttcagtgacc 1200
tccattgata tttaagttaa taaacggtct tcaatttctc aagtttcagt ttcatttttc 1260
ttgttctatt acaacttttt ttacttcttg ttcattagaa agaaagcata gcaatctaat 1320
ctaaggggcg gtgttgacaa ttaatcatcg gcatagtata tcggcatagt ataatacgac 1380
aaggtgagga actaaaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc 1440
gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg 1500
gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag 1560
gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg 1620
tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg 1680
accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac 1740
tgcgtgcact tcgtggccga ggagcaggac tgacacgtcc gacggcggcc cacgggtccc 1800
aggcctcgga gatccgtccc ccttttcctt tgtcgatatc atgtaattag ttatgtcacg 1860
cttacattca cgccctcccc ccacatccgc tctaaccgaa aaggaaggag ttagacaacc 1920
tgaagtctag gtccctattt atttttttat agttatgtta gtattaagaa cgttatttat 1980
atttcaaatt tttctttttt ttctgtacag acgcgtgtac gcatgtaaca ttatactgaa 2040
aaccttgctt gagaaggttt tgggacgctc gaaggcttta atttgcaagc tggagaccaa 2100
catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 2160
tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 2220
gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 2280
ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 2340
cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 2400
caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 2460
ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 2520
taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 2580
taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 2640
cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 2700
tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 2760
gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 2820
catgagatc 2829
<210> 13
<211> 1878
<212> DNA
<213〉artificial sequence
<220>
<221> CDS
<222> (1)~(1878)
<223〉HSA/PTH(1-34) gene order of fusion rotein and protein sequence
<400> 13
gat gca cac aag agt gag gtt gct cat cgg ttt aaa gat ttg gga gaa 48
Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu
1 5 10 15
gaa aat ttc aaa gcc ttg gtg ttg att gcc ttt gct cag tat ctt cag 96
Glu Asn Phe Lys Ala Leu Val Leu Ile Ala Phe Ala Gln Tyr Leu Gln
20 25 30
cag tgt cca ttt gaa gat cat gta aaa tta gtg aat gaa gta act gaa 144
Gln Cys Pro Phe Glu Asp His Val Lys Leu Val Asn Glu Val Thr Glu
35 40 45
ttt gca aaa aca tgt gtt gct gat gag tca gct gaa aat tgt gac aaa 192
Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu Asn Cys Asp Lys
50 55 60
tca ctt cat acc ctt ttt gga gac aaa tta tgc aca gtt gca act ctt 240
Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu
65 70 75 80
cgt gaa acc tat ggt gaa atg gct gac tgc tgt gca aaa caa gaa cct 288
Arg Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gln Glu Pro
85 90 95
gag aga aat gaa tgc ttc ttg caa cac aaa gat gac aac cca aac ctc 336
Glu Arg Asn Glu Cys Phe Leu Gln His Lys Asp Asp Asn Pro Asn Leu
100 105 110
ccc cga ttg gtg aga cca gag gtt gat gtg atg tgc act gct ttt cat 384
Pro Arg Leu Val Arg Pro Glu Val Asp Val Met Cys Thr Ala Phe His
115 120 125
gac aat gaa gag aca ttt ttg aaa aaa tac tta tat gaa att gcc aga 432
Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu Ile Ala Arg
130 135 140
aga cat cct tac ttt tat gcc ccg gaa ctc ctt ttc ttt gct aaa agg 480
Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg
145 150 155 160
tat aaa gct gct ttt aca gaa tgt tgc caa gct gct gat aaa gct gcc 528
Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gln Ala Ala Asp Lys Ala Ala
165 170 175
tgc ctg ttg cca aag ctc gat gaa ctt cgg gat gaa ggg aag gct tcg 576
Cys Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser
180 185 190
tct gcc aaa cag aga ctc aag tgt gcc agt ctc caa aaa ttt gga gaa 624
Ser Ala Lys Gln Arg Leu Lys Cys Ala Ser Leu Gln Lys Phe Gly Glu
195 200 205
aga gct ttc aaa gca tgg gca gta gct cgc ctg agc cag aga ttt ccc 672
Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser Gln Arg Phe Pro
210 215 220
aaa gct gag ttt gca gaa gtt tcc aag tta gtg aca gat ctt acc aaa 720
Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys
225 230 235 240
gtc cac acg gaa tgc tgc cat gga gat ctg ctt gaa tgt gct gat gac 768
Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp
245 250 255
agg gcg gac ctt gcc aag tat atc tgt gaa aat caa gat tcg atc tcc 816
Arg Ala Asp Leu Ala Lys Tyr Ile Cys Glu Asn Gln Asp Ser Ile Ser
260 265 270
agt aaa ctg aag gaa tgc tgt gaa aaa cct ctg ttg gaa aaa tcc cac 864
Ser Lys Leu Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His
275 280 285
tgc att gcc gaa gtg gaa aat gat gag atg cct gct gac ttg cct tca 912
Cys Ile Ala Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro Ser
290 295 300
tta gct gct gat ttt gtt gaa agt aag gat gtt tgc aaa aac tat gct 960
Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala
305 310 315 320
gag gca aag gat gtc ttc ctg ggc atg ttt ttg tat gaa tat gca aga 1008
Glu Ala Lys Asp Val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg
325 330 335
agg cat cct gat tac tct gtc gtg ctg ctg ctg aga ctt gcc aag aca 1056
Arg His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr
340 345 350
tat gaa acc act cta gag aag tgc tgt gcc gct gca gat cct cat gaa 1104
Tyr Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu
355 360 365
tgc tat gcc aaa gtg ttc gat gaa ttt aaa cct ctt gtg gaa gag cct 1152
Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu Val Glu Glu Pro
370 375 380
cag aat tta atc aaa caa aat tgt gag ctt ttt gag cag ctt gga gag 1200
Gln Asn Leu Ile Lys Gln Asn Cys Glu Leu Phe Glu Gln Leu Gly Glu
385 390 395 400
tac aaa ttc cag aat gcg cta tta gtt cgt tac acc aag aaa gta ccc 1248
Tyr Lys Phe Gln Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro
405 410 415
caa gtg tca act cca act ctt gta gag gtc tca aga aac cta gga aaa 1296
Gln Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys
420 425 430
gtg ggc agc aaa tgt tgt aaa cat cct gaa gca aaa aga atg ccc tgt 1344
Val Gly Ser Lys Cys Cys Lys His Pro Glu Ala Lys Arg Met Pro Cys
435 440 445
gca gaa gac tat cta tcc gtg gtc ctg aac cag tta tgt gtg ttg cat 1392
Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gln Leu Cys Val Leu His
450 455 460
gag aaa acg cca gta agt gac aga gtc acc aaa tgc tgc aca gaa tcc 1440
Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser
465 470 475 480
ttg gtg aac agg cga cca tgc ttt tca gct ctg gaa gtc gat gaa aca 1488
Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys
485 490 495
tac gtt ccc aaa gag ttt aat gct gaa aca ttc acc ttc cat gca gat 1536
Leu Val Asn Arg Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp
500 505 510
ata tgc aca ctt tct gag aag gag aga caa atc aag aaa caa act gca 1584
Ile Cys Thr Leu Ser Glu Lys Glu Arg Gln Ile Lys Lys Gln Thr Ala
515 520 525
ctt gtt gag ctc gtg aaa cac aag ccc aag gca aca aaa gag caa ctg 1632
Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr Lys Glu Gln Leu
530 535 540
aaa gct gtt atg gat gat ttc gca gct ttt gta gag aag tgc tgc aag 1680
Lys Ala Val Met Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys
545 550 555 560
gct gac gat aag gag acc tgc ttt gcc gag gag ggt aaa aaa ctt gtt 1728
Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val
565 570 575
gct gca agt caa gct gcc tta ggc tta gga tcc ggt ggt ggt ggt agt 1776
Ala Ala Ser Gln Ala Ala Leu Gly Leu Gly Ser Gly Gly Gly Gly Ser
580 585 590
tct gtt tct gaa atc cag ctg atg cac aac ctt ggt aaa cac ctg aac 1824
Ser Val Ser Glu Ile Gln Leu Met His Asn Leu Glu Lys His Leu Asn
595 600 605
tcc atg gaa cgt gtt gaa tgg ctg cgt aag aaa ctg cag gat gtc cat 1872
Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His
610 615 620
aac ttc
Asn Phe
625
Claims (3)
1. method for preparing human serum albumin-human parathyroid hormone, realize by following steps:
(a) design
YPS15 ' end homology arm and 3 ' end homology arm amplimer, and from the pichia spp genome, effectively amplify this two homology arm sequences; Wherein be used for amplification
YPS1Upstream primer sequence SEQ ID the NO.1:5 '-aagc of 5 ' end homology arm sequence
CtgcagAgctccattg cgccaacccc-3 ', the line part is
PstThe I restriction enzyme site, downstream primer sequence SEQ ID NO.2:5 '-cggc
GtcgacAatctggctg agcggaaagt ttga-3 ', the line part is
SalThe I restriction enzyme site is used for amplification
YPS1Upstream primer sequence SEQ ID the NO.3:5 '-ccc of 3 ' end homology arm sequence
AgatctC acatttcgcg cctgccttcc t-3 ', the line part is
BglThe II restriction enzyme site, downstream primer sequence SEQ ID NO.4:5 '-aaa
CtgcagG ggtgagcctg tctggccct-3 ', the line part is
PstThe I restriction enzyme site;
(b) these two homology arm sequences that obtain in the step (a) are inserted among the carrier pPICZ α B according to sequencing, obtain to be used for the recombinant vectors pPICZ α B-of subsequent homo reorganization
YPS1△, sequence is SEQ ID NO.12;
(c) with the recombinant vectors that makes up in the step (b) with single digestion with restriction enzyme after transfection SMD1168, cultivation screening recon with the selective pressure that contains Zeocin, with the exactness of corresponding positive primer and negative primer checking homologous recombination, correct defective type recon is designated as SMD1168
Yps1△; Wherein negative primer sequence is SEQ ID NO.5:5 '-ctc ctt cca ccg cca cgg c-3 ', SEQ ID NO.6:5 '-acc act tga ggc gtc ctt gga-3 ', positive primer sequence is SEQ ID NO.7:5 '-atg gag gag ttg agg acc-3 ', SEQ ID NO.8:5 '-gca aat ggc att ctg aca tcc-3 ';
(d) utilize the protease-deficient host SMD1168 that obtains in the step (c)
Yps1△ expresses corresponding proteolytic ferment responsive type recombinant protein: human serum albumin-human parathyroid hormone (1-34) fusion rotein, SEQ ID NO:13 is the dna sequence dna of this fusion rotein.
2. a kind of method for preparing human serum albumin-human parathyroid hormone according to claim 1 is characterized in that, in the step (a), provides
YPS1The gene order SEQ ID NO.9 of 5 ' end homology arm,
YPS1The gene order of the gene order SEQ ID NO.10 of 3 ' end homology arm and coding yapsin 1
YPS1SEQ ID NO.11.
3. a kind of method for preparing human serum albumin-human parathyroid hormone according to claim 1, it is characterized in that, in the step (b), yeast host GS115, SMD1168 and expression vector pPICZ α B buy from Invitrogen Corp.San Diego.California. USA company.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201210134869 CN102676563B (en) | 2012-05-03 | 2012-05-03 | Method for preparing human serum albumin-human parathyroid hormone |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201210134869 CN102676563B (en) | 2012-05-03 | 2012-05-03 | Method for preparing human serum albumin-human parathyroid hormone |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102676563A CN102676563A (en) | 2012-09-19 |
CN102676563B true CN102676563B (en) | 2013-09-18 |
Family
ID=46809115
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201210134869 Expired - Fee Related CN102676563B (en) | 2012-05-03 | 2012-05-03 | Method for preparing human serum albumin-human parathyroid hormone |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102676563B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103173477A (en) * | 2012-11-23 | 2013-06-26 | 江南大学 | Method for stably expressing recombined human serum albumin and human parathyroid hormone (1-34)ab diad fusion protein by using saccharomyces cerevisiae |
US10337022B2 (en) * | 2014-11-04 | 2019-07-02 | Agresearch Limited | Methods of increasing root biomass in plants |
US10647975B2 (en) | 2017-10-03 | 2020-05-12 | Bolt Threads, Inc. | Modified strains for the production of recombinant silk |
AU2017434920B2 (en) * | 2017-10-03 | 2022-02-10 | Bolt Threads, Inc . | Modified strains for the production of recombinant silk |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1597965A (en) * | 2004-09-14 | 2005-03-23 | 浙江大学 | Man's serum albumin with man's parathormone (1-34) fusion protein and its application |
CN101063125A (en) * | 2007-04-17 | 2007-10-31 | 江南大学 | Preparation method for fused protein of human parathyroid hormone and human serum albumins and its product |
-
2012
- 2012-05-03 CN CN 201210134869 patent/CN102676563B/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1597965A (en) * | 2004-09-14 | 2005-03-23 | 浙江大学 | Man's serum albumin with man's parathormone (1-34) fusion protein and its application |
CN101063125A (en) * | 2007-04-17 | 2007-10-31 | 江南大学 | Preparation method for fused protein of human parathyroid hormone and human serum albumins and its product |
Non-Patent Citations (4)
Title |
---|
Engineering of protein secretion in yeast: strategies and impact on protein production;Idiris A等;《APPLIED MICROBIOLOGY AND BIOTECHNOLOGY》;20100301;第86卷(第2期);403-417 * |
Idiris A等.Engineering of protein secretion in yeast: strategies and impact on protein production.《APPLIED MICROBIOLOGY AND BIOTECHNOLOGY》.2010,第86卷(第2期),403-417. |
人血清白蛋白与PTH(1-34)融合蛋白在毕赤酵母中的分泌表达及鉴定;陈静等;《浙江大学学报(医学版)》;20080325;第37卷(第02期);126-133 * |
陈静等.人血清白蛋白与PTH(1-34)融合蛋白在毕赤酵母中的分泌表达及鉴定.《浙江大学学报(医学版)》.2008,第37卷(第02期),126-133. |
Also Published As
Publication number | Publication date |
---|---|
CN102676563A (en) | 2012-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Hallewell et al. | Amino terminal acetylation of authentic human Cu, Zn superoxide dismutase produced in yeast | |
Lisowsky | Dual function of a new nuclear gene for oxidative phosphorylation and vegetative growth in yeast | |
CN102676563B (en) | Method for preparing human serum albumin-human parathyroid hormone | |
Hashimoto et al. | Expression of the bovine heart mitochondrial ADP/ATP carrier in yeast mitochondria: significantly enhanced expression by replacement of the N-terminal region of the bovine carrier by the corresponding regions of the yeast carriers | |
CN101084233A (en) | Methods for reducing or eliminating alpha-mannosidase resistant glycans for the production of glycoproteins | |
KR20160002692A (en) | Recombinant Micro-organism For Use In Method With Increased Product Yield | |
ES2464740T3 (en) | Method to generate a genetically modified microbe | |
CN102782130B (en) | High-expression promoter derived from kluyveromyces marxianus | |
CN109706179B (en) | Porcine parvovirus infectious cloning system stably carrying genetic marker and construction method and application thereof | |
CN110042068B (en) | Method for improving enterokinase secretion expression quantity by modifying signal peptide | |
CN111321167B (en) | Construction method and application of heterologous protein expressed rolling circle replication recombinant vector | |
CN110628795B (en) | Cell enrichment technology using inactivated screening agent resistance gene as report system for A.G base substitution and application thereof | |
Chow et al. | Determination of the cDNA sequence for the human mitochondrial 75‐kDa Fe–S protein of NADH‐coenzyme Q reductase | |
CN111363711B (en) | Method for producing lysine by adsorption immobilized fermentation of recombinant corynebacterium glutamicum | |
CN113502297B (en) | Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof | |
Hatanaka et al. | Significant expression of functional human type 1 mitochondrial ADP/ATP carrier in yeast mitochondria | |
CN108517321B (en) | Corynebacterium inducible promoter, expression vector containing same and application | |
CN110256577B (en) | Fused albumin nanoparticle and application thereof | |
CN111269930A (en) | Method for detecting genetic stability of filamentous fungus transformation system | |
Monia et al. | A human ubiquitin carboxyl extension protein functions in yeast. | |
WO2020182768A1 (en) | Methods for bioactive lectin production with improved yield | |
ES2905776T3 (en) | Increased production of triacylglycerol in microalgae | |
KR20210120027A (en) | ABC transporter for high-efficiency production of rebaudioside | |
CN110373424A (en) | A kind of production method of the cold-resistant rice containing HVUL5H49059.2 gene | |
CN112105735A (en) | Nucleic acids for mite allergy treatment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130918 |