CN113005111B - Novel glycerol mono-diacyl ester lipase - Google Patents
Novel glycerol mono-diacyl ester lipase Download PDFInfo
- Publication number
- CN113005111B CN113005111B CN202011499617.4A CN202011499617A CN113005111B CN 113005111 B CN113005111 B CN 113005111B CN 202011499617 A CN202011499617 A CN 202011499617A CN 113005111 B CN113005111 B CN 113005111B
- Authority
- CN
- China
- Prior art keywords
- seq
- val
- ala
- amino acid
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 title claims description 49
- 150000002148 esters Chemical class 0.000 title claims description 18
- 108090001060 Lipase Proteins 0.000 title description 18
- 102000004882 Lipase Human genes 0.000 title description 18
- 239000004367 Lipase Substances 0.000 title description 18
- 235000019421 lipase Nutrition 0.000 title description 18
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 131
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 126
- 229920001184 polypeptide Polymers 0.000 claims abstract description 124
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 70
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 62
- 239000002157 polynucleotide Substances 0.000 claims abstract description 62
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 62
- 239000013604 expression vector Substances 0.000 claims abstract description 36
- 235000019626 lipase activity Nutrition 0.000 claims abstract description 19
- 238000000034 method Methods 0.000 claims description 43
- 239000013612 plasmid Substances 0.000 claims description 34
- 239000002773 nucleotide Substances 0.000 claims description 33
- 125000003729 nucleotide group Chemical group 0.000 claims description 33
- 230000007062 hydrolysis Effects 0.000 claims description 16
- 238000006460 hydrolysis reaction Methods 0.000 claims description 16
- 238000003786 synthesis reaction Methods 0.000 claims description 15
- 230000001105 regulatory effect Effects 0.000 claims description 14
- 230000015572 biosynthetic process Effects 0.000 claims description 13
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 12
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 6
- 229930195729 fatty acid Natural products 0.000 claims description 6
- 239000000194 fatty acid Substances 0.000 claims description 6
- 150000004665 fatty acids Chemical class 0.000 claims description 6
- 238000005809 transesterification reaction Methods 0.000 claims description 6
- OGBUMNBNEWYMNJ-UHFFFAOYSA-N batilol Chemical class CCCCCCCCCCCCCCCCCCOCC(O)CO OGBUMNBNEWYMNJ-UHFFFAOYSA-N 0.000 claims description 5
- LDVVTQMJQSCDMK-UHFFFAOYSA-N 1,3-dihydroxypropan-2-yl formate Chemical compound OCC(CO)OC=O LDVVTQMJQSCDMK-UHFFFAOYSA-N 0.000 claims description 4
- 230000002194 synthesizing effect Effects 0.000 claims description 2
- 238000006467 substitution reaction Methods 0.000 abstract description 25
- 238000007792 addition Methods 0.000 abstract description 12
- 238000012217 deletion Methods 0.000 abstract description 12
- 230000037430 deletion Effects 0.000 abstract description 12
- 102000005398 Monoacylglycerol Lipase Human genes 0.000 abstract 1
- 108020002334 Monoacylglycerol lipase Proteins 0.000 abstract 1
- 102000004190 Enzymes Human genes 0.000 description 50
- 108090000790 Enzymes Proteins 0.000 description 50
- 235000001014 amino acid Nutrition 0.000 description 49
- 210000004027 cell Anatomy 0.000 description 45
- 230000000694 effects Effects 0.000 description 43
- 229940024606 amino acid Drugs 0.000 description 36
- 108090000623 proteins and genes Proteins 0.000 description 32
- 150000001413 amino acids Chemical class 0.000 description 31
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 27
- 108020004414 DNA Proteins 0.000 description 20
- 102000004169 proteins and genes Human genes 0.000 description 20
- 235000018102 proteins Nutrition 0.000 description 19
- 238000012216 screening Methods 0.000 description 18
- 101710197359 Mono- and diacylglycerol lipase Proteins 0.000 description 16
- 238000006243 chemical reaction Methods 0.000 description 16
- 239000002609 medium Substances 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 14
- 235000011187 glycerol Nutrition 0.000 description 14
- 230000035772 mutation Effects 0.000 description 14
- 150000007523 nucleic acids Chemical class 0.000 description 14
- 238000000855 fermentation Methods 0.000 description 13
- 230000004151 fermentation Effects 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- 241000588724 Escherichia coli Species 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 239000012634 fragment Substances 0.000 description 11
- XZKIHKMTEMTJQX-UHFFFAOYSA-N 4-Nitrophenyl Phosphate Chemical compound OP(O)(=O)OC1=CC=C([N+]([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-N 0.000 description 10
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 10
- 241000235058 Komagataella pastoris Species 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- -1 diglycerides Chemical class 0.000 description 9
- 102000039446 nucleic acids Human genes 0.000 description 9
- 108020004707 nucleic acids Proteins 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 8
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 8
- AFSHUZFNMVJNKX-UHFFFAOYSA-N 1,2-di-(9Z-octadecenoyl)glycerol Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC(CO)OC(=O)CCCCCCCC=CCCCCCCCC AFSHUZFNMVJNKX-UHFFFAOYSA-N 0.000 description 7
- AFSHUZFNMVJNKX-LLWMBOQKSA-N 1,2-dioleoyl-sn-glycerol Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](CO)OC(=O)CCCCCCC\C=C/CCCCCCCC AFSHUZFNMVJNKX-LLWMBOQKSA-N 0.000 description 7
- 102220602057 Argininosuccinate lyase_S229R_mutation Human genes 0.000 description 7
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 7
- 229960000723 ampicillin Drugs 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 102220195236 rs1057518512 Human genes 0.000 description 7
- 102200136979 rs879253833 Human genes 0.000 description 7
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 6
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 239000002585 base Substances 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 230000001976 improved effect Effects 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 239000013600 plasmid vector Substances 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 210000005253 yeast cell Anatomy 0.000 description 6
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 5
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 5
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 5
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 5
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 5
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 5
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 5
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 5
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 5
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 5
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 5
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 5
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 5
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 5
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 5
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 5
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 5
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 5
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 5
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 5
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 5
- 241001506991 Komagataella phaffii GS115 Species 0.000 description 5
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 5
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 5
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 5
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 5
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 5
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 5
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 5
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 5
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 5
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 5
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 5
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 5
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 5
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 5
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 5
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 5
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 5
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 5
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 5
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 5
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 5
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 5
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 5
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 5
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 5
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 5
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 5
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 5
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 5
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 5
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 5
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 5
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 5
- 238000003259 recombinant expression Methods 0.000 description 5
- 230000006798 recombination Effects 0.000 description 5
- 238000005215 recombination Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- 239000004475 Arginine Substances 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 239000011665 D-biotin Substances 0.000 description 4
- 235000000638 D-biotin Nutrition 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 4
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 4
- 235000011130 ammonium sulphate Nutrition 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 229940098773 bovine serum albumin Drugs 0.000 description 4
- 229940041514 candida albicans extract Drugs 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- 238000012258 culturing Methods 0.000 description 4
- 230000032050 esterification Effects 0.000 description 4
- 238000005886 esterification reaction Methods 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000001509 sodium citrate Substances 0.000 description 4
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000001131 transforming effect Effects 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- XPFJYKARVSSRHE-UHFFFAOYSA-K trisodium;2-hydroxypropane-1,2,3-tricarboxylate;2-hydroxypropane-1,2,3-tricarboxylic acid Chemical compound [Na+].[Na+].[Na+].OC(=O)CC(O)(C(O)=O)CC(O)=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O XPFJYKARVSSRHE-UHFFFAOYSA-K 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 102100036826 Aldehyde oxidase Human genes 0.000 description 3
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 101000928314 Homo sapiens Aldehyde oxidase Proteins 0.000 description 3
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 3
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 3
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 3
- 239000012154 double-distilled water Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- 101150037796 mdlA gene Proteins 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 238000004321 preservation Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000010188 recombinant method Methods 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 150000003626 triacylglycerols Chemical class 0.000 description 3
- SBKVPJHMSUXZTA-MEJXFZFPSA-N (2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-5-amino-2-[[2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-5-oxopentanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 SBKVPJHMSUXZTA-MEJXFZFPSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- OYIFNHCXNCRBQI-UHFFFAOYSA-N 2-aminoadipic acid Chemical compound OC(=O)C(N)CCCC(O)=O OYIFNHCXNCRBQI-UHFFFAOYSA-N 0.000 description 2
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 101150061183 AOX1 gene Proteins 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 2
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 2
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- 108010038049 Mating Factor Proteins 0.000 description 2
- 102220636067 Olfactory receptor 5D18_N136D_mutation Human genes 0.000 description 2
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 2
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 2
- 241000228143 Penicillium Species 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 241000235346 Schizosaccharomyces Species 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 241000985245 Spodoptera litura Species 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical class N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 239000012152 bradford reagent Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 229960002591 hydroxyproline Drugs 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000004897 n-terminal region Anatomy 0.000 description 2
- 229960003104 ornithine Drugs 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 102200084990 rs199422124 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- BHZOKUMUHVTPBX-UHFFFAOYSA-M sodium acetic acid acetate Chemical compound [Na+].CC(O)=O.CC([O-])=O BHZOKUMUHVTPBX-UHFFFAOYSA-M 0.000 description 2
- 238000013112 stability test Methods 0.000 description 2
- 239000012089 stop solution Substances 0.000 description 2
- 230000002463 transducing effect Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- BJBUEDPLEOHJGE-UHFFFAOYSA-N (2R,3S)-3-Hydroxy-2-pyrolidinecarboxylic acid Natural products OC1CCNC1C(O)=O BJBUEDPLEOHJGE-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- YOFPFYYTUIARDI-ZCFIWIBFSA-N (2r)-2-aminooctanedioic acid Chemical compound OC(=O)[C@H](N)CCCCCC(O)=O YOFPFYYTUIARDI-ZCFIWIBFSA-N 0.000 description 1
- BVAUMRCGVHUWOZ-ZETCQYMHSA-N (2s)-2-(cyclohexylazaniumyl)propanoate Chemical compound OC(=O)[C@H](C)NC1CCCCC1 BVAUMRCGVHUWOZ-ZETCQYMHSA-N 0.000 description 1
- NPDBDJFLKKQMCM-SCSAIBSYSA-N (2s)-2-amino-3,3-dimethylbutanoic acid Chemical compound CC(C)(C)[C@H](N)C(O)=O NPDBDJFLKKQMCM-SCSAIBSYSA-N 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- VCUYQZSUSFMZHI-UHFFFAOYSA-N 4-amino-2-ethylbenzenesulfonic acid Chemical compound CCC1=CC(N)=CC=C1S(O)(=O)=O VCUYQZSUSFMZHI-UHFFFAOYSA-N 0.000 description 1
- XWHHYOYVRVGJJY-UHFFFAOYSA-N 4-fluorophenylalanine Chemical compound OC(=O)C(N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-UHFFFAOYSA-N 0.000 description 1
- 101150005709 ARG4 gene Proteins 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 101150069554 HIS4 gene Proteins 0.000 description 1
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Natural products NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- KNTFCRCCPLEUQZ-VKHMYHEASA-N O-methylserine Chemical compound COC[C@H](N)C(O)=O KNTFCRCCPLEUQZ-VKHMYHEASA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000710961 Semliki Forest virus Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 150000001483 arginine derivatives Chemical class 0.000 description 1
- 150000001507 asparagine derivatives Chemical class 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000007036 catalytic synthesis reaction Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 125000003636 chemical group Chemical class 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 150000001944 cysteine derivatives Chemical class 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 229960002086 dextran Drugs 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- GLVVKKSPKXTQRB-UHFFFAOYSA-N ethenyl dodecanoate Chemical compound CCCCCCCCCCCC(=O)OC=C GLVVKKSPKXTQRB-UHFFFAOYSA-N 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000004519 grease Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Natural products NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- BBJIPMIXTXKYLZ-UHFFFAOYSA-N isoglutamic acid Chemical compound OC(=O)CC(N)CC(O)=O BBJIPMIXTXKYLZ-UHFFFAOYSA-N 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 239000006225 natural substrate Substances 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000007519 polyprotic acids Chemical class 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- HNJBEVLQSNELDL-UHFFFAOYSA-N pyrrolidin-2-one Chemical compound O=C1CCCN1 HNJBEVLQSNELDL-UHFFFAOYSA-N 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 102220291483 rs745756794 Human genes 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 150000003354 serine derivatives Chemical class 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- FQENQNTWSFEDLI-UHFFFAOYSA-J sodium diphosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OP([O-])([O-])=O FQENQNTWSFEDLI-UHFFFAOYSA-J 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 229940048086 sodium pyrophosphate Drugs 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 235000019818 tetrasodium diphosphate Nutrition 0.000 description 1
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- BJBUEDPLEOHJGE-IMJSIDKUSA-N trans-3-hydroxy-L-proline Chemical compound O[C@H]1CC[NH2+][C@@H]1C([O-])=O BJBUEDPLEOHJGE-IMJSIDKUSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6454—Glycerides by esterification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01023—Acylglycerol lipase (3.1.1.23)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present application provides novel polypeptides having monoglyceride lipase activity comprising an amino acid sequence as set forth in any one of SEQ ID NOs 1 to 3 and 15 or a sequence comprising at least one amino acid substitution, deletion or addition in the above sequence. The application also provides polynucleotides encoding the polypeptides, expression vectors and host cells containing the polynucleotides. Furthermore, the present application relates to the use of the above-described polypeptide having monoglyceride-diacyl lipase activity.
Description
Cross Reference to Related Applications
The present application claims priority from chinese patent application No. 201911329804.5 filed on date 20 and 12 in 2019, which is incorporated herein by reference in its entirety.
Technical Field
The present application is in the field of enzyme engineering, in particular, polypeptides having lipase activity, nucleic acids encoding the same, and expression vectors and host cells comprising the same. The application also relates to a screening method and application of the polypeptide.
Background
Lipase is one of esterases and can catalyze the hydrolysis of ester bonds of triglycerides, diglycerides, monoglycerides, other small molecule esters, polyol esters and polybasic acid esters, fat is a natural substrate of lipase, diglycerides and monoglycerides are produced in the hydrolysis process, and hydrolysis end products are glycerol and fatty acids. The lipase has mild hydrolysis condition, less byproducts and no coenzyme. Fat is mostly a hydrophobic substance, so that hydrolysis occurs at the oil-water interface or in the organic phase. The monoglyceride-diacylglycerol lipase (MDGL) is one of lipases, the MDGL has substrate specificity, only acts on Monoglyceride (MAG) and Diglyceride (DAG), has no catalysis effect on triacylglycerol, and can produce monoglyceride with high industrial value by esterification or transesterification reaction.
MAG is an excellent emulsifier and has wide application in the food, pharmaceutical and cosmetic industries. The traditional synthesis process of MAG is to produce through continuous esterification of grease/Triglyceride (TAG) and glycerin under the condition of nitrogen at high temperature by taking inorganic base as a catalyst, wherein the product is a mixture of MAG, DAG and TAG, and the MAG is obtained through distillation, and the yield of the MAG is 40-50%. Unlike traditional synthesis process, the enzymatic synthesis of MAG takes fatty acid and glycerin as substrates, and the catalytic synthesis of MAG is carried out under mild condition, wherein the conversion rate of fatty acid in enzymatic catalysis is more than 97%, and the MAG accounts for more than 74% of the product. The main problems of the current restriction enzyme method application are lower enzyme activity and higher enzyme cost.
Summary of The Invention
In a first aspect, there is provided a polypeptide having glycerol mono-diacyl ester lipase activity comprising or consisting of a sequence selected from the group consisting of seq id nos:
(a) An amino acid sequence as shown in any one of SEQ ID NO 1-3 and 15; and
(b) A sequence obtained by substituting, deleting or adding at least one amino acid into the sequence of (a); wherein the polypeptide variant obtained from (b) retains monoglyceride-diacyl lipase activity.
In some embodiments, the polypeptides of the present application comprise an amino acid sequence set forth in any one of SEQ ID NOs 1-3 and 15. In a preferred embodiment, the above polypeptide consists of the amino acid sequence shown in any one of SEQ ID NOs 1 to 3 and 15. In a more preferred embodiment, the polypeptide consists of the amino acid sequence shown in SEQ ID NO. 2, 3 or 15.
Herein, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 1 is designated MDGLs, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 2 is designated MDGLc, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 3 is designated MDGLsc, and the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 15 is designated MDGL-3G3.
In a second aspect, there is provided a polynucleotide encoding the polypeptide of the first aspect comprising or consisting of a sequence selected from the group consisting of seq id nos:
(a) A nucleotide sequence encoding an amino acid sequence shown in any one of SEQ ID NOs 1 to 3 and 15 or a sequence comprising at least one amino acid substitution, deletion or addition in the above sequence; and
(b) A nucleotide sequence which hybridizes under stringent conditions to the nucleotide sequence of (a).
In some embodiments, the polynucleotides of the present application comprise a nucleotide sequence set forth in any one of SEQ ID NOs 4-6 and 16. In a preferred embodiment, the above polynucleotide consists of the nucleotide sequence shown in any one of SEQ ID NOS.4-6 and 16. In a more preferred embodiment, the polynucleotide consists of the nucleotide sequence shown in SEQ ID NO. 5, 6 or 16.
In a third aspect, there is provided an expression vector comprising at least one polynucleotide of the second aspect.
In certain embodiments, the expression vectors of the present application further comprise a regulatory sequence that regulates the expression of the polynucleotide, wherein the polynucleotide is operably linked to the regulatory sequence. In a preferred embodiment, the expression vector is a plasmid vector, such as pPIC9K.
In a fourth aspect, there is provided a host cell comprising the polynucleotide of the second aspect or the expression vector of the third aspect.
In specific embodiments, the host cell is a yeast, such as pichia pastoris, or e.
In a fifth aspect, there is provided the use of a polypeptide according to the first aspect for catalyzing a synthesis, hydrolysis or transesterification reaction of an ester.
In some embodiments, the esters are monoglycerides and/or diglycerides.
In a sixth aspect, there is provided a method of synthesizing a monoglyceride and/or diglyceride comprising contacting a polypeptide of the first aspect with a fatty acid and glycerol.
In a seventh aspect, there is provided a method of screening a polypeptide of the present application comprising:
1) Mutating the nucleotide sequence shown in SEQ ID NO. 7 to obtain a mutant sequence;
2) Expressing the mutated sequence obtained in step 1) to obtain a polypeptide mutated with respect to the polypeptide shown in SEQ ID NO. 8; and
3) Mutant polypeptides with enzyme activity and/or thermal stability higher than those of the polypeptide shown in SEQ ID NO. 8 are screened.
In addition, the application also provides the use of the nucleotide sequence shown in SEQ ID NO. 7 for screening the polypeptide disclosed herein.
The polypeptides obtained by the screening methods disclosed herein have higher lipase activity (in particular glycerol mono-diacyl ester lipase activity) and better thermostability.
Brief description of the drawings
FIG. 1 shows the results of MDGL mutant delta flask fermentation protein production.
FIG. 2 shows the results of the enzymatic yield of MDGL mutant delta flask fermented protein.
FIG. 3 shows the results of the thermostability of MDGL mutants.
FIG. 4 shows the result of SDS-PAGE analysis of MDGL mutants.
In the above figures, s represents MDGLs, c represents MDGLc, and sc represents MDGLsc.
FIG. 5 shows the results of protein production by MDGL-3G3 mutant delta flask fermentation.
FIG. 6 shows the results of the enzyme activity yield and specific enzyme activity of MDGL-3G3 mutant delta flask fermented protein.
FIG. 7 shows the results of thermostability of MDGL-3G3 mutants.
FIG. 8 shows the result of SDS-PAGE analysis of MDGL-3G3 mutant.
Brief description of the sequence
SEQ ID NO. 1: amino acid sequence of MDGLs;
SEQ ID NO. 2: amino acid sequence of MDGLc;
SEQ ID NO. 3: the amino acid sequence of MDGLsc;
SEQ ID NO. 4: coding nucleic acid sequences for MDGLs;
SEQ ID NO. 5: a nucleic acid sequence encoding MDGLc;
SEQ ID NO. 6: a nucleic acid sequence encoding MDGLsc;
SEQ ID NO. 7: an optimized MDGL coding nucleic acid sequence;
SEQ ID NO. 8: an optimized MDGL amino acid sequence;
SEQ ID NO. 9: forward amplification primer MDGLm-F1;
SEQ ID NO. 10: reverse amplification primer MDGLm-R1;
SEQ ID NO. 11: forward amplification primer MDGLm-F2;
SEQ ID NO. 12: reverse amplification primer MDGLm-R2;
SEQ ID NO. 13: forward amplification primer MDGL-ctlm-F;
SEQ ID NO. 14: reverse amplification primer MDGL-ctlm-R;
SEQ ID NO. 15: an amino acid sequence of MDGL-3G 3;
SEQ ID NO. 16: a nucleic acid sequence encoding MDGL-3G 3.
Detailed Description
Polypeptides of the present application
The present application provides polypeptides having lipase activity, in particular glycerol mono-diacyl ester lipase activity, comprising or consisting of a sequence selected from the group consisting of seq id nos:
(a) An amino acid sequence as shown in any one of SEQ ID NO 1-3 and 15; and
(b) A sequence obtained by substituting, deleting or adding at least one amino acid into the sequence of (a); wherein the polypeptide variant obtained from (b) retains lipase activity.
Glycerol mono-diacyl ester lipase (MDGL) is one type of lipase. MDGL has 2 forms, the molecular weight is 37kDa and 39kDa respectively, MDGL has a signal peptide consisting of 26 amino acids, and the mature peptide contains 279 amino acids. For example, MDGL enzyme activity units can be determined using vinyl laurate as a substrate.
In certain embodiments, the number of amino acid substitutions, deletions or additions described above is from 1 to 30, preferably from 1 to 20, more preferably from 1 to 10, wherein the resulting polypeptide variant substantially retains the lipase activity of the unaltered protein.
In preferred embodiments, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 1 by about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid substitutions, deletions and/or additions. In a more preferred embodiment, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 1 by about 1, 2, 3, 4 or 5 amino acid substitutions, deletions or additions.
In preferred embodiments, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 2 by about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid substitutions, deletions and/or additions. In a more preferred embodiment, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 2 by about 1, 2, 3, 4 or 5 amino acid substitutions, deletions or additions.
In preferred embodiments, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 3 by about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid substitutions, deletions and/or additions. In a more preferred embodiment, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 3 by about 1, 2, 3, 4 or 5 amino acid substitutions, deletions or additions.
In preferred embodiments, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 15 by about 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid substitutions, deletions and/or additions. In a more preferred embodiment, the above polypeptide variants differ from the amino acid sequence shown in SEQ ID NO. 15 by about 1, 2, 3, 4 or 5 amino acid substitutions, deletions or additions.
In some embodiments, the polypeptides of the present application comprise an amino acid sequence set forth in any one of SEQ ID NOs 1-3 and 15. In a preferred embodiment, the above polypeptide consists of the amino acid sequence shown in any one of SEQ ID NOs 1 to 3 and 15. In a more preferred embodiment, the polypeptide consists of the amino acid sequence shown in SEQ ID NO. 2, 3 or 15.
Herein, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 1 is designated MDGLs, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 2 is designated MDGLc, the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 3 is designated MDGLsc, and the polypeptide consisting of the amino acid sequence shown in SEQ ID NO. 15 is designated MDGL-3G3.
The amino acid sequence of SEQ ID NO. 1 of the present application is shown below:
DVSTSELDQFEFWVQYAAASYCEADYTAQVGDKLSCSKGNCPEVEATGATVSYDFSDSTITDTAGYIAVDHTNSAVVLAFRGSYSVRNWVADATFVHTNPGLCDGCLAELGFWSSWKLVRDDIIKELKEVVAQNPNYELVVVGHSLGAAVATLAATDLRGKGYPSAKLYAYASPRVGNAALAKYITAQGNNFRFTHTNDPVPKLPLLSMGYVHVSPEYWITSPNNATVSTSDIKVIDGDVSFDGNTGTGLPLLTDFEAHIWYFVQVDACKGPGLPFKRV
the amino acid sequence of SEQ ID NO. 2 of the present application is shown below:
DVSTSELDQFEFWVQYAAASYYEADYTAQVGDKLSCSKGNCPEVEATGATVSYDFSDSTITDTAGYIAVDHTNSAVVLAFRGSYSVRNWVADATFVHTNPGLCDGCLAELGFWSSWKLVRDDIIKELKEVVAQNPNYELVVVGHSLGAAVATLAATDLRGKGYPSAKLYAYASPRVGNAALAKYITAQGNNFRFTHTNDPVPKLPLLSMGYVHVSPEYWITSPNNATVRRRDIKVIDGDVSFDGNTGTGLPLLTDFEAHIWYFVQVDAGKGPGLPFKRV
the amino acid sequence of SEQ ID NO. 3 of the present application is shown below:
DVSTSELDQFEFWVQYAAASYCEADYTAQVGDKLSCSKGNCPEVEATGATVSYDFSDSTITDTAGYIAVDHTNSAVVLAFRGSYSVRNWVADATFVHTNPGLCDGCLAELGFWSSWKLVRDDIIKELKEVVAQNPNYELVVVGHSLGAAVATLAATDLRGKGYPSAKLYAYASPRVGNAALAKYITAQGNNFRFTHTNDPVPKLPLLSMGYVHVSPEYWITSPNNATVRRRDIKVIDGDVSFDGNTGTGLPLLTDFEAHIWYFVQVDACKGPGLPFKRV
the amino acid sequence of SEQ ID NO. 15 of the present application is shown below:
DVSTSELDQFEFWVQYAAASYCEADYRAQVGDKLSCSKGNCPEVEATGATVSYDFSDSTITDTAGYIAVDHTNSAVVLAFRGSHSVRNWVADATFVHTNPGLCDGCLAELGFWSSWKLVRDDIIKELKEVVAQNPDYELVVVGHSLGAAVATLAATDLRGKGYPSAKLYAYASPRVGNAALAKYITAQGNNFRFTHTNDPVPKLPLLSMGYVHVSPEYWITSPNNATVRRRDIKVIDGDVSFDGNTGTGLPLLTDFEAHIWYFVQVDACKGPGLPFKRV
as used herein, the term "amino acid" refers to naturally occurring and non-naturally occurring amino acids as well as amino acid analogs and mimetics. Naturally occurring amino acids include the 20 (L) -amino acids used in protein biosynthesis, as well as other amino acids, such as 4-hydroxyproline, hydroxylysine, desmin, isodesmin, homocysteine, citrulline, and ornithine. Non-naturally occurring amino acids include, for example, (D) -amino acids, norleucine, norvaline, p-fluorophenylalanine, ethylsulfanilic acid, and the like, as known to those skilled in the art. Amino acid analogs include modified forms of naturally and non-naturally occurring amino acids. Such modifications may include, for example, substitution of chemical groups and moieties on the amino acid, or derivatization of the amino acid. Amino acid mimics include, for example, organic structures that exhibit functionally similar properties, such as charge and charge-space characteristics of amino acids. For example, the organic structure mimicking arginine (Arg or R) has a positively charged moiety located in a similar molecular space and having the same degree of mobility as the e-amino group of the side chain of a naturally occurring Arg amino acid. The mimetic also includes a constraining structure to maintain optimal space and charge interactions of the amino acid or amino acid functional groups. One skilled in the art can determine what structures constitute functionally equivalent amino acid analogs and amino acid mimics.
In some embodiments, the variant of the amino acid sequence shown in SEQ ID NO. 1 has at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homology with SEQ ID NO. 1. In a preferred embodiment, the polypeptide variant has more than 99% homology with the sequence shown in SEQ ID NO. 1.
In some embodiments, the variant of the amino acid sequence shown in SEQ ID NO. 2 has at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homology with SEQ ID NO. 2. In a preferred embodiment, the polypeptide variant has more than 99% homology with the sequence shown in SEQ ID NO. 2.
In some embodiments, the variant of the amino acid sequence shown in SEQ ID NO. 3 has at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homology with SEQ ID NO. 3. In a preferred embodiment, the polypeptide variant has more than 99% homology with the sequence shown in SEQ ID NO. 3.
In some embodiments, the variant of the amino acid sequence shown in SEQ ID NO. 15 has at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homology with SEQ ID NO. 15. In a preferred embodiment, the polypeptide variant has more than 99% homology with the sequence shown in SEQ ID NO. 15.
"homology" as used herein is defined as the percentage of identical residues in an amino acid or nucleotide sequence variant after sequence alignment and introduction of gaps, if desired, to achieve a maximum percentage of homology. Methods and computer programs for alignment are well known in the art.
"polypeptide" and "protein" are used interchangeably herein to refer to polymers of amino acid residues and variants and synthetic and naturally occurring analogs thereof. Thus, these terms apply to naturally occurring amino acid polymers and naturally occurring chemical derivatives thereof, as well as amino acid polymers in which one or more amino acid residues are synthetic non-naturally occurring amino acids (such as chemical analogs of the corresponding naturally occurring amino acids). Such derivatives include, for example, post-translational modifications and degradation products, including phosphorylated, glycosylated, oxidized, isomerized and deaminated variants of the polypeptide fragments shown in any of SEQ ID NO 1-3 and 15.
In a preferred embodiment, the sequence of the polypeptide variant is a sequence comprising one or several conservative amino acid substitutions in the amino acid sequence shown in any of SEQ ID NO's 1-3 and 15, wherein the substituted sequence still retains similar lipase catalytic activity, in particular glycerol mono-diacyl ester lipase activity.
Certain amino acid substitutions, known as "conservative amino acid substitutions," can frequently occur in a protein without altering the conformation or function of the protein, which is an established rule in protein chemistry.
Conservative amino acid substitutions in this application include, but are not limited to, substitution of any one of glycine (G), alanine (a), isoleucine (I), valine (V), and leucine (L) for any other one of these aliphatic amino acids; substitution of serine (S) for threonine (T) and vice versa; substitution of aspartic acid (D) for glutamic acid (E) and vice versa; substitution of asparagine (N) with glutamine (Q) and vice versa; substitution of arginine (R) with lysine (K) and vice versa; substitution of any other of these aromatic amino acids with phenylalanine (F), tyrosine (Y) and tryptophan (W); and substitution of cysteine (C) with methionine (M) and vice versa. Other substitutions may also be considered conservative, depending on the particular amino acid environment and its role in the three-dimensional structure of the protein. For example, glycine (G) and alanine (a) are often interchangeable as are alanine (a) and valine (V). Methionine (M), which is relatively hydrophobic, can often be exchanged with leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are often interchanged at the following positions: wherein the important feature of the amino acid residues is their charge and the different pK of the two amino acid residues is not apparent. Under certain circumstances, there are still other variations that may be considered "conservative" (see, e.g., BIOCHEMISTRIY at pp.13-15,2 nd ed.Lubert Stryer ed.(Stanford University);Henikoff et al.,Proc.Nat’l Acad.Sci.USA(1992)89:10915-10919;Lei et al.,J.Biol.Chem.(1995)270(20):11882-11886)。
Hereinafter, amino acid residues are exemplified by a classification of substitutable residues, but the substitutable amino acid residues are not limited to the residues described below:
group A: leucine, isoleucine, norleucine, valine, norvaline, alanine, 2-aminobutyric acid, methionine, O-methylserine, t-butylglycine and cyclohexylalanine;
group B: aspartic acid, glutamic acid, isoaspartic acid, isoglutamic acid, 2-aminoadipic acid, and 2-aminosuberic acid;
group C: asparagine and glutamine;
group D: lysine, arginine, ornithine, 2, 4-diaminobutyric acid, i.e. 2, 3-diaminopropionic acid;
group E: proline, 3-hydroxyproline and 4-hydroxyproline;
group F: serine, threonine and homoserine;
group G: phenylalanine and tyrosine.
In other specific embodiments, the C-terminal or N-terminal region of a polypeptide of the present application, e.g., a polypeptide set forth in any one of SEQ ID NOs 1-3 and 15, may also be truncated by about 1, 2,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25 or more amino acids, while still retaining similar lipase catalytic activity, particularly monoglyceride-diacyl lipase activity.
In further embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25 or more amino acids may also be added to the C-terminal or N-terminal regions of a polypeptide of the present application, e.g., a polypeptide represented by any one of SEQ ID NOs 1-3 and 15, resulting in a polypeptide variant that still retains similar lipase catalytic activity, particularly monoglyceride-diacyl lipase activity.
Furthermore, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25 or more amino acids may be added or deleted in a region other than the C-terminus or N-terminus of a polypeptide, such as any of the sequences set forth in SEQ ID NOs 1-3 and 15, provided that the altered polypeptide substantially retains similar lipase catalytic activity, particularly monoglyceride-diacyl lipase activity.
In certain embodiments, a polypeptide of the present application, such as a polypeptide set forth in any one of SEQ ID NOS: 1-3 and 15, or a variant thereof, is fused to a heterologous polypeptide. In some embodiments, the fusion protein substantially retains the lipase activity of the polypeptide shown in any one of SEQ ID NOs 1-3 and 15. In certain embodiments, the heterologous polypeptide is linked to the N-terminus of the polypeptide shown in any one of SEQ ID NOs 1-3 and 15. In certain embodiments, the heterologous polypeptide is linked to the C-terminus of the polypeptide shown in any one of SEQ ID NOs 1-3 and 15. In these embodiments, the heterologous polypeptide may be selected from a purification tag (which may include, for example, but not limited to, GST, MBP), an epitope tag (which may include, for example, but not limited to, myc, FLAG), a targeting sequence, a signal peptide, and the like. In a specific embodiment, the fusion protein comprises a polypeptide as set forth in any one of SEQ ID NO 1-3 and 15 and a tag, typically a peptide tag, that binds to the C-terminus or N-terminus of the polypeptide as set forth in any one of SEQ ID NO 1-3 and 15. The tag is typically a peptide or amino acid sequence that can be used to isolate and purify the fusion protein.
Polynucleotide
The present application provides polynucleotides encoding polypeptides disclosed herein comprising or consisting of a sequence selected from the group consisting of seq id nos: (a) A nucleotide sequence encoding an amino acid sequence shown in any one of SEQ ID NOs 1 to 3 and 15 or a sequence comprising at least one amino acid substitution, deletion or addition in the above sequence; and
(b) A nucleotide sequence which hybridizes under stringent conditions to the nucleotide sequence of (a).
In certain specific embodiments, the polynucleotides of the present application encode polypeptides represented by any one of SEQ ID NOs 1-3 and 15, as well as functionally equivalent variants thereof. In some embodiments, the polynucleotides of the present application have at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homology to polynucleotides encoding the polypeptides set forth in any of SEQ ID NOs 1-3 and 15, and functionally equivalent variants thereof.
In certain embodiments, polynucleotides of the present application comprise nucleotide sequences having at least 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homology to the nucleotide sequences set forth in any one of SEQ ID NOs 4-6 and 16. In a preferred embodiment, the polynucleotides of the present application comprise a nucleotide sequence set forth in any one of SEQ ID NOs 4-6 and 16.
The nucleotide sequence of SEQ ID NO. 4 is shown below:
GATGTCTCCACTTCCGAACTGGACCAGTTCGAGTTCTGGGTTCAATACGCAGCCGCCTCTTACTGTGAGGCTGATTACACCGCACAGGTTGGTGATAAGCTGTCCTGCTCTAAGGGTAACTGCCCAGAAGTTGAAGCAACCGGTGCAACTGTGTCTTACGACTTCTCCGATTCCACGATCACTGACACCGCAGGTTACATCGCAGTTGATCACACCAACTCCGCAGTGGTACTGGCATTCCGTGGTTCTTACTCCGTACGTAACTGGGTTGCTGATGCTACTTTCGTCCATACCAACCCAGGTCTGTGTGATGGTTGTCTGGCTGAGCTGGGTTTCTGGTCTTCCTGGAAGCTGGTTCGTGATGATATTATCAAAGAACTGAAAGAAGTGGTGGCACAGAACCCAAACTATGAACTGGTGGTCGTGGGCCACTCCCTGGGTGCTGCTGTGGCTACTCTGGCTGCTACCGACCTGCGTGGTAAAGGTTATCCATCTGCTAAACTGTACGCTTACGCTTCCCCTCGTGTTGGCAACGCAGCCCTGGCCAAATATATCACCGCCCAGGGCAACAACTTCCGTTTCACCCACACCAATGACCCAGTACCTAAACTGCCACTGCTGTCTATGGGCTATGTACATGTTTCTCCTGAATATTGGATCACCTCTCCTAACAACGCCACTGTTTCTACCTCTGACATCAAAGTCATTGACGGCGACGTATCTTTTGACGGCAATACCGGCACGGGCCTGCCTCTGCTGACGGACTTTGAAGCCCACATTTGGTACTTTGTACAGGTTGACGCCTGCAAAGGTCCTGGCCTGCCATTCAAACGTGTTTAA
the nucleotide sequence of SEQ ID NO. 5 is shown below:
GATGTCTCCACTTCCGAACTGGACCAGTTCGAGTTCTGGGTTCAATACGCAGCCGCCTCTTACTACGAGGCTGATTACACCGCACAGGTTGGTGATAAGCTGTCCTGCTCTAAGGGTAACTGCCCAGAAGTTGAAGCAACCGGTGCAACTGTGTCTTACGACTTCTCCGATTCCACGATCACTGACACCGCAGGTTACATCGCAGTTGATCACACCAACTCCGCAGTGGTACTGGCATTCCGTGGTTCTTACTCCGTACGTAACTGGGTTGCTGATGCTACTTTCGTCCATACCAACCCAGGTCTGTGTGATGGTTGTCTGGCTGAGCTGGGTTTCTGGTCTTCCTGGAAGCTGGTTCGTGATGATATTATCAAAGAACTGAAAGAAGTGGTGGCACAGAACCCAAACTATGAACTGGTGGTCGTGGGCCACTCCCTGGGTGCTGCTGTGGCTACTCTGGCTGCTACCGACCTGCGTGGTAAAGGTTATCCATCTGCTAAACTGTACGCTTACGCTTCCCCTCGTGTTGGCAACGCAGCCCTGGCCAAATATATCACCGCCCAGGGCAACAACTTCCGTTTCACCCACACCAATGACCCAGTACCTAAACTGCCACTGCTGTCTATGGGCTATGTACATGTTTCTCCTGAATATTGGATCACCTCTCCTAACAACGCCACTGTTCGTAGACGTGACATCAAAGTCATTGACGGCGACGTATCTTTTGACGGCAATACCGGCACGGGCCTGCCTCTGCTGACGGACTTTGAAGCCCACATTTGGTACTTTGTACAGGTTGACGCCGGCAAAGGTCCTGGCCTGCCATTCAAACGTGTTTAA
the nucleotide sequence of SEQ ID NO. 6 is shown below:
GATGTCTCCACTTCCGAACTGGACCAGTTCGAGTTCTGGGTTCAATACGCAGCCGCCTCTTACTGTGAGGCTGATTACACCGCACAGGTTGGTGATAAGCTGTCCTGCTCTAAGGGTAACTGCCCAGAAGTTGAAGCAACCGGTGCAACTGTGTCTTACGACTTCTCCGATTCCACGATCACTGACACCGCAGGTTACATCGCAGTTGATCACACCAACTCCGCAGTGGTACTGGCATTCCGTGGTTCTTACTCCGTACGTAACTGGGTTGCTGATGCTACTTTCGTCCATACCAACCCAGGTCTGTGTGATGGTTGTCTGGCTGAGCTGGGTTTCTGGTCTTCCTGGAAGCTGGTTCGTGATGATATTATCAAAGAACTGAAAGAAGTGGTGGCACAGAACCCAAACTATGAACTGGTGGTCGTGGGCCACTCCCTGGGTGCTGCTGTGGCTACTCTGGCTGCTACCGACCTGCGTGGTAAAGGTTATCCATCTGCTAAACTGTACGCTTACGCTTCCCCTCGTGTTGGCAACGCAGCCCTGGCCAAATATATCACCGCCCAGGGCAACAACTTCCGTTTCACCCACACCAATGACCCAGTACCTAAACTGCCACTGCTGTCTATGGGCTATGTACATGTTTCTCCTGAATATTGGATCACCTCTCCTAACAACGCCACTGTTCGTAGACGTGACATCAAAGTCATTGACGGCGACGTATCTTTTGACGGCAATACCGGCACGGGCCTGCCTCTGCTGACGGACTTTGAAGCCCACATTTGGTACTTTGTACAGGTTGACGCCTGCAAAGGTCCTGGCCTGCCATTCAAACGTGTTTAA
the nucleotide sequence of SEQ ID NO. 16 is shown below:
GACGTTTCTACTTCCGAGTTGGACCAATTCGAGTTCTGGGTTCAATACGCTGCTGCTTCTTACTGTGAGGCTGATTACAGAGCTCAAGTTGGAGATAAGTTGTCTTGTTCTAAGGGTAACTGTCCTGAAGTTGAGGCTACTGGTGCTACTGTTTCTTACGATTTCTCCGATTCTACTATCACTGATACTGCCGGTTACATTGCTGTTGATCACACTAACTCTGCTGTTGTTTTGGCTTTCAGAGGTTCTCACTCTGTTCGTAACTGGGTTGCTGACGCTACTTTTGTTCACACTAACCCAGGTTTGTGTGATGGTTGTTTGGCTGAGTTGGGTTTCTGGTCCTCCTGGAAGTTGGTTAGAGATGACATTATTAAGGAGTTGAAGGAGGTTGTCGCTCAAAACCCTGACTACGAGTTGGTTGTTGTTGGTCACTCTTTGGGTGCTGCCGTTGCTACCTTGGCTGCCACTGACTTGAGAGGTAAAGGTTACCCATCTGCTAAGTTGTACGCTTACGCTTCTCCAAGAGTTGGTAACGCTGCTTTGGCTAAGTACATTACCGCTCAAGGTAACAACTTCAGATTCACTCACACTAACGATCCAGTTCCAAAGTTGCCATTGTTGTCCATGGGTTACGTTCACGTTTCTCCAGAGTACTGGATTACTTCCCCAAACAACGCTACTGTTCGTAGACGTGACATTAAGGTTATCGATGGAGATGTTTCTTTCGACGGTAACACTGGTACCGGTTTGCCATTGTTGACTGACTTCGAGGCTCACATCTGGTACTTCGTTCAAGTTGATGCTTGTAAGGGTCCAGGTTTGCCATTCAAGAGAGTTTAA
in a preferred embodiment, the polynucleotides of the present application consist of nucleotide sequences having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homology to the nucleotide sequences shown in any of SEQ ID NO. 4-6 and 16. In a more preferred embodiment, the polynucleotides of the present application consist of the nucleotide sequence shown in any one of SEQ ID NOs 4 to 6 and 16. In a most preferred embodiment, the polynucleotide consists of the nucleotide sequence shown in SEQ ID NO. 5, 6 or 16.
In a specific embodiment, the polynucleotide shown in SEQ ID NO. 4 encodes the amino acid sequence shown in SEQ ID NO. 1; the polynucleotide shown in SEQ ID NO. 5 codes for an amino acid sequence shown in SEQ ID NO. 2; the polynucleotide shown in SEQ ID NO. 6 encodes an amino acid sequence shown in SEQ ID NO. 3; the polynucleotide shown in SEQ ID NO. 16 encodes the amino acid sequence shown in SEQ ID NO. 15.
The term "polynucleotide" or "nucleic acid" as used herein refers to mRNA, RNA, cRNA, cDNA or DNA, including DNA in single-and double-stranded form. The term generally refers to a polymeric form of nucleotides of at least 10 bases in length, which are ribonucleotides or deoxynucleotides or modified forms of either type of nucleotide.
In certain embodiments, the polynucleotides of the present application comprise or consist of nucleotide sequences that specifically hybridize to nucleotide sequences encoding polypeptides set forth in any of SEQ ID NO:1-3 and 15 and functionally equivalent variants thereof under stringent conditions.
Stringent conditions for DNA hybridization can be routinely selected by those skilled in the art. Longer probes typically require higher temperatures for proper annealing, while shorter probes require lower temperatures. Hybridization generally depends on the re-annealing ability of denatured DNA when the complementary strand is in an environment below its melting temperature. The higher the degree of homology between the probe and the hybridizable sequence, the higher the relative temperature that can be used. Thus, higher relative temperatures tend to make the reaction conditions more stringent, while at lower temperatures, the severity is lower. For a detailed description of the stringent conditions of hybridization reactions, see Ausubel et al Current Protocols in Molecular Biology, wiley Interscience Publishers (1995).
In certain embodiments, the stringent conditions employed for DNA hybridization include: 1) Washing with low ionic strength and high temperature, e.g., 0.015M sodium chloride/0.0015M sodium citrate/0.1% sodium dodecyl sulfate at 50deg.C; 2) For hybridization, denaturing agents such as formamide (e.g., 50% (v/v) formamide plus 0.1% bovine serum albumin/0.1% Ficoll/0.1% polydiene pyrrolidone/50 mM sodium phosphate buffer pH 6.5 at 42℃and 750mM sodium chloride, 75mM sodium citrate; or (3) hybridization overnight at 42℃with 50% formamide, 5 XSSC (0.75M sodium chloride, 0.075M sodium citrate 1), 50mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 XDenhardt's solution, sonicated salmon sperm DNA (50.mg/mL), 0.1% SDS and 10% dextran sulfate, followed by washing in 0.2 XSSC (sodium chloride/sodium citrate) at 42℃for 10 minutes, and further high stringency washing in 0.1 XSSC with EDTA at 55 ℃. Moderately stringent conditions may be as described in Sambrook et al, molecular Cloning: a Laboratory Manual, new York: cold Spring Harbor Press, 1989. Medium stringency conditions include the use of wash solutions and hybridization conditions (e.g., temperature, ionic strength, and SDS percentage) that are less stringent than those described above. For example, moderately stringent conditions include hybridization with at least about 16% v/v to at least about 30% v/v formamide and at least about 0.5M to at least about 0.9M salt at 42℃and washing with at least about 0.1M to at least about 0.2M salt at 55 ℃. Moderately stringent conditions can also include the use of 1% Bovine Serum Albumin (BSA), 1mM EDTA, 0.5M NaHPO 4 (pH 7.2), 7% SDS at 65℃and (i) 2 XSSC, 0.1% SDS; or (ii) 0.5% BSA, 1mM EDTA, 40mM NaHPO 4 (pH 47.2)、5% SDS was washed at 60-65 ℃. The practitioner will adjust the temperature, ionic strength, etc. based on factors such as probe length. The stringency at which nucleic acids hybridize depends on the length and degree of complementarity of the nucleic acid molecules, as well as other variables well known in the art. The greater the similarity or homology between two nucleotide sequences, the greater the Tm of the nucleic acid hybrids containing these sequences. The relative stability of nucleic acid hybridization (corresponding to higher Tm) decreases in the following order: RNA, DNA, RNA and DNA. Preferably, the minimum length of the hybridizable nucleic acid is at least about 12 nucleotides, preferably at least about 16, more preferably at least about 24, and most preferably at least about 36 nucleotides.
In some embodiments, the inventors herein have codon optimized the sequence encoding the mature protein based on the mdlA gene derived from Penicilliurn camembertii U-150, based on the codon preference of pichia pastoris, and the synthesized gene expressed using pichia pastoris GS115 or mc1-1H as the expression host.
In a specific embodiment, the present application provides a nucleotide sequence (SEQ ID NO: 7) encoding a mature peptide of a monoglyceride-diglyceride lipase (SEQ ID NO: 8) obtained by codon optimization.
The nucleotide sequence of SEQ ID NO. 7 is shown below:
GATGTCTCCACTTCCGAACTGGACCAGTTCGAGTTCTGGGTTCAATACGCAGCCGCCTCTTACTACGAGGCTGATTACACCGCACAGGTTGGTGATAAGCTGTCCTGCTCTAAGGGTAACTGCCCAGAAGTTGAAGCAACCGGTGCAACTGTGTCTTACGACTTCTCCGATTCCACGATCACTGACACCGCAGGTTACATCGCAGTTGATCACACCAACTCCGCAGTGGTACTGGCATTCCGTGGTTCTTACTCCGTACGTAACTGGGTTGCTGATGCTACTTTCGTCCATACCAACCCAGGTCTGTGTGATGGTTGTCTGGCTGAGCTGGGTTTCTGGTCTTCCTGGAAGCTGGTTCGTGATGATATTATCAAAGAACTGAAAGAAGTGGTGGCACAGAACCCAAACTATGAACTGGTGGTCGTGGGCCACTCCCTGGGTGCTGCTGTGGCTACTCTGGCTGCTACCGACCTGCGTGGTAAAGGTTATCCATCTGCTAAACTGTACGCTTACGCTTCCCCTCGTGTTGGCAACGCAGCCCTGGCCAAATATATCACCGCCCAGGGCAACAACTTCCGTTTCACCCACACCAATGACCCAGTACCTAAACTGCCACTGCTGTCTATGGGCTATGTACATGTTTCTCCTGAATATTGGATCACCTCTCCTAACAACGCCACTGTTTCTACCTCTGACATCAAAGTCATTGACGGCGACGTATCTTTTGACGGCAATACCGGCACGGGCCTGCCTCTGCTGACGGACTTTGAAGCCCACATTTGGTACTTTGTACAGGTTGACGCCGGCAAAGGTCCTGGCCTGCCATTCAAACGTGTTTAA
the amino acid sequence of SEQ ID NO. 8 is shown below:
DVSTSELDQFEFWVQYAAASYYEADYTAQVGDKLSCSKGNCPEVEATGATVSYDFSDSTITDTAGYIAVDHTNSAVVLAFRGSYSVRNWVADATFVHTNPGLCDGCLAELGFWSSWKLVRDDIIKELKEVVAQNPNYELVVVGHSLGAAVATLAATDLRGKGYPSAKLYAYASPRVGNAALAKYITAQGNNFRFTHTNDPVPKLPLLSMGYVHVSPEYWITSPNNATVSTSDIKVIDGDVSFDGNTGTGLPLLTDFEAHIWYFVQVDAGKGPGLPFKRV
the polynucleotides disclosed herein may be combined with other DNA sequences, such as promoters, polyadenylation signals, other restriction sites, multiple cloning sites, other coding segments, and the like, such that their total lengths may vary significantly. It is therefore contemplated that polynucleotide fragments of nearly any length may be utilized; the overall length is preferably limited by the ease of preparation and use in contemplated recombinant DNA protocols.
Polynucleotides and their fusions may be prepared, manipulated and/or expressed using any of a variety of maturation techniques known and available in the art. For example, polynucleotide sequences encoding the polypeptides of the present application, or variants thereof, may be used in recombinant DNA molecules to direct expression of the polypeptides in an appropriate host cell. Because of the degeneracy inherent in the genetic code, other DNA sequences encoding substantially identical or functionally equivalent amino acid sequences may also be used in the present application, and these sequences may be used to clone and express a given polypeptide.
In addition, polynucleotide sequences of the present application can be engineered using methods well known in the art, including but not limited to cloning, processing, expression, and/or alteration of activity of gene products.
In certain embodiments, polynucleotides of the present application are produced by artificial synthesis, e.g., direct chemical synthesis or enzymatic synthesis. In alternative embodiments, the polynucleotides described above are produced by recombinant techniques.
In a specific embodiment, the MDGL-encoding gene mdlA used in the present application is derived from Penicillium carpentry (Penicilliurn camembertii) U-150 (GeneBank: BAA 14345.1), the codon is optimized according to the codon preference of Pichia pastoris, the optimized gene is synthesized by the company Shanghai, inc., and the synthesized gene is ligated into pUC 57.
In certain embodiments, the sequence of the obtained polynucleotide may be determined by conventional methods, preferably by the dideoxy chain termination method (Sanger et al PNAS,1977, 74:5463-5467). Such polynucleotide sequencing can also be accomplished using commercially available sequencing kits.
Expression vector
The present application provides expression vectors comprising the polynucleotides of the present application.
An "expression vector" as described herein is a recombinantly or synthetically produced nucleic acid construct having a series of specific nucleic acid elements that permit transcription of a particular nucleic acid in a host cell. The expression vector used in the present application may be a plasmid vector such as pPIC9K, pUC, pET-24a (+), pIRES2-EGFP, pcDNA3.1, pCI-neo, pDC516, pVAC, pcDNA4.0, pGEM-T, pDC315, or a viral vector such as adenovirus, adeno-associated virus, retrovirus, semliki forest virus (sFv) vector, or other vectors well known in the art.
In certain embodiments, polynucleotide sequences encoding the polypeptides set forth in any one of SEQ ID NOs 1-3 and 15 and variants thereof are cloned into vectors to form recombinant vectors comprising the polynucleotides described herein.
In a preferred embodiment, the expression vector used to clone the polynucleotide is a plasmid vector. In a more preferred embodiment, the plasmid vector is pPIC9K or pUC57.
In particular embodiments, the above-described expression vectors further comprise a regulatory sequence that regulates the expression of the polynucleotide, wherein the polynucleotide is operably linked to the regulatory sequence.
The term "regulatory sequence" as used herein refers to a polynucleotide sequence required to effect expression of a coding sequence to which it is linked. The nature of such regulatory sequences varies with the host organism. In prokaryotes, such regulatory sequences typically include promoters, ribosome binding sites and terminators; in eukaryotes, such regulatory sequences typically include promoters, terminators and, in some cases, enhancers. Thus, the term "regulatory sequence" includes all sequences whose presence is minimal necessary for expression of the gene of interest, but may also include other sequences whose presence is advantageous for expression of the gene of interest, such as a leader sequence.
The term "operably linked" as used herein refers to the following circumstance: the sequences involved are in a relationship that allows them to function in the desired manner. Thus, for example, a regulatory sequence "operably linked" to a coding sequence allows expression of the coding sequence under conditions compatible with the regulatory sequence.
In certain embodiments, expression vectors comprising nucleotide sequences encoding polypeptides represented by any one of SEQ ID NOs 1-3 and 15, and variants thereof, and suitable transcriptional/translational regulatory elements are constructed using methods well known to those skilled in the art. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like (Sambrook, et al molecular Cloning, a Laboratory Manual, cold Spring Harbor laboratory New York, 1989). The nucleotide sequence is operably linked to an appropriate promoter in the expression vector to direct mRNA synthesis. Representative examples of such promoters include: the lac or trp promoter of E.coli; PL promoter of lambda phage; eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, LTRs from retroviruses, and other known promoters that control the expression of genes in prokaryotic or eukaryotic cells or viruses thereof. Expression vectors also include ribosome binding sites for translation initiation, transcription terminators, and the like. Insertion of the enhancer sequence into the vector will result in enhanced transcription in higher eukaryotic cells. Enhancers are cis-acting elements of DNA expression, usually about 10 to 300 base pairs, that act on a promoter to increase transcription of a gene. Examples include the SV40 enhancer 100 to 270 base pairs late in the replication origin, the polyoma enhancer late in the replication origin, and adenovirus enhancers, among others.
In addition, the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase, neomycin resistance and Green Fluorescent Protein (GFP) for eukaryotic cell culture, or tetracycline or ampicillin resistance for E.coli, and the like.
Host cells
The present application provides host cells comprising the polynucleotides or expression vectors disclosed herein.
In certain embodiments, polynucleotides encoding the polypeptides and variants thereof set forth in any one of SEQ ID NOs 1-3 and 15 or expression vectors containing the polynucleotides are transformed or transduced into host cells to obtain genetically engineered host cells containing the polynucleotides or expression vectors.
The host cell used herein may be any host cell known to those skilled in the art, including prokaryotic cells, eukaryotic cells, such as bacterial cells, fungal cells, yeast cells, mammalian cells, insect cells or plant cells, and the like. Exemplary bacterial cells include any of the genera escherichia, bacillus, streptomyces, salmonella, pseudomonas, and staphylococcus, including, for example, escherichia coli, lactococcus, bacillus subtilis, bacillus cereus, salmonella typhimurium, pseudomonas fluorescens. Exemplary fungal cells include any of the species Aspergillus. Exemplary yeast cells include any of the genera pichia, lager, schizosaccharomyces, or saccharomyces, including pichia, lager, or schizosaccharomyces. Exemplary insect cells include any of the species spodoptera litura or drosophila, including drosophila S2 and spodoptera litura Sf9. Exemplary animal cells include CHO, COS or melanoma or any mouse or human cell line. The selection of an appropriate host is within the ability of those skilled in the art.
In a specific embodiment, the host cell used herein is E.coli. In a preferred embodiment, an expression vector carrying a polynucleotide sequence of the present application is transformed into E.coli DH 5. Alpha. Strain for inducible expression. In another specific embodiment, an expression vector carrying a polynucleotide sequence of the present application is transformed into pichia cells for expression. Pichia pastoris useful in the present application includes GS115 (purchased from American Type Culture Collection (ATCC)), mc1-1H (which was deposited by the applicant of the present application at China general microbiological culture Collection center (CGMCC, address: institute of microorganisms, national academy of sciences of China, no. 3, north Chen West Lu 1, the area of Korea, beijing, and post code: 100101), and has classification designations Pichia pastoris, accession number CGMCC 16668 and KM71 (purchased from American Type Culture Collection (ATCC)), all of which have HIS4 auxotroph markers. GS115 strain has AOX1 gene, is mut+, i.e. normal type of methanol utilization; the mc1-1H strain has an AOX1 gene and is mut+, namely normal methanol utilization; the AOX1 locus of KM71 strain is inserted by ARG4 gene, the phenotype is Muts, namely slow methanol utilization, and all three strains are suitable for a general yeast transformation method.
The expression vector may be introduced into the host cell using any technique known in the art, including transformation, transduction, transfection, viral infection, gene gun, or Ti-mediated gene transfer. Specific methods include calcium phosphate transfection, DEAE-dextran mediated transfection, lipofection or electroporation, and the like (Davis, L., dibner, M., battey, I., basic Methods in Molecular Biology, (1986)). As an example, when the host is a prokaryote such as E.coli, competent cells can be harvested after the exponential growth phase using CaCl as is well known in the art 2 The method is used for transformation.
In a specific embodiment, recombinant expression vectors containing polynucleotides encoding MDGL mutants are digested and linearized with SalI restriction enzymes, and then transformed into Pichia pastoris GS115 or mc1-1H competent cells by electric shock, and plated onto selection media MGYS screening plates for culture. And (3) picking the transformant to culture on a glycerol dioleate flat plate, and finally picking the transformant with the largest hydrolysis ring for shake flask fermentation test.
Screening and preparation method of polypeptide
The polypeptides of the present application may be screened and prepared by any suitable method known to those of skill in the art.
In some embodiments, the polypeptides disclosed herein are obtained by site-directed mutagenesis of MDGL, and the results can be validated and also used to direct directed engineering of MDGL.
In some embodiments, the polypeptides of the present application may also be produced by recombinant techniques, or chemically synthesized. Methods for producing recombinant peptides are known in the art. Methods of chemical synthesis of peptides are also well known to those skilled in the art, and for example, the polypeptides of the present application and variants thereof can be produced by directed peptide synthesis using solid phase techniques (Merrifield, J.am. Chem. Soc.85:2149-2154 (1963)). Protein synthesis may be performed manually or by automation. For example, automated synthesis can be accomplished using a 431A peptide synthesizer (Perkin Elmer) of Applied Biosystems. Alternatively, the different fractions may be synthesized chemically and combined chemically, respectively, to produce the desired molecule.
In a specific embodiment, the mdlA gene is subjected to site-directed mutagenesis using a rapid recombination cloning method, and PCR and recombination are performed using TOYOBO KOD enzyme and primers to obtain a plasmid containing the mutated sequence. The resulting fragment is cloned into a plasmid vector, e.g., pPIC9K, via avrili and EcoRI cleavage sites, and the resulting vector is transformed into a host cell, e.g., e. After the mutant is cultured in a medium such as ampicillin, the plasmid is extracted, linearized with SalI, and the amplified recombinant vector (e.g., a fragment of about 10 kb) is recovered and then introduced into yeast cells. The yeast cells into which the recombinant vector is introduced are cultured on a screening medium plate, and mutants with high enzyme activity and/or high heat stability are screened by measuring the enzyme activity of the supernatant medium.
In some embodiments, the selected mutants may also be validated, e.g., fermentation validated. In some embodiments, the validated mutants are sequenced to determine the sequence after mutation.
In some embodiments, the mutated sequences obtained by screening are cloned into vectors, e.g., plasmid vectors, and then introduced into yeast cells, e.g., transformed into competent cells of pichia pastoris GS115 or mc1-1H strain by electrotransformation, and fermentation to verify the enzymatic activity and thermostability of the polypeptide.
In some embodiments, a method of screening for a polypeptide disclosed herein comprises:
1) Mutating the nucleotide sequence shown in SEQ ID NO. 7 to obtain a mutant sequence;
2) Expressing the mutated sequence obtained in step 1) to obtain a polypeptide mutated with respect to the polypeptide shown in SEQ ID NO. 8; and
3) Mutant polypeptides with enzyme activity and/or thermal stability higher than those of the polypeptide shown in SEQ ID NO. 8 are screened.
In certain embodiments, the mutation performed in the above method on the nucleotide sequence set forth in SEQ ID NO. 7 is a site-directed mutation.
In certain embodiments, the expression described in the above methods comprises transforming an expression vector comprising the mutant sequence into a yeast strain. In a specific embodiment, the expression comprises cloning the mutated sequence obtained in step 1) onto an expression vector, and then transforming or transducing into a suitable host cell; and culturing the host cell, recovering the recombinant expression vector, and then transforming the product into a yeast strain for culturing.
In certain embodiments, the mutation site of the mutant polypeptide is selected from any one or a combination of the following: 22, 25, 27, 84, 116, 136, 209, 229, 230, 231, 260, and 269.
In certain embodiments, the mutant polypeptide comprises any one or a combination of the following mutation sites: Y22C, D25R, T R, Y H/R, W116A/H/F, N136D, M209A/V, S229R, T230R, S231R, I E/R/H and G269C.
In certain embodiments, the mutation site comprises Y22C and G269C. In certain embodiments, the mutation site comprises S229R, T230R and S231R. In certain embodiments, the mutation site comprises Y22C, S229R, T230R, S R and G269C. In certain embodiments, the mutation site comprises Y22C, T27R, Y H, N35136D, S229R, T230R, S231R and G269C.
In a specific embodiment, the mutation sites of MDGLs (SEQ ID NO: 1) are Y22C and G269C. In a specific embodiment, the mutation sites of MDGLc (SEQ ID NO: 2) are S229R, T230R and S231R. In a specific embodiment, the mutation sites of MDGLsc (SEQ ID NO: 3) are Y22C, S229R, T230R, S231R and G269C. In a specific embodiment, the mutation sites of MDGL-3G3 (SEQ ID NO: 15) are Y22C, T27R, Y84H, N136D, S229R, T230R, S R and G269C.
In exemplary embodiments, the method of screening for a polypeptide of the present application comprises one or more of the following steps:
1) Mutating the nucleotide sequence shown in SEQ ID NO. 7 to obtain a mutant sequence;
2) Cloning the mutated sequence obtained in step 1) onto an expression vector and then transforming or transducing into a suitable host cell;
3) Culturing the host cell in step 2), recovering the recombinant expression vector, and then converting the product into a yeast strain for culturing; and
4) Mutant polypeptides with enzyme activity and/or thermal stability higher than those of the polypeptide shown in SEQ ID NO. 8 are screened.
Suitable host cells are those suitable for expression of the expression vector or polynucleotide of interest. Suitable media refer to media suitable for growth or inducible expression of the host cell.
In certain embodiments, various conventional media may be selected depending on the host cell used. The culture is carried out under conditions suitable for the growth of the host cell. Preferably, the engineered host cells may be cultured in conventional nutrient media modified as appropriate for activating promoters to screen transformants or amplify the polynucleotides of the present application. After transformation of the appropriate host cell and growth of the host cell to the appropriate cell density, the selected promoter is induced by a suitable method (e.g., temperature shift or chemical induction) and the cell is cultured for an additional period of time to allow production of the polypeptide of interest or fragment thereof.
In certain embodiments, the polypeptide produced by the host cell may be coated in the cell, or expressed on a cell membrane, or secreted out of the cell. If desired, the recombinant proteins can be isolated and purified by various separation methods using their physical, chemical and other properties. For example, the expressed polypeptide or fragment thereof may be recovered and purified from recombinant cell culture by the following methods well known in the art: conventional renaturation treatment, protein precipitant treatment (salting-out method), centrifugation, osmotic sterilization, ultrasonic treatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, high Performance Liquid Chromatography (HPLC), and other various liquid chromatography techniques and combinations of these methods.
In specific embodiments, the polypeptides disclosed herein are secreted into the medium outside the host cell, e.g., outside the yeast cell. In some embodiments, the cultured bacterial fluid is centrifuged and the supernatant is taken for use in determining the activity of the polypeptide, e.g., the ability to hydrolyze pNPP.
Use of polypeptides having monoglyceride-diglyceride lipase activity
The polypeptide of the application is a polypeptide with lipase activity, in particular a polypeptide with glycerol mono-diacyl ester lipase (MDGL) activity, and can utilize esterification or transesterification reaction to produce glycerol monoacyl ester (MAG) with higher industrial value.
The polypeptides disclosed herein have higher enzyme activity and/or thermostability. In some embodiments, the polypeptide disclosed herein (e.g., a polypeptide set forth in any one of SEQ ID NOS: 1-3 and 15) has a higher pNPP hydrolysis activity than wild-type MDGL (e.g., a control polypeptide set forth in SEQ ID NO: 8). In some embodiments, the polypeptides disclosed herein (e.g., the polypeptides set forth in SEQ ID NO:1, 3, or 15) have a thermostability that is superior to wild-type MDGL. In some embodiments, the polypeptides disclosed herein (e.g., the polypeptides shown in SEQ ID NO:2, 3, or 15) have a higher specific enzymatic activity than wild-type MDGL (e.g., the control polypeptide shown in SEQ ID NO: 8). In some embodiments, the MDGL mutants disclosed herein (e.g., the polypeptides shown in SEQ ID NO:3 or 15) have significantly higher enzyme activity and thermostability than wild-type MDGL (e.g., the control polypeptides shown in SEQ ID NO: 8).
The polypeptides disclosed herein can be used for MAG synthesis, MAG/DAG hydrolysis, esterification of fatty acids, and MAG/DAG transesterification, among others. The present application provides for the use of the polypeptides disclosed herein to catalyze synthesis, hydrolysis or transesterification reactions of esters, preferably, the esters are monoglycerides and/or diglycerides. In some embodiments, the use of the enzymes disclosed herein can significantly reduce the amount of enzyme added and the cost of production.
In this specification and claims, the words "comprise", "comprising" and "include" mean "including but not limited to", and are not intended to exclude other moieties, additives, components or steps.
It should be understood that features, characteristics, components or steps described in particular aspects, embodiments or examples of the present application may be applied to any other aspects, embodiments or examples described herein unless contradicted by context.
The foregoing disclosure generally describes the present application, which is further illustrated by the following examples. These examples are described for illustration only and are not intended to limit the scope of the present application. Although specific terms and values are used herein, these terms and values are likewise to be understood as exemplary and do not limit the scope of the present application.
Examples
Example 1: synthesis and cloning of MDGL sequences
SEQ ID NO:7 (the coding sequence of which is shown as polypeptide 28-3 or MDGL of SEQ ID NO: 8) was synthesized by the division of biological engineering (Shanghai) Co.
The synthesized MDGL sequence was cloned in plasmid pUC57, stored in E.coli DH 5. Alpha. And the plasmid was digested with AvrII and EcoRI from NEB, and the plasmid was digested with AvrII and EcoRI together with the inducible promoter P AOX1 And the yeast alpha mating factor plasmid pPIC9K, then using Axygen company gel recovery kit purification. The MDGL sequence enzyme fragment and the plasmid enzyme fragment are connected by using T4 DNA ligase of Fermentas company according to the product specification, and the connection heat shock method is transferred into a large scaleIn enterobacteria DH 5. Alpha. Were cultured overnight on LB plates containing ampicillin. The following day, the monoclonals were picked up and cultured in LB liquid medium, plasmids were extracted using the Axygen company plasmid extraction kit and submitted to Shanghai Biotechnology Co.Ltd for sequencing.
The strain with correct sequencing result is used for extracting plasmid, after the plasmid is linearized by SalI, the Pichia pastoris GS115 is transformed by adopting an electric shock method, the transformant is screened on a screening medium without histidine, and only the recombinant Pichia pastoris transformed with exogenous gene fragments can grow on the screening medium. Transferring the colony on the screening culture medium to a BMMY-olive oil screening culture medium plate, selecting the clone according to the size of a hydrolysis circle, and performing shake flask fermentation and activity detection to obtain recombinant Pichia pastoris GS115-MDGL.
Example 2: MDGL site-directed mutagenesis
The plasmid pPIC9K-MDGL containing the polynucleotide encoding MDGL was amplified using the primer pair MDGLm-F1 (CTCTTACTGTGAGGCTGATTACACC, SEQ ID NO: 9) and MDGLm-R1 (CCTTTGCAGGCGTCAACCTGTACAAAGTACC, SEQ ID NO: 10), and the primer pair MDGLm-F2 (TTGACGCCTGCAAAGGTCCTGGCCT, SEQ ID NO: 11) and MDGLm-R2 (CAGCCTCACAGTAAGAGGCGGCTGC, SEQ ID NO: 12), and PCR was performed using the KOD enzyme of TOYOBO Co., as follows: mu.L of a 25. Mu. L, dNTP mixture of 2 Xbuffer (2.5 mM each), 1.5. Mu.L of each primer, 1. Mu.L of plasmid template 1. Mu. L, KOD enzyme, and double distilled water was added to 50. Mu.L. The PCR reaction procedure was as follows: 98℃for 10s, 56℃for 20s, 68℃for 1kb/min, 68℃for 5min,30 cycles. The PCR product was purified using the PCR purification kit from Axygen. The purified PCR products were ligated by using a rapid recombination kit (Tiangen Biochemical technology (Beijing) Co., ltd.), the resulting vector was transformed into E.coli DH 5. Alpha. Strain, the positive clones were sequenced, the strain containing the correct mutated sequence was inoculated into LB medium containing 100mg/mL ampicillin, cultured at 37℃for 16 hours, and plasmids were extracted using an Axygen Minipre plasmid extraction kit to obtain plasmid pPIC9K-MDGLs containing the mutated sequence. The nucleotide sequence of the mutant sequence is shown as SEQ ID NO:4, the coded MDGL mutant MDGLs has an amino acid sequence shown in SEQ ID NO: 1.
The plasmid pPIC9K-MDGL containing the polynucleotide encoding MDGL was amplified using primer pairs MDGL-ctlm-F (CGTAGACGTGACATCAAAGTCATTGACGGCG, SEQ ID NO: 13) and MDGL-ctlm-R (GATGTCACGTCTACGAACAGTGGCGTTGTTAGGAGAGG, SEQ ID NO: 14), and PCR was performed using KOD enzyme from TOYOBO Co., ltd. In the following reaction system: mu.L of a 25. Mu. L, dNTP mixture of 2 Xbuffer (2.5 mM each), 1.5. Mu.L of each primer, 1. Mu.L of plasmid template 1. Mu. L, KOD enzyme, and double distilled water was added to 50. Mu.L. The PCR reaction procedure was as follows: 98℃for 10s, 56℃for 20s, 68℃for 10min, 68℃for 5min,30 cycles. The PCR product was purified using the PCR purification kit from Axygen. The purified PCR products were ligated by using a rapid recombination kit (Tiangen Biochemical technology (Beijing) Co., ltd.), the resulting vector was transformed into E.coli DH 5. Alpha. Strain, the positive clones were sequenced, the strain containing the correct mutated sequence was inoculated into LB medium containing 100mg/mL ampicillin, cultured at 37℃for 16 hours, and plasmids were extracted using an Axygen Minipre plasmid extraction kit to obtain plasmid pPIC9K-MDGLc containing the mutated sequence. The nucleotide sequence of the mutant sequence is shown as SEQ ID NO:5, the coded MDGL mutant MDGLc has an amino acid sequence shown in SEQ ID NO: 2.
The plasmid pPIC9K-MDGLs containing the polynucleotide encoding MDGLs was amplified using primer pairs MDGL-ctlm-F and MDGL-ctlm-R, and PCR was performed using KOD enzyme from TOYOBO Co., ltd. The reaction system is as follows: mu.L of a 25. Mu. L, dNTP mixture of 2 Xbuffer (2.5 mM each), 1.5. Mu.L of each primer, 1. Mu.L of plasmid template 1. Mu. L, kKOD enzyme, and double distilled water was added to 50ul. The PCR reaction procedure was as follows: 98℃for 10s, 56℃for 20s, 68℃for 10min, 68℃for 5min,30 cycles. The PCR product was purified using the PCR purification kit from Axygen. The purified PCR products were ligated by using a rapid recombination kit (Tiangen Biochemical technology (Beijing) Co., ltd.), the resulting vector was transformed into E.coli DH 5. Alpha. Strain, the positive clones were sequenced, the strain containing the correct mutated sequence was inoculated into LB medium containing 100mg/mL ampicillin, cultured at 37℃for 16 hours, and plasmids were extracted using an Axygen Minipre plasmid extraction kit to obtain plasmid pPIC9K-MDGLsc containing the mutated sequence. The nucleotide sequence of the mutant sequence is shown as SEQ ID NO:6, the coded MDGL mutant MDGLsc has an amino acid sequence shown in SEQ ID NO: 3.
Example 3: mutant transformed pichia pastoris and panel screening
Recombinant expression vectors containing polynucleotides encoding MDGL mutants (MDGLs, MDGLc or MDGLsc) and wild type MDGL (polypeptide 28-3) which were sequenced correctly were digested and linearized with SalI restriction enzyme, and then transformed into Pichia pastoris GS115 competent cells by electric shock, plated onto selection medium MGYS screening plates, and cultured at 28℃for three days. Transformants were picked up on glycerol dioleate plates (1% YNB, 2% phospholipid, 2% agarose, 2% methanol, 0.5% glycerol dioleate) and incubated at 28℃for 1 day, and transformants with the largest hydrolytic circles were picked up for shake flask fermentation testing.
Example 4: fermentation verification of strain producing monoglyceride-diacyl ester lipase
The larger transparent circle of the glycerol dioleate plate of example 3 was picked up and cultured in 50mL BMGY medium (1% yeast extract, 2% peptone, 1.34% ammonium sulfate-containing yeast nitrogen source base (YNB) without amino acid, 2% glycerol, 4X 10-5% D-biotin, 100mM citric acid-sodium citrate buffer, pH 6.6), 28℃and 200rpm for 1 day, and OD was measured 600 Taking a certain volume of culture solution, centrifuging at 8000rpm for 2 min, and re-suspending into 250mL triangular flask containing 50mL BMMY (1% yeast extract, 2% peptone, 1.34% yeast nitrogen source alkali (YNB) containing ammonium sulfate and no amino acid, 2% methanol, 4×10-5% D-biotin, 100mM citric acid-sodium citrate buffer solution, pH 6.6), and the initial bacterial liquid OD 600 The culture was carried out at 28℃and 200rpm, 500. Mu.L of methanol was added every 12 hours, and the culture was carried out for 3 days. After completion of the culture, all the bacterial solutions were collected, centrifuged at 8000rpm for 5 minutes, and the total supernatant was collected and ultrafiltered with a 10K filter membrane (Millipore) and the protein concentration was measured using a Bradford reagent (manufactured and bioengineered (Shanghai) Co., ltd.). The protein concentrations of MDGLs, MDGLc, MDGLsc and 28-3 were 0.16mg/mL, 0.13mg/mL and 0.45mg/mL, respectively, as shown in FIG. 1.
Example 5: enzyme activity yield of MDGL detected by pNPP method
To 400. Mu.L of the pNPP reaction solution (3 mg/mL, dissolved in isopropyl alcohol, 1mL was mixed with 9mL of a 0.2M sodium acetate-acetic acid solution before use), 5. Mu.L of the enzyme solution was added, the reaction was continued at 37℃for 20 minutes at 1000rpm, 400. Mu.L of a stop solution (200 mM Tris-HCl, 5% triton-100, pH 8.5) was added, the reaction was stopped, centrifugation was performed at 12000rpm for 5 minutes, and the supernatant was collected and the absorbance at 405nm was measured.
Enzyme activity unit=0.1935 od 405 *V 1 * Dilution factor/T/V 2
V 1 : volume of reaction solution (mL), V 2 : enzyme volume (mL), T: time (min).
The measurement results were as follows: 28-3 was 7.3U/mL, MDGLs was 2.5U/mL, MDGLc was 0.9U/mL, and MDGLsc was 1U/mL, see FIG. 2.
Example 6: MDGL mutant thermal stability test
Taking 50 mu L of fermentation supernatant, preserving heat at 40-50 ℃ for 30 minutes, immediately measuring pNPP hydrolysis capacity after the heat preservation is finished, and calculating enzyme activity residual rate and enzyme activity residual rate by taking the supernatant which is not subjected to heat treatment as a referenceThe results show that, compared to polypeptide 28-3, MDGLs is +.>About 1℃increase in MDGLsc +.>An increase of about 2 ℃ (fig. 3), which indicates an increase in the thermal stability of MDGLs and MDGLsc, especially an increase in the thermal stability of MDGLsc is more pronounced.
Example 7: SDS-PAGE (sodium dodecyl sulfate polyacrylamide gel electrophoresis) analysis of MDGL mutant
In order to exclude the effect of the impurity proteins, 3 mutants (MDGLs, MDGLc and MDGLsc) and wild type MDGL (28-3) were subjected to SDS-PAGE analysis (FIG. 4), all samples were protein with pNPP hydrolysis activity of 16U, the band of MDGLs was thicker than that of wild type MDGL, which indicated that the specific enzyme activity of MDGLs was reduced, the band of MDGLc was significantly thinner than that of wild type MDGL, which indicated that the specific enzyme activity of MDGLc was significantly improved, and the band of MDGLsc was thinner than that of wild type MDGL, which indicated that the specific enzyme activity of MDGLsc was improved.
Example 8: synthesis and cloning of MDGL-3G3 sequence
SEQ ID NO:16 (the coding sequence of which is shown as the polypeptide MDGL-3G3 shown in SEQ ID NO: 15) is synthesized by the division of biological engineering (Shanghai) Co.
The synthesized MDGL-3G3 sequence was cloned in plasmid pUC57, stored in E.coli DH 5. Alpha. And the plasmid was digested with AvrII and EcoRI from NEB, and the plasmid was digested with AvrII and EcoRI together with the inducible promoter P AOX1 And the yeast alpha mating factor plasmid pPIC9K, then using Axygen company gel recovery kit purification. The MDGL-3G3 sequence fragment and the plasmid enzyme fragment were ligated using Fermentas T4 DNA ligase according to the product instructions, and the ligation was transferred into E.coli DH 5. Alpha. By heat shock and cultured overnight on LB plates containing ampicillin. The following day, the monoclonals were picked up and cultured in LB liquid medium, plasmids were extracted using the Axygen company plasmid extraction kit and submitted to Shanghai Biotechnology Co.Ltd for sequencing.
Example 9: transformation of Pichia pastoris and Panel screening with the mutant MDGL-3G3
The recombinant expression vector containing the polynucleotide for encoding MDGL mutant (MDGL-3G 3) with correct sequencing result is digested and linearized by SalI restriction enzyme, and then pichia pastoris mc1-1H competent cells are transformed by adopting an electric shock method, spread on a selection medium MGYS screening plate and cultured for three days at 28 ℃. Transformants were picked up on glycerol dioleate plates (1% YNB, 2% phospholipid, 2% agarose, 2% methanol, 0.5% glycerol dioleate) and incubated at 28℃for 1 day, and transformants with the largest hydrolytic circles were picked up for shake flask fermentation testing. The control strain was sequence-optimized MDGL with no mutation in amino acid sequence.
Example 10: fermentation verification of strain producing monoglyceride-diacyl ester lipase
5 large transparent circles of the glycerol dioleate plates of example 9 were picked up and placed in 50mL BMGY medium (1% yeast extract, 2% peptone, 1.34% ammonium sulfate-containing and amino acid-free yeast nitrogen source base (YNB), 2% glycerol, 4X 1)0-5% D-biotin, 100mM citric acid-sodium citrate buffer, pH 6.6), 28℃and 200rpm for 1 day, and OD was measured 600 . The culture broth was centrifuged at 8000rpm for 2 minutes and resuspended in a 250mL Erlenmeyer flask containing 50mL BMMY (1% yeast extract, 2% peptone, 1.34% Yeast Nitrogen Base (YNB) with ammonium sulfate without amino acids, 2% methanol, 4X 10-5% D-biotin, 100mM citric acid-sodium citrate buffer, pH 6.6), incubated at 28℃at 200rpm with 500. Mu.L of methanol added every 12h for 3 days. After completion of the culture, all the bacterial solutions were collected, centrifuged at 8000rpm for 5 minutes, and the total supernatant was collected and ultrafiltered with a 10K filter membrane (Millipore) and the protein concentration was measured using a Bradford reagent (manufactured and bioengineered (Shanghai) Co., ltd.). The protein concentration of MDGL-3G3-1 to MDGL-3G3-5 was 0.22-0.64mg/mL, and the protein yield of the control strain was 0.69mg/mL, see FIG. 5.
Example 11: detection of enzyme activity yield of MDGL-3G3 mutant by pNPP method
To 400. Mu.L of the pNPP reaction solution (3 mg/mL, dissolved in isopropyl alcohol, 1mL was mixed with 9mL of a 0.2M sodium acetate-acetic acid solution before use), 5. Mu.L of the enzyme solution was added, the reaction was carried out at 37℃for 10 minutes at 1000rpm, 400. Mu.L of a stop solution (200 mM Tris-HCl, 5% triton-100, pH 8.5) was added to terminate the reaction, centrifugation was carried out at 12000rpm for 5 minutes, and the supernatant was collected and the absorbance at 405nm was measured.
Enzyme activity unit=0.1935 od 405 *V 1 * Dilution factor/T/V 2
V 1 : volume of reaction solution (mL), V 2 : enzyme volume (mL), T: time (min).
Specific enzyme activity = enzyme activity yield/protein concentration
The measurement results were as follows: the enzyme activity yield of the wild MDGL is 14.1U/mL, the specific enzyme activity is 20.51U/mg, the enzyme activity yield of the MDGL-3G3-2 is 25.1U/mL, the specific enzyme activity is 43.1U/mg, and the specific enzyme activity is improved by 110.1% compared with the wild MDGL; the enzyme activity yield of MDGL-3G3-3 is 19.0U/mL, the specific enzyme activity is 29.9U/mg, and the specific enzyme activity is improved by 45.9% compared with that of the wild MDGL, see FIG. 6.
Example 12: MDGL-3G3 mutant thermal stability test
Taking 50 mu L of each of the MDGL-3G3-3 and MDGL fermentation liquid, preserving the temperature at 37-62 ℃ for 30 minutes, immediately measuring the pNPP hydrolysis capability after the heat preservation, and calculating the enzyme activity residual rate by taking the supernatant which is not subjected to heat treatment as a reference. As a result, the residual activity ratio of MDGL-3G3 after heat preservation treatment at 41.8-52.2 ℃ for 30min is obviously higher than that of MDGL, which shows that the heat stability of MDGL-3G3 is obviously improved, as shown in FIG. 7.
Example 13: SDS-PAGE analysis of MDGL-3G3 mutant and MDGL (sodium dodecyl sulfate polyacrylamide gel electrophoresis)
A large amount of non-target proteins are present in the fermentation broth, and SDS-PAGE analysis is performed on two mutants of MDGL-3G3-2 and MDGL-3G3-3 and wild type MDGL (FIG. 8), and all samples are proteins with pNPP hydrolysis activity of 150U, and the bands of MDGL-3G3-2 and MDGL-3G3-3 are finer than that of the wild type MDGL, which indicates that the specific enzyme activities of MDGL-3G3-2 and MDGL-3G3 are improved.
It should be understood that while the application is illustrated in some form, the application is not limited to what has been shown and described in this specification. It will be apparent to those skilled in the art that various changes can be made without departing from the scope of the application. Such variations are within the scope of the claims of the present application.
Sequence listing
<110> Feng Yi (Shanghai) Biotechnology research and development center Co., ltd
<120> novel monoglyceride-diacyl ester lipase
<150> CN 201911329804.5
<151> 2019-12-20
<160> 16
<170> CNIPASequenceListing 1.0
<210> 1
<211> 279
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 1
Asp Val Ser Thr Ser Glu Leu Asp Gln Phe Glu Phe Trp Val Gln Tyr
1 5 10 15
Ala Ala Ala Ser Tyr Cys Glu Ala Asp Tyr Thr Ala Gln Val Gly Asp
20 25 30
Lys Leu Ser Cys Ser Lys Gly Asn Cys Pro Glu Val Glu Ala Thr Gly
35 40 45
Ala Thr Val Ser Tyr Asp Phe Ser Asp Ser Thr Ile Thr Asp Thr Ala
50 55 60
Gly Tyr Ile Ala Val Asp His Thr Asn Ser Ala Val Val Leu Ala Phe
65 70 75 80
Arg Gly Ser Tyr Ser Val Arg Asn Trp Val Ala Asp Ala Thr Phe Val
85 90 95
His Thr Asn Pro Gly Leu Cys Asp Gly Cys Leu Ala Glu Leu Gly Phe
100 105 110
Trp Ser Ser Trp Lys Leu Val Arg Asp Asp Ile Ile Lys Glu Leu Lys
115 120 125
Glu Val Val Ala Gln Asn Pro Asn Tyr Glu Leu Val Val Val Gly His
130 135 140
Ser Leu Gly Ala Ala Val Ala Thr Leu Ala Ala Thr Asp Leu Arg Gly
145 150 155 160
Lys Gly Tyr Pro Ser Ala Lys Leu Tyr Ala Tyr Ala Ser Pro Arg Val
165 170 175
Gly Asn Ala Ala Leu Ala Lys Tyr Ile Thr Ala Gln Gly Asn Asn Phe
180 185 190
Arg Phe Thr His Thr Asn Asp Pro Val Pro Lys Leu Pro Leu Leu Ser
195 200 205
Met Gly Tyr Val His Val Ser Pro Glu Tyr Trp Ile Thr Ser Pro Asn
210 215 220
Asn Ala Thr Val Ser Thr Ser Asp Ile Lys Val Ile Asp Gly Asp Val
225 230 235 240
Ser Phe Asp Gly Asn Thr Gly Thr Gly Leu Pro Leu Leu Thr Asp Phe
245 250 255
Glu Ala His Ile Trp Tyr Phe Val Gln Val Asp Ala Cys Lys Gly Pro
260 265 270
Gly Leu Pro Phe Lys Arg Val
275
<210> 2
<211> 279
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 2
Asp Val Ser Thr Ser Glu Leu Asp Gln Phe Glu Phe Trp Val Gln Tyr
1 5 10 15
Ala Ala Ala Ser Tyr Tyr Glu Ala Asp Tyr Thr Ala Gln Val Gly Asp
20 25 30
Lys Leu Ser Cys Ser Lys Gly Asn Cys Pro Glu Val Glu Ala Thr Gly
35 40 45
Ala Thr Val Ser Tyr Asp Phe Ser Asp Ser Thr Ile Thr Asp Thr Ala
50 55 60
Gly Tyr Ile Ala Val Asp His Thr Asn Ser Ala Val Val Leu Ala Phe
65 70 75 80
Arg Gly Ser Tyr Ser Val Arg Asn Trp Val Ala Asp Ala Thr Phe Val
85 90 95
His Thr Asn Pro Gly Leu Cys Asp Gly Cys Leu Ala Glu Leu Gly Phe
100 105 110
Trp Ser Ser Trp Lys Leu Val Arg Asp Asp Ile Ile Lys Glu Leu Lys
115 120 125
Glu Val Val Ala Gln Asn Pro Asn Tyr Glu Leu Val Val Val Gly His
130 135 140
Ser Leu Gly Ala Ala Val Ala Thr Leu Ala Ala Thr Asp Leu Arg Gly
145 150 155 160
Lys Gly Tyr Pro Ser Ala Lys Leu Tyr Ala Tyr Ala Ser Pro Arg Val
165 170 175
Gly Asn Ala Ala Leu Ala Lys Tyr Ile Thr Ala Gln Gly Asn Asn Phe
180 185 190
Arg Phe Thr His Thr Asn Asp Pro Val Pro Lys Leu Pro Leu Leu Ser
195 200 205
Met Gly Tyr Val His Val Ser Pro Glu Tyr Trp Ile Thr Ser Pro Asn
210 215 220
Asn Ala Thr Val Arg Arg Arg Asp Ile Lys Val Ile Asp Gly Asp Val
225 230 235 240
Ser Phe Asp Gly Asn Thr Gly Thr Gly Leu Pro Leu Leu Thr Asp Phe
245 250 255
Glu Ala His Ile Trp Tyr Phe Val Gln Val Asp Ala Gly Lys Gly Pro
260 265 270
Gly Leu Pro Phe Lys Arg Val
275
<210> 3
<211> 279
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 3
Asp Val Ser Thr Ser Glu Leu Asp Gln Phe Glu Phe Trp Val Gln Tyr
1 5 10 15
Ala Ala Ala Ser Tyr Cys Glu Ala Asp Tyr Thr Ala Gln Val Gly Asp
20 25 30
Lys Leu Ser Cys Ser Lys Gly Asn Cys Pro Glu Val Glu Ala Thr Gly
35 40 45
Ala Thr Val Ser Tyr Asp Phe Ser Asp Ser Thr Ile Thr Asp Thr Ala
50 55 60
Gly Tyr Ile Ala Val Asp His Thr Asn Ser Ala Val Val Leu Ala Phe
65 70 75 80
Arg Gly Ser Tyr Ser Val Arg Asn Trp Val Ala Asp Ala Thr Phe Val
85 90 95
His Thr Asn Pro Gly Leu Cys Asp Gly Cys Leu Ala Glu Leu Gly Phe
100 105 110
Trp Ser Ser Trp Lys Leu Val Arg Asp Asp Ile Ile Lys Glu Leu Lys
115 120 125
Glu Val Val Ala Gln Asn Pro Asn Tyr Glu Leu Val Val Val Gly His
130 135 140
Ser Leu Gly Ala Ala Val Ala Thr Leu Ala Ala Thr Asp Leu Arg Gly
145 150 155 160
Lys Gly Tyr Pro Ser Ala Lys Leu Tyr Ala Tyr Ala Ser Pro Arg Val
165 170 175
Gly Asn Ala Ala Leu Ala Lys Tyr Ile Thr Ala Gln Gly Asn Asn Phe
180 185 190
Arg Phe Thr His Thr Asn Asp Pro Val Pro Lys Leu Pro Leu Leu Ser
195 200 205
Met Gly Tyr Val His Val Ser Pro Glu Tyr Trp Ile Thr Ser Pro Asn
210 215 220
Asn Ala Thr Val Arg Arg Arg Asp Ile Lys Val Ile Asp Gly Asp Val
225 230 235 240
Ser Phe Asp Gly Asn Thr Gly Thr Gly Leu Pro Leu Leu Thr Asp Phe
245 250 255
Glu Ala His Ile Trp Tyr Phe Val Gln Val Asp Ala Cys Lys Gly Pro
260 265 270
Gly Leu Pro Phe Lys Arg Val
275
<210> 4
<211> 840
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 4
gatgtctcca cttccgaact ggaccagttc gagttctggg ttcaatacgc agccgcctct 60
tactgtgagg ctgattacac cgcacaggtt ggtgataagc tgtcctgctc taagggtaac 120
tgcccagaag ttgaagcaac cggtgcaact gtgtcttacg acttctccga ttccacgatc 180
actgacaccg caggttacat cgcagttgat cacaccaact ccgcagtggt actggcattc 240
cgtggttctt actccgtacg taactgggtt gctgatgcta ctttcgtcca taccaaccca 300
ggtctgtgtg atggttgtct ggctgagctg ggtttctggt cttcctggaa gctggttcgt 360
gatgatatta tcaaagaact gaaagaagtg gtggcacaga acccaaacta tgaactggtg 420
gtcgtgggcc actccctggg tgctgctgtg gctactctgg ctgctaccga cctgcgtggt 480
aaaggttatc catctgctaa actgtacgct tacgcttccc ctcgtgttgg caacgcagcc 540
ctggccaaat atatcaccgc ccagggcaac aacttccgtt tcacccacac caatgaccca 600
gtacctaaac tgccactgct gtctatgggc tatgtacatg tttctcctga atattggatc 660
acctctccta acaacgccac tgtttctacc tctgacatca aagtcattga cggcgacgta 720
tcttttgacg gcaataccgg cacgggcctg cctctgctga cggactttga agcccacatt 780
tggtactttg tacaggttga cgcctgcaaa ggtcctggcc tgccattcaa acgtgtttaa 840
<210> 5
<211> 840
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 5
gatgtctcca cttccgaact ggaccagttc gagttctggg ttcaatacgc agccgcctct 60
tactacgagg ctgattacac cgcacaggtt ggtgataagc tgtcctgctc taagggtaac 120
tgcccagaag ttgaagcaac cggtgcaact gtgtcttacg acttctccga ttccacgatc 180
actgacaccg caggttacat cgcagttgat cacaccaact ccgcagtggt actggcattc 240
cgtggttctt actccgtacg taactgggtt gctgatgcta ctttcgtcca taccaaccca 300
ggtctgtgtg atggttgtct ggctgagctg ggtttctggt cttcctggaa gctggttcgt 360
gatgatatta tcaaagaact gaaagaagtg gtggcacaga acccaaacta tgaactggtg 420
gtcgtgggcc actccctggg tgctgctgtg gctactctgg ctgctaccga cctgcgtggt 480
aaaggttatc catctgctaa actgtacgct tacgcttccc ctcgtgttgg caacgcagcc 540
ctggccaaat atatcaccgc ccagggcaac aacttccgtt tcacccacac caatgaccca 600
gtacctaaac tgccactgct gtctatgggc tatgtacatg tttctcctga atattggatc 660
acctctccta acaacgccac tgttcgtaga cgtgacatca aagtcattga cggcgacgta 720
tcttttgacg gcaataccgg cacgggcctg cctctgctga cggactttga agcccacatt 780
tggtactttg tacaggttga cgccggcaaa ggtcctggcc tgccattcaa acgtgtttaa 840
<210> 6
<211> 840
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 6
gatgtctcca cttccgaact ggaccagttc gagttctggg ttcaatacgc agccgcctct 60
tactgtgagg ctgattacac cgcacaggtt ggtgataagc tgtcctgctc taagggtaac 120
tgcccagaag ttgaagcaac cggtgcaact gtgtcttacg acttctccga ttccacgatc 180
actgacaccg caggttacat cgcagttgat cacaccaact ccgcagtggt actggcattc 240
cgtggttctt actccgtacg taactgggtt gctgatgcta ctttcgtcca taccaaccca 300
ggtctgtgtg atggttgtct ggctgagctg ggtttctggt cttcctggaa gctggttcgt 360
gatgatatta tcaaagaact gaaagaagtg gtggcacaga acccaaacta tgaactggtg 420
gtcgtgggcc actccctggg tgctgctgtg gctactctgg ctgctaccga cctgcgtggt 480
aaaggttatc catctgctaa actgtacgct tacgcttccc ctcgtgttgg caacgcagcc 540
ctggccaaat atatcaccgc ccagggcaac aacttccgtt tcacccacac caatgaccca 600
gtacctaaac tgccactgct gtctatgggc tatgtacatg tttctcctga atattggatc 660
acctctccta acaacgccac tgttcgtaga cgtgacatca aagtcattga cggcgacgta 720
tcttttgacg gcaataccgg cacgggcctg cctctgctga cggactttga agcccacatt 780
tggtactttg tacaggttga cgcctgcaaa ggtcctggcc tgccattcaa acgtgtttaa 840
<210> 7
<211> 840
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 7
gatgtctcca cttccgaact ggaccagttc gagttctggg ttcaatacgc agccgcctct 60
tactacgagg ctgattacac cgcacaggtt ggtgataagc tgtcctgctc taagggtaac 120
tgcccagaag ttgaagcaac cggtgcaact gtgtcttacg acttctccga ttccacgatc 180
actgacaccg caggttacat cgcagttgat cacaccaact ccgcagtggt actggcattc 240
cgtggttctt actccgtacg taactgggtt gctgatgcta ctttcgtcca taccaaccca 300
ggtctgtgtg atggttgtct ggctgagctg ggtttctggt cttcctggaa gctggttcgt 360
gatgatatta tcaaagaact gaaagaagtg gtggcacaga acccaaacta tgaactggtg 420
gtcgtgggcc actccctggg tgctgctgtg gctactctgg ctgctaccga cctgcgtggt 480
aaaggttatc catctgctaa actgtacgct tacgcttccc ctcgtgttgg caacgcagcc 540
ctggccaaat atatcaccgc ccagggcaac aacttccgtt tcacccacac caatgaccca 600
gtacctaaac tgccactgct gtctatgggc tatgtacatg tttctcctga atattggatc 660
acctctccta acaacgccac tgtttctacc tctgacatca aagtcattga cggcgacgta 720
tcttttgacg gcaataccgg cacgggcctg cctctgctga cggactttga agcccacatt 780
tggtactttg tacaggttga cgccggcaaa ggtcctggcc tgccattcaa acgtgtttaa 840
<210> 8
<211> 279
<212> PRT
<213> Ka Meng's Penicillium (Penicilliurn camembertii)
<400> 8
Asp Val Ser Thr Ser Glu Leu Asp Gln Phe Glu Phe Trp Val Gln Tyr
1 5 10 15
Ala Ala Ala Ser Tyr Tyr Glu Ala Asp Tyr Thr Ala Gln Val Gly Asp
20 25 30
Lys Leu Ser Cys Ser Lys Gly Asn Cys Pro Glu Val Glu Ala Thr Gly
35 40 45
Ala Thr Val Ser Tyr Asp Phe Ser Asp Ser Thr Ile Thr Asp Thr Ala
50 55 60
Gly Tyr Ile Ala Val Asp His Thr Asn Ser Ala Val Val Leu Ala Phe
65 70 75 80
Arg Gly Ser Tyr Ser Val Arg Asn Trp Val Ala Asp Ala Thr Phe Val
85 90 95
His Thr Asn Pro Gly Leu Cys Asp Gly Cys Leu Ala Glu Leu Gly Phe
100 105 110
Trp Ser Ser Trp Lys Leu Val Arg Asp Asp Ile Ile Lys Glu Leu Lys
115 120 125
Glu Val Val Ala Gln Asn Pro Asn Tyr Glu Leu Val Val Val Gly His
130 135 140
Ser Leu Gly Ala Ala Val Ala Thr Leu Ala Ala Thr Asp Leu Arg Gly
145 150 155 160
Lys Gly Tyr Pro Ser Ala Lys Leu Tyr Ala Tyr Ala Ser Pro Arg Val
165 170 175
Gly Asn Ala Ala Leu Ala Lys Tyr Ile Thr Ala Gln Gly Asn Asn Phe
180 185 190
Arg Phe Thr His Thr Asn Asp Pro Val Pro Lys Leu Pro Leu Leu Ser
195 200 205
Met Gly Tyr Val His Val Ser Pro Glu Tyr Trp Ile Thr Ser Pro Asn
210 215 220
Asn Ala Thr Val Ser Thr Ser Asp Ile Lys Val Ile Asp Gly Asp Val
225 230 235 240
Ser Phe Asp Gly Asn Thr Gly Thr Gly Leu Pro Leu Leu Thr Asp Phe
245 250 255
Glu Ala His Ile Trp Tyr Phe Val Gln Val Asp Ala Gly Lys Gly Pro
260 265 270
Gly Leu Pro Phe Lys Arg Val
275
<210> 9
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
ctcttactgt gaggctgatt acacc 25
<210> 10
<211> 31
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
cctttgcagg cgtcaacctg tacaaagtac c 31
<210> 11
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
ttgacgcctg caaaggtcct ggcct 25
<210> 12
<211> 25
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 12
cagcctcaca gtaagaggcg gctgc 25
<210> 13
<211> 31
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 13
cgtagacgtg acatcaaagt cattgacggc g 31
<210> 14
<211> 38
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 14
gatgtcacgt ctacgaacag tggcgttgtt aggagagg 38
<210> 15
<211> 279
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 15
Asp Val Ser Thr Ser Glu Leu Asp Gln Phe Glu Phe Trp Val Gln Tyr
1 5 10 15
Ala Ala Ala Ser Tyr Cys Glu Ala Asp Tyr Arg Ala Gln Val Gly Asp
20 25 30
Lys Leu Ser Cys Ser Lys Gly Asn Cys Pro Glu Val Glu Ala Thr Gly
35 40 45
Ala Thr Val Ser Tyr Asp Phe Ser Asp Ser Thr Ile Thr Asp Thr Ala
50 55 60
Gly Tyr Ile Ala Val Asp His Thr Asn Ser Ala Val Val Leu Ala Phe
65 70 75 80
Arg Gly Ser His Ser Val Arg Asn Trp Val Ala Asp Ala Thr Phe Val
85 90 95
His Thr Asn Pro Gly Leu Cys Asp Gly Cys Leu Ala Glu Leu Gly Phe
100 105 110
Trp Ser Ser Trp Lys Leu Val Arg Asp Asp Ile Ile Lys Glu Leu Lys
115 120 125
Glu Val Val Ala Gln Asn Pro Asp Tyr Glu Leu Val Val Val Gly His
130 135 140
Ser Leu Gly Ala Ala Val Ala Thr Leu Ala Ala Thr Asp Leu Arg Gly
145 150 155 160
Lys Gly Tyr Pro Ser Ala Lys Leu Tyr Ala Tyr Ala Ser Pro Arg Val
165 170 175
Gly Asn Ala Ala Leu Ala Lys Tyr Ile Thr Ala Gln Gly Asn Asn Phe
180 185 190
Arg Phe Thr His Thr Asn Asp Pro Val Pro Lys Leu Pro Leu Leu Ser
195 200 205
Met Gly Tyr Val His Val Ser Pro Glu Tyr Trp Ile Thr Ser Pro Asn
210 215 220
Asn Ala Thr Val Arg Arg Arg Asp Ile Lys Val Ile Asp Gly Asp Val
225 230 235 240
Ser Phe Asp Gly Asn Thr Gly Thr Gly Leu Pro Leu Leu Thr Asp Phe
245 250 255
Glu Ala His Ile Trp Tyr Phe Val Gln Val Asp Ala Cys Lys Gly Pro
260 265 270
Gly Leu Pro Phe Lys Arg Val
275
<210> 16
<211> 840
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 16
gacgtttcta cttccgagtt ggaccaattc gagttctggg ttcaatacgc tgctgcttct 60
tactgtgagg ctgattacag agctcaagtt ggagataagt tgtcttgttc taagggtaac 120
tgtcctgaag ttgaggctac tggtgctact gtttcttacg atttctccga ttctactatc 180
actgatactg ccggttacat tgctgttgat cacactaact ctgctgttgt tttggctttc 240
agaggttctc actctgttcg taactgggtt gctgacgcta cttttgttca cactaaccca 300
ggtttgtgtg atggttgttt ggctgagttg ggtttctggt cctcctggaa gttggttaga 360
gatgacatta ttaaggagtt gaaggaggtt gtcgctcaaa accctgacta cgagttggtt 420
gttgttggtc actctttggg tgctgccgtt gctaccttgg ctgccactga cttgagaggt 480
aaaggttacc catctgctaa gttgtacgct tacgcttctc caagagttgg taacgctgct 540
ttggctaagt acattaccgc tcaaggtaac aacttcagat tcactcacac taacgatcca 600
gttccaaagt tgccattgtt gtccatgggt tacgttcacg tttctccaga gtactggatt 660
acttccccaa acaacgctac tgttcgtaga cgtgacatta aggttatcga tggagatgtt 720
tctttcgacg gtaacactgg taccggtttg ccattgttga ctgacttcga ggctcacatc 780
tggtacttcg ttcaagttga tgcttgtaag ggtccaggtt tgccattcaa gagagtttaa 840
Claims (10)
1. A polypeptide having glycerol mono-diacyl ester lipase activity, which consists of an amino acid sequence shown in any one of SEQ ID NOs 1 to 3 and 15.
2. A polynucleotide encoding the polypeptide of claim 1.
3. The polynucleotide according to claim 2, which consists of the nucleotide sequence shown in any one of SEQ ID NOs 4 to 6 and 16.
4. An expression vector comprising at least one polynucleotide of claim 2 or 3.
5. The expression vector of claim 4, further comprising a regulatory sequence that regulates expression of the polynucleotide, wherein the polynucleotide is operably linked to the regulatory sequence.
6. The expression vector of claim 5, wherein the expression vector is a plasmid.
7. A host cell comprising the polynucleotide of claim 2 or 3, or the expression vector of any one of claims 4-6.
8. The host cell of claim 7, wherein the host cell is yeast or e.
9. Use of the polypeptide of claim 1 for catalyzing synthesis, hydrolysis or transesterification of monoglycerides and/or diglycerides.
10. A method of synthesizing a monoglyceride and/or diglyceride comprising contacting the polypeptide of claim 1 with a fatty acid and glycerol.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2019113298045 | 2019-12-20 | ||
CN201911329804 | 2019-12-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113005111A CN113005111A (en) | 2021-06-22 |
CN113005111B true CN113005111B (en) | 2024-04-12 |
Family
ID=76383519
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011499617.4A Active CN113005111B (en) | 2019-12-20 | 2020-12-17 | Novel glycerol mono-diacyl ester lipase |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113005111B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112574975B (en) * | 2020-09-30 | 2022-04-01 | 华南理工大学 | Glyceride lipase mutant G28C-P206C, and coding gene and application thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106811449A (en) * | 2015-11-27 | 2017-06-09 | 丰益(上海)生物技术研发中心有限公司 | A kind of new lipase B M19 |
CN108251400A (en) * | 2016-12-29 | 2018-07-06 | 丰益(上海)生物技术研发中心有限公司 | Lipase and its application |
CN109929820A (en) * | 2017-12-19 | 2019-06-25 | 丰益(上海)生物技术研发中心有限公司 | Novel Mono-and diacylglycerol lipase |
-
2020
- 2020-12-17 CN CN202011499617.4A patent/CN113005111B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106811449A (en) * | 2015-11-27 | 2017-06-09 | 丰益(上海)生物技术研发中心有限公司 | A kind of new lipase B M19 |
CN108251400A (en) * | 2016-12-29 | 2018-07-06 | 丰益(上海)生物技术研发中心有限公司 | Lipase and its application |
CN109929820A (en) * | 2017-12-19 | 2019-06-25 | 丰益(上海)生物技术研发中心有限公司 | Novel Mono-and diacylglycerol lipase |
Also Published As
Publication number | Publication date |
---|---|
CN113005111A (en) | 2021-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108431218B (en) | Efficient zinc ion-independent phospholipase C mutants | |
CN109929820B (en) | Novel glycerol mono-diacyl ester lipase | |
CN108239648B (en) | Method for efficiently expressing rhizomucor miehei lipase | |
CN113005111B (en) | Novel glycerol mono-diacyl ester lipase | |
CN111032872A (en) | Modified lipase and use thereof | |
CA2368953A1 (en) | Proteins related to gaba metabolism | |
WO2018021324A1 (en) | Polypeptide having lipase activity with excellent stability | |
CN113025595A (en) | Acid-tolerant lipases | |
CN113025596A (en) | Methanol-tolerant lipase | |
CN111378633B (en) | High-enzyme-activity phospholipase C mutant | |
CN113122520A (en) | Novel glycerol mono-diacyl ester lipase | |
KR20200015012A (en) | Heat-resisting Carbonic Anhydrase Variants and Composition for Capturing Carbon Dioxide | |
KR20150007393A (en) | Mutant polypepetide, nucleic acid molecule encoding the same, recombinant vector, transformant and production method of 3-hydroxypropionate by using the same | |
CN106811449B (en) | Lipase BM19 | |
CN105779412B (en) | Novel lipase | |
CN105779418B (en) | Novel lipase | |
CN105669841B (en) | Polypeptide, fusion enzyme, preparation method and application thereof | |
WO2018174231A1 (en) | Method for producing protease in yeast | |
CN105779417B (en) | Novel lipase | |
CN105779415B (en) | Novel lipase | |
CN105779416B (en) | Novel lipase | |
CN113481183B (en) | Marine microorganism lipase chimera and construction method and application thereof | |
CN105779413B (en) | Novel lipase | |
CN105331593B (en) | A kind of lipase | |
CN113913414B (en) | Double-base enzyme Kex2 mutant with high stability and high catalytic efficiency |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |