CN115074339A - Coding sequence of peroxidase POD1 and application thereof - Google Patents
Coding sequence of peroxidase POD1 and application thereof Download PDFInfo
- Publication number
- CN115074339A CN115074339A CN202110271147.4A CN202110271147A CN115074339A CN 115074339 A CN115074339 A CN 115074339A CN 202110271147 A CN202110271147 A CN 202110271147A CN 115074339 A CN115074339 A CN 115074339A
- Authority
- CN
- China
- Prior art keywords
- pod1
- coding sequence
- peroxidase
- seq
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 101000957299 Homo sapiens Coronin-7 Proteins 0.000 title claims abstract description 43
- 101000800546 Homo sapiens Transcription factor 21 Proteins 0.000 title claims abstract description 43
- 102100033121 Transcription factor 21 Human genes 0.000 title claims abstract description 43
- 108091026890 Coding region Proteins 0.000 title claims abstract description 37
- 102000003992 Peroxidases Human genes 0.000 title claims abstract description 24
- 108040007629 peroxidase activity proteins Proteins 0.000 title claims abstract description 24
- 229960004191 artemisinin Drugs 0.000 claims abstract description 29
- 229930101531 artemisinin Natural products 0.000 claims abstract description 28
- BLUAFEHZUWYNDE-NNWCWBAJSA-N artemisinin Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2OC(=O)[C@@H]4C BLUAFEHZUWYNDE-NNWCWBAJSA-N 0.000 claims abstract description 27
- 229920001184 polypeptide Polymers 0.000 claims abstract description 10
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 10
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 9
- JYGAZEJXUVDYHI-DGTMBMJNSA-N dihydroartemisinic acid Chemical compound C1CC(C)=C[C@@H]2[C@H]([C@@H](C)C(O)=O)CC[C@@H](C)[C@@H]21 JYGAZEJXUVDYHI-DGTMBMJNSA-N 0.000 claims description 29
- 230000015572 biosynthetic process Effects 0.000 claims description 24
- 238000003786 synthesis reaction Methods 0.000 claims description 24
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 15
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 15
- JYGAZEJXUVDYHI-UHFFFAOYSA-N dihydroartemisininic acid Natural products C1CC(C)=CC2C(C(C)C(O)=O)CCC(C)C21 JYGAZEJXUVDYHI-UHFFFAOYSA-N 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 11
- 230000009465 prokaryotic expression Effects 0.000 claims description 9
- 241000588724 Escherichia coli Species 0.000 claims description 7
- 230000014509 gene expression Effects 0.000 claims description 4
- 230000001939 inductive effect Effects 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 claims description 3
- 239000002773 nucleotide Substances 0.000 claims description 2
- 125000003729 nucleotide group Chemical group 0.000 claims description 2
- 239000000758 substrate Substances 0.000 abstract description 5
- 230000000694 effects Effects 0.000 abstract description 4
- 229960002521 artenimol Drugs 0.000 abstract 1
- BJDCWCLMFKKGEE-ISOSDAIHSA-N artenimol Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2O[C@H](O)[C@@H]4C BJDCWCLMFKKGEE-ISOSDAIHSA-N 0.000 abstract 1
- 229930016266 dihydroartemisinin Natural products 0.000 abstract 1
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 239000000047 product Substances 0.000 description 11
- 150000001413 amino acids Chemical group 0.000 description 10
- 150000001875 compounds Chemical class 0.000 description 10
- 150000002500 ions Chemical group 0.000 description 9
- 101100520672 Arabidopsis thaliana POD1 gene Proteins 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 108090000623 proteins and genes Proteins 0.000 description 7
- 235000001405 Artemisia annua Nutrition 0.000 description 6
- 240000000011 Artemisia annua Species 0.000 description 6
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 108020004414 DNA Proteins 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 241000049464 Artemisia apiacea Species 0.000 description 4
- 235000011570 Artemisia caruifolia var apiacea Nutrition 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 102000016680 Dioxygenases Human genes 0.000 description 3
- 108010028143 Dioxygenases Proteins 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- -1 cyclic hydroperoxides Chemical class 0.000 description 3
- 239000012149 elution buffer Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000000945 filler Substances 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 229930000044 secondary metabolite Natural products 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- SNAKIVFVLVUCKB-UHFFFAOYSA-N Asn-Glu-Ala-Lys Natural products NCCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(N)CC(N)=O SNAKIVFVLVUCKB-UHFFFAOYSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- 229930003268 Vitamin C Natural products 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000000078 anti-malarial effect Effects 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 238000007036 catalytic synthesis reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- GPRLSGONYQIRFK-UHFFFAOYSA-N hydron Chemical compound [H+] GPRLSGONYQIRFK-UHFFFAOYSA-N 0.000 description 1
- 125000001867 hydroperoxy group Chemical group [*]OO[H] 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 150000002978 peroxides Chemical class 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 229930004725 sesquiterpene Natural products 0.000 description 1
- 150000004354 sesquiterpene derivatives Chemical class 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0065—Oxidoreductases (1.) acting on hydrogen peroxide as acceptor (1.11)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/181—Heterocyclic compounds containing oxygen atoms as the only ring heteroatoms in the condensed system, e.g. Salinomycin, Septamycin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y111/00—Oxidoreductases acting on a peroxide as acceptor (1.11)
- C12Y111/01—Peroxidases (1.11.1)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The application discloses a coding sequence of peroxidase POD1, which is characterized in that the amino acid sequence coded by the coding sequence comprises an amino acid sequence shown in SEQ ID NO.2 or a functional variant thereof, and the functional variant has homology of more than 70% with the amino acid sequence shown in SEQ ID NO. 2. The application also discloses a polypeptide coded by the coding sequence of peroxidase POD1, which can synthesize artemisinin by using dihydroartemisinin as a substrate and has an important effect on the scale production of artemisinin.
Description
Technical Field
The application relates to the field of biosynthesis, in particular to a coding sequence of peroxidase POD1 and application thereof.
Background
The content of secondary metabolites generated in natural plants is extremely low, and the chemical synthesis method is complex in process flow and high in cost, and a lot of secondary metabolites are added into the food and cosmetic industry, so that the natural plant materials are safer to use. Artemisinin is an effective substance for treating malaria at present, and is mainly extracted from plants, but the content of artemisinin in wild type artemisia apiacea is only 0.1% -1% of the dry weight, so that the actual requirement is difficult to meet, and researchers are dedicated to exploring other methods for improving the content of plant secondary metabolites; peroxide bridge containing compounds include cyclic hydroperoxides, chain peroxides, and hydroperoxy compounds, where the cyclic hydroperoxides possess multiple biological activities. The compound with intra-ring peroxy-bridged sesquiterpene is most remarkable as artemisinin compound, and the anti-malarial activity of artemisinin is inseparable from that of "peroxy-bridged". The in-loop peroxybridge synthetase has been reported to a lesser extent, while the mechanism of in-loop peroxybridge synthesis is rather unclear. Therefore, obtaining a synthetase that catalyzes the synthesis of an intra-domain peroxybridge of artemisinin is of great significance to the synthesis pathway of compounds containing a peroxybridge as well as to the heterologous synthesis of compounds containing a peroxybridge.
Disclosure of Invention
In order to overcome the defects in the prior art, the application provides a peroxidase POD1 coding sequence and a protein coded by the coding sequence, and application of the POD1 gene coding sequence and the protein coded by the coding sequence.
In one aspect, the present application provides a coding sequence for peroxidase POD1, wherein the coding sequence encodes an amino acid sequence comprising the amino acid sequence set forth in SEQ ID No.2, or a functional variant thereof, which has greater than 70% homology with the amino acid sequence set forth in SEQ ID No. 2.
In certain embodiments, the functional variant has greater than 90% homology with the amino acid sequence set forth in SEQ ID No. 2.
In certain embodiments, the functional variant has greater than 99% homology with the amino acid sequence set forth in SEQ ID No. 2.
In certain embodiments, the amino acid sequence set forth in SEQ ID No.2, or a functional variant thereof, is capable of catalyzing the synthesis of an artemisinin intra-ring peroxybridge.
In some embodiments, the amino acid sequence shown in SEQ ID No.2 or a functional variant thereof is capable of synthesizing artemisinin with dihydroartemisinic acid as substrate.
In certain embodiments, the functional variant is derived from artemisia apiacea.
In certain embodiments, the coding sequence encodes an amino acid sequence as set forth in SEQ ID No. 2.
In certain embodiments, the nucleotide sequence of the coding sequence is set forth in SEQ ID No. 1.
In another aspect, the present application also provides a polypeptide encoded by the coding sequence of the peroxidase POD1 described herein.
In certain embodiments, the sequence of the polypeptide is set forth in SEQ ID No. 2.
In another aspect, the present application also provides a coding sequence for a peroxidase POD1 as described herein, and the use of a polypeptide as described herein for catalyzing the synthesis of an intra-ring peroxidic bridge.
In certain embodiments, the application comprises the steps of:
connecting a coding sequence of peroxidase POD1 to an escherichia coli prokaryotic expression regulatory sequence pET30a, and constructing a prokaryotic expression vector pET30a-POD 1;
step two, transferring the expression vector pET30a-POD1 in the step one into escherichia coli Rosetta;
step three, inducing the expression of the recombinant protein, and purifying the recombinant protein;
step four, catalyzing the synthesis of the intra-ring peroxide bridge bond by using the recombinant protein obtained in the step three.
In another aspect, the present application also provides the coding sequence of peroxidase POD1 described herein, and the use of the polypeptide described herein in the synthesis of artemisinin using dihydroartemisinic acid.
In certain embodiments, the application comprises the steps of:
connecting a coding sequence of peroxidase POD1 to an escherichia coli prokaryotic expression regulatory sequence pET30a, and constructing a prokaryotic expression vector pET30a-POD 1;
step two, transferring the expression vector pET30a-POD1 in the step one into escherichia coli Rosetta;
step three, inducing the expression of the recombinant protein, and purifying the recombinant protein;
step four, catalyzing the synthesis of the intra-ring peroxide bridge bond by using the recombinant protein obtained in the step three.
In certain embodiments, the fourth step comprises:
a) UPLC-QQQ-MS analyzes the catalytic products; and/or
b) UPLC-QTOF-MS analysis of the catalytic products.
Technical effects
A catalytic experiment in the application shows that a catalytic synthesis product is analyzed by UPLC-QTOF-MS, a compound which has the same peak time with an artemisinin standard appears in a product which catalyzes the synthesis of dihydroartemisinic acid by a coding protein of a POD1 coding sequence, a characteristic ion fragment of the compound is consistent with the artemisinin standard, and the result shows that POD1 can synthesize the artemisinin containing the intra-ring peroxygen bridge bond by using the dihydroartemisinic acid as a substrate; meanwhile, the saccharomyces cerevisiae is used for synthesizing dihydroartemisinic acid, the POD1 coding sequence and the protein coded by the coding sequence play an important role in the large-scale production of artemisinin and the synthesis of artemisinin by using yeast, and have important significance in the synthesis way of compounds containing peroxide bridge bonds and the heterologous synthesis of compounds containing peroxide bridge bonds.
The conception, specific structure and technical effects of the present application will be further described in conjunction with the accompanying drawings to fully understand the purpose, characteristics and effects of the present application.
Drawings
FIG. 1 is a diagram showing the results of SDS-PAGE detection of the purified POD1 recombinant protein in the present application.
Fig. 2 shows the product of UPLC-QQQ-MS analysis of the dioxygenase POD1 catalyzed synthesis of dihydroartemisinic acid in the present application, wherein fig. 2A shows an extracted ion flow diagram of an artemisinin standard, fig. 2B shows an extracted ion flow diagram of POD1 catalyzed synthesis of artemisinin from dihydroartemisinic acid, and fig. 2C shows an extracted ion flow diagram of a boiled POD1 recombinant protein control, wherein the ordinate in the figure represents relative ionic strength and the abscissa represents retention time.
FIG. 3 shows UPLC-QTOF-MS analysis that POD1 catalyzes the synthesis of artemisinin from dihydroarteannuic acid, wherein FIG. 3A shows an ion flow diagram of artemisinin standard extraction; FIG. 3B shows an ion flow diagram of artemisinin extraction by POD1 catalyzed synthesis of dihydroartemisinic acid, FIG. 3C shows mass spectrum results of artemisinin standard, and FIG. 3D shows mass spectrum results of artemisinin by POD1 catalyzed synthesis of dihydroartemisinic acid; in FIGS. 3A and 3B, the ordinate represents relative ion intensity, and the abscissa represents retention time; the ordinate in fig. 3C and 3D indicates relative ion intensity, and the abscissa indicates mass-to-charge ratio m/z.
Detailed Description
The present application will now be further described with reference to examples, which are intended to be illustrative only, and the present application may be embodied in many different forms of embodiments and should not be construed as limited to the embodiments set forth herein.
Experimental procedures without specific conditions noted in the following examples, generally followed by conventional conditions, such as molecular cloning in Sambrook et al: the conditions described in the Laboratory Manual (New York: Cold Spring Harbor Laboratory Press,1989), or according to the conditions recommended by the relevant Material manufacturers.
Example 1 cloning of the Artemisia annua oxidase POD1 Gene
1. Extraction of total RNA of sweet wormwood genome
Taking sweet wormwood leaf tissue, placing the sweet wormwood leaf tissue in liquid nitrogen for grinding, adding the sweet wormwood leaf tissue into a 1.5mL Eppendorf (EP) centrifuge tube containing lysis solution, fully oscillating, and extracting total RNA according to the instruction of a TIANGEN kit. The total RNA quality was determined by agarose gel electrophoresis and the RNA content was determined on a spectrophotometer.
2. Cloning of the Artemisia apiacea POD1 Gene
Synthesizing cDNA under the action of PowerScript reverse transcriptase by taking the extracted total RNA as a template; gene-specific primers were designed based on the sequence of the POD1 gene, and the POD1 gene was amplified from the total cDNA by PCR and sequenced.
Through the steps, a coding sequence (shown as SEQ ID NO: 1) with the POD1 gene length of 1053bp in the sweet wormwood herb is obtained, wherein the initiation codon is ATG, the termination codon is TAA, and the coding amino acid sequence is further obtained as shown in SEQ ID NO: 2, respectively.
TABLE 1 PCR primers
Primer name | Primer sequence (5 '→ 3') | SEQ ID NO | |
POD1- | ATGGGGCTTGTTTCATTTCTCT | 3 | |
POD1-RP | TCACAACTTGAGTTTTCTGAAAT | 4 |
TABLE 2 reaction System for PCR
Artemisia | 1μL | |
10×KOD Plus Buffer | 5μL | |
dNTP | 5μL | |
MgSO 4 | 2μL | |
POD1-FP | 1μL | |
POD1-RP | 1μL | |
KOD Plus | 1μL | |
ddH 2 O | 34μL | |
Total volume | 50μL |
Example 2 construction of prokaryotic expression vector containing POD1 Gene
The POD1 gene was constructed on prokaryotic expression vector pET30a to obtain expression vector pET30a-POD1, in order to facilitate the construction of the expression vector, BamHI cleavage site was introduced into the forward primer, XhoI cleavage site was introduced into the reverse primer, and the primers are shown in Table 3.
TABLE 3 PCR primers for pET30a-AOP1 vector construction
Example 3 Induction of expression of recombinant POD1 protein
1. The constructed prokaryotic expression vector pET30a-POD1 is transferred into Rosetta (purchased from Shanghai Weidi Biotechnology Co., Ltd.);
2. the activated E.coli was transferred to 200mL of LB liquid medium (containing 100mg/L kanamycin), cultured at 37 ℃ until OD600 became 0.6, added with 0.5mM of IPTG (stock solution 1M/mL), and induced at 16 ℃ overnight;
3. the cells were collected by centrifugation, and 1/10 volumes (20mL) of lysine buffer (7.8g of NaH2PO4.2H2O, 17.54g of NaCl and 0.68g of Imidazole, pH 8.0) were added to the cells, followed by sonication;
4. centrifuging at 4 deg.C and 13000rpm for 20min, sucking supernatant, and storing on ice;
5. shaking up nickel column filler (purchased from assist in shanghai saint biotechnology limited, cat # 20502ES10), sucking 1-2mL to the filter column, dripping off the liquid in the filler, and washing the column with 2.5-5mL of deionized water;
6. equilibrating the nickel column with 5-10mL lysine buffer (7.8g NaH2PO4.2H2O, 17.54g NaCl and 0.68g Imidazole, pH 8.0), adding the supernatant and filtering;
7. the column was washed with 10mL wash buffer (7.8g NaH2PO4.2H2O, 17.54g NaCl and 1.36g Imidazole, pH 8.0);
8. 250uL of Elution Buffer (7.8g of NaH2PO4.2H2O, 17.54g of NaCl and 17.0g of Imidazole, pH 8.0) was added first, and the wash Buffer in the nickel column was squeezed; then adding 2mL of Elution Buffer, and collecting effluent protein liquid;
9. adding 2-3mL of Elution Buffer again, and collecting effluent protein liquid;
10. the purified protein solution was placed in a dialysis bag and dialyzed into PBS solution.
11. The purified product was checked by SDS-PAGE.
As shown in FIG. 1, the size of the purified POD1 recombinant protein was 38.9 kDa.
Example 4 POD1 in vitro catalysis experiment
Mu.g of purified POD1 recombinant protein and a substrate dihydroartemisinic acid are respectively added into a catalytic reaction system (100mM PBS buffer solution (pH7.0),10mM vitamin C, 10mM MgCl2, 200nM NADPH as a hydrogen ion donor, 0.1mM DTT), a magnetic stirrer is continuously stirred, the reaction is carried out for 24 hours in a dark place at 30 ℃, and boiled purified POD1 recombinant protein is used as a negative control. Extracting the reaction product with n-hexane for three times, drying the extract liquor, dissolving the extract liquor with methanol, analyzing the catalytic product by UPLC-QQQ-MS, and further analyzing the catalytic product by UPLC-QTOF-MS.
The experimental results showed that UPLC-QQQ-MS showed that the product of dioxygenase POD1 catalyzing the synthesis of dihydroartemisinic acid contained artemisinin, while the boiled POD1 recombinant protein control group showed no presence of artemisinin (fig. 2); the results of the UPLC-QTOF-MS experiments show that the POD1 catalyzes a compound with the peak time consistent with that of an artemisinin standard product in the product synthesized by dihydroartemisinic acid, and the characteristic ion fragments of the compound are consistent with that of the artemisinin standard product (figure 3). This result demonstrates that the dioxygenase POD1 is capable of synthesizing artemisinin using dihydroartemisinic acid as substrate.
The foregoing detailed description of the preferred embodiments of the present application. It should be understood that numerous modifications and variations can be devised by those skilled in the art in light of the present teachings without departing from the inventive concepts. Therefore, the technical solutions available to those skilled in the art through logic analysis, reasoning and limited experiments based on the concepts of the present application should be within the scope of protection defined by the claims.
Sequence listing
<110> Shanghai university of transportation
<120> coding sequence of peroxidase POD1 and application thereof
<130> CN084-21002PICN
<160> 6
<170> PatentIn version 3.5
<210> 1
<211> 1053
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> coding sequence for peroxidase POD1
<400> 1
atggggcttg tttcatttct ctcaactgtc ccaactttgc ttccaaatgc ttcatttccc 60
tcaacttaca ttccttccaa atgcccaggt attattagtt gcaaatttga gaataatgct 120
ggtgatagca atgagacaaa tggtgttaaa aggagggatg ttcttaactg ctttggagct 180
gccattagcg tggaactggt ggcaagctca agcccaatct cgagcccttt cattgaggtg 240
gcaaatgctg ctgatttgat acaaagaaga caacgttccg actttttatc aagtataaag 300
acaaccctct atacggccat taaggcaaat caggagctca ttccatcaat attaacttta 360
gctcttaatg attctatgac atatgataag ccaacaaaaa ctggaggccc aaatggatcc 420
atacgcttta gctcggagat taacagacca gaaaacaaag gactttctgc agctttatct 480
ttggttgagg aagcaaaaaa agagatcgat tcttactcga aaggtggacc gatctcatac 540
tcagatctca tccagttagc agctcaaagt gctcttaaag ctacattttt ggcttcggca 600
attaaaaaat gtggtggaaa tgaggaaaaa gggaacttgt tatactccgc ttatggttca 660
agtggacagt ggggattgtt cgataggcaa tttggaaggt cggatagtca agaacctgat 720
cctgagggaa gagtaccaga ctggagtaca gcaagtgtcc aagaaatgaa agataagttt 780
acagccatcg gcttcggtcc tcgtcagcta gcagtaatgt cggcattctt gggtcccgat 840
caattagcaa acgaagcaaa attagcaacg gataaagatg ttactaaatg ggtagagaaa 900
taccaacgga gcagagaaac agtctcacag acggactatg aggttgattt gatcacgacc 960
cttacgaaaa tgagttgttt gggtcaaaac attaactacg aggcatactc atacgctgtc 1020
cccaagattg atttcagaaa actcaagttg tga 1053
<210> 2
<211> 350
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> peroxidase POD1 amino acid sequence
<400> 2
Met Gly Leu Val Ser Phe Leu Ser Thr Val Pro Thr Leu Leu Pro Asn
1 5 10 15
Ala Ser Phe Pro Ser Thr Tyr Ile Pro Ser Lys Cys Pro Gly Ile Ile
20 25 30
Ser Cys Lys Phe Glu Asn Asn Ala Gly Asp Ser Asn Glu Thr Asn Gly
35 40 45
Val Lys Arg Arg Asp Val Leu Asn Cys Phe Gly Ala Ala Ile Ser Val
50 55 60
Glu Leu Val Ala Ser Ser Ser Pro Ile Ser Ser Pro Phe Ile Glu Val
65 70 75 80
Ala Asn Ala Ala Asp Leu Ile Gln Arg Arg Gln Arg Ser Asp Phe Leu
85 90 95
Ser Ser Ile Lys Thr Thr Leu Tyr Thr Ala Ile Lys Ala Asn Gln Glu
100 105 110
Leu Ile Pro Ser Ile Leu Thr Leu Ala Leu Asn Asp Ser Met Thr Tyr
115 120 125
Asp Lys Pro Thr Lys Thr Gly Gly Pro Asn Gly Ser Ile Arg Phe Ser
130 135 140
Ser Glu Ile Asn Arg Pro Glu Asn Lys Gly Leu Ser Ala Ala Leu Ser
145 150 155 160
Leu Val Glu Glu Ala Lys Lys Glu Ile Asp Ser Tyr Ser Lys Gly Gly
165 170 175
Pro Ile Ser Tyr Ser Asp Leu Ile Gln Leu Ala Ala Gln Ser Ala Leu
180 185 190
Lys Ala Thr Phe Leu Ala Ser Ala Ile Lys Lys Cys Gly Gly Asn Glu
195 200 205
Glu Lys Gly Asn Leu Leu Tyr Ser Ala Tyr Gly Ser Ser Gly Gln Trp
210 215 220
Gly Leu Phe Asp Arg Gln Phe Gly Arg Ser Asp Ser Gln Glu Pro Asp
225 230 235 240
Pro Glu Gly Arg Val Pro Asp Trp Ser Thr Ala Ser Val Gln Glu Met
245 250 255
Lys Asp Lys Phe Thr Ala Ile Gly Phe Gly Pro Arg Gln Leu Ala Val
260 265 270
Met Ser Ala Phe Leu Gly Pro Asp Gln Leu Ala Asn Glu Ala Lys Leu
275 280 285
Ala Thr Asp Lys Asp Val Thr Lys Trp Val Glu Lys Tyr Gln Arg Ser
290 295 300
Arg Glu Thr Val Ser Gln Thr Asp Tyr Glu Val Asp Leu Ile Thr Thr
305 310 315 320
Leu Thr Lys Met Ser Cys Leu Gly Gln Asn Ile Asn Tyr Glu Ala Tyr
325 330 335
Ser Tyr Ala Val Pro Lys Ile Asp Phe Arg Lys Leu Lys Leu
340 345 350
<210> 3
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> POD1-FP primer sequence
<400> 3
atggggcttg tttcatttct ct 22
<210> 4
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> POD1-RP primer sequence
<400> 4
tcacaacttg agttttctga aat 23
<210> 5
<211> 37
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> BamHI-POD1-FP primer sequence
<400> 5
gctgatatcg gatccatggg gcttgtttca tttctct 37
<210> 6
<211> 35
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<220>
<223> XhoI-POD1-RP primer sequences
<400> 6
gtggtggtgc tcgagcaact tgagttttct gaaat 35
Claims (10)
1. A coding sequence for peroxidase POD1, wherein the amino acid sequence encoded by said coding sequence comprises the amino acid sequence shown in SEQ ID No.2 or a functional variant thereof, said functional variant having more than 70% homology with the amino acid sequence shown in SEQ ID No. 2.
2. The coding sequence of the peroxidase POD1 according to claim 1, wherein the functional variant has more than 90% homology with the amino acid sequence depicted in SEQ ID No. 2.
3. The coding sequence of the peroxidase POD1 of any one of claims 1 to 2, wherein the amino acid sequence depicted in SEQ ID No.2, or a functional variant thereof, is capable of catalyzing the synthesis of an intra-artemisinin oxo-peroxide bridge.
4. The coding sequence for the peroxidase POD1 according to claim 4, wherein the coding sequence encodes an amino acid sequence as set forth in SEQ ID No. 2.
5. The coding sequence for peroxidase POD1 according to claim 1, wherein the nucleotide sequence of the coding sequence is set forth in SEQ ID No. 1.
6. A polypeptide encoded by the coding sequence of peroxidase POD1 of claims 1-5.
7. The polypeptide of claim 6, wherein the sequence of the polypeptide is as set forth in SEQ ID No. 2.
8. Use of the coding sequence for a peroxidase POD1 according to claims 1-5, the polypeptide according to claims 6-7, for catalyzing the synthesis of an intra-loop peroxide bridge.
9. The application of claim 8, wherein the application comprises the steps of:
connecting a coding sequence of peroxidase POD1 to an escherichia coli prokaryotic expression regulatory sequence pET30a, and constructing a prokaryotic expression vector pET30a-POD 1;
step two, transferring the expression vector pET30a-POD1 in the step one into escherichia coli Rosetta;
step three, inducing the expression of the recombinant protein, and purifying the recombinant protein;
step four, catalyzing the synthesis of the intra-ring peroxide bridge bond by using the recombinant protein obtained in the step three.
10. Use of the coding sequence of the peroxidase POD1 according to claims 1-5, the polypeptide according to claims 6-7 for the synthesis of artemisinin using dihydroartemisinic acid.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110271147.4A CN115074339B (en) | 2021-03-12 | 2021-03-12 | Coding sequence of peroxidase POD1 and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110271147.4A CN115074339B (en) | 2021-03-12 | 2021-03-12 | Coding sequence of peroxidase POD1 and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115074339A true CN115074339A (en) | 2022-09-20 |
CN115074339B CN115074339B (en) | 2024-06-21 |
Family
ID=83241216
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110271147.4A Active CN115074339B (en) | 2021-03-12 | 2021-03-12 | Coding sequence of peroxidase POD1 and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115074339B (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110106209A (en) * | 2019-05-09 | 2019-08-09 | 山东大学 | A method of synthesis terpenoid is positioned using Yarrowia lipolytica approach |
-
2021
- 2021-03-12 CN CN202110271147.4A patent/CN115074339B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110106209A (en) * | 2019-05-09 | 2019-08-09 | 山东大学 | A method of synthesis terpenoid is positioned using Yarrowia lipolytica approach |
Non-Patent Citations (2)
Title |
---|
PWA81314.1: "thylakoid lumenal 29 kDa protein [Artemisia annua]", GENBANK * |
张超: "基于植物基因工程的方法合成青蒿素的研究进展", 中国中药杂志, vol. 44, no. 19, pages 4285 - 4292 * |
Also Published As
Publication number | Publication date |
---|---|
CN115074339B (en) | 2024-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ezaki et al. | Expression of a moderately anionic peroxidase is induced by aluminum treatment in tobacco cells: possible involvement of peroxidase isozymes in aluminum ion stress | |
Miller et al. | First isolation of an isoprene synthase gene from poplar and successful expression of the gene in Escherichia coli | |
Wang et al. | Purification, characterization, and cloning of an extracellular laccase with potent dye decolorizing ability from white rot fungus Cerrena unicolor GSM-01 | |
Einax et al. | Oligonucleotide primers for the universal amplification of β-tubulin genes facilitate phylogenetic analyses in the regnum Fungi | |
US6093563A (en) | Chondroitin lyase enzymes | |
Coton et al. | Histidine carboxylase of Leuconostoc oenos 9204: purification, kinetic properties, cloning and nucleotide sequence of the hdc gene | |
Klonowska et al. | LAC3, a new low redox potential laccase from Trametes sp. strain C30 obtained as a recombinant protein in yeast | |
Chen et al. | Cloning and expression of a new manganese peroxidase from Irpex lacteus F17 and its application in decolorization of reactive black 5 | |
TWI410492B (en) | Novel fungal laccases and uses thereof | |
CN104928310B (en) | Methionine lyases and its encoding gene and biological synthesis method | |
US7381810B2 (en) | Polyphenol oxidase genes from lettuce | |
CN115074339A (en) | Coding sequence of peroxidase POD1 and application thereof | |
DK2550293T3 (en) | BACILLUS PUMILUS BILIRUBIN OXIDASE AND APPLICATIONS THEREOF | |
EP1748073B1 (en) | Recombinant d-amino acid oxidase | |
Nagasaki et al. | Purification, characterization, and gene cloning of Ceriporiopsis sp. strain MD-1 peroxidases that decolorize human hair melanin | |
Rebrikov et al. | Laccase of the lignolytic fungus Trametes hirsuta: purification and characterization of the enzyme, and cloning and primary structure of the gene | |
Mura et al. | Catalase and antiquitin from Euphorbia characias: Two proteins involved in plant defense? | |
US9005939B2 (en) | Protoilludene synthase | |
CN115074337B (en) | Application of alpha-ketoglutarate dependent dioxygenase in catalytic synthesis of cyclic peroxy bridge | |
US7700358B2 (en) | Coffee mannanase | |
Johansson et al. | Isozymes of the manganese (II)-dependent peroxidase in the white-rot basidiomycete Trametes versicolor | |
CN109790553B (en) | Production method of sesaminol or sesaminol glucoside | |
Zhang et al. | Molecular cloning of a classical plant peroxidase from Artemisia annua and its effect on the biosynthesis of artemisinin in vitro | |
Preger et al. | Ascorbate-independent electron transfer between cytochrome b 561 and a 27 kDa ascorbate peroxidase of bean hypocotyls | |
US6627794B1 (en) | Polyphenyl oxidase genes from banana |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |