WO2015188742A2 - Group of glycosyltransferases and use thereof - Google Patents

Group of glycosyltransferases and use thereof Download PDF

Info

Publication number
WO2015188742A2
WO2015188742A2 PCT/CN2015/081111 CN2015081111W WO2015188742A2 WO 2015188742 A2 WO2015188742 A2 WO 2015188742A2 CN 2015081111 W CN2015081111 W CN 2015081111W WO 2015188742 A2 WO2015188742 A2 WO 2015188742A2
Authority
WO
WIPO (PCT)
Prior art keywords
ggt29
polypeptide
glycosyl
compound
ginsenoside
Prior art date
Application number
PCT/CN2015/081111
Other languages
French (fr)
Chinese (zh)
Other versions
WO2015188742A3 (en
Inventor
周志华
严兴
王平平
魏勇军
魏维
许云鹏
李晓东
杨成帅
Original Assignee
中国科学院上海生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院上海生命科学研究院 filed Critical 中国科学院上海生命科学研究院
Publication of WO2015188742A2 publication Critical patent/WO2015188742A2/en
Publication of WO2015188742A3 publication Critical patent/WO2015188742A3/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H4/00Plant reproduction by tissue culture techniques ; Tissue culture techniques therefor
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/18Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P33/00Preparation of steroids

Definitions

  • the present invention relates to the field of biotechnology and plant biology, and in particular, the present invention relates to a novel set of glycosyltransferases and uses thereof.
  • Ginsenoside is a general term for saponins isolated from ginseng and its genus (such as Panax notoginseng, American ginseng, etc.) and belongs to triterpenoid saponins, which is the main active ingredient in ginseng. At present, at least 60 saponins have been isolated from ginseng, and some of them have been proven to have a wide range of physiological functions and medicinal properties: including anti-tumor, immune regulation, anti-fatigue, heart protection, liver protection and other functions.
  • the ginsenosides Rg3, Rf and Rg2 are all rare ginsenosides, each having a very strong physiological activity as described above.
  • ginsenoside Rg3 has excellent anti-tumor activity, can induce tumor cell apoptosis, and inhibit tumor cell metastasis. It can be combined with radiotherapy and chemotherapy to enhance the effects of radiotherapy and chemotherapy.
  • Ginsenoside Rf has anti-tumor and anti-fatigue effects, can reduce uterine contraction, and has an analgesic effect associated with brain nerve cells.
  • ginsenoside Rg2 has protective effect on the brain of rats with Alzheimer's disease, can enhance the learning and memory ability of rats, and has a repairing effect on myocardial injury. In addition, ginsenoside Rg2 also protects cells from UV damage. The role.
  • the method for producing such a rare ginsenoside is to start from a large amount of ginseng saponins in ginseng, and to perform extraction and purification by selectively hydrolyzing a glycosyl group.
  • the saponins of the genus Panax species or the original ginseng diol saponins are used as raw materials, and are chemically and enzymatically transformed, separated and extracted. Due to the large loss of raw materials in the chemical preparation method, the operation is cumbersome, and there are many by-products, which leads to an increase in cost and difficulty in improving the yield.
  • the acquisition of ginseng total saponins depends on the cultivation of ginseng, the market price of rare ginsenosides produced by conventional methods is high.
  • an in vitro glycosylation method comprising the steps of:
  • glycosyl donor glycosyl group is transferred to the following sites of the tetracyclic triterpenoid in the presence of a glycosyltransferase:
  • glycosyltransferase is a glycosyltransferase as shown in SEQ ID NO.: 61 or a polypeptide derived therefrom.
  • the extended sugar chain comprises a direct extension or a displacement extension.
  • the direct extension is to add a sugar group to the first sugar group at the C-3 and/or C-6 position to extend the sugar chain.
  • the substitution is extended to replace the terminal glycosyl group of the C-3 and/or C-6 sugar chain with a different glycosyl group from the C-3 and/or C-6 position.
  • the sugar chain is extended on the first glycosyl group.
  • the derivative polypeptide is selected from the group consisting of:
  • the amino acid sequence has a homology of SEQ ID NO.: 61 amino acid sequence ⁇ 85% (preferably ⁇ 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%), and a derivative polypeptide having glycosyltransferase activity;
  • glycosyltransferase activity refers to the ability to transfer a glycosyl donor glycosyl group to the first glycosyl group of the tetracyclic triterpenoid C-3 and/or C-6 to extend the activity of the sugar chain .
  • the derived polypeptide comprises a SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, The sequence shown in any of 92, 93, 94 or 95.
  • polypeptide selected from the group consisting of:
  • glycosyltransferase activity refers to the ability to transfer a glycosyl donor glycosyl group to the first glycosyl group of the tetracyclic triterpenoid C-3 and/or C-6 to extend the activity of the sugar chain .
  • sequence (c) is a fusion protein formed by adding a tag sequence, a signal sequence or a secretion signal sequence to (a) or (b).
  • the derived polypeptide comprises a SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, A polypeptide of the amino acid sequence shown in 92, 93, 94 or 95.
  • an isolated polynucleotide having a sequence selected from the group consisting of:
  • the nucleotide sequence is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87 , 89, or 91.
  • the sequence is as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
  • the polynucleotide encoding amino acid sequences are SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, respectively.
  • a vector comprising the polynucleotide of the third aspect.
  • the vector comprises an expression vector, a shuttle vector, an integration vector.
  • a fifth aspect of the invention there is provided the use of the isolated polypeptide of the first or second aspect of the invention, which is used to catalyze one or more of the following reactions, or is used to prepare one or more of the following Catalytic preparation for the reaction: transferring a glycosyl group derived from a glycosyl donor to the first glycosyl group at the C-3 position or the C6 position of the tetracyclic triterpenoid to extend the sugar chain; preferably,
  • the glycosyl donor comprises a nucleoside diphosphate selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetyl Glucose, ADP-acetylglucose, TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, GDP-xylose , UDP-xylose, UDP-galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose , TDP-galactose, CDP-galactose, GDP-galactose, U
  • the glycosyl donor comprises a uridine diphosphate (UDP) sugar selected from the group consisting of UDP-glucose, UDP-xylose, UDP-galacturonic acid, UDP-galactose, UDP-arabinose, UDP-rhamnose, or other uridine diphosphate hexose or uridine pentose diphosphate, or a combination thereof.
  • UDP uridine diphosphate
  • the isolated polypeptide is used to catalyze one or more of the following reactions or to prepare a catalytic formulation that catalyzes one or more of the following reactions:
  • R1 is a glycosyl group
  • R2 and R3 are OH or H
  • R4 is a glycosyl group or H
  • R5 is a glycosyl group
  • R5-R1-O is a C3 first glycosyl group-derived glycosyl group, said polypeptide being selected from the group consisting of a polypeptide represented by SEQ ID NO.: 61 or a polypeptide derived therefrom; preferably, selected from the group consisting of 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 or a polypeptide derived therefrom.
  • R1 is a glucosyl group
  • R2 is H
  • R3 is OH
  • R4 is H
  • the compound of the formula (I) is Rh2.
  • R1 is a glucosyl group
  • R2 is H
  • R3 is OH
  • R4 is a glucosyl group
  • the compound of formula (I) is F2.
  • the substrate (I) compound is Rh2, then the product of formula (II) is Rg3; and the substrate (I) compound is F2, then the product of formula (II) is Rd;
  • the substrate (I) compound is Rh2, and the product of formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)- PPD; the substrate (I) compound is F2, and the product of formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • R1 and R2 are H or a glycosyl group
  • R3 and R4 are a glycosyl group
  • R3-R4-O is the first glycosyl-derived glycosyl group of C6, and the polypeptide is selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 55, 57, 59, 78, 82, 92, 94 or 95 or a polypeptide derived therefrom.
  • the substrate (III) compound is Rh1
  • the product of formula (IV) is Rf
  • the substrate (III) compound is Rh1
  • the product of formula (IV) is Rg2.
  • R1 is a glycosyl group
  • R2 and R3 are OH or H
  • R4 is a glycosyl group or H
  • R5 is a glycosyl group
  • R5-R1-O is a glycosyl group derived from the first glycosyl group of C3
  • R6 is a glycosyl group
  • R6 -R1-O is the first glycosyl-derived glycosyl group of C3, said polypeptide being selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 59, 76 a polypeptide as shown in 84, 86 or 88.
  • R1 is two glucosyl groups, R2 is H, R3 is OH, R4 is H, and the compound of formula (V) is Rg3.
  • R1 is two glucosyl groups
  • R2 is H
  • R3 is OH
  • R4 is a glucosyl group
  • the compound of formula (V) is Rd.
  • the substrate (V) compound is Rg3
  • the product of formula (VI) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)- PPD
  • the substrate (V) compound is Rd
  • the product of formula (VI) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • the glycosyl group is selected from the group consisting of: glucosyl, galacturonic acid, xylose, galactosyl, arabinose, rhamnosyl, and other hexose or pentose base.
  • the compound of the formula (I) or (III) includes, but is not limited to, a dammarane type tetracyclic triterpenoid of the S configuration or the R configuration, a lanolinane type Tetracyclic triterpenoids, apotirucallane type tetracyclic triterpenes, ganthanane type tetracyclic triterpenoids, cycloalkane (cycloaltenane) type tetracyclic triterpenoids, cucurbitane tetracyclic triterpenoids a compound or a decane type tetracyclic triterpenoid.
  • a dammarane type tetracyclic triterpenoid of the S configuration or the R configuration a lanolinane type Tetracyclic triterpenoids, apotirucallane type tetracyclic triterpenes, ganthanane type tetracyclic triterpenoids, cycloalkane (cycl
  • polypeptide is selected from the group consisting of:
  • polynucleotide encoding the nucleotide of the polypeptide is a sequence selected from the group consisting of:
  • (C) a nucleotide sequence as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 ;
  • (F) a nucleotide sequence that is complementary (preferably fully complementary) to the nucleotide sequence of any of (A)-(E).
  • sequence of the nucleotide is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
  • the sequence is as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
  • the polynucleotide encoding amino acid sequences are SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, respectively.
  • a method of performing a glycosyl transfer catalytic reaction comprising the steps of performing a glycosyl transfer catalytic reaction in the presence of the polypeptide of the second aspect of the invention or a polypeptide derived therefrom.
  • the method further includes the steps of:
  • the method further comprises separately adding the polypeptide and the polypeptide derived therefrom to a catalytic reaction; and/or
  • the polypeptide and its derived polypeptide are simultaneously added to the catalytic reaction.
  • the method further comprises ligating a nucleotide sequence encoding a glycosyltransferase with a key gene in a anabolic pathway of dammar diol and/or protopanaxadiol and/or protosol. / or other glycosyltransferase gene is co-expressed in a host cell to obtain the compound of formula (II), (IV), or (VI).
  • the host cell is a yeast or Escherichia coli.
  • the polypeptide has SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, A polypeptide of the amino acid sequence shown in 92, 93, 94 or 95 and a polypeptide derived therefrom.
  • the nucleotide sequence encoding the polypeptide is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 is shown.
  • the method further comprises: providing an additive for regulating enzyme activity to the reaction system.
  • the additive for regulating enzyme activity is an additive that increases enzyme activity or inhibits enzyme activity.
  • the additive for regulating enzyme activity is selected from the group consisting of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ . , or Fe 2+ .
  • the additive for regulating enzyme activity is: Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , Or a substance of Fe 2+ .
  • the glycosyl donor is a nucleoside diphosphate sugar selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetyl Glucose, ADP-acetylglucose, TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, GDP-xylose , UDP-xylose, UDP-galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose , TDP-galactose, CDP-galactose, GDP-galactose,
  • the glycosyl donor is uridine diphosphate, selected from the group consisting of UDP-glucose, UDP-xylose, UDP-galacturonic acid, UDP-galactose, UDP-Arabic Sugar, UDP-rhamnose, or other uridine diphosphate hexose or uridine pentose diphosphate, or a combination thereof.
  • the pH of the reaction system is: pH 4.0 to 10.0, preferably pH 5.5 to 9.0.
  • the temperature of the reaction system is from 10 ° C to 105 ° C, preferably from 20 ° C to 50 ° C.
  • the key genes in the darumadiol anabolic pathway include, but are not limited to, the damasenediol synthase gene.
  • the key genes in the proto-ginsengdiol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase gene thereof, or Its combination. .
  • the key genes in the proto-ginsolic triol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cell. Pigment P450 CYP716A53V2 gene and its reductase gene, or a combination thereof.
  • the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three ⁇ C-3 position glycosyltransferase gene UGTPg45, or a combination thereof.
  • the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome.
  • the substrate of the glycosyl catalyzed reaction is a compound of the formula (I) or (III), respectively, and the product is a compound of (II) or (IV), respectively;
  • the compound of formula (I) is ginsenoside Rh2, and the compound of formula (II) is ginsenoside Rg3;
  • the compound of the formula (I) is ginsenoside Rh2, and the compound of the formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD;
  • the compound of formula (I) is ginsenoside F2, and the compound of formula (II) is ginsenoside Rd;
  • the compound of the formula (I) is ginsenoside F2, and the compound of the formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK;
  • the compound of the formula (III) is ginsenoside Rh1, and the compound of the formula (IV) is ginsenoside Rf;
  • the compound of formula (III) is ginsenoside Rh1, and the compound of formula (IV) is ginsenoside Rg2;
  • the compound of the formula (V) is ginsenoside Rg3, and the compound of the formula (VI) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD;
  • the compound of formula (V) is ginsenoside Rd
  • the compound of formula (IV) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • a genetically engineered host cell comprising the vector of the fourth aspect of the invention, or a genome thereof, which integrates the polynucleoside of the third aspect of the invention acid.
  • the glycosyltransferase is a polypeptide as described in the second aspect of the invention or a polypeptide derived therefrom.
  • nucleotide sequence encoding the glycosyltransferase is as described in the third aspect of the invention.
  • the cell is a prokaryotic cell or a eukaryotic cell.
  • the host cell is a eukaryotic cell, such as a yeast cell or a plant cell.
  • the host cell is a Saccharomyces cerevisiae cell.
  • the host cell is a prokaryotic cell, such as E. coli.
  • the host cell is a ginseng cell.
  • the host cell is not a cell that naturally produces a compound of formula (II), (IV), (VI), (VIII), (II), (IIII).
  • the host cell is not naturally producing a rare ginsenoside Rg3 and/or a rare ginsenoside Rf and/or a rare ginsenoside Rg2, and/or a new ginsenoside 3-O- ⁇ -(D- Cells such as xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD and 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • a rare ginsenoside Rg3 and/or a rare ginsenoside Rf and/or a rare ginsenoside Rg2 and/or a new ginsenoside 3-O- ⁇ -(D- Cells such as xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD and 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosy
  • the key genes in the darumadiol anabolic pathway include, but are not limited to, the damasenediol synthase gene.
  • the host cell contains key genes in the proto-glycol diol anabolic pathway including, but not limited to, the dammarenediol synthase gene, the cytochrome P450 CYP716A47 gene, and its reductase gene. , or a combination thereof.
  • the host cell contains key genes in the original ginseng triol anabolic pathway including, but not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase thereof, P450 CYP716A47 reductase gene and its gene, or a combination thereof. .
  • the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three ⁇ C-3 position glycosyltransferase gene UGTPg45, or a combination thereof.
  • the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome.
  • the use of the host cell of the eighth aspect is provided for the preparation of an enzyme catalytic reagent, or for the production of a glycosyltransferase, or as a catalytic cell, or for the production of formula (II), (IV) Or (VI).
  • the host cell is used to produce a novel saponin 3-O- ⁇ -(D) by glycosylation of ginsenoside Rh2, F2, Rg3, Rd and/or ginsenoside Rh1, Rg1.
  • a method of producing a transgenic plant comprising the steps of: regenerating the genetically engineered host cell of the eighth aspect into a plant, and wherein the genetically engineered host cell is a plant cell.
  • the genetically engineered host cell is a ginseng cell.
  • Figure 1 shows agarose gel electrophoresis patterns of (a) gGT29/gGT29-3 gene and (b) gGT29-4/gGT29-5/gGT29-6 and gGT29-7 gene PCR products.
  • Lane 1 nucleic acid Marker; Lane 2, gGT29/gGT29-3 gene PCR product;
  • Lane 1 Lane 1, gGT29-4/gGT29-5/gGT29-6 gene PCR product;
  • Lane 2 gGT29-7 gene PCR Product; Lane 3, Nucleic Acid Marker.
  • Figure 2 shows SDS-PAGE detection of gGT29 and gGT29-3 expression in Saccharomyces cerevisiae; Lane 1, lysate supernatant of empty vector pYES2 recombinant; Lane 2, lysate supernatant of gGT29-pYES2 yeast recombinant; Lane 3 , lysate supernatant of gGT29-3-pYES2 yeast recombinant.
  • Figure 3 shows Western Blot detection of gGT29 and gGT29-3 expression in Saccharomyces cerevisiae; Lane 1, lysate supernatant of empty vector pYES2 recombinant; Lane 2, lysate supernatant of gGT29-pYES2 yeast recombinant; Lane 3, Lysate supernatant of gGT29-3-pYES2 yeast recombinant.
  • Figure 4 shows TLC detection profiles of glycosyltransferases gGT29 and gGT29-3 catalyzed products of ginsenoside Rh2 and F2; Lane 1, PPD and PPD type saponin mixed standards, Lane 2, gGT29 crude enzyme solution (gGT29-pYES2 yeast recombination) The lysate supernatant of the bacterium catalyzes the formation of Rg3 by Rh2, Lane 3, gGT29 crude enzyme solution catalyzes the Rh2 control, and adds pYES2 empty plasmid yeast recombinant lysate instead of the enzyme solution; Lane 4, gGT29 catalyzes the formation of Rd by F2, Lane 5, gGT29 catalysis F2 control, adding pYES2 empty plasmid yeast recombinant lysate instead of enzyme solution; Lane 6, gGT29-3 crude enzyme solution (gGT29-3-pYES2 yeast lysate supernatant) catalyzes Rh2
  • Figure 5 shows TLC detection of glycotransferase gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with PPD products;
  • gGT29 and BvUGT73C10 combined with catalytic PPD, lane 1, PPD and PPD type saponin mixed standards;
  • lane 2, BvUGT73C10 catalytic PPD Generation of Rh2;
  • Lane 3, gGT29 catalyzes the formation of Rg3 by Rh2;
  • Lane 4, BvUGT73C10 and gGT29 combine to catalyze the formation of Rg3 by PPD;
  • GGT29 and UGTPg45 combined with catalytic PPD, Lane 1, PPD and PPD type saponin mixed standards;
  • Lane 4, UGTPg45 and gGT29 combined with catalytic PPD to generate Rg3.
  • Figure 6 shows that the glycosyltransferases BvUGT73C10 and gGT29 catalyze the TLC detection of 20(R)-PPD and 20(R)-PPD and the combined catalyst 20(R)-PPD, respectively; Lane 1, BvUGT73C10 catalyzes 20(R)- PPD produces 20(R)-Rh2; Lane 2, gGT29 catalyzes the formation of 20(R)-Rh2 by 20(R)-Rg3; Lane 3, BvUGT73C10 and gGT29 combine to catalyze the formation of 20(R)-PP3 by 20(R)-PPD.
  • Figure 7 shows the results of HPLC detection of glycotransferases gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with catalytic PPD products.
  • the first row Rg3, Rh2 and PPD mixed standard samples; the second row, gGT29 and BvUGT73C10 combined catalytic PPD, the third row, gGT29 and UGTPg45 combined catalytic PPD.
  • Figure 8 shows the results of LC/MS detection of the products of glycosyltransferases gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with PPD.
  • the mass spectrum of the standard sample Rg3 and the mass spectrum of the P1 peak (product of gGT29 and BvUGT73C10 in combination with catalytic PPD) and the P2 peak (product product peak of gGT29 and UGTPg45 combined catalytic PPD) are shown in Fig. 7.
  • Figure 9 shows the results of HPLC detection of Rg3 yeast engineering strain A2 cell lysate extract, the first line of samples: original ginseng diol (PPD), dammar diol (DM), ginsenoside Rh2 and Rg3 mixing standards Sample; second row of samples: Rg3 yeast engineering bacteria A2 cell lysate extract.
  • Figure 10 shows the expression of gGT29-4, gGT29-5, gGT29-6, gGT29-7 in recombinant E. coli by SDS-PAGE.
  • Lane 1 gGT29-4-pET28a recombinant E. coli lysed total protein; lane 2, gGT29-4-pET28a recombinant E. coli lysate supernatant; Lane 3, gGT29-5-pET28a recombinant E. coli lysed total protein; Lane 4, gGT29-5-pET28a recombinant E. coli lysate supernatant; Lane 5, gGT29-6-pET28a recombinant E.
  • Figure 11 shows the expression of gGT29-4, gGT29-5, gGT29-6, gGT29-7 in recombinant E. coli by Western Blot.
  • Lane 1 gGT29-4-pET28a recombinant E. coli lysed total protein
  • Lane 2 gGT29-4-pET28a recombinant E. coli lysate supernatant
  • Lane 3 gGT29-5-pET28a recombinant E. coli lysed total protein
  • Lane 4 gGT29-5-pET28a recombinant E. coli lysate supernatant
  • Lane 5 gGT29-6-pET28a recombinant E.
  • Figure 12 shows that the glycosyltransferases gGT29-4, gGT29-5, gGT29-6, gGT29-7 catalyze the TLC detection of the products of Rh2 and F2, respectively.
  • Lane Rh2 indicates the use of saponin Rh2 as a substrate; and lane F2 indicates the use of saponin F2 as a substrate.
  • gGT29-4, gGT29-5, gGT29-6, gGT29-7 indicate that the catalytic reaction was carried out with different enzyme solutions.
  • Figure 13 shows that the glycosyltransferases gGT29-4, gGT29-5, gGT29-6, gGT29-7 catalyze the TLC detection of the product of Rh1, respectively.
  • Lanes 1, 2 and 3 represent the glycosyltransferases gGT29-4, gGT29-5 and gGT29-6 respectively catalyze the product of Rh1, and lane 4 represents the original ginsenotriol-type saponin mixed standard;
  • Lane 1 The product representing Rh1 is catalyzed by the glycosyltransferase gGT29-7, and the lane 2 represents the original ginseng triol type saponin mixed standard.
  • Figure 14 shows the results of the detection of Rh1 by the glycosyltransferase gGT29-7 and its mutant proteins gGT29-7-N343G, gGT29-7-A359P and gGT29-7-N343G/A359P (using both UDP-glucose and UDP-rhamnet) Sugar as a glycosyl donor).
  • Rg1, Rf, Rg2 and Rh1 are mixed with standard samples; the second row, gGT29-7 catalyzes Rh1; the third row, gGT29-7-N343G catalyzes Rh1; the fourth row, gGT29-7-A359P catalyzes Rh1; Five lines, gGT29-7-N343G/A359P catalyze Rh1.
  • Figure 15 shows TLC detection of glycosyltransferases gGT29-3 and gGT29-14 with UDP-xylose as a glycosyl donor, catalyzing the ginsenoside Rh2, Rg3 and Rd products;
  • A lysis of Enterobacteriaceae expressing the empty vector pET28a The supernatant was catalyzed as an enzyme solution;
  • B catalyzed by the supernatant of Enterobacter cloaca expressing pET28a-gGT29-3 as an enzyme solution;
  • C the supernatant of Enterobacter cloaca expressing pET28a-gGT29-14 was used as The enzyme solution is catalyzed.
  • Lanes Rh2, Rg3 and Rd represent the saponins Rh2, Rg3 and Rd as substrates, respectively, and lane M represents the original ginsengdiol-type saponin mixing standard.
  • glycosyltransferase gGT29-7 (SEQ ID NO.: 61) and its derived polypeptides for the first time, such as gGT29 (SEQ ID NO.: 26), gGT29-3 (SEQ ID NO) .: 28), gGT29-4 (SEQ ID NO.: 55), gGT29-5 (SEQ ID NO.: 57), gGT29-6 (SEQ ID NO.: 59), gGT29-8 (SEQ ID NO.: 72), gGT29-9 (SEQ ID NO.: 74), gGT29-10 (SEQ ID NO.: 76), gGT29-11 (SEQ ID NO.: 78), gGT29-12 (SEQ ID NO.: 80) , gGT29-13 (SEQ ID NO.: 82) gGT29-14 (SEQ ID NO.: 84), gGT29-15 (SEQ ID NO.: 86), gGT29-16 (SEQ ID NO.:
  • the glycosyltransferase of the present invention is capable of specifically and efficiently catalyzing the tetracyclic triterpenoid substrate and/or transferring a glycosyl group derived from a glycosyl donor to the C-3 position of a tetracyclic triterpenoid or
  • the first sugar group of C-6 is extended with a sugar chain.
  • ginseng Rh2 can be converted into rare ginsenoside Rg3 with anticancer activity
  • ginsenoside F2 can be converted into ginsenoside Rd
  • ginsenoside Rh1 can be converted into anti-tumor, anti-fatigue saponin Rf
  • ginsenoside Rh1 Transformation of rare ginsenoside Rg2 with neuroprotective effects and UV protection.
  • the invention also provides methods of transformation and catalysis.
  • the glycosyltransferase of the present invention may also be a key enzyme in the anabolic pathway of dammarane diol and/or protopanaxadiol or protopanaxatriol, and a glycosyltransferase of the tetracyclic triterpenoid C-3 or C-6 position.
  • the enzyme is co-expressed in the host cell or used in the preparation of genetically engineered cells of ginsenoside Rh2 and ginsenoside Rh1 for the construction of artificially synthesized ginsenoside Rg3 and Rf.
  • glycosyltransferase of the present invention may also be a key enzyme in the anabolic pathway of dammarane diol and/or protopanaxadiol or protopanaxadiol, and a glycosyltransferase at the C-6 position and a synthetic UDP-mouse.
  • the key enzyme of plum sugar is co-expressed in host cells and used to construct a strain of artificially synthesized ginsenoside Rg2. The present invention has been completed on this basis.
  • active polypeptide refers to glycosyltransferase gGT29-7 (SEQ ID NO.: 61) or a polypeptide derived therefrom.
  • preferred derivative polypeptides include gGT29 (SEQ ID NO.: 26), gGT29-3 (SEQ ID NO.: 28), gGT29-4 (SEQ ID NO.: 55), gGT29-5 (SEQ ID NO.: 57).
  • gGT29-6 SEQ ID NO.: 59
  • gGT29-8 SEQ ID NO.: 72
  • gGT29-9 SEQ ID NO.: 74
  • gGT29-10 SEQ ID NO.: 76
  • gGT29-11 SEQ ID NO.: 78
  • gGT29-12 SEQ ID NO.: 80
  • gGT29-13 SEQ ID NO.: 82
  • gGT29-14 SEQ ID NO.: 84
  • gGT29-15 SEQ ID NO.: 86
  • gGT29-16 SEQ ID NO.: 88
  • gGT29-17 SEQ ID NO.: 90
  • gGT29-18 SEQ ID NO.: 92
  • gGT29-7-N343G SEQ ID NO.: 93
  • gGT29-7-A359P SEQ ID NO.: 94
  • gGT29-7-N343G/A359P SEQ ID NO.: 95
  • the ginsenosides and sapogenins referred to herein are ginsenosides and sapogenins of the C20 position S and/or R configuration.
  • isolated polypeptide means that the polypeptide is substantially free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
  • One skilled in the art can purify the polypeptide using standard protein purification techniques. A substantially pure polypeptide produces a single major band on a non-reducing polyacrylamide gel. The purity of the polypeptide can also be further analyzed using amino acid sequences.
  • the active polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide, or a synthetic polypeptide.
  • the polypeptides of the invention may be naturally purified products, either chemically synthesized or produced recombinantly from prokaryotic or eukaryotic hosts (e.g., bacteria, yeast, plants).
  • the polypeptide of the invention may be glycosylated or may be non-glycosylated, depending on the host used in the recombinant production protocol.
  • Polypeptides of the invention may also or may not include an initial methionine residue.
  • the invention also includes fragments, derivatives and analogs of the polypeptides.
  • fragment refers to a polypeptide that substantially retains the same biological function or activity of the polypeptide.
  • the polypeptide fragment, derivative or analog of the present invention may be (i) a polypeptide having one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) substituted, and such substituted amino acid residues It may or may not be encoded by the genetic code, or (ii) a polypeptide having a substituent group in one or more amino acid residues, or (iii) a mature polypeptide and another compound (such as a compound that extends the half-life of the polypeptide, for example Polyethylene glycol) a polypeptide formed by fusion, or (iv) a polypeptide formed by fused an additional amino acid sequence to the polypeptide sequence (such as a leader or secretion sequence or a sequence or proprotein sequence used to purify the polypeptide, or A fusion protein for the formation of an antigenic IgG fragment).
  • a polypeptide having one or more conservative or non-conservative amino acid residues preferably conservative amino acid residues
  • substituted amino acid residues It
  • the active polypeptide of the present invention has glycosyltransferase activity and is capable of catalyzing one or more of the following reactions:
  • R1 is a glycosyl group
  • R2 and R3 are OH or H
  • R4 is a glycosyl group or H
  • R5 is a glycosyl group
  • said polypeptide is selected from the group consisting of SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 or a polypeptide derived therefrom.
  • R1 is a glucosyl group
  • R2 is H
  • R3 is OH
  • R4 is H
  • the compound of the formula (I) is Rh2.
  • R1 is a glucosyl group
  • R2 is H
  • R3 is OH
  • R4 is a glucosyl group
  • the compound of formula (I) is F2.
  • the substrate (I) compound is Rh2, then the product of formula (II) is Rg3; and the substrate (I) compound is F2, then the product of formula (II) is Rd;
  • the substrate (I) compound is Rh2, and the product of formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)- PPD; the substrate (I) compound is F2, and the product of formula (II) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • R1 and R2 are H or a glycosyl group
  • R3 and R4 are a glycosyl group
  • R3-R4-O is the first glycosyl-derived glycosyl group of C6, and the polypeptide is selected from the group consisting of SEQ ID NO.: 55, 57, 59, 61, 78, 82, 92, 94 or 95 or a polypeptide derived therefrom.
  • the substrate (IIII) compound is Rh1
  • the product of formula (IV) is Rf
  • the substrate (IIII) compound is Rh1
  • the product of formula (IV) is Rg2;
  • R1 is a glycosyl group
  • R2 and R3 are OH or H
  • R4 is a glycosyl group or H
  • R5 is a glycosyl group
  • R5-R1-O is a glycosyl group derived from the first glycosyl group of C3
  • R6 is a glycosyl group
  • R6 -R1-O is the first glycosyl-derived glycosyl group of C3
  • the polypeptide is selected from the group consisting of SEQ ID NO.: 26, 28, 59, 76, 84, 86 or 88 or a polypeptide derived therefrom.
  • R1 is two glucosyl groups, R2 is H, R3 is OH, R4 is H, and the compound of formula (V) is Rg3.
  • R1 is two glucosyl groups
  • R2 is H
  • R3 is OH
  • R4 is a glucosyl group
  • the compound of formula (V) is Rd.
  • the substrate (V) compound is Rg3
  • the product of formula (VI) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)- PPD
  • the substrate (V) compound is Rd
  • the product of formula (VI) is 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • the polypeptide sequence is SEQ ID NO.: 61 or a derivative thereof, preferably a derivative polypeptide 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, a polypeptide as shown in 90, 92, 93, 94 or 95, the term further comprising SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76 having the same function as the polypeptide shown.
  • variations include (but are not limited to): one or more (usually 1-50, Preferably 1-30, more preferably 1-20, optimally 1-10) amino acid deletions, insertions and/or substitutions, and addition of one or several at the C-terminus and/or N-terminus (usually It is an amino acid of 20 or less, preferably 10 or less, more preferably 5 or less.
  • amino acid deletions usually 1-50, Preferably 1-30, more preferably 1-20, optimally 1-10) amino acid deletions, insertions and/or substitutions, and addition of one or several at the C-terminus and/or N-terminus (usually It is an amino acid of 20 or less, preferably 10 or less, more preferably 5 or less.
  • the function of the protein is generally not altered.
  • the addition of one or several amino acids at the C-terminus and/or N-terminus will generally not alter the function of the protein.
  • the term also encompasses active fragments and active derivatives of the proteins of the invention.
  • the invention also provides analogs of the poly
  • the difference between these analogs and the natural polypeptide may be a difference in amino acid sequence, a difference in the modification form which does not affect the sequence, or a combination thereof.
  • These polypeptides include natural or induced genetic variants. Induced variants can be obtained by a variety of techniques, such as random mutagenesis by irradiation or exposure to a mutagen, or by site-directed mutagenesis or other techniques known to molecular biology. Analogs also include analogs having residues other than the native L-amino acid (such as D-amino acids), as well as analogs having non-naturally occurring or synthetic amino acids (such as beta, gamma-amino acids). It is to be understood that the polypeptide of the present invention is not limited to the representative polypeptides exemplified above.
  • Modifications include chemically derived forms of the polypeptide, such as acetylation or carboxylation, in vivo or in vitro. Modifications also include glycosylation, such as those produced by glycosylation modifications in the synthesis and processing of the polypeptide or in further processing steps. Such modification can be accomplished by exposing the polypeptide to an enzyme that performs glycosylation, such as a mammalian glycosylation enzyme or a deglycosylation enzyme. Modified forms also include sequences having phosphorylated amino acid residues such as phosphotyrosine, phosphoserine, phosphothreonine. Also included are polypeptides modified to increase their resistance to proteolytic properties or to optimize solubility properties.
  • the gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom for example, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29
  • the carboxy terminus may also contain one or more polypeptide fragments as a protein tag.
  • the tags can be FLAG, HA, HA1, c-Myc, Poly-His, Poly-Arg, Strep-TagII, AU1, EE, T7, 4A6, ⁇ , B, gE, and Ty1. These tags can be used to purify proteins. Table 1 lists some of these tags and their sequences.
  • the gGT29-7 polypeptide or a derivative thereof preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29, may also be used.
  • amino acid amino terminus of the amino acid is added with a signal peptide sequence, such as a pelB signal peptide.
  • the signal peptide can be cleaved off during secretion of the polypeptide from the cell.
  • the polynucleotide of the present invention may be in the form of DNA or RNA.
  • DNA forms include cDNA, genomic DNA or synthetic DNA.
  • DNA can be single-stranded or double-stranded.
  • the DNA can be a coding strand or a non-coding strand.
  • the coding region sequence encoding the mature polypeptide may be the same as the coding region sequence shown in SEQ ID NO.: 60 or a degenerate variant.
  • a "degenerate variant" in the present invention refers to a polypeptide having SEQ ID NO.: 61 or a derivative thereof, preferably SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74.
  • a protein of 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 but with the coding sequence of SEQ ID NO.: 60 or a derivative thereof, preferably SEQ ID NO.: A nucleic acid sequence having a sequence different from 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
  • Polynucleotides of mature polypeptides of 90, 92, 93, 94 or 95 include: coding sequences encoding only mature polypeptides; coding sequences for mature polypeptides and various additional coding sequences; coding sequences for mature polypeptides (and optional additional coding) Sequence) and non-coding sequences.
  • polynucleotide encoding a polypeptide can be a polynucleotide comprising the polypeptide, or a polynucleotide further comprising additional coding and/or non-coding sequences.
  • the invention also relates to variants of the above polynucleotides which encode fragments, analogs and derivatives of polypeptides or polypeptides having the same amino acid sequence as the invention.
  • Variants of this polynucleotide may be naturally occurring allelic variants or non-naturally occurring variants. These nucleotide variants include substitution variants, deletion variants, and insertion variants.
  • an allelic variant is an alternative form of a polynucleotide that may be a substitution, deletion or insertion of one or more nucleotides, but does not substantially alter the function of the polypeptide encoded thereby. .
  • the invention also relates to polynucleotides which hybridize to the sequences described above and which have at least 50%, preferably at least 70%, more preferably at least 80% identity between the two sequences.
  • the invention particularly relates to polynucleotides that hybridize to the polynucleotides of the invention under stringent conditions (or stringent conditions).
  • stringent conditions means: (1) hybridization and elution at a lower ionic strength and higher temperature, such as 0.2 x SSC, 0.1% SDS, 60 ° C; or (2) hybridization a denaturing agent such as 50% (v/v) formamide, 0.1% calf serum / 0.1% Ficoll, 42 ° C, etc.; or (3) at least 90% identity between the two sequences, more It is good that hybridization occurs more than 95%.
  • polypeptide encoded by the hybridizable polynucleotide is SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92
  • the mature polypeptides shown in 93, 94 or 95 have the same biological function and activity.
  • nucleic acid fragments that hybridize to the sequences described above.
  • a "nucleic acid fragment” is at least 15 nucleotides in length, preferably at least 30 nucleotides, more preferably at least 50 nucleotides, and most preferably at least 100 nucleotides or more.
  • Nucleic acid fragments can be used in nucleic acid amplification techniques (eg, PCR) to identify and/or isolate encoding gGT29-7 polypeptides or derived polypeptides thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29 -8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29 Polynucleotide of -7-A359P, or gGT29-7-N343G/A359P protein.
  • PCR nucleic acid amplification techniques
  • polypeptides and polynucleotides of the invention are preferably provided in isolated form, more preferably purified to homogeneity.
  • the gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29- 12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G
  • the full-length nucleotide sequence of gGT29-7-A359P or gGT29-7-N343G/A359P or a fragment thereof can usually be obtained by a PCR amplification method, a recombinant method or a synthetic method.
  • primers can be designed in accordance with the disclosed nucleotide sequences, particularly open reading frame sequences, and can be prepared using commercially available cDNA libraries or conventional methods known to those skilled in the art.
  • the library is used as a template to amplify the relevant sequences. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then the amplified fragments are spliced together in the correct order.
  • the recombinant sequence can be used to obtain the relevant sequences in large quantities. This is usually done by cloning it into a vector, transferring it to a cell, and then isolating the relevant sequence from the proliferated host cell by conventional methods.
  • synthetic sequences can be used to synthesize related sequences, especially when the fragment length is short.
  • a long sequence of fragments can be obtained by first synthesizing a plurality of small fragments and then performing the ligation.
  • DNA sequence encoding the protein of the present invention (or a fragment thereof, or a derivative thereof) completely by chemical synthesis.
  • the DNA sequence can then be introduced into various existing DNA molecules (or vectors) and cells known in the art.
  • mutations can also be introduced into the protein sequences of the invention by chemical synthesis.
  • a method of amplifying DNA/RNA using PCR technology is preferably used to obtain the gene of the present invention.
  • RACE method RACE-cDNA end rapid amplification method
  • primers for PCR can be appropriately selected according to the sequence information of the present invention disclosed herein. And can be synthesized by conventional methods.
  • the amplified DNA/RNA fragment can be isolated and purified by conventional methods such as by gel electrophoresis.
  • the invention also relates to a vector comprising a polynucleotide of the invention, and a vector of the invention or a gGT29-7 polypeptide or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, A host cell produced by genetic engineering of the gGT29-7-A359P or gGT29-7-N343G/A359P protein coding sequence, and a method of producing a polypeptide of the present invention by recombinant techniques.
  • the polynucleotide sequence of the present invention can be used to express or produce a recombinant gGT29-7 polypeptide or a derivative thereof by conventional recombinant DNA techniques, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29- 7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P polypeptide.
  • gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/ A polynucleotide (or variant) of the A359P polypeptide, or a recombinant host vector comprising the polynucleotide, which is transformed or transduced into a suitable host cell;
  • the gGT29-7 polypeptide or a polypeptide derived therefrom is preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29. -12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P polynucleotide sequence It can be inserted into a recombinant expression vector.
  • recombinant expression vector refers to bacterial plasmids, phage, yeast plasmids, plant cell diseases well known in the art. Toxic, mammalian cell viruses such as adenoviruses, retroviruses or other vectors. Any plasmid and vector can be used as long as it can replicate and stabilize in the host.
  • An important feature of expression vectors is that they typically contain an origin of replication, a promoter, a marker gene, and a translational control element.
  • gGT29-7-containing polypeptide or a derivative thereof preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29- 7-N343G/A359P An expression vector encoding a DNA sequence and a suitable transcription/translation control signal.
  • the DNA sequence can be operably linked to an appropriate promoter in an expression vector to direct mRNA synthesis.
  • promoters are: lac or trp promoter of E. coli; lambda phage PL promoter; eukaryotic promoters include CMV immediate early promoter, HSV thymidine kinase promoter, early and late SV40 promoter, anti- Promoters for transcription of viral LTRs and other known controllable genes in prokaryotic or eukaryotic cells or their viruses.
  • the expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
  • the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase for eukaryotic cell culture, neomycin resistance, and green Fluorescent protein (GFP), or tetracycline or ampicillin resistance for E. coli.
  • selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase for eukaryotic cell culture, neomycin resistance, and green Fluorescent protein (GFP), or tetracycline or ampicillin resistance for E. coli.
  • Vectors comprising the appropriate DNA sequences described above, as well as appropriate promoters or control sequences, can be used to transform appropriate host cells to enable expression of the protein.
  • the host cell can be a prokaryotic cell, such as a bacterial cell; or a lower eukaryotic cell, such as a yeast cell; or a higher eukaryotic cell, such as a mammalian cell.
  • a prokaryotic cell such as a bacterial cell
  • a lower eukaryotic cell such as a yeast cell
  • a higher eukaryotic cell such as a mammalian cell.
  • Representative examples are: Escherichia coli, Streptomyces; bacterial cells of Salmonella typhimurium; fungal cells such as yeast; plant cells; insect cells of Drosophila S2 or Sf9; CHO, COS, 293 cells, or Bowes melanoma cells Animal cells, etc.
  • an enhancer sequence is inserted into the vector.
  • An enhancer is a cis-acting factor of DNA, usually about 10 to 300 base pairs, acting on a promoter to enhance transcription of the gene.
  • Usable examples include a 100 to 270 base pair SV40 enhancer on the late side of the replication initiation point, a polyoma enhancer on the late side of the replication initiation site, and an adenovirus enhancer.
  • Transformation of host cells with recombinant DNA can be carried out using conventional techniques well known to those skilled in the art.
  • the host is a prokaryote such as E. coli
  • competent cells capable of absorbing DNA can be harvested after the exponential growth phase and treated by the CaCl 2 method, and the procedures used are well known in the art.
  • Another method is to use MgCl 2.
  • Conversion can also be carried out by electroporation if desired.
  • the host is a eukaryote, the following DNA transfection methods can be used: calcium phosphate coprecipitation, conventional mechanical methods such as microinjection, electroporation, liposome packaging, and the like.
  • the obtained transformant can be cultured by a conventional method to express the polypeptide encoded by the gene of the present invention.
  • the medium used in the culture may be selected from various conventional media depending on the host cell used.
  • the cultivation is carried out under conditions suitable for the growth of the host cell.
  • the selected promoter is induced by a suitable method (such as temperature conversion or chemical induction) and the cells are cultured for a further period of time.
  • the recombinant polypeptide in the above method can be expressed intracellularly, or on the cell membrane, or secreted outside the cell.
  • the recombinant protein can be isolated and purified by various separation methods using its physical, chemical, and other properties. These methods are well known to those skilled in the art. Examples of these methods include but Not limited to: conventional renaturation treatment, treatment with protein precipitant (salting method), centrifugation, osmotic bacteria, super treatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, High performance liquid chromatography (HPLC) and various other liquid chromatography techniques and combinations of these methods.
  • the active polypeptide or peptidyl transferase gGT29-7 polypeptide or derivative thereof according to the present invention is preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29- 10.
  • N343G/A359P Uses of N343G/A359P include, but are not limited to, the specific and efficient transfer of a glycosyl group from a glycosyl donor to the first glycosyl group at the C-3 and C-6 positions of the tetracyclic triterpenoid To extend the sugar chain.
  • ginsenoside Rh2 can be converted into rare ginsenoside Rg3 with better anticancer activity; ginsenoside F2 can be converted into ginsenoside Rd; ginsenoside Rh1 can be converted into rare ginsenoside Rf having antitumor and anti-fatigue activity; Ginsenoside Rh1 transforms rare ginsenoside Rg2 with neuroprotective effect and UV protection; the glycosyltransferase of the present invention can also synthesize ginsenoside Rh2, Rg3 and Rd into a novel saponin 3-O- ⁇ - (not previously reported) D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD and (3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK.
  • the tetracyclic triterpene compound includes, but is not limited to, a dammarane type, a lanolin type, a ganthanane type, a cycloalkane (cycloaltenane) type in the S configuration or the R configuration, a tetracyclic triterpenoid such as apotirucallane type, cucurbitane or decane type.
  • the present invention provides an industrial catalytic method comprising: using a gGT29-7 polypeptide of the present invention or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, under conditions providing a glycosyl donor , gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29
  • a compound of formula (II), (IV) or formula (VI) is obtained from -7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P active polypeptide or peptidyl transferase.
  • the polypeptide used in the (A) reaction is selected from the polypeptide represented by SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74
  • the polypeptide used in the (B) reaction is selected from the group consisting of SEQ ID NO.: 55 , 57, 59, 61, 78, 82, 92, 94 or 95
  • the polypeptide used in the (C) reaction is selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 59, 76, 84, 86 or 88.
  • the glycosyl donor is a nucleoside diphosphate sugar selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetylglucose, ADP-acetylglucose , TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, UDP-xylose, GDP-xylose, UDP -galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose, TDP-galactose, CDP - Galactose, GDP-galactose,
  • the glycosyl donor is preferably uridine diphosphate, selected from the group consisting of UDP-glucose, UDP-xylose, UDP-rhamnose, UDP-galacturonic acid, UDP-galactose, UDP-Arabic Sugar, or other uridine diphosphate Acid hexose or uridine pentose diphosphate, or a combination thereof.
  • uridine diphosphate selected from the group consisting of UDP-glucose, UDP-xylose, UDP-rhamnose, UDP-galacturonic acid, UDP-galactose, UDP-Arabic Sugar, or other uridine diphosphate Acid hexose or uridine pentose diphosphate, or a combination thereof.
  • an enzyme active additive (an additive that increases enzyme activity or inhibits enzyme activity) may also be added.
  • the enzyme activity additive may be selected from the group consisting of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , or Fe 2+ ; a substance of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , or Fe 2+ .
  • the pH conditions of the process are: pH 4.0-10.0, preferably pH 6.0-pH 8.5, more preferably 8.5.
  • the temperature conditions of the process are from 10 ° C to 105 ° C, preferably from 25 ° C to 35 ° C, more preferably 35 ° C.
  • the present invention also provides a composition comprising an effective amount of the active polypeptide or peptidyl transferase gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29- 7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P, and a food or industrially acceptable carrier or excipient.
  • Such carriers include, but are not limited to, water, buffer, dextrose, water, glycerol, ethanol, and combinations thereof.
  • Substances which modulate the glycosyltransferase activity of the present invention may also be added to the composition. Any substance having a function of increasing the activity of the enzyme is available. Preferably, the substance which increases the glycosyltransferase activity of the present invention is selected from the group consisting of mercaptoethanol.
  • many substances can reduce enzyme activity, including but not limited to: Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , and Fe 2+ ;
  • Substrate can be hydrolyzed to form Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ and Fe 2+ .
  • the enzyme can be conveniently applied by a person skilled in the art to function as a transglycosyl group, particularly for the transglycosylation of dammar diol, protopanaxadiol and protopanaxatriol.
  • rare ginsenosides there are also provided two methods for forming rare ginsenosides, one of which comprises: using the gGT29-7 polypeptide of the present invention or a derivative thereof, preferably gGT29, gGT29-3, gGT29- 4.
  • the gGT29-7 polypeptide or a polypeptide derived therefrom is used under the condition of pH 3.5-10.
  • gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G or gGT29-7- A359P or gGT29-7-N343G/A359P enzyme treats the substrate to be transglycosylated.
  • the gGT29-7 polypeptide or a polypeptide derived therefrom is used at a temperature of 30-105 °C.
  • the second method comprises: the gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29 -10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7 -N343G/A359P gene transfer can be synthesized In the engineered bacteria of saponin Rh2 or Rh1 (for example, yeast or Escherichia coli engineering bacteria), or the gGT29-7 polypeptide or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29- 6,
  • the gGT29-7 polypeptide or a polypeptide derived therefrom preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29- 12.
  • gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, or gGT29-7-A359P, or gGT29-7-N343G/A359P gene Key enzymes in the anabolic pathway of methenediol and/or protopanaxadiol or protosol ginseng, and glycosyltransferases at the C-6 position of tetracyclic triterpenes and key enzymes for the synthesis of UDP-rhamnose in host cells Expression, applied to the construction of recombinant strains of synthetic ginsenoside R
  • the key genes in the dammar diol anabolic pathway include, but are not limited to, the dammarene diol synthase gene.
  • the key genes in the proto-ginsengdiol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase gene thereof, or combination. Or isozymes of various enzymes and combinations thereof.
  • dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol
  • cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol .
  • the key genes in the proto-ginsolic triol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cell. Pigment P450 CYP716A53V2 gene and its reductase gene, or a combination thereof. Or isozymes of various enzymes and combinations thereof.
  • dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol
  • cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol
  • the cytochrome P450 CYP716A53v2 (JII036031) and its reductase further convert the original ginseng diol to the original ginseng triol.
  • the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three ⁇ C-3 position glycosyltransferase UGTPg45 or a combination thereof. Or isozymes of various enzymes and combinations thereof.
  • dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol
  • cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol
  • the glycosyltransferase UGTPg45 can further convert protoglycan diol to Rh2 (Wang et. al, Metabolic Engineering, 2015, 29.97-105).
  • the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome.
  • dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarane diol
  • cytochrome P450 CYP716A47 and its reductase convert dammarene diol into proto-ginseng diol
  • cytochrome P450 CYP716A53v2 JII036031
  • reductase to further convert proto-ginseng diol into proto-ginstriol
  • glycosyltransferase UGTPg100 The original ginseng triol was further converted to Rh1 (Wei et. al, Molecular Plant, 2015, 15. doi: 10.1016/j.molp. 2015.05.010).
  • the glycosyltransferase of the present invention can specifically and efficiently transfer a glycosyl group derived from a glycosyl donor to the first sugar at the C-3 position and/or the C-6 position of the tetracyclic triterpenoid Extended sugar chain
  • the glycosyltransferase of the present invention is particularly capable of converting ginsenoside Rh2 and Rh1 into rare ginsenoside Rg3 having better anticancer activity, and ginsenoside Rf having antitumor and anti-fatigue effects, respectively, and having neuroprotection
  • ginsenoside Rg2 in the role of UV protection
  • a synthetic route of ginsenosides (Damadiol, Protopanaxadiol, and Protopanaxatriol) or a synthetic route of rare ginsenoside Rh2 or Rh1 was constructed in yeast to achieve a monosaccharide such as glucose. Fermentation with yeast to produce rare ginsenosides Rg3 and Rf.
  • the synthetic route of rare ginsenoside Rh1 and the route of synthesizing UDP-rhamnose were constructed in yeast, and yeast was used to ferment and produce rare ginsenoside Rg2. This not only solves the problem of the source of raw materials for saponin production, but also greatly reduces the production cost of the rare saponins Rg3, Rg2 and Rf.
  • More than 100 cDNA sequences predicted to be glycosyltransferases were extracted from the published expression data of Panax species, and 60 full-length cDNA sequences were cloned and expressed and transglycosylated. 17 of them were analyzed. The expression product has transglycosylation activity on ginsenosides and saponins.
  • Ginseng RNA was extracted and reverse transcribed to obtain cDNA of ginseng.
  • PCR amplification was carried out using this cDNA as a template, using primer pair 1 (SEQ ID NO.: 7, 8); primer pair 2 (SEQ ID NO.: 9, 10); primer pair 3 (SEQ ID NO.: 11) , 12); Primer Pair 5 (SEQ ID NO.: 34, 35); Primer Pair 7 (SEQ ID NO.: 46, 47); Primer Pair 8 (SEQ ID NO.: 62, 63); Primer Pair 9 ( SEQ ID NO.: 64, 65) All obtained amplification products.
  • DNA polymerase uses the high-fidelity KOD DNA polymerase from Biotech Engineering Co., Ltd.
  • the PCR product was detected by agarose gel electrophoresis (Fig. 1). Irradiate in the ultraviolet and cut off the target DNA band. The amplified DNA fragment was then recovered from the agarose gel using an AIIygen Gel EIItraction Kit (AEYGEN). This DNA fragment was ligated with the commercially available cloning vector pMD18-T Vector after the end of A with the rTaq DNA polymerase of Biosciences Co., Ltd., and the ligated product was transformed into a commercially available E. coli EPI300 competent cell, and the transformed large intestine was transformed.
  • a commercially available cloning vector pMD18-T Vector after the end of A with the rTaq DNA polymerase of Biosciences Co., Ltd.
  • the Bacillus solution was coated on an LB plate supplemented with ampicillin 50 ug/mL, IPTG 0.5 mM, II-Gal 25 ⁇ g/mL, and the recombinant clone was further verified by PCR and restriction enzyme digestion.
  • One of the clones was selected to extract the recombinant plasmid and then sequenced.
  • the open reading frame (ORF) was searched using the BESTORF software. By sequence alignment, the ORF encodes a PSPG cassette of the first functional conserved domain of the glycosyltransferase, indicating a glycosyltransferase gene.
  • the gene obtained with primer pair 5 (SEQ ID NO.: 34, 35) has the nucleotide sequence shown by SEQ ID NO.: 25, 27, and is named gGT29 and gGT29-3, respectively. Among them, the corresponding genetic information is shown in Table 2.
  • the nucleotide sequence of the gGT29-7-derived polypeptide having the gene shown in Table 3 was obtained using primer pair 7 (SEQ ID NO.: 62, 63) and designated as gGT29-4, gGT29-5, gGT29-6, gGT29-8. , gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-17 and gGT29-18.
  • the corresponding genetic information is shown in Table 3.
  • the gene obtained with primer pair 8 (SEQ ID NO.: 64, 65) had the nucleotide sequences shown in Table 4 and designated as gGT29-7, gGT29-15 and gGT29-16. Among them, the corresponding genetic information is shown in Table 4.
  • the target genes were amplified using the plasmids gGT29-pMD18T and gGT29-3-pMD18T containing the gGT29 and gGT29-3 genes constructed in Example 1 as templates.
  • the forward primers used for gGT29 were (SEQ ID NO.: 36), and the Kpn I recognition site was added to the 5' end: GGATCC; the reverse primers used were all (SEQ ID NO.: 37), and the 5' end was added with IIhoI.
  • the forward primers used for gGT29-3 were (SEQ ID NO.: 38), and the Kpn I recognition site was added to the 5' end: GGATCC; the reverse primers used were all (SEQ ID NO.: 39), and the 5' end The IIhoI recognition site was added: CTCGAG, and the reverse primer introduced 6-His Tag for Western Blot detection expression and purification.
  • DNA polymerase was selected from Toyobo's high-fidelity DNA polymerase kod, and the PCR program was set up according to the instructions: 94 ° C for 2 min; 94 ° C for 15 s, 58 ° C for 30 s, 68 ° C for 1.5 min, for a total of 30 cycles; 68 ° C for 10 min; 10 ° C Keep warm.
  • the PCR product was detected by agarose gel electrophoresis, and the band of the same size as the target DNA was cut under ultraviolet light.
  • the DNA fragment was then recovered from the agarose gel using AIIYGEN's AIIyPrep DNA Gel EIItraction Kit. Using Takara The DNA fragments recovered by double-cutting of the QuickCut restriction enzymes Kpn I and IIba I were used for 30 min, and the digested products were cleaned and recovered using AIIYGEN's AIIyPrep PCR Cleanup Kit.
  • the digested product was ligated with S. cerevisiae expression plasmid pYES2 (also cut with Kpn I and IIba I and tapped and recovered) using NEB T4 DNA ligase for 2 h at 25 °C. The ligation product was transformed into E.
  • the constructed expression plasmids gGT29-pYES2 and gGT29-3-pYES2 were transformed into Saccharomyces cerevisiae by electroporation and plated on SC-Ura (0.67% yeast amino acid-free basic nitrogen source, 2% glucose). ). Yeast recombinants were verified by colony PCR. Yeast recombinant colonies were picked up in 10 mL of SC-Ura (2% glucose) medium and incubated at 30 ° C for 200 h at 200 rpm.
  • the cells were collected by centrifugation at 3500 g at 4 ° C, the cells were washed twice with sterile deionized water, and the cells were resuspended in induction medium SC-Ura (2% galactose) and inoculated into 50 mL of induction medium to make OD 600. At about 0.4, induction was initiated at 30 ° C at 200 rpm. The cells induced to express for 12 hours were collected by centrifugation at 3500 g at 4 ° C, and the cells were washed twice with sterile deionized water and resuspended in yeast lysis buffer to give an OD 600 between 50 and 100.
  • induction medium SC-Ura 2% galactose
  • the yeast cells were disrupted by shaking with a Fastprep cell disrupter, and the cell debris was removed by centrifugation at 12000 g for 10 min at 4 ° C, and the supernatant of the cell lysate was collected. The appropriate amount of lysate supernatant was subjected to SDS-PAGE electrophoresis. Compared with the pYES2 empty vector recombinant, the gGT29-pYES2 and gGT29-3-pYES2 recombinants showed no significant banding (Fig. 2). The expression was detected by anti-6-His Tag Western Blot.
  • Saccharomyces cerevisiae expressing gGT29 and gGT29-3 showed strong Western Blot signal, indicating that both gGT29 and gGT29-3 were soluble in yeast, and transferred to pYES2 empty vector. The recombinant did not have an anti-6-His Tag Western Blot signal (Figure 3).
  • the recombinant yeast cleavage supernatant expressing gGT29 and gGT29-3 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2, and the recombinant yeast cleavage supernatant expressing empty vector was used as a control.
  • the 100 ⁇ L reaction system is shown in Table 3. The reaction was carried out at 35 ° C for 12 h, then 100 ⁇ L of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
  • the reaction product was first detected by thin layer chromatography (TLC), and the yeast host cleavage supernatant solution expressing gGT29 and gGT29-3 can be further extended into a glycosyl group at the 3-position glycosyl group of ginsenoside Rh2 and F2 to be converted into ginsenoside. Rg3 and Rd ( Figure 4).
  • the catalytic activity of gGT29 and gGT29-3 is not affected by the glycosylation or hydroxyl configuration of ginsenoside 20, and 20(R)-Rh2 can be converted to 20(R)-Rg3 (Fig. 6).
  • the E. coli host cleavage supernatant expressing BvUGT73C10 (JQ291613) or UGTPg45 (KM401918) and the yeast host cleavage supernatant expressing gGT29 were used as an enzyme solution to co-catalyze the original ginseng diol (PPD).
  • the 100 ⁇ L reaction system is shown in Table 3. 40 ⁇ L of the 73.4 ⁇ L enzyme solution was the large intestine host lysis supernatant of BvUGT73C10, and the remaining 33.4 ⁇ L was the yeast host cleavage supernatant expressing gGT29.
  • the reaction was carried out at 35 ° C for 12 h, then 100 ⁇ L of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol. The reaction product was first detected by thin layer chromatography (TLC) (Fig. 5). It can be seen that the combination of glycosyltransferase BvUGT73C10 and gGT29 or UGTPg45 and gGT29 can convert PPD to Rg3.
  • TLC thin layer chromatography
  • Glycosyltransferase BvUGT73C10 and gGT29 or a combination of 3GT2 and gGT29 can catalyze 20(R)-PPD to form 20(R)-Rg3 ( Figure 6).
  • cytochrome P450 reductase ATR2-1 (NP_849472.2) was integrated into the chromosome trp1 gene locus in the S. cerevisiae BY4742 chromosome (GAL1 promoter, using trp1 original terminator) to construct recombinant yeast A2.
  • Recombinant yeast needs to be supplemented
  • the corresponding amino acid was added (0.01% tryptophan, 0.01% leucine, 0.01% lysine).
  • the recombinant yeast A2 lysate was transferred to a 2 mL EP tube, 1 mL each tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 ⁇ L of methanol and used for HPLC detection.
  • the cell lysate of recombinant yeast A2 contained dammarene diol, protopanaxadiol (PPD) and ginsenoside active metabolite Rg3 ( Figures 8 and 9) by HPLC and LC-MS analysis.
  • the method of 6.2 is the same as 6.1, except that the glycosyltransferase BvUGT73C10 is used instead of UGTPg45 to obtain recombinant yeast A6.
  • the cell lysate of recombinant yeast A6 also contained dammarene diol, protopanaxadiol (PPD) and ginsenoside active metabolite Rg3 by HPLC analysis.
  • Example 7 Construction of Escherichia coli recombinant expression vector for glycosyltransferase genes gGT29-4, gGT29-5, gGT29-6 and gGT29-7
  • the plasmids gGT29-4-pMD18T, gGT29-5-pMD18T, gGT29-6-pMD18T and gGT29-7-pMD18T containing the gGT29-4, gGT29-5, gGT29-6 and gGT29-7 genes constructed in Example 1 were used as templates. Amplify the target gene.
  • the forward primer used for the gGT29-5 and gGT29-6 genes is shown in SEQ ID NO.: 66, and the sequence homologous to the vector pET28a is added to the 5' end: CTGGTGCCGCGCGGCAGC; the reverse primer used is SEQ ID NO.: As shown at 68, a sequence homologous to the vector pET28a was added to the 5' end: TGCGGCCGCAAGCTTGTC.
  • the forward primer used for the gGT29-4 and gGT29-7 genes is SEQ ID NO.: 67, and the sequence homologous to the vector pET28a is added to the 5' end: CTGGTGCCGCGCGGCAGC; the reverse primer used is SEQ ID NO.: 68, which is 5 The 18-base fragment homologous to the vector pET28a was added to the 'end: TGCGGGCCGAGAGCTTGTC.
  • the gGT29-4, gGT29-5, gGT29-6 and gGT29-7 genes were amplified by a PCR method using the above primers.
  • the amplified gene was selected from NEB's Q5 high-fidelity DNA polymerase.
  • the PCR program was set up according to the instructions: 98 ° C 30 s; 98 ° C 15 s, 58 ° C 30 s, 72 ° C 1 min, a total of 35 cycles; 72 ° C 2 min; 10 ° C insulation .
  • the linearized vector pET28a was obtained using SEQ ID NO.: 69 and SEQ ID NO.: 70 as the forward and reverse primer amplification vectors pET28a, respectively.
  • the linearized vector of amplified pET28a was also selected from NEB's Q5 high-fidelity DNA polymerase.
  • the PCR procedure was set up according to the instructions: 98 ° C for 30 s; 98 ° C for 15 s, 58 ° C for 30 s, 72 ° C for 3 min for 35 cycles; 72 ° C for 2 min. ; 10 ° C insulation.
  • the gGT29-4, gGT29-5, gGT29-6 and gGT29-7 gene PCR products and the linearized vector pET28a were detected by agarose gel electrophoresis, and the bands corresponding to the target DNA size were cut under ultraviolet light. The DNA fragment was then recovered from the agarose gel using Axygen's AxyPrep DNA Gel Extraction Kit.
  • BGclonart Seamless Cloning Kit Instructions of Nuojing Biotechnology Co., Ltd., the linearized pET28a vector fragment recovered, the recovered gGT29-4, gGT29-5, gGT29-6 and gGT29-7 gene fragments and Nuojing Biotechnology
  • the BGclonart seamless cloning reaction solution of the company was mixed in an appropriate ratio for a total of 20 ⁇ l. After mixing, the mixture was incubated at 50 ° C for 30 minutes, and then the mixed reaction solution was transferred to ice. E. coli EPI300 competent cells were transformed with 5 ⁇ l of the reaction solution and plated on LB plates supplemented with 50 ⁇ g/mL kanamycin.
  • E. coli expression vectors gGT29-4-pET28a, gGT29-5-pET28a, gGT29-6-pET28a and gGT29-7-pET28a constructed in Example 19 were transformed into commercially available E. coli BL21.
  • One recombinant was inoculated into LB medium, cultured at 30 ° C, 200 rpm to an OD 600 of about 0.6-0.8, the bacterial solution was cooled to 4 ° C, IPTG was added to a final concentration of 50 ⁇ M, and expression was induced for 15 h at 18 ° C at 200 rpm.
  • the cells were collected by centrifugation at 4 ° C, and the cells were sonicated, and the supernatant of the cell lysate was collected by centrifugation at 12,000 g at 4 ° C, and the sample was taken for SDS-PAGE electrophoresis ( FIG. 10 ).
  • gGT29-4-pET28a, gGT29-5-pET28a, gGT29-6-pET28a, gGT29-7-pET28a and gGT29-7-pET28a recombinant lysates and total protein and supernatant all have distinct protein bands (approximately 50KD), respectively representing sugar Base transferases gGT29-4, gGT29-5, gGT29-6 and gGT29-7. From the results of Western Blot (Fig. 11), it was also confirmed that the target proteins gGT29-4, gGT29-5, gGT29-6 and gGT29-7 achieved soluble expression in the host.
  • the recombinant yeast cleavage supernatant expressing gGT29-4, gGT29-5, gGT29-6 and gGT29-7 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2.
  • the 100 ⁇ L reaction system is shown in Table 6. The reaction was carried out at 35 ° C for 12 h, then 100 ⁇ L of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
  • the reaction product was detected by thin layer chromatography (TLC).
  • TLC thin layer chromatography
  • the crude enzyme solution of gGT29-6 can further extend a sugar group on the 3-position glycosyl group of ginsenoside Rh2 and F2 to form ginsenoside Rg3 and Rd, respectively (Fig. 12
  • the crude enzyme solution of gGT29-4, gGT29-5 and gGT29-7 can further extend a glycosyl group at the 3-position glycosyl group of ginsenoside F2 to form saponin Rd, but they cannot catalyze the saponin Rh2 (Fig. 12).
  • the crude enzyme solution of gGT29-4, gGT29-5, gGT29-6 and gGT29-7 can also be extended on the C-6 glycosyl group of the original ginseng triol type saponin Rh1. Glycosyl group formed ginsenoside Rf (Fig. 13) in which gGT29-4, gGT29-5 and gGT29-6 were relatively weak, while gGT29-7 had relatively strong activity (Table 7).
  • Example 10 Glycosyltransferase genes gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29- Construction of 18 E. coli recombinant expression vector, expression in E. coli and identification of products
  • coli expression vector for gGT29-18 (gGT29-8-pET28a, gGT29-9-pET28a, gGT29-10-pET28a, gGT29-11-pET28a, gGT29-12-pET28a, gGT29-13-pET28a, gGT29-14-pET28a , gGT29-15-pET28a, gGT29-16-pET28a, gGT29-17-pET28a, gGT29-18-pET28a), and achieve soluble expression in E. coli.
  • the glycosyltransferase gGT29-7 can be a saccharide-based donor with UDP-glucose, and the C6-O-Glc of ginsenoside Rh1 is catalyzed to extend a glucose to produce ginsenoside Rf.
  • the specificity of the glycosyl donor was altered by site-directed mutagenesis of the glycosyl donor binding region (PSPG) of the glycosyltransferase gGT29-7.
  • Amino acid sequence alignment of rhamnosyltransferase which can be used to catalyze other glycosylation receptors, with gGT29-7, and screening and finally selecting two sites on the gGT29-7PSPG cassette for site-directed mutagenesis (N343G and A359P).
  • the PCR amplification procedure was: 95 ° C for 30 s; 95 ° C for 5 s, 56 ° C for 10 s, and 72 ° C for 1.5 min for a total of 30 cycles; to 10 ° C.
  • the PCR product was digested with DpnI and reacted in a water bath at 37 ° C for 2 h. Then, 3 ⁇ L of the recombinant enzyme EIInase in the kit was used for reconstitution at 37 ° C for 30 min to transform E. coli TOP 10 competent state. The monoclonal was picked and the plasmid was extracted, and the mutation site was verified by sequencing.
  • the obtained plasmid was named pET28a-gGT29-7-N343G.
  • the plasmid pET28a-gGT29-7-N343G was transformed into E. coli BL21 (DE3), and the recombinant strain pET28a-gGT29-7-N343G-BL21 was constructed, and the procedure for inducing expression was the same as in Example 8.
  • the A359 locus of gGT29-7 was subjected to point mutation according to conventional design and synthesis of two primers, and the mutant plasmid pET28a-gGT29-7-A359P and the recombinant strain gGT29-7-N343G-BL21 were constructed, and the procedure was the same as above.
  • the procedure for inducing expression was the same as in Example 8.
  • the single mutant protein gGT29-7-N343G catalyzes the activity of C6-O-Glc of Rh1 to extend a glucose to produce Rf
  • gGT29-7-A359P still retains C6 which catalyzes Rh1.
  • -O-Glc extends the enzyme activity of a glucose.
  • the double mutant protein gGT29-7-N343G/A359P not only retains the wild-type protein catalyzes the C6-O-Glc of Rh1 to extend the activity of a glucose-producing Rf, but also obtains a rhamnose that catalyzes the extension of C6-O-Glc of Rh1.
  • the enzyme activity of Rg2 is generated.
  • the recombinant E. coli supernatant expressing gGT29-10 and gGT29-14 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2.
  • the 100 ⁇ L reaction system is shown in Table 6, except that UDP-xylose was used instead of UDP-glucose as a glycosyl donor.
  • the reaction was carried out at 35 ° C for 12 h, then 100 ⁇ L of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
  • reaction product was detected by thin layer chromatography (TLC).
  • TLC thin layer chromatography
  • gGT29-10 and gGT29-14 were able to extend a xylose in the C-3 glycosyl group of ginsenoside Rh2 to form a new triterpenoid saponin, (3-O- ⁇ ).
  • gGT29-10 and gGT29-14 can replace the second glucose of ginsenoside Rg3C-3 with xylose to form a new triterpenoid saponin ( 3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-PPD); gGT29-10 and gGT29-14 can also replace the second glucose of ginsenoside Rd C-3 with xylose A new trisaponin (3-O- ⁇ -(D-xylopyranosyl)- ⁇ -(D-glucopyranosyl)-CK) (Fig. 15).
  • glycosyltransferases gGT29, gGT29-4, gGT29-5, gGT29-6 and gGT29-7gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-15, gGT29-16, gGT29-17 and gGT29-18, with UDP-xylose as a glycosyl donor, catalyze the activities of Rh2, Rg3 and Rd as shown in Table 8.
  • Recombinant yeast A5 was constructed by chromosomal trp1 gene locus in the yeast BY4742 chromosome (GAL1 promoter, using trp1 original terminator). Recombinant yeast requires additional amino acids (0.01% tryptophan, 0.01% leucine, 0.01% lysine).
  • the recombinant yeast A7 lysate was transferred to a 2 mL EP tube, 1 mL per tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 ⁇ L of methanol and used for HPLC detection.
  • the cell lysate of recombinant yeast A7 contained protopanaxatriol (PPT) and ginsenoside active metabolites Rh1 and Rf by HPLC analysis.
  • PPT protopanaxatriol
  • Rh1 and Rf ginsenoside active metabolites
  • GAL1 promoter using trp1 original terminator
  • UDP-L-rhamnose synthase RHM2 GQ292791.1
  • GAL10 promoter, FBA1 terminator GAL10 promoter, FBA1 terminator
  • the recombinant yeast A8 lysate was transferred to a 2 mL EP tube, 1 mL each tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 ⁇ L of methanol and used for HPLC detection.
  • the cell lysate of recombinant yeast A8 contained protopanaxatriol (PPT) and ginsenoside active metabolites Rh1 and Rg2 by HPLC analysis.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Developmental Biology & Embryology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Botany (AREA)
  • General Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Plant Pathology (AREA)
  • Cell Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Environmental Sciences (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

The present invention relates to a group of glycosyltransferases and a use thereof. Specifically, the use of glycosyltransferase gGT29-7 and a polypeptide derived therefrom in catalytic glycosylation of a terpenoid and in synthesizing new saponins is provided. The glycosyltransferase is able to specifically and highly efficiently transfer a glycosyl from a glycosyl donor to a first glycosyl at the C-3 position and/or the C-6 position of a 4-ring triterpenoid, so as to extend the carbohydrate chain. The gylcosyltransferases of the present invention may also be used in constructing artificially synthesized ginsenosides and a variety of new ginsenosides and derivatives thereof.

Description

一组糖基转移酶及其应用A group of glycosyltransferases and their applications 技术领域Technical field
本发明涉及生物技术和植物生物学领域,具体地,本发明涉及一组新的糖基转移酶及其应用。The present invention relates to the field of biotechnology and plant biology, and in particular, the present invention relates to a novel set of glycosyltransferases and uses thereof.
背景技术Background technique
人参皂苷是从人参及其同属植物(如三七、西洋参等)中分离到的皂苷的总称,属于三萜类皂苷,是人参中的主要有效成份。目前,已经从人参中分离出了至少60种皂苷,其中一些人参皂苷被证实具有广泛的生理功能和药用价值:包括抗肿瘤、免疫调节、抗疲劳、护心、护肝等功能。Ginsenoside is a general term for saponins isolated from ginseng and its genus (such as Panax notoginseng, American ginseng, etc.) and belongs to triterpenoid saponins, which is the main active ingredient in ginseng. At present, at least 60 saponins have been isolated from ginseng, and some of them have been proven to have a wide range of physiological functions and medicinal properties: including anti-tumor, immune regulation, anti-fatigue, heart protection, liver protection and other functions.
人参皂苷Rg3、Rf和Rg2均为稀有人参皂苷,分别具有非常强大的上述生理活性。例如,人参皂苷Rg3具有很好的抗肿瘤活性,可以诱导肿瘤细胞凋亡,抑制肿瘤细胞转移。它与放疗和化疗结合试验,可以增强放疗和化疗的效果;人参皂苷Rf具有抗肿瘤,抗疲劳的功效,能够减少子宫收缩,且起到与脑神经细胞有关联的镇痛作用,同时具有调节磷脂蛋白代谢的生理功能;人参皂苷Rg2对阿尔茨海默病大鼠大脑具有保护作用,可以增强大鼠学习记忆能力,同时对心肌损伤具有修复作用,此外人参皂苷Rg2还具有保护细胞受到紫外线伤害的作用。The ginsenosides Rg3, Rf and Rg2 are all rare ginsenosides, each having a very strong physiological activity as described above. For example, ginsenoside Rg3 has excellent anti-tumor activity, can induce tumor cell apoptosis, and inhibit tumor cell metastasis. It can be combined with radiotherapy and chemotherapy to enhance the effects of radiotherapy and chemotherapy. Ginsenoside Rf has anti-tumor and anti-fatigue effects, can reduce uterine contraction, and has an analgesic effect associated with brain nerve cells. The physiological function of phospholipid protein metabolism; ginsenoside Rg2 has protective effect on the brain of rats with Alzheimer's disease, can enhance the learning and memory ability of rats, and has a repairing effect on myocardial injury. In addition, ginsenoside Rg2 also protects cells from UV damage. The role.
然而,由于这些低度糖基化的稀有人参皂苷在天然人参中非常容易被糖基转移酶修饰并产生含复杂糖链的人参皂苷,因此,这些具有高生物活性的人参皂苷在人参中含量极低。However, since these low-glycosylated dilute ginsenosides are very susceptible to modification by glycosyltransferases in natural ginseng and produce ginsenosides containing complex sugar chains, these highly bioactive ginsenosides are extremely high in ginseng. low.
目前,生产这类稀有人参皂苷的方法是从人参中的大量人参皂苷出发,通过选择性水解糖基的方法进行转化后再进行提取和纯化。以人参属植物的总皂苷或原人参二醇类皂苷为原料,通过化学,酶法转化、分离和提取。由于化学制备方法原料损失较大,操作繁琐,副产物多,从而导致成本增加,而且难以提高产率。此外,由于人参总皂苷的获得依赖于人参的种植,所以导致用传统方法生产的稀有人参皂苷单体市场价格高昂。At present, the method for producing such a rare ginsenoside is to start from a large amount of ginseng saponins in ginseng, and to perform extraction and purification by selectively hydrolyzing a glycosyl group. The saponins of the genus Panax species or the original ginseng diol saponins are used as raw materials, and are chemically and enzymatically transformed, separated and extracted. Due to the large loss of raw materials in the chemical preparation method, the operation is cumbersome, and there are many by-products, which leads to an increase in cost and difficulty in improving the yield. In addition, since the acquisition of ginseng total saponins depends on the cultivation of ginseng, the market price of rare ginsenosides produced by conventional methods is high.
目前本领域尚缺乏一种有效的生产稀有人参皂苷Rg3、Rf和Rg2的方法,因此迫切需要开发多种特异高效的糖基转移酶。At present, there is a lack of an effective method for producing rare ginsenosides Rg3, Rf and Rg2 in the field, and thus it is urgent to develop a variety of specific and efficient glycosyltransferases.
发明内容Summary of the invention
本发明的目的就是提供一组糖基转移酶及其应用。It is an object of the present invention to provide a group of glycosyltransferases and uses thereof.
在本发明的第一方面,提供了一种体外糖基化方法,包括步骤:In a first aspect of the invention, there is provided an in vitro glycosylation method comprising the steps of:
在糖基转移酶存在下,将糖基供体的糖基转移到四环三萜类化合物的以下位点上:The glycosyl donor glycosyl group is transferred to the following sites of the tetracyclic triterpenoid in the presence of a glycosyltransferase:
C-3和/或C-6位的第一个糖基上;On the first glycosyl group at the C-3 and/or C-6 position;
从而形成糖基化的四环三萜类化合物;Thereby forming a glycosylated tetracyclic triterpenoid;
其中,所述的糖基转移酶为如SEQ ID NO.:61所示的糖基转移酶或其衍生多肽。Wherein the glycosyltransferase is a glycosyltransferase as shown in SEQ ID NO.: 61 or a polypeptide derived therefrom.
在另一优选例中,所述的延伸糖链包括直接延伸或置换延伸。In another preferred embodiment, the extended sugar chain comprises a direct extension or a displacement extension.
在另一优选例中,所述的直接延伸为在C-3和/或C-6位的第一个糖基上添加一个糖基以延伸糖链。 In another preferred embodiment, the direct extension is to add a sugar group to the first sugar group at the C-3 and/or C-6 position to extend the sugar chain.
在另一优选例中,所述的置换延伸为将C-3和/或C-6位糖链的末端糖基置换成不同的糖基,从在C-3和/或C-6位的第一个糖基上延伸糖链。In another preferred embodiment, the substitution is extended to replace the terminal glycosyl group of the C-3 and/or C-6 sugar chain with a different glycosyl group from the C-3 and/or C-6 position. The sugar chain is extended on the first glycosyl group.
在另一优选例中,所述的衍生多肽选自:In another preferred embodiment, the derivative polypeptide is selected from the group consisting of:
将SEQ ID NO.:61所示氨基酸序列的多肽经过一个或几个氨基酸残基的取代、缺失或添加而形成的、或是添加信号肽序列后形成的、并具有糖基转移酶活性的衍生多肽;或Derivatization of a polypeptide of the amino acid sequence of SEQ ID NO.: 61 by substitution, deletion or addition of one or several amino acid residues, or addition of a signal peptide sequence, and having a glycosyltransferase activity Polypeptide; or
氨基酸序列与SEQ ID NO.:61氨基酸序列的同源性≥85%(较佳地≥90%、91%、92%、93%、94%、95%、96%、97%、98%、99%),并具有糖基转移酶活性的衍生多肽;The amino acid sequence has a homology of SEQ ID NO.: 61 amino acid sequence ≥ 85% (preferably ≥ 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%), and a derivative polypeptide having glycosyltransferase activity;
其中,所述糖基转移酶活性指能将糖基供体的糖基转移到四环三萜类化合物C-3和/或C-6位的第一个糖基上以延伸糖链的活性。Wherein the glycosyltransferase activity refers to the ability to transfer a glycosyl donor glycosyl group to the first glycosyl group of the tetracyclic triterpenoid C-3 and/or C-6 to extend the activity of the sugar chain .
在另一优选例中,所述的衍生多肽包括选自SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95的任一所示序列。In another preferred embodiment, the derived polypeptide comprises a SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, The sequence shown in any of 92, 93, 94 or 95.
本发明第二方面,提供了一种分离的多肽,所述的多肽选自下组:In a second aspect of the invention, an isolated polypeptide is provided, the polypeptide being selected from the group consisting of:
(a)具有SEQ ID NO.:61所示氨基酸序列的多肽;(a) a polypeptide having the amino acid sequence of SEQ ID NO.: 61;
(b)将SEQ ID NO.:61所示氨基酸序列的多肽经过一个或几个氨基酸残基的取代、缺失或添加而形成的、或是添加信号肽序列后形成的、并具有糖基转移酶活性的衍生多肽;(b) forming a polypeptide of the amino acid sequence of SEQ ID NO.: 61 by substitution, deletion or addition of one or several amino acid residues, or by adding a signal peptide sequence, and having a glycosyltransferase Active derivative polypeptide;
(c)序列中含有(a)或(b)中所述多肽序列的衍生多肽;(c) a derivative polypeptide comprising a polypeptide sequence as described in (a) or (b);
(d)氨基酸序列与SEQ ID NO.:61氨基酸序列的同源性≥85%(较佳地≥90%、91%、92%、93%、94%、95%、96%、97%、98%或99%),并具有糖基转移酶活性的衍生多肽;(d) homology of the amino acid sequence to the amino acid sequence of SEQ ID NO.: 61 ≥ 85% (preferably ≥ 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%), a derivative polypeptide having glycosyltransferase activity;
其中,所述糖基转移酶活性指能将糖基供体的糖基转移到四环三萜类化合物C-3和/或C-6位的第一个糖基上以延伸糖链的活性。Wherein the glycosyltransferase activity refers to the ability to transfer a glycosyl donor glycosyl group to the first glycosyl group of the tetracyclic triterpenoid C-3 and/or C-6 to extend the activity of the sugar chain .
在另一优选例中,所述的序列(c)为由(a)或(b)添加了标签序列、信号序列或分泌信号序列后所形成的融合蛋白。In another preferred embodiment, the sequence (c) is a fusion protein formed by adding a tag sequence, a signal sequence or a secretion signal sequence to (a) or (b).
在另一优选例中,所述的衍生多肽包括选自SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示氨基酸序列的多肽。In another preferred embodiment, the derived polypeptide comprises a SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, A polypeptide of the amino acid sequence shown in 92, 93, 94 or 95.
在本发明的第三方面,提供了一种分离的多核苷酸,所述的多核苷酸为选自下组的序列:In a third aspect of the invention, there is provided an isolated polynucleotide, the polynucleotide being a sequence selected from the group consisting of:
(A)编码权利要求3所述多肽的核苷酸序列;(A) a nucleotide sequence encoding the polypeptide of claim 3;
(B)编码如SEQ ID NO.:61所示多肽或其衍生多肽的核苷酸序列;(B) a nucleotide sequence encoding the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom;
(C)如SEQ ID NO.:60所示的核苷酸序列;(C) a nucleotide sequence as shown in SEQ ID NO.: 60;
(D)与SEQ ID NO.:60所示序列的同源性≥90%(较佳地≥91%、92%、93%、94%、95%、96%、97%、98%或99%)的核苷酸序列;(D) homology to the sequence set forth in SEQ ID NO.: 60 ≥ 90% (preferably ≥ 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99) %) nucleotide sequence;
(E)在SEQ ID NO.:60所示核苷酸序列的5’端和/或3’端截短或添加1-60个(较佳地1-30,更佳地1-10个)核苷酸所形成的核苷酸序列;(E) truncating or adding 1-60 (preferably 1-30, more preferably 1-10) at the 5' end and/or the 3' end of the nucleotide sequence shown by SEQ ID NO. a nucleotide sequence formed by a nucleotide;
(F)与(A)-(E)任一所述的核苷酸序列互补的核苷酸序列。(F) a nucleotide sequence complementary to the nucleotide sequence of any of (A) to (E).
在另一优选例中,所述的核苷酸的序列如SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示。 In another preferred embodiment, the nucleotide sequence is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87 , 89, or 91.
在另一优选例中,序列如SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示的多核苷酸编码氨基酸序列分别如SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示的多肽。In another preferred embodiment, the sequence is as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91. The polynucleotide encoding amino acid sequences are SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, respectively. The polypeptide shown in 94 or 95.
在本发明的第四方面,提供了一种载体,所述的载体含有第三方面所述的多核苷酸。较佳地,所述载体包括表达载体、穿梭载体、整合载体。In a fourth aspect of the invention, a vector comprising the polynucleotide of the third aspect is provided. Preferably, the vector comprises an expression vector, a shuttle vector, an integration vector.
在本发明的第五方面,提供了本发明第一或第二方面所述分离的多肽的用途,它被用于催化以下一种或多种反应,或被用于制备催化以下一种或多种反应的催化制剂:将来自糖基供体的糖基转移到四环三萜类化合物的C-3位或C6位的第一个糖基上以延伸糖链;优选地,In a fifth aspect of the invention, there is provided the use of the isolated polypeptide of the first or second aspect of the invention, which is used to catalyze one or more of the following reactions, or is used to prepare one or more of the following Catalytic preparation for the reaction: transferring a glycosyl group derived from a glycosyl donor to the first glycosyl group at the C-3 position or the C6 position of the tetracyclic triterpenoid to extend the sugar chain; preferably,
它被用于催化以下一种或多种体外反应,或被用于制备催化以下一种或多种反应的催化制剂:It is used to catalyze one or more of the following in vitro reactions, or to prepare a catalytic formulation that catalyzes one or more of the following reactions:
(i)将来自糖基供体的糖基转移到四环三萜类化合物的C-3位的第一个糖基上,延伸糖链;(i) transferring a glycosyl group derived from a glycosyl donor to a first glycosyl group at the C-3 position of the tetracyclic triterpenoid, extending the sugar chain;
(ii)将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的第一个糖基上,延伸糖链;(ii) transferring a glycosyl group derived from a glycosyl donor to a first glycosyl group at the C-6 position of the tetracyclic triterpenoid, extending the sugar chain;
(iii)将来之糖基供体的糖基与四环三萜类化合物的C-6位糖链的末端糖基进行置换,从在C-3和/或C-6位的第一个糖基上延伸糖链。(iii) replacement of the glycosyl group of the glycosyl donor in the future with the terminal glycosyl group of the C-6 sugar chain of the tetracyclic triterpenoid, from the first sugar at the C-3 and/or C-6 position The sugar chain is extended on the base.
在另一优选例中,所述的糖基供体包括选自下组的核苷二磷酸糖:UDP-葡萄糖,ADP-葡萄糖,TDP-葡萄糖,CDP-葡萄糖,GDP-葡萄糖,UDP-乙酰基葡萄糖,ADP-乙酰基葡萄糖,TDP-乙酰基葡萄糖,CDP-乙酰基葡萄糖,GDP-乙酰基葡萄糖,UDP-木糖,ADP-木糖,TDP-木糖,CDP-木糖,GDP-木糖,UDP-木糖,UDP-半乳糖醛酸,ADP-半乳糖醛酸,TDP-半乳糖醛酸,CDP-半乳糖醛酸,GDP-半乳糖醛酸,UDP-半乳糖,ADP-半乳糖,TDP-半乳糖,CDP-半乳糖,GDP-半乳糖,UDP-阿拉伯糖,ADP-阿拉伯糖,TDP-阿拉伯糖,CDP-阿拉伯糖,GDP-阿拉伯糖,UDP-鼠李糖,ADP-鼠李糖,TDP-鼠李糖,CDP-鼠李糖,GDP-鼠李糖,或其他核苷二磷酸己糖或核苷二磷酸戊糖,或其组合。In another preferred embodiment, the glycosyl donor comprises a nucleoside diphosphate selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetyl Glucose, ADP-acetylglucose, TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, GDP-xylose , UDP-xylose, UDP-galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose , TDP-galactose, CDP-galactose, GDP-galactose, UDP-arabinose, ADP-arabinose, TDP-arabinose, CDP-arabinose, GDP-arabinose, UDP-rhamnose, ADP-rat Liose, TDP-rhamnose, CDP-rhamnose, GDP-rhamnose, or other nucleoside diphosphate hexose or nucleoside pentose pentose, or a combination thereof.
在另一优选例中,所述的糖基供体包括选自下组的尿苷二磷酸(UDP)糖:UDP-葡萄糖,UDP-木糖,UDP-半乳糖醛酸,UDP-半乳糖,UDP-阿拉伯糖,UDP-鼠李糖,或其他尿苷二磷酸己糖或尿苷二磷酸戊糖,或其组合。In another preferred embodiment, the glycosyl donor comprises a uridine diphosphate (UDP) sugar selected from the group consisting of UDP-glucose, UDP-xylose, UDP-galacturonic acid, UDP-galactose, UDP-arabinose, UDP-rhamnose, or other uridine diphosphate hexose or uridine pentose diphosphate, or a combination thereof.
在另一优选例中,所述分离的多肽用于催化下述一种或多种反应或被用于制备催化下述一种或多种反应的催化制剂:In another preferred embodiment, the isolated polypeptide is used to catalyze one or more of the following reactions or to prepare a catalytic formulation that catalyzes one or more of the following reactions:
(A)(A)
Figure PCTCN2015081111-appb-000001
Figure PCTCN2015081111-appb-000001
其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,R5-R1-O为C3第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:61所示的多肽或其衍生多肽;优选地,选自26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95或其衍生多肽。Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, and R5-R1-O is a C3 first glycosyl group-derived glycosyl group, said polypeptide being selected from the group consisting of a polypeptide represented by SEQ ID NO.: 61 or a polypeptide derived therefrom; preferably, selected from the group consisting of 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 or a polypeptide derived therefrom.
R1-R4经取代后的化合物如下表所示:The substituted compounds of R1-R4 are shown in the following table:
底物Substrate R1R1 R2R2 R3R3 R4R4
Rh2Rh2 糖基Glycosyl HH OHOH HH
F2F2 糖基Glycosyl HH OHOH 糖基Glycosyl
即当R1为葡萄糖基;R2为H,R3为OH,R4为H,式(I)化合物为Rh2。That is, when R1 is a glucosyl group; R2 is H, R3 is OH, R4 is H, and the compound of the formula (I) is Rh2.
R1为葡萄糖基;R2为H,R3为OH,R4为葡萄糖基,式(I)化合物为F2。R1 is a glucosyl group; R2 is H, R3 is OH, R4 is a glucosyl group, and the compound of formula (I) is F2.
当以UDP-葡糖糖作为糖基供体时,底物(I)化合物为Rh2,则产物式(II)化合物为Rg3;底物(I)化合物为F2,则产物式(II)化合物为Rd;When UDP-glucose is used as the glycosyl donor, the substrate (I) compound is Rh2, then the product of formula (II) is Rg3; and the substrate (I) compound is F2, then the product of formula (II) is Rd;
当以UDP-木糖作为糖基供体时,底物(I)化合物为Rh2,则产物式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;底物(I)化合物为F2,则产物式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。When UDP-xylose is used as the glycosyl donor, the substrate (I) compound is Rh2, and the product of formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)- PPD; the substrate (I) compound is F2, and the product of formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
(B)(B)
Figure PCTCN2015081111-appb-000002
Figure PCTCN2015081111-appb-000002
其中,R1和R2为H或者糖基,R3和R4为糖基。R3-R4-O为C6第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:61所示多肽或其衍生多肽,优选地,为SEQ ID NO.:55,57,59,78,82,92,94或者95或其衍生多肽。Wherein R1 and R2 are H or a glycosyl group, and R3 and R4 are a glycosyl group. R3-R4-O is the first glycosyl-derived glycosyl group of C6, and the polypeptide is selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 55, 57, 59, 78, 82, 92, 94 or 95 or a polypeptide derived therefrom.
当R1和R2为H,R3为葡萄糖基时,式(III)化合物为Rh1。When R1 and R2 are H and R3 is a glucosyl group, the compound of formula (III) is Rh1.
当以UDP-葡糖糖作为糖基供体时,底物(III)化合物为Rh1,则产物式(IV)化合物为Rf;When UDP-glucose is used as the glycosyl donor, the substrate (III) compound is Rh1, then the product of formula (IV) is Rf;
当以UDP-鼠李糖作为糖基供体时,底物(III)化合物为Rh1,则产物式(IV)化合物为Rg2。When UDP-rhamnose is used as the glycosyl donor, the substrate (III) compound is Rh1, and the product of formula (IV) is Rg2.
(C)(C)
Figure PCTCN2015081111-appb-000003
Figure PCTCN2015081111-appb-000003
Figure PCTCN2015081111-appb-000004
Figure PCTCN2015081111-appb-000004
其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,R5-R1-O为C3第一个糖基衍生的糖基;R6为糖基,R6-R1-O为C3第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:61所示多肽或其衍生多肽,优选为SEQ ID NO.:26、28、59、76、84、86或88所示的多肽。Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, R5-R1-O is a glycosyl group derived from the first glycosyl group of C3; R6 is a glycosyl group, R6 -R1-O is the first glycosyl-derived glycosyl group of C3, said polypeptide being selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 59, 76 a polypeptide as shown in 84, 86 or 88.
R1为两个葡萄糖基,R2为H,R3为OH,R4为H,式(V)化合物为Rg3。R1 is two glucosyl groups, R2 is H, R3 is OH, R4 is H, and the compound of formula (V) is Rg3.
R1为两个葡萄糖基,R2为H,R3为OH,R4为葡萄糖基,式(V)化合物为RdR1 is two glucosyl groups, R2 is H, R3 is OH, R4 is a glucosyl group, and the compound of formula (V) is Rd.
当以UDP-木糖作为糖基供体时,底物(V)化合物为Rg3,则产物式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;底物(V)化合物为Rd,则产物式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。When UDP-xylose is used as the glycosyl donor, the substrate (V) compound is Rg3, and the product of formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)- PPD; the substrate (V) compound is Rd, and the product of formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
在另一优选例中,所述的糖基选自:葡萄糖基、半乳糖醛酸基、木糖糖基,半乳糖基、阿拉伯糖基、鼠李糖基,以及其他己糖基或戊糖基。In another preferred embodiment, the glycosyl group is selected from the group consisting of: glucosyl, galacturonic acid, xylose, galactosyl, arabinose, rhamnosyl, and other hexose or pentose base.
在另一优选例中,所述反应式(I)、或(III)化合物包括(但不限于):S构型或R构型的达玛烷型四环三萜类化合物、羊毛脂烷型四环三萜类化合物、apotirucallane型四环三萜、甘遂烷型四环三萜类化合物、环阿屯烷(环阿尔廷烷)型四环三萜类化合物、葫芦烷四环三萜类化合物、或楝烷型四环三萜类化合物。In another preferred embodiment, the compound of the formula (I) or (III) includes, but is not limited to, a dammarane type tetracyclic triterpenoid of the S configuration or the R configuration, a lanolinane type Tetracyclic triterpenoids, apotirucallane type tetracyclic triterpenes, ganthanane type tetracyclic triterpenoids, cycloalkane (cycloaltenane) type tetracyclic triterpenoids, cucurbitane tetracyclic triterpenoids a compound or a decane type tetracyclic triterpenoid.
在另一优选例中,所述的多肽选自下组:In another preferred embodiment, the polypeptide is selected from the group consisting of:
(a)具有SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95中任一条所示氨基酸序列的多肽;(a) having any of SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 a polypeptide of the indicated amino acid sequence;
(b)将SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95中任一条所示氨基酸序列的多肽经过一个或多个氨基酸残基的取代、缺失或添加而形成的、或是添加信号肽序列后形成的、并具有糖基转移酶活性的衍生多肽;(b) SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 a polypeptide of the indicated amino acid sequence formed by substitution, deletion or addition of one or more amino acid residues, or a derivative polypeptide formed by adding a signal peptide sequence and having glycosyltransferase activity;
(c)序列中含有(a)或(b)中所述多肽序列的衍生多肽;(c) a derivative polypeptide comprising a polypeptide sequence as described in (a) or (b);
(d)氨基酸序列与SEQ ID NO:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95中任一条中所示氨基酸序列的同源性≥85%或≥90%(较佳地≥95%),并具有糖基转移酶活性的衍生多肽。(d) Amino acid sequence and SEQ ID NO: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 A derivative polypeptide having a homologity of the amino acid sequence shown in any one of ≥85% or ≥90% (preferably ≥95%) and having glycosyltransferase activity.
在另一优选例中,编码所述多肽的核苷酸所述的多核苷酸为选自下组的序列:In another preferred embodiment, the polynucleotide encoding the nucleotide of the polypeptide is a sequence selected from the group consisting of:
(A)编码本发明第二方面所述多肽的核苷酸序列;(A) a nucleotide sequence encoding the polypeptide of the second aspect of the invention;
(B)编码如SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示多肽的核苷酸序列;(B) encoded as SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 Showing the nucleotide sequence of the polypeptide;
(C)如SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示的核苷酸序列;(C) a nucleotide sequence as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 ;
(D)与SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示序列的同源性≥85%(较佳地较佳地≥90%、91%、92%、93%、94%、95%、96%、97%、98%或99%)的核苷酸序列;(D) Homology to the sequence set forth in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 a nucleotide sequence of ≥ 85% (preferably preferably ≥ 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%);
(E)在SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示核苷酸序列的5’端和/或3’端截短或添加1-60个(较佳地1-30,更佳地1-10个)核苷酸所形成的核苷酸序列;(E) a nucleotide sequence represented by SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 a nucleotide sequence formed by truncating the 5' end and/or the 3' end or adding 1-60 (preferably 1-30, more preferably 1-10) nucleotides;
(F)与(A)-(E)任一所述的核苷酸序列互补(较佳地完全互补)的核苷酸序列。(F) a nucleotide sequence that is complementary (preferably fully complementary) to the nucleotide sequence of any of (A)-(E).
在另一优选例中,所述的核苷酸的序列如SEQ ID NO.:25、27、54、56、58、 60、71、73、75、77、79、81、83、85、87、89、或91所示。In another preferred embodiment, the sequence of the nucleotide is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
在另一优选例中,序列如SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示的多核苷酸编码氨基酸序列分别如SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示的多肽。In another preferred embodiment, the sequence is as shown in SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91. The polynucleotide encoding amino acid sequences are SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, respectively. The polypeptide shown in 94 or 95.
在本发明的第六方面,提供了一种进行糖基转移催化反应的方法,包括步骤:在本发明第二方面所述的多肽或其衍生多肽存在的条件下,进行糖基转移催化反应。In a sixth aspect of the invention, there is provided a method of performing a glycosyl transfer catalytic reaction comprising the steps of performing a glycosyl transfer catalytic reaction in the presence of the polypeptide of the second aspect of the invention or a polypeptide derived therefrom.
在另一优选例中,所述的方法还包括步骤:In another preferred embodiment, the method further includes the steps of:
在糖基供体以及如本发明第二方面所述多肽及其衍生多肽的存在下,或式(I)化合物转化为所述的式(II)化合物,或式(III)化合物转化为所述的式(IV)化合物,或式(V)化合物转化为所述的式(VI)化合物;In the presence of a glycosyl donor and a polypeptide according to the second aspect of the invention and a polypeptide derived therefrom, or a compound of formula (I) is converted to said compound of formula (II), or a compound of formula (III) is converted to said a compound of formula (IV), or a compound of formula (V), is converted to the compound of formula (VI);
在另一优选例中,所述的方法还包括将所述的多肽及其衍生多肽分别加入催化反应;和/或In another preferred embodiment, the method further comprises separately adding the polypeptide and the polypeptide derived therefrom to a catalytic reaction; and/or
将所述的多肽及其衍生多肽同时加入催化反应。The polypeptide and its derived polypeptide are simultaneously added to the catalytic reaction.
在另一优选例中,所述的方法还包括将编码糖基转移酶的核苷酸序列与达玛稀二醇和/或原人参二醇和/或原人参三醇合成代谢途径中的关键基因和/或其他糖基转移酶基因在宿主细胞中共表达,从而获得所述的式(II)、(IV)、或(VI)化合物。In another preferred embodiment, the method further comprises ligating a nucleotide sequence encoding a glycosyltransferase with a key gene in a anabolic pathway of dammar diol and/or protopanaxadiol and/or protosol. / or other glycosyltransferase gene is co-expressed in a host cell to obtain the compound of formula (II), (IV), or (VI).
在另一优选例中,所述的宿主细胞为酵母菌或大肠杆菌。In another preferred embodiment, the host cell is a yeast or Escherichia coli.
在另一优选例中,所述的多肽为具有SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示氨基酸序列的多肽及其衍生多肽。In another preferred embodiment, the polypeptide has SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, A polypeptide of the amino acid sequence shown in 92, 93, 94 or 95 and a polypeptide derived therefrom.
在另一优选例中,编码所述多肽的核苷酸序列如SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示。In another preferred embodiment, the nucleotide sequence encoding the polypeptide is SEQ ID NO.: 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91 is shown.
在另一优选例中,所述方法还包括:向反应体系中提供用于调节酶活性的添加物。In another preferred embodiment, the method further comprises: providing an additive for regulating enzyme activity to the reaction system.
在另一优选例中,所述的用于调节酶活性的添加物是:提高酶活性或抑制酶活性的添加物。In another preferred embodiment, the additive for regulating enzyme activity is an additive that increases enzyme activity or inhibits enzyme activity.
在另一优选例中,所述的用于调节酶活性的添加物选自下组:Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+、或Fe2+In another preferred embodiment, the additive for regulating enzyme activity is selected from the group consisting of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ . , or Fe 2+ .
在另一优选例中,所述的用于调节酶活性的添加物是:可以生成Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+、或Fe2+的物质。In another preferred embodiment, the additive for regulating enzyme activity is: Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , Or a substance of Fe 2+ .
在另一优选例中,所述的糖基供体是核苷二磷酸糖,选自下组:UDP-葡萄糖,ADP-葡萄糖,TDP-葡萄糖,CDP-葡萄糖,GDP-葡萄糖,UDP-乙酰基葡萄糖,ADP-乙酰基葡萄糖,TDP-乙酰基葡萄糖,CDP-乙酰基葡萄糖,GDP-乙酰基葡萄糖,UDP-木糖,ADP-木糖,TDP-木糖,CDP-木糖,GDP-木糖,UDP-木糖,UDP-半乳糖醛酸,ADP-半乳糖醛酸,TDP-半乳糖醛酸,CDP-半乳糖醛酸,GDP-半乳糖醛酸,UDP-半乳糖,ADP-半乳糖,TDP-半乳糖,CDP-半乳糖,GDP-半乳糖,UDP-阿拉伯糖,ADP-阿拉伯糖,TDP-阿拉伯糖,CDP-阿拉伯糖,GDP-阿拉伯糖,UDP-鼠李糖,ADP-鼠李糖,TDP-鼠李糖,CDP-鼠李糖,GDP-鼠李糖,或其他核苷二磷酸己糖或核苷二磷酸戊糖,或其组合。 In another preferred embodiment, the glycosyl donor is a nucleoside diphosphate sugar selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetyl Glucose, ADP-acetylglucose, TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, GDP-xylose , UDP-xylose, UDP-galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose , TDP-galactose, CDP-galactose, GDP-galactose, UDP-arabinose, ADP-arabinose, TDP-arabinose, CDP-arabinose, GDP-arabinose, UDP-rhamnose, ADP-rat Liose, TDP-rhamnose, CDP-rhamnose, GDP-rhamnose, or other nucleoside diphosphate hexose or nucleoside pentose pentose, or a combination thereof.
在另一优选例中,所述的糖基供体是尿苷二磷酸糖,选自下组:UDP-葡萄糖,UDP-木糖,UDP-半乳糖醛酸,UDP-半乳糖,UDP-阿拉伯糖,UDP-鼠李糖,或其他尿苷二磷酸己糖或尿苷二磷酸戊糖,或其组合。In another preferred embodiment, the glycosyl donor is uridine diphosphate, selected from the group consisting of UDP-glucose, UDP-xylose, UDP-galacturonic acid, UDP-galactose, UDP-Arabic Sugar, UDP-rhamnose, or other uridine diphosphate hexose or uridine pentose diphosphate, or a combination thereof.
在另一优选例中,反应体系的pH为:pH4.0-10.0,优选pH为5.5-9.0。In another preferred embodiment, the pH of the reaction system is: pH 4.0 to 10.0, preferably pH 5.5 to 9.0.
在另一优选例中,反应体系的温度为:10℃-105℃,优选20℃-50℃。In another preferred embodiment, the temperature of the reaction system is from 10 ° C to 105 ° C, preferably from 20 ° C to 50 ° C.
在另一优选例中,所述的达玛稀二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因。In another preferred embodiment, the key genes in the darumadiol anabolic pathway include, but are not limited to, the damasenediol synthase gene.
在另一优选例中,所述的原人参二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因和其的还原酶基因,或其组合。。In another preferred embodiment, the key genes in the proto-ginsengdiol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase gene thereof, or Its combination. .
在另一优选例中,所述的原人参三醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、及细胞色素P450 CYP716A53V2基因及其还原酶基因,或其组合。In another preferred embodiment, the key genes in the proto-ginsolic triol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cell. Pigment P450 CYP716A53V2 gene and its reductase gene, or a combination thereof.
在另一优选例中,所述的人参皂苷Rh2合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、四环三萜C-3位糖基转移酶基因UGTPg45,或其组合。In another preferred embodiment, the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three萜C-3 position glycosyltransferase gene UGTPg45, or a combination thereof.
在另一优选例中,所述的人参皂苷Rh1合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、及细胞色素P450 CYP716A53V2基因及其还原酶基因、四环三萜C-6位的糖基转移酶基因UGTPg100,或其组合。In another preferred embodiment, the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome. P450 CYP716A53V2 gene and its reductase gene, tetracyclic triterpenoid C-6 glycosyltransferase gene UGTPg100, or a combination thereof.
在另一优选例中,所述糖基催化反应的底物分别为式(I)、或(III)化合物,且所述的产物分别为(II)、或(IV)化合物;In another preferred embodiment, the substrate of the glycosyl catalyzed reaction is a compound of the formula (I) or (III), respectively, and the product is a compound of (II) or (IV), respectively;
在另一优选例中,所述的式(I)化合物为人参皂苷Rh2,并且式(II)化合物为人参皂苷Rg3;In another preferred embodiment, the compound of formula (I) is ginsenoside Rh2, and the compound of formula (II) is ginsenoside Rg3;
或,所述的式(I)化合物为人参皂苷Rh2,并且式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;Or, the compound of the formula (I) is ginsenoside Rh2, and the compound of the formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;
或,所述的式(I)化合物为人参皂苷F2,并且式(II)化合物为人参皂苷Rd;Or the compound of formula (I) is ginsenoside F2, and the compound of formula (II) is ginsenoside Rd;
或,所述的式(I)化合物为人参皂苷F2,并且式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK;Or, the compound of the formula (I) is ginsenoside F2, and the compound of the formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK;
或,所述的式(III)化合物为人参皂苷Rh1,并且式(IV)化合物为人参皂苷Rf;Or the compound of the formula (III) is ginsenoside Rh1, and the compound of the formula (IV) is ginsenoside Rf;
或,所述的式(III)化合物为人参皂苷Rh1,并且式(IV)化合物为人参皂苷Rg2;Or, the compound of formula (III) is ginsenoside Rh1, and the compound of formula (IV) is ginsenoside Rg2;
或,所述的式(V)化合物为人参皂苷Rg3,并且式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;Or, the compound of the formula (V) is ginsenoside Rg3, and the compound of the formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;
或,所述的式(V)化合物为人参皂苷Rd,并且式(IV)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。在本发明的第七方面,提供了一种遗传工程化的宿主细胞,所述的宿主细胞含有本发明第四方面所述的载体,或其基因组中整合本发明第三方面所述的多核苷酸。Alternatively, the compound of formula (V) is ginsenoside Rd, and the compound of formula (IV) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK. In a seventh aspect of the invention, a genetically engineered host cell comprising the vector of the fourth aspect of the invention, or a genome thereof, which integrates the polynucleoside of the third aspect of the invention acid.
在另一优选例中,所述的糖基转移酶为如本发明第二方面中所述的多肽或其衍生多肽。In another preferred embodiment, the glycosyltransferase is a polypeptide as described in the second aspect of the invention or a polypeptide derived therefrom.
在另一优选例中,编码所述糖基转移酶的核苷酸序列如本发明第三方面所述。In another preferred embodiment, the nucleotide sequence encoding the glycosyltransferase is as described in the third aspect of the invention.
在另一优选例中,所述的细胞为原核细胞或真核细胞。In another preferred embodiment, the cell is a prokaryotic cell or a eukaryotic cell.
在另一优选例中,所述的宿主细胞为真核细胞,如酵母细胞或植物细胞。 In another preferred embodiment, the host cell is a eukaryotic cell, such as a yeast cell or a plant cell.
在另一优选例中,所述的宿主细胞为酿酒酵母细胞。In another preferred embodiment, the host cell is a Saccharomyces cerevisiae cell.
在另一优选例中,所述的宿主细胞原核细胞,如大肠杆菌。In another preferred embodiment, the host cell is a prokaryotic cell, such as E. coli.
在另一优选例中,所述的宿主细胞为人参细胞。In another preferred embodiment, the host cell is a ginseng cell.
在另一优选例中,所述的宿主细胞不是天然产生式(II),(IV),(VI),(VIII),(II),(IIII)化合物的细胞。In another preferred embodiment, the host cell is not a cell that naturally produces a compound of formula (II), (IV), (VI), (VIII), (II), (IIII).
在另一优选例中,所述的宿主细胞不是天然产生稀有人参皂苷Rg3和/或稀有人参皂苷Rf和/或稀有人参皂苷Rg2,和/或新的人参皂苷3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD和3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK等的细胞。In another preferred embodiment, the host cell is not naturally producing a rare ginsenoside Rg3 and/or a rare ginsenoside Rf and/or a rare ginsenoside Rg2, and/or a new ginsenoside 3-O-β-(D- Cells such as xylopyranosyl)-β-(D-glucopyranosyl)-PPD and 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
在另一优选例中,所述的达玛稀二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因。In another preferred embodiment, the key genes in the darumadiol anabolic pathway include, but are not limited to, the damasenediol synthase gene.
在另一优选例中,所述的宿主细胞含有原人参二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因,或其组合。In another preferred embodiment, the host cell contains key genes in the proto-glycol diol anabolic pathway including, but not limited to, the dammarenediol synthase gene, the cytochrome P450 CYP716A47 gene, and its reductase gene. , or a combination thereof.
在另一优选例中,所述的宿主细胞含有原人参三醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶、P450 CYP716A47的还原酶基因及其基因,或其组合。。In another preferred embodiment, the host cell contains key genes in the original ginseng triol anabolic pathway including, but not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase thereof, P450 CYP716A47 reductase gene and its gene, or a combination thereof. .
在另一优选例中,所述的人参皂苷Rh2合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、四环三萜C-3位糖基转移酶基因UGTPg45,或其组合。In another preferred embodiment, the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three萜C-3 position glycosyltransferase gene UGTPg45, or a combination thereof.
在另一优选例中,所述的人参皂苷Rh1合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、及细胞色素P450 CYP716A53V2基因及其还原酶基因、四环三萜C-6位的糖基转移酶基因UGTPg100,或其组合。In another preferred embodiment, the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome. P450 CYP716A53V2 gene and its reductase gene, tetracyclic triterpenoid C-6 glycosyltransferase gene UGTPg100, or a combination thereof.
在本发明的第八方面,提供了第八方面所述的宿主细胞的用途,用于制备酶催化试剂,或生产糖基转移酶、或作为催化细胞、或产生式(II)、(IV)或(VI)。In an eighth aspect of the invention, the use of the host cell of the eighth aspect is provided for the preparation of an enzyme catalytic reagent, or for the production of a glycosyltransferase, or as a catalytic cell, or for the production of formula (II), (IV) Or (VI).
在另一优选例中,所述的宿主细胞用于通过对人参皂苷Rh2、F2、Rg3、Rd和/或人参皂苷Rh1、Rg1的糖基化反应,生产新皂苷3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD和3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CKII和/或稀有人参皂苷Rg3和/或稀有人参皂苷Rf和/或稀有人参皂苷Rg2。In another preferred embodiment, the host cell is used to produce a novel saponin 3-O-β-(D) by glycosylation of ginsenoside Rh2, F2, Rg3, Rd and/or ginsenoside Rh1, Rg1. -xylopyranosyl)-β-(D-glucopyranosyl)-PPD and 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CKII and/or rare ginsenoside Rg3 and/or rare ginsenoside Rf and / or rare ginsenoside Rg2.
在本发明的第九方面,提供了一种产生转基因植物的方法,包括步骤:将第八方面所述的遗传工程化的宿主细胞再生为植物,并且所述的遗传工程化的宿主细胞为植物细胞。In a ninth aspect of the invention, a method of producing a transgenic plant, comprising the steps of: regenerating the genetically engineered host cell of the eighth aspect into a plant, and wherein the genetically engineered host cell is a plant cell.
在另一优选例中,所述的遗传工程化的宿主细胞为人参细胞。In another preferred embodiment, the genetically engineered host cell is a ginseng cell.
应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。It is to be understood that within the scope of the present invention, the various technical features of the present invention and the various technical features specifically described hereinafter (as in the embodiments) may be combined with each other to constitute a new or preferred technical solution. Due to space limitations, we will not repeat them here.
附图说明DRAWINGS
下列附图用于说明本发明的具体实施方案,而不用于限定由权利要求书所界 定的本发明范围。The following figures are used to illustrate specific embodiments of the invention and are not intended to be limited by the scope of the claims The scope of the invention is intended.
图1显示(a)gGT29/gGT29-3基因和(b)gGT29-4/gGT29-5/gGT29-6和gGT29-7基因PCR产物的琼脂糖凝胶电泳图。(b)泳道1,核酸Marker;泳道2,gGT29/gGT29-3基因PCR产物;(b)泳道1,gGT29-4/gGT29-5/gGT29-6基因PCR产物;泳道2,gGT29-7基因PCR产物;泳道3,核酸Marker。Figure 1 shows agarose gel electrophoresis patterns of (a) gGT29/gGT29-3 gene and (b) gGT29-4/gGT29-5/gGT29-6 and gGT29-7 gene PCR products. (b) Lane 1, nucleic acid Marker; Lane 2, gGT29/gGT29-3 gene PCR product; (b) Lane 1, gGT29-4/gGT29-5/gGT29-6 gene PCR product; Lane 2, gGT29-7 gene PCR Product; Lane 3, Nucleic Acid Marker.
图2显示SDS-PAGE检测gGT29和gGT29-3在酿酒酵母中的表达;泳道1,空载体pYES2重组子的裂解液上清;泳道2,gGT29-pYES2酵母重组子的裂解液上清;泳道3,gGT29-3-pYES2酵母重组子的裂解液上清。Figure 2 shows SDS-PAGE detection of gGT29 and gGT29-3 expression in Saccharomyces cerevisiae; Lane 1, lysate supernatant of empty vector pYES2 recombinant; Lane 2, lysate supernatant of gGT29-pYES2 yeast recombinant; Lane 3 , lysate supernatant of gGT29-3-pYES2 yeast recombinant.
图3显示Western Blot检测gGT29和gGT29-3在酿酒酵母中的表达;泳道1,空载体pYES2重组子的裂解液上清;泳道2,gGT29-pYES2酵母重组子的裂解液上清;泳道3,gGT29-3-pYES2酵母重组子的裂解液上清。Figure 3 shows Western Blot detection of gGT29 and gGT29-3 expression in Saccharomyces cerevisiae; Lane 1, lysate supernatant of empty vector pYES2 recombinant; Lane 2, lysate supernatant of gGT29-pYES2 yeast recombinant; Lane 3, Lysate supernatant of gGT29-3-pYES2 yeast recombinant.
图4显示糖基转移酶gGT29和gGT29-3催化人参皂苷Rh2和F2的产物的TLC检测图谱;泳道1,PPD及PPD类型皂苷混合标样,泳道2,gGT29粗酶液(gGT29-pYES2酵母重组子的裂解液上清)催化Rh2生成Rg3,泳道3,gGT29粗酶液催化Rh2对照,加入pYES2空质粒酵母重组子裂解液替代酶液;泳道4,gGT29催化F2生成Rd,泳道5,gGT29催化F2对照,加入pYES2空质粒酵母重组子裂解液替代酶液;泳道6,gGT29-3粗酶液(gGT29-3-pYES2酵母重组子的裂解液上清)催化Rh2生成Rg3;泳道7,gGT29-3粗酶液催化F2生成Rd。Figure 4 shows TLC detection profiles of glycosyltransferases gGT29 and gGT29-3 catalyzed products of ginsenoside Rh2 and F2; Lane 1, PPD and PPD type saponin mixed standards, Lane 2, gGT29 crude enzyme solution (gGT29-pYES2 yeast recombination) The lysate supernatant of the bacterium catalyzes the formation of Rg3 by Rh2, Lane 3, gGT29 crude enzyme solution catalyzes the Rh2 control, and adds pYES2 empty plasmid yeast recombinant lysate instead of the enzyme solution; Lane 4, gGT29 catalyzes the formation of Rd by F2, Lane 5, gGT29 catalysis F2 control, adding pYES2 empty plasmid yeast recombinant lysate instead of enzyme solution; Lane 6, gGT29-3 crude enzyme solution (gGT29-3-pYES2 yeast lysate supernatant) catalyzes Rh2 to produce Rg3; Lane 7, gGT29- 3 The crude enzyme solution catalyzes the formation of Rd by F2.
图5显示糖基转移酶gGT29和BvUGT73C10或gGT29和UGTPg45联合催化PPD产物的TLC检测;(a)gGT29和BvUGT73C10联合催化PPD,泳道1,PPD及PPD类型皂苷混合标样;泳道2,BvUGT73C10催化PPD生成Rh2;泳道3,gGT29催化Rh2生成Rg3;泳道4,BvUGT73C10和gGT29联合催化PPD生成Rg3;(b)gGT29和UGTPg45联合催化PPD,泳道1,PPD及PPD类型皂苷混合标样;泳道2,UGTPg45催化PPD生成Rh2;泳道3,PPD;泳道4,UGTPg45和gGT29联合催化PPD生成Rg3。Figure 5 shows TLC detection of glycotransferase gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with PPD products; (a) gGT29 and BvUGT73C10 combined with catalytic PPD, lane 1, PPD and PPD type saponin mixed standards; lane 2, BvUGT73C10 catalytic PPD Generation of Rh2; Lane 3, gGT29 catalyzes the formation of Rg3 by Rh2; Lane 4, BvUGT73C10 and gGT29 combine to catalyze the formation of Rg3 by PPD; (b) GGT29 and UGTPg45 combined with catalytic PPD, Lane 1, PPD and PPD type saponin mixed standards; Lane 2, UGTPg45 Catalytic PPD to produce Rh2; Lane 3, PPD; Lane 4, UGTPg45 and gGT29 combined with catalytic PPD to generate Rg3.
图6显示糖基转移酶BvUGT73C10和gGT29分别催化20(R)-PPD和20(R)-PPD以及联合催化20(R)-PPD的产物TLC检测图谱;泳道1,BvUGT73C10催化20(R)-PPD生成20(R)-Rh2;泳道2,gGT29催化20(R)-Rh2生成20(R)-Rg3;泳道3,BvUGT73C10和gGT29联合催化20(R)-PPD生成20(R)-Rg3。Figure 6 shows that the glycosyltransferases BvUGT73C10 and gGT29 catalyze the TLC detection of 20(R)-PPD and 20(R)-PPD and the combined catalyst 20(R)-PPD, respectively; Lane 1, BvUGT73C10 catalyzes 20(R)- PPD produces 20(R)-Rh2; Lane 2, gGT29 catalyzes the formation of 20(R)-Rh2 by 20(R)-Rg3; Lane 3, BvUGT73C10 and gGT29 combine to catalyze the formation of 20(R)-PP3 by 20(R)-PPD.
图7显示糖基转移酶gGT29和BvUGT73C10或gGT29和UGTPg45联合催化PPD产物的的HPLC检测结果。第一行,Rg3,Rh2和PPD混合标准样品;第二行,gGT29和BvUGT73C10联合催化PPD,第三行,gGT29和UGTPg45联合催化PPD。Figure 7 shows the results of HPLC detection of glycotransferases gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with catalytic PPD products. The first row, Rg3, Rh2 and PPD mixed standard samples; the second row, gGT29 and BvUGT73C10 combined catalytic PPD, the third row, gGT29 and UGTPg45 combined catalytic PPD.
图8显示糖基转移酶gGT29和BvUGT73C10或gGT29和UGTPg45联合催化PPD的产物LC/MS检测结果。显示了标准样品Rg3的质谱图和图7中P1峰(gGT29和BvUGT73C10联合催化PPD的产物)和P2峰(gGT29和UGTPg45联合催化PPD的产物产物峰)的质谱图。Figure 8 shows the results of LC/MS detection of the products of glycosyltransferases gGT29 and BvUGT73C10 or gGT29 and UGTPg45 in combination with PPD. The mass spectrum of the standard sample Rg3 and the mass spectrum of the P1 peak (product of gGT29 and BvUGT73C10 in combination with catalytic PPD) and the P2 peak (product product peak of gGT29 and UGTPg45 combined catalytic PPD) are shown in Fig. 7.
图9显示产Rg3酵母工程菌A2细胞裂解液抽提物的HPLC检测结果,第一行样品:原人参二醇(PPD)、达玛烯二醇(DM),人参皂苷Rh2和Rg3的混合标准样品;第二行样品:产Rg3酵母工程菌A2细胞裂解液抽提物。Figure 9 shows the results of HPLC detection of Rg3 yeast engineering strain A2 cell lysate extract, the first line of samples: original ginseng diol (PPD), dammar diol (DM), ginsenoside Rh2 and Rg3 mixing standards Sample; second row of samples: Rg3 yeast engineering bacteria A2 cell lysate extract.
图10显示SDS-PAGE检测gGT29-4,gGT29-5,gGT29-6,gGT29-7在重组大肠杆菌中的表达。泳道1,gGT29-4-pET28a重组大肠杆菌体裂解总蛋白;泳道 2,gGT29-4-pET28a重组大肠杆菌体裂解上清;泳道3,gGT29-5-pET28a重组大肠杆菌体裂解总蛋白;泳道4,gGT29-5-pET28a重组大肠杆菌体裂解上清;泳道5,gGT29-6-pET28a重组大肠杆菌体裂解总蛋白;泳道6,gGT29-6-pET28a重组大肠杆菌体裂解上清;泳道7,gGT29-7-pET28a重组大肠杆菌体裂解总蛋白;泳道8,gGT29-7-pET28a重组大肠杆菌体裂解上清;泳道9,蛋白分子量Marker。Figure 10 shows the expression of gGT29-4, gGT29-5, gGT29-6, gGT29-7 in recombinant E. coli by SDS-PAGE. Lane 1, gGT29-4-pET28a recombinant E. coli lysed total protein; lane 2, gGT29-4-pET28a recombinant E. coli lysate supernatant; Lane 3, gGT29-5-pET28a recombinant E. coli lysed total protein; Lane 4, gGT29-5-pET28a recombinant E. coli lysate supernatant; Lane 5, gGT29-6-pET28a recombinant E. coli lysed total protein; Lane 6, gGT29-6-pET28a recombinant E. coli lysate supernatant; Lane 7, gGT29-7-pET28a recombinant E. coli lysed total protein; Lane 8, gGT29- 7-pET28a recombinant E. coli lysate supernatant; Lane 9, protein molecular weight Marker.
图11显示Western Blot检测gGT29-4,gGT29-5,gGT29-6,gGT29-7在重组大肠杆菌中的表达。泳道1,gGT29-4-pET28a重组大肠杆菌体裂解总蛋白;泳道2,gGT29-4-pET28a重组大肠杆菌体裂解上清;泳道3,gGT29-5-pET28a重组大肠杆菌体裂解总蛋白;泳道4,gGT29-5-pET28a重组大肠杆菌体裂解上清;泳道5,gGT29-6-pET28a重组大肠杆菌体裂解总蛋白;泳道6,gGT29-6-pET28a重组大肠杆菌体裂解上清;泳道7,gGT29-7-pET28a重组大肠杆菌体裂解总蛋白;泳道8,gGT29-7-pET28a重组大肠杆菌体裂解上清。Figure 11 shows the expression of gGT29-4, gGT29-5, gGT29-6, gGT29-7 in recombinant E. coli by Western Blot. Lane 1, gGT29-4-pET28a recombinant E. coli lysed total protein; Lane 2, gGT29-4-pET28a recombinant E. coli lysate supernatant; Lane 3, gGT29-5-pET28a recombinant E. coli lysed total protein; Lane 4 , gGT29-5-pET28a recombinant E. coli lysate supernatant; Lane 5, gGT29-6-pET28a recombinant E. coli lysed total protein; Lane 6, gGT29-6-pET28a recombinant E. coli lysate supernatant; Lane 7, gGT29 -7-pET28a recombinant E. coli lysed total protein; Lane 8, gGT29-7-pET28a recombinant E. coli lysate supernatant.
图12显示糖基转移酶gGT29-4,gGT29-5,gGT29-6,gGT29-7分别催化Rh2和F2的产物TLC检测图谱。泳道Rh2表示用皂苷Rh2为底物;泳道F2表示用皂苷F2为底物。gGT29-4,gGT29-5,gGT29-6,gGT29-7表示用不同的酶液进行催化反应。Figure 12 shows that the glycosyltransferases gGT29-4, gGT29-5, gGT29-6, gGT29-7 catalyze the TLC detection of the products of Rh2 and F2, respectively. Lane Rh2 indicates the use of saponin Rh2 as a substrate; and lane F2 indicates the use of saponin F2 as a substrate. gGT29-4, gGT29-5, gGT29-6, gGT29-7 indicate that the catalytic reaction was carried out with different enzyme solutions.
图13显示糖基转移酶gGT29-4,gGT29-5,gGT29-6,gGT29-7分别催化Rh1的产物TLC检测图谱。(a)泳道1,2和3分别代表糖基转移酶gGT29-4,gGT29-5和gGT29-6分别催化Rh1的产物,泳道4代表原人参三醇型皂苷混合标样;(b)泳道1代表糖基转移酶gGT29-7催化Rh1的产物,泳道2代表原人参三醇型皂苷混合标样。Figure 13 shows that the glycosyltransferases gGT29-4, gGT29-5, gGT29-6, gGT29-7 catalyze the TLC detection of the product of Rh1, respectively. (a) Lanes 1, 2 and 3 represent the glycosyltransferases gGT29-4, gGT29-5 and gGT29-6 respectively catalyze the product of Rh1, and lane 4 represents the original ginsenotriol-type saponin mixed standard; (b) Lane 1 The product representing Rh1 is catalyzed by the glycosyltransferase gGT29-7, and the lane 2 represents the original ginseng triol type saponin mixed standard.
图14显示了显示糖基转移酶gGT29-7与其突变蛋白gGT29-7-N343G,gGT29-7-A359P和gGT29-7-N343G/A359P催化Rh1的检测结果(同时使用UDP-葡萄糖和UDP-鼠李糖作为糖基供体)。第一行,Rg1,Rf,Rg2和Rh1混合标准样品;第二行,gGT29-7催化Rh1;第三行,gGT29-7-N343G催化Rh1;第四行,gGT29-7-A359P催化Rh1;第五行,gGT29-7-N343G/A359P催化Rh1。Figure 14 shows the results of the detection of Rh1 by the glycosyltransferase gGT29-7 and its mutant proteins gGT29-7-N343G, gGT29-7-A359P and gGT29-7-N343G/A359P (using both UDP-glucose and UDP-rhamnet) Sugar as a glycosyl donor). In the first row, Rg1, Rf, Rg2 and Rh1 are mixed with standard samples; the second row, gGT29-7 catalyzes Rh1; the third row, gGT29-7-N343G catalyzes Rh1; the fourth row, gGT29-7-A359P catalyzes Rh1; Five lines, gGT29-7-N343G/A359P catalyze Rh1.
图15显示糖基转移酶gGT29-3和gGT29-14以UDP-xylose为糖基供体,催化人参皂苷Rh2,Rg3和Rd产物的TLC检测图谱;(A)以表达空载体pET28a的肠杆菌裂解液上清作为酶液进行催化;(B)以表达pET28a-gGT29-3的肠杆菌裂解液上清作为酶液进行催化;(C)以表达pET28a-gGT29-14的肠杆菌裂解液上清作为酶液进行催化。泳道Rh2,Rg3和Rd分别表示用皂苷Rh2,Rg3和Rd作为底物,泳道M代表原人参二醇型皂苷混合标准。Figure 15 shows TLC detection of glycosyltransferases gGT29-3 and gGT29-14 with UDP-xylose as a glycosyl donor, catalyzing the ginsenoside Rh2, Rg3 and Rd products; (A) lysis of Enterobacteriaceae expressing the empty vector pET28a The supernatant was catalyzed as an enzyme solution; (B) catalyzed by the supernatant of Enterobacter cloaca expressing pET28a-gGT29-3 as an enzyme solution; (C) the supernatant of Enterobacter cloaca expressing pET28a-gGT29-14 was used as The enzyme solution is catalyzed. Lanes Rh2, Rg3 and Rd represent the saponins Rh2, Rg3 and Rd as substrates, respectively, and lane M represents the original ginsengdiol-type saponin mixing standard.
具体实施方式detailed description
本发明人经过广泛而深入的研究,首次提供糖基转移酶gGT29-7(SEQ ID NO.:61)及其衍生多肽,如gGT29(SEQ ID NO.:26),gGT29-3(SEQ ID NO.:28),gGT29-4(SEQ ID NO.:55),gGT29-5(SEQ ID NO.:57),gGT29-6(SEQ ID NO.:59),gGT29-8(SEQ ID NO.:72),gGT29-9(SEQ ID NO.:74),gGT29-10(SEQ ID NO.:76),gGT29-11(SEQ ID NO.:78),gGT29-12(SEQ ID NO.:80),gGT29-13(SEQ ID NO.:82)gGT29-14(SEQ ID NO.:84)、gGT29-15(SEQ ID NO.:86)、gGT29-16(SEQ ID NO.:88)、gGT29-17(SEQ ID NO.:90)、gGT29-18(SEQ ID NO.:92)、 gGT29-7-N343G(SEQ ID NO.:93)、gGT29-7-A359P(SEQ ID NO.:94)、gGT29-7-N343G/A359P(SEQ ID NO.:95)在萜类化合物糖基化催化及新皂苷合成中的应用。具体地,本发明的糖基转移酶能特异和高效地催化四环三萜化合物底物的和/或将来自糖基供体的糖基转移到四环三萜类化合物的C-3位或C-6的第一个糖基上以延伸糖链。特别是能够将人参Rh2转化为具有抗癌活性的稀有人参皂苷Rg3,将人参皂苷F2转化为人参皂苷Rd,将人参皂苷Rh1转化为抗肿瘤,抗疲劳功效的稀有人参皂苷Rf,将人参皂苷Rh1转化具有神经保护作用和紫外线保护作用的稀有人参皂苷Rg2。本发明还提供了转化和催化方法。本发明的糖基转移酶还可与达玛烯二醇和/或原人参二醇或原人参三醇合成代谢途径中的关键酶以及四环三萜C-3或者C-6位的糖基转移酶在宿主细胞中共表达,或者应用于制备人参皂苷Rh2和人参皂苷Rh1的遗传工程细胞中,应用于构建人工合成稀有人参皂苷Rg3和Rf。此外,本发明的糖基转移酶还可与达玛烯二醇和/或原人参二醇或原人参三醇合成代谢途径中的关键酶以及C-6位的糖基转移酶以及合成UDP-鼠李糖的关键酶在宿主细胞中共表达,应用于构建人工合成稀有人参皂苷Rg2的菌株。在此基础上完成了本发明。The inventors have provided extensive and intensive research to provide glycosyltransferase gGT29-7 (SEQ ID NO.: 61) and its derived polypeptides for the first time, such as gGT29 (SEQ ID NO.: 26), gGT29-3 (SEQ ID NO) .: 28), gGT29-4 (SEQ ID NO.: 55), gGT29-5 (SEQ ID NO.: 57), gGT29-6 (SEQ ID NO.: 59), gGT29-8 (SEQ ID NO.: 72), gGT29-9 (SEQ ID NO.: 74), gGT29-10 (SEQ ID NO.: 76), gGT29-11 (SEQ ID NO.: 78), gGT29-12 (SEQ ID NO.: 80) , gGT29-13 (SEQ ID NO.: 82) gGT29-14 (SEQ ID NO.: 84), gGT29-15 (SEQ ID NO.: 86), gGT29-16 (SEQ ID NO.: 88), gGT29- 17 (SEQ ID NO.: 90), gGT29-18 (SEQ ID NO.: 92), gGT29-7-N343G (SEQ ID NO.: 93), gGT29-7-A359P (SEQ ID NO.: 94), gGT29-7-N343G/A359P (SEQ ID NO.: 95) glycosylation of terpenoids Catalytic and new saponin synthesis applications. Specifically, the glycosyltransferase of the present invention is capable of specifically and efficiently catalyzing the tetracyclic triterpenoid substrate and/or transferring a glycosyl group derived from a glycosyl donor to the C-3 position of a tetracyclic triterpenoid or The first sugar group of C-6 is extended with a sugar chain. In particular, ginseng Rh2 can be converted into rare ginsenoside Rg3 with anticancer activity, ginsenoside F2 can be converted into ginsenoside Rd, and ginsenoside Rh1 can be converted into anti-tumor, anti-fatigue saponin Rf, ginsenoside Rh1 Transformation of rare ginsenoside Rg2 with neuroprotective effects and UV protection. The invention also provides methods of transformation and catalysis. The glycosyltransferase of the present invention may also be a key enzyme in the anabolic pathway of dammarane diol and/or protopanaxadiol or protopanaxatriol, and a glycosyltransferase of the tetracyclic triterpenoid C-3 or C-6 position. The enzyme is co-expressed in the host cell or used in the preparation of genetically engineered cells of ginsenoside Rh2 and ginsenoside Rh1 for the construction of artificially synthesized ginsenoside Rg3 and Rf. In addition, the glycosyltransferase of the present invention may also be a key enzyme in the anabolic pathway of dammarane diol and/or protopanaxadiol or protopanaxadiol, and a glycosyltransferase at the C-6 position and a synthetic UDP-mouse. The key enzyme of plum sugar is co-expressed in host cells and used to construct a strain of artificially synthesized ginsenoside Rg2. The present invention has been completed on this basis.
定义definition
如本文所用,术语“活性多肽”、“本发明的多肽及其衍生多肽”、“本发明的酶”、“糖基转移酶”、“本发明的gGT29或其衍生多肽”或“本发明的糖基转移酶”,均指糖基转移酶gGT29-7(SEQ ID NO.:61)或其衍生多肽。其中优选的衍生多肽包括gGT29(SEQ ID NO.:26)、gGT29-3(SEQ ID NO.:28)、gGT29-4(SEQ ID NO.:55)、gGT29-5(SEQ ID NO.:57)、gGT29-6(SEQ ID NO.:59)、gGT29-8(SEQ ID NO.:72)、gGT29-9(SEQ ID NO.:74),gGT29-10(SEQ ID NO.:76),gGT29-11(SEQ ID NO.:78),gGT29-12(SEQ ID NO.:80),gGT29-13(SEQ ID NO.:82)gGT29-14(SEQ ID NO.:84)、gGT29-15(SEQ ID NO.:86)、gGT29-16(SEQ ID NO.:88)、gGT29-17(SEQ ID NO.:90)、gGT29-18(SEQ ID NO.:92)、gGT29-7-N343G(SEQ ID NO.:93)、gGT29-7-A359P(SEQ ID NO.:94)、gGT29-7-N343G/A359P(SEQ ID NO.:95)。The term "active polypeptide", "polypeptide of the invention and derivatives thereof", "enzyme of the invention", "glycosyltransferase", "gGT29 of the invention or a polypeptide derived therefrom" or "the invention", as used herein, "Glycosyltransferase", both refers to glycosyltransferase gGT29-7 (SEQ ID NO.: 61) or a polypeptide derived therefrom. Among the preferred derivative polypeptides include gGT29 (SEQ ID NO.: 26), gGT29-3 (SEQ ID NO.: 28), gGT29-4 (SEQ ID NO.: 55), gGT29-5 (SEQ ID NO.: 57). ), gGT29-6 (SEQ ID NO.: 59), gGT29-8 (SEQ ID NO.: 72), gGT29-9 (SEQ ID NO.: 74), gGT29-10 (SEQ ID NO.: 76), gGT29-11 (SEQ ID NO.: 78), gGT29-12 (SEQ ID NO.: 80), gGT29-13 (SEQ ID NO.: 82) gGT29-14 (SEQ ID NO.: 84), gGT29-15 (SEQ ID NO.: 86), gGT29-16 (SEQ ID NO.: 88), gGT29-17 (SEQ ID NO.: 90), gGT29-18 (SEQ ID NO.: 92), gGT29-7-N343G (SEQ ID NO.: 93), gGT29-7-A359P (SEQ ID NO.: 94), gGT29-7-N343G/A359P (SEQ ID NO.: 95).
如非特别说明,本文所说的人参皂苷和皂苷元,是的C20位S和/或R构型的人参皂苷和皂苷元。Unless otherwise specified, the ginsenosides and sapogenins referred to herein are ginsenosides and sapogenins of the C20 position S and/or R configuration.
如本文所用,“分离的多肽”是指所述多肽基本上不含天然与其相关的其它蛋白、脂类、糖类或其它物质。本领域的技术人员能用标准的蛋白质纯化技术纯化所述多肽。基本上纯的多肽在非还原聚丙烯酰胺凝胶上能产生单一的主带。所述多肽的纯度还可以用氨基酸序列进行进一步分析。As used herein, "isolated polypeptide" means that the polypeptide is substantially free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated. One skilled in the art can purify the polypeptide using standard protein purification techniques. A substantially pure polypeptide produces a single major band on a non-reducing polyacrylamide gel. The purity of the polypeptide can also be further analyzed using amino acid sequences.
本发明的活性多肽可以是重组多肽、天然多肽、合成多肽。本发明的多肽可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、植物)中产生。根据重组生产方案所用的宿主,本发明的多肽可以是糖基化的,或可以是非糖基化的。本发明的多肽还可包括或不包括起始的甲硫氨酸残基。The active polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide, or a synthetic polypeptide. The polypeptides of the invention may be naturally purified products, either chemically synthesized or produced recombinantly from prokaryotic or eukaryotic hosts (e.g., bacteria, yeast, plants). The polypeptide of the invention may be glycosylated or may be non-glycosylated, depending on the host used in the recombinant production protocol. Polypeptides of the invention may also or may not include an initial methionine residue.
本发明还包括所述多肽的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持所述多肽相同的生物学功能或活性的多肽。 The invention also includes fragments, derivatives and analogs of the polypeptides. As used herein, the terms "fragment," "derivative," and "analog" refer to a polypeptide that substantially retains the same biological function or activity of the polypeptide.
本发明的多肽片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)成熟多肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前导序列或分泌序列或用来纯化此多肽的序列或蛋白原序列,或与抗原IgG片段的形成的融合蛋白)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。The polypeptide fragment, derivative or analog of the present invention may be (i) a polypeptide having one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) substituted, and such substituted amino acid residues It may or may not be encoded by the genetic code, or (ii) a polypeptide having a substituent group in one or more amino acid residues, or (iii) a mature polypeptide and another compound (such as a compound that extends the half-life of the polypeptide, for example Polyethylene glycol) a polypeptide formed by fusion, or (iv) a polypeptide formed by fused an additional amino acid sequence to the polypeptide sequence (such as a leader or secretion sequence or a sequence or proprotein sequence used to purify the polypeptide, or A fusion protein for the formation of an antigenic IgG fragment). These fragments, derivatives and analogs are within the purview of those skilled in the art in light of the teachings herein.
在本发明的活性多肽具有糖基转移酶活性,并且能够催化以下一种或多种反应:The active polypeptide of the present invention has glycosyltransferase activity and is capable of catalyzing one or more of the following reactions:
(A)(A)
Figure PCTCN2015081111-appb-000005
Figure PCTCN2015081111-appb-000005
其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,所述的多肽选自SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或95或其衍生多肽。Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, and said polypeptide is selected from the group consisting of SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 or a polypeptide derived therefrom.
R1-R4经取代后的化合物如下表所示:The substituted compounds of R1-R4 are shown in the following table:
底物Substrate R1R1 R2R2 R3R3 R4R4
Rh2Rh2 糖基Glycosyl HH OHOH HH
F2F2 糖基Glycosyl HH OHOH 糖基Glycosyl
即当R1为葡萄糖基;R2为H,R3为OH,R4为H,式(I)化合物为Rh2。That is, when R1 is a glucosyl group; R2 is H, R3 is OH, R4 is H, and the compound of the formula (I) is Rh2.
R1为葡萄糖基;R2为H,R3为OH,R4为葡萄糖基,式(I)化合物为F2。R1 is a glucosyl group; R2 is H, R3 is OH, R4 is a glucosyl group, and the compound of formula (I) is F2.
当以UDP-葡糖糖作为糖基供体时,底物(I)化合物为Rh2,则产物式(II)化合物为Rg3;底物(I)化合物为F2,则产物式(II)化合物为Rd;When UDP-glucose is used as the glycosyl donor, the substrate (I) compound is Rh2, then the product of formula (II) is Rg3; and the substrate (I) compound is F2, then the product of formula (II) is Rd;
当以UDP-木糖作为糖基供体时,底物(I)化合物为Rh2,则产物式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;底物(I)化合物为F2,则产物式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。When UDP-xylose is used as the glycosyl donor, the substrate (I) compound is Rh2, and the product of formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)- PPD; the substrate (I) compound is F2, and the product of formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
(B) (B)
Figure PCTCN2015081111-appb-000006
Figure PCTCN2015081111-appb-000006
其中,R1和R2为H或者糖基,R3和R4为糖基。R3-R4-O为C6第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:55,57,59,61,78,82,92,94或者95或其衍生多肽。Wherein R1 and R2 are H or a glycosyl group, and R3 and R4 are a glycosyl group. R3-R4-O is the first glycosyl-derived glycosyl group of C6, and the polypeptide is selected from the group consisting of SEQ ID NO.: 55, 57, 59, 61, 78, 82, 92, 94 or 95 or a polypeptide derived therefrom.
当R1和R2为H,R3为葡萄糖基时,式(III)化合物为Rh1。When R1 and R2 are H and R3 is a glucosyl group, the compound of formula (III) is Rh1.
当以UDP-葡糖糖作为糖基供体时,底物(IIII)化合物为Rh1,则产物式(IV)化合物为Rf;When UDP-glucose is used as the glycosyl donor, the substrate (IIII) compound is Rh1, and the product of formula (IV) is Rf;
当以UDP-鼠李糖作为糖基供体时,底物(IIII)化合物为Rh1,则产物式(IV)化合物为Rg2;When UDP-rhamnose is used as the glycosyl donor, the substrate (IIII) compound is Rh1, and the product of formula (IV) is Rg2;
(C)(C)
Figure PCTCN2015081111-appb-000007
Figure PCTCN2015081111-appb-000007
其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,R5-R1-O为C3第一个糖基衍生的糖基;R6为糖基,R6-R1-O为C3第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:26、28、59、76、84、86或88或其衍生多肽。Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, R5-R1-O is a glycosyl group derived from the first glycosyl group of C3; R6 is a glycosyl group, R6 -R1-O is the first glycosyl-derived glycosyl group of C3, and the polypeptide is selected from the group consisting of SEQ ID NO.: 26, 28, 59, 76, 84, 86 or 88 or a polypeptide derived therefrom.
R1为两个葡萄糖基,R2为H,R3为OH,R4为H,式(V)化合物为Rg3。R1 is two glucosyl groups, R2 is H, R3 is OH, R4 is H, and the compound of formula (V) is Rg3.
R1为两个葡萄糖基,R2为H,R3为OH,R4为葡萄糖基,式(V)化合物为RdR1 is two glucosyl groups, R2 is H, R3 is OH, R4 is a glucosyl group, and the compound of formula (V) is Rd.
当以UDP-木糖作为糖基供体时,底物(V)化合物为Rg3,则产物式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;底物(V)化合物为Rd,则产物式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。When UDP-xylose is used as the glycosyl donor, the substrate (V) compound is Rg3, and the product of formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)- PPD; the substrate (V) compound is Rd, and the product of formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
所述的多肽序列为SEQ ID NO.:61或其衍生多肽,其优选的衍生多肽26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示的多肽,该术语还包括具有与所示多肽具有相同功能的SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或者95序列的变异形式。这些变异形式包括(但并不限于):一个或多个(通常为1-50个, 较佳地1-30个,更佳地1-20个,最佳地1-10个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。又比如,在C末端和/或N末端添加一个或数个氨基酸通常也不会改变蛋白质的功能。该术语还包括本发明蛋白的活性片段和活性衍生物。本发明还提供所述多肽的类似物。这些类似物与天然多肽的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些多肽包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的多肽并不限于上述例举的代表性的多肽。The polypeptide sequence is SEQ ID NO.: 61 or a derivative thereof, preferably a derivative polypeptide 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, a polypeptide as shown in 90, 92, 93, 94 or 95, the term further comprising SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76 having the same function as the polypeptide shown. A variant of the 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95 sequence. These variations include (but are not limited to): one or more (usually 1-50, Preferably 1-30, more preferably 1-20, optimally 1-10) amino acid deletions, insertions and/or substitutions, and addition of one or several at the C-terminus and/or N-terminus (usually It is an amino acid of 20 or less, preferably 10 or less, more preferably 5 or less. For example, in the art, when substituted with amino acids of similar or similar properties, the function of the protein is generally not altered. As another example, the addition of one or several amino acids at the C-terminus and/or N-terminus will generally not alter the function of the protein. The term also encompasses active fragments and active derivatives of the proteins of the invention. The invention also provides analogs of the polypeptides. The difference between these analogs and the natural polypeptide may be a difference in amino acid sequence, a difference in the modification form which does not affect the sequence, or a combination thereof. These polypeptides include natural or induced genetic variants. Induced variants can be obtained by a variety of techniques, such as random mutagenesis by irradiation or exposure to a mutagen, or by site-directed mutagenesis or other techniques known to molecular biology. Analogs also include analogs having residues other than the native L-amino acid (such as D-amino acids), as well as analogs having non-naturally occurring or synthetic amino acids (such as beta, gamma-amino acids). It is to be understood that the polypeptide of the present invention is not limited to the representative polypeptides exemplified above.
修饰(通常不改变一级结构)形式包括:体内或体外的多肽的化学衍生形式如乙酰化或羧基化。修饰还包括糖基化,如那些在多肽的合成和加工中或进一步加工步骤中进行糖基化修饰而产生的多肽。这种修饰可以通过将多肽暴露于进行糖基化的酶(如哺乳动物的糖基化酶或去糖基化酶)而完成。修饰形式还包括具有磷酸化氨基酸残基(如磷酸酪氨酸,磷酸丝氨酸,磷酸苏氨酸)的序列。还包括被修饰从而提高了其抗蛋白水解性能或优化了溶解性能的多肽。Modifications (usually without altering the primary structure) include chemically derived forms of the polypeptide, such as acetylation or carboxylation, in vivo or in vitro. Modifications also include glycosylation, such as those produced by glycosylation modifications in the synthesis and processing of the polypeptide or in further processing steps. Such modification can be accomplished by exposing the polypeptide to an enzyme that performs glycosylation, such as a mammalian glycosylation enzyme or a deglycosylation enzyme. Modified forms also include sequences having phosphorylated amino acid residues such as phosphotyrosine, phosphoserine, phosphothreonine. Also included are polypeptides modified to increase their resistance to proteolytic properties or to optimize solubility properties.
本发明的gGT29-7多肽或其衍生多肽,例如优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P蛋白的氨基端或羧基端还可含有一个或多个多肽片段,作为蛋白标签。任何合适的标签都可以用于本发明。例如,所述的标签可以是FLAG、HA、HA1、c-Myc、Poly–His、Poly-Arg、Strep-TagII、AU1、EE、T7、4A6、ε、B、gE、以及Ty1。这些标签可用于对蛋白进行纯化。表1列出了其中的一些标签及其序列。The gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, for example, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29 The amino terminus of -12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P protein Or the carboxy terminus may also contain one or more polypeptide fragments as a protein tag. Any suitable label can be used in the present invention. For example, the tags can be FLAG, HA, HA1, c-Myc, Poly-His, Poly-Arg, Strep-TagII, AU1, EE, T7, 4A6, ε, B, gE, and Ty1. These tags can be used to purify proteins. Table 1 lists some of these tags and their sequences.
表1Table 1
Figure PCTCN2015081111-appb-000008
Figure PCTCN2015081111-appb-000008
为了使翻译的蛋白分泌表达(如分泌到细胞外),还可在所述gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P 的氨基酸氨基末端添加上信号肽序列,如pelB信号肽等。信号肽在多肽从细胞内分泌出来的过程中可被切去。In order to secrete the translated protein (eg, secreted extracellularly), the gGT29-7 polypeptide or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29, may also be used. -8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29 -7-A359P or gGT29-7-N343G/A359P The amino acid amino terminus of the amino acid is added with a signal peptide sequence, such as a pelB signal peptide. The signal peptide can be cleaved off during secretion of the polypeptide from the cell.
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码成熟多肽的编码区序列可以与SEQ ID NO.:60所示的编码区序列相同或者是简并的变异体。如本文所用,“简并的变异体”在本发明中是指编码具有SEQ ID NO.:61或其衍生多肽,优选为SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或者95的蛋白质,但与SEQ ID NO.:60或其衍生蛋白的编码序列,优选为SEQ ID NO.:25、27、54、56、58、60、71、73、75、77、79、81、83、85、87、89、或91所示的序列有差别的核酸序列。The polynucleotide of the present invention may be in the form of DNA or RNA. DNA forms include cDNA, genomic DNA or synthetic DNA. DNA can be single-stranded or double-stranded. The DNA can be a coding strand or a non-coding strand. The coding region sequence encoding the mature polypeptide may be the same as the coding region sequence shown in SEQ ID NO.: 60 or a degenerate variant. As used herein, a "degenerate variant" in the present invention refers to a polypeptide having SEQ ID NO.: 61 or a derivative thereof, preferably SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74. a protein of 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95, but with the coding sequence of SEQ ID NO.: 60 or a derivative thereof, preferably SEQ ID NO.: A nucleic acid sequence having a sequence different from 25, 27, 54, 56, 58, 60, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, or 91.
编码SEQ ID NO.:61所示多肽或其衍生多肽,优选为SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93,94或95的成熟多肽的多核苷酸包括:只编码成熟多肽的编码序列;成熟多肽的编码序列和各种附加编码序列;成熟多肽的编码序列(和任选的附加编码序列)以及非编码序列。Encoding the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86, 88, Polynucleotides of mature polypeptides of 90, 92, 93, 94 or 95 include: coding sequences encoding only mature polypeptides; coding sequences for mature polypeptides and various additional coding sequences; coding sequences for mature polypeptides (and optional additional coding) Sequence) and non-coding sequences.
术语“编码多肽的多核苷酸”可以是包括编码此多肽的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。The term "polynucleotide encoding a polypeptide" can be a polynucleotide comprising the polypeptide, or a polynucleotide further comprising additional coding and/or non-coding sequences.
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或多肽的片段、类似物和衍生物。此多核苷酸的变异体可以是天然发生的等位变异体或非天然发生的变异体。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的多肽的功能。The invention also relates to variants of the above polynucleotides which encode fragments, analogs and derivatives of polypeptides or polypeptides having the same amino acid sequence as the invention. Variants of this polynucleotide may be naturally occurring allelic variants or non-naturally occurring variants. These nucleotide variants include substitution variants, deletion variants, and insertion variants. As is known in the art, an allelic variant is an alternative form of a polynucleotide that may be a substitution, deletion or insertion of one or more nucleotides, but does not substantially alter the function of the polypeptide encoded thereby. .
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件(或严紧条件)下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指:(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在90%以上,更好是95%以上时才发生杂交。并且,可杂交的多核苷酸编码的多肽与SEQ ID NO.:26、28、55、57、59、61、72、74、76、78、80、82、84、86、88、90、92、93、94或者95所示的成熟多肽有相同的生物学功能和活性。The invention also relates to polynucleotides which hybridize to the sequences described above and which have at least 50%, preferably at least 70%, more preferably at least 80% identity between the two sequences. The invention particularly relates to polynucleotides that hybridize to the polynucleotides of the invention under stringent conditions (or stringent conditions). In the present invention, "stringent conditions" means: (1) hybridization and elution at a lower ionic strength and higher temperature, such as 0.2 x SSC, 0.1% SDS, 60 ° C; or (2) hybridization a denaturing agent such as 50% (v/v) formamide, 0.1% calf serum / 0.1% Ficoll, 42 ° C, etc.; or (3) at least 90% identity between the two sequences, more It is good that hybridization occurs more than 95%. Furthermore, the polypeptide encoded by the hybridizable polynucleotide is SEQ ID NO.: 26, 28, 55, 57, 59, 61, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92 The mature polypeptides shown in 93, 94 or 95 have the same biological function and activity.
本发明还涉及与上述的序列杂交的核酸片段。如本文所用,“核酸片段”的长度至少含15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸以上。核酸片段可用于核酸的扩增技术(如PCR)以确定和/或分离编码gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P、或gGT29-7-N343G/A359P蛋白的多聚核苷酸。The invention also relates to nucleic acid fragments that hybridize to the sequences described above. As used herein, a "nucleic acid fragment" is at least 15 nucleotides in length, preferably at least 30 nucleotides, more preferably at least 50 nucleotides, and most preferably at least 100 nucleotides or more. Nucleic acid fragments can be used in nucleic acid amplification techniques (eg, PCR) to identify and/or isolate encoding gGT29-7 polypeptides or derived polypeptides thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29 -8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29 Polynucleotide of -7-A359P, or gGT29-7-N343G/A359P protein.
本发明中的多肽和多核苷酸优选以分离的形式提供,更佳地被纯化至均质。The polypeptides and polynucleotides of the invention are preferably provided in isolated form, more preferably purified to homogeneity.
本发明的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、 gGT29-7-A359P或gGT29-7-N343G/A359P核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。The gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29- 12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, The full-length nucleotide sequence of gGT29-7-A359P or gGT29-7-N343G/A359P or a fragment thereof can usually be obtained by a PCR amplification method, a recombinant method or a synthetic method. For PCR amplification, primers can be designed in accordance with the disclosed nucleotide sequences, particularly open reading frame sequences, and can be prepared using commercially available cDNA libraries or conventional methods known to those skilled in the art. The library is used as a template to amplify the relevant sequences. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then the amplified fragments are spliced together in the correct order.
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。Once the relevant sequences are obtained, the recombinant sequence can be used to obtain the relevant sequences in large quantities. This is usually done by cloning it into a vector, transferring it to a cell, and then isolating the relevant sequence from the proliferated host cell by conventional methods.
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。In addition, synthetic sequences can be used to synthesize related sequences, especially when the fragment length is short. Usually, a long sequence of fragments can be obtained by first synthesizing a plurality of small fragments and then performing the ligation.
目前,已经可以完全通过化学合成来得到编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中已知的各种现有的DNA分子(或如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。At present, it has been possible to obtain a DNA sequence encoding the protein of the present invention (or a fragment thereof, or a derivative thereof) completely by chemical synthesis. The DNA sequence can then be introduced into various existing DNA molecules (or vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequences of the invention by chemical synthesis.
应用PCR技术扩增DNA/RNA的方法被优选用于获得本发明的基因。特别是很难从文库中得到全长的cDNA时,可优选使用RACE法(RACE-cDNA末端快速扩增法),用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。A method of amplifying DNA/RNA using PCR technology is preferably used to obtain the gene of the present invention. Particularly, when it is difficult to obtain a full-length cDNA from a library, RACE method (RACE-cDNA end rapid amplification method) can be preferably used, and primers for PCR can be appropriately selected according to the sequence information of the present invention disclosed herein. And can be synthesized by conventional methods. The amplified DNA/RNA fragment can be isolated and purified by conventional methods such as by gel electrophoresis.
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。The invention also relates to a vector comprising a polynucleotide of the invention, and a vector of the invention or a gGT29-7 polypeptide or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, A host cell produced by genetic engineering of the gGT29-7-A359P or gGT29-7-N343G/A359P protein coding sequence, and a method of producing a polypeptide of the present invention by recombinant techniques.
通过常规的重组DNA技术,可利用本发明的多聚核苷酸序列可用来表达或生产重组的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P多肽。一般来说有以下步骤:The polynucleotide sequence of the present invention can be used to express or produce a recombinant gGT29-7 polypeptide or a derivative thereof by conventional recombinant DNA techniques, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29- 7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P polypeptide. Generally there are the following steps:
(1).用本发明的编码gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P多肽的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(1) using the gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/ A polynucleotide (or variant) of the A359P polypeptide, or a recombinant host vector comprising the polynucleotide, which is transformed or transduced into a suitable host cell;
(2).在合适的培养基中培养的宿主细胞;(2) a host cell cultured in a suitable medium;
(3).从培养基或细胞中分离、纯化蛋白质。(3). Separating and purifying the protein from the culture medium or the cells.
本发明中,gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病 毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。In the present invention, the gGT29-7 polypeptide or a polypeptide derived therefrom is preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29. -12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P polynucleotide sequence It can be inserted into a recombinant expression vector. The term "recombinant expression vector" refers to bacterial plasmids, phage, yeast plasmids, plant cell diseases well known in the art. Toxic, mammalian cell viruses such as adenoviruses, retroviruses or other vectors. Any plasmid and vector can be used as long as it can replicate and stabilize in the host. An important feature of expression vectors is that they typically contain an origin of replication, a promoter, a marker gene, and a translational control element.
本领域的技术人员熟知的方法能用于构建含gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。这些启动子的代表性例子有:大肠杆菌的lac或trp启动子;λ噬菌体PL启动子;真核启动子包括CMV立即早期启动子、HSV胸苷激酶启动子、早期和晚期SV40启动子、反转录病毒的LTRs和其他一些已知的可控制基因在原核或真核细胞或其病毒中表达的启动子。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。Methods well known to those skilled in the art can be used to construct a gGT29-7-containing polypeptide or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29- 7-N343G/A359P An expression vector encoding a DNA sequence and a suitable transcription/translation control signal. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. The DNA sequence can be operably linked to an appropriate promoter in an expression vector to direct mRNA synthesis. Representative examples of such promoters are: lac or trp promoter of E. coli; lambda phage PL promoter; eukaryotic promoters include CMV immediate early promoter, HSV thymidine kinase promoter, early and late SV40 promoter, anti- Promoters for transcription of viral LTRs and other known controllable genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。Furthermore, the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase for eukaryotic cell culture, neomycin resistance, and green Fluorescent protein (GFP), or tetracycline or ampicillin resistance for E. coli.
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。Vectors comprising the appropriate DNA sequences described above, as well as appropriate promoters or control sequences, can be used to transform appropriate host cells to enable expression of the protein.
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,链霉菌属;鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母;植物细胞;果蝇S2或Sf9的昆虫细胞;CHO、COS、293细胞、或Bowes黑素瘤细胞的动物细胞等。The host cell can be a prokaryotic cell, such as a bacterial cell; or a lower eukaryotic cell, such as a yeast cell; or a higher eukaryotic cell, such as a mammalian cell. Representative examples are: Escherichia coli, Streptomyces; bacterial cells of Salmonella typhimurium; fungal cells such as yeast; plant cells; insect cells of Drosophila S2 or Sf9; CHO, COS, 293 cells, or Bowes melanoma cells Animal cells, etc.
本发明的多核苷酸在高等真核细胞中表达时,如果在载体中插入增强子序列时将会使转录得到增强。增强子是DNA的顺式作用因子,通常大约有10到300个碱基对,作用于启动子以增强基因的转录。可举的例子包括在复制起始点晚期一侧的100到270个碱基对的SV40增强子、在复制起始点晚期一侧的多瘤增强子以及腺病毒增强子等。When a polynucleotide of the present invention is expressed in higher eukaryotic cells, transcription will be enhanced if an enhancer sequence is inserted into the vector. An enhancer is a cis-acting factor of DNA, usually about 10 to 300 base pairs, acting on a promoter to enhance transcription of the gene. Usable examples include a 100 to 270 base pair SV40 enhancer on the late side of the replication initiation point, a polyoma enhancer on the late side of the replication initiation site, and an adenovirus enhancer.
本领域一般技术人员都清楚如何选择适当的载体、启动子、增强子和宿主细胞。It will be apparent to one of ordinary skill in the art how to select appropriate vectors, promoters, enhancers and host cells.
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl2法处理,所用的步骤在本领域众所周知。另一种方法是使用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法:磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。Transformation of host cells with recombinant DNA can be carried out using conventional techniques well known to those skilled in the art. When the host is a prokaryote such as E. coli, competent cells capable of absorbing DNA can be harvested after the exponential growth phase and treated by the CaCl 2 method, and the procedures used are well known in the art. Another method is to use MgCl 2. Conversion can also be carried out by electroporation if desired. When the host is a eukaryote, the following DNA transfection methods can be used: calcium phosphate coprecipitation, conventional mechanical methods such as microinjection, electroporation, liposome packaging, and the like.
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。The obtained transformant can be cultured by a conventional method to express the polypeptide encoded by the gene of the present invention. The medium used in the culture may be selected from various conventional media depending on the host cell used. The cultivation is carried out under conditions suitable for the growth of the host cell. After the host cell has grown to the appropriate cell density, the selected promoter is induced by a suitable method (such as temperature conversion or chemical induction) and the cells are cultured for a further period of time.
在上面的方法中的重组多肽可在细胞内、或在细胞膜上表达、或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并 不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。The recombinant polypeptide in the above method can be expressed intracellularly, or on the cell membrane, or secreted outside the cell. If desired, the recombinant protein can be isolated and purified by various separation methods using its physical, chemical, and other properties. These methods are well known to those skilled in the art. Examples of these methods include but Not limited to: conventional renaturation treatment, treatment with protein precipitant (salting method), centrifugation, osmotic bacteria, super treatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, High performance liquid chromatography (HPLC) and various other liquid chromatography techniques and combinations of these methods.
应用application
本发明涉及的活性多肽或肽基转移酶gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P的用途包括(但不限于):特异和高效地将来自糖基供体的糖基转移到四环三萜类化合物的C-3位和C-6位的第一个糖基上以延伸糖链。特别是能够将人参皂苷Rh2转化为抗癌活性更优良的稀有人参皂苷Rg3;将人参皂苷F2转化为人参皂苷Rd;将人参皂苷Rh1转化为具有抗肿瘤和抗疲劳活性的稀有人参皂苷Rf;将人参皂苷Rh1转化具有神经保护作用和紫外线保护作用的稀有人参皂苷Rg2;本发明的糖基转移酶还能将人参皂苷Rh2,Rg3和Rd合成之前未见报道的新皂苷3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD和(3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。The active polypeptide or peptidyl transferase gGT29-7 polypeptide or derivative thereof according to the present invention is preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29- 10. gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7- Uses of N343G/A359P include, but are not limited to, the specific and efficient transfer of a glycosyl group from a glycosyl donor to the first glycosyl group at the C-3 and C-6 positions of the tetracyclic triterpenoid To extend the sugar chain. In particular, ginsenoside Rh2 can be converted into rare ginsenoside Rg3 with better anticancer activity; ginsenoside F2 can be converted into ginsenoside Rd; ginsenoside Rh1 can be converted into rare ginsenoside Rf having antitumor and anti-fatigue activity; Ginsenoside Rh1 transforms rare ginsenoside Rg2 with neuroprotective effect and UV protection; the glycosyltransferase of the present invention can also synthesize ginsenoside Rh2, Rg3 and Rd into a novel saponin 3-O-β- (not previously reported) D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD and (3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
所述的四环三萜化合物包括(但不限于):S构型或R构型的达玛烷型、羊毛脂烷型、甘遂烷型、环阿屯烷(环阿尔廷烷)型、apotirucallane型、葫芦烷、楝烷型等四环三萜类化合物。The tetracyclic triterpene compound includes, but is not limited to, a dammarane type, a lanolin type, a ganthanane type, a cycloalkane (cycloaltenane) type in the S configuration or the R configuration, a tetracyclic triterpenoid such as apotirucallane type, cucurbitane or decane type.
本发明提供了一种工业催化方法,包括:在提供糖基供体的条件下,用本发明的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P活性多肽或肽基转移酶获得式(II)、(IV)或式(VI)化合物。具体是,所述的(A)反应中所用的多肽选自SEQ ID NO.:61所示多肽或其衍生多肽,优选为SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示氨基酸序列的活性多肽;所述(B)反应中所用的多肽选自SEQ ID NO.:55,57,59,61,78,82,92,94或者95;所述(C)反应中所用的多肽选自SEQ ID NO.:61所示多肽或其衍生多肽,优选为SEQ ID NO.:26、28、59、76、84、86或88。The present invention provides an industrial catalytic method comprising: using a gGT29-7 polypeptide of the present invention or a derivative thereof, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, under conditions providing a glycosyl donor , gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29 A compound of formula (II), (IV) or formula (VI) is obtained from -7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P active polypeptide or peptidyl transferase. Specifically, the polypeptide used in the (A) reaction is selected from the polypeptide represented by SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74 An active polypeptide of the amino acid sequence shown in 76, 78, 80, 82, 84, 86, 88, 90, 92, 93, 94 or 95; the polypeptide used in the (B) reaction is selected from the group consisting of SEQ ID NO.: 55 , 57, 59, 61, 78, 82, 92, 94 or 95; the polypeptide used in the (C) reaction is selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 59, 76, 84, 86 or 88.
所述的糖基供体是核苷二磷酸糖,选自下组:UDP-葡萄糖,ADP-葡萄糖,TDP-葡萄糖,CDP-葡萄糖,GDP-葡萄糖,UDP-乙酰基葡萄糖,ADP-乙酰基葡萄糖,TDP-乙酰基葡萄糖,CDP-乙酰基葡萄糖,GDP-乙酰基葡萄糖,UDP-木糖,ADP-木糖,TDP-木糖,CDP-木糖,UDP-木糖,GDP-木糖,UDP-半乳糖醛酸,ADP-半乳糖醛酸,TDP-半乳糖醛酸,CDP-半乳糖醛酸,GDP-半乳糖醛酸,UDP-半乳糖,ADP-半乳糖,TDP-半乳糖,CDP-半乳糖,GDP-半乳糖,UDP-阿拉伯糖,ADP-阿拉伯糖,TDP-阿拉伯糖,CDP-阿拉伯糖,GDP-阿拉伯糖,UDP-鼠李糖,ADP-鼠李糖,TDP-鼠李糖,CDP-鼠李糖,GDP-鼠李糖,或其他核苷二磷酸己糖或核苷二磷酸戊糖,或其组合。The glycosyl donor is a nucleoside diphosphate sugar selected from the group consisting of UDP-glucose, ADP-glucose, TDP-glucose, CDP-glucose, GDP-glucose, UDP-acetylglucose, ADP-acetylglucose , TDP-acetylglucose, CDP-acetylglucose, GDP-acetylglucose, UDP-xylose, ADP-xylose, TDP-xylose, CDP-xylose, UDP-xylose, GDP-xylose, UDP -galacturonic acid, ADP-galacturonic acid, TDP-galacturonic acid, CDP-galacturonic acid, GDP-galacturonic acid, UDP-galactose, ADP-galactose, TDP-galactose, CDP - Galactose, GDP-galactose, UDP-arabinose, ADP-arabinose, TDP-arabinose, CDP-arabinose, GDP-arabinose, UDP-rhamnose, ADP-rhamnose, TDP-rham Sugar, CDP-rhamnose, GDP-rhamnose, or other nucleoside hexose phosphate or nucleoside pentose pentose, or a combination thereof.
所述的糖基供体优选是尿苷二磷酸糖,选自下组:UDP-葡萄糖,UDP-木糖,UDP-鼠李糖,UDP-半乳糖醛酸,UDP-半乳糖,UDP-阿拉伯糖,或其他尿苷二磷 酸己糖或尿苷二磷酸戊糖,或其组合。The glycosyl donor is preferably uridine diphosphate, selected from the group consisting of UDP-glucose, UDP-xylose, UDP-rhamnose, UDP-galacturonic acid, UDP-galactose, UDP-Arabic Sugar, or other uridine diphosphate Acid hexose or uridine pentose diphosphate, or a combination thereof.
在所述方法中,还可以添加酶活性添加物(提高酶活性或抑制酶活性的添加物)。所述酶活性的添加物可以选自下组:Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+、或Fe2+;或为可以生成Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+、或Fe2+的物质。In the method, an enzyme active additive (an additive that increases enzyme activity or inhibits enzyme activity) may also be added. The enzyme activity additive may be selected from the group consisting of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , or Fe 2+ ; a substance of Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , or Fe 2+ .
所述方法的pH条件为:pH4.0-10.0,优选pH6.0-pH8.5,更优选8.5。The pH conditions of the process are: pH 4.0-10.0, preferably pH 6.0-pH 8.5, more preferably 8.5.
所述方法的温度条件为:10℃-105℃,优选25℃-35℃,更优选35℃。The temperature conditions of the process are from 10 ° C to 105 ° C, preferably from 25 ° C to 35 ° C, more preferably 35 ° C.
本发明还提供了一种组合物,它含有有效量的本发明的活性多肽或肽基转移酶gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或者gGT29-7-N343G/A359P,以及食品学上或工业上可接受的载体或赋形剂。这类载体包括(但并不限于):水、缓冲液、葡萄糖、水、甘油、乙醇、及其组合。The present invention also provides a composition comprising an effective amount of the active polypeptide or peptidyl transferase gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29- 7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P, and a food or industrially acceptable carrier or excipient. Such carriers include, but are not limited to, water, buffer, dextrose, water, glycerol, ethanol, and combinations thereof.
所述的组合物中还可添加调节本发明糖基转移酶活性的物质。任何具有提高酶活性功能的物质均是可用的。较佳地,所述的提高本发明的糖基转移酶活性的物质选自巯基乙醇。此外,很多物质可以降低酶活性,包括但不限于:Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+和Fe2+;或在添加至底物后可水解形成Ca2+、Co2+、Mn2+、Ba2+、Al3+、Ni2+、Zn2+和Fe2+的物质。Substances which modulate the glycosyltransferase activity of the present invention may also be added to the composition. Any substance having a function of increasing the activity of the enzyme is available. Preferably, the substance which increases the glycosyltransferase activity of the present invention is selected from the group consisting of mercaptoethanol. In addition, many substances can reduce enzyme activity, including but not limited to: Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ , and Fe 2+ ; Substrate can be hydrolyzed to form Ca 2+ , Co 2+ , Mn 2+ , Ba 2+ , Al 3+ , Ni 2+ , Zn 2+ and Fe 2+ .
在获得了本发明的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P后,本领域人员可以方便地应用该酶来发挥转糖基的作用,特别是对达玛稀二醇、原人参二醇和原人参三醇的转糖基作用。作为本发明的优选方式,还提供了二种形成稀有人参皂苷的方法,该方法之一包含:用本发明所述的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P处理待转糖基的底物,所述的底物包括达玛稀二醇、原人参二醇和原人参三醇及其衍生物等四环三萜类化合物。较佳地,在pH3.5-10条件下,用所述的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G或gGT29-7-A359P或gGT29-7-N343G/A359P酶处理待转糖基的底物。较佳地,在温度30-105℃条件下,用所述的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P、gGT29-7-N343G/A359P、gGT29-3酶处理待转糖基的底物。The gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, which is preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, is obtained. , gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P, The enzyme can be conveniently applied by a person skilled in the art to function as a transglycosyl group, particularly for the transglycosylation of dammar diol, protopanaxadiol and protopanaxatriol. As a preferred mode of the present invention, there are also provided two methods for forming rare ginsenosides, one of which comprises: using the gGT29-7 polypeptide of the present invention or a derivative thereof, preferably gGT29, gGT29-3, gGT29- 4. gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7-N343G/A359P treats the substrate to be transglycosylated, said substrate comprising dammar diol, protopanaxadiol and protocorm a tetracyclic triterpenoid such as a triol or a derivative thereof. Preferably, the gGT29-7 polypeptide or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, is used under the condition of pH 3.5-10. gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G or gGT29-7- A359P or gGT29-7-N343G/A359P enzyme treats the substrate to be transglycosylated. Preferably, the gGT29-7 polypeptide or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, is used at a temperature of 30-105 °C. gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7- A359P, gGT29-7-N343G/A359P, gGT29-3 enzymes treat the substrate to be transglycosylated.
该方法之二包含:将本发明所述的gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P或gGT29-7-N343G/A359P基因转入可以合成人 参皂苷Rh2或者Rh1的工程菌(例如,酵母或大肠杆菌工程菌)中,或者,将gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、gGT29-7-A359P、或gGT29-7-N343G/A359P基因与达玛稀二醇、原人参二醇和原人参三醇合成代谢途径中的关键基因以及四环三萜C-3和/或C-6位的糖基转移酶于宿主细胞(例如酵母细胞或大肠杆菌)中共表达,获得直接生产稀有人参皂苷Rg3或Rf的重组菌。或者,将gGT29-7多肽或其衍生多肽,优选为gGT29、gGT29-3、gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18、gGT29-7-N343G、或gGT29-7-A359P、或gGT29-7-N343G/A359P基因与达玛烯二醇和/或原人参二醇或原人参三醇合成代谢途径中的关键酶以及四环三萜C-6位的糖基转移酶以及合成UDP-鼠李糖的关键酶在宿主细胞中共表达,应用于构建人工合成稀有人参皂苷Rg2的重组菌株。The second method comprises: the gGT29-7 polypeptide of the present invention or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29 -10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, gGT29-7-A359P or gGT29-7 -N343G/A359P gene transfer can be synthesized In the engineered bacteria of saponin Rh2 or Rh1 (for example, yeast or Escherichia coli engineering bacteria), or the gGT29-7 polypeptide or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29- 6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7- N343G, gGT29-7-A359P, or gGT29-7-N343G/A359P genes and key genes in the anabolic pathway of dammar diol, protopanaxadiol and protosol ginseng, and tetracyclic triterpenoid C-3 and/or The glycosyltransferase at position C-6 is co-expressed in a host cell (e.g., yeast cell or Escherichia coli) to obtain a recombinant strain that directly produces rare ginsenoside Rg3 or Rf. Alternatively, the gGT29-7 polypeptide or a polypeptide derived therefrom, preferably gGT29, gGT29-3, gGT29-4, gGT29-5, gGT29-6, gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29- 12. gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29-18, gGT29-7-N343G, or gGT29-7-A359P, or gGT29-7-N343G/A359P gene Key enzymes in the anabolic pathway of methenediol and/or protopanaxadiol or protosol ginseng, and glycosyltransferases at the C-6 position of tetracyclic triterpenes and key enzymes for the synthesis of UDP-rhamnose in host cells Expression, applied to the construction of recombinant strains of synthetic ginsenoside Rg2.
所述的达玛稀二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因。The key genes in the dammar diol anabolic pathway include, but are not limited to, the dammarene diol synthase gene.
在另一优选例中,所述的原人参二醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因,或其组合。或者以上各种酶的同功酶及其组合。其中,达玛烯二醇合成酶将环氧角鲨烯(酿酒酵母自身合成)转化为达玛烯二醇,细胞色素P450 CYP716A47及其还原酶再将达玛烯二醇转化为原人参二醇。(Han et.al,plant&cell physiology,2011,52.2062-73)In another preferred embodiment, the key genes in the proto-ginsengdiol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, and a reductase gene thereof, or combination. Or isozymes of various enzymes and combinations thereof. Among them, dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol, and cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol . (Han et.al, plant&cell physiology, 2011, 52.2062-73)
在另一优选例中,所述的原人参三醇合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、及细胞色素P450 CYP716A53V2基因及其还原酶基因,或其组合。或者以上各种酶的同功酶及其组合。其中,达玛烯二醇合成酶将环氧角鲨烯(酿酒酵母自身合成)转化为达玛烯二醇,细胞色素P450 CYP716A47及其还原酶再将达玛烯二醇转化为原人参二醇,细胞色素P450 CYP716A53v2(JII036031)及其还原酶再进一步将原人参二醇转化为原人参三醇。(Han et.al,plant&cell physiology,2012,53.1535-45)In another preferred embodiment, the key genes in the proto-ginsolic triol anabolic pathway include, but are not limited to, a dammarenediol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cell. Pigment P450 CYP716A53V2 gene and its reductase gene, or a combination thereof. Or isozymes of various enzymes and combinations thereof. Among them, dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol, and cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol The cytochrome P450 CYP716A53v2 (JII036031) and its reductase further convert the original ginseng diol to the original ginseng triol. (Han et.al, plant & cell physiology, 2012, 53.1535-45)
在另一优选例中,所述的人参皂苷Rh2合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、四环三萜C-3位糖基转移酶UGTPg45或其组合。或者以上各种酶的同功酶及其组合。其中,达玛烯二醇合成酶将环氧角鲨烯(酿酒酵母自身合成)转化为达玛烯二醇,细胞色素P450 CYP716A47及其还原酶再将达玛烯二醇转化为原人参二醇,糖基转移酶UGTPg45可以进一步把原人参二醇转化为Rh2(Wang et.al,Metabolic Engineering,2015,29.97-105)。In another preferred embodiment, the key genes in the ginsenoside Rh2 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene and a reductase gene thereof, and a tetracyclic three萜C-3 position glycosyltransferase UGTPg45 or a combination thereof. Or isozymes of various enzymes and combinations thereof. Among them, dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarene diol, and cytochrome P450 CYP716A47 and its reductase convert dammarene diol into protocanthodiol The glycosyltransferase UGTPg45 can further convert protoglycan diol to Rh2 (Wang et. al, Metabolic Engineering, 2015, 29.97-105).
在另一优选例中,所述的人参皂苷Rh1合成代谢途径中的关键基因包括(但不限于):达玛烯二醇合成酶基因、细胞色素P450 CYP716A47基因及其还原酶基因、及细胞色素P450 CYP716A53V2基因及其还原酶基因,及C-6糖基转移酶UGTPg100或其组合。或者以上各种酶的同功酶及其组合。其中,达玛烯二醇合成酶将环氧角鲨烯(酿酒酵母自身合成)转化为达玛烯二醇,细胞色素P450  CYP716A47及其还原酶再将达玛烯二醇转化为原人参二醇,细胞色素P450 CYP716A53v2(JII036031)及其还原酶再进一步将原人参二醇转化为原人参三醇,糖基转移酶UGTPg100可以进一步把原人参三醇转化为Rh1(Wei et.al,Molecular Plant,2015,15.doi:10.1016/j.molp.2015.05.010)。In another preferred embodiment, the key genes in the ginsenoside Rh1 anabolic pathway include, but are not limited to, a dammarene diol synthase gene, a cytochrome P450 CYP716A47 gene, a reductase gene thereof, and a cytochrome. P450 CYP716A53V2 gene and its reductase gene, and C-6 glycosyltransferase UGTPg100 or a combination thereof. Or isozymes of various enzymes and combinations thereof. Among them, dammarene diol synthase converts squalene (Saccharomyces cerevisiae self-synthesis) into dammarane diol, cytochrome P450 CYP716A47 and its reductase convert dammarene diol into proto-ginseng diol, cytochrome P450 CYP716A53v2 (JII036031) and its reductase to further convert proto-ginseng diol into proto-ginstriol, glycosyltransferase UGTPg100 The original ginseng triol was further converted to Rh1 (Wei et. al, Molecular Plant, 2015, 15. doi: 10.1016/j.molp. 2015.05.010).
本发明的主要优点:The main advantages of the invention:
(1)本发明的糖基转移酶可以特异性和高效地将来自糖基供体的糖基转移到四环三萜类化合物的C-3位和/或C-6位的第一个糖基上以延伸糖链(1) The glycosyltransferase of the present invention can specifically and efficiently transfer a glycosyl group derived from a glycosyl donor to the first sugar at the C-3 position and/or the C-6 position of the tetracyclic triterpenoid Extended sugar chain
(2)本发明的糖基转移酶特别能够分别将人参皂苷Rh2和Rh1转化为具有更好抗癌活性的稀有人参皂苷Rg3和具有抗肿瘤,抗疲劳的功效的稀有人参皂苷Rf以及具有神经保护中的作用和紫外线保护作用的稀有人参皂苷Rg2(2) The glycosyltransferase of the present invention is particularly capable of converting ginsenoside Rh2 and Rh1 into rare ginsenoside Rg3 having better anticancer activity, and ginsenoside Rf having antitumor and anti-fatigue effects, respectively, and having neuroprotection The role of ginsenoside Rg2 in the role of UV protection
(3)在酵母中构建了人参皂苷元(达玛稀二醇、原人参二醇和原人参三醇)的合成途径或稀有人参皂苷Rh2或Rh1的合成途径,从而实现以葡萄糖等单糖为底物,用酵母来发酵生产稀有人参皂苷Rg3和Rf。在酵母中构建了稀有人参皂苷Rh1的合成途径以及合成UDP-鼠李糖的途径,用酵母来发酵生产稀有人参皂苷Rg2。这不但可以解决皂苷生产的原料来源问题,而且可以大幅降低稀有皂苷Rg3、Rg2和Rf的生产成本。(3) A synthetic route of ginsenosides (Damadiol, Protopanaxadiol, and Protopanaxatriol) or a synthetic route of rare ginsenoside Rh2 or Rh1 was constructed in yeast to achieve a monosaccharide such as glucose. Fermentation with yeast to produce rare ginsenosides Rg3 and Rf. The synthetic route of rare ginsenoside Rh1 and the route of synthesizing UDP-rhamnose were constructed in yeast, and yeast was used to ferment and produce rare ginsenoside Rg2. This not only solves the problem of the source of raw materials for saponin production, but also greatly reduces the production cost of the rare saponins Rg3, Rg2 and Rf.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。The invention is further illustrated below in conjunction with specific embodiments. It is to be understood that the examples are not intended to limit the scope of the invention. The experimental methods in the following examples which do not specify the specific conditions are usually carried out according to the conditions described in conventional conditions such as Sambrook et al., Molecular Cloning: Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturer. The suggested conditions.
实施例1 糖基转移酶及其编码基因的分离Example 1 Isolation of glycosyltransferase and its coding gene
从发表的人参属植物表达谱数据提取了100多条预测为糖基转移酶的cDNA序列,从中克隆了60条cDNA全长序列并对它们进行了表达及转糖基反应分析,其中有17种表达产物对人参皂苷元及皂苷具有转糖基活性。More than 100 cDNA sequences predicted to be glycosyltransferases were extracted from the published expression data of Panax species, and 60 full-length cDNA sequences were cloned and expressed and transglycosylated. 17 of them were analyzed. The expression product has transglycosylation activity on ginsenosides and saponins.
提取人参RNA并进行反转录,获得人参的cDNA。以该cDNA为模板进行PCR扩增,使用其中引物对1(SEQ ID NO.:7、8);引物对2(SEQ ID NO.:9、10);引物对3(SEQ ID NO.:11,12);引物对5(SEQ ID NO.:34、35);引物对7(SEQ ID NO.:46、47);引物对8(SEQ ID NO.:62、63);引物对9(SEQ ID NO.:64、65)全部获得扩增产物。DNA聚合酶选用宝生物工程有限公司的高保真的KOD DNA聚合酶。PCR产物经琼脂糖凝胶电泳检测(图1)。在紫外下照射,切下目标DNA条带。然后采用AIIygen Gel EIItraction Kit(AEYGEN公司)从琼脂糖凝胶中回收DNA即为扩增出的DNA片段。将此DNA片段用宝生物工程有限公司的rTaq DNA聚合酶在末端加A后与市售的克隆载体pMD18-T Vector连接,连接产物转化市售的大肠杆菌EPI300感受态细胞,将转化后的大肠杆菌菌液涂布在添加氨苄青霉素50ug/mL、IPTG 0.5mM、II-Gal 25μg/mL的LB平板上,并进一步通过PCR和酶切验证重组克隆。分别选取其中一个克隆提取重组质粒后进行测序。用BESTORF软件寻找开放阅读框(ORF)。通过序列比对,ORF编码了糖基转移酶第1家族保守功能域PSPG盒,表明是糖基转移酶基因。 Ginseng RNA was extracted and reverse transcribed to obtain cDNA of ginseng. PCR amplification was carried out using this cDNA as a template, using primer pair 1 (SEQ ID NO.: 7, 8); primer pair 2 (SEQ ID NO.: 9, 10); primer pair 3 (SEQ ID NO.: 11) , 12); Primer Pair 5 (SEQ ID NO.: 34, 35); Primer Pair 7 (SEQ ID NO.: 46, 47); Primer Pair 8 (SEQ ID NO.: 62, 63); Primer Pair 9 ( SEQ ID NO.: 64, 65) All obtained amplification products. DNA polymerase uses the high-fidelity KOD DNA polymerase from Biotech Engineering Co., Ltd. The PCR product was detected by agarose gel electrophoresis (Fig. 1). Irradiate in the ultraviolet and cut off the target DNA band. The amplified DNA fragment was then recovered from the agarose gel using an AIIygen Gel EIItraction Kit (AEYGEN). This DNA fragment was ligated with the commercially available cloning vector pMD18-T Vector after the end of A with the rTaq DNA polymerase of Biosciences Co., Ltd., and the ligated product was transformed into a commercially available E. coli EPI300 competent cell, and the transformed large intestine was transformed. The Bacillus solution was coated on an LB plate supplemented with ampicillin 50 ug/mL, IPTG 0.5 mM, II-Gal 25 μg/mL, and the recombinant clone was further verified by PCR and restriction enzyme digestion. One of the clones was selected to extract the recombinant plasmid and then sequenced. The open reading frame (ORF) was searched using the BESTORF software. By sequence alignment, the ORF encodes a PSPG cassette of the first functional conserved domain of the glycosyltransferase, indicating a glycosyltransferase gene.
用引物对5(SEQ ID NO.:34,35)获得基因具有SEQ ID NO.:25,27所示的核苷酸序列,分别命名为gGT29和gGT29-3。其中,相应的基因信息见表2。The gene obtained with primer pair 5 (SEQ ID NO.: 34, 35) has the nucleotide sequence shown by SEQ ID NO.: 25, 27, and is named gGT29 and gGT29-3, respectively. Among them, the corresponding genetic information is shown in Table 2.
表2Table 2
Figure PCTCN2015081111-appb-000009
Figure PCTCN2015081111-appb-000009
用引物对7(SEQ ID NO.:62、63)获得基因具有表3所示的gGT29-7衍生多肽的核苷酸序列,命名为gGT29-4、gGT29-5、gGT29-6、gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-17和gGT29-18。其中相应的基因信息见表3。The nucleotide sequence of the gGT29-7-derived polypeptide having the gene shown in Table 3 was obtained using primer pair 7 (SEQ ID NO.: 62, 63) and designated as gGT29-4, gGT29-5, gGT29-6, gGT29-8. , gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-17 and gGT29-18. The corresponding genetic information is shown in Table 3.
表3table 3
Figure PCTCN2015081111-appb-000010
Figure PCTCN2015081111-appb-000010
Figure PCTCN2015081111-appb-000011
Figure PCTCN2015081111-appb-000011
用引物对8(SEQ ID NO.:64、65)获得基因具有表4所示的核苷酸序列,命名为gGT29-7、gGT29-15和gGT29-16。其中,相应的基因信息见表4。The gene obtained with primer pair 8 (SEQ ID NO.: 64, 65) had the nucleotide sequences shown in Table 4 and designated as gGT29-7, gGT29-15 and gGT29-16. Among them, the corresponding genetic information is shown in Table 4.
表4Table 4
Figure PCTCN2015081111-appb-000012
Figure PCTCN2015081111-appb-000012
所涉及的蛋白及其特性如5表所示:The proteins involved and their properties are shown in Table 5:
表5table 5
Figure PCTCN2015081111-appb-000013
Figure PCTCN2015081111-appb-000013
Figure PCTCN2015081111-appb-000014
Figure PCTCN2015081111-appb-000014
实施例2 糖基转移酶基因gGT29和gGT29-3的酵母重组表达载体的构建Example 2 Construction of a yeast recombinant expression vector for glycosyltransferase genes gGT29 and gGT29-3
分别以实施例1构建的含有gGT29和gGT29-3基因的质粒gGT29-pMD18T和gGT29-3-pMD18T为模板扩增目标基因。The target genes were amplified using the plasmids gGT29-pMD18T and gGT29-3-pMD18T containing the gGT29 and gGT29-3 genes constructed in Example 1 as templates.
gGT29所用正向引物均为(SEQ ID NO.:36),其5’端添加Kpn I识别位点:GGATCC;所用反向引物均为(SEQ ID NO.:37),其5’端添加IIhoI识别位点:CTCGAG,反向引物引入6-His Tag以便进行Western Blot检测表达和纯化。The forward primers used for gGT29 were (SEQ ID NO.: 36), and the Kpn I recognition site was added to the 5' end: GGATCC; the reverse primers used were all (SEQ ID NO.: 37), and the 5' end was added with IIhoI. Identification site: CTCGAG, reverse primer introduced 6-His Tag for Western Blot detection expression and purification.
gGT29-3所用正向引物均为(SEQ ID NO.:38),其5’端添加Kpn I识别位点:GGATCC;所用反向引物均为(SEQ ID NO.:39),其5’端添加IIhoI识别位点:CTCGAG,反向引物引入6-His Tag以便进行Western Blot检测表达和纯化。The forward primers used for gGT29-3 were (SEQ ID NO.: 38), and the Kpn I recognition site was added to the 5' end: GGATCC; the reverse primers used were all (SEQ ID NO.: 39), and the 5' end The IIhoI recognition site was added: CTCGAG, and the reverse primer introduced 6-His Tag for Western Blot detection expression and purification.
以质粒gGT29-pMD18T和gGT29-3-pMD18T为模板,利用上述引物通过PCR方法扩增gGT29和gGT29-3的基因。DNA聚合酶选用Toyobo公司的高保真DNA聚合酶kod,参考其说明书设定PCR程序:94℃ 2min;94℃ 15s,58℃ 30s,68℃1.5min,共30个循环;68℃ 10min;10℃保温。PCR产物经琼脂糖凝胶电泳检测,在紫外光下,切下与目标DNA大小一致的条带。然后采用AIIYGEN公司的AIIyPrep DNA Gel EIItraction Kit从琼脂糖凝胶中回收DNA片段。用Takara公司 的的QuickCut限制性内切酶Kpn I和IIba I双酶切回收的DNA片段30min,用AIIYGEN公司的AIIyPrep PCR Cleanup Kit对酶切产物进行清洁回收。利用NEB公司的T4 DNA连接酶将酶切产物与酿酒酵母表达质粒pYES2(同样以Kpn I和IIba I酶切并割胶回收)25℃连接2h。连接产物转化E.coli TOP 10感受态细胞,并涂布于添加100μg/mL氨苄青霉素的LB平板上。通过菌落PCR验证阳性转化子,并测序进一步验证表达质粒gGT29-pYES2和gGT29-3-pYES2构建成功。Using the plasmids gGT29-pMD18T and gGT29-3-pMD18T as templates, the genes of gGT29 and gGT29-3 were amplified by PCR using the above primers. DNA polymerase was selected from Toyobo's high-fidelity DNA polymerase kod, and the PCR program was set up according to the instructions: 94 ° C for 2 min; 94 ° C for 15 s, 58 ° C for 30 s, 68 ° C for 1.5 min, for a total of 30 cycles; 68 ° C for 10 min; 10 ° C Keep warm. The PCR product was detected by agarose gel electrophoresis, and the band of the same size as the target DNA was cut under ultraviolet light. The DNA fragment was then recovered from the agarose gel using AIIYGEN's AIIyPrep DNA Gel EIItraction Kit. Using Takara The DNA fragments recovered by double-cutting of the QuickCut restriction enzymes Kpn I and IIba I were used for 30 min, and the digested products were cleaned and recovered using AIIYGEN's AIIyPrep PCR Cleanup Kit. The digested product was ligated with S. cerevisiae expression plasmid pYES2 (also cut with Kpn I and IIba I and tapped and recovered) using NEB T4 DNA ligase for 2 h at 25 °C. The ligation product was transformed into E. coli TOP 10 competent cells and plated on LB plates supplemented with 100 μg/mL ampicillin. Positive transformants were verified by colony PCR and sequenced to further verify that the expression plasmids gGT29-pYES2 and gGT29-3-pYES2 were successfully constructed.
实施例3 糖基转移酶基因gGT29和gGT29-3在酿酒酵母中的表达Example 3 Expression of glycosyltransferase genes gGT29 and gGT29-3 in Saccharomyces cerevisiae
通过电转化方法将构建好的表达质粒gGT29-pYES2和gGT29-3-pYES2转化到酿酒酵母(Saccharomyces cerevisiae)中,涂布于筛选平板SC-Ura(0.67%酵母无氨基酸基本氮源,2%葡萄糖)。通过菌落PCR验证酵母重组子。挑酵母重组子菌落于10mL SC-Ura(2%葡萄糖)培养基中,30℃ 200rpm培养20h。4℃ 3500g离心收集菌体,用无菌去离子水清洗菌体两次,用诱导培养基SC-Ura(2%半乳糖)重悬菌体,并接种到50mL诱导培养基中,使OD600在0.4左右,30℃ 200rpm开始诱导表达。4℃ 3500g离心收集诱导表达12h的菌体,用无菌去离子水清洗菌体两次,重悬于酵母裂解缓冲液中,使OD600在50-100之间。用Fastprep细胞破碎仪震荡破碎酵母细胞,4℃ 12000g离心10min去除细胞碎片,收集细胞裂解液上清。取适量裂解液上清进行SDS-PAGE电泳检测,与pYES2空载体的重组子相比,gGT29-pYES2和gGT29-3-pYES2重组子没有明显的条带表征(图2)。采用anti-6-His Tag Western Blot检测表达情况,表达gGT29和gGT29-3的酿酒酵母显示很强的Western Blot信号,表明gGT29和gGT29-3在酵母中均可溶表达,而转pYES2空载体的重组子则没有anti-6-His Tag Western Blot信号(图3)。The constructed expression plasmids gGT29-pYES2 and gGT29-3-pYES2 were transformed into Saccharomyces cerevisiae by electroporation and plated on SC-Ura (0.67% yeast amino acid-free basic nitrogen source, 2% glucose). ). Yeast recombinants were verified by colony PCR. Yeast recombinant colonies were picked up in 10 mL of SC-Ura (2% glucose) medium and incubated at 30 ° C for 200 h at 200 rpm. The cells were collected by centrifugation at 3500 g at 4 ° C, the cells were washed twice with sterile deionized water, and the cells were resuspended in induction medium SC-Ura (2% galactose) and inoculated into 50 mL of induction medium to make OD 600. At about 0.4, induction was initiated at 30 ° C at 200 rpm. The cells induced to express for 12 hours were collected by centrifugation at 3500 g at 4 ° C, and the cells were washed twice with sterile deionized water and resuspended in yeast lysis buffer to give an OD 600 between 50 and 100. The yeast cells were disrupted by shaking with a Fastprep cell disrupter, and the cell debris was removed by centrifugation at 12000 g for 10 min at 4 ° C, and the supernatant of the cell lysate was collected. The appropriate amount of lysate supernatant was subjected to SDS-PAGE electrophoresis. Compared with the pYES2 empty vector recombinant, the gGT29-pYES2 and gGT29-3-pYES2 recombinants showed no significant banding (Fig. 2). The expression was detected by anti-6-His Tag Western Blot. Saccharomyces cerevisiae expressing gGT29 and gGT29-3 showed strong Western Blot signal, indicating that both gGT29 and gGT29-3 were soluble in yeast, and transferred to pYES2 empty vector. The recombinant did not have an anti-6-His Tag Western Blot signal (Figure 3).
实施例4 酵母表达产物gGT29和gGT29-3转糖基反应和产物鉴定Example 4 Transfection reaction and product identification of yeast expression products gGT29 and gGT29-3
以表达gGT29和gGT29-3的重组酵母裂解上清做为酶液来催化人参皂苷Rh2和F2的转糖基反应,表达空载体的重组酵母裂解上清为对照。100μL反应体系如表3所示。反应在35℃下进行12h,然后加入100μL丁醇终止反应,并抽提产物。产物真空干燥后,用甲醇溶解。The recombinant yeast cleavage supernatant expressing gGT29 and gGT29-3 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2, and the recombinant yeast cleavage supernatant expressing empty vector was used as a control. The 100 μL reaction system is shown in Table 3. The reaction was carried out at 35 ° C for 12 h, then 100 μL of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
反应产物先用薄层层析(TLC)进行检测,表达gGT29和gGT29-3的酵母宿主裂解上清酶液可以在人参皂苷Rh2和F2的3位糖基再延伸一个糖基,转化为人参皂苷Rg3和Rd(图4)。gGT29和gGT29-3的催化活性不受人参皂苷20位糖基或者羟基构型的影响,可以将20(R)-Rh2转化为20(R)-Rg3(图6)。The reaction product was first detected by thin layer chromatography (TLC), and the yeast host cleavage supernatant solution expressing gGT29 and gGT29-3 can be further extended into a glycosyl group at the 3-position glycosyl group of ginsenoside Rh2 and F2 to be converted into ginsenoside. Rg3 and Rd (Figure 4). The catalytic activity of gGT29 and gGT29-3 is not affected by the glycosylation or hydroxyl configuration of ginsenoside 20, and 20(R)-Rh2 can be converted to 20(R)-Rg3 (Fig. 6).
实施例5 糖基转移酶BvUGT73C10/UGTPg45和gGT29的联合转糖基反应和产物鉴定Example 5 Combined transglycosylation reaction and product identification of glycosyltransferase BvUGT73C10/UGTPg45 and gGT29
以表达BvUGT73C10(JQ291613)或UGTPg45(KM401918)的大肠杆菌宿主裂解上清和以表达gGT29的酵母宿主裂解上清做为酶液来共同催化原人参二醇(PPD)。100μL反应体系如表3所示。73.4μL的酶液中40μL为BvUGT73C10的大肠宿主裂解上清,剩余33.4μL为表达gGT29的酵母宿主裂解上清。反应在35℃下进行12h,然后加入100μL丁醇终止反应,并抽提产物。产物真空干燥后,用甲醇溶解。反应产物先用薄层层析(TLC)进行检测(图5),可以看到糖基转移酶BvUGT73C10和gGT29或者UGTPg45和gGT29联合使用都可将PPD转化为Rg3。 The E. coli host cleavage supernatant expressing BvUGT73C10 (JQ291613) or UGTPg45 (KM401918) and the yeast host cleavage supernatant expressing gGT29 were used as an enzyme solution to co-catalyze the original ginseng diol (PPD). The 100 μL reaction system is shown in Table 3. 40 μL of the 73.4 μL enzyme solution was the large intestine host lysis supernatant of BvUGT73C10, and the remaining 33.4 μL was the yeast host cleavage supernatant expressing gGT29. The reaction was carried out at 35 ° C for 12 h, then 100 μL of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol. The reaction product was first detected by thin layer chromatography (TLC) (Fig. 5). It can be seen that the combination of glycosyltransferase BvUGT73C10 and gGT29 or UGTPg45 and gGT29 can convert PPD to Rg3.
糖基转移酶BvUGT73C10和gGT29或者3GT2和gGT29联合使用都可以催化20(R)-PPD,生成20(R)-Rg3(图6)。Glycosyltransferase BvUGT73C10 and gGT29 or a combination of 3GT2 and gGT29 can catalyze 20(R)-PPD to form 20(R)-Rg3 (Figure 6).
实施例6 产Rg3酵母工程菌的构建与产物鉴定Example 6 Construction and Product Identification of Rg3 Yeast Engineering Bacteria
6.1在pESC-HIS质粒((Stratagene,Agilent)上,同时组装达玛烯二醇合成酶(Dammarenediol synthase)(ACZ71036.1)(GAL1/GAL10GAL10侧启动子,ADH1终止子)、细胞色素P450 CYP716A47(AEY75213.1)(FBA1启动子,CYC1终止子)以及糖基转移酶UGTPg45和gGT29(GAL1/GAL10GAL1侧启动子,TDH2终止子),构成游离型质粒,转化酿酒酵母BY4742,并将拟南芥来源的细胞色素P450还原酶ATR2-1(NP_849472.2)整合于酿酒酵母BY4742染色体中染色体trp1基因位点(GAL1启动子,利用trp1原有终止子),构建了重组酵母A2。重组酵母菌需补加的相应氨基酸(0.01%色氨酸,0.01%亮氨酸,0.01%赖氨酸)。6.1 On the pESC-HIS plasmid ((Stratagene, Agilent), simultaneously assemble Dammarenediol synthase (ACZ71036.1) (GAL1/GAL10GAL10 side promoter, ADH1 terminator), cytochrome P450 CYP716A47 ( AEY75213.1) (FBA1 promoter, CYC1 terminator) and glycosyltransferases UGTPg45 and gGT29 (GAL1/GAL10GAL1 side promoter, TDH2 terminator), constitute an episomal plasmid, transform S. cerevisiae BY4742, and source Arabidopsis thaliana The cytochrome P450 reductase ATR2-1 (NP_849472.2) was integrated into the chromosome trp1 gene locus in the S. cerevisiae BY4742 chromosome (GAL1 promoter, using trp1 original terminator) to construct recombinant yeast A2. Recombinant yeast needs to be supplemented The corresponding amino acid was added (0.01% tryptophan, 0.01% leucine, 0.01% lysine).
将重组酵母A2裂解液转移到2mL EP管中,每个管装1mL,加入等体积(1mL)的正丁醇抽提约30min后12000g离心10min。吸取上清至一新的EP管中。45℃并真空条件下使正丁醇蒸干。用100μL甲醇溶解后用于HPLC检测。The recombinant yeast A2 lysate was transferred to a 2 mL EP tube, 1 mL each tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 μL of methanol and used for HPLC detection.
通过HPLC和LC-MS分析,重组酵母A2的细胞裂解液中含有达玛烯二醇、原人参二醇(PPD)和人参皂苷活性代谢物Rg3(图8和图9)。The cell lysate of recombinant yeast A2 contained dammarene diol, protopanaxadiol (PPD) and ginsenoside active metabolite Rg3 (Figures 8 and 9) by HPLC and LC-MS analysis.
6.2方法同6.1,区别在于用糖基转移酶BvUGT73C10代替UGTPg45,得到重组酵母A6。通过HPLC分析,重组酵母A6的细胞裂解液中也含有达玛烯二醇、原人参二醇(PPD)及人参皂苷活性代谢物Rg3。The method of 6.2 is the same as 6.1, except that the glycosyltransferase BvUGT73C10 is used instead of UGTPg45 to obtain recombinant yeast A6. The cell lysate of recombinant yeast A6 also contained dammarene diol, protopanaxadiol (PPD) and ginsenoside active metabolite Rg3 by HPLC analysis.
实施例7 糖基转移酶基因gGT29-4,gGT29-5,gGT29-6和gGT29-7的大肠杆菌重组表达载体的构建Example 7 Construction of Escherichia coli recombinant expression vector for glycosyltransferase genes gGT29-4, gGT29-5, gGT29-6 and gGT29-7
以实施例1构建的含有gGT29-4,gGT29-5,gGT29-6和gGT29-7基因的质粒gGT29-4-pMD18T,gGT29-5-pMD18T,gGT29-6-pMD18T和gGT29-7-pMD18T为模板扩增目标基因。The plasmids gGT29-4-pMD18T, gGT29-5-pMD18T, gGT29-6-pMD18T and gGT29-7-pMD18T containing the gGT29-4, gGT29-5, gGT29-6 and gGT29-7 genes constructed in Example 1 were used as templates. Amplify the target gene.
gGT29-5和gGT29-6基因所用的正向引物如SEQ ID NO.:66所示,其5’端添加了与载体pET28a同源的序列:CTGGTGCCGCGCGGCAGC;所用反向引物为如SEQ ID NO.:68所示,其5’端添加了与载体pET28a同源的序列:TGCGGCCGCAAGCTTGTC。The forward primer used for the gGT29-5 and gGT29-6 genes is shown in SEQ ID NO.: 66, and the sequence homologous to the vector pET28a is added to the 5' end: CTGGTGCCGCGCGGCAGC; the reverse primer used is SEQ ID NO.: As shown at 68, a sequence homologous to the vector pET28a was added to the 5' end: TGCGGCCGCAAGCTTGTC.
gGT29-4和gGT29-7基因所用正向引物为SEQ ID NO.:67,其5’端添加了与载体pET28a同源的序列:CTGGTGCCGCGCGGCAGC;所用反向引物为SEQ ID NO.:68,其5’端添加了与载体pET28a同源的18个碱基片段:TGCGGCCGCAAGCTTGTC。The forward primer used for the gGT29-4 and gGT29-7 genes is SEQ ID NO.: 67, and the sequence homologous to the vector pET28a is added to the 5' end: CTGGTGCCGCGCGGCAGC; the reverse primer used is SEQ ID NO.: 68, which is 5 The 18-base fragment homologous to the vector pET28a was added to the 'end: TGCGGGCCGAGAGCTTGTC.
利用上述引物通过PCR方法扩增gGT29-4,gGT29-5,gGT29-6和gGT29-7基因。扩增基因选用NEB公司的Q5高保真DNA聚合酶,参考其说明书设定PCR程序:98℃ 30s;98℃ 15s,58℃ 30s,72℃ 1min,共35个循环;72℃ 2min;10℃保温。The gGT29-4, gGT29-5, gGT29-6 and gGT29-7 genes were amplified by a PCR method using the above primers. The amplified gene was selected from NEB's Q5 high-fidelity DNA polymerase. The PCR program was set up according to the instructions: 98 ° C 30 s; 98 ° C 15 s, 58 ° C 30 s, 72 ° C 1 min, a total of 35 cycles; 72 ° C 2 min; 10 ° C insulation .
同时,使用SEQ ID NO.:69和SEQ ID NO.:70分别作为正向和反向引物扩增载体pET28a,获得线性化的载体pET28a。扩增pET28a线性化载体也选用NEB公司的Q5高保真DNA聚合酶,参考其说明书设定PCR程序:98℃ 30s;98℃ 15s,58℃ 30s,72℃ 3min,共35个循环;72℃ 2min;10℃保温。 Meanwhile, the linearized vector pET28a was obtained using SEQ ID NO.: 69 and SEQ ID NO.: 70 as the forward and reverse primer amplification vectors pET28a, respectively. The linearized vector of amplified pET28a was also selected from NEB's Q5 high-fidelity DNA polymerase. The PCR procedure was set up according to the instructions: 98 ° C for 30 s; 98 ° C for 15 s, 58 ° C for 30 s, 72 ° C for 3 min for 35 cycles; 72 ° C for 2 min. ; 10 ° C insulation.
上述gGT29-4,gGT29-5,gGT29-6和gGT29-7基因PCR产物以及线性化的载体pET28a,经琼脂糖凝胶电泳检测后,在紫外光下切下与目标DNA大小一致的条带。然后采用AXYGEN公司的AxyPrep DNA Gel Extraction Kit从琼脂糖凝胶中回收DNA片段。参考诺晶生物科技有限公司的BGclonart无缝克隆试剂盒说明书,将回收的线性化的pET28a载体片段,回收的gGT29-4,gGT29-5,gGT29-6和gGT29-7基因片段及诺晶生物科技有限公司的BGclonart无缝克隆反应液以适当比例进行混合,共20μl。混匀后在50℃孵育30分种,然后将混合反应液转移到冰上。使用5μl反应液转化E.coli EPI300感受态细胞,并涂布于添加50μg/mL卡那霉素的LB平板上。通过菌落PCR验证阳性转化子,并测序进一步验证表达质粒gGT29-4-pET28a,gGT29-5-pET28a,gGT29-6-pET28a和gGT29-7-pET28a构建成功。The gGT29-4, gGT29-5, gGT29-6 and gGT29-7 gene PCR products and the linearized vector pET28a were detected by agarose gel electrophoresis, and the bands corresponding to the target DNA size were cut under ultraviolet light. The DNA fragment was then recovered from the agarose gel using Axygen's AxyPrep DNA Gel Extraction Kit. Refer to the BGclonart Seamless Cloning Kit Instructions of Nuojing Biotechnology Co., Ltd., the linearized pET28a vector fragment recovered, the recovered gGT29-4, gGT29-5, gGT29-6 and gGT29-7 gene fragments and Nuojing Biotechnology The BGclonart seamless cloning reaction solution of the company was mixed in an appropriate ratio for a total of 20 μl. After mixing, the mixture was incubated at 50 ° C for 30 minutes, and then the mixed reaction solution was transferred to ice. E. coli EPI300 competent cells were transformed with 5 μl of the reaction solution and plated on LB plates supplemented with 50 μg/mL kanamycin. Positive transformants were verified by colony PCR and sequenced to further verify that the expression plasmids gGT29-4-pET28a, gGT29-5-pET28a, gGT29-6-pET28a and gGT29-7-pET28a were successfully constructed.
实施例8 糖基转移酶基因gGT29-4,gGT29-5,gGT29-6和gGT29-7在大肠杆菌中的表达Example 8 Expression of glycosyltransferase genes gGT29-4, gGT29-5, gGT29-6 and gGT29-7 in Escherichia coli
以实施例19构建的大肠杆菌表达载体gGT29-4-pET28a,gGT29-5-pET28a,gGT29-6-pET28a和gGT29-7-pET28a转化到市售的E.coli BL21中。接种一个重组子到LB培养基中,30℃ 200rpm培养至OD600约0.6-0.8,使菌液降温至4℃,加入终浓度为50μM的IPTG,18℃ 200rpm诱导表达15h。4℃离心收集菌体,超声破碎细胞,4℃ 12000g离心收集细胞裂解液上清,取样品进行SDS-PAGE电泳(图10)。gGT29-4-pET28a,gGT29-5-pET28a,gGT29-6-pET28a和gGT29-7-pET28a重组子裂解液和总蛋白和上清中都有明显的目的蛋白条带(大约50KD),分别表征糖基转移酶gGT29-4,gGT29-5,gGT29-6和gGT29-7。从Western Blot的结果看(图11),也证明目标蛋白gGT29-4,gGT29-5,gGT29-6和gGT29-7在宿主中实现了可溶表达。The E. coli expression vectors gGT29-4-pET28a, gGT29-5-pET28a, gGT29-6-pET28a and gGT29-7-pET28a constructed in Example 19 were transformed into commercially available E. coli BL21. One recombinant was inoculated into LB medium, cultured at 30 ° C, 200 rpm to an OD 600 of about 0.6-0.8, the bacterial solution was cooled to 4 ° C, IPTG was added to a final concentration of 50 μM, and expression was induced for 15 h at 18 ° C at 200 rpm. The cells were collected by centrifugation at 4 ° C, and the cells were sonicated, and the supernatant of the cell lysate was collected by centrifugation at 12,000 g at 4 ° C, and the sample was taken for SDS-PAGE electrophoresis ( FIG. 10 ). gGT29-4-pET28a, gGT29-5-pET28a, gGT29-6-pET28a, gGT29-7-pET28a and gGT29-7-pET28a recombinant lysates and total protein and supernatant all have distinct protein bands (approximately 50KD), respectively representing sugar Base transferases gGT29-4, gGT29-5, gGT29-6 and gGT29-7. From the results of Western Blot (Fig. 11), it was also confirmed that the target proteins gGT29-4, gGT29-5, gGT29-6 and gGT29-7 achieved soluble expression in the host.
实施例9 大肠杆菌表达产物gGT29-4,gGT29-5,gGT29-6和gGT29-7转糖基反应和产物的鉴定Example 9 Identification of E. coli expression products gGT29-4, gGT29-5, gGT29-6 and gGT29-7 transglycosylation and products
以表达gGT29-4,gGT29-5,gGT29-6和gGT29-7的重组酵母裂解上清做为酶液来催化人参皂苷Rh2和F2的转糖基反应。100μL反应体系如表6所示。反应在35℃下进行12h,然后加入100μL丁醇终止反应,并抽提产物。产物真空干燥后,用甲醇溶解。The recombinant yeast cleavage supernatant expressing gGT29-4, gGT29-5, gGT29-6 and gGT29-7 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2. The 100 μL reaction system is shown in Table 6. The reaction was carried out at 35 ° C for 12 h, then 100 μL of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
表6Table 6
9%吐温209% Tween 20 11.1μL11.1μL
50mM UDP-葡萄糖50mM UDP-glucose 10μL10μL
1M Tris-HCl pH8.51M Tris-HCl pH 8.5 5μL5μL
100mM底物(乙醇溶解)100 mM substrate (ethanol dissolution) 0.5μL0.5μL
酶液Enzyme solution 73.4μL73.4μL
反应产物用薄层层析(TLC)进行检测,gGT29-6的粗酶液可以在人参皂苷Rh2和F2的3位糖基上再延伸一个糖基,分别生成为人参皂苷Rg3和Rd(图12);gGT29-4,gGT29-5和gGT29-7的粗酶液可以在人参皂苷F2的3位糖基再延伸一个糖基生成皂苷Rd,但是它们不能催化皂苷Rh2(图12)。gGT29-4,gGT29-5,gGT29-6和gGT29-7的粗酶液还可以在原人参三醇型皂苷Rh1的C-6位糖基上再延伸一个 糖基,形成人参皂苷Rf(图13),其中,gGT29-4,gGT29-5和gGT29-6的活性比较弱,而gGT29-7则具有比较强的活性(表7)。The reaction product was detected by thin layer chromatography (TLC). The crude enzyme solution of gGT29-6 can further extend a sugar group on the 3-position glycosyl group of ginsenoside Rh2 and F2 to form ginsenoside Rg3 and Rd, respectively (Fig. 12 The crude enzyme solution of gGT29-4, gGT29-5 and gGT29-7 can further extend a glycosyl group at the 3-position glycosyl group of ginsenoside F2 to form saponin Rd, but they cannot catalyze the saponin Rh2 (Fig. 12). The crude enzyme solution of gGT29-4, gGT29-5, gGT29-6 and gGT29-7 can also be extended on the C-6 glycosyl group of the original ginseng triol type saponin Rh1. Glycosyl group formed ginsenoside Rf (Fig. 13) in which gGT29-4, gGT29-5 and gGT29-6 were relatively weak, while gGT29-7 had relatively strong activity (Table 7).
实施例10 糖基转移酶基因gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18大肠杆菌重组表达载体的构建、在大肠杆菌中的表达以及产物鉴定Example 10 Glycosyltransferase genes gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, gGT29- Construction of 18 E. coli recombinant expression vector, expression in E. coli and identification of products
方法同实施例7-8,构建了gGT29-8、gGT29-9、gGT29-10、gGT29-11、gGT29-12、gGT29-13、gGT29-14、gGT29-15、gGT29-16、gGT29-17、gGT29-18的大肠杆菌表达载体(gGT29-8-pET28a,gGT29-9-pET28a,gGT29-10-pET28a,gGT29-11-pET28a,gGT29-12-pET28a,gGT29-13-pET28a,gGT29-14-pET28a,gGT29-15-pET28a,gGT29-16-pET28a,gGT29-17-pET28a,gGT29-18-pET28a),并在大肠杆菌中实现可溶性表达。根据实施例9所示方法的鉴定,其产物与其他gGT29系列糖基转移酶对底物不同位置(C3或C6)第一个糖基的延伸活性见表7,其中,“+"表示在该位置有活性,“++"表示在该位置有比较强的活性。Methods were the same as those in Examples 7-8, and gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-14, gGT29-15, gGT29-16, gGT29-17, E. coli expression vector for gGT29-18 (gGT29-8-pET28a, gGT29-9-pET28a, gGT29-10-pET28a, gGT29-11-pET28a, gGT29-12-pET28a, gGT29-13-pET28a, gGT29-14-pET28a , gGT29-15-pET28a, gGT29-16-pET28a, gGT29-17-pET28a, gGT29-18-pET28a), and achieve soluble expression in E. coli. According to the identification of the method shown in Example 9, the extension activity of the product and other gGT29 series glycosyltransferase to the first glycosyl group at different positions (C3 or C6) of the substrate is shown in Table 7, wherein "+" indicates The position is active, and "++" indicates a relatively strong activity at this position.
表7Table 7
Figure PCTCN2015081111-appb-000015
Figure PCTCN2015081111-appb-000015
实施例11 通过氨基酸定点突变改变gGT29-7对糖基供体专一性Example 11 Alteration of glycosyl donor specificity by gGT29-7 by site-directed mutagenesis of amino acids
糖基转移酶gGT29-7可以以UDP-葡萄糖为糖基供体,催化人参皂苷Rh1的C6-O-Glc延伸一个葡萄糖生成人参皂苷Rf。通过对对糖基转移酶gGT29-7的糖基供体结合区域(PSPG)进行了定点突变来改变它对糖基供体的专一性。对已有研究报道的能够催化其他糖基受体的鼠李糖基转移酶与gGT29-7进行氨基酸序列比对,筛选并最后选取了gGT29-7PSPG盒上两个位点进行定点突变(N343G和A359P)。The glycosyltransferase gGT29-7 can be a saccharide-based donor with UDP-glucose, and the C6-O-Glc of ginsenoside Rh1 is catalyzed to extend a glucose to produce ginsenoside Rf. The specificity of the glycosyl donor was altered by site-directed mutagenesis of the glycosyl donor binding region (PSPG) of the glycosyltransferase gGT29-7. Amino acid sequence alignment of rhamnosyltransferase, which can be used to catalyze other glycosylation receptors, with gGT29-7, and screening and finally selecting two sites on the gGT29-7PSPG cassette for site-directed mutagenesis (N343G and A359P).
根据常规设计并合成两条引物,以质粒pET28a-gGT29-7为模板,利用Yeasen公司的Hieff MutTM site-directed mutagenesis kit进行gGT29-7的N343位点分别进行定点突变。 The conventional design and synthesis of two primers, plasmid pET28a-gGT29-7 as a template, Yeasen's Hieff Mut TM site-directed mutagenesis kit gGT29-7 sites were N343, respectively site-directed mutagenesis.
PCR扩增程序为:95℃ 30s;95℃ 5s,56℃ 10s,72℃ 1.5min,共30个循环;降至10℃。PCR产物用DpnI酶切,37℃水浴反应2h。再取3μL用试剂盒中的重组酶EIInase进行重组37℃ 30min,转化大肠杆菌TOP 10感受态。挑取单克隆并抽提质粒,测序验证突变位点,所获得的质粒命名为pET28a-gGT29-7-N343G。The PCR amplification procedure was: 95 ° C for 30 s; 95 ° C for 5 s, 56 ° C for 10 s, and 72 ° C for 1.5 min for a total of 30 cycles; to 10 ° C. The PCR product was digested with DpnI and reacted in a water bath at 37 ° C for 2 h. Then, 3 μL of the recombinant enzyme EIInase in the kit was used for reconstitution at 37 ° C for 30 min to transform E. coli TOP 10 competent state. The monoclonal was picked and the plasmid was extracted, and the mutation site was verified by sequencing. The obtained plasmid was named pET28a-gGT29-7-N343G.
将质粒pET28a-gGT29-7-N343G转化到大肠杆菌BL21(DE3),构建重组菌株pET28a-gGT29-7-N343G-BL21,诱导表达的步骤同实施例8。The plasmid pET28a-gGT29-7-N343G was transformed into E. coli BL21 (DE3), and the recombinant strain pET28a-gGT29-7-N343G-BL21 was constructed, and the procedure for inducing expression was the same as in Example 8.
根据常规设计并合成两条引物对gGT29-7的A359位点进行点突变并构建突变质粒pET28a-gGT29-7-A359P和重组菌株gGT29-7-N343G-BL21,步骤同上。诱导表达的步骤同实施例8。以质粒pET28a-gGT29-7-N343G为模板,29-7-A359P-F和29-7-A359P-R为引物对A359进行点突变构建双突变质粒pET28a-gGT29-7-N343G/A359P和重组菌株gGT29-7-N343G/A359P-BL21,步骤同上。诱导表达的步骤同实施例8,转糖基反应和产物的鉴定同实施例9,反应体系如表6所示,但是使用按照1/1的比例混合的UDP-葡萄糖/UDP-鼠李糖代替UDP-葡萄糖作为糖基供体。The A359 locus of gGT29-7 was subjected to point mutation according to conventional design and synthesis of two primers, and the mutant plasmid pET28a-gGT29-7-A359P and the recombinant strain gGT29-7-N343G-BL21 were constructed, and the procedure was the same as above. The procedure for inducing expression was the same as in Example 8. Using the plasmid pET28a-gGT29-7-N343G as a template, 29-7-A359P-F and 29-7-A359P-R as primers to mutate A359 to construct double mutant plasmid pET28a-gGT29-7-N343G/A359P and recombinant strain gGT29-7-N343G/A359P-BL21, the procedure is the same as above. The procedure for inducing expression was the same as in Example 8, the transglycosylation reaction and the identification of the product were the same as in Example 9, and the reaction system was as shown in Table 6, but using UDP-glucose/UDP-rhamnose mixed in a ratio of 1/1. UDP-glucose is used as a glycosyl donor.
结果如图14所示,单突变蛋白gGT29-7-N343G催化Rh1的C6-O-Glc延伸一个葡萄糖生成Rf的酶活力已经基本检测不到,而gGT29-7-A359P仍然保留了催化Rh1的C6-O-Glc延伸一个葡萄糖的酶活力。双突变蛋白gGT29-7-N343G/A359P不仅保留了野生型蛋白的催化Rh1的C6-O-Glc延伸一个葡萄糖生成Rf的酶活力,还获得了催化Rh1的C6-O-Glc延伸一个鼠李糖生成Rg2的酶活力。As a result, as shown in Fig. 14, the single mutant protein gGT29-7-N343G catalyzes the activity of C6-O-Glc of Rh1 to extend a glucose to produce Rf, and gGT29-7-A359P still retains C6 which catalyzes Rh1. -O-Glc extends the enzyme activity of a glucose. The double mutant protein gGT29-7-N343G/A359P not only retains the wild-type protein catalyzes the C6-O-Glc of Rh1 to extend the activity of a glucose-producing Rf, but also obtains a rhamnose that catalyzes the extension of C6-O-Glc of Rh1. The enzyme activity of Rg2 is generated.
实施例12 以UDP-xylose为糖基供体,糖基转移酶大肠杆菌表达产物的转糖基反应和产物的鉴定Example 12 Transglycosylation reaction and product identification of glycosyltransferase E. coli expression product with UDP-xylose as glycosyl donor
以表达gGT29-10和gGT29-14的重组大肠杆菌上清做为酶液来催化人参皂苷Rh2和F2的转糖基反应。100μL反应体系如表6所示,但是用UDP-木糖代替UDP-葡萄糖作为糖基供体。反应在35℃下进行12h,然后加入100μL丁醇终止反应,并抽提产物。产物真空干燥后,用甲醇溶解。The recombinant E. coli supernatant expressing gGT29-10 and gGT29-14 was used as an enzyme solution to catalyze the transglycosylation reaction of ginsenoside Rh2 and F2. The 100 μL reaction system is shown in Table 6, except that UDP-xylose was used instead of UDP-glucose as a glycosyl donor. The reaction was carried out at 35 ° C for 12 h, then 100 μL of butanol was added to terminate the reaction, and the product was extracted. The product was dried under vacuum and dissolved in methanol.
反应产物用薄层层析(TLC)进行检测,gGT29-10和gGT29-14能够在人参皂苷Rh2的C-3糖基延伸一个木糖生成一种新的三萜皂苷,(3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;gGT29-10和gGT29-14能够将人参皂苷Rg3C-3位的第二个葡萄糖置换为木糖生成一种新的三萜皂苷(3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD);gGT29-10和gGT29-14还能够将人参皂苷Rd C-3位的第二个葡萄糖置换为木糖生成一种新的三萜皂苷(3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK)(图15)。The reaction product was detected by thin layer chromatography (TLC). gGT29-10 and gGT29-14 were able to extend a xylose in the C-3 glycosyl group of ginsenoside Rh2 to form a new triterpenoid saponin, (3-O-β). -(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD; gGT29-10 and gGT29-14 can replace the second glucose of ginsenoside Rg3C-3 with xylose to form a new triterpenoid saponin ( 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD); gGT29-10 and gGT29-14 can also replace the second glucose of ginsenoside Rd C-3 with xylose A new trisaponin (3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK) (Fig. 15).
其它糖基转移酶gGT29,gGT29-4,gGT29-5,gGT29-6和,gGT29-7gGT29-8,gGT29-9,gGT29-10,gGT29-11,gGT29-12,gGT29-13,gGT29-15,gGT29-16,gGT29-17和gGT29-18,以UDP-木糖为糖基供体,催化Rh2,Rg3和Rd的活性如表8所示。Other glycosyltransferases gGT29, gGT29-4, gGT29-5, gGT29-6 and gGT29-7gGT29-8, gGT29-9, gGT29-10, gGT29-11, gGT29-12, gGT29-13, gGT29-15, gGT29-16, gGT29-17 and gGT29-18, with UDP-xylose as a glycosyl donor, catalyze the activities of Rh2, Rg3 and Rd as shown in Table 8.
表8Table 8
Figure PCTCN2015081111-appb-000016
Figure PCTCN2015081111-appb-000016
Figure PCTCN2015081111-appb-000017
Figure PCTCN2015081111-appb-000017
实施例13 产Rf酵母工程菌的构建与产物鉴定Example 13 Construction and Product Identification of Rf Yeast Engineering Bacteria
在pESC-HIS质粒((Stratagene,Agilent)上,同时组装达玛烯二醇合成酶(Dammarenediol synthase)(ACZ71036.1)(GAL1/GAL10GAL10侧启动子,ADH1终止子)、细胞色素P450 CYP716A47(AEY75213.1)(FBA1启动子,CYC1终止子)、细胞色素P450 CYP716A53V2基因(ENO2启动子,CYC1终止子)以及糖基转移酶基因UGTPg100(KP795113)(GAL1/GAL10GAL1侧启动子,TDH2终止子)和糖基转移酶gGT29-7(TEF1启动子,FBA1终止子),构成游离型质粒,转化酿酒酵母BY4742,并将拟南芥来源的细胞色素P450还原酶ATR2-1(NP_849472.2)整合于酿酒酵母BY4742染色体中染色体trp1基因位点(GAL1启动子,利用trp1原有终止子),构建了重组酵母A5。重组酵母菌需补加的相应氨基酸(0.01%色氨酸,0.01%亮氨酸,0.01%赖氨酸)。Simultaneous assembly of Dammarenediol synthase (ACZ71036.1) (GAL1/GAL10GAL10 side promoter, ADH1 terminator) and cytochrome P450 CYP716A47 (AEY75213) on pESC-HIS plasmid (Stratagene, Agilent) .1) (FBA1 promoter, CYC1 terminator), cytochrome P450 CYP716A53V2 gene (ENO2 promoter, CYC1 terminator) and glycosyltransferase gene UGTPg100 (KP795113) (GAL1/GAL10GAL1 side promoter, TDH2 terminator) and Glycosyltransferase gGT29-7 (TEF1 promoter, FBA1 terminator), constitutes an episomal plasmid, transforms S. cerevisiae BY4742, and integrates Arabidopsis-derived cytochrome P450 reductase ATR2-1 (NP_849472.2) into winemaking Recombinant yeast A5 was constructed by chromosomal trp1 gene locus in the yeast BY4742 chromosome (GAL1 promoter, using trp1 original terminator). Recombinant yeast requires additional amino acids (0.01% tryptophan, 0.01% leucine, 0.01% lysine).
将重组酵母A7裂解液转移到2mL EP管中,每个管装1mL,加入等体积(1mL)的正丁醇抽提约30min后12000g离心10min。吸取上清至一新的EP管中。45℃并真空条件下使正丁醇蒸干。用100μL甲醇溶解后用于HPLC检测。The recombinant yeast A7 lysate was transferred to a 2 mL EP tube, 1 mL per tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 μL of methanol and used for HPLC detection.
通过HPLC分析,重组酵母A7的细胞裂解液中含有原人参三醇(PPT)及人参皂苷活性代谢物Rh1和Rf。The cell lysate of recombinant yeast A7 contained protopanaxatriol (PPT) and ginsenoside active metabolites Rh1 and Rf by HPLC analysis.
实施例14 产Rg2酵母工程菌的构建与产物鉴定Example 14 Construction and Product Identification of Rg2 Yeast Engineering Bacteria
在pESC-HIS质粒((Stratagene,Agilent)上,同时组装达玛烯二醇合成酶(Dammarenediol synthase)(ACZ71036.1)(GAL1/GAL10GAL10侧启动子,ADH1终止子)、细胞色素P450 CYP716A47(AEY75213.1)(FBA1启动子,CYC1终止子)、细胞色素P450 CYP716A53V2基因(ENO2启动子,CYC1终止子)以及两个糖基转移酶基因UGTPg100(KP795113)(GAL1/GAL10GAL1侧启动子,TDH2终止子)和gGT29-7-N343G/A359P(TEF1启动子,FBA1终止子),构成游离型质粒,转化酿酒酵母BY4742,并将拟南芥来源的细胞色素P450还原酶ATR2-1(NP_849472.2)(GAL1启动子,利用trp1原有终止子)和UDP-L-鼠李糖合成酶RHM2(GQ292791.1)(GAL10启动子,FBA1终止子)整合于酿酒酵母BY4742染 色体中染色体trp1基因位点,构建了重组酵母A8。重组酵母菌需补加的相应氨基酸(0.01%色氨酸,0.01%亮氨酸,0.01%赖氨酸)。Simultaneous assembly of Dammarenediol synthase (ACZ71036.1) (GAL1/GAL10GAL10 side promoter, ADH1 terminator) and cytochrome P450 CYP716A47 (AEY75213) on pESC-HIS plasmid (Stratagene, Agilent) .1) (FBA1 promoter, CYC1 terminator), cytochrome P450 CYP716A53V2 gene (ENO2 promoter, CYC1 terminator) and two glycosyltransferase genes UGTPg100 (KP795113) (GAL1/GAL10GAL1 side promoter, TDH2 terminator) And gGT29-7-N343G/A359P (TEF1 promoter, FBA1 terminator), constitute an episomal plasmid, transform S. cerevisiae BY4742, and Arabidopsis-derived cytochrome P450 reductase ATR2-1 (NP_849472.2) GAL1 promoter, using trp1 original terminator) and UDP-L-rhamnose synthase RHM2 (GQ292791.1) (GAL10 promoter, FBA1 terminator) integrated into Saccharomyces cerevisiae BY4742 Recombinant yeast A8 was constructed by chromosomal trp1 gene locus in the chromosome. Recombinant yeast requires additional amino acids (0.01% tryptophan, 0.01% leucine, 0.01% lysine).
将重组酵母A8裂解液转移到2mL EP管中,每个管装1mL,加入等体积(1mL)的正丁醇抽提约30min后12000g离心10min。吸取上清至一新的EP管中。45℃并真空条件下使正丁醇蒸干。用100μL甲醇溶解后用于HPLC检测。通过HPLC分析,重组酵母A8的细胞裂解液中含有原人参三醇(PPT)及人参皂苷活性代谢物Rh1和Rg2。The recombinant yeast A8 lysate was transferred to a 2 mL EP tube, 1 mL each tube, and an equal volume (1 mL) of n-butanol was added for about 30 min and then centrifuged at 12000 g for 10 min. Pipette the supernatant into a new EP tube. The n-butanol was evaporated to dryness at 45 ° C under vacuum. It was dissolved in 100 μL of methanol and used for HPLC detection. The cell lysate of recombinant yeast A8 contained protopanaxatriol (PPT) and ginsenoside active metabolites Rh1 and Rg2 by HPLC analysis.
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。 All documents mentioned in the present application are hereby incorporated by reference in their entirety in their entireties in the the the the the the the the In addition, it should be understood that various modifications and changes may be made by those skilled in the art in the form of the appended claims.
Figure PCTCN2015081111-appb-000018
Figure PCTCN2015081111-appb-000018

Claims (9)

  1. (C)如SEQ ID NO.:60所示的核苷酸序列;(C) a nucleotide sequence as shown in SEQ ID NO.: 60;
    (D)与SEQ ID NO.:60所示序列的同源性≥90%(较佳地≥91%、92%、93%、94%、95%、96%、97%、98%或99%)的核苷酸序列;(D) homology to the sequence set forth in SEQ ID NO.: 60 ≥ 90% (preferably ≥ 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99) %) nucleotide sequence;
    (E)在SEQ ID NO.:60所示核苷酸序列的5’端和/或3’端截短或添加1-60个(较佳地1-30,更佳地1-10个)核苷酸所形成的核苷酸序列;(E) truncating or adding 1-60 (preferably 1-30, more preferably 1-10) at the 5' end and/or the 3' end of the nucleotide sequence shown by SEQ ID NO. a nucleotide sequence formed by a nucleotide;
    (F)与(A)-(E)任一所述的核苷酸序列互补的核苷酸序列。(F) a nucleotide sequence complementary to the nucleotide sequence of any of (A) to (E).
  2. 一种载体,其特征在于,所述的载体含有权利要求4所述的多核苷酸。A vector comprising the polynucleotide of claim 4.
  3. 权利要求3所述分离的多肽的用途,其特征在于,它被用于催化以下一种或多种体外反应,或被用于制备催化以下一种或多种反应的催化制剂:Use of the isolated polypeptide of claim 3, characterized in that it is used to catalyze one or more of the following in vitro reactions or to prepare a catalytic preparation that catalyzes one or more of the following reactions:
    (i)将来自糖基供体的糖基转移到四环三萜类化合物的C-3位的第一个糖基上,延伸糖链;(i) transferring a glycosyl group derived from a glycosyl donor to a first glycosyl group at the C-3 position of the tetracyclic triterpenoid, extending the sugar chain;
    (ii)将来自糖基供体的糖基转移到四环三萜类化合物的C-6位的第一个糖基上,延伸糖链;(ii) transferring a glycosyl group derived from a glycosyl donor to a first glycosyl group at the C-6 position of the tetracyclic triterpenoid, extending the sugar chain;
    (iii)将来之糖基供体的糖基与四环三萜类化合物的C-6位糖链的末端糖基进行置换,从在C-3和/或C-6位的第一个糖基上延伸糖链。(iii) replacement of the glycosyl group of the glycosyl donor in the future with the terminal glycosyl group of the C-6 sugar chain of the tetracyclic triterpenoid, from the first sugar at the C-3 and/or C-6 position The sugar chain is extended on the base.
  4. 如权利要求6所述的用途,其特征在于,所述分离的多肽用于催化下述一种或多种反应或被用于制备催化下述一种或多种反应的催化制剂:The use according to claim 6, wherein the isolated polypeptide is used to catalyze one or more of the following reactions or to prepare a catalytic preparation that catalyzes one or more of the following reactions:
    (A)(A)
    Figure PCTCN2015081111-appb-100001
    Figure PCTCN2015081111-appb-100001
    其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,R5-R1-O为C3第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:61所示的多肽或其衍生多肽,优选地,选自SEQ ID NO.:26、28、55、57、59、72、74、76、78、80、82、84、86、88、90、92、93、94或95所示的多肽;Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, and R5-R1-O is a C3 first glycosyl group-derived glycosyl group, said polypeptide being selected from the group consisting of The polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably selected from the group consisting of SEQ ID NO.: 26, 28, 55, 57, 59, 72, 74, 76, 78, 80, 82, 84, 86 a polypeptide as shown in 88, 90, 92, 93, 94 or 95;
    (B) (B)
    Figure PCTCN2015081111-appb-100002
    Figure PCTCN2015081111-appb-100002
    其中,R1和R2为H或者糖基,R3和R4为糖基。所述的多肽选自SEQ ID NO.:61或其衍生多肽,优选地,选自55,57,59,78,82,92,94或者95或其衍生多肽;Wherein R1 and R2 are H or a glycosyl group, and R3 and R4 are a glycosyl group. The polypeptide is selected from the group consisting of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably selected from 55, 57, 59, 78, 82, 92, 94 or 95 or a polypeptide derived therefrom;
    (C)(C)
    Figure PCTCN2015081111-appb-100003
    Figure PCTCN2015081111-appb-100003
    其中,R1为糖基;R2和R3为OH或者H;R4为糖基或者H;R5为糖基,R5-R1-O为C3第一个糖基衍生的糖基;R6为糖基,R6-R1-O为C3第一个糖基衍生的糖基,所述的多肽选自SEQ ID NO.:61所示多肽或其衍生多肽,优选为SEQ ID NO.:26、28、59、76、84、86或88所示的多肽。Wherein R1 is a glycosyl group; R2 and R3 are OH or H; R4 is a glycosyl group or H; R5 is a glycosyl group, R5-R1-O is a glycosyl group derived from the first glycosyl group of C3; R6 is a glycosyl group, R6 -R1-O is the first glycosyl-derived glycosyl group of C3, said polypeptide being selected from the polypeptide of SEQ ID NO.: 61 or a polypeptide derived therefrom, preferably SEQ ID NO.: 26, 28, 59, 76 a polypeptide as shown in 84, 86 or 88.
  5. 一种进行糖基催化反应的方法,其特征在于,包括步骤:在权利要求3所述的多肽或其衍生多肽存在的条件下,进行糖基催化反应。A method for performing a glycosyl-catalyzed reaction, comprising the steps of performing a glycosyl-catalyzed reaction in the presence of the polypeptide of claim 3 or a polypeptide derived therefrom.
  6. 如权利要求8所述的方法,其特征在于,所述糖基催化反应的底物为式(I)、(III)、或(V)化合物,且所述的产物为(II)、(IV)、或(VI)化合物;较佳地,The method according to claim 8, wherein the substrate of the glycosyl catalyzed reaction is a compound of the formula (I), (III), or (V), and the product is (II), (IV) Or a compound of (VI); preferably,
    或,所述的式(I)化合物为人参皂苷Rh2,并且式(II)化合物为人参皂苷Rg3;Or, the compound of formula (I) is ginsenoside Rh2, and the compound of formula (II) is ginsenoside Rg3;
    或,所述的式(I)化合物为人参皂苷Rh2,并且式(II)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;或,所述的式(I)化合物为人参皂苷F2,并且式(II)化合物为人参皂苷Rd;Or, the compound of the formula (I) is ginsenoside Rh2, and the compound of the formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD; or (I) the compound is ginsenoside F2, and the compound of formula (II) is ginsenoside Rd;
    或,所述的式(I)化合物为人参皂苷F2,并且式(II)化合物为 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK;Or, the compound of formula (I) is ginsenoside F2, and the compound of formula (II) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK;
    或,所述的式(III)化合物为人参皂苷Rh1,并且式(IV)化合物为人参皂苷Rf;Or the compound of the formula (III) is ginsenoside Rh1, and the compound of the formula (IV) is ginsenoside Rf;
    或,所述的式(III)化合物为人参皂苷Rh1,并且式(IV)化合物为人参皂苷Rg2;Or, the compound of formula (III) is ginsenoside Rh1, and the compound of formula (IV) is ginsenoside Rg2;
    或,所述的式(V)化合物为人参皂苷Rg3,并且式(VI)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;Or, the compound of the formula (V) is ginsenoside Rg3, and the compound of the formula (VI) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-PPD;
    或,所述的式(V)化合物为人参皂苷Rd,并且式(IV)化合物为3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK。Alternatively, the compound of formula (V) is ginsenoside Rd, and the compound of formula (IV) is 3-O-β-(D-xylopyranosyl)-β-(D-glucopyranosyl)-CK.
  7. 一种遗传工程化的宿主细胞,其特征在于,所述的宿主细胞含有权利要求5所述的载体,或其基因组中整合有权利要求4所述的多核苷酸。A genetically engineered host cell, comprising the vector of claim 5, or the polynucleotide of claim 4 integrated in the genome.
  8. 权利要求10所述的宿主细胞的用途,其特征在于,用于制备酶催化试剂,或生产糖基转移酶、或作为催化细胞、或产生糖基化的四环三萜类化合物。Use of the host cell according to claim 10, characterized in that it is used for the preparation of an enzyme catalytic reagent, or for the production of a glycosyltransferase, or as a catalytic cell, or a glycosylated tetracyclic triterpenoid.
  9. 一种产生转基因植物的方法,其特征在于,包括步骤:将权利要求10所述的遗传工程化的宿主细胞再生为植物,并且所述的遗传工程化的宿主细胞为植物细胞。 A method of producing a transgenic plant, comprising the steps of: regenerating the genetically engineered host cell of claim 10 into a plant, and said genetically engineered host cell is a plant cell.
PCT/CN2015/081111 2014-06-09 2015-06-09 Group of glycosyltransferases and use thereof WO2015188742A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410254021.6 2014-06-09
CN201410254021 2014-06-09

Publications (2)

Publication Number Publication Date
WO2015188742A2 true WO2015188742A2 (en) 2015-12-17
WO2015188742A3 WO2015188742A3 (en) 2016-01-28

Family

ID=54834503

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/081111 WO2015188742A2 (en) 2014-06-09 2015-06-09 Group of glycosyltransferases and use thereof

Country Status (2)

Country Link
CN (3) CN115094046A (en)
WO (1) WO2015188742A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111892639A (en) * 2020-07-10 2020-11-06 山东省药学科学院 Novel cycloartane-type saponin compound in camptosorus sibiricus, preparation method and application thereof

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106544310B (en) * 2016-09-27 2019-07-16 南开大学 The engineering bacteria and its construction method of a kind of gynostemma pentaphylla glycosyl transferase and application
CN106701647B (en) * 2016-09-27 2019-12-24 南开大学 Application of gynostemma pentaphylla glycosyltransferase in synthesizing rare ginsenoside
CN108728423B (en) * 2017-04-13 2021-12-07 中国医学科学院药物研究所 Bacillus subtilis glycosyltransferase and application thereof
CN108866020A (en) * 2017-05-16 2018-11-23 中国科学院上海生命科学研究院 Glycosyl transferase, mutant and its application
CN108949711A (en) 2017-05-19 2018-12-07 中国科学院上海生命科学研究院 The UDP- glycosyl transferase and its application that one group of catalysis sugar chain extends
CN107868115B (en) * 2017-11-17 2021-02-26 中国科学院天津工业生物技术研究所 Non-natural ginsenoside Rd12, and preparation method and application thereof
CN110438099B (en) * 2018-05-04 2022-04-15 中国科学院天津工业生物技术研究所 Application of glycosyltransferase and related materials thereof in construction of engineering bacteria for producing ginsenosides Rb1 and Rg1
CN115838754A (en) * 2018-12-27 2023-03-24 中国医学科学院药物研究所 Recombinant bacterium for producing dammarenediol-II glucoside and application thereof
CN112831481B (en) * 2019-11-22 2024-01-19 生合万物(上海)生物科技有限公司 Glycosyltransferase and method for catalyzing sugar chain extension
CN110872594B (en) * 2019-12-09 2021-04-06 四川农业大学 Blackberry glycosyltransferase gene and application thereof
CN114807075B (en) * 2021-01-27 2023-11-21 东北林业大学 Glycosyltransferase PpUGT73E5 and application thereof in paris polyphylla saponin synthesis
CN115678952A (en) * 2021-07-30 2023-02-03 生合万物(苏州)生物科技有限公司 Highly specific glycosyltransferase for rhamnose and application thereof

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL128193A (en) * 1999-01-22 2008-03-20 Yeda Res & Dev Rhamnosyl-transferase gene and uses thereof
KR101479608B1 (en) * 2012-09-27 2015-01-06 한국과학기술원 Novel UDP-glycosyltransferase derived from ginseng and use thereof
KR101479615B1 (en) * 2012-09-27 2015-01-06 한국과학기술원 Novel UDP-glycosyltransferase derived from ginseng and use thereof
KR20190126434A (en) * 2012-12-06 2019-11-11 상하이 인스티튜츠 포 바이올로지컬 사이언시스, 차이니즈 아카데미 오브 사이언시스 Group of glycosyltransferases and use thereof
CN104232723B (en) * 2013-06-07 2021-03-26 中国科学院分子植物科学卓越创新中心 Group of glycosyltransferases and application thereof

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111892639A (en) * 2020-07-10 2020-11-06 山东省药学科学院 Novel cycloartane-type saponin compound in camptosorus sibiricus, preparation method and application thereof
CN111892639B (en) * 2020-07-10 2022-06-28 山东省药学科学院 Novel cycloartane type saponin compound in camptosorus sibiricus, preparation method and application thereof

Also Published As

Publication number Publication date
CN115094046A (en) 2022-09-23
CN115341008A (en) 2022-11-15
WO2015188742A3 (en) 2016-01-28
CN105177100A (en) 2015-12-23

Similar Documents

Publication Publication Date Title
WO2015188742A2 (en) Group of glycosyltransferases and use thereof
JP6479084B2 (en) A series of glycosyltransferases and their applications
US20230203458A1 (en) Group of udp-glycosyltransferase for catalyzing carbohydrate chain elongation and application thereof
CN104232723B (en) Group of glycosyltransferases and application thereof
CN105087739B (en) A kind of new catalyst system and its application for preparing rare ginsenoside
JP7086107B2 (en) Glycosyltransferases, mutants and their use
WO2016119756A1 (en) Mutant protein of glycosyltransferase and uses thereof
CN109796516B (en) A method for synthesizing natural and unnatural protopanaxatriol type ginsenoside
WO2023006109A1 (en) Highly specific glycosyltransferase for rhamnose, and use thereof
CN112831481B (en) Glycosyltransferase and method for catalyzing sugar chain extension
CN113444703B (en) Glycosyltransferase mutant for catalyzing sugar chain extension and application thereof
CN111926000B (en) Gynostemma pentaphylla glycosyltransferase and application thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15805949

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28.03.2017)

122 Ep: pct application non-entry in european phase

Ref document number: 15805949

Country of ref document: EP

Kind code of ref document: A2