WO2021135796A1 - D-木酮糖4-差向异构酶、其突变体及其用途 - Google Patents

D-木酮糖4-差向异构酶、其突变体及其用途 Download PDF

Info

Publication number
WO2021135796A1
WO2021135796A1 PCT/CN2020/133395 CN2020133395W WO2021135796A1 WO 2021135796 A1 WO2021135796 A1 WO 2021135796A1 CN 2020133395 W CN2020133395 W CN 2020133395W WO 2021135796 A1 WO2021135796 A1 WO 2021135796A1
Authority
WO
WIPO (PCT)
Prior art keywords
polypeptide
seq
positions
amino acid
xylulose
Prior art date
Application number
PCT/CN2020/133395
Other languages
English (en)
French (fr)
Inventor
张以恒
周伟
Original Assignee
中国科学院天津工业生物技术研究所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院天津工业生物技术研究所 filed Critical 中国科学院天津工业生物技术研究所
Priority to BR112022012915A priority Critical patent/BR112022012915A2/pt
Priority to EP20909683.3A priority patent/EP4083199A4/en
Priority to JP2022540978A priority patent/JP2023509176A/ja
Priority to KR1020227025377A priority patent/KR20220119128A/ko
Priority to MX2022008199A priority patent/MX2022008199A/es
Priority to US17/790,417 priority patent/US20230220433A1/en
Priority to CA3163520A priority patent/CA3163520A1/en
Publication of WO2021135796A1 publication Critical patent/WO2021135796A1/zh

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/02Monosaccharides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/24Preparation of compounds containing saccharide radicals produced by the action of an isomerase, e.g. fructose
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y501/00Racemaces and epimerases (5.1)
    • C12Y501/03Racemaces and epimerases (5.1) acting on carbohydrates and derivatives (5.1.3)
    • C12Y501/03004L-Ribulose-5-phosphate 4-epimerase (5.1.3.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y501/00Racemaces and epimerases (5.1)
    • C12Y501/03Racemaces and epimerases (5.1) acting on carbohydrates and derivatives (5.1.3)
    • C12Y501/03005UDP-arabinose 4-epimerase (5.1.3.5)

Definitions

  • the present disclosure belongs to the field of biocatalysis and synthetic biology.
  • the present disclosure relates to novel polypeptides (enzymes) capable of reversibly catalyzing the mutual conversion between D-xylulose and L-ribulose and their mutants, named D-xylulose 4-epimerase (Xu4E); And by constructing multiple artificial enzyme pathways, using this enzyme to produce L-pentose (ie L-arabinose, L-ribose, L-ribulose, L-xylone) from the most abundant pentose D-xylose in nature Sugar, L-xylose and L-lyxose) new methods.
  • L-pentose ie L-arabinose, L-ribose, L-ribulose, L-xylone
  • Pentose (or pentose monosaccharide), with the chemical formula C 5 H 10 O 5 , is a monosaccharide with five carbon atoms. They can be divided into two categories: aldose and ketulose; there are eight types of aldose and four types of ketopentose, and each ketopentose corresponds to two types of aldose. The twelve types of pentoses can also be divided into L-type sugars and D-type sugars, and each type of sugar has four aldoses and two ketopentoses. D-xylose, D-ribose and L-arabinose are natural sugars, but other pentose sugars are rare sugars that exist in very small amounts in nature. D-xylose is the most abundant pentose in nature. L-pentose has attracted a lot of attention due to its medical and health applications, especially its great potential as a precursor of many important medicines.
  • D-xylose is the most abundant pentose in nature. It can be separated from lignocellulose and is called wood sugar. D-xylose is the main component of hemicellulose xylan. Now D-xylose is mainly separated from the acidic or alkaline hydrolysates of corn cob and beet pulp, and most of the xylose is used to convert xylitol into a zero-calorie sweetener.
  • L-arabinose is a zero-calorie natural sweetener with 50% sweetness of sucrose approved by the US Food and Drug Administration (FDA) (Antila et al. 1997, Boku et al. 2001). More importantly, adding 3-4% L-arabinose to sucrose can inhibit the activity of invertase, prevent sucrose hydrolysis and prevent sucrose absorption, so L-arabinose is considered a sucrose neutralizer, according to calories Ingestion (Morimoto et al. 2001). At the same time, unused sugar in the intestine is a prebiotic that promotes the growth of beneficial bacteria, thereby inhibiting the growth of harmful microorganisms in the large intestine. L-arabinose is also used as a starting material for synthetic drugs and a biochemical product widely used in molecular biology experiments and industrial fermentation.
  • L-arabinose exists in the hemicellulose of higher plants in the form of arabinan, arabinoxylan, arabinogalactan, etc.
  • L-arabinose is obtained by extracting hemicellulose from corn fiber (Boku et al. 2001), gum arabic, and sugar beet pulp (Antila et al. 1997) by alkaline method and then performing acid hydrolysis.
  • L-arabinose is a by-product of the acid hydrolysis of corncobs to produce D-xylose.
  • the high price and limited supply of L-arabinose greatly limit its wide application.
  • L-ribose does not exist widely in nature. It is the precursor of many new nucleotide analogues, used to produce antiviral drugs, such as drugs against human immunodeficiency virus, hepatitis virus and cytomegalovirus (Kim et al. 2014). L-ribose can also be used as a competitive inhibitor of glucose dehydrogenase (Beerens et al. 2012). Previously, the production method of L-ribose was a two-step microbial transformation using ribitol as an intermediate product.
  • L-ribose has been catalyzed by a two-step enzyme reaction: L-arabinose isomerase (L-AI) and L-ribose isomerase (L-RI, EC 5.3.1.B3) or mannose 6-phosphate isomerase (MPI, EC 5.3.1.8) converts L-ribulose to L-ribose (Kim et al. 2014).
  • L-AI L-arabinose isomerase
  • L-RI L-ribose isomerase
  • MPI mannose 6-phosphate isomerase
  • L-ribulose is the starting material for the synthesis of L-ribose and L-arabinose. Its 5'-phosphate product L-ribulose 5-phosphate is an important metabolite of the pentose phosphate pathway.
  • L-xylulose can be used as an inhibitor of ⁇ -glucosidase and can be used to lower blood sugar.
  • L-xylulose can also be used to produce other important rare sugars, such as L-ribose for the production of antiviral drugs and L-xylose as an indicator of hepatitis or liver cirrhosis.
  • L-lyxose is a component of averamycin A, an antibiotic used in animals, and is a potential L-fucosidase inhibitor.
  • L-xylose is the starting material for the synthesis of anti-hepatitis B virus (HBV) nucleosides and the synthesis of L-ribose furanose and derivatives.
  • HBV anti-hepatitis B virus
  • Epimerase is a type of isomerase that catalyzes the configuration change of an asymmetric carbon atom in a substrate containing multiple asymmetric centers.
  • People have been looking for pentylose 4-epimerase for a long time, but it has never been reported (Beerens et al. 2017).
  • 4-epimerase in nature such as L-ribulose 5-phosphate 4-epimerase (EC 5.1.3.4) and UDP-D-xylose 4-epimerase (EC 5.1. .3.5)
  • the pentose substrate is required to have a phosphoric acid or uridine diphosphate (UDP) group.
  • the present disclosure provides a wild-type polypeptide (enzyme) and mutants thereof, which have the chemical reaction ability to catalyze the mutual conversion of D-xylulose and L-ribulose ( Figure 2).
  • the wild-type D-xylulose 4-epimerase (Xu4E) and its mutants in the present disclosure can use the most abundant pentose in nature, namely D-xylose, as a raw material, Production of L-pentose.
  • the exemplary L-pentose is selected from six types of L-pentose, namely L-arabinose, L-ribose, L-ribulose, L-xylulose, L-xylose and L-lyxose.
  • the present disclosure provides a method for preparing the aforementioned Xu4E mutant, which uses molecular biology and genetic engineering methods to prepare the Xu4E mutant.
  • the present disclosure provides the use of the aforementioned wild-type Xu4E and its mutants, which can be used to produce L-pentose.
  • the present disclosure provides a new method for producing L-pentose.
  • the method includes a method for producing L-pentose using D-xylose or D-xylulose as a raw material.
  • the present disclosure provides a new method for producing L-pentose, the method comprising the step of converting D-xylulose into L-ribulose using Xu4E or a mutant thereof .
  • the present disclosure provides a method for further optimizing the aforementioned method for producing L-pentose.
  • polypeptide has D-xylulose 4-epimerase activity, wherein the polypeptide is selected from any one of the group consisting of (a)-(d):
  • a polypeptide which is a mutant of the polypeptide shown in any one of SEQ ID NO: 2-32, and the mutant includes substitution, duplication, deletion, or addition of one or more at one or more positions. Amino acids, and the polypeptide still has D-xylulose 4-epimerase activity; and
  • polypeptide according to (1) wherein the polypeptide is a mutant, and compared with the polypeptide shown in any one of SEQ ID NO: 2-32, the polypeptide has at least 95%, at least 96% %, at least 97%, at least 98%, or at least 99% sequence identity.
  • polypeptide according to any one of (1) to (2) wherein the polypeptide is a mutant of the polypeptide shown in any one of SEQ ID NO: 2-32, and the mutant is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9 positions containing amino acid mutations, and the polypeptide still has D- Xylulose 4-epimerase activity.
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 102, One or more of the group consisting of 125, 131, 161, 163, 266, 267, 297, 306, 318, 337, 394, 402 and 403;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 102, 125 One or more of the group consisting of, 131, 161, 163, 266, 267, 297, 306, 318, 337, 394, 402 and 403;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 102, 125 One or more of the group consisting of, 131, 161, 163, 266, 267, 297, 306, 318, 337, 394, 402 and 403;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 105, 128 One or more of the group consisting of, 134, 164, 166, 270, 271, 301, 310, 322, 341, 398, 406, 407;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 105, 128 One or more of the group consisting of, 134, 164, 166, 269, 270, 300, 309, 321, 340, 397, 405, 406;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 117, 140 One or more of the group consisting of, 146, 176, 178, 285, 286, 316, 325, 337, 355, 412, 420, 421;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 125, 148 One or more of the group consisting of, 154, 184, 186, 293, 294, 324, 333, 345, 363, 420, 428, 429;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 124, 147 One or more of the group consisting of, 153, 183, 185, 297, 298, 328, 337, 349, 368, 425, 433, 434;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 108, 131 One or more of the group consisting of, 137, 167, 169, 276, 277, 307, 316, 328, 346, 403, 411, 412;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 115, 138 One or more of the group consisting of, 144, 174, 176, 280, 281, 311, 320, 332, 351, 408, 416, 417;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 107, 130 One or more of the group consisting of, 136, 166, 168, 272, 273, 303, 312, 324, 343, 400, 408, 409;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 109, 132 One or more of the group consisting of, 138, 168, 170, 275, 276, 306, 315, 327, 346, 403, 411, 412;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 103, 126 One or more of the group consisting of, 132, 162, 164, 267, 268, 298, 307, 319, 338, 395, 403, 404;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 105, 128 One or more of the group consisting of, 134, 164, 166, 271, 272, 302, 311, 323, 342, 399, 407, 408;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 64, 88 One or more of the group consisting of, 94, 123, 125, 236, 237, 267, 274, 286, 373, 381, 382;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 110, 133 One or more of the group consisting of, 139, 169, 171, 271, 272, 302, 311, 323, 342, 399, 407, 408;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 102, 125 One or more of the group consisting of, 131, 161, 163, 266, 267, 297, 306, 318, 337, 394, 402, 403;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 121, 144 One or more of the group consisting of, 150, 180, 182, 289, 290, 320, 329, 341, 359, 416, 424, 425;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 107, 130 One or more of the group consisting of, 136, 166, 168, 273, 274, 304, 313, 325, 344, 401, 409, 410;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 21, 48 One or more of the group consisting of, 54, 84, 86, 182, 183, 213, 222, 234, 260, 324, 332, 333;
  • one or more amino acids in the amino acid sequence of the polypeptide contain mutations at positions corresponding to the following positions, wherein the positions are selected from 30, 55 One or more of the group consisting of, 61, 91, 93, 202, 203, 233, 242, 254, 273, 330, 338, 339.
  • polypeptide a polypeptide corresponding to SEQ ID NO: 2 with mutations as shown below:
  • polypeptide according to (5) wherein the polypeptide is a mutated polypeptide at any of the following positions corresponding to SEQ ID NO: 2:
  • amino acid corresponding to position 131 of SEQ ID NO: 2 is mutated from arginine (R) to aspartic acid (D), threonine (T), glutamic acid (E) or serine ( S);
  • amino acid at position 161 corresponding to SEQ ID NO: 2 is mutated from aspartic acid (D) to alanine (A);
  • amino acid corresponding to position 163 of SEQ ID NO: 2 is mutated from valine (V) to lysine (K), arginine (R), serine (S), isoleucine ( I) or methionine (M);
  • amino acid at position 267 corresponding to SEQ ID NO: 2 is mutated from valine (V) to leucine (L), methionine (M) or isoleucine (I);
  • amino acid corresponding to position 297 of SEQ ID NO: 2 is mutated from asparagine (N) to phenylalanine (F), tyrosine (Y) or lysine (K);
  • amino acid at position 306 corresponding to SEQ ID NO: 2 is mutated from tryptophan (W) to methionine (M), serine (S) or threonine (T);
  • amino acid at position 394 corresponding to SEQ ID NO: 2 is mutated from aspartic acid (D) to methionine (M);
  • amino acid corresponding to position 402 on SEQ ID NO: 2 is mutated from serine (S) to valine (V), leucine (L), phenylalanine (F), and cysteine (C) or Tyrosine (Y);
  • amino acid corresponding to position 403 on SEQ ID NO: 2 is mutated from tyrosine (Y) to tryptophan (W), threonine (T), isoleucine (I) or phenylalanine Acid (F).
  • polypeptide derived from Thermotoga maritima, Thermotoga neapolitana, Thermotoga neapolitana, Thermotoga sp), Thermotoga caldifontis, Pseudothermotoga lettingae, Halanaerobium congolense, Thermosediminibacter litoriperuensis, Rhodothermus marinus, Gracilibacillus timonensis, Thermotogaebacterium, Thermotogaebacterium, Thermotogaebacterium albicans (Candidatus Acetothermia bacterium), Pseudothermotoga thermarum, antipyretic thermophilic anaerobic bacteria (Thermoanaerobacterium thermosaccharolyticum), Thermofilum adornatus, Thermoanaerobacter italic
  • polypeptide according to (1) wherein the polypeptide includes one or more amino acid residues missing at the N-terminal or middle part or C-terminal of the polypeptide shown in SEQ ID NO: 2-32.
  • polypeptide according to (8) wherein the polypeptide is selected from the following group:
  • amino acids 196-236 of the polypeptide shown in SEQ ID NO: 2 there are 1-41 amino acids missing, preferably 1-30, more preferably 1-20, and most preferably 1-10 amino acids. It is formed and has the activity of catalyzing the conversion of D-xylulose to L-ribulose.
  • polypeptide (10) The polypeptide according to any one of (8) to (9), wherein the polypeptide is selected from the following group:
  • polypeptide corresponding to SEQ ID NO: 2 lacks amino acids 196-236, and has the activity of catalyzing the conversion of D-xylulose to L-ribulose; or
  • polypeptide corresponding to SEQ ID NO: 2 lacks amino acids 1-86 and amino acids 196-236, and has the activity of catalyzing the conversion of D-xylulose into L-ribulose.
  • polypeptide according to any one of (1) to (10), wherein the polypeptide comprises an amino acid sequence having at least 96% sequence identity with the sequence shown in SEQ ID NO: 33-122 or is composed of Its composition; optionally, the polypeptide contains at least 98.3%, at least 98.5%, at least 98.7%, at least 98.9%, at least 99.1%, and any one of the polypeptides encoded by SEQ ID NO: 33-122.
  • polynucleotide wherein the polynucleotide comprises a nucleotide sequence encoding the polypeptide according to any one of (1) to (13).
  • the nucleotide sequence encodes the amino acid shown in any one of SEQ ID NO: 33-122.
  • a nucleic acid construct comprising the polynucleotide according to (14) or (15), the polynucleotide being operably linked to one or more regulatory sequences that direct the polypeptide in Produced in an expression host.
  • a recombinant host cell comprising the nucleic acid construct according to (16) or the recombinant expression vector according to (17).
  • the method further includes the following steps:
  • a method for converting D-xylulose into L-ribulose comprising using the polypeptide according to any one of (1) to (13) to perform a catalytic reaction.
  • a method for preparing L-pentose comprising the following steps:
  • the method further includes:
  • D-xylulose 4-epimerase is selected from any one of (1)-(13) Polypeptide; preferably, the D-xylulose 4-epimerase is encoded by the amino acid sequence shown in any one of SEQ ID NO: 2-122, or the enzyme is encoded by the amino acid sequence shown in SEQ ID NO: Enzyme encoded by the sequence shown in any one of 2-122.
  • reaction system of the method further comprises an enzyme reaction solution; preferably, the enzyme reaction solution contains metal ions; more preferably The metal ion can be Co 2+ , Zn 2+ , Mg 2+ , Mn 2+ , Ca 2+ , Ni 2+ , Cu 2+ , Fe 2+ , Sr 2+ , Ba 2+ , Cd 2+ One or more of, Pb 2+ , Fe 3+ , Al 3+ , Bi 3+ , Ag + , Li +.
  • reaction vessel is selected from the group consisting of a batch-feed bioreactor, a packed bed bioreactor containing immobilized enzyme, an enzyme or cell recycling bioreactor, and a membrane-containing separation One or more of the bioreactor and continuous feed bioreactor.
  • polypeptide in the preparation of L-pentose, wherein the polypeptide is selected from the polypeptide according to any one of (1) to (13).
  • L-pentose is selected from the group consisting of L-arabinose, L-ribose, L-ribulose, L-xylulose, L-xylose and L- One or more of lyxose.
  • polypeptide as an enzyme having D-xylulose 4-epimerase activity, the polypeptide being selected from the polypeptide according to any one of (1) to (13).
  • the present disclosure has discovered a wild-type D-xylulose 4-epimerase (Xu4E) and its mutations that have the chemical reaction ability to catalyze the mutual conversion of D-xylulose and L-ribulose body.
  • Xu4E D-xylulose 4-epimerase
  • the Xu4E mutant provided in the present disclosure has improved properties, such as improved physical and/or chemical properties, compared to wild-type Xu4E.
  • the specific enzyme activity of the Xu4E mutant is increased compared with the wild-type Xu4E; in another specific embodiment, the Xu4E mutant has a reaction rate compared with the wild-type Xu4E Increase; In another specific embodiment, the Xu4E mutant has a lower K m compared to wild-type Xu4E.
  • the present disclosure has discovered a new method for preparing L-pentose. Compared with the traditional production method in the prior art, the production process is simpler and the cost of producing L-pentose is simpler. decline.
  • Figure 1 shows the Izumoring diagram for mutual conversion of all pentoses in the prior art.
  • Figure 2 shows a diagram of the mutual conversion between D-xylulose and L-ribulose catalyzed by D-xylulose 4-epimerase (Xu4E).
  • Figure 3 shows an artificial multi-enzyme pathway based on Xu4E to convert D-xylose into six L-pentoses.
  • D-XI D-xylose isomerase (EC 5.3.1.5); L-AI: L-arabinose isomerase (EC 5.3.1.4); L-RI: L-ribose isomerase (EC 5.3.1.4) 5.3.1.B3); MPI: mannose phosphate isomerase (EC 5.3.1.8);
  • D-LI D-ribose isomerase (EC 5.3.1.15); Ru3E: L-ribulose 3-epi Isomerase (EC 5.1.3.31); L-Ful: L-fucose isomerase (EC 5.3.1.25); D-AI: D-arabinose isomerase (EC 5.3.1.3) and L-RaI : L-rhamnose isomerase (EC 5.3.1.14).
  • Figure 4 shows the results of HPLC chromatographic separation of four rare sugars.
  • (a) shows the use of a Bio-Rad Aminex HPX-87H hydrogen ion exchange column. The separation conditions are: column temperature 60°C, mobile phase 5mM sulfuric acid, and flow rate 0.6mL/min.
  • (b) shows the use of a Bio-Rad Aminex HPX-87P lead ion exchange column. The separation conditions are as follows: column temperature is 60°C, mobile phase is deionized water, and flow rate is 0.6 mL/min.
  • (c) shows the use of a Waters Sugar Pak I calcium ion exchange column, and the separation conditions are: column temperature 80°C, mobile phase deionized water, and flow rate 0.5 mL/min.
  • (d) shows the use of a Shodex Sugar KS-801 sodium ion exchange column. The separation conditions are: column temperature 70°C, mobile phase deionized water, and flow rate 0.5 mL/min.
  • Figure 5 shows the comparison of the specific enzyme activity of wild-type Xu4E and 8 representative Xu4E mutants obtained by directed evolution under different reaction conditions.
  • Figure 6 shows the effect of a single amino acid residue change based on wild-type TmXu4E on its specific activity.
  • FIG. 7 shows the SDS-PAGE analysis of three thermostable enzymes (ie D-XI, Xu4E and L-AI) expressed in E. coli BL21 (DE3) purified by heat treatment.
  • T total cell lysate
  • S supernatant
  • H heat-treated cell lysate.
  • Figure 8 shows the one-pot production of L-arabinose from 50 mM D-xylose.
  • L-arabinose is produced in 100 mM HEPES buffer (pH 8.0) containing 1 U/mL XI and 1 U/mL AI, 0.2 mM Co 2+ , 1 mM Mn 2+ , 1 mg/mL Xu4E (wild type or mutant). The reaction was carried out at 50°C under anaerobic conditions.
  • Figure 9 shows the one-pot production of L-arabinose from 500 mM D-xylose.
  • the reaction was carried out in 100 mM HEPES buffer (pH 8.0) containing 10 U/mL XI and 10 U/mL AI, 0.2 mM Co 2+ , 1 mM Mn 2+, and 10 mg/mL Xu4E M87.
  • the reaction was carried out at 50°C under anaerobic conditions.
  • Figure 10 shows a comparison diagram of the production and separation (b) of L-arabinose produced by bioconversion and simulated moving bed (SMB) separation (a) and industrialized high fructose corn syrup (HFCS).
  • SMB simulated moving bed
  • HFCS industrialized high fructose corn syrup
  • Figure 11 shows the HPLC separation chart of the product (L-ribulose) produced from the substrate (D-xylulose) catalyzed by Xu4E. Among them, the peaks separated by HPLC were verified by the primary mass spectrum (Figure 12) and the secondary mass spectrum (Figure 13).
  • Figure 12 shows the first-order mass spectra of the substrate (D-xylulose) and product (L-ribulose) peaks separated by HPLC.
  • Figure 13 shows the secondary mass spectrum of the substrate (D-xylulose) and product (L-ribulose) peaks separated by HPLC.
  • Figures 14A and 14B show the comparison results of amino acid sequence analysis of polypeptides with Xu4E activity from different species.
  • the term "about” means that a value includes the standard deviation of the error of the device or method used to determine the value. Exemplarily, the aforementioned standard deviation is generally within a range of 20-30% of the original value.
  • converting refers to the chemical conversion from one molecule to another catalyzed by one or more polypeptides (enzymes);
  • the yield refers to the ratio (in %) between the molar amount of the product and the molar amount of the substrate.
  • the term "monosaccharide” refers to any type of sugar (such as D-glucose, pentose, D-xylose, L-arabinose), which cannot be hydrolyzed to give simpler sugars, and has no chemical Group modification, such as phosphate group or UDP group.
  • pentose or "pentanose” refers to any type of monosaccharide whose molecule contains five carbon atoms, such as D-xylose and L-arabinose.
  • “monosaccharides” can be labeled as “D-", or "L-”.
  • the division of the aforementioned two series is based on the structure of glyceraldehyde as the comparison standard, and is determined according to the configuration of the lowest asymmetric carbon atom in the Fisher projection formula. It is stipulated that the right-handed glyceraldehyde is defined as the isomer with the hydroxyl group on the right in the Fischer projection formula, called D-isomer; the left-handed glyceraldehyde is defined as the glyceraldehyde with the hydroxyl group on the left, called the L-isomer .
  • the chiral carbon atom of the monosaccharide is the same as D-glyceraldehyde and the hydroxyl group is located at the right end, it is labeled as D-monosaccharide; if it is the same as L-glyceraldehyde and the hydroxyl group is located at the left end, it is labeled as L-monosaccharide sugar.
  • the term "4-epimerase” refers to an enzyme capable of exchanging the hydroxyl group at carbon 4 of sugar.
  • “4-epimerase” is an enzyme capable of exchanging the hydroxyl group at carbon 4 of D-tagatose and D-fructose, and can exchange the hydroxyl group at carbon 4 of D-xylulose and L-ribulose.
  • polypeptide As used in the present disclosure, the terms “polypeptide”, “enzyme”, “polypeptide or enzyme”, and “polypeptide/enzyme” have the same meaning, and they are interchangeable in the present disclosure.
  • the aforementioned term refers to a polymer composed of peptide bonds with many amino acids, which may or may not contain modifications such as phosphate groups and formyl groups.
  • D-xylulose 4-epimerase and its abbreviated name “Xu4E” means that it can catalyze D-xylulose and L- A polypeptide (enzyme) that interconverts ribulose.
  • 1 "enzyme activity unit (U)" is defined as the amount of enzyme required to generate 1 ⁇ mol product per minute through an enzyme-catalyzed reaction of a substrate.
  • telomere activity is also expressed as “specific activity” or “specific activity”, which has the same meaning in the present disclosure and can be used interchangeably. It refers to the enzyme activity (U/mg) per milligram of polypeptide (enzyme).
  • sequence identity or “percent identity” in the comparison of two nucleic acids or polypeptides refers to when using a nucleotide or amino acid residue sequence comparison algorithm or as measured by visual inspection, When comparing and comparing with the greatest correspondence, they are the same or have the same specific percentage of the sequence.
  • the identity of a nucleotide or amino acid sequence can be defined by the following ratio, which is the way to compare two or more nucleotide or amino acid sequences according to the number of identical nucleotides or amino acids to the maximum , And add gaps as needed to compare the number of nucleotides or amino acids that are identical during the comparison, and the ratio of the total number of nucleotides or amino acids in the comparison part.
  • sequence identity between two or more polynucleotides or polypeptides can be determined by the following method: aligning the nucleotide or amino acid sequence of the polynucleotide or polypeptide and aligning the aligned Score the number of positions containing the same nucleotide or amino acid residue in the polynucleotide or polypeptide, and compare it with the number of positions containing different nucleotide or amino acid residues in the aligned polynucleotide or polypeptide .
  • Polynucleotides may differ at one position, for example, by containing different nucleotides or missing nucleotides.
  • Polypeptides may differ at one position, for example, by containing different amino acids or missing amino acids.
  • Sequence identity can be calculated by dividing the number of positions containing the same nucleotide or amino acid residue by the total number of amino acid residues in the polynucleotide or polypeptide.
  • percent identity can be calculated by dividing the number of positions containing the same nucleotide or amino acid residue by the total number of nucleotide or amino acid residues in the polynucleotide or polypeptide and multiplying by 100.
  • two or more sequences or subsequences when using sequence comparison algorithms or measuring by visual inspection to compare and align with the greatest correspondence, two or more sequences or subsequences have at least 40%, 50%, 60% %, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% of the "sequence of nucleotide or amino acid residues" Identity” or "Percent Identity”.
  • Identity or “Percent Identity”.
  • Percent Identity The judgment/calculation of "sequence identity” or “percent identity” can be based on any suitable region of the sequence.
  • a region of at least about 50 residues in length a region of at least about 100 residues, a region of at least about 200 residues, a region of at least about 400 residues, or a region of at least about 500 residues.
  • the sequence is substantially the same over the entire length of any one or two compared biopolymers (i.e., nucleic acids or polypeptides).
  • the basis for corresponding numbers of nucleotides or amino acids of different sequences is that when using nucleotide or amino acid residue sequence comparison algorithms or by visual inspection and measurement, the comparison is made with the greatest correspondence. Or comparison to determine the "sequence identity" or “percent identity", the number of the target nucleotide or target amino acid compared to the reference nucleotide or reference amino acid.
  • sequence shown in SEQ ID NO: 5 is numbered according to the sequence shown in SEQ ID NO: 2
  • sequence shown in SEQ ID NO: 5 means that the sequence shown in SEQ ID NO: 5 (corresponding to the target amino acid )
  • sequence shown in SEQ ID NO: 2 (equivalent to the reference amino acid) for the judgment of "sequence identity” or "identity percentage”
  • sequence shown in SEQ ID NO: 5 shall be the most corresponding to SEQ ID
  • the number corresponding to the sequence shown in SEQ ID NO: 2 is the number of the sequence shown in SEQ ID NO: 5.
  • amino acid mutation or “nucleotide mutation” includes “substitution, duplication, deletion or addition of one or more amino acids or nucleotides.”
  • mutation refers to a change in nucleotide sequence or amino acid sequence. In a specific embodiment, the term “mutation” refers to "substitution.”
  • the "mutation" of the present disclosure may be selected from “conservative mutations".
  • the term “conservative mutation” refers to a mutation that can normally maintain the function of a protein. Representative examples of conservative mutations are conservative substitutions.
  • the term "conservative substitution” involves replacing an amino acid residue with an amino acid residue having a similar side chain.
  • the art has defined families of amino acid residues with similar side chains, and include those with basic side chains (such as lysine, arginine, and histidine), acidic side chains (such as aspartic acid and glutamic acid). ), non-polar side chains (such as glycine, asparagine, glutamine, serine, threonine, tyrosine, and cysteine), non-polar side chains (such as alanine, valine) Acid, leucine, isoleucine, proline, phenylalanine, methionine and tryptophan), ⁇ -branched chains (e.g. threonine, valine and isoleucine) and aromatic side chains (E.g. tyrosine, phenylalanine, tryptophan and histidine).
  • basic side chains such as lysine, arginine, and histidine
  • substitutions generally exchange an amino acid at one or more positions in a protein. This substitution can be conservative. As the substitutions considered as conservative substitutions, specifically, the substitution of Ala to Ser or Thr, the substitution of Arg to Gln, His or Lys, the substitution of Asn to Glu, Gln, Lys, His or Asp, the substitution of Asp to Asn, Glu or Gln substitution, Cys to Ser or Ala, Gln to Asn, Glu, Lys, His, Asp or Arg, Glu to Gly, Asn, Gln, Lys or Asp, Gly to Pro Replacement, His to Asn, Lys, Gln, Arg or Tyr, Ile to Leu, Met, Val or Phe, Leu to Ile, Met, Val or Phe, Lys to Asn, Glu, Gln, His or Arg substitution, Met to Ile, Leu, Val or Phe, Phe to Trp, Tyr, Met, Ile or Leu, Trp, Tyr, Met, Ile or
  • polynucleotide refers to a polymer composed of nucleotides.
  • Polynucleotides can be in the form of individual fragments, or they can be a component of a larger nucleotide sequence structure, which is derived from nucleotide sequences that have been separated at least once in number or concentration, and can pass standards Molecular biology methods (for example, using cloning vectors) identify, manipulate, and restore sequences and their component nucleotide sequences.
  • a nucleotide sequence is represented by a DNA sequence (ie A, T, G, C)
  • this also includes an RNA sequence (ie A, U, G, C), where "U” replaces "T”.
  • polynucleotide refers to a polymer of nucleotides removed from other nucleotides (individual fragments or entire fragments), or can be a part or component of a larger nucleotide structure, such as expression Vector or polycistronic sequence. Polynucleotides include DNA, RNA and cDNA sequences. "Recombinant polynucleotide” belongs to a type of "polynucleotide”.
  • the term "recombinant polynucleotide” refers to a polynucleotide having a sequence that is not linked together in nature.
  • the recombinant polynucleotide can be included in a suitable vector, and the vector can be used to transform into a suitable host cell. Host cells containing recombinant polynucleotides are referred to as "recombinant host cells”.
  • the polynucleotide is then expressed in a recombinant host cell to produce, for example, a "recombinant polypeptide".
  • expression includes any step involved in the production of a polypeptide, including but not limited to: transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
  • expression vector refers to a linear or circular DNA molecule that contains a polynucleotide encoding a polypeptide and the polynucleotide is operatively linked to a control sequence for its expression.
  • the term "recombinant expression vector” refers to a DNA structure used to express, for example, a polynucleotide encoding a desired polypeptide.
  • Recombinant expression vectors may include, for example, i) a collection of genetic elements that have a regulatory effect on gene expression, such as promoters and enhancers; ii) structures or coding sequences that are transcribed into mRNA and translated into proteins; and iii) appropriate transcription And the transcription subunits of translation initiation and termination sequences.
  • the recombinant expression vector is constructed in any suitable manner. The nature of the vector is not important, and any vector can be used, including plasmids, viruses, phages, and transposons.
  • Possible vectors for use in the present disclosure include, but are not limited to, chromosomal, non-chromosomal and synthetic DNA sequences, such as bacterial plasmids, phage DNA, yeast plasmids, and vectors derived from combinations of plasmids and phage DNA, such as vaccinia, adenovirus, chicken DNA of viruses such as pox, baculovirus, SV40 and pseudorabies.
  • chromosomal, non-chromosomal and synthetic DNA sequences such as bacterial plasmids, phage DNA, yeast plasmids, and vectors derived from combinations of plasmids and phage DNA, such as vaccinia, adenovirus, chicken DNA of viruses such as pox, baculovirus, SV40 and pseudorabies.
  • operably linked refers to a configuration in which a regulatory sequence is placed in an appropriate position relative to the coding sequence of a polynucleotide, so that the regulatory sequence directs the expression of the coding sequence.
  • the regulatory sequence may be selected from sequences encoded by promoters and/or enhancers.
  • nucleic acid construct includes a polynucleotide encoding a polypeptide or domain or module operably linked to a suitable regulatory sequence that is useful for performing polynucleotide control in selected cells or strains. Expression is required.
  • endogenous refers to polynucleotides, polypeptides, or other compounds that are naturally expressed or produced in organisms or cells. That is, endogenous polynucleotides, polypeptides or other compounds are not exogenous. For example, when the cell is initially isolated from nature, there is an “endogenous" polynucleotide or polypeptide in the cell.
  • exogenous refers to any polynucleotide or polypeptide that is naturally found or expressed in a particular cell or organism that needs to be expressed. Exogenous polynucleotides, polypeptides or other compounds are not endogenous.
  • wild-type refers to an object that can be found in nature.
  • a polypeptide or polynucleotide sequence that exists in an organism can be isolated from a source in nature and has not been intentionally modified by humans in the laboratory, is naturally occurring.
  • naturally occurring and “wild-type” are synonymous.
  • mutant refers to a polynucleotide or polypeptide that contains an alteration (i.e., several) at one or more (e.g., several) positions relative to the "wild-type", or “compared” polynucleotide or polypeptide.
  • substitution refers to replacing a nucleotide or amino acid occupying a position with a different nucleotide or amino acid.
  • Deletion refers to the removal of nucleotides or amino acids occupying a certain position.
  • Insertion refers to the addition of nucleotides or amino acids adjacent to and immediately following the nucleotide or amino acid occupying a position.
  • the "mutant" in the present disclosure is a polypeptide that still has D-xylulose 4-epimerase (Xu4E) activity.
  • overexpressed recombinant genes produce more RNA and/or protein than the corresponding naturally occurring genes in the microorganism. Methods for measuring RNA and protein quality are known in the art. Overexpression can also be determined by measuring protein activity, such as enzyme activity. According to an embodiment of the present disclosure, "overexpression" is an amount of at least 3%, at least 5%, at least 10%, at least 20%, at least 25%, or at least 50% or more.
  • the overexpressed polynucleotide is usually a polynucleotide inherent to the host cell, and the production amount of its product is greater than the normal production amount in the host cell. For example, but not limitation, overexpression is achieved by operably linking the polynucleotide to a promoter other than the natural promoter of the polynucleotide, or introducing additional copies of the polynucleotide into the host cell.
  • fragment means a polypeptide or a catalytic or carbohydrate binding module in which one or more (eg, several) amino acids have been deleted from the amino and/or carboxyl terminus of a mature polypeptide or domain .
  • the fragment has D-xylulose 4-epimerase (Xu4E) activity.
  • isolated means a substance in a form or environment that does not exist in nature.
  • isolated substances include (1) any non-naturally occurring substance, (2) any substance including but not limited to any enzyme, mutant, nucleic acid, protein, peptide or cofactor, which substance is at least partially derived from One or more or all naturally occurring components related to its nature are removed; (3) Any substance that is artificially modified relative to a substance found in nature; or (4) By adding a substance relative to other components that are naturally related to it Any substance (such as recombinant production in a host cell; multiple copies of the gene encoding the substance; and the use of a promoter that is stronger than the promoter naturally associated with the gene encoding the substance).
  • the separated material may be present in the fermentation broth sample.
  • the host cell can be genetically modified to express the polypeptide of the present disclosure.
  • the fermentation broth from the host cell will contain the isolated polypeptide.
  • the separated substance may be present in the sample of the biotransformation fluid.
  • the target product L-arabinose can be separated from the enzyme-catalyzed polysaccharide mixed liquid.
  • high stringency conditions means that for probes with a length of at least 100 nucleotides, follow standard Southern blotting procedures at 42°C at 5X SSPE (saline sodium phosphate EDTA) , 0.3% SDS, 200 ⁇ g/ml sheared and denatured salmon sperm DNA and 50% formamide pre-hybridization and hybridization for 12 to 24 hours. Finally, the carrier material was washed three times with 2X SSC and 0.2% SDS at 65°C for 15 minutes each time.
  • 5X SSPE saline sodium phosphate EDTA
  • very high stringency conditions means that for probes with a length of at least 100 nucleotides, following standard Southern blotting procedures, at 42°C at 5X SSPE (saline sodium phosphate EDTA) ), 0.3% SDS, 200 ⁇ g/ml sheared and denatured salmon sperm DNA and 50% formamide pre-hybridization and hybridization for 12 to 24 hours. Finally, the carrier material was washed three times with 2X SSC and 0.2% SDS at 70°C for 15 minutes each time.
  • 5X SSPE saline sodium phosphate EDTA
  • free enzyme refers to an enzyme that does not contain living organisms.
  • the free enzyme of the present disclosure can be partially or highly purified after lysing the cells on which it is expressed, suspended in a solution, soluble or bound to an insoluble matrix.
  • immobilized enzyme refers to an enzyme that has a catalytic effect in a certain space and can be used repeatedly and continuously. Usually enzyme-catalyzed reactions are carried out in aqueous solutions, while immobilized enzymes are treated by physical or chemical methods to make water-soluble enzymes insoluble in water but still in a state of enzymatic activity.
  • the term "host cell” means any cell type that is easily transformed, transfected, transduced, etc. with a nucleic acid construct or expression vector containing the polynucleotide of the present disclosure.
  • the term “host cell” encompasses any progeny of a parent cell that differs from the parent cell due to mutations that occur during replication.
  • the term "whole cell microorganism” refers to a whole cell that does not completely lyse the cell membrane.
  • the whole cell microorganism containing the enzyme can be used directly, or immobilized to maintain stability and recyclability, or the whole cell can be permeabilized to obtain a rapid reaction rate.
  • catalytic reaction refers to a chemical reaction carried out under the action of a catalyst.
  • a catalyst can only selectively accelerate a specific reaction, which may cause the chemical reaction to proceed in one of several thermodynamically possible directions.
  • the catalyst and the reactants are in the same phase as homogeneous catalytic reaction (Homogeneous Catalytic Reaction), and those in different phases are called heterogeneous catalytic reaction (or heterogeneous catalytic reaction) (Heterogeneous Catalytic Reaction).
  • the reaction involving the biocatalyst-enzyme is called Enzymic Catalytic Reaction.
  • the catalytic reaction can be catalyzed by enzymes or multiple enzymes outside the whole cell.
  • the aforementioned catalytic reaction can also be called “enzyme-catalyzed reaction”, which refers to the process of chemical transformation using enzymes as catalysts. This reaction process is also called biotransformation or biocatalysis.
  • the catalytic reaction can be carried out in a living body/cell, and the aforementioned catalytic reaction can also be referred to as an "intracellular catalytic reaction".
  • intracellular catalytic reaction can also be referred to as “whole cell biocatalytic reaction”, which refers to the use of intact biological organisms (ie whole cells, tissues or even individuals) as a catalyst for chemical transformation process.
  • Organic catalysts commonly used in whole-cell biocatalytic reactions are mainly microorganisms, and their essence is to use one or more enzymes in one or more microbial cells for catalysis.
  • methods of biotransformation using animal cells, plant cells and even individual organisms have also been developed. Commonly used methods of whole-cell biotransformation are fixation of cells on the reaction plane, suspension of microspheres, and porous solid-phase carriers.
  • the term "fermentation product” refers to a preparation that is produced by cell fermentation, does not undergo or undergoes minimal recovery and/or purification.
  • the fermentation product may contain the unfractionated or fractionated content of the fermented material obtained at the end of the fermentation.
  • the fermentation product is unfractionated and contains spent medium and cell debris present after the microbial cells (eg, filamentous fungal cells) have been removed, for example, by centrifugation.
  • the fermentation product contains spent cell culture medium, extracellular enzymes, and viable and/or non-viable microbial cells.
  • biocatalytic product refers to a preparation produced by a biocatalyst (polypeptide or enzyme or whole cell) for biocatalysis that does not undergo or undergo minimal recovery and/or purification.
  • Biocatalysis is carried out in an aqueous buffer catalyzed by a biocatalyst, which contains metal ions.
  • the biocatalyst includes a free enzyme for reaction, a cell lysate containing the enzyme, a whole cell organism containing the enzyme, immobilized the enzyme, and cross-linked aggregates of the enzyme.
  • the term "bioreactor” is a device system that utilizes the biological functions of enzymes or organisms (such as microorganisms) to perform biotransformation reactions. It is a biological function simulator, such as a fermentation tank, immobilized enzyme or immobilized cell reactor, etc.
  • the specific enzyme activity of the Xu4E mutant is increased compared with the wild-type Xu4E; in another specific embodiment, the Xu4E mutant has a reaction rate compared with the wild-type Xu4E Increase; In another specific embodiment, the Xu4E mutant has a lower K m compared to wild-type Xu4E.
  • the new enzymes disclosed in the present disclosure naturally exist in various organisms. Although specific enzymes with desired activities are used in the examples, the present disclosure is not limited to these enzymes because other enzymes may have similar activities and can be used. For example, it may be discovered that some new polypeptides can also catalyze the mutual conversion of D-xylulose and L-ribulose. Other reactions described in the present disclosure can be catalyzed by enzymes not described in the embodiments of this embodiment, and are also included in this embodiment.
  • mutants of these enzymes can be used in the present disclosure in which the catalytic activity has been altered, for example to make them more active and stable under acidic or alkaline conditions.
  • the amino acid sequence mutants of the polypeptide include substitution, insertion or deletion mutants, and the mutants may be substantially homologous or substantially identical to the unmodified enzyme.
  • the mutant retains at least some of the biological activity of the enzyme, such as catalytic activity.
  • Other mutants include enzyme mutants that retain at least about 10%, preferably at least about 50%, more preferably at least about 75%, and most preferably at least about 90% of the biological activity.
  • a polypeptide or polynucleotide derived from an organism contains one or more modifications to the natural amino acid sequence or nucleotide sequence, and compared with the natural enzyme, if not better, exhibits similar activities (for example, It is at least 10%, at least 30%, at least 50%, at least 70%, at least 80%, at least 90%, at least 100% or at least 110% of the natural enzyme activity level, or even higher enzyme activity).
  • the enzyme activity is improved by directed evolution of the parent/naturally occurring sequence.
  • the enzyme coding sequence may be mutated to obtain the desired performance.
  • the "required performance" is selected from the group consisting of better thermal stability, increased reaction rate, change in optimal pH, or preference for metal cofactors, and the like.
  • the free enzyme used in the present disclosure or the cell lysate containing the enzyme is water-soluble. It is usually best to use immobilized enzymes. Immobilized enzymes are generally more stable and durable. Immobilized enzymes are also easier to recover and use in multiple catalytic cycles, thereby reducing the cost of the production process. In the art, many methods of enzyme immobilization are known. Enzymes can also be cross-linked to form a cross-linked enzyme aggregate (CLEA), which is generally more stable and easier to recycle and reuse. Many enzymes are present in organisms. They can be used as biocatalysts to produce rare sugars, but they can also be expressed heterologously in engineered microorganisms and then can be used as biocatalysts.
  • CLMA cross-linked enzyme aggregate
  • the recombinase used in the present disclosure can be retained in the whole cell without complete cell lysis.
  • the whole cell contains one or more enzymes. It is usually best to use immobilized whole cells.
  • Whole cells can be permeabilized by many techniques, such as organic solvent treatment, chemical reagent treatment or heat treatment. Immobilized cells are also easier to recover and reuse in multiple catalytic cycles, thereby reducing the cost of the production process.
  • many methods for whole cell permeability and whole cell immobilization are known.
  • the present disclosure relates to methods of immobilization and crosslinking of whole cells that catalyze the reactions described in the present disclosure.
  • Error-prone PCR is the use of DNA polymerase to amplify the promoter sequence by adjusting the reaction conditions, such as increasing the concentration of magnesium ions, adding manganese ions, changing the concentration of the four dNTPs in the system, or using low-fidelity DNA polymerase, etc.
  • the method is to change the mutation frequency in the DNA amplification process, thereby randomly introducing mutations into the target DNA sequence with a higher mutation frequency, and obtaining random mutants for the target sequence.
  • compositions of the present disclosure can be adapted to various conventional fermentation or enzyme-catalyzed bioreactors (e.g., batch, batch feed, cell or enzyme recycling and continuous fermentation or continuous enzyme catalysis).
  • bioreactors e.g., batch, batch feed, cell or enzyme recycling and continuous fermentation or continuous enzyme catalysis.
  • the amount of biocatalytic product formed per unit time is generally a function of the catalytic activity conditions of the enzyme (for example, pH, temperature, metal ion) and the amount of enzyme present in the catalytic process.
  • the solution containing the metal ions may contain one or more metal ions.
  • the solution containing metal ions may be selected from solutions containing CuCl 2 , FeCl 3 , ZnCl 2 , CaCl 2 , MgCl 2 , CoCl 2 , NiCl 2 or MnCl 2 .
  • Some of the key parameters of a highly efficient microbial-catalyzed fermentation process include the ability to grow microorganisms to a larger cell density, increase the yield of the desired product, increase the amount of volumetric productivity, remove undesirable co-metabolites, and improve cheap carbon and nitrogen sources. Utilize and adapt to change fermentation tank conditions, increase bacterial production, increase recombinant enzyme synthesis, increase tolerance to acidic conditions, increase tolerance to alkaline conditions, increase tolerance to organic solvents, and increase Tolerance to high salt conditions and increased tolerance to high or low temperature.
  • multiple enzymes may exist in one or more forms of a free enzyme, a cell lysate containing the enzyme, a whole cell containing the enzyme, and an immobilized enzyme , Biocatalysis is carried out in the reaction liquid containing the pentose substrate, so that the converted product is generated into the reaction liquid.
  • the final product of enzyme catalysis can be separated from the reaction liquid using any suitable method known in the art.
  • L-pentose can be separated from multiple enzymes, reactants, reaction intermediates and biocatalytic products, and various methods known in the art can be used to recover and/or purify the biocatalytic products from the reactants and reaction intermediates.
  • the biocatalytic product is recovered from the bioreactor.
  • the microorganisms are disrupted, and the culture medium or lysate is centrifuged to remove particulate cell debris and separate cell membranes to obtain a soluble protein fraction including the enzyme, and the multienzyme can catalyze the production of L-pentose.
  • the separation and purification methods of L-pentose include, but are not limited to chromatography, simulated moving bed chromatography, crystallization, adsorption and release based on ionic, hydrophobic and size exclusion resins, filtration, microfiltration, ultrafiltration, nanofiltration , Centrifugation, extraction, salt or solvent precipitation, drying or a combination thereof.
  • the required separation is not limited to enzyme removal/recovery, but also includes the recovery of some or all of the remaining products and reactants (including D-xylose, D-xylulose, L-pentose and metal ions) mixtures; required The separation may not require further purification. With or without the recovery of D-xylose, D-xylulose and L-ribulose, as well as the purification, immobilization and recycling of enzymes are further embodiments included in the present disclosure.
  • the enzymes that catalyze some or all of the reactions described in the present disclosure can be expressed in non-natural, engineered heterologous organisms. Specifically, genes encoding pathway enzymes can be isolated, inserted into expression vectors used for transformation of production organisms, can be incorporated into the genome, and the enzymes can be directly expressed. In this field, methods for manipulating microorganisms are known, such as “Modern Methods of Molecular Biology” (Online ISBN: 9780471142720, John Wiley and Sons, Inc.), “Microbial Metabolism Engineering: Methods and Procedures” (Qiong Cheng Ed., Springer) and “System Metabolic Engineering: Methods and Procedures” (Hal S. Alper Ed., Springer) and other publications.
  • Mutants can be constructed by up-regulating or down-regulating the expression of the polynucleotide using methods well known in the art, such as insertion, destruction, substitution, or deletion.
  • the polynucleotide to be modified or inactivated may be a coding region or a part thereof necessary for activity, or a regulatory element required for the expression of the coding region.
  • An example of such a regulatory or control sequence may be a promoter sequence or a functional part thereof, that is, a part sufficient to influence the expression of the polynucleotide.
  • Other control sequences that can be modified include, but are not limited to: leader, polyadenylation sequence, propeptide sequence, signal peptide sequence, transcription terminator, and transcription activator.
  • multiple enzymes are mixed to form an artificial multi-enzyme pathway, which can convert raw materials, such as D-xylose or other intermediate products (D-xylulose) into L-pentose, and Recovery of L-pentose.
  • the biological reaction process can be carried out under aerobic, micro-aerobic or anaerobic conditions.
  • the biocatalytic reaction is performed under anaerobic conditions (ie, there is no detectable oxygen).
  • the biological reaction process is carried out under the condition of 30°C-90°C. In some specific embodiments of the present disclosure, the biological reaction process is carried out under the condition of 40°C-80°C. In some more specific embodiments of the present disclosure, the biological reaction process is carried out under the condition of 50°C-70°C. In some more specific embodiments of the present disclosure, the biological reaction process is carried out under the condition of 60°C-70°C.
  • the preparation method of the pET plasmid carrying the corresponding protein coding gene is as follows.
  • L-ribulose 5-phosphate-4-epimerase that may possess the enzyme activity of Xu4E
  • they are derived from T. maritima, E. coli, and spores.
  • the rp4e genes of Bacillus subtilis 168 and Geobacillus stearothermophilus were amplified from their corresponding genomes.
  • POE-PCR extended overlap extension PCR
  • reaction conditions for POE-PCR are as follows: 250ng pET20b plasmid backbone and equimolar target gene fragments, 0.2mM various dNTPs, and 0.02U/ ⁇ l Q5 DNA polymerase.
  • PCR amplification conditions 98°C for 1min; 98°C for 20s, 60°C for 20s, 72°C for 72s, 30 cycles; 72°C for 5min.
  • R (SEQ ID NO: 124): GGTGGTGGTGCTCGAGCCCCTCCAGCAGATCCACGTGCC-3'.
  • the uxaE gene was amplified from the genome of Thermus marinus by PCR method.
  • R (SEQ ID NO: 126): 5'-CTTTCAAGACCATATGGGTATATCTCCTTCTTAAAG-3'.
  • the multimer plasmid was amplified and transformed into E. coli TOP10 to obtain the plasmid pET28a-tm_UxaE.
  • L-arabinose isomerase L-AI
  • DNA sequence of AI derived from the thermostable bacterium Geobacillus stearothermophilus was codon-optimized and synthesized by General Biology (Anhui, China). Plasmid pET20b-BsAI.
  • thermostable xylose isomerase D-XI
  • all recombinases have histidine fusion tags and are purified by affinity adsorption using nickel ion resin.
  • the pET plasmid carrying the gene encoding the target protein was cultured using E.coli BL21 cells in 250 ml of LB medium at a temperature of 37°C. When the cell absorbance A 600 reaches ⁇ 0.6-0.8, add 0.1mM IPTG to induce protein expression. Protein expression was performed at 37°C for 6 hours or at 18°C for 16 hours. After the cells were collected by centrifugation, the pellet was resuspended in 50 mM HEPES buffer (pH 7.5) containing 0.1 M sodium chloride and 10 mM imidazole.
  • the cell membrane was broken by ultrasound, and after centrifugation, the supernatant sample containing the enzyme was loaded onto a nickel ion resin purification column.
  • the target enzyme was eluted and purified with 50 mM HEPES buffer (pH 7.5) containing 0.1 M sodium chloride and 150-500 mM imidazole.
  • the enzyme concentration can be determined by the Bradford measurement method, and bovine serum albumin is used as the standard protein.
  • the expression level of the recombinant protein and the purity of the protein were detected by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE), and the abundance using Image Lab software (Böhler, Hercules, California, USA) The analysis function performs quantification.
  • thermostable enzymes used in the synthesis of L-arabinose are D-xylose isomerase derived from Thermus thermophiles, wild-type Xu4E and Xu4E derived from T. maritima Mutant M8 and L-arabinose isomerase derived from Geobacillus stearothermophilus can be purified by heat treatment (50-80°C, 10-60 minutes). The cell lysate is heat-treated and centrifuged, and the supernatant containing the above three enzymes is mixed, which can be used for the conversion from D-xylose to L-arabinose.
  • the screening plasmid pGS-Xu4E contains the mCherry gene under the control of the P BAD promoter, the wild-type araC gene under the control of the P AraC promoter, and the xu4e gene under the control of the P tac promoter.
  • the Xu4E positive mutants can produce more L-arabinose, thereby inducing E. coli cells to express higher levels of mCherry fluorescent protein and produce stronger fluorescent signals.
  • the screening plasmid is constructed using standard DNA assembly techniques.
  • E. coli JZ919 (TOP10 ⁇ xylB::araA) was constructed as a screening host and used together with the screening plasmid pGS-Xu4E.
  • the screening plasmid pGS-Xu4E carries a genetic sensor that can detect the concentration of L-arabinose in the cell and display the mCherry fluorescence signal.
  • L-arabinose two genes related to the utilization of D-xylulose and L-ribulose in E. coli host cells were knocked out, and the araA gene was inserted into the genome of E. coli. Starting from E. coli Top10 ( ⁇ araABCD), the xylB gene in the genome is replaced by the araA gene, and the knockout and insertion are performed simultaneously.
  • the xu4e gene mutant library was established by error-prone PCR (ep-PCR) with low mutation rate.
  • the primers are: MUxaE-IF (SEQ ID NO: 127): 5'-CCATATGGTCTTGAA-3'; MUxaE-IR (SEQ ID NO: 128): 5'-GGTGGTGGTGCTCGAGCCCCTCCAGCAGATCCACGTGCC-3' .
  • MUxaE-IF is a 5'-end phosphorylated primer.
  • the last 28 bp sequence at the 5'end of MUxaE-I R is a homologous complementary sequence to the sequence of the plasmid backbone.
  • 50 ⁇ l ep-PCR reaction system includes 1ng/ ⁇ l plasmid pGS-Xu4E, 0.2mM dATP, 0.2mM dGTP, 1mM dCTP, 1mM dTTP, 5mM MgCl 2 , 0.05mM MnCl 2 , 0.4 ⁇ M primers (MUxaE-IF and MUxaE-IR) And 0.05U/ ⁇ l NEB Taq polymerase.
  • E. coli JZ919 The chemically competent cells of E. coli JZ919 can be prepared according to the prior art, for example, by the method described in the "Molecular Biology Experiment Guide”. Further, E. coli JZ919 cells carrying the uxaE mutant library were cultured on LB solid medium containing D-xylose. After culturing at 37°C for 12 hours, the color of the colony was observed every 4 hours. Select positive clones by eyeball observation or by UV radiation to detect the fluorescence intensity of the colony. The clones showing stronger fluorescence intensity were selected and cultured in a 96 deep-well plate containing 0.5 ml of LB medium. D-xylose was added to the LB medium and cultured at 37°C for 12 hours.
  • the fluorescence signal of the cell culture medium in the 96-well plate was used SynergyMx multi-function microplate detector (Berton, Vermont, USA).
  • the fluorescence excitation scan was performed at 589 nm, and the emission scan was performed at 610 nm.
  • D-xylose/D-xylulose mixture Preparation of D-xylose/D-xylulose mixture.
  • the D-xylose/D-xylulose mixture was prepared in 1 ml of 50mM HEPES buffer (pH 7.5), the buffer containing 1M xylose, 5mM MgCl 2 and 50mg immobilized D-XI, XI was purchased from Sigma -Aldrich (G4166). After an overnight reaction at 70°C, the immobilized XI was removed by centrifugation.
  • the D-xylose/D-xylulose mixture contains approximately 700 mM xylose and 300 mM xylulose.
  • the enzyme activity determination of Xu4E uses a step-by-step enzyme activity determination method.
  • the reaction solution is a 50mM Tris buffer containing 70mM xylose, 30mM xylulose and 0.2mM Co 2+ or 2mM Zn 2+ .
  • the Xu4E enzyme activity determination is carried out at 30-80°C for 15 minutes to 24 hours, and the enzyme protein solubility is 0.001-10g/L.
  • draw 65 ⁇ L of the reaction solution and mix it with 35 ⁇ L of 1.88M HClO 4 , and add 13.5 ⁇ L of 5M KOH to neutralize the mixture.
  • the second step of the conversion reaction was carried out in 50 mM HEPES buffer (pH 7.5) containing 10 U/mL (excess) L-AI and 1 mM Mn 2+.
  • the supernatant containing L-ribulose was removed. It is converted into L-arabinose, and the enzyme reaction is carried out at 50°C for 15 minutes.
  • the L-arabinose concentration was measured with Megazyme L-arabinose/D-galactose assay kit (K-ARGA, Borre, Ireland).
  • the enzyme activity of Xu4E was measured in 50 mM Tris buffer (pH 8.5) containing 0.2 mM Co 2+ at 50°C.
  • the total concentration of D-xylose and D-xylulose is 1.5 to 1000 mM, and the concentration of D-xylulose is 0.5 to 300 mM.
  • Enzyme activity determination was carried out at 50°C for 15 minutes.
  • the apparent K m and k cat constants of Xu4E for D-xylulose are based on the nonlinear fitting of the Michaelis equation and calculated using GraphPad Prism 5 software (Graphpad Software Company, Los Angeles, California, USA).
  • L-arabinose from 50mM D-xylose is carried out in a 1ml reaction system, which is 100mM HEPES buffer (pH 8.0) including 0.2mM Co 2+ , 1mM Mn 2+ , 1g/L Xu4E (wild type , Its DNA sequence SEQ ID No: 1, and representative mutant M8, its amino acid sequence SEQ ID No: 40), 1U/mL D-XI and 1U/mL L-AI.
  • concentration of Xu4E increased to 10g/L
  • D-XI and L-AI also increased to 10U/mL.
  • the reaction solution was reacted at 50°C after mixing the three enzymes.
  • the concentration of L-arabinose was measured using Megazyme L-arabinose/D-galactose assay kit (K-ARGA, Borre, Ireland), while the concentration of D-xylose, D-xylulose and L-ribulose
  • K-ARGA Megazyme L-arabinose/D-galactose assay kit
  • D-xylose, D-xylulose and L-ribulose concentration was detected by Shimadzu High Performance Liquid Chromatography equipped with a differential refractive index detector and separated by Bó Lè Aminex HPLC HPX-87H liquid column.
  • the product L-ribulose obtained was analyzed by LC-ESI-QTOF-MS.
  • the 1mL reaction system includes 50mM Tris buffer (pH 8.5), 10mM D-xylulose, 0.2mM Co 2+ and 1g/L Xu4E. After the reaction was carried out at 50°C for 1 hour, 538 ⁇ L of HClO 4 was added to terminate the reaction. The mixture was neutralized by adding 207 ⁇ L of 5M KOH. Centrifugation to remove inactivated proteins and precipitates. The samples were separated by Shimadzu high performance liquid chromatography.
  • the products were detected by QTOF (compact QTOF, Bruker, Germany) equipped with electrospray ionization (ESI) quadrupole time-of-flight tandem mass spectrometry. . .
  • Waters Sugar Pak I calcium ion exchange column (300 x 6.5 mm, 10 ⁇ m particle size) was used as a stationary phase for sample separation (Waters Co, Milford, Massachusetts, USA).
  • the mobile phase is deionized water, the flow rate is 0.5mL/min, the column temperature is 80°C, and the sample volume is 20 ⁇ l.
  • ESI uses negative ion mode.
  • the capillary voltage is 4500V
  • the sprayer pressure is 2bar
  • the drying heater is 200°C
  • the drying airflow is 8L/min.
  • the present disclosure also relates to fermentation products or cell lysates containing the polypeptides of the present disclosure.
  • the fermentation product further contains additional components used in the fermentation process, such as whole cells (including host cells containing genes encoding the polypeptides of the present disclosure, which are used to produce the polypeptides of interest), or cell lysates.
  • the composition is a whole culture medium containing killed whole cells containing the enzyme, a cell lysate containing the enzyme, and a culture medium containing the whole cell killing of the enzyme.
  • Simulated moving bed is a mass transfer device that uses the principle of adsorption to perform liquid separation operations. It is performed in a countercurrent continuous operation mode.
  • Industrialized SMB has been increasingly used to separate low value-added biological products, such as organic acids, amino acids, and rare sugars. Combining enzyme immobilization and SMB separation can effectively reduce the production cost of the required L-pentose and improve the utilization efficiency of the substrate.
  • the SMB resin includes Shodex Sugar KS-801 sodium ion exchange column, Waters Sugar Pak I calcium ion exchange column, Bio-Rad Aminex HPX-87P lead ion exchange column or Bio-Rad Aminex HPX-87H hydrogen ion exchange column. The one that arrives, or other similar resins, or a series combination of them.
  • SEQ ID NO: 1 shows the wild-type tagatonic acid carbon 3-epimerase gene of Thermotoga maritima MSB8 (NCBI reference sequence: WP_004081526.1, KEGG ID TM0440) Nucleotide sequence
  • SEQ ID NO: 2 shows the wild-type tagatonic acid carbon 3-epimerase gene of Thermotoga maritima MSB8 (NCBI reference sequence: WP_004081526.1, KEGG ID TM0440) Amino acid sequence
  • SEQ ID NO: 3 shows the amino acid sequence of the tagatonic acid carbon 3-epimerase gene (NCBI reference sequence: WP_015918744.1) of Thermotoga neapolitana;
  • SEQ ID NO: 4 shows the amino acid sequence of the tagatonic acid carbon 3-epimerase gene (NCBI reference sequence: WP_101512888.1) of Thermotoga sp SG1 (Thermotoga sp SG1);
  • SEQ ID NO: 5 shows the amino acid sequence of the tagatonic acid carbon 3-epimerase gene of Thermotoga caldifontis (NCBI reference sequence: WP_041077375.1);
  • SEQ ID NO: 6 shows the amino acid sequence of the tagatonic acid carbon 3-epimerase gene (NCBI reference sequence: WP_012002872.1) of Pseudothermotoga lettingae;
  • SEQ ID NO: 7 shows the amino acid sequence of the tagatonic acid carbon 3-epimerase gene (NCBI reference sequence: WP_081374543.1) of Halanaerobium congolense;
  • SEQ ID NO: 8 shows the amino acid sequence of the 4-epimerase gene (NCBI reference sequence: TYP53248.1) of Thermosediminibacter litoriperuensis;
  • SEQ ID NO: 9 shows the amino acid sequence of the 4-epimerase gene of Rhodothermus marinus (NCBI reference sequence: WP_012844026.1);
  • SEQ ID NO: 10 shows the amino acid sequence of the 4-epimerase gene of Gracilibacillus timonensis (NCBI reference sequence: WP_066188474.1);
  • SEQ ID NO: 11 shows the amino acid sequence of the 4-epimerase gene (NCBI reference sequence: HCZ06146.1) of Thermotogae bacterium;
  • SEQ ID NO: 12 shows the amino acid sequence of the 4-epimerase gene of Thermotogae bacterium (NCBI reference sequence: RKX45454.1);
  • SEQ ID NO: 13 shows the amino acid sequence of the 4-epimerase gene (NCBI reference sequence: HAF71394.1) of Candidatus Acetothermia bacterium;
  • SEQ ID NO: 14 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Theth_1083) of Pseudothermotoga thermorum;
  • SEQ ID NO: 15 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Tthe_2391) of Thermoanaerobacterium thermosaccharolyticum DSM 571;
  • SEQ ID NO: 16 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: TCARB_0828) of Thermofilum adornatus 1505;
  • SEQ ID NO: 17 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Thit_1746) of Thermoanaerobacter italicus;
  • SEQ ID NO: 18 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Tnap_0222) of Thermotoga naphthophila;
  • SEQ ID NO: 19 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Cst_c08510) of Thermoclostridium stercorarium DSM 8532;
  • SEQ ID NO: 20 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: DICTH_1923) of Dictyoglomus thermophilum;
  • SEQ ID NO: 21 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: STHERM_c04350) of Spirochaeta thermophila DSM 6192;
  • SEQ ID NO: 22 shows the amino acid sequence of the 4-epimerase gene (Kegg ID: Sinac_2806) of Singulisphaera acidiphila;
  • SEQ ID NO: 23 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (Kegg ID: TM0283) of Thermotoga maritima MSB8;
  • SEQ ID NO: 24 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (NCBI Reference Sequence: WP_041077291.1) of Thermotoga caldifontis;
  • SEQ ID NO: 25 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (GenBank: ACM22577.1) of Thermotoga neapolitana DSM 4359;
  • SEQ ID NO: 26 shows the amino acid sequence of class II aldolase (GenBank: KUK21094.1) of Pseudothermotoga lettingae;
  • SEQ ID NO: 27 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (Kegg ID: BSU28780) of Bacillus subtilis;
  • SEQ ID NO: 28 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (NCBI Reference Sequence: WP_060788488.1) of Geobacillus zalihae;
  • SEQ ID NO: 29 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (GenBank: KFL15052.1) of Geobacillus stearothermophilus;
  • SEQ ID NO: 30 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (NCBI Reference Sequence: WP_042385633.1) of Parageobacillus thermoglucosidasius;
  • SEQ ID NO: 31 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (NCBI Reference Sequence: WP_094043878.1) of Thermoanaerobacterium thermosaccharolyticum;
  • SEQ ID NO: 32 shows the amino acid sequence of the D-xylulose 5-phosphate 4-epimerase gene (Kegg ID: b0061) of Escherichia coli K-12MG1655.
  • SEQ ID NO: 33-122 shows the mutant constructed by the inventor.
  • nucleotide sequence corresponding to the amino acid sequence shown in SEQ ID NO: 3-32 is also known to those skilled in the art. Content.
  • D-xylose, D-xylulose, D-ribulose, and L-arabinose are separated by using any of the methods described in (1)-(4) below:
  • the concentration can be an island equipped with a differential refractive index detector. Jin High Performance Liquid Chromatography for detection.
  • the natural enzyme with the activity of Xu4E enzyme is the enzyme encoded by the sequence shown in SEQ ID NO: 23-32.
  • Example 2 Considering the similarity of the substrate structure and the possible enzyme catalytic mechanism, we adopted a method similar to that in Example 2, and selected the possible possibility from the tagatonic acid 3-epimerase (UxaE, EC 5.1.2.7) A new enzyme with the function of converting D-xylulose into L-ribulose. We cloned multiple uxae genes from different microorganisms and cloned them into the pET plasmid. E. coli BL21 (DE3) carrying the expression plasmid is cultured and the recombinant protein is expressed.
  • UxaE tagatonic acid 3-epimerase
  • the obtained recombinant protein is detected.
  • the taganoic acid 3-epimerase (Tm0440) from Thermus marinus showed specific enzyme activity under the optimal reaction conditions (70°C, 5mM Zn 2+ ). It is about 0.012U/mg.
  • the natural enzyme with the activity of the Xu4E enzyme was the enzyme encoded by the sequence shown in SEQ ID NO: 2-22.
  • TmXu4E tagononic acid 3-epimerase
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on the solid plate of LB medium containing D-xylose. Approximately 10,000 clones were selected on the plate, and the positive mutants were picked out and inoculated into 96-well plates and cultured in LB medium containing D-xylose.
  • "Xu4E mutant library of high-throughput screening" method described in the present disclosure we are using a microplate reader fluorescence signal of the cell culture microplate is detected, a positive acknowledgment mutants. Several positive mutants were selected in each round.
  • mutant M4 containing S125D.
  • specific enzyme activity of mutant M4 As a result, it was found that M4 showed a 25% increase in specific enzyme activity relative to the natural enzyme (Figure 5).
  • mutant M2 selected in Example 4 we used error-prone PCR with low mutation rate (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on the solid plate of LB medium containing D-xylose. Approximately 10,000 clones were selected on the plate, and the positive mutants were picked out and inoculated into 96-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M47 selected in Example 5 we used a low mutation rate error-prone PCR (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on the solid plate of LB medium containing D-xylose. Approximately 10,000 clones were selected on the plate, and the positive mutants were picked out and inoculated into 96-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M57 selected in Example 6 we used error-prone PCR with low mutation rate (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on the solid plate of LB medium containing D-xylose. Approximately 20,000 clones were selected on the plate, and the positive mutants were picked out and inoculated into 96-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M61 selected in Example 7 we used error-prone PCR with low mutation rate (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on a solid plate of LB medium containing D-xylose. Approximately 15,000 clones were screened on the plate, and the positive mutants were picked and inoculated into 96 deep-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M64 which contains five amino acid mutations S125D/V163K/V267I/N297F/Y403W.
  • mutant M64 selected in Example 8 we used error-prone PCR with low mutation rate (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on the solid plate of LB medium containing D-xylose. Approximately 12,000 clones were selected on the plate, and the positive mutants were picked out into 96-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M72 selected in Example 9 we used error-prone PCR with low mutation rate (that is, ⁇ 1 mutation site per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on a solid plate of LB medium with D-xylose. Approximately 18,000 clones were screened on the plate, and the positive mutants were picked out into 96-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M75 containing seven amino acid mutations S125D/V163K/V267I/N297F/W306M/S402V/Y403W.
  • mutant M72 selected in Example 9 we used error-prone PCR with low mutation rate (that is, 1-2 mutation sites per gene) to construct the xu4e mutant library.
  • the mutant library inserted into the plasmid pGS-Xu4E was transferred to the host E. coli JZ919 and spread on a solid plate of LB medium with D-xylose. Approximately 25,000 clones were screened on the plate, and the positive mutants were picked out into 96 deep-well plates and cultured in LB medium containing D-xylose. We use a microplate reader to detect the fluorescence signal of the cell culture medium in the microplate to confirm the positive mutant.
  • mutant M87 containing nine amino acid mutations S125D/R131S/V163K/V267I/N297F/W306M/Q318K/S402V/Y403W.
  • the amino acid sequence of the aforementioned mutant M87 is as shown in SEQ ID NO: 119.
  • mutant M87 is about 2U/mg.
  • mutant has a significant increase in specific enzyme activity.
  • the amino acid sequence of the aforementioned wild-type TmXu4E is the sequence shown in SEQ ID NO: 2, and the nucleotide sequence encoding the aforementioned amino acid is the sequence shown in SEQ ID NO: 1.
  • mutant M87 Nine amino acid mutation sites are selected from mutant M87, which are serine at position 125, arginine at position 131, valine at position 161, valine at position 267, and day at position 297. Paraffin, tryptophan at position 306, glutamine at position 318, serine at position 402, or tyrosine at position 403.
  • the 9 mutant libraries were inserted into the plasmid pGS-Xu4E of the host E. coli JZ919, and then they were spread on each LB medium containing D-xylose.
  • the positive mutants were screened by the "High-throughput Screening of Xu4E Mutant Library" method described in the present disclosure. Compared with the wild-type enzyme, the positive mutant was verified in a 96-well microtiter plate and sequenced by DNA sequencing.
  • mutants obtained from the screening From the mutants obtained from the screening, we selected the following 9 mutants: S125D, R131S, V163K, V267I, N297F, W306M, Q318K, S402V and Y403W. Among them, the amino acid numbers in the aforementioned mutants are based on SEQ ID NO : 2 for numbering.
  • the above 9 mutants were overexpressed in E. coli BL21 (DE3) and purified by affinity adsorption on a nickel ion-containing resin.
  • the specific activities of the Xu4E enzymes of mutants S125D, V163K, V267I, N297F, W306M, Q318K, S402V and Y403W are shown in Figure 6. Among them, the specific activity of eight mutants was higher than that of wild-type TmXu4E, and the specific activity of one mutant R131S was slightly lower than that of wild-type TmXu4E.
  • M68 (six mutations): V163K/V267I/N297F/W306M/S402V/Y403W;
  • M78 (seven mutations): R131S/V163K/V267I/N297F/W306M/S402V/Y403W.
  • the enzyme activities of M41, M46, M50, M58, M68, and M78 are approximately 0.30U/mg, 0.21U/mg, 0.18U/mg, 0.33U/mg, 0.41U/mg, 0.57U/ mg.
  • the Xu4E mutants we obtained are as follows (the aforementioned Xu4E mutants are all numbered according to SEQ ID NO: 2):
  • the enzyme activities of M13, M19, M1, M30, and M31 were approximately 0.008U/mg, 0.013U/mg, 0.03U/mg, 0.06U/mg, 0.04U/mg, respectively.
  • D-xylose isomerase derived from T. thermophiles, Xu4E derived from T. maritima, and L derived from G. stearothermophilus -Arabinose isomerase is heterologously expressed in E. coli BL21 (DE3). After the cells were collected and disrupted, the supernatant of the cell lysate was subjected to heat treatment (70° C., 20 min). After centrifugation, the supernatant containing three thermostable enzymes was mixed and used for the bioconversion of D-xylose to L-arabinose. The expression level of the target protein and the purity of the purified protein were detected by SDS-PGAE ( Figure 7).
  • D-xylose isomerase D-XI
  • Xu4E mutant M87 a substrate of 50mM D-xylose.
  • the reaction mixture includes 100 mM HEPES buffer (pH 8.0), 0.2 mM Co 2+ , 1 mM Mn 2+ , 1 g/L Xu4E M8 and 1 U/mL XI.
  • the reaction solution was mixed gently and reacted at 50°C under anaerobic conditions.
  • D-xylose isomerase D-XI
  • Xu4E mutant M87 D-xylose isomerase
  • L-AI L-arabinose synthase
  • the reaction mixture is 100mM Tris buffer (pH 8.0) including 0.2mM Co 2+ , 1mM Mn 2+ , 1g/L Xu4E (where Xu4E is selected from wild-type or M87 mutant), 1U/mL D-XI and 1U/ mL L-AI.
  • the three-enzyme system containing the Xu4E mutant M87 produced 21 mM L-arabinose after 8 hours of reaction, while the three-enzyme system containing the wild type only produced 1.25 mM L-arabinose after 24 hours of reaction ( Figure 8).
  • D-xylose isomerase D-XI
  • Xu4E mutant M87 D-xylose isomerase
  • L-AI L-arabinose synthase
  • the three-enzyme system containing M87 produced 175mM L-arabinose after 4 hours of reaction. Its specific volume production rate reaches 6.56g L-arabinose/liter/hour. After 8 hours of reaction, when the reaction reaches equilibrium, the substrate D-xylose is 207mM, the intermediate products D-xylulose and L-ribulose are 55 and 42mM, respectively, and the product L-arabinose is 196mM ( Figure 9) .
  • D-xylose isomerase D-XI
  • Xu4E mutant M87 mannose phosphate isomerase
  • MPI mannose phosphate isomerase
  • the mpi gene derived from G. thermodenitrificans was cloned and inserted into the pET plasmid (Kim et al. 2014).
  • the reaction mixture is a 50 mM Tris buffer (pH 8.0) including 0.2 mM Co 2+ , 1 mM Mn 2+ , 1 g/L Xu4E (M87 mutant), 1 U/mL XI, and 1 U/mL MPI.
  • Co-immobilization of multiple enzymes not limited to purified recombinase or crude cell lysate containing overexpressed recombinase, prolongs the service life of the enzyme, facilitates the separation of enzymes and products/intermediates, and reduces the use of enzyme cost.
  • immobilizing the whole microbial cell containing the enzyme to improve the stability of the biocatalyst and the reusability of the biocatalyst is an alternative solution.
  • permeabilizing the whole microbial cells is also an optional solution.
  • a simulated moving bed (SMB)
  • the relevant packing resin is not limited to Shodex Sugar KS-801 sodium ion exchange column, Waters Sugar Pak I calcium ion exchange column, Bio-Rad Aminex HPX-87P lead ion exchange Column, Bio-Rad Aminex HPX-87H hydrogen ion exchange column, or chromatographic separation column with similar functions.
  • a simulated moving bed can be used to separate L-arabinose from unused D-xylose and the intermediate products L-ribulose and D-xylulose ( Figure 10).
  • the wild-type Xu4E and Xu4E mutant M87 were used to catalyze the reaction of the substrate D-xylulose.
  • the wild-type Xu4E was performed at 50°C for 24 hours, and the Xu4E mutant M87 was performed at 50°C for 10 minutes.
  • the substrate D-xylulose and the product L-ribulose are separated by HPLC equipped with a Waters Sugar Pak column.
  • the HPLC chromatogram is shown in Figure 11.
  • the retention time of the product (L-ribulose) is a new peak (black solid line) at 15.49 minutes, which is the same as the retention time of the L-ribulose standard.
  • the product peaks separated by HPLC were characterized by the primary mass spectrum (Figure 12) and the secondary mass spectrum ( Figure 13).
  • Example 3 Based on bioinformatics methods, we analyzed and compared the sequences of different wild-type Xu4E obtained in Example 3, Example 14 and Example 15. Among them, * shows the conservative amino acid positions of wild-type Xu4E from different sources.
  • the experimental results show that the activity of the protein after the sequence truncation still maintains the activity of Xu4E, which is 90% (removing amino acids 1-86), 87% (removing amino acids 196-236) or 85% of the enzyme activity of wild-type Xu4E respectively. Remove the 1-86 amino acids and 196-236 amino acids at the same time).
  • Example 25 Statistics of enzymes with Xu4E activity in the present disclosure
  • thermostable L-rhamnose isomerase from Thermotoga maritima ATCC 43589 and its application in the production of L-lyxose and L-mannose. Biotechnol. Lett. 32:1947-1953.

Abstract

本公开涉及D-木酮糖4-差向异构酶、其突变体及其用途。具体来说,本公开涉及一种具有D-木酮糖4-差向异构酶活性的多肽,前述多肽的制备方法,以及前述多肽在以D-木糖或D-木酮糖为原料,制备L-戊糖中的用途。本公开发现的一种新的制备L-戊糖的方法,其相较于现有技术中传统的生产方法,生产工艺更加简单,并且使得生产L-戊糖的成本下降。

Description

D-木酮糖4-差向异构酶、其突变体及其用途 技术领域
本公开属于生物催化和合成生物学领域。本公开涉及能够可逆催化D-木酮糖和L-核酮糖之间相互转化的新型多肽(酶)以及它们的突变体,命名D-木酮糖4-差向异构酶(Xu4E);以及通过构建多条人工酶途径,利用该酶,从自然界最丰富的戊糖D-木糖生产L-戊糖(即L-阿拉伯糖、L-核糖、L-核酮糖、L-木酮糖、L-木糖和L-来苏糖)的新方法。
背景技术
戊糖(或戊单糖),化学式为C 5H 10O 5,是五个碳原子的单糖。它们可分为两大类:戊醛糖和戊酮糖;一共有八种戊醛糖和四种戊酮糖,每一个戊酮糖对应两种戊醛糖。十二种戊糖也可分为L-型糖和D-型糖,并且每一型的糖具有四种戊醛糖和两种戊酮糖。D-木糖、D-核糖和L-阿拉伯糖是天然糖,但是其它戊糖是自然界中极少量存在的稀少糖。D-木糖为自然界中最丰富的戊糖。L-戊糖由于其在医疗和健康的应用,特别是作为许多重要医药前体的巨大潜能,引起很多的关注。
D-木糖为自然界中最丰富的戊糖,可以自木质纤维素中分离,被称为木糖(wood sugar)。D-木糖是半纤维素木聚糖的主要组成部分。现在D-木糖主要从玉米芯和甜菜浆的酸性或碱性水解物中分离,大部分木糖用来转化为零卡路里的甜味剂木糖醇。
L-阿拉伯糖为一种美国食品和药物管理局(FDA)批准的具有蔗糖50%甜度的、零卡路里天然甜味剂(Antila等1997,Boku等2001)。更重要的是,添加3-4%的L-阿拉伯糖到蔗糖,可以抑制蔗糖酶的活性,防止蔗糖水解,阻止蔗糖吸收,所以L-阿拉伯糖被认为是一种蔗糖中和剂,根据卡路里摄入(Morimoto等2001)。同时,在肠道中未利用的糖是一种促进有益细菌生长的益生元,从而抑制大肠中有害微生物的生长。L-阿拉伯糖也是一种用作合成药物的起始原料和一种广泛应用于分子生物学实验和工业发酵的生化产品。
L-阿拉伯糖以阿拉伯聚糖、阿拉伯木聚糖、阿拉伯半乳聚糖等形式存在于高等植物的半纤维素中。在日本,L-阿拉伯糖是利用碱法提取玉米纤维(Boku等2001)、阿拉伯树胶、甜菜浆(Antila等1997)等中的半纤维素,然后进行酸水解制得的。在中国,L-阿拉伯糖是对玉米芯进行酸水解生产D-木糖的副产物。L-阿拉伯糖的价格高、供应量有限大大地限制了其广泛的应用。
L-核糖在自然界中不广泛存在。它是许多新型核苷酸类似物的前体,用来生产抗病毒药物,如抗人类免疫缺陷病毒、肝炎病毒和巨细胞病毒的药物(Kim等2014)。L-核糖也可作为葡萄糖脱氢酶的竞争性抑制剂(Beerens等2012)。以前,L-核糖的生产方法是以通过核糖醇为中间产物的两步法微生物转化。最近,L-核糖的生物合成通过两步酶催化反应:通过L-阿拉伯糖异构酶(L-AI)和L-核糖异构酶(L-RI,EC 5.3.1.B3)或甘露糖6-磷酸异构酶(MPI,EC 5.3.1.8)将L-核酮糖转化为L-核糖(Kim等2014)。
L-核酮糖是合成L-核糖和L-阿拉伯糖的起始原料。它的5’-磷酸产物L-核酮糖5-磷酸是戊糖磷酸途径的重要代谢产物。
L-木酮糖可以作为α-葡萄糖苷酶的抑制剂,可以用来降低血糖。L-木酮糖也可用于生产其他重要的稀少糖,如用于生产抗病毒药物的L-核糖和作为肝炎或肝硬化指标的L-木糖。
L-来苏糖是用于动物的抗生素阿维拉霉素A的成分,是一种潜在的L-岩藻糖苷酶抑制剂。
L-木糖是合成抗乙肝病毒(HBV)核苷和合成L-核糖呋喃糖以及衍生物的起始原料。
表1不同种类的L-戊糖的应用
Figure PCTCN2020133395-appb-000001
差向异构酶是一类催化含多个不对称中心的底物中某一个不对称碳原子发生构型变化的异构酶。人们寻找戊单糖4-差向异构酶已有很长时间,但是从来没有被报道过(Beerens等2017)。自然界中的4-差向异构酶,如L-核酮糖5-磷酸4-差向异构酶(EC 5.1.3.4)和UDP-D-木糖4-差向异构酶(EC 5.1.3.5),需要其戊糖底物有磷酸或尿苷二磷酸(UDP)基团。
Ken Izumori教授提出了一个用于稀少糖生物合成的完整策略(如本公开的图1所示)。从D-木糖开始,通过使用L-核酮糖3-差向异构酶(或戊酮糖3-差向异构酶)、醛糖异构酶、醛糖还原酶和多元醇脱氢酶可制得其他11种戊糖。现有技术以D-木糖为原料,生产六种L-戊糖时,必须经过生产木糖醇或核糖醇的步骤,因此,需要使用两种基于辅酶NAD(P)的氧化还原酶——醛糖还原酶和多元醇脱氢酶。因为要使用昂贵且不稳定的辅酶(NAD(P)),对可逆的平衡反应中的产物以及有关中间体进行复杂分离,导致L-戊糖生产成本很高。
发明内容
发明要解决的问题
由于现有L-戊糖生产技术存在成本高、生产工艺复杂的缺陷,需要提供一种新的生产L-戊糖的方法。
在一个实施方式中,本公开提供一种野生型多肽(酶)及其突变体,其具有催化D-木酮糖和L-核酮糖相互转化的化学反应能力(图2)。本公开中的野生型D-木酮糖4-差向异构酶(D-xylulose 4-epimerase,Xu4E)及其突变体可以利用自然界中最丰富的戊糖,即D-木糖作为原料,生产L-戊糖。在一个实施方式中,示例性的L-戊糖选自六种L-戊单糖,即L-阿拉伯糖、L-核糖、L-核酮糖、L-木酮糖、L-木糖和L-来苏糖。
在另一个实施方式中,本公开提供一种制备前述Xu4E突变体的方法,其通过分子生物学和基因工程方法,制备Xu4E突变体。
在另一个实施方式中,本公开提供前述野生型Xu4E及其突变体的用途,其可以用于生产L-戊糖。
在另一个实施方式中,本公开提供一种新的用于生产L-戊糖的方法,所述方法包括以D-木糖或D-木酮糖为原料,生产L-戊糖的方法。
在另一个实施方式中,本公开提供一种新的用于生产L-戊糖的方法,所述方法包括利用Xu4E或其突变体,将D-木酮糖转化为L-核酮糖的步骤。
在一个具体的实施方式中,本公开提供了一种对于前述生产L-戊糖的方法进行进一步优化的方法。
用于解决问题的方案
本公开用于解决前述技术问题的技术方案如下。
(1)所述多肽具有D-木酮糖4-差向异构酶活性,其中,所述多肽选自由(a)-(d)组成的组中的任一项:
(a)由与SEQ ID NO:2-32任一项所示的序列具有至少60%、至少70%、至少80%或至少90%的序列同一性的序列编码的多肽;
(b)由多核苷酸编码的多肽,所述多核苷酸在非常高严格条件下与(i)或(ii)所示的多核苷酸杂交:
(i)编码如SEQ ID NO:2-32任一项所示的氨基酸序列的多核苷酸;
(ii)(i)的全长互补多核苷酸;
(c)多肽,所述多肽为如SEQ ID NO:2-32任一项所示的多肽的突变体,所述突变体在一个或多个位置处包含取代、重复、缺失或添加一个或多个氨基酸,并且所述多肽仍然具有D-木酮糖4-差向异构酶活性;以及
(d)由(a)、(b)、(c)所示的多肽的片段,并且所述片段具有D-木酮糖4-差向异构酶活性。
(2)根据(1)所述的多肽,其中,所述多肽为突变体,并且所述多肽与SEQ ID NO:2-32任一项所示的多肽相比,具有至少95%、至少96%、至少97%、至少98%或至少99%的序列同一性。
(3)根据(1)-(2)任一项所述的多肽,其中,所述多肽为如SEQ ID NO:2-32任一项所示的多肽的突变体,所述突变体在至少1个、至少2个、至少3个、至少4个、至少5个、至少6个、至少7个、至少8个、至少9个位点处包含氨基酸的突变,并且所述多肽仍然具有D-木酮糖4-差向异构酶活性。
(4)根据(1)-(3)任一项所述的多肽,所述多肽为如下所示的多肽:
(a)与SEQ ID NO:2所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点处包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
(b)与SEQ ID NO:3所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
(c)与SEQ ID NO:4所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
(d)与SEQ ID NO:5所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、270、271、301、310、322、341、398、406、407组成的组中的一种或多种;
(e)与SEQ ID NO:6所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨 基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、269、270、300、309、321、340、397、405、406组成的组中的一种或多种;
(f)与SEQ ID NO:7所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由117、140、146、176、178、285、286、316、325、337、355、412、420、421组成的组中的一种或多种;
(g)与SEQ ID NO:8所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由125、148、154、184、186、293、294、324、333、345、363、420、428、429组成的组中的一种或多种;
(h)与SEQ ID NO:9所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由124、147、153、183、185、297、298、328、337、349、368、425、433、434组成的组中的一种或多种;
(i)与SEQ ID NO:10所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由108、131、137、167、169、276、277、307、316、328、346、403、411、412组成的组中的一种或多种;
(j)与SEQ ID NO:11所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由115、138、144、174、176、280、281、311、320、332、351、408、416、417组成的组中的一种或多种;
(k)与SEQ ID NO:12所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由107、130、136、166、168、272、273、303、312、324、343、400、408、409组成的组中的一种或多种;
(l)与SEQ ID NO:13所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由109、132、138、168、170、275、276、306、315、327、346、403、411、412组成的组中的一种或多种;
(m)与SEQ ID NO:14所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由103、126、132、162、164、267、268、298、307、319、338、395、403、404组成的组中的一种或多种;
(n)与SEQ ID NO:15所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、271、272、302、311、323、342、399、407、408组成的组中的一种或多种;
(o)与SEQ ID NO:16所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由64、88、94、123、125、236、237、267、274、286、373、381、382组成的组中的一种或多种;
(p)与SEQ ID NO:17所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由110、133、139、169、171、271、272、302、311、323、342、399、407、408组成的组中的一种或多种;
(q)与SEQ ID NO:18所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402、403组成的组中的一种或多种;
(r)与SEQ ID NO:19所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由121、144、150、180、182、289、290、320、329、341、359、416、424、425组成的组中的一种或多种;
(s)与SEQ ID NO:20所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由107、130、136、166、168、273、274、304、313、325、344、401、409、410组成的组中的一种或多种;
(t)与SEQ ID NO:21所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨 基酸在对应于以下位点包含突变,其中,所述位点为选自由21、48、54、84、86、182、183、213、222、234、260、324、332、333组成的组中的一种或多种;
(u)与SEQ ID NO:22所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由30、55、61、91、93、202、203、233、242、254、273、330、338、339组成的组中的一种或多种。
(5)根据(1)-(4)任一项所述的多肽,其中,所述多肽为在对应于SEQ ID NO:2上的,存在如下所示的突变的多肽:
(a)在102位上的突变;
(b)在125位上的突变;
(c)在131位上的突变;
(d)在161位上的突变;
(e)在163位上的突变;
(f)在266位上的突变;
(g)在267位上的突变;
(h)在297位上的突变;
(i)在306位上的突变;
(j)在318位上的突变;
(k)在337位上的突变;
(l)在394位上的突变;
(m)在402位上的突变;
(n)在403位上的突变
(o)在267和297位上的组合突变;
(p)在306和403位上的组合突变;
(q)在125和297位上的组合突变;
(r)在163、267、403位上的组合突变;
(s)在125、267和297位上的组合突变;
(t)在163、267、297和403位上的组合突变;
(u)在125、163、267和297位上的组合突变;
(v)在125、163、267、297和403位上的组合突变;
(w)在125、163、267、297、402和403位上的组合突变;
(x)在163、267、297、306、402和403位上的组合突变;
(y)在125、163、267、297、306、402和403位上的组合突变;
(z)在125、131、163、267、297、306、402和403位上的组合突变;
(aa)
(bb)在125、163、267、297、306、318、402和403位上的组合突变;
(cc)在125、131、163、267、297、306、318、402和403位上的组合突变。
(6)根据(5)所述的多肽,其中,所述多肽为在对应于SEQ ID NO:2上的以下任一种位点的突变的多肽:
(a)对应于SEQ ID NO:2上的位点102的氨基酸由甘氨酸(G)突变为亮氨酸(L);
(b)对应于SEQ ID NO:2上的位点125的氨基酸由丝氨酸(S)突变为天门冬氨酸(D)、半胱氨酸(C)、酪氨酸(Y)、谷氨酰胺(Q)、谷氨酸(E)、苏氨酸(T)或天冬酰胺(N);
(c)对应于SEQ ID NO:2上的位点131的氨基酸由精氨酸(R)突变为天门冬氨酸(D)、苏氨酸(T)、谷氨酸(E)或丝氨酸(S);
(d)对应于SEQ ID NO:2上的位点161的氨基酸由天门冬氨酸(D)突变为丙氨酸(A);
(e)对应于SEQ ID NO:2上的位点163的氨基酸由缬氨酸(V)突变为赖氨酸(K)、精氨酸(R)、丝氨酸(S)、异亮氨酸(I)或蛋氨酸(M);
(f)对应于SEQ ID NO:2上的位点266的氨基酸由谷氨酸(E)突变为丙氨酸(A);
(g)对应于SEQ ID NO:2上的位点267的氨基酸由缬氨酸(V)突变为亮氨酸(L)、蛋氨酸(M)或异亮氨酸(I);
(h)对应于SEQ ID NO:2上的位点297的氨基酸由天冬酰胺(N)突变为苯丙氨酸(F)、酪氨酸(Y)或赖氨酸(K);
(i)对应于SEQ ID NO:2上的位点306的氨基酸由色氨酸(W)突变为蛋氨酸(M)、丝氨酸(S)或苏氨酸(T);
(j)对应于SEQ ID NO:2上的位点318的氨基酸由谷氨酰胺(Q)突变为赖氨酸(K);
(k)对应于SEQ ID NO:2上的位点337的氨基酸由赖氨酸(K)突变为天门冬氨酸(D);
(l)对应于SEQ ID NO:2上的位点394的氨基酸由天门冬氨酸(D)突变为蛋氨酸(M);
(m)对应于SEQ ID NO:2上的位点402的氨基酸由丝氨酸(S)突变为缬氨酸(V)、亮氨酸(L)、苯丙氨酸(F)、半胱氨酸(C)或酪氨酸(Y);
(n)对应于SEQ ID NO:2上的位点403的氨基酸由酪氨酸(Y)突变为色氨酸(W)、苏氨酸(T)、异亮氨酸(I)或苯丙氨酸(F)。
(7)根据权利要求1-6任一项所述的多肽,其中,所述多肽来源于海栖热袍菌(Thermotoga maritima),新阿波罗栖热袍菌(Thermotoga neapolitana),栖热袍菌(Thermotoga sp),Thermotoga caldifontis,产假假单胞菌(Pseudothermotoga lettingae),牙合嗜盐菌(Halanaerobium congolense),Thermosediminibacter litoriperuensis,Rhodothermus marinus,Gracilibacillus timonensis,栖热袍菌(Thermotogae bacterium),Thermotogae bacterium,醋酸假丝酵母(Candidatus Acetothermia bacterium),Pseudothermotoga thermarum,解热嗜热厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum),Thermofilum adornatus,Thermoanaerobacter italicus,Thermotoga naphthophila,Thermoclostridium stercorarium,Dictyoglomus thermophilum,Spirochaeta thermophila,Singulisphaera acidiphila,Thermotoga caldifontis,Pseudothermotoga lettingae,枯草杆菌(Bacillus subtilis),Geobacillus zalihae,Geobacillus stearothermophilus,Parageobacillus thermoglucosidasius,Thermoanaero-bacterium thermosaccharolyticum或大肠杆菌(Escherichia coli)。
(8)根据(1)所述的多肽,其中,所述多肽包括在如SEQ ID NO:2-32所示的多肽的N端或中间部位或C端缺失一个或超过一个氨基酸残基。
(9)根据(8)所述的多肽,其中,所述多肽选自下组:
(i)对应于如SEQ ID NO:2所示的多肽的N端起,缺失1-100个氨基酸,优选1-90个氨酸酸,更优选1-86个,更优选1-50个,更优选1-30个,最优选1-10个氨基酸而形成,并且具有催化D-木酮糖转化为L-核酮糖活性;或
(ii)对应于SEQ ID NO:2所示的多肽的第196-236位氨基酸中,缺失1-41个氨基酸,优选1-30个,更优选1-20个,最优选1-10个氨基酸而形成的,且具有催化D-木酮糖转化为L-核酮糖活性。
(10)根据(8)-(9)任一项所述的多肽,其中,所述多肽选自下组:
(i)对应于如SEQ ID NO:2所示的多肽中,缺失第1-86位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性;
(ii)对应于SEQ ID NO:2所示的多肽中,缺失第196-236位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性;或
(iii)对应于SEQ ID NO:2所示的多肽中,缺失第1-86位氨基酸和第196-236位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性。
(11)根据(1)-(10)任一项所述的多肽,其中,所述多肽包含与SEQ ID NO:33-122所示的序列具有至少96%的序列同一性的氨基酸序列或由其组成;可选的,所述多肽包含与如SEQ ID NO:33-122所编码的多肽中的任一项具有至少98.3%、至少98.5%、至少98.7%、至少98.9%、至少99.1%、至少99.3%、至少99.5%、至少99.7%或100%的序列同一性的氨基酸序列或由其组成。
(12)根据(1)-(11)任一项所述的多肽,其中,所述多肽和如SEQ ID NO:2所示的多肽相比,具有改进的D-木酮糖4-差向异构酶活性。
(13)根据(1)-(12)任一项所述的多肽,其中,所述多肽由包含如SEQ ID NO:33-122所示的序列编码,或者所述多肽为由如SEQ ID NO:33-122所示的序列编码的多肽。
(14)一种分离的多核苷酸,其中,所述多核苷酸包含编码根据(1)-(13)任一项所述多肽的核苷酸序列。
(15)根据(14)所述的分离的多核苷酸,其在编码如SEQ ID NO:2-32任一项所示的氨基酸的核苷酸中包含至少一个突变;优选的,所述多核苷酸序列编码如SEQ ID NO:33-122任一项所示的氨基酸。
(16)一种核酸构建体,其包含根据(14)或(15)所述的多核苷酸,所述多核苷酸与一个或多个调控序列可操作地连接,所述调控序列指导多肽在表达宿主中产生。
(17)一种重组表达载体,其包含根据(16)所述的核酸构建体。
(18)一种重组宿主细胞,其包含根据(16)所述的核酸构建体或根据(17)所述的重组表达载体。
(19)一种用于产生根据(1)-(13)中任一项所述的多肽的方法,其中,所述方法包括如下步骤:(a)在有助于所述多肽产生的条件下培养细胞或菌株,所述细胞或菌株产生所述多肽;其中,所述细胞或菌株包含根据(16)所述的核酸构建体或(17)所述的重组表达载体,所述核酸构建体或所述重组表达载体包含编码所述多肽的核苷酸序列;
可选的,所述方法还包括如下步骤:
(b)纯化或回收所述多肽。
(20)一种D-木酮糖转化为L-核酮糖的方法,其中,所述方法包括利用根据(1)-(13)任一项所述多肽进行催化反应。
(21)根据(20)所述的方法,其中,所述多肽由包含如SEQ ID NO:2-122任一项所示的氨基酸序列编码,或者所述多肽为由如SEQ ID NO:2-122任一项所示的序列编码的多肽。
(22)一种制备L-戊糖的方法,其中,所述制备L-戊糖的方法包括如下步骤:
(a)采用D-木酮糖4-差向异构酶,将D-木酮糖转化为L-核酮糖;
可选的,所述方法还包括:
(b)采用D-木糖异构酶,将D-木糖转化为D-木酮糖。
(23)根据(22)所述的方法,其中,所述方法进一步包括,采用L-阿拉伯糖异构酶,将L-核酮糖转化为L-阿拉伯糖,所述L-戊糖为L-阿拉伯糖。
(24)根据(22)所述的方法,其中,所述方法进一步包括,采用L-核糖异构酶或甘露糖6-磷酸异构酶或它们的组合,将L-核酮糖转化为L-核糖,所述L-戊糖为L-核糖。
(25)根据(22)所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,所述L-戊糖为L-木酮糖。
(26)根据(22)所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,采用L-岩藻糖异构酶或D-阿拉伯糖异构酶或L-鼠李糖异构酶,将L-木酮糖或L-木酮糖和L-核酮糖的组合转化为L-木糖,所述L-戊糖为L-木糖。
(27)根据(22)所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,采用L-鼠李糖异构酶将L-木酮糖转化为L-木糖,所述L-戊糖为L-来苏糖。
(28)根据(22)-(27)任一项所述的方法,其中,所述D-木酮糖4-差向异构酶选自(1)-(13)任一项所述的多肽;优选的,所述D-木酮糖4-差向异构酶由包含如SEQ ID NO:2-122任一项所示的氨基酸序列编码,或者所述酶为由如SEQ ID NO:2-122任一项所示的序列编码的酶。
(29)根据(22)-(28)任一项所述的方法,其中,所述方法中还包含纯化和/或分离所述L-戊糖的步骤。
(30)根据(29)所述的方法,其中,所述分离步骤包含使用模拟移动床(SMB)分离的步骤。
(31)根据(22)-(30)任一项所述的方法,其中,所述方法的反应体系中还包含酶反应液;优选的,所述酶反应液中含有金属离子;更优选的,所述金属离子可以为Co 2+、Zn 2+、Mg 2+、Mn 2+、Ca 2+、Ni 2+、Cu 2+、Fe 2+、Sr 2+、Ba 2+、Cd 2+、Pb 2+、Fe 3+、Al 3+、Bi 3+、Ag +、Li +中的一种或多种。
(32)根据(22)-(31)任一项所述的方法,其中,所述反应在有氧、微有氧或无氧条件下进行。
(33)根据(22)-(32)任一项所述的方法,其中,所述反应在30℃-90℃温度下进行;优选的,所述反应在40℃-80℃温度下进行。
(34)根据(22)-(33)任一项所述的方法,其中,所述反应在pH为3.0至11.0的范围内进行;优选的,所述反应在pH为4.0至10.0的范围内进行。
(35)根据(22)-(34)任一项所述的方法,其中,所述反应在无氧、45-55℃温度、pH为8.0、金属离子为Co 2+或Mg 2+或Mn 2+或它们的组合的的条件下进行。
(36)根据(22)-(35)任一项所述的方法,其中,所述反应包括体外催化反应或全细胞生物催化反应。
(37)根据(36)所述的方法,其中,所述反应为体外催化反应,所述体外催化反应可以分步进行或同时进行。
(38)根据(37)所述的方法,其中,所述体外催化反应分步进行时,其在一个反应容器中或一个以上串联的反应容器中进行。
(39)根据(38)所述的方法,其中,所述反应容器选自分批进料生物反应器、含固定化酶的填充床生物反应器、酶或细胞再循环生物反应器、含膜分离的生物反应器和连续进料生物反应器中的一种或多种。
(40)根据(22)-(39)任一项所述的方法,其中,所述体外催化反应中的酶以游离酶、含有所述酶的细胞裂解液、含有所述酶的全细胞、固定化酶中的一种或多种形式存在。
(41)根据(36)所述的方法,其中,所述全细胞生物催化反应的反应方式为利用细胞工厂进行反应,所述细胞中包含根据(16)所述的核酸构建体或根据(17)所述的重组表达载体。
(42)多肽在制备L-戊糖中的应用,所述多肽选自根据(1)-(13)中任一项所述的多肽。
(43)根据(42)所述的应用,其中,所述L-戊糖选自L-阿拉伯糖、L-核糖、L-核酮糖、L-木酮糖、L-木糖和L-来苏糖中的一种或多种。
(44)多肽作为具有D-木酮糖4-差向异构酶活性的酶的用途,所述多肽选自根据(1)-(13)中任一项所述的多肽。
发明的效果
在一个实施方式中,本公开发现了具有催化D-木酮糖和L-核酮糖相互转化的化学反应能力的野生型D-木酮糖4-差向异构酶(Xu4E)及其突变体。
在一个具体的实施方式中,本公开提供的Xu4E突变体相较于野生型Xu4E,具有改进的性质,例如改进的物理和/或化学性质。示例性的,在一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其比酶活增加;在另一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其反应速率增加;在另一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其K m降低。
在另一个实施方式中,本公开发现了一种新的制备L-戊糖的方法,其相较于现有技术中传统的生产方法,生产工艺更加简单,并且使得生产L-戊糖的成本下降。
附图说明
图1示出了现有技术中将所有戊糖之间相互转化的Izumoring图。
图2示出了由D-木酮糖4-差向异构酶(Xu4E)催化的D-木酮糖和L-核酮糖之间相互转化的图。
图3示出了基于Xu4E将D-木糖转化为六种L-戊糖的人工多酶途径。其中,D-XI:D-木糖异构酶(EC 5.3.1.5);L-AI:L-阿拉伯糖异构酶(EC 5.3.1.4);L-RI:L-核糖异构酶(EC 5.3.1.B3);MPI:甘露糖磷酸异构酶(EC 5.3.1.8);D-LI:D-核糖异构酶(EC 5.3.1.15);Ru3E:L-核酮糖3-差向异构酶(EC 5.1.3.31);L-Ful:L-岩藻糖异构酶(EC 5.3.1.25);D-AI:D-阿拉伯糖异构酶(EC 5.3.1.3)和L-RaI:L-鼠李糖异构酶(EC 5.3.1.14)。
图4示出了四种稀少糖的HPLC色谱分离的结果。其中,(a)示出了利用Bio-Rad Aminex HPX-87H氢离子交换柱,其分离条件为:柱温60℃,流动相为5mM硫酸,流速为0.6mL/min。(b)示出了利用Bio-Rad Aminex HPX-87P铅离子交换柱,其分离条件为:柱温60℃,流动相为去离子水,流速为0.6mL/min。(c)示出了利用Waters Sugar Pak I钙离子交换柱,其分离条件为:柱温80℃,流动相为去离子水,流速为0.5mL/min。(d)示出了利用Shodex Sugar KS-801钠离子交换柱,其分离条件为:柱温70℃,流动相为去离子水,流速为0.5mL/min。
图5示出了在不同反应条件下野生型Xu4E与通过定向进化获得的8个代表性的Xu4E突变体的比酶活的比较。
图6示出了基于野生型TmXu4E的单个氨基酸残基变化对其比活性的影响。
图7示出了通过热处理纯化的在大肠杆菌E.coli BL21(DE3)中表达的三种热稳酶(即D-XI、Xu4E和L-AI)的SDS-PAGE分析。T:总细胞裂解液,S:上清,H:热处理细胞裂解液。
图8示出了由50mM D-木糖一锅法生产L-阿拉伯糖。L-阿拉伯糖在包含1U/mL XI和1U/mL AI、0.2mM Co 2+、1mM Mn 2+、1mg/mL Xu4E(野生型或突变体)的100mM HEPES缓冲液(pH 8.0)中生产。反应在50℃无氧条件下进行。
图9示出了从500mM D-木糖一锅法生产L-阿拉伯糖。反应在包含10U/mL XI和10U/mL AI、0.2mM Co 2+、1mM Mn 2+、10mg/mL Xu4E M87的100mM HEPES缓冲液(pH 8.0)中进行。反应在50℃无氧条件下进行。
图10示出了通过生物转化和模拟移动床(SMB)分离(a)生产L-阿拉伯糖与工业化高果糖玉米糖浆(HFCS)生产和分离(b)的比较图。
图11示出了由Xu4E催化的底物(D-木酮糖)生产产物(L-核酮糖)的HPLC分离图。其中,HPLC分离的峰由一级质谱(图12)和二级质谱(图13)验证。
图12示出了HPLC分离的底物(D-木酮糖)和产物(L-核酮糖)峰的一级质谱图。
图13示出了HPLC分离的底物(D-木酮糖)和产物(L-核酮糖)峰的二级质谱图。
图14A和图14B示出了来自不同物种的具有Xu4E活性的多肽的氨基酸序列分析比较结果。
具体实施方式
定义
在本公开的权利要求和/或说明书中,除非上下文中另有说明,否则例如“一个/种(a,an)”、“所述(said)”或“所述(the)”等指示对象旨在支持单数和/或复数情况二者。
如在权利要求和说明书中所使用的,词语“包含”、“具有”、“包括”、“拥有”、或“含有”是指包括在内的或开放式的,并不排除额外的、未引述的元件或方法步骤。
如本公开所使用的,术语“约”表示:一个数值包括测定该数值所使用的装置或方法的误差的标准偏差。示例性的,前述标准偏差一般为原始数值相差20-30%的范围之内。
虽然所公开的内容支持术语“或”的定义仅为替代物以及“和/或”,但除非明确表示仅为替代物或替代物之间相互排斥外,权利要求中的术语“或”是指“和/或”。
如本公开所使用的,尽管可以使用其他有机或无机催化剂,但术语“转化(converting)”是指主要由一种或多种的多肽(酶)催化从一个分子到另一个分子的化学转化;其得率是指产物的摩尔量与底物的摩尔量之间的比率(以%为单位)。
如本公开所使用的,术语“单糖”指任何一类糖(如D-葡萄糖、戊单糖、D-木糖、L-阿拉伯糖),其不能水解得到更简单的糖,并且没有化学基团修饰,如磷酸基团或UDP基团。
如本公开所使用的,术语“戊糖”或“戊单糖”指分子含有五个碳原子的任何一类单糖,如D-木糖和L-阿拉伯糖。
如本公开所使用的,“单糖”可按“D-”、或“L-”标记。前述两个系列的划分是以甘油醛的结构为比较标准,并根据费歇尔投影式中最下面一个不对称碳原子的构型决定。规定将右旋的甘油醛定为费歇尔投影式中羟基在右的异构体,称为D-异构体;将左旋的定为羟基在左的甘油醛,称为L-异构体。也就是说,若单糖的该手性碳原子与D-甘油醛相同,羟基位于右端,则标记为D-单糖;若与L-甘油醛相同,羟基位于左端,则标记为L-单糖。
如本公开所使用的,术语“4-差向异构酶”是指,能交换糖的碳4位羟基的酶。示例性的,“4-差向异构酶”能够交换D-塔格糖和D-果糖的碳4位羟基的酶、能够交换D-木酮糖和L-核酮糖的碳4位羟基的酶、能够交换D-葡萄糖和D-半乳糖的碳4位羟基的酶、能够交换D-木糖和L-阿拉伯糖的碳4位羟基的酶。
如本公开所使用的,术语“多肽”、“酶”、“多肽或酶”、“多肽/酶”具有相同的含义,其在本公开中可以互换。前述术语是指一种由和很多氨基酸通过肽键组成的聚合物,其可能含有或可能不含有如磷酸基和甲酰基的修饰。
如本公开所使用的,术语“D-木酮糖4-差向异构酶”(D-xylulose 4-epimerase)及其缩写名称“Xu4E”是指,能够催化D-木酮糖和L-核酮糖相互转化的多肽(酶)。
如本公开所使用的,1个“酶活力单位(U)”定义为每分钟底物通过酶催化反应生成1μmol产物所需要的酶量。
如本公开所使用的,术语“比酶活”也被表述为“比活力”或“比活度”,其在本公开中具有相同的含义,可以互换使用。其是指每毫克多肽(酶)的酶活力(U/mg)。
如本公开所使用的,在两种核酸或多肽比较中的术语“序列同一性”或“同一性百分比”,是指当使用核苷酸或氨基酸残基序列比较算法或通过目视检查测量,以最大的对应性进行比较和比对时,它们是具有相同的或具有相同序列特定百分比数。也就是说,核苷酸或者氨基酸序列的同一性可以利用下述比例来定义,该比例是将两个或多个核苷酸 或氨基酸序列按照一致的核苷酸或氨基酸数比较达到最大的方式,并根据需要加入空位来进行比对时一致的核苷酸数或氨基酸数,在比对部分的全部核苷酸或氨基酸数中的比例。
如本公开所使用的,两个或更多个多核苷酸或多肽之间的序列同一性可通过以下方法测定:将多核苷酸或多肽的核苷酸或氨基酸序列对准且对经对准的多核苷酸或多肽中含有相同核苷酸或氨基酸残基的位置数目进行评分,且将其与经对准的多核苷酸或多肽中含有不同核苷酸或氨基酸残基的位置数目进行比较。多核苷酸可例如通过含有不同核苷酸或缺失核苷酸而在一个位置处不同。多肽可例如通过含有不同氨基酸或缺失氨基酸而在一个位置处不同。序列同一性可通过用含有相同核苷酸或氨基酸残基的位置数目除以多核苷酸或多肽中氨基酸残基的总数来计算。举例而言,可通过用含有相同核苷酸或氨基酸残基的位置数目除以多核苷酸或多肽中核苷酸或氨基酸残基的总数且乘以100来计算同一性百分比。
示例性的,在本公开中,当使用序列比较算法或通过目视检查测量以最大的对应性进行比较和比对时,两个或多个序列或子序列具有至少40%、50%、60%、70%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%核苷酸或氨基酸残基的“序列同一性”或“同一性百分比”。“序列同一性”或“同一性百分比”的判断/计算可以基于序列任何合适的区域上。例如,长度至少约50个残基的区域、至少约100个残基的区域,至少约200个残基的区域,至少约400个残基的区域,或至少约500个残基的区域。在某些实施方案中,所述序列在任一或两个相比较的生物聚合物(就是核酸或多肽)的整个长度上基本相同。
如本公开所使用的,将不同序列的核苷酸或者氨基酸的编号进行对应的依据为,当使用核苷酸或氨基酸残基序列比较算法或通过目视检查测量,以最大的对应性进行比较或比对,进而判断“序列同一性”或“同一性百分比”时,目标核苷酸或者目标氨基酸相较于基准核苷酸或者基准氨基酸的编号。示例性的,在本公开中,“SEQ ID NO:5所示的序列,依据SEQ ID NO:2所示的序列进行编号”是指,SEQ ID NO:5所示的序列(相当于目标氨基酸)和SEQ ID NO:2所示的序列(相当于基准氨基酸)进行“序列同一性”或“同一性百分比”判断时,将SEQ ID NO:5所示的序列以最大的对应性对于SEQ ID NO:2所示的序列进行比较或比对时,SEQ ID NO:2所示的序列所对应的编号,即为SEQ ID NO:5所示的序列的编号。
如本公开所使用的,术语“氨基酸突变”或“核苷酸突变”,包括“取代、重复、缺失或添加一个或多个氨基酸或核苷酸”。在本公开中,术语“突变”是指核苷酸序列或者氨基酸序列的改变。在一个具体的实施方式中,术语“突变”是指“取代”。
在一个实施方式中,本公开的“突变”可以选自“保守突变”。在本公开中,术语“保守突变”是指可正常维持蛋白质的功能的突变。保守突变的代表性例子为保守置换。
如本公开所使用的,术语“保守置换”涉及用具有类似侧链的氨基酸残基替换氨基酸残基。本领域已经定义了具有类似侧链的氨基酸残基家族,并且包括具有碱性侧链(例如赖氨酸、精氨酸和组氨酸)、酸性侧链(例如天冬氨酸和谷氨酸)、不带电极性侧链(例如甘氨酸、天冬酰胺、谷氨酰胺、丝氨酸、苏氨酸、酪氨酸、和半胱氨酸)、非极性侧链(例如丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸、蛋氨酸和色氨酸)、β-支链(例如苏氨酸、缬氨酸和异亮氨酸)和芳香侧链(例如酪氨酸、苯丙氨酸、色氨酸和组氨酸)。
如本公开所使用的,“保守置换”通常在蛋白质的一个或多个位点上交换一种氨基酸。这种取代可以是保守的。作为被视作保守置换的置换,具体而言,可以举出Ala向Ser或Thr的置换、Arg向Gln、His或Lys的置换、Asn向Glu、Gln、Lys、His或Asp的置换、Asp向Asn、Glu或Gln的置换、Cys向Ser或Ala的置换、Gln向Asn、Glu、Lys、His、Asp或Arg 的置换、Glu向Gly、Asn、Gln、Lys或Asp的置换、Gly向Pro的置换、His向Asn、Lys、Gln、Arg或Tyr的置换、Ile向Leu、Met、Val或Phe的置换、Leu向Ile、Met、Val或Phe的置换、Lys向Asn、Glu、Gln、His或Arg的置换、Met向Ile、Leu、Val或Phe的置换、Phe向Trp、Tyr、Met、Ile或Leu的置换、Ser向Thr或Ala的置换、Thr向Ser或Ala的置换、Trp向Phe或Tyr的置换、Tyr向His、Phe或Trp的置换、及Val向Met、Ile或Leu的置换。此外,保守突变还包括起因于基因所来源的个体差异、株、种的差异等天然产生的突变。
如本公开所使用的,术语“多核苷酸”指由核苷酸组成的聚合物。多核苷酸可以是单独片段的形式,也可以是更大的核苷酸序列结构的一个组成部分,其是从至少在数量或浓度上分离一次的核苷酸序列衍生而来的,能够通过标准分子生物学方法(例如,使用克隆载体)识别、操纵以及恢复序列及其组分核苷酸序列。当一个核苷酸序列通过一个DNA序列(即A、T、G、C)表示时,这也包括一个RNA序列(即A、U、G、C),其中“U”取代“T”。换句话说,“多核苷酸”指从其他核苷酸(单独的片段或整个片段)中去除的核苷酸聚合物,或者可以是一个较大核苷酸结构的组成部分或成分,如表达载体或多顺反子序列。多核苷酸包括DNA、RNA和cDNA序列。“重组多核苷酸”属于“多核苷酸”中的一种。
如本公开所使用的,术语“重组多核苷酸”指具有在自然界中不连接在一起的序列的多核苷酸。重组多核苷酸可包括在合适的载体中,且该载体可用于转化至合适的宿主细胞。含有重组多核苷酸的宿主细胞被称为“重组宿主细胞”。然后多核苷酸在重组宿主细胞中表达以产生例如“重组多肽”。
如本公开所使用的,术语“表达”包括涉及多肽产生的任何步骤,包括但不限于:转录、转录后修饰、翻译、翻译后修饰、和分泌。
如本公开所使用的,术语“表达载体”是指线状或环状DNA分子,该分子包含编码多肽的多核苷酸并且该多核苷酸有效地连接于供用于其表达的控制序列。
如本公开所使用的,术语“重组表达载体”指用于表达例如编码所需多肽的多核苷酸的DNA结构。重组表达载体可包括,例如包含i)对基因表达具有调控作用的遗传元素的集合,例如启动子和增强子;ii)转录成mRNA并翻译成蛋白质的结构或编码序列;以及iii)适当的转录和翻译起始和终止序列的转录亚单位。重组表达载体以任何合适的方式构建。载体的性质并不重要,并可以使用任何载体,包括质粒、病毒、噬菌体和转座子。用于本公开的可能载体包括但不限于染色体、非染色体和合成DNA序列,例如细菌质粒、噬菌体DNA、酵母质粒以及从质粒和噬菌体DNA的组合中衍生的载体,来自如牛痘、腺病毒、鸡痘、杆状病毒、SV40和伪狂犬病等病毒的DNA。
如本公开所使用的,术语“可操作地连接”是指如下的构造:调控序列相对于多核苷酸的编码序列安置在适当位置,从而使得该调控序列指导该编码序列的表达。示例性的,所述调控序列可以选自启动子和/或增强子编码的序列。
如本公开所使用的,术语“核酸构建体”包含与适合的调控序列有效地连接的编码多肽或结构域或模块的多核苷酸,该调控序列对于在所选细胞或者菌株进行多核苷酸的表达是必需的。
如本公开所使用的,术语“内源的”指在生物体或细胞内自然表达或产生的多核苷酸、多肽或其他化合物。也就是说,内源性多核苷酸、多肽或其他化合物不是外源的。例如,当细胞最初从自然界分离时,细胞中存在一种“内源性”多核苷酸或多肽。
如本公开所使用的,术语“外源的”指在需要表达的特定细胞或有机体中天然发现或表达的任何多核苷酸或多肽。外源多核苷酸、多肽或其他化合物不是内源性的。
如本公开所使用的,术语“野生型的”指在自然界中可以找到的对象。例如,一种存在于生物体中,可以从自然界的一个来源中分离出来并且在实验室中没有被人类有意修改的多肽或多核苷酸序列是天然存在的。如本公开所用的,“天然存在的”和“野生型的” 是同义词。
如本公开所使用的,术语“突变体”是指相对于“野生型”,或者“相比较的”多核苷酸或多肽,在一个或多个(例如,若干个)位置处包含改变(即,取代、插入和/或缺失)的多核苷酸或多肽,其中,取代是指用不同的核苷酸或氨基酸置换占用一个位置的核苷酸或氨基酸。缺失是指去除占据某一位置的核苷酸或氨基酸。插入是指在邻接并且紧随占据位置的核苷酸或氨基酸之后添加核苷酸或氨基酸。示例性的,本公开中的“突变体”为仍然具有D-木酮糖4-差向异构酶(Xu4E)活性的多肽。
如本公开所使用的,“过表达的”重组基因比微生物中相应的天然存在的基因产生更多的RNA和/或蛋白质。本领域测量RNA和蛋白质量的方法是已知的。过表达也可以通过测量蛋白质活性,如酶活性来测定。根据本公开的实施方案,“过表达”是至少3%、至少5%、至少10%、至少20%、至少25%或至少50%以上的量。过表达的多核苷酸通常是宿主细胞固有的多核苷酸,其产物的生成量大于宿主细胞中的正常生成量。例如但并非限制,过表达通过可操作性地将多核苷酸连接到不同于多核苷酸的天然启动子的启动子,或将多核苷酸的附加拷贝引入宿主细胞实现。
如本公开所使用的,术语“片段”意指从成熟多肽或结构域的氨基和/或羧基末端缺失一个或多个(例如,若干个)氨基酸的一种多肽或一个催化或碳水化合物结合模块。在本公开的技术方案中,所述片段具有D-木酮糖4-差向异构酶(Xu4E)活性。
如本公开所使用的,术语“分离的”意指处于自然界中不存在的形式或环境中的物质。分离的物质的非限制性实例包括(1)任何非天然存在的物质,(2)包括但不限于任何酶、突变体、核酸、蛋白质、肽或辅因子的任何物质,该物质至少部分地从与其本质相关的一种或多种或所有天然存在的成分中去除;(3)相对于天然发现的物质通过人工修饰的任何物质;或(4)通过相对于与其天然相关的其他组分增加物质的量而修饰的任何物质(例如宿主细胞中的重组产生;编码该物质的基因的多个拷贝;以及使用比与编码该物质的基因天然相关的启动子更强的启动子)。分离的物质可以存在于发酵液样品中。例如宿主细胞可以被遗传修饰以表达本公开的多肽。来自宿主细胞的发酵液将包含分离的多肽。分离的物质可以存在于生物转化液的样品中。例如目标产物L-阿拉伯糖可以从酶催化的多糖混合液体中分离出来。
如本公开所使用的,术语“高严格条件”是指,对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃处在5X SSPE(saline sodium phosphate EDTA)、0.3%SDS、200微克/ml剪切并变性的鲑精DNA和50%甲酰胺中预杂交和杂交12至24小时。最后在65℃处使用2X SSC、0.2%SDS将载体材料洗涤三次,每次15分钟。
如本公开所使用的,术语“非常高严格条件”是指,对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃处在5X SSPE(saline sodium phosphate EDTA)、0.3%SDS、200微克/ml剪切并变性的鲑精DNA和50%甲酰胺中预杂交和杂交12至24小时。最后在70℃处使用2X SSC、0.2%SDS将载体材料洗涤三次,每次15分钟。
如本公开所使用的,术语“游离酶”指不含活生物体的酶。本公开的游离酶可在经过裂解其所表达的细胞,是部分或高度纯化之后,悬浮在溶液中,是可溶的或结合到不溶性基质上。
如本公开所使用的,术语“固定化酶”,是指在一定的空间范围内起催化作用,并能反复和连续使用的酶。通常酶催化反应都是在水溶液中进行的,而固定化酶是将水溶性酶用物理或化学方法处理,使之成为不溶于水的,但仍具有酶活性的状态。
如本公开所使用的,术语“宿主细胞”意指易于用包含本公开的多核苷酸的核酸构建体或表达载体转化、转染、转导等的任何细胞类型。术语“宿主细胞”涵盖由于复制过程中发生的突变而与亲本细胞不同的亲本细胞的任何后代。
如本公开所使用的,术语“全细胞微生物”指没有完全裂解细胞膜的全细胞。含有所 述酶的全细胞微生物可以直接使用,或固定化以保持稳定性和可回收再利用,或者可以对全细胞进行通透性处理,以获得快速反应速率。
如本公开所使用的,术语“催化反应”是指在催化剂作用下进行的化学反应。一种催化剂只能选择性地加速特定的反应,从而可能使化学反应朝着几个热力学可能的方向之一进行。催化剂与反应物处于同一相的称均相催化反应(Homogeneous Catalytic Reaction),处于不同相者称异相催化反应(或多相催化反应)(Heterogeneous Catalytic Reaction)。由生物催化剂-酶参加的反应称酶催化反应(Enzymic Catalytic Reaction)。
在一个具体的实施方式中,所述催化反应可以在全细胞体外由酶或多酶进行催化,前述催化反应也可以被称为“酶催化反应”,指利用酶作为催化剂进行化学转化的过程,这种反应过程又称为生物转化(biotransformation)或生物催化(biocatalysis)。
在另一个具体的实施方式中,所述催化反应可以在生物体内/细胞内进行,前述催化反应也可以被称为“胞内催化反应”。
如本公开所使用的,术语“胞内催化反应”也可以被称为“全细胞生物催化反应”,其是指利用完整的生物有机体(即全细胞、组织甚至个体)作为催化剂进行化学转化的过程。全细胞生物催化反应中常用的有机催化体主要是微生物,其本质是利用一个或多个微生物细胞内的一个酶或多个酶进行催化。现在也发展出利用动物细胞、植物细胞甚至生物个体等进行的生物转化方式。常用的全细胞生物转化方式是将细胞固定于反应平面、微球体悬浮、多孔固相载体等方式。
如本公开所使用的,术语“发酵产物”是指由细胞发酵产生、不经历或经历最低限的回收和/或纯化的制剂。发酵产物可以含有在发酵结束时得到的发酵材料的未分级的或分级的内容物。典型地,发酵产物是未分级的并且包含用过的培养基以及例如通过离心去除微生物细胞(例如,丝状真菌细胞)之后存在的细胞碎片。在一些实施方式中,发酵产物含有用过的细胞培养基、胞外酶以及有活力的和/或无活力的微生物细胞。
如本公开所使用的,术语“生物催化产物”是指由生物催化剂(多肽或酶或全细胞)进行生物催化产生的不经历或经历最低限的回收和/或纯化的制剂。生物催化是在生物催化剂催化的水相缓冲液中进行,包含金属离子。在一些实施方式中,生物催化剂包括游离酶进行反应、含有所述酶的细胞裂解液、含有所述酶的全细胞生物、固定化所述酶、和交联所述酶的聚集体。
如本公开所使用的,术语“生物反应器”是利用酶或生物体(如微生物)所具有的生物功能,进行生物转化反应的装置系统。它是一种生物功能模拟机,如发酵罐、固定化酶或固定化细胞反应器等。
除非另外定义或由背景清楚指示,否则在本公开中的全部技术与科学术语具有如本公开所属领域的普通技术人员通常理解的相同含义。
D-木酮糖4-差向异构酶及突变体
在一个实施方式中,我们发现了从未报道过的戊糖4-差向异构酶,其可实现D-木酮糖和L-核酮糖之间的相互转化。我们将其命名为D-木酮糖4-差向异构酶(Xu4E)(图2)。
在一个具体的实施方式中,我们首次从两个酶家族的一些酶中发现了有具有Xu4E的酶活性:塔格糖酮酸3-差向异构酶(EC 5.1.2.7)和L-核酮糖5-磷酸4-差向异构酶(EC 5.1.3.4)。
在另一个实施方式中,我们利用野生型Xu4E,制造了xu4e的DNA突变体库,并从中鉴定了具有改变的物理化学性质的Xu4E突变体。
示例性的,在一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其比酶活增加;在另一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其反应速率增加;在另一个具体的实施方式中,Xu4E突变体相较于野生型Xu4E,其K m降低。
用于L-戊糖生产的人工酶途径
我们设计了无需辅酶NAD(P)的、基于Xu4E将D-木糖转化为6种L-戊糖的人工多酶催化途径(表2)。借助于Xu4E及其突变体,我们设计了人工多酶催化途径,通过4-差向异构酶、3-差向异构酶和醛糖异构酶(图3和表2),从D-木糖制备6种L-戊糖(即L-阿拉伯糖、L-核糖、L-核酮糖、L-木糖、L-来苏糖和L-木酮糖),这些人工多酶途径不需要昂贵的两个NAD(P)依赖的氧化还原酶(即醛糖还原酶和多元醇脱氢酶)。
示例性的,在本公开所示的L-戊糖的生产过程中,不需要通过图1所示的经过木糖醇或核糖醇的途径。
表2.基于Xu4E从D-木糖生产L-戊糖的人工多酶催化途径
Figure PCTCN2020133395-appb-000002
酶和/或其突变体
本公开公开的新酶自然存在于各种生物体中。虽然在实施例中使用具有所需活性的特定酶,但本公开不限于这些酶,因为其他酶可能具有类似的活性并且可以使用。例如,可能发现一些新的多肽也可以催化D-木酮糖和L-核酮糖的相互转化。本公开中描述的其他反应可由本实施的方案中未描述的酶催化,也被包括在本实施方案中。
在某些实施方案中,可在本公开中使用这些酶的突变体,其中催化活性已被改变,例如使其在酸性或碱性条件下更为活跃和稳定。多肽的氨基酸序列突变体包括替换、插入或缺失突变体,并且突变体可与未经修饰的酶基本上同源或基本相同。在某些实施方案中,突变体至少保留酶的一些生物活性,例如催化活性。其他突变体包括保留至少约10%,优选至少约50%、更优选至少约75%、最优选至少约90%生物活性的酶突变体。
一种来源于生物体的多肽或多核苷酸含有对天然氨基酸序列或核苷酸序列的一个或多个修饰,并与天然酶相比,如果不是更好的话,表现出类似的活性(例如,是天然酶活性水平的至少10%、至少30%、至少50%、至少70%、至少80%、至少90%、至少100%或者至少110%,甚至更高的酶活性)。例如,在某些情况下,通过亲本/天然存在序列的定向进化改善酶活性。另外可选地,对酶编码序列进行突变以获得所需的性能。示例性 的,所述“所需的性能”选自热稳定性更好、反应速率增加、最适pH改变或金属辅因子偏好性等。
酶的形式
本公开中所使用的游离酶或含有所述酶的细胞裂解液是水溶性的。通常最好使用固定化酶。固定化酶通常更稳定和耐用。固定化酶也更容易回收并在多个催化循环中使用,从而降低了生产过程的成本。在本领域,许多酶固定化的方法是已知的。酶也可以交叉连接以形成交联酶聚集体(Cross-linked enzyme aggregate,CLEA),其通常更稳定并且更易于回收和再利用。许多酶都存在于生物体中,它们可以作为生物催化剂生产稀少糖,但是它们也可以在经工程改造的微生物中进行异源表达,然后可以用作生物催化剂。
本公开中所使用的重组酶可保留在整个细胞中,而无需完全的细胞裂解。整个细胞含有一种或多种酶。通常最好使用固定化的全细胞。全细胞可以通过许多技术进行通透性处理,如有机溶剂处理、化学试剂处理或热处理。固定化细胞也更容易回收并在多个催化循环中反复使用,从而降低了生产过程的成本。在本领域,许多全细胞通透性和全细胞固定化的方法是已知的。本公开涉及催化本公开所述反应的全细胞的固定化和交联方法。
易错PCR
易错PCR是在采用DNA聚合酶进行启动子序列扩增时,通过调整反应条件,如提高镁离子浓度、加入锰离子、改变体系中四种的dNTPs浓度或运用低保真度DNA聚合酶等方法,来改变DNA扩增过程中的突变频率,从而以较高突变频率向目标DNA序列中随机引入突变,获得针对目标序列的随机突变体。
生产L-戊糖的过程
(1)生产/分离/纯化过程
本公开的方法和组合物可以适于各种常规发酵或酶催化生物反应器(例如分批、分批进料、细胞或酶再循环和连续发酵或连续酶催化)。
在本公开的实施例中,每单位时间形成的生物催化产物的量一般来说是酶的催化活性条件(例如pH、温度、金属离子)和催化工艺中所存在的酶的量的函数。
在本公开的实施方式中,含有所述金属离子的溶液中可以含有一种或者多种金属离子。示例性的,所述含有金属离子的溶液可以选自含有CuCl 2、FeCl 3、ZnCl 2、CaCl 2、MgCl 2、CoCl 2、NiCl 2或MnCl 2的溶液。
高效微生物催化的发酵工艺的一些关键参数包括能够使微生物生长到较大细胞密度、增加所要产物的产率、增加容积生产率的量、去除不合需要的共代谢物、改进便宜的碳和氮来源的利用、调适以改变发酵槽条件、增加菌体的生产、增加重组酶的合成、增加对酸性条件的耐受性、增加对碱性条件的耐受性、增加对有机溶剂的耐受性、增加对高盐条件的耐受性和增加对高温或低温的耐受性。
在一些实施例中,如本文中提供的多个酶,它们可以以游离酶、含有所述酶的细胞裂解液、含有所述酶的全细胞、固定化酶中的一种或多种形式存在,在含戊糖底物的反应液中进行生物催化,使转化的产物生成到反应液中。在一个实施例中,酶催化的最终产物可以使用本领域中已知的任何合适方法从反应液中分离。
L-戊糖可从多酶、反应物、反应中间物以及生物催化产物中分离,并使用本领域已知的各种方法从反应物以及反应中间物回收和/或纯化生物催化产物。在一些实施例中,将生物催化产物从生物反应器中回收。在一个实施例中,使微生物破裂,并且将培养基或溶解物离心以去除微粒细胞碎片和分离细胞膜,得到可溶蛋白质部分包括所述酶,将所述多酶可以催化生产L-戊糖。L-戊糖的分离纯化方法包括,但不限于色谱法、模拟移动床色谱法、结晶法、基于离子型、疏水性和尺寸排除树脂的吸附和释放、过滤、微滤、超滤、纳滤、离心、萃取、盐或溶剂沉淀、干燥或其组合。所需的分离不限于酶的去除/ 回收,也包括回收一些或所有剩余产物和反应物(包括D-木糖、D-木酮糖、L-戊糖和金属离子)的混合物;所需的分离可能无需进一步纯化。具有或不具有D-木糖、D-木酮糖和L-核酮糖回收、以及酶的纯化、固定化和回收利用是包括在本公开进一步的实施方案。
(2)在经改造的微生物细胞生产多肽(酶)
本公开所述的催化一些或所有反应的酶可在非天然的、经工程改造的异源生物体中表达。具体地,编码途径用酶的基因可被分离,插入到用于转化生产的生物体的表达载体中,可被并入至基因组,并直接表达所述酶。在本领域,用于操纵微生物的方法是已知的,如《分子生物学现代方法》(Online ISBN:9780471142720,John Wiley and Sons,Inc.)、《微生物代谢工程:方法和规程》(Qiong Cheng Ed.,Springer)和《系统代谢工程:方法和规程》(Hal S.Alper Ed.,Springer)等出版物中被解释。
可以使用本领域熟知的方法通过上调或下调该多核苷酸的表达来构建突变体,例如插入、破坏、替换、或缺失。例如,待修饰或失活的多核苷酸可以是活性必需的编码区或其部分,或编码区的表达所需的调控元件。此类调节或控制序列的实例可以是启动子序列或其功能部分,即足以影响多核苷酸表达的部分。可修饰的其他控制序列包括但不限于:前导子、多腺苷酸化序列、前肽序列、信号肽序列、转录终止子、和转录激活因子。
本领域技术人员可培养经工程改造的微生物细胞生产所述酶。微生物细胞生产重组酶的指南和规程可在以下出版物中找到,如《发酵和生物化学工程手册:原理、过程设计和仪器》(2nd Edition,Henry C.Vogel and Celeste L.Todaro,Noyes Publications 1997)和《发酵技术原理》(2nd Edition,P.F.Stanbury et.al.,Butterworth Heineman,2003)。
(3)生物反应条件
在本公开的一些实施例中,将多个酶进行混合组成人工多酶途径,可以将原料,如D-木糖或其他中间产物(D-木酮糖),转化为L-戊糖,并回收L-戊糖。生物反应过程可以在有氧、微氧或无氧条件下进行。在本公开的另一些实施例中,生物催化反应在无氧条件下(即没有可检测的氧气)中进行。
在本公开的一些实施例中,生物反应过程在30℃-90℃的条件下进行反应。在本公开的一些具体实施例中,生物反应过程在40℃-80℃的条件下进行反应。在本公开的一些更具体实施例中,生物反应过程在50℃-70℃的条件下进行反应。在本公开的一些更具体实施例中,生物反应过程在60℃-70℃的条件下进行反应。
质粒以及重组蛋白制备
所有重组蛋白的过表达均使用大肠杆菌E.coli BL21(DE3)进行。本公开中涉及的所有重组蛋白的表达/过表达方法,均可以根据《分子生物学实验指南》记载的技术方案进行。
示例性的,携带有相应蛋白编码基因的pET质粒制备方式如下所述。
为制备可能拥有Xu4E酶活的L-核酮糖5-磷酸-4-差向异构酶(RP4E),它们来源于海栖热孢菌(T.maritima)、大肠杆菌(E.coli)、芽孢杆菌(Bacillus subtilis 168)以及嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)的rp4e基因被从它们相应的基因组中扩增获得。通过基于延长重叠延伸PCR(POE-PCR)的简单克隆技术(You,C.,X.-Z.Zhang and Y.-H.P.Zhang(2012)."Simple Cloning:direct transformation of PCR product(DNA multimer)to Escherichia coli and Bacillus subtilis."Appl.Environ.Microbiol.78:1593-1595.),它们被插入到pET20b质粒,构建得到相应质粒pET20b-TmRP4E、pET20b-EcRP4E、pET20b-BsRP4E和pET20b-GsRP4E。POE-PCR的反应条件如下:250ng pET20b质粒骨架及等摩尔的目的基因片段,0.2mM各种dNTPs,以及0.02U/μl Q5 DNA聚合酶。PCR扩增条件:98℃ 1min;98℃ 20s,60℃ 20s,72℃ 72s,30个循环;72℃ 5min。
为制备可能拥有Xu4E酶活的野生型D-木酮糖4-差向异构酶(Xu4E),利用一对引物F_UxaE(F)和R_UxaE(R):
F(SEQ ID NO:123):5’-GAGATATACCCATATGGTCTTGAAAGTGTTCAAAGACC-3’;
R(SEQ ID NO:124):GGTGGTGGTGCTCGAGCCCCTCCAGCAGATCCACGTGCC-3’。
通过PCR方法,将uxaE基因从海栖热孢菌基因组中扩增得到。
基于pET28a,利用一对引物F_pET28a(F)和R_pET28a(R)进行扩增:
F(SEQ ID NO:125):5’-GCTGGAGGGGCTCGAGCACCACCACCACCACCACTG-3’;
R(SEQ ID NO:126):5’-CTTTCAAGACCATATGGGTATATCTCCTTCTTAAAG-3’。
通过POE-PCR方法,扩增得到多聚体质粒,被转入到大肠杆菌E.coli TOP10,得到质粒pET28a-tm_UxaE。
为制备L-阿拉伯糖异构酶(L-AI),来源于耐热菌嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)的AI的DNA序列经过密码子优化并由通用生物(安徽,中国)合成,得到质粒pET20b-BsAI。
编码来源于嗜热栖热菌(Thermus thermophiles)的耐热木糖异构酶(D-XI)的质粒pET20b-TtcXI,是从参考文献得到(Wu等2018)。
除特别注明,所有重组酶均有拥有组氨酸融合标签,利用镍离子树脂通过亲和吸附进行纯化。携带有编码目的蛋白基因的pET质粒利用大肠杆菌E.coli BL21细胞在250毫升LB培养基中培养,培养温度为37℃。当细胞吸光度A 600达到~0.6-0.8时,添加0.1mM IPTG诱导蛋白表达。蛋白表达在37℃进行6小时或在18℃进行16小时。通过离心收集细胞后,使用包含有0.1M氯化钠以及10mM咪唑的50mM HEPES缓冲液(pH 7.5)重悬沉淀。超声破碎细胞膜,离心后,将含所述酶的上清液样品上样到镍离子树脂纯化柱上。使用包含有0.1M氯化钠以及150-500mM咪唑的50mM HEPES缓冲液(pH 7.5)洗脱纯化目的酶。酶浓度可以被Bradford测量方法进行测定,牛血清蛋白用作标准品蛋白。重组蛋白的表达水平以及蛋白的纯度使用十二烷基硫酸钠聚丙烯酰胺凝胶电泳(SDS-PAGE)进行检测,并使用Image Lab软件(伯乐,赫拉克勒斯,加利福尼亚,美国)的丰度分析功能进行定量。
用于L-阿拉伯糖合成的热稳酶,是来源于嗜热栖热菌(Thermus thermophiles)的D-木糖异构酶、来源于海栖热孢菌(T.maritima)的野生型Xu4E及Xu4E突变体M8、以及来源于嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)的L-阿拉伯糖异构酶,它们可以通过热处理(50-80℃,10-60分钟)进行纯化。细胞裂解液经过热处理和离心,将包含上述三个酶的上清液混合,可以用于从D-木糖到L-阿拉伯糖的转化。
构建筛选质粒pGS-Xu4E及筛选宿主大肠杆菌JZ919
筛选质粒pGS-Xu4E包含在P BAD启动子控制下的mCherry基因、在P AraC启动子控制下的野生型的araC基因,以及在P tac启动子控制下的xu4e基因。在大肠杆菌细胞中,Xu4E阳性突变体可产生更多的L-阿拉伯糖,从而诱导大肠杆菌细胞表达更高水平的mCherry荧光蛋白,产生更强的荧光信号。筛选质粒利用标准DNA组装技术进行构建。
大肠杆菌E.coli JZ919(TOP10ΔxylB::araA)被构建用作筛选宿主,与筛选质粒pGS-Xu4E共同使用。筛选质粒pGS-Xu4E携带有基因传感器能对胞内的L-阿拉伯糖浓度进行检测,显示mCherry荧光信号。为积累L-阿拉伯糖,大肠杆菌宿主细胞中与D-木酮糖和L-核酮糖利用相关的两个基因被敲除,同时araA基因被插入到大肠杆菌的基因组。从大肠杆菌E.coli Top10(ΔaraABCD)出发,基因组中的xylB基因被araA基因替代,敲除与插入同步进行。
构建Xu4E突变体文库
xu4e基因的突变体文库利用低突变率的易错PCR(ep-PCR)建立。
以质粒pET28a-UxaE作为DNA模板,引物为:MUxaE-IF(SEQ ID NO:127): 5’-CCATATGGTCTTGAA-3’;MUxaE-IR(SEQ ID NO:128):5’-GGTGGTGGTGCTCGAGCCCCTCCAGCAGATCCACG TGCC-3’。
通过PCR扩增得到突变体文库。MUxaE-IF为5’-末端磷酸化引物。MUxaE-I R 5’末端的最后28bp序列与质粒骨架的序列为同源互补序列。50μl ep-PCR反应体系包括1ng/μl质粒pGS-Xu4E、0.2mM dATP、0.2mM dGTP、1mM dCTP、1mM dTTP、5mM MgCl 2、0.05mM MnCl 2、0.4μM引物(MUxaE-IF和MUxaE-IR)以及0.05U/μl NEB Taq聚合酶。
Xu4E突变体文库的高通量筛选
E.coli JZ919的化学感受态细胞可以根据现有技术,例如通过《分子生物学实验指南》中记载的方法进行制备。进一步的,将携带uxaE突变体文库的E.coli JZ919细胞在含有D-木糖的LB固体培养基上进行培养。在37℃培养12小时后,每4小时观察菌落颜色。通过眼球观察或者通过UV辐射检测菌落的荧光强度挑选阳性克隆。显示更强荧光强度的克隆被挑选并在含有0.5毫升LB培养基的96深孔板培养,LB培养基中添加D-木糖,在37℃培养12小时。96孔板中细胞培养基的荧光信号利用SynergyMx多功能微孔板检测仪(伯腾,佛蒙特,美国)。荧光激发扫描在589nm进行,发射扫描在610nm进行。
判断多肽/酶是否具有Xu4E酶活性的测定方法
D-木糖/D-木酮糖混合液的准备。D-木糖/D-木酮糖混合液在1毫升50mM HEPES缓冲液(pH 7.5)中制备,缓冲液中含有1M木糖、5mM MgCl 2和50mg固定化的D-XI,XI购自Sigma-Aldrich(G4166)。在70℃,过夜反应后,通过离心除去固定化的XI。D-木糖/D-木酮糖混合液包含大约700mM木糖和300mM木酮糖。
Xu4E的酶活性测定是使用分步酶活测定方法。反应液是50mM Tris缓冲液包括含有70mM木糖,30mM木酮糖和0.2mM Co 2+或2mM Zn 2+。除特殊注明,Xu4E酶活测定在30-80℃进行15分钟到24小时,酶蛋白溶度是0.001-10g/L。Xu4E催化反应后,吸取65μL反应液与35μL 1.88M HClO 4混合,混合液加入13.5μL 5M KOH中和。离心去掉沉淀后,第二步转化反应在含有10U/mL(过量的)L-AI和1mM Mn 2+的50mM HEPES缓冲液(pH 7.5)中进行将含有L-核酮糖的上清液被转化为L-阿拉伯糖,酶反应在50℃进行15min。L-阿拉伯糖浓度用Megazyme L-阿拉伯糖/D-半乳糖测定试剂盒(K-ARGA,博雷,爱尔兰)测定。
为测定动力学参数,Xu4E的酶活测定在含有0.2mM Co 2+的50mM Tris缓冲液(pH 8.5)中在50℃进行。D-木糖及D-木酮糖的总浓度为1.5到1000mM,而D-木酮糖的浓度为0.5到300mM。酶活测定在50℃进行15min。Xu4E对于D-木酮糖的表观K m及k cat常数基于米氏方程的非线性拟合,使用GraphPad Prism 5软件计算(Graphpad软件公司,洛杉矶,加利福尼亚,美国)。
利用D-木糖生产L-阿拉伯糖
从50mM D-木糖生产L-阿拉伯糖在1毫升的反应体系中进行,反应体系是100mM HEPES缓冲液(pH 8.0)包括0.2mM Co 2+、1mM Mn 2+、1g/L Xu4E(野生型,它的DNA序列SEQ ID No:1,和代表性突变体M8,它的氨基酸序列SEQ ID No:40)、1U/mL D-XI和1U/mL L-AI。从500mM D-木糖生产L-阿拉伯糖时,Xu4E的浓度增加为10g/L,而D-XI和L-AI的浓度也增加为10U/mL。反应液在三酶混合后在50℃反应。L-阿拉伯糖的浓度使用Megazyme L-阿拉伯糖/D-半乳糖测定试剂盒(K-ARGA,博雷,爱尔兰)测定,而D-木糖、D-木酮糖和L-核酮糖的浓度使用配有示差折光检测器的岛津高效液相色谱进行检测,使用伯乐Aminex HPLC HPX-87H液相柱进行分离。
确认Xu4E利用D-木酮糖生产的产物为L-核酮糖
通过野生型或者突变型Xu4E,得到的产物L-核酮糖使用LC-ESI-QTOF-MS分析。1mL反应体系中包括50mM Tris缓冲液(pH 8.5)、10mM D-木酮糖、0.2mM Co 2+和1g/L Xu4E。反应在50℃进行1小时后添加538μL HClO 4终止反应。混合液添加207μL 5M KOH中和。 离心去除失活的蛋白及沉淀,样品使用岛津高效液相色谱进行分离,产品使用配有电喷雾离子化(ESI)的四级杆飞行时间串联质谱QTOF(compact QTOF,Bruker,德国)进行检测。。沃特世Sugar Pak I钙离子交换柱(300x 6.5mm,10μm粒径尺寸)被用作固定相用于样品分离(Waters Co,米尔福德,马塞诸萨,美国)。流动相为去离子水,流速为0.5mL/min,柱温为80℃,上样量为20μl。ESI使用负离子模式。毛细管电压为4500V,喷雾器压力位2bar,干燥加热器为200℃,干燥气流为8L/min。
发酵产物或细胞裂解液
本公开还涉及包含本公开的多肽的发酵产物或细胞裂解液。发酵产物进一步包含在发酵过程中使用的另外的成分,例如全细胞(包括含有编码本公开的多肽的基因的宿主细胞,这些宿主细胞被用于产生感兴趣的多肽)、或细胞裂解液。在一些实施例中,该组合物是含有杀灭的含所述酶的全细胞、含所述酶的细胞裂解液以及含培养基的含所述酶的细胞杀灭的全培养液。
模拟移动床分离
模拟移动床(SMB)是一种利用吸附原理进行液体分离操作的传质设备,它是以逆流连续操作方式进行。工业化的SMB已经被越来越多的用于分离低附加值的生物产品,比如有机酸、氨基酸和稀少糖。结合酶固定化及SMB分离可有效降低所需L-戊糖的生产成本并提高底物的利用效率。示例性的,SMB树脂包括Shodex Sugar KS-801钠离子交换柱、Waters Sugar Pak I钙离子交换柱、Bio-Rad Aminex HPX-87P铅离子交换柱或Bio-Rad Aminex HPX-87H氢离子交换柱所用到的一种,或其他类似的树脂,或者它们的系列组合。
实施例
本公开的其他目的、特征和优点将从以下详细描述中变得明显。但是,应当理解的是,详细描述和具体实施例(虽然表示本公开的具体实施方式)仅为解释性目的而给出,因为在阅读该详细说明后,在本公开的精神和范围内所作出的各种改变和修饰,对于本领域技术人员来说将变得显而易见。
实施例中采用的所有试剂,除非另有强调,否则均可以通过商业途径购买获得。
材料与方法
药品与材料
除特殊注明,所有药品均为分析纯或更高纯度,采购自Sigma-Aldrich(圣路易斯,密苏里州,美国)或中国国药集团(上海,中国)。海栖热孢菌(Thermotoga maritima)MSB8以及风产液菌(Aquifex aeolicus)的基因组DNA采购自美国菌种保藏中心American Type Culture Collection(马纳萨斯,弗吉尼亚,美国)。大肠杆菌E.coli TOP10和DH5α(赛默飞世尔科技,沃尔瑟姆,马塞诸塞,美国)被用于DNA操作以及质粒扩增。大肠杆菌E.coli BL21(DE3)(英潍捷基生命技术有限公司,卡尔斯巴德,加利福尼亚,美国)被用于重组蛋白的表达。
在本公开的技术方案中,说明书核苷酸和氨基酸序列表的编号所代表的含义如下所示:
SEQ ID NO:1所示的是海栖热袍菌(Thermotoga maritima)MSB8的野生型塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_004081526.1,KEGG ID TM0440)的核苷酸序列;
SEQ ID NO:2所示的是海栖热袍菌(Thermotoga maritima)MSB8的野生型塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_004081526.1,KEGG ID TM0440)的氨基酸序列;
SEQ ID NO:3所示的是新阿波罗栖热袍菌(Thermotoga neapolitana)的塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_015918744.1)的氨基酸序列;
SEQ ID NO:4所示的是栖热袍菌SG1(Thermotoga sp SG1)的塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_101512888.1)的氨基酸序列;
SEQ ID NO:5所示的是Thermotoga caldifontis的塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_041077375.1)的氨基酸序列;
SEQ ID NO:6所示的是产假假单胞菌(Pseudothermotoga lettingae)的塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_012002872.1)的氨基酸序列;
SEQ ID NO:7所示的是牙合嗜盐菌(Halanaerobium congolense)的塔格糖酮酸碳3-差向异构酶基因(NCBI参考序列:WP_081374543.1)的氨基酸序列;
SEQ ID NO:8所示的是Thermosediminibacter litoriperuensis的4-差向异构酶基因(NCBI参考序列:TYP53248.1)的氨基酸序列;
SEQ ID NO:9所示的是Rhodothermus marinus的4-差向异构酶基因(NCBI参考序列:WP_012844026.1)的氨基酸序列;
SEQ ID NO:10所示的是Gracilibacillus timonensis的4-差向异构酶基因(NCBI参考序列:WP_066188474.1)的氨基酸序列;
SEQ ID NO:11所示的是栖热袍菌(Thermotogae bacterium)的4-差向异构酶基因(NCBI参考序列:HCZ06146.1)的氨基酸序列;
SEQ ID NO:12所示的是Thermotogae bacterium的4-差向异构酶基因(NCBI参考序列:RKX45454.1)的氨基酸序列;
SEQ ID NO:13所示的是醋酸假丝酵母(Candidatus Acetothermia bacterium)的4-差向异构酶基因(NCBI参考序列:HAF71394.1)的氨基酸序列;
SEQ ID NO:14所示的是Pseudothermotoga thermarum的4-差向异构酶基因(Kegg ID:Theth_1083)的氨基酸序列;
SEQ ID NO:15所示的是解热嗜热厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum)DSM 571的4-差向异构酶基因(Kegg ID:Tthe_2391)的氨基酸序列;
SEQ ID NO:16所示的是Thermofilum adornatus 1505的4-差向异构酶基因(Kegg ID:TCARB_0828)的氨基酸序列;
SEQ ID NO:17所示的是Thermoanaerobacter italicus的4-差向异构酶基因(Kegg ID:Thit_1746)的氨基酸序列;
SEQ ID NO:18所示的是Thermotoga naphthophila的4-差向异构酶基因(Kegg ID:Tnap_0222)的氨基酸序列;
SEQ ID NO:19所示的是Thermoclostridium stercorarium DSM 8532的4-差向异构酶基因(Kegg ID:Cst_c08510)的氨基酸序列;
SEQ ID NO:20所示的是Dictyoglomus thermophilum的4-差向异构酶基因(Kegg ID:DICTH_1923)的氨基酸序列;
SEQ ID NO:21所示的是Spirochaeta thermophila DSM 6192的4-差向异构酶基因(Kegg ID:STHERM_c04350)的氨基酸序列;
SEQ ID NO:22所示的是Singulisphaera acidiphila的4-差向异构酶基因(Kegg ID:Sinac_2806)的氨基酸序列;
SEQ ID NO:23所示的是Thermotoga maritima MSB8的D-木酮糖5-磷酸4-差向异构酶基因(Kegg ID:TM0283)的氨基酸序列;
SEQ ID NO:24所示的是Thermotoga caldifontis的D-木酮糖5-磷酸4-差向异构酶基因(NCBI Reference Sequence:WP_041077291.1)的氨基酸序列;
SEQ ID NO:25所示的是Thermotoga neapolitana DSM 4359的D-木酮糖5-磷酸4-差向异构酶基因(GenBank:ACM22577.1)的氨基酸序列;
SEQ ID NO:26所示的是Pseudothermotoga lettingae的II类醛缩酶(GenBank:KUK21094.1)的氨基酸序列;
SEQ ID NO:27所示的是枯草芽孢杆菌(Bacillus subtilis)的D-木酮糖5-磷酸4-差向异构酶基因(Kegg ID:BSU28780)的氨基酸序列;
SEQ ID NO:28所示的是Geobacillus zalihae的D-木酮糖5-磷酸4-差向异构酶基因(NCBI Reference Sequence:WP_060788488.1)的氨基酸序列;
SEQ ID NO:29所示的是Geobacillus stearothermophilus的D-木酮糖5-磷酸4-差向异构酶基因(GenBank:KFL15052.1)的氨基酸序列;
SEQ ID NO:30所示的是Parageobacillus thermoglucosidasius的D-木酮糖5-磷酸4-差向异构酶基因(NCBI Reference Sequence:WP_042385633.1)的氨基酸序列;
SEQ ID NO:31所示的是Thermoanaerobacterium thermosaccharolyticum的D-木酮糖5-磷酸4-差向异构酶基因(NCBI Reference Sequence:WP_094043878.1)的氨基酸序列;
SEQ ID NO:32所示的是大肠艾希氏菌(Escherichia coli)K-12MG1655的D-木酮糖5-磷酸4-差向异构酶基因(Kegg ID:b0061)的氨基酸序列。
SEQ ID NO:33-122所示的是发明人构建的突变体。所述突变体的具体突变位点参见本公开中表3的记载。
需要说明的是,根据现有技术的数据库(例如GenBank)中公开的内容,与如SEQ ID NO:3-32所示的氨基酸序列所对应的核苷酸序列,也属于本领域技术人员已知的内容。
实施例1.戊糖的分离和检测
D-木糖、D-木酮糖、D-核酮糖和L-阿拉伯糖,通过使用如下(1)-(4)中所述的任一种方法进行分离:
(1)伯乐(Bio-Rad)Aminex HPLC HPX-87H液相离子交换柱进行分离,其分离条件为:柱温为60℃,流动相为5mM硫酸,流速为0.6mL/min。
(2)伯乐Aminex HPX-87P铅离子交换柱,柱温为80℃,流动相为去离子水,流速为0.6mL/min;
(3)沃特世Sugar Pak I钙离子交换柱,柱温为80℃,流动相为去离子水,流速为0.5mL/min;以及
(4)Shodex Sugar KS-801钠离子交换柱,柱温为70℃,流动相为去离子水,流速为0.5mL/min。
对于通过上述(1)-(4)的方法,分离得到的D-木糖、D-木酮糖、D-核酮糖和L-阿拉伯糖,其浓度可以使用配有示差折光检测器的岛津高效液相色谱进行检测。
实验结果:对于D-木糖、D-木酮糖、D-核酮糖和L-阿拉伯糖,利用以上(1)-(4)所示的HPLC进行色谱分离的效果如图4所示。其中,(1)所示的HPLC分离条件具有最好的分离效果。
实施例2.从L-核酮糖-5-磷酸4-差向异构酶中挖掘具有Xu4E功能的酶
考虑到底物结构的相似性以及可能的酶催化机理,我们从L-核酮糖-5-磷酸4-差向异构酶家族(RP4E,EC 5.1.3.4)中选择了有可能具备将D-木酮糖转化为L-核酮糖功能的Xu4E酶。我们分别从枯草芽孢杆菌(Bacillus subtilis 168)、嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)、大肠杆菌(Escherichia coli)以及海栖热孢菌(T.maritima)中克隆了四个RP4E,并把它们克隆至pET质粒。携带有表达质粒的E.coli BL21(DE3)被培养并且表达重组蛋白。
通过亲和吸附纯化出带有His-tag的蛋白后,通过本公开中记载的用于“ 判断多肽/酶 是否具有Xu4E酶活性的测定方法”,对得到的重组蛋白进行检测。从检测结果来看,三个来源于海栖热孢菌、枯草芽孢杆菌以及嗜热脂肪芽孢杆菌的RP4E表现出一定的Xu4E 活性,其比活力大约为0.0002~0.0003U/mg,而E.coli来源的RP4E的比活力低于0.0001U/mg。
基于上述实验结果,我们从更多具有L-核酮糖-5-磷酸4-差向异构酶家族(RP4E,EC5.1.3.4)的微生物中克隆了相应的rp4e基因,并把它们克隆至pET质粒。携带有前述表达质粒的E.coli BL21(DE3)被培养并且表达重组蛋白。进一步的,通过本公开中记载的用于“ 判断多肽/酶是否具有Xu4E酶活性的测定方法”,对得到的重组蛋白进行检测。
通过前述实验方法进行检测,发现具有Xu4E酶的活性的天然酶为由如SEQ ID NO:23-32所示的序列编码的酶。
实施例3.从塔格糖酮酸3-差向异构酶中挖掘具有Xu4E功能的新酶
考虑到底物结构的相似性以及可能的酶催化机理,我们采用和实施例2类似的方法,从塔格糖酮酸3-差向异构酶(UxaE,EC 5.1.2.7)中选择了有可能具备将D-木酮糖转化为L-核酮糖功能的新酶。我们分别从不同的微生物中克隆了多个uxae基因,并把它们克隆至pET质粒。携带有表达质粒的E.coli BL21(DE3)被培养并且表达重组蛋白。
通过亲和吸附纯化出带有His-tag的蛋白后,通过本公开中记载的用于“ 判断多肽/酶 是否具有Xu4E酶活性的测定方法”,对得到的重组蛋白进行检测。从检测结果来看,来自海栖热孢菌的塔格糖酮酸3-差向异构酶(Tm0440)在没有最优化的反应条件(70度,5mM Zn 2+),显示出比酶活大约是0.012U/mg。
通过前述实验方法进行检测,发现具有Xu4E酶的活性的天然酶为由如SEQ ID NO:2-22所示的序列编码的酶。
实施例4.增强活性的Xu4E突变体M4
我们使用来源于海栖热孢菌,具备较高热稳定性以及较高催化混杂活性的塔格糖酮酸3-差向异构酶(TmXu4E)作为定向进化出发的天然酶。其中,前述TmXu4E的氨基酸序列为如SEQ ID NO:2所示的序列,编码前述氨基酸的核苷酸序列为如SEQ ID NO:1所示的序列。
我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中,并涂布到含有D-木糖的LB培养基固体平板上。大约10,000克隆在平板上被筛选,阳性突变体被挑出接种到96孔板中,在含有D-木糖的LB培养基中培养。利用本公开中记载的“ Xu4E突变体文库 的高通量筛选”方法,我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。每轮几个阳性突变体被挑选。
从筛选得到的突变体中,我们从中选择了含有S125D的突变体M4。我们对突变体M4的比酶活进行了检测。结果发现,相对于天然酶,M4表现出25%比酶活的提高(图5)。
实施例5.继续增强活性的Xu4E突变体M47
从实施例4中选择的突变体M2出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中,并涂布到含有D-木糖的LB培养基固体平板上。大约10,000克隆在平板上被筛选,阳性突变体被挑出接种到96孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有两个氨基酸突变S125D/N297F的突变体M47。我们对前述突变体M47的比酶活进行了检测。结果发现,M47表现出更高的比酶活(图5)。
实施例6.继续增强活性的Xu4E突变体M57
从实施例5中选择的突变体M47出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中,并涂布到含有D-木糖的LB培养基固体平板上。大约10,000克隆在平板 上被筛选,阳性突变体被挑出接种到96孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有三个氨基酸突变S125D/V267I/N297F的突变体M57。我们对前述突变体M57的比酶活进行了检测。结果发现,M57表现出更高的比酶活(图5)。
实施例7.继续增强活性的Xu4E突变体M61
从实施例6中选择的突变体M57出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中,并涂布到含有D-木糖的LB培养基固体平板上。大约20,000克隆在平板上被筛选,阳性突变体被挑出接种到96孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有四个氨基酸突变S125D/V163K/V267I/N297F的突变体M61。我们对前述突变体M61的比酶活进行了检测。结果发现,M61表现出更高的比酶活(图5)。
实施例8.继续增强活性的Xu4E突变体M64
从实施例7中选择的突变体M61出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中并涂布到含有D-木糖的LB培养基固体平板上。大约15,000克隆在平板上被筛选,阳性突变体被挑出接种到96深孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了突变体M64,其包含有五个氨基酸突变S125D/V163K/V267I/N297F/Y403W。我们对前述突变体M64的比酶活进行了检测。结果发现,M64表现出更高的比酶活(图5)。
实施例9.继续增强活性的Xu4E突变体M72
从实施例8中选择的突变体M64出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中,并涂布到含有D-木糖的LB培养基固体平板上。大约12,000克隆在平板上被筛选,阳性突变体被挑出到96孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有六个氨基酸突变S125D/V163K/V267I/N297F/S402V/Y403W的突变体M72。我们对前述突变体M72的比酶活进行了检测。结果发现,M72表现出更高的比酶活(图5)。
实施例10.继续增强活性的Xu4E突变体M75
从实施例9中选择的突变体M72出发,我们使用低突变率的易错PCR(即每基因产生~1个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中并涂布到加入D-木糖的LB培养基固体平板上。大约18,000克隆在平板上被筛选,阳性突变体被挑出到96孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有七个氨基酸突变S125D/V163K/V267I/N297F/W306M/S402V/Y403W的突变体M75。我们对前述突变体M75的比酶活进行了检测。结果发现,M75表现出更高的比酶活(图5)。
实施例11.继续增强活性的Xu4E突变体M87
从实施例9中选择的突变体M72出发,我们使用低突变率的易错PCR(即每基因产生1-2个突变位点)去构建xu4e突变体文库。插入到质粒pGS-Xu4E的突变体文库被转入到宿主E.coli JZ919中并涂布到加入D-木糖的LB培养基固体平板上。大约25,000克隆在平板 上被筛选,阳性突变体被挑出到96深孔板中,在含有D-木糖的LB培养基中培养。我们使用酶标仪对微孔板中的细胞培养液的荧光信号进行检测,确认阳性突变体。
从筛选得到的突变体中,我们从中选择了包含有九个氨基酸突变S125D/R131S/V163K/V267I/N297F/W306M/Q318K/S402V/Y403W的突变体M87。前述突变体M87的氨基酸序列为如SEQ ID NO:119所示的序列。
实施例12.M87突变体的比酶活测定
我们将实施例11得到的突变体M87的比酶活与野生型Xu4E进行了对比(图5)。
从比较的结果来看,突变体M87的比酶活大约为2U/mg。也就是说,突变体相较于野生型Xu4E,其比酶活有了明显的提高。
实施例13.单点氨基酸突变的突变体制备
从野生型TmXu4E开始,通过本公开中记载的“ 构建Xu4E突变体文库”的方法,我们制备了九个单一氨基酸位点的突变体文库,它们是通过定点饱和突变在野生型TmXu4E上改变一个氨基酸残基。其中,前述野生型TmXu4E的氨基酸序列为如SEQ ID NO:2所示的序列,编码前述氨基酸的核苷酸序列为如SEQ ID NO:1所示的序列。
九个氨基酸突变位点选自突变体M87,它们分别是在位置125的丝氨酸,在位置131的精氨酸,在位置161的缬氨酸,在位置267的缬氨酸,在位置297的天冬酰胺,在位置306的色氨酸,位置318的谷氨酰胺,位置402的丝氨酸,或位置403的酪氨酸。将9个突变体文库插入宿主大肠杆菌JZ919的质粒pGS-Xu4E,再将它们涂布在每个含有D-木糖的LB培养基。通过本公开记载的“ Xu4E突变体文库的高通量筛选”方法筛选阳性突变体。与野生型酶相比,阳性突变体在96孔微孔板中得到验证,并通过DNA测序进行测序。
从筛选得到的突变体中,我们从中选择了如下9个突变体:S125D,R131S,V163K,V267I,N297F,W306M,Q318K,S402V和Y403W,其中,前述突变体中氨基酸的编号均根据SEQ ID NO:2进行编号。
将上述9个突变体在大肠杆菌BL21(DE3)中过表达,并通过在含镍离子树脂上的亲和吸附进行纯化。突变体S125D,V163K,V267I,N297F,W306M,Q318K,S402V和Y403W的Xu4E酶的比活度在图6所示。其中,八个突变体的比活性高于野生型TmXu4E,而一个突变体R131S变异体的比活度比野生型TmXu4E略低。
实施例14.多点氨基酸Xu4E突变体的制备和比酶活测定
通过和实施例4-实施例11相同的方法,我们对Xu4E突变体进行了进一步的筛选。
经过筛选,我们得到基于Xu4E的多点氨基酸突变体如下(其中,前述Xu4E突变体均根据SEQ ID NO:2进行编号):
M41(双突变):V267I/N297F;
M46(双突变):W306M/Y403W;
M50(三突变):V163K/V267I/Y403W;
M58(四突变):V163K/V267I/N297F/Y403W;
M68(六突变):V163K/V267I/N297F/W306M/S402V/Y403W;
M78(七突变):R131S/V163K/V267I/N297F/W306M/S402V/Y403W。
通过本公开记载的“ Xu4E酶活测定”方法,对上述突变体的比酶活进行检测。
从实验结果来看,M41,M46,M50,M58,M68,M78酶活分别大约为0.30U/mg,0.21U/mg,0.18U/mg,0.33U/mg,0.41U/mg,0.57U/mg。
实施例15.单点氨基酸Xu4E突变体的制备和比酶活测定
通过和实施例4相同的方法,我们得到了Xu4E突变体库。
我们得到的Xu4E突变体如下(其中,前述Xu4E突变体均根据SEQ ID NO:2进行编号):
M13(单突变):D161A;
M19(单突变):E266A;
M1(单突变):G102L;
M30(单突变):K337D;
M31(单突变):D394M。
从实验结果来看,M13,M19,M1,M30,M31酶活分别大约为0.008U/mg,0.013U/mg,0.03U/mg,0.06U/mg,0.04U/mg。
从上述实验结果来看,虽然上述Xu4E突变体的比酶活和野生型TmXu4E(0.09U/mg)相比,存在一定程度的降低,但是,其仍然具有Xu4E酶活性。
实施例16.重组热稳酶的简单纯化
来源于嗜热栖热菌(T.thermophiles)的D-木糖异构酶,来源于海栖热孢菌(T.maritima)的Xu4E,以及来源于嗜热脂肪芽孢杆菌(G.stearothermophilus)的L-阿拉伯糖异构酶在大肠杆菌E.coli BL21(DE3)中异源表达。收集细胞并破碎后,细胞裂解液的上清液通过热处理(70℃,20min)。离心后,将包含三个热稳酶的上清液混合,用于D-木糖到L-阿拉伯糖的生物转化。目标蛋白的表达水平及纯化后的蛋白纯度使用SDS-PGAE检测(图7)。
实施例17.从D-木糖合成L-核酮糖
我们构建了一个包括D-木糖异构酶(D-XI)和Xu4E突变体M87的双酶体系,底物为50mM D-木糖。反应混合物包括100mM HEPES缓冲液(pH 8.0)、0.2mM Co 2+、1mM Mn 2+、1g/L Xu4E M8和1U/mL XI。反应液温和混合后在无氧条件下50℃进行反应。
经过12小时反应后,L-核酮糖使用HPLC检测,成功获得L-核酮糖。证明该人工路线结果与设计一致。
实施例18.利用50mM D-木糖合成L-阿拉伯糖
我们构建了一个包括D-木糖异构酶(D-XI)、Xu4E突变体M87和L-阿拉伯糖合成酶(L-AI)的三酶体系,底物为50mM D-木糖。反应混合物是100mM Tris缓冲液(pH 8.0)包括0.2mM Co 2+、1mM Mn 2+、1g/L Xu4E(其中,Xu4E选自野生型或M87突变体)、1U/mL D-XI和1U/mL L-AI。三酶和含底物的反应液混合后,在50℃进行反应。
含有Xu4E突变体M87的三酶体系,在8小时反应后生产了21mM L-阿拉伯糖,而包含了野生型的三酶体系24小时反应仅仅生产了1.25mM L-阿拉伯糖(图8)。
实施例19.利用500mM D-木糖合成L-阿拉伯糖
我们构建了一个包括D-木糖异构酶(D-XI)、Xu4E突变体M87和L-阿拉伯糖合成酶(L-AI)的三酶体系,底物为500mM D-木糖。反应混合物包括100mM Tris缓冲液(pH 8.0)、0.2mM Co 2+和1mM Mn 2+。Xu4E的浓度为15g/L,XI及AI的浓度提升为20U/mL。
含有M87的三酶体系4小时反应后生产了175mM L-阿拉伯糖。它的比体积生产速率达到6.56g L-阿拉伯糖/升/小时。反应8小时后,当反应达到平衡时,底物D-木糖为207mM,中间产物D-木酮糖及L-核酮糖分别为55及42mM,产物L-阿拉伯糖为196mM(图9)。
实施例20.利用D-木糖生产L-核糖
我们构建了一个包括D-木糖异构酶(D-XI)、Xu4E突变体M87和磷酸甘露糖异构酶(MPI)的三酶体系,底物为50mM D-木糖。来源于嗜热脱氮芽孢杆菌(G.thermodenitrificans)的mpi基因被克隆并插入到pET质粒(Kim等2014)。反应混合物是50mM Tris缓冲液(pH 8.0)包括0.2mM Co 2+、1mM Mn 2+、1g/L Xu4E(M87突变体)、1U/mL XI和1U/mL MPI。三酶和含底物的反应液混合后,催化反应在下40℃进行。
24小时反应后,L-核糖使用HPLC检测,证明该人工路线结果与设计一致。
实施例21.L-戊糖的生产及分离
由于差向异构酶及异构酶所催化的反应存在反应平衡,将目的底物与酶及底物/中间产物进行有效分离非常重要。比如,利用D-葡萄糖生产高果糖浆就是利用模拟移动床 (SMB)对D-葡萄糖和果糖浆进行分离,得到含高果糖的糖浆,而没有被利用D-葡萄糖被进一步循环利用(图10)。固定化的木糖异构酶以及包含木糖异构酶的固定化微生物全细胞已经被广泛用于工业上生产高果糖浆(图10)。将多个酶进行共固定,不限于纯化后的重组酶或含过表达重组酶的细胞粗裂解液,延长了酶的使用寿命,便于酶及产物/中间产物的分离,并且降低了酶的使用成本。此外,对包含所述酶的微生物全细胞进行固定化,提高生物催化剂的稳定性及生物催化剂的可重复利用性,是可选择的解决方案。此外,为提高反应速度,对微生物全细胞进行透化处理,也是可选择的解决方案。
在本实施例中,我们采用了模拟移动床(SMB),有关填充树脂不限于Shodex Sugar KS-801钠离子交换柱、Waters Sugar Pak I钙离子交换柱、Bio-Rad Aminex HPX-87P铅离子交换柱、Bio-Rad Aminex HPX-87H氢离子交换柱、或有着相似功能的色谱分离柱。
以L-阿拉伯糖的生产为例,模拟移动床(SMB)可用于将L-阿拉伯糖与未被利用的D-木糖及中间产物L-核酮糖和D-木酮糖进行分离(图10)。
实施例22.Xu4E催化的产物L-核酮糖的表征
用野生型Xu4E和Xu4E突变体M87催化底物D-木酮糖的反应。野生型Xu4E在50℃下进行24小时,Xu4E突变体M87在50℃下进行10分钟。通过配备有Waters Sugar Pak柱的HPLC分离底物D-木酮糖和产物L-核酮糖。
HPLC色谱图如图11所示。产物(L-核酮糖)的保留时间是15.49分钟的新峰(黑色实线),与L-核酮糖标准品的保留时间相同。与D-木酮糖和L-核酮糖的标准品相比,HPLC分离的产物峰由一级质谱(图12)和二级质谱(图13)表征。
实验结果清楚地证明,野生型Xu4E和Xu4E突变体M87能够催化从D-木酮糖到L-核酮糖的酶反应。
实施例23.不同来源的野生型Xu4E的突变体的功能检测
基于生物信息学的方法,我们将实施例3、实施例14和实施例15获得的不同野生型Xu4E的序列进行了分析比较。其中,*所示的是不同来源的野生型Xu4E的保守氨基酸位点。
与此同时,我们通过如本申请实施例记载的试验方法,按照与如SEQ ID NO:2所示的序列相比,计算序列同一性时的编号,对前述不同野生型的Xu4E中的氨基酸进行编号。
我们发现,如图14A和图14B所示,对于前述不同的野生型的Xu4E,在其对应于前述如SEQ ID NO:2所示的编号进行编号时,其对应于SEQ ID NO:2所示的编号的如下位置的氨基酸,进行突变后,仍然具有Xu4E活性:G102、S125、R131、D161、V163、E266、V267、N297、W306、Q318、K337、D394、S402和Y403。
实施例24.构建序列缩短的Xu4E突变体
我们将编码野生型Xu4E进行截短,将编码1-86位及196-236位的氨基酸序列分别或一起去除,并通过本公开记载的方法,判断前述截短后的Xu4E突变体的生物活性。
实验结果显示,序列截短后蛋白的活性仍保持Xu4E的活性,分别为野生型Xu4E酶活的90%(去除1-86位氨基酸),87%(去除196-236位氨基酸)或85%(同时去除1-86位氨基酸和196-236位氨基酸)。
实施例25.本公开中具有Xu4E活性的酶的统计
通过本公开实施例4-16所记载的方法,我们得到了具有Xu4E活性的酶的序列,以及其酶活的实验结果。
实验结果如表3所示。
表3本公开中具有Xu4E活性的酶的实验结果
Figure PCTCN2020133395-appb-000003
本说明书公开的所有技术特征都可以任何组合方式进行组合。本说明所公开的每个特征也可以被其它具有相同、相等或相似作用的特征所替换。因此,除非特殊说明,所公开的每一特征仅仅是一系列相等或相似特征的实例。
此外,从上述描述中,本领域技术人员可从本公开中很容易清楚本公开的关键特征,在不脱离本公开的精神及范围的情况下,可对发明进行很多修改以适应各种不同的使用目的及条件,因此这类修改也旨在落入所附权利要求书的范围内。
参考文献
Antila,J.,V.Ravanko and P.Walliander(1997).Method of preparing L-arabinose from sugar beet pulp.US6506897B1.
Bhosale,S.H.,M.B.Rao and V.V.Deshpande(1996)."Molecular and industrial aspects of glucose isomerase." Microbiological Reviews 60(2):280.
Boku,T.,I.Kusakabe,N.Matsuo and e.al.(2001).Method for isolating l-arabinose.JP2002262899A.
Izumori,K.,S.Sugimoto and B.Kraska(1980)."Induction of D-ribose isomerase by L-ribose in Mycobacterium smegmatis." Agric.Biol.Chem.44:223-225.
Izumori,K.,Y.Ueda and K.Yamanaka(1978)."Pentose metabolism in Mycobacterium smegmatis:comparison of L-arabinose isomerase induced by L-arabinose and D-galactose." J.Bacteriol.133:413-414.
Kim,K.-R.,E.-S.Seo and D.-K.Oh(2014)."l-Ribose Production from l-Arabinose by Immobilized Recombinant Escherichia coli Co-expressing the l-Arabinose Isomerase and Mannose-6-Phosphate Isomerase Genes from Geobacillus thermodenitrificans." Appl.Biochem.Biotechnol.172(1):275-288.
Kim,Y.S.,K.C.Shin,Y.R.Lim and D.K.Oh(2013)."Characterization of a recombinant L-rhamnose isomerase from Dictyoglomus turgidum and its application for L-rhamnulose production." Biotechnol.Lett.35:259-264.
Morimoto,A.,K.Mukai,Y.Nisikawa,H.Tanaka and G.Yoshikawa(2001).Process for producing l-arabinose,l-arabinose-containing enzymatically processed products,diet foods,diabetic diet foods and fruit or vegetable juices and process for producing the same.EP1167536A1.
Mortlock,R.P.(1966)."D-Arabinose isomerase." Methods Enzymol.9:583-585.
Park,C.S.,S.J.Yeom,Y.R.Lim,Y.S.Kim and D.K.Oh(2010)."Characterization of a recombinant thermostable L-rhamnose isomerase from Thermotoga maritima ATCC 43589 and its application in the production of L-lyxose and L-mannose." Biotechnol.Lett.32:1947-1953.
Rodionova,I.A.,D.A.Scott,N.V.Grishin,A.L.Osterman and D.A.Rodionov(2012)."Tagaturonate–fructuronate epimerase UxaE,a novel enzyme in the hexuronate catabolic network in Thermotoga maritima." Environ.Microbiol.14(11):2920-2934.
Uechi,K.,G.Takata,Y.Fukai,A.Yoshihara and K.Morimoto(2013)."Gene Cloning and Characterization of L-Ribulose 3-epimerase from Mesorhizobium loti and Its Application to Rare Sugar Production." Bioscience,Biotechnology,and Biochemistry 77(3):511-515.
Wu,R.,C.Ma,Y.-H.P.Zhang and Z.Zhu(2018)."Complete oxidation of xylose for bioelectricity generation by reconstructing a bacterial xylose utilization pathway in vitro." ChemCatChem 10(9):2030-2035.
Yang,S.J.,Y.H.Kim,S.B.Kim,S.W.Park,I.H.PARK,M.H.Kim and Y.M.Lee(2014).Production method for tagatose.WO2014196811A1,US20160138053A1..
Yang,S.J.,Y.H.Kim,I.H.PARK,Y.M.Lee,H.K.CHO,S.B.Kim and S.W.Park(2015).Hexuronate C4-epimerase mutant with improved conversion activity,and method for producing D-tagatose by using same.US10196626B2.

Claims (44)

  1. 一种多肽,所述多肽具有D-木酮糖4-差向异构酶活性,其中,所述多肽选自由(a)-(d)组成的组中的任一项:
    (a)由与SEQ ID NO:2-32任一项所示的序列具有至少60%、至少70%、至少80%或至少90%的序列同一性的序列编码的多肽;
    (b)由多核苷酸编码的多肽,所述多核苷酸在非常高严格条件下与(i)或(ii)所示的多核苷酸杂交:
    (i)编码如SEQ ID NO:2-32任一项所示的氨基酸序列的多核苷酸;
    (ii)(i)的全长互补多核苷酸;
    (c)多肽,所述多肽为如SEQ ID NO:2-32任一项所示的多肽的突变体,所述突变体在一个或多个位置处包含取代、重复、缺失或添加一个或多个氨基酸,并且所述多肽仍然具有D-木酮糖4-差向异构酶活性;以及
    (d)由(a)、(b)、(c)所示的多肽的片段,并且所述片段具有D-木酮糖4-差向异构酶活性。
  2. 根据权利要求1所述的多肽,其中,所述多肽为突变体,并且所述多肽与SEQ ID NO:2-32任一项所示的多肽相比,具有至少95%、至少96%、至少97%、至少98%或至少99%的序列同一性。
  3. 根据权利要求1-2任一项所述的多肽,其中,所述多肽为如SEQ ID NO:2-32任一项所示的多肽的突变体,所述突变体在至少1个、至少2个、至少3个、至少4个、至少5个、至少6个、至少7个、至少8个、至少9个位点处包含氨基酸的突变,并且所述多肽仍然具有D-木酮糖4-差向异构酶活性。
  4. 根据权利要求1-3任一项所述的多肽,其中,所述多肽为如下所示的多肽:
    (a)与SEQ ID NO:2所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点处包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
    (b)与SEQ ID NO:3所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
    (c)与SEQ ID NO:4所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402和403组成的组中的一种或多种;
    (d)与SEQ ID NO:5所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、270、271、301、310、322、341、398、406、407组成的组中的一种或多种;
    (e)与SEQ ID NO:6所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、269、270、300、309、321、340、397、405、406组成的组中的一种或多种;
    (f)与SEQ ID NO:7所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由117、140、146、176、178、285、286、316、325、337、355、412、420、421组成的组中的一种或多种;
    (g)与SEQ ID NO:8所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由125、148、154、184、186、293、294、324、333、345、363、420、428、429组成的组中的一种或多种;
    (h)与SEQ ID NO:9所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由124、147、153、183、185、297、298、328、337、349、368、425、433、434组成的组中的一种或多种;
    (i)与SEQ ID NO:10所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由108、131、137、167、169、276、277、307、316、328、346、403、411、412组成的组中的一种或多种;
    (j)与SEQ ID NO:11所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由115、138、144、174、176、280、281、311、320、332、351、408、416、417组成的组中的一种或多种;
    (k)与SEQ ID NO:12所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由107、130、136、166、168、272、273、303、312、324、343、400、408、409组成的组中的一种或多种;
    (l)与SEQ ID NO:13所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由109、132、138、168、170、275、276、306、315、327、346、403、411、412组成的组中的一种或多种;
    (m)与SEQ ID NO:14所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由103、126、132、162、164、267、268、298、307、319、338、395、403、404组成的组中的一种或多种;
    (n)与SEQ ID NO:15所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由105、128、134、164、166、271、272、302、311、323、342、399、407、408组成的组中的一种或多种;
    (o)与SEQ ID NO:16所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由64、88、94、123、125、236、237、267、274、286、373、381、382组成的组中的一种或多种;
    (p)与SEQ ID NO:17所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由110、133、139、169、171、271、272、302、311、323、342、399、407、408组成的组中的一种或多种;
    (q)与SEQ ID NO:18所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由102、125、131、161、163、266、267、297、306、318、337、394、402、403组成的组中的一种或多种;
    (r)与SEQ ID NO:19所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由121、144、150、180、182、289、290、320、329、341、359、416、424、425组成的组中的一种或多种;
    (s)与SEQ ID NO:20所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由107、130、136、166、168、273、274、304、313、325、344、401、409、410组成的组中的一种或多种;
    (t)与SEQ ID NO:21所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由21、48、54、84、86、182、183、213、222、234、260、324、332、333组成的组中的一种或多种;
    (u)与SEQ ID NO:22所示的序列相比,所述多肽的氨基酸序列中的一个或多个氨基酸在对应于以下位点包含突变,其中,所述位点为选自由30、55、61、91、93、202、203、233、242、254、273、330、338、339组成的组中的一种或多种。
  5. 根据权利要求1-4任一项所述的多肽,其中,所述多肽为在对应于SEQ ID NO:2上的,存在如下所示的突变的多肽:
    (a)在102位上的突变;
    (b)在125位上的突变;
    (c)在131位上的突变;
    (d)在161位上的突变;
    (e)在163位上的突变;
    (f)在266位上的突变;
    (g)在267位上的突变;
    (h)在297位上的突变;
    (i)在306位上的突变;
    (j)在318位上的突变;
    (k)在337位上的突变;
    (l)在394位上的突变;
    (m)在402位上的突变;
    (n)在403位上的突变
    (o)在267和297位上的组合突变;
    (p)在306和403位上的组合突变;
    (q)在125和297位上的组合突变;
    (r)在163、267、403位上的组合突变;
    (s)在125、267和297位上的组合突变;
    (t)在163、267、297和403位上的组合突变;
    (u)在125、163、267和297位上的组合突变;
    (v)在125、163、267、297和403位上的组合突变;
    (w)在125、163、267、297、402和403位上的组合突变;
    (x)在163、267、297、306、402和403位上的组合突变;
    (y)在125、163、267、297、306、402和403位上的组合突变;
    (z)在125、131、163、267、297、306、402和403位上的组合突变;
    (aa)在125、163、267、297、306、318、402和403位上的组合突变;
    (bb)在125、131、163、267、297、306、318、402和403位上的组合突变。
  6. 根据权利要求1-5任一项所述的多肽,其中,所述多肽为在对应于SEQ ID NO:2上的以下任一种位点的突变的多肽:
    (a)对应于SEQ ID NO:2上的位点102的氨基酸由甘氨酸(G)突变为亮氨酸(L);
    (b)对应于SEQ ID NO:2上的位点125的氨基酸由丝氨酸(S)突变为天门冬氨酸(D)、半胱氨酸(C)、酪氨酸(Y)、谷氨酰胺(Q)、谷氨酸(E)、苏氨酸(T)或天冬酰胺(N);
    (c)对应于SEQ ID NO:2上的位点131的氨基酸由精氨酸(R)突变为天门冬氨酸(D)、苏氨酸(T)、谷氨酸(E)或丝氨酸(S);
    (d)对应于SEQ ID NO:2上的位点161的氨基酸由天门冬氨酸(D)突变为丙氨酸(A);
    (e)对应于SEQ ID NO:2上的位点163的氨基酸由缬氨酸(V)突变为赖氨酸(K)、精氨酸(R)、丝氨酸(S)、异亮氨酸(I)或蛋氨酸(M);
    (f)对应于SEQ ID NO:2上的位点266的氨基酸由谷氨酸(E)突变为丙氨酸(A);
    (g)对应于SEQ ID NO:2上的位点267的氨基酸由缬氨酸(V)突变为亮氨酸(L)、蛋氨酸(M)或异亮氨酸(I);
    (h)对应于SEQ ID NO:2上的位点297的氨基酸由天冬酰胺(N)突变为苯丙氨酸(F)、酪氨酸(Y)或赖氨酸(K);
    (i)对应于SEQ ID NO:2上的位点306的氨基酸由色氨酸(W)突变为蛋氨酸(M)、丝氨酸(S)或苏氨酸(T);
    (j)对应于SEQ ID NO:2上的位点318的氨基酸由谷氨酰胺(Q)突变为赖氨酸(K);
    (k)对应于SEQ ID NO:2上的位点337的氨基酸由赖氨酸(K)突变为天门冬氨酸(D);
    (l)对应于SEQ ID NO:2上的位点394的氨基酸由天门冬氨酸(D)突变为蛋氨酸(M);
    (m)对应于SEQ ID NO:2上的位点402的氨基酸由丝氨酸(S)突变为缬氨酸(V)、亮氨酸(L)、苯丙氨酸(F)、半胱氨酸(C)或酪氨酸(Y);
    (n)对应于SEQ ID NO:2上的位点403的氨基酸由酪氨酸(Y)突变为色氨酸(W)、苏氨酸(T)、异亮氨酸(I)或苯丙氨酸(F)。
  7. 根据权利要求1-6任一项所述的多肽,其中,所述多肽来源于海栖热袍菌(Thermotoga maritima),新阿波罗栖热袍菌(Thermotoga neapolitana),栖热袍菌(Thermotoga sp),Thermotoga caldifontis,产假假单胞菌(Pseudothermotoga lettingae),牙合嗜盐菌(Halanaerobium congolense),Thermosediminibacter litoriperuensis,Rhodothermus marinus,Gracilibacillus timonensis,栖热袍菌(Thermotogae bacterium),Thermotogae bacterium,醋酸假丝酵母(Candidatus Acetothermia bacterium),Pseudothermotoga thermarum,解热嗜热厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum),Thermofilum adornatus,Thermoanaerobacter italicus,Thermotoga naphthophila,Thermoclostridium stercorarium,Dictyoglomus thermophilum,Spirochaeta thermophila,Singulisphaera acidiphila,Thermotoga caldifontis,Pseudothermotoga lettingae,枯草杆菌(Bacillus subtilis),Geobacillus zalihae,Geobacillus stearothermophilus,Parageobacillus thermoglucosidasius,Thermoanaero-bacterium thermosaccharolyticum或大肠杆菌(Escherichia coli)。
  8. 根据权利要求1所述的多肽,其中,所述多肽包括在如SEQ ID NO:2-32所示的多肽的N端或中间部位或C端缺失一个或超过一个氨基酸残基。
  9. 根据权利要求8所述的多肽,其中,所述多肽选自下组:
    (i)对应于如SEQ ID NO:2所示的多肽的N端起,缺失1-100个氨基酸,优选1-90个氨酸酸,更优选1-86个,更优选1-50个,更优选1-30个,最优选1-10个氨基酸而形成,并且具有催化D-木酮糖转化为L-核酮糖活性;或
    (ii)对应于SEQ ID NO:2所示的多肽的第196-236位氨基酸中,缺失1-41个氨基酸,优选1-30个,更优选1-20个,最优选1-10个氨基酸而形成的,且具有催化D-木酮糖转化为L-核酮糖活性。
  10. 根据权利要求8-9任一项所述的多肽,其中,所述多肽选自下组:
    (i)对应于如SEQ ID NO:2所示的多肽中,缺失第1-86位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性;
    (ii)对应于SEQ ID NO:2所示的多肽中,缺失第196-236位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性;或
    (iii)对应于SEQ ID NO:2所示的多肽中,缺失第1-86位氨基酸和第196-236位氨基酸,并且具有催化D-木酮糖转化为L-核酮糖活性。
  11. 根据权利要求1-10任一项所述的多肽,其中,所述多肽包含与SEQ ID NO:33-122所示的序列具有至少96%的序列同一性的氨基酸序列或由其组成;可选的,所述多肽包含与如SEQ ID NO:33-122所编码的多肽中的任一项具有至少98.3%、至少98.5%、至少98.7%、至少98.9%、至少99.1%、至少99.3%、至少99.5%、至少99.7%或100%的序列同一性的氨基酸序列或由其组成。
  12. 根据权利要求1-11任一项所述的多肽,其中,所述多肽和如SEQ ID NO:2所示的多肽相比,具有改进的D-木酮糖4-差向异构酶活性。
  13. 根据权利要求1-12任一项所述的多肽,其中,所述多肽由包含如SEQ ID NO:33-122所示的序列编码,或者所述多肽为由如SEQ ID NO:33-122所示的序列编码的多肽。
  14. 一种分离的多核苷酸,其中,所述多核苷酸包含编码根据权利要求1-13任一项所述多肽的核苷酸序列。
  15. 根据权利要求14所述的分离的多核苷酸,其在编码如SEQ ID NO:2-32任一项所 示的氨基酸的核苷酸中包含至少一个突变;优选的,所述多核苷酸序列编码如SEQ ID NO:33-122任一项所示的氨基酸。
  16. 一种核酸构建体,其包含根据权利要求14或15所述的多核苷酸,所述多核苷酸与一个或多个调控序列可操作地连接,所述调控序列指导多肽在表达宿主中产生。
  17. 一种重组表达载体,其包含根据权利要求16所述的核酸构建体。
  18. 一种重组宿主细胞,其包含根据权利要求16所述的核酸构建体或根据权利要求17所述的重组表达载体。
  19. 一种用于产生根据权利要求1-13中任一项所述多肽的方法,其中,所述方法包括如下步骤:(a)在有助于所述多肽产生的条件下培养细胞或菌株,所述细胞或菌株产生所述多肽;其中,所述细胞或菌株包含根据权利要求16所述的核酸构建体或权利要求17所述的重组表达载体,所述核酸构建体或所述重组表达载体包含编码所述多肽的核苷酸序列;
    可选的,所述方法还包括如下步骤:
    (b)纯化或回收所述多肽。
  20. 一种D-木酮糖转化为L-核酮糖的方法,其中,所述方法包括利用根据权利要求1-13任一项所述多肽进行催化反应。
  21. 根据权利要求20所述的方法,其中,所述多肽选自权利要求1-13任一项所述的多肽;优选的,所述多肽由包含如SEQ ID NO:2-122任一项所示的氨基酸序列编码,或者所述多肽为由如SEQ ID NO:2-122任一项所示的序列编码的多肽。
  22. 一种制备L-戊糖的方法,其中,所述制备L-戊糖的方法包括如下步骤:
    (a)采用D-木酮糖4-差向异构酶,将D-木酮糖转化为L-核酮糖;
    可选的,所述方法还包括:
    (b)采用D-木糖异构酶,将D-木糖转化为D-木酮糖。
  23. 根据权利要求22所述的方法,其中,所述方法进一步包括,采用L-阿拉伯糖异构酶,将L-核酮糖转化为L-阿拉伯糖,所述L-戊糖为L-阿拉伯糖。
  24. 根据权利要求22所述的方法,其中,所述方法进一步包括,采用L-核糖异构酶或甘露糖磷酸异构酶或它们的组合,将L-核酮糖转化为L-核糖,所述L-戊糖为L-核糖。
  25. 根据权利要求22所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,所述L-戊糖为L-木酮糖。
  26. 根据权利要求22所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,采用L-岩藻糖异构酶或D-阿拉伯糖异构酶或L-鼠李糖异构酶,将L-木酮糖转化为L-木糖,所述L-戊糖为L-木糖。
  27. 根据权利要求22所述的方法,其中,所述方法进一步包括,采用L-核酮糖3-差向异构酶,将L-核酮糖转化为L-木酮糖,采用L-鼠李糖异构酶将L-木酮糖转化为L-来苏糖,所述L-戊糖为L-来苏糖。
  28. 根据权利要求22-27任一项所述的方法,其中,所述D-木酮糖4-差向异构酶选自权利要求1-13任一项所述的多肽;优选的,所述D-木酮糖4-差向异构酶由包含如SEQ ID NO:2-122任一项所示的氨基酸序列编码,或者所述酶为由如SEQ ID NO:2-122任一项所示的序列编码的酶。
  29. 根据权利要求22-28任一项所述的方法,其中,所述方法中还包含纯化和/或分离所述L-戊糖的步骤。
  30. 根据权利要求29所述的方法,其中,所述分离步骤包含使用模拟移动床(SMB)分离的步骤。
  31. 根据权利要求22-30任一项所述的方法,其中,所述方法的反应体系中还包含酶反应液;优选的,所述酶反应液中含有金属离子;更优选的,所述金属离子可以为Co 2+、 Zn 2+、Mg 2+、Mn 2+、Ca 2+、Ni 2+、Cu 2+、Fe 2+、Sr 2+、Ba 2+、Cd 2+、Pb 2+、Fe 3+、Al 3+、Bi 3+、Ag +、Li +中的一种或多种。
  32. 根据权利要求22-31任一项所述的方法,其中,所述反应在有氧、微有氧或无氧条件下进行。
  33. 根据权利要求22-32任一项所述的方法,其中,所述反应在30℃-90℃温度下进行;优选的,所述反应在40℃-80℃温度下进行。
  34. 根据权利要求22-33任一项所述的方法,其中,所述反应在pH为3.0至11.0的范围内进行;优选的,所述反应在pH为4.0至10.0的范围内进行。
  35. 根据权利要求22-34任一项所述的方法,其中,所述反应在无氧、45-55℃温度、pH为8.0、金属离子为Co 2+或Mg 2+或Mn 2+或它们的组合的条件下进行。
  36. 根据权利要求22-35任一项所述的方法,其中,所述反应包括体外催化反应或全细胞生物催化反应。
  37. 根据权利要求36所述的方法,其中,所述反应为体外催化反应,所述体外催化反应可以分步进行或同时进行。
  38. 根据权利要求37所述的方法,其中,所述体外催化反应分步进行时,其在一个反应容器中或一个以上串联的反应容器中进行。
  39. 根据权利要求38所述的方法,其中,所述反应容器选自分批进料生物反应器、含固定化酶的填充床生物反应器、酶或细胞再循环生物反应器、含膜分离的生物反应器和连续进料生物反应器中的一种或多种。
  40. 根据权利要求22-39任一项所述的方法,其中,所述体外催化反应中的酶以游离酶、含有所述酶的细胞裂解液、含有所述酶的全细胞、固定化酶中的一种或多种形式存在。
  41. 根据权利要求36所述的方法,其中,所述全细胞生物催化反应的反应方式为利用细胞工厂进行反应,所述细胞中包含根据权利要求16所述的核酸构建体或根据权利要求17所述的重组表达载体。
  42. 多肽在制备L-戊糖中的应用,所述多肽选自根据权利要求1-13中任一项所述的多肽。
  43. 根据权利要求42所述的应用,其中,所述L-戊糖选自L-阿拉伯糖、L-核糖、L-核酮糖、L-木酮糖、L-木糖和L-来苏糖中的一种或多种。
  44. 多肽作为具有D-木酮糖4-差向异构酶活性的酶的用途,所述多肽选自根据权利要求1-13中任一项所述的多肽。
PCT/CN2020/133395 2019-12-31 2020-12-02 D-木酮糖4-差向异构酶、其突变体及其用途 WO2021135796A1 (zh)

Priority Applications (7)

Application Number Priority Date Filing Date Title
BR112022012915A BR112022012915A2 (pt) 2019-12-31 2020-12-02 Polipeptídeo que tem atividade de d-xilulose 4-epimerase, método para produzi-lo, uso deste, polinucleotídeo isolado, construto de ácido nucleico, vetor de expressão recombinante, célula hospedeira recombinante, método para converter d-xilulose em l-ribulose, método e uso de um polipeptídeo para preparar l-pentose
EP20909683.3A EP4083199A4 (en) 2019-12-31 2020-12-02 D-XYLULOSE-4-EPIMERASE, MUTANT THEREOF AND ITS USE
JP2022540978A JP2023509176A (ja) 2019-12-31 2020-12-02 D-キシルロース4-エピメラーゼ、その変異体及びそれらの使用
KR1020227025377A KR20220119128A (ko) 2019-12-31 2020-12-02 D-자일룰로오스 4-에피머라아제, 그 변이체 및 용도
MX2022008199A MX2022008199A (es) 2019-12-31 2020-12-02 D-xilulosa-4-epimerasa, mutantes y usos de los mismos.
US17/790,417 US20230220433A1 (en) 2019-12-31 2020-12-02 D-xylulose 4-epimerase, mutants and uses thereof
CA3163520A CA3163520A1 (en) 2019-12-31 2020-12-02 D-xylulose 4-epimerase, mutant thereof and use thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911404948.2 2019-12-31
CN201911404948.2A CN113122528A (zh) 2019-12-31 2019-12-31 D-木酮糖4-差向异构酶、其突变体及其用途

Publications (1)

Publication Number Publication Date
WO2021135796A1 true WO2021135796A1 (zh) 2021-07-08

Family

ID=76686438

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/133395 WO2021135796A1 (zh) 2019-12-31 2020-12-02 D-木酮糖4-差向异构酶、其突变体及其用途

Country Status (9)

Country Link
US (1) US20230220433A1 (zh)
EP (1) EP4083199A4 (zh)
JP (1) JP2023509176A (zh)
KR (1) KR20220119128A (zh)
CN (1) CN113122528A (zh)
BR (1) BR112022012915A2 (zh)
CA (1) CA3163520A1 (zh)
MX (1) MX2022008199A (zh)
WO (1) WO2021135796A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116254279A (zh) * 2022-09-30 2023-06-13 山东大学 一种利用双酶级联重组大肠杆菌催化木糖醇生物合成l-木糖的方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105431541A (zh) * 2013-06-05 2016-03-23 Cj第一制糖株式会社 塔格糖的生产方法
KR101610911B1 (ko) * 2013-05-09 2016-04-08 주식회사 삼양사 L-리불로스 5-인산 4-에피머화 효소를 이용한 과당에서 타가토스 생산
CN108368498A (zh) * 2016-07-29 2018-08-03 Cj第制糖株式会社 具有改良的d-塔格糖转化活性的己糖醛酸酯c4-差向异构酶变异体和用它制造d-塔格糖的方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101675069A (zh) * 2004-07-02 2010-03-17 梅坦诺米克斯有限公司 产生精细化学品的方法
DE102007016534B4 (de) * 2007-04-05 2019-03-28 Butalco Gmbh Neuer Expressionsvektor von Codon-optimierten Genen eines Arabinose-Stoffwechselweges zur Arabinose-Umsetzung in Hefe
WO2016201110A1 (en) * 2015-06-09 2016-12-15 Kembiotix Llc Method for producing carbohydrates from dihydroxyacetone
CN106754776B (zh) * 2016-12-26 2019-12-20 江南大学 一种催化木糖的比酶活提高的葡萄糖脱氢酶突变体

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101610911B1 (ko) * 2013-05-09 2016-04-08 주식회사 삼양사 L-리불로스 5-인산 4-에피머화 효소를 이용한 과당에서 타가토스 생산
CN105431541A (zh) * 2013-06-05 2016-03-23 Cj第一制糖株式会社 塔格糖的生产方法
CN108368498A (zh) * 2016-07-29 2018-08-03 Cj第制糖株式会社 具有改良的d-塔格糖转化活性的己糖醛酸酯c4-差向异构酶变异体和用它制造d-塔格糖的方法
CN108884454A (zh) * 2016-07-29 2018-11-23 Cj第制糖株式会社 具有改良的d-塔格糖转化活性的己糖醛酸酯c4-差向异构酶变异体和用它制造d-塔格糖的方法
CN109415715A (zh) * 2016-07-29 2019-03-01 Cj第制糖株式会社 具有改良的d-塔格糖转化活性的己糖醛酸酯c4-差向异构酶变异体和用它制造d-塔格糖的方法

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HENRY C. VOGELCELESTE L. TODARO: "Handbook of Fermentation and Biochemical Engineering: Principles, Process Design and Instrumentation", 1997, JOHN WILEY AND SONS, INC
KIM JAE EUNG: "In Vitro Synthetic Biology Platform and Protein Engineering for Biorefinery", DISSERTATION FOR PHD IN BIOLOGICAL SYSTEMS ENGINEERING AT VIRGINIA POLYTECHNIC INSTITUTE AND STATE UNIVERSITY, 17 July 2017 (2017-07-17), XP055826050, Retrieved from the Internet <URL:https://vtechworks.lib.vt.edu/bitstream/handle/10919/86645/Kim_J_D_2017.pdf?sequence=1&isAllowed=y> *
P.P. STANBURY: "Principles of Fermentation Technology", 2003, BUTTERWORTH HEINEMAN

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116254279A (zh) * 2022-09-30 2023-06-13 山东大学 一种利用双酶级联重组大肠杆菌催化木糖醇生物合成l-木糖的方法
CN116254279B (zh) * 2022-09-30 2024-04-05 山东大学 一种利用双酶级联重组大肠杆菌催化木糖醇生物合成l-木糖的方法

Also Published As

Publication number Publication date
BR112022012915A2 (pt) 2022-09-06
MX2022008199A (es) 2022-08-04
CA3163520A1 (en) 2021-07-08
EP4083199A1 (en) 2022-11-02
KR20220119128A (ko) 2022-08-26
EP4083199A4 (en) 2023-06-14
CN113122528A (zh) 2021-07-16
JP2023509176A (ja) 2023-03-07
US20230220433A1 (en) 2023-07-13

Similar Documents

Publication Publication Date Title
CN111254129B (zh) 一种多聚磷酸激酶突变体及其应用
US9102922B2 (en) Trehalose phosphorylases and their use in the biocatalytic production of trehalose-analogues and glycosyl phosphates
CN113025592B (zh) 一种高性能多聚磷酸激酶突变体及其应用
JP2022544085A (ja) プシコース3-エピメラーゼミュータント、それを発現するための遺伝子工学菌、その固定化酵素及び固定化方法
CN113265382A (zh) 多聚磷酸激酶突变体
Patel et al. Substrate specificity of the Bacillus licheniformis lyxose isomerase YdaE and its application in in vitro catalysis for bioproduction of lyxose and glucose by two-step isomerization
JP7404537B2 (ja) アルロースエピマー化酵素変異体、その製造方法及びそれを利用したアルロースの製造方法
WO2021135796A1 (zh) D-木酮糖4-差向异构酶、其突变体及其用途
KR101057873B1 (ko) 알도헥소오스 에피머라아제 및 이를 이용한 알도헥소오스 에피머의 효소적 제조 방법
EP4273224A1 (en) Recombinant microorganism, preparation method therefor, and application of recombinant microorganism in production of tagatose
US20120021468A1 (en) Production of l-ribose and other rare sugars
CA2619488A1 (en) Production of l-ribose and other rare sugars
TWI719140B (zh) 新型多磷酸鹽依存性葡萄糖激酶與使用其製備葡萄糖-6-磷酸的方法
JP7094449B2 (ja) アルロースエピマー化酵素変異体、その製造方法及びこれを用いたアルロースの製造方法
US20240052389A1 (en) Allulose epimerase variant with excellent thermal stability, preparation method therefor, and preparation method for allulose using same
JP7445947B2 (ja) アラビノースイソメラーゼ変異体
ES2906870T3 (es) Nuevas glucosa isomerasas mejoradas
CN108220308B (zh) β-半乳糖苷酶及其基因和载体及菌株及含该酶的产品和将乳糖和果糖转化为乳果糖的方法
WO2015048452A2 (en) Methods and enzymes for producing hydroxymethylfurfural
CN117603931A (zh) 一种糖醇氧化酶及其应用
KR20180101310A (ko) 신규 폴리포스페이트-의존형 포도당인산화효소 및 이를 이용한 포도당-6-인산 제조방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20909683

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022540978

Country of ref document: JP

Kind code of ref document: A

Ref document number: 3163520

Country of ref document: CA

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112022012915

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 20227025377

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2020909683

Country of ref document: EP

Effective date: 20220726

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 112022012915

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20220628