WO2000077181A2 - Dna manipulation methods, applications for synthetic enzymes and use for polyketide production - Google Patents

Dna manipulation methods, applications for synthetic enzymes and use for polyketide production Download PDF

Info

Publication number
WO2000077181A2
WO2000077181A2 PCT/GB2000/002286 GB0002286W WO0077181A2 WO 2000077181 A2 WO2000077181 A2 WO 2000077181A2 GB 0002286 W GB0002286 W GB 0002286W WO 0077181 A2 WO0077181 A2 WO 0077181A2
Authority
WO
WIPO (PCT)
Prior art keywords
dna
enzyme
recognition sequence
domains
units
Prior art date
Application number
PCT/GB2000/002286
Other languages
French (fr)
Other versions
WO2000077181A3 (en
Inventor
Anand Ranganathan
Original Assignee
Qxyz Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qxyz Limited filed Critical Qxyz Limited
Priority to CA002376559A priority Critical patent/CA2376559A1/en
Priority to AU55457/00A priority patent/AU5545700A/en
Priority to EP00940533A priority patent/EP1190045A2/en
Publication of WO2000077181A2 publication Critical patent/WO2000077181A2/en
Publication of WO2000077181A3 publication Critical patent/WO2000077181A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/64General methods for preparing the vector, for introducing it into the cell or for selecting the vector-containing host
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/06Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/08Oxygen as only ring hetero atoms containing a hetero ring of at least seven ring members, e.g. zearalenone, macrolide aglycons

Definitions

  • PKSs include examples of both type I (multifunctional enzyme) and type II (dissociable complex) organisation.
  • Taxol The isolation of the genes coding for the proteins that make the highly potent anti-cancer compound Taxol, has not as yet been reported.
  • the resulting choice for obtaining Taxol is either to cut down 200 Pacific Yew trees to obtain enough taxol for one chemotherapy session, or to make the drug chemically using one of the many exceedingly expensive and long chemical routes that have appeared recently in the literature.
  • the relocation of the thioesterase domain at the end of DEBS1 was the first example demonstrating the efficacy of repositioning domains in type I modular systems. Since then, numerous such experiments have been carried out in order to probe further the efficacy of these multienzymes.
  • the TE domain has been relocated at the end of module 5 as well as module 3 respectively (Kao et al., 1995, 1996). In both cases, the predicted compounds were produced that resulted from truncation of the progressing polyketide chain. Release of the 12-membered product in the former case showed that the thioesterase domain can indeed catalyse ring closure even for less energetically favourable reactions.
  • two products were produced, one of them thought to be resulting from spontaneous decarboxylation.
  • the first example of a chimaeric polyketide synthase constructed from a domain taken from a second PKS was demonstrated by Oliynyk ef al. (1996).
  • An acyltransferase domain (AT) from module 2 of the rapamycin polyketide synthase was used to replace the AT of module 1 in the DEBS1- TE system.
  • the resulting triketide lactone had a methyl group missing at position 5 of the six-membered ring. This was expected since the AT of module 2 of rap PKS (unlike the AT of module 1 of DEBS1 ) incorporates a malonyl-CoA extender unit, instead of a methylmalonyl-CoA unit.
  • PKS polyketide synthases
  • PKS type i polyketide synthases
  • the invention provides a method of assembling several DNA units in sequence in a DNA construct.
  • This method comprises the steps of: a) providing each DNA unit with a restriction enzyme recognition sequence at it's 5' end and with a recognition sequence for the same restriction enzyme at its 3' end that is combined with a recognition site for a DNA modification enzyme, b) providing a starting DNA construct having an accessible restriction site for the same or a compatible restriction enzyme and cleaving the starting DNA construct with such a restriction enzyme, c) inserting the desired DNA unit and bringing the ligated product into contact with a DNA modification enzyme such that the restriction site at the 3' end of the inserted DNA unit is abolished, d) cleaving the ligated product at an accessible unmodified recognition site for the same or a compatible restriction enzyme, e) repeating steps c) and d) to introduce each desired DNA unit to give a DNA construct containing all the desired units in sequence.
  • DNA units can be any desired DNA sequence, though usually they encode enzyme domains or modules of two or more enzyme domains.
  • the recognition sequences are usually positioned at the ends of the DNA unit once the DNA unit has been cut with the relevant enzyme, by this it is meant that the recognition sequences are adjacent to the coding sequence, or that they flank the said sequence.
  • An accessible rest ⁇ ction site is herein defined as a restriction site which is unmodified, such that it can be cleaved by a restriction enzyme that normally recognises the sequence of the site.
  • the accessible restriction site is preferably a unique site in the DNA unit or ligated product.
  • the DNA modification enzyme employed in the method can be a methylase for example the dam methylase of Esche chia coli. Other methylases such as dcm are also envisaged.
  • a particular method comprises the steps of a) providing each DNA unit with an Xba ⁇ recognition sequence
  • 5'XXTCTAGA3' (where XX is not GA) at it's 5' end and with an Xba ⁇ recognition sequence 5'GATCTAGA3' at its 3' end.
  • the recognition sequences for the restriction enzyme and the DNA modification enzyme employed in the method can be created in the DNA units prior to cutting with the restriction enzyme, for example by means of a primer extension reaction.
  • the preferred DNA construct made by the method can be an expression vector capable of facilitating expression of the protein encoded by the desired DNA units.
  • DNA modification can be removed and the restriction site re-established by replicating the ligated product in a dam- strain of E. coli by means of suitable vectors as known in the art.
  • the invention also encompasses DNA unit assemblies where any given restriction enzyme recognition site can be modified by addition of a certain combination of nucleotide bases in order for it to be protected.
  • the invention provides a method of making an assembly of several DNA units in sequence which method comprises the steps of: a) providing a first DNA unit with a recognition sequence for a first restriction enzyme at its 3' end, and cleaving the said first DNA unit with said first restriction enzyme, b) providing each other DNA unit with a recognition sequence at its 5' end for a second rest ⁇ ction enzyme which has a compatible ligation sequence with that of the first restriction enzyme, and an upstream recognition sequence for said first restriction enzyme and a downstream recognition sequence for a third restriction enzyme at its 3' end, and cleaving each said other DNA unit with the second and third restriction enzymes, c) ligating the said first DNA unit with a desired other DNA unit to form a ligated product such that the ligation of the two units abolishes the recognition site for the first restriction enzyme at the ligation junction, and cleaving the ligated product with said first restriction enzyme, d) ligating the product from c) with a desired DNA unit from b) to
  • a particular method comprises the steps of: a) providing a first DNA unit with an Xbal recognition sequence
  • the assembly can occur via stepwise addition of fragments to a vector.
  • the first DNA unit can be attached to the solid phase for use in step c). This permits the solid phase to be split and mixed between steps c), d), and e) to make several different assemblies.
  • Methods of attaching DNA units to the solid phase are well know in the art.
  • Preferred solid phase elements are beads attached to the DNA units via a biotinylated nucleotide, as known in the art.
  • the recognition sequences in one or more of the DNA units are preferably introduced by means of extension primers, as known in the art, though other methods such as the ligation of the required sequences or in vitro mutagenesis can also be employed.
  • the assembly of several DNA units can be inserted into an expression vector and thus used to transform a host capable of expressing the protein encoded by the insert of the vector.
  • the method is particularly useful where one or more of the DNA units encodes a catalytic or transport protein domain for example a ketoreductase domain from a PKS enzyme or an ACP domain from a hybrid poiyketide/peptide synthesising enzyme.
  • a catalytic or transport protein domain for example a ketoreductase domain from a PKS enzyme or an ACP domain from a hybrid poiyketide/peptide synthesising enzyme.
  • Such domains can be derived from enzyme domain DNA sequences from, for example, polyketide synthesising enzymes, peptide synthesising enzymes, hybrid peptide polyketide synthesising enzymes, fatty acid synthesising enzymes or other enzyme domains known in the art.
  • the DNA units used in the methods of the invention can encode modules comprising one or more catalytic or transport domains. Usually a module contains all of the domains required to complete one condensation step in the synthesis of a target molecule.
  • Alternative aspects of the invention resulting from the methods of the invention include: DNA constructs or vectors incorporating a DNA assembly encoding synthetic enzymes, synthetic enzymes encoded by such DNA assemblies, hosts expressing synthetic enzymes, hybrids of transformed hosts expressing synthetic enzymes, and compounds produced by the synthetic enzymes. Where the product produced by the synthetic enzyme exhibits toxicity to a host stain, this can be worked around e.g. by means of choosing a different strain or mutating the original strain to provide mutants which are more tolerant.
  • the diversity of compounds produced by hosts transformed with the synthetic enzymes of the invention can be further increased by using known methods of using different feedstocks in the fermentation to provide different starter units for the desired product. Where yield of desired synthetic enzyme product is low, routine steps e.g. mutation and selection, can be taken to improve this,
  • the synthetic enzymes of the invention can also be used in cell-free systems to produce the desired target molecule in vitro as known in the art, for example, see Carreras and Khosla (1998).
  • the invention provides a method of synthesising a target molecule comprising the steps of a) examining the composition and stereochemistry of a target molecule, b) determining which catalytic and transport domains need to be present in a synthetic enzyme in order to catalyse the synthesis of the target molecule, c) using any one of the methods of the invention to assemble the required DNA units encoding the catalytic and transport domains into a DNA assembly that encodes said synthetic enzyme which is capable of synthesising the target molecule. d) placing the DNA assembly into a vector to allow expression of the synthetic enzyme in a host capable of synthesising the target molecule after transformation with said vector.
  • Target molecules are generally bio-active molecules, usually having a predominantly carbon based backbone and usually are macromolecules comprised of condensed units.
  • the transformed host can be tested for the presence of the target molecule after step d). If yields of the desired compound are low then conventional methods of improving product yield from, for example Streptomycetes, can be employed.
  • Transformed hosts which result from the methods of the invention and their use in producing target molecules are also aspects of the invention.
  • Hosts suitable for transformation with the DNA assemblies of the invention are known in the art and include insect or mammalian cells, though more usually suitable are bacterial cells, for example, the improved host strains described by Ziermann and Betlach (1999).
  • the synthetic enzyme can be used in a cell-free system to produce the target molecule in vitro.
  • a further aspect of the invention is a method of making a synthetic enzyme to catalyse the synthesis of a target molecule comprising the steps of a) examining the composition and stereochemistry of a target molecule, b) determining which catalytic and transport domains need to be present in the synthetic enzyme in order to catalyse synthesis of the target molecule, c) using any one of the methods of the invention to assemble the required DNA units encoding the catalytic and transport domains into a DNA assembly that encodes an enzyme which is capable of synthesising the target molecule, d) expressing the DNA assembly in a suitable host to produce the enzyme.
  • each DNA unit has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme.
  • each DNA unit has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
  • each DNA unit has a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction.
  • each DNA unit has a Spel recognition sequence 5 ⁇ CTAGT3' at its 5'-end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end
  • Catalytic or transport protein domains can be derived from any enzyme, for example those listed above.
  • Particularly envisaged are libraries in which the DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain.
  • modules comprising a DNA sequence encoding a functional set of polyketide synthetic domains wherein the module has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme.
  • An envisaged module has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
  • a module comprising a DNA sequence encoding a functional set of polyketide synthetic domains can have a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction.
  • the module has a Spel recognition sequence 5 ⁇ CTAGT3' at its 5'-end, and an upstream Xbal recognition sequence 5TCTAGA3' and a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end.
  • modules wherein the DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain. It is also envisaged that other non-polyketide enzyme domains can be included in the modules provided by the invention.
  • vectors containing one or more modules are also provided. Particularly useful are vectors in which a non-functional recA gene is also present. Such vectors prevent unwanted homologous recombination occurring between domains within the vector upon integration into a suitable host by abolishing the recA gene activity in that host.
  • the invention also provides a method of transforming a host with one or more synthetic DNA assemblies encoding enzyme domains which comprises the steps of: a) Inserting said DNA assembly into a vector containing a mutated internal fragment of a recA gene sequence such that the vector is capable of undergoing homologous recombination with the recA gene of the host, b) bringing said vector into contact with a host chromosome under conditions which permit homologous recombination to take place, c) disrupting the host recA gene by the integration of the DNA of said vector into the chromosome.
  • the expression vector can be used to transform a Steptomyces host.
  • the DNA assemblies contained in the vector can be modules as described herein. Also envisaged are transformed hosts which prior to transformation with a vector containing one or more modules according to the invention, were already lacking a recA function.
  • kits containing DNA units, DNA modules, vectors, DNA manipulation hosts, DNA modification hosts, expression hosts, or solid phase elements for use in the methods of the invention.
  • a kit might contain a first DNA unit which is a vector suitable for transforming a suitable host, a library of modules for insertion into that vector, both the first DNA unit and the library having the necessary recognition sites for use in the methods of the invention, together with host strains suitable for the manipulation and expression of the DNA assemblies of the invention.
  • a de novo "domain-by-domain" reconstruction of a hybrid multienzyme from the erythromycin-producing PKS has been achieved by the inventors by assembling DNA units corresponding to the constituent domains. The assembled gene was expressed in S. erythraea and the expected compounds were isolated from the bacterial broth.
  • Application of this methodology, or variations of this methodology for making combinatorial assemblies of complex and aromatic PKSs allows for the rapid generation of novel or altered PKS or other synthetic multienzymes and paves the way for a quick and inexpensive synthesis of potentially bio- active molecules.
  • One alternative to chemical syntheses is to carry out a 'retrobiosynthetic analysis' of the desired molecule, by pinpointing the exact number and type of synthetic enzyme domains that are required for every chemical step, and then assembling the DNA units that encode these enzymes in order to make a hybrid synthetic enzyme.
  • the aim is therefore, to assemble these domains or even modules in a manner as desired, so that the linked enzymes can carry out a progressive synthesis of a desired target molecule.
  • flanking both ends of the DNA of the desired DNA unit (domain or module) with a recognition sequence that is cleaved on one end by Xbal, and on the other end by a restriction enzyme that is compatible with Xbal (e.g. Spel) is possible.
  • This strategy makes use of selective recognition of the restriction enzyme site by the restriction enzyme Xbal, depending upon the sequence adjacent to the restriction enzyme site and upon the strain used (dam + or dam " ) during the assembly process.
  • the method has been shown to be successful, and by using this methodology to assemble modules, the complete erythromycin-producing PKS (comprising of six modules coded by three large open reading frames) can be built in under 10 days. Even though this time-period is small compared to what it would take to assemble the ery PKS genes using conventional methodologies, using a variation of the above mentioned methodology, complete gene-clusters, like the 33 kbp erythromycin PKS, can be built within a matter of hours.
  • the methodology thus outlined requires DNA units to be modified so that they contain the appropriate 5'and 3' ends (X and X d respectively). These units are then progressively assembled to achieve the desired gene length. The vector containing the assembled or reconstructed gene is then used to transform an expression system to achieve protein expression. This methodology has been shown to work effectively - the hybrid multienzyme DEBS1-TE was reconstructed by assembling de novo the ten constituent domains. The assembled gene, when expressed in S. erythraea gave the expected six-membered triketide lactones.
  • inter-modular recombination events within the reconstituted PKS or other synthetic enzyme gene may preclude the use of identical PKS or other enzyme domain DNA units in a set of modules. It might be expected that, for example ( Figure 2) the ACP * DNA in module 1 to recombine with the identical ACP * DNA in module 3. This event can take place, for example, when the expression vector that possesses the assembled gene containing numerous identical PKS DNA units is used to transform a streptomyces host for polyketide production.
  • the inventors have developed a strategy that can circumvent this problem, therefore making it possible to construct large synthetic enzyme gene clusters using identical domains or modules repeatedly. This translates into a less expensive route towards synthetic enzyme gene construction (one would not require to have a start-up library of 200 or so to cover all possibilities), as the set of 12 domains, or similar functional arrangements of domains, are true "off-the-shelf components for the assembly of PKS genes or genes for other hybrid synthetic enzymes.
  • the inventors provide methods of DNA assembly that pave the way for a cheap and fast synthesis of a host of bio-active molecules, e.g. the anti- cancer drug Discodermolide.
  • Figure 1 shows the chemical/stereochemical choices that each PKS domain can make. A total of 12 domains are required for every conceivable polyketide reaction.
  • Figure 2 shows integration of a plasmid containing more than one identical DNA unit (ACP * ). After the plasmid has integrated in the streptomyces host through homologous recombination with TE, internal recombination can occur to yield truncated PKS genes. This is because the host is recA + .
  • Figures 3A and 3B show a schematic representation of the assembly process.
  • the de novo construction of DEBS1-TE DNA fragments (units) encoding for the constituent domains of the multienzyme DEBS1-TE were inserted sequentially into the expression plasmid pCJR24.
  • the final plasmid pAR10 was then expressed in S. erythraea/JC2 to yield the expected triketide lactone products that are synthesised by the schematically shown re-assembled DEBS1-TE synthase.
  • the amino acid changes made within the linker regions between domains are shown below the actual amino acid sequence.
  • Figure 4 shows the methodology of the assembly of DNA units using Xbal/dam methylase technology.
  • transformation of a Dam ' strain with plasmid (as it is a dam ' strain, even X d would be cleaved by Xbal) is effected.
  • Cutting is achieved by Xbal and the DNA unit purified on a gel.
  • Figure 5 shows the procedure for the assembly of DNA units using Xbal/dam methylase technology.
  • Figure 6 shows how an Xbal site can be made sensitive to methylation.
  • the RE cuts at the sites shown by arrows.
  • the boxed sequence is methylated in a dam + strain thereby altering the Xbal recognition site.
  • the sequence however is not methylated in a dam strain, and so can still be cleaved by Xbal.
  • the Xbal recognition sequence (5 CTAGA3') can therefore be selectively cleaved by Xbal. Assembly of DNA units uses only one restriction enzyme - Xbal.
  • Figure 7 shows the methodology of the in vitro assembly of DNA units - I using solid phase beads with the enzymes Xbal, Spel and Smal (other Xbal - compatible REs may be used).
  • Figures 8 and 9 show how the methodology of the in vitro assembly of DNA units - II would proceed to the point of placing the DNA assembly into an expression vector for transforming and appropriate host.
  • Figure 10 shows how in one single ligation, 16 ongoing assemblies are generated. This cascade can obtain exponential proportions.
  • the gene library can be increased by increasing the diversity of the incoming unit.
  • Figure 11 shows the integration of an expression plasmid into a streptomyces host, using a mutated internal fragment of the recA gene as the region for homologous recombination.
  • the resulting PKS gene can now contain more than one identical DNA units as the strain has been made recA minus.
  • Figure 12 shows the assembled PKS recADEBS1-TE.
  • the second module is composed of domains that normally belong to the first module.
  • Figures 14A and 14B show a DNA sequence alignment of the recA gene S. lividans (S.I) and S. ambofaciens (S.a). Start of the gene is from 'ATG' and stop is TGA'. Percent similarity: 94.713, percent identity: 94.713.
  • Figure 15 shows how an Xbal/Spel system might be used instead of an Xbal/dam methylase system to assemble DNA units, a strategy involving compatible restriction enzymes flanking either end of a DNA unit.
  • An example of compatible REs would be Xbal and Spel.
  • the recognition sequence of Xbal is - 5TCTAGA3' and that for Spel is 5 ⁇ CTAGT3'. After Xbal and Spel have cleaved the DNA at their respective sites, the DNA unit can be ligated together as the overhanging is complementary. The junction where any two units are joined is now recognised by either Xbal or Spel.
  • Figure 16 is a schematic representation of the compatibility of Xbal- and Spel- digested DNA overhangs. It shows the compatibility of the sticky ends produced by Xbal and Spel and how re-ligation abolishes both sites.
  • Figure 17 shows a schematic representation of the erythromycin-producing polyketide synthase; primary organisation of the genes and their corresponding protein domains.
  • the multienzymes deoxyerythronolide B synthase 1 (DEBS1), DEBS2 and DEBS3 each have two modules, each of which processes one cycle of polyketide chain extension. Each of the six modules is constituted by covalently-linked enzymatic domains. Exploitation of such an enzymatic hierarchy as "of-the-shelf reagents can lead to synthesis of important chemical compounds.
  • Figure 18 shows the structure of the anticancer drug discodermolide (top) and the 'retrobiosynthetic approach' towards synthesising a target molecule (a discodermolide).
  • a discodermolide a target molecule
  • Such an approach would involve opening up the structure (a.), identifying the number and type of polyketide carbon units that would make the discodermolide carbon skeleton (b.), and choosing the PKS DNA units (modules/domains) responsible for the uptake and subsequent processing of the carbon units (c).
  • Figure 19 shows the anti-tumour compound octalactin and the strategy behind the retrobiosynthetic approach towards synthesising bio-active molecules.
  • the strategy comprises the steps of: 21a
  • Figure 20 shows a schematic representation of they hypothetical polyketide synthase for synthesising octalactin B, assembled from enzyme units that belong to various PKSs in the public domain.
  • Figure 21 shows a schematic representation of the hypothetical decarestrictine polyketide synthase for synthesising the anti-cholesterol compound decarestrictine J, assembled from enzyme units that belong to various PKSs in the public domain.
  • Example 1 Vectorial assembly of DNA units
  • DNA units that are to be assembled contain the Xbal recognition sequence at either end of the unit.
  • two nucleotides are arranged at the 5' end of the Xbal recognition sequence (thus making it 5'GATCTAGA3'). This is achieved by first incorporating the Xbal recognition sequences in the oligonucleotide primers and then amplifying the desired DNA unit by PCR. The PCR products are then ligated to a pUC-18 vector, used to transform a dam + strain of E. coli, and the clones isolated and sequenced for possible errors in the PCR products. A dam + strain of E.
  • coli- like DH10BTM - methylate the nucleotide A in the sequence GATCTAGA, as 5'GATC3' is a sequence that is recognised by the product of the Dam methylase gene (Fujimoto ef a/.,1965; Geier et al., 1979). This makes only one end of the DNA unit cleavable by Xbal.
  • the vector is then used to transform a dam " strain of E. coli (e.g. ET12567 - MacNeil et al. (1992)) and the plasmid DNA isolated. This DNA is now cleavable at bofb ends of the DNA unit by Xbal.
  • DEBS1-TE a multienzyme that has the first of the three bimodular erythromycin DEBS enzymes (DEBS1), fused with the erythromycin thioesterase (Cortes et al., 1995) was constructed in a de novo fashion.
  • DEBS1-TE a multienzyme that has the first of the three bimodular erythromycin DEBS enzymes (DEBS1), fused with the erythromycin thioesterase (Cortes et al., 1995) was constructed in a de novo fashion.
  • the ten inherent PKS domains in DEBS1-TE namely, loading module (itself composed of an AT and an ACP), KS1 (ketosynthase of module 1), AT1 , KR1 , ACP1 , KS2 - 23 -
  • the DNA for all ten domains was amplified by PCR to incorporate the two aforementioned recognition sequences for Xbal (5TCTAGA3' and 5'GATCTAGA3') at the 5' and 3' ends of the DNA unit respectively.
  • the PCR products were cloned in pUC18 vector, sequenced, and then used to transform the dam " E. co// ET12567 strain.
  • the DNA unit for TE was inserted into S. erythraea expression vector pCJR24 (Rowe et al., 1998) which has a unique Xbal site. This vector also contains a thiostrepton-resistance gene as a marker for identifying successful integrands.
  • the ligated products were used to transform the dam + E. co// DH10BTM strain and the plasmid DNA isolated.
  • This plasmid (pAR1) can only be singly cleaved with Xbal, despite possessing two Xbal recognition sequences, as one of the sites (situated at the 3' end of the TE unit) has been methylated by the E. coli Dam methylase.
  • the next DNA unit (ACP2 from module 2 of DEBS1) was then ligated to the Xbal-cut pAR1 , the ligation mixture used to transform DH10B cells and the plasmid DNA isolated.
  • Plasmid pAR10 was then used to transform S. erythraea/JC2 - a mutant strain of the wild- type S. erythraea NRRL2338 that lacks the DEBS genes except for the TE DNA fragment (Rowe et al., 1998).
  • Thiostrepton-resistant colonies were selected upon integration of the vector into the S. erythraea chromosome. Single transformants were grown on selective media, as described in the methods section. The fermentation broth was extracted with ethyl acetate - 24 -
  • E. coli dam + DH10BTM strain was purchased from Gibco BRL, USA.
  • Pfu DNA polymerase was purchased from Boeringer, Germany. Construction of the final expression plasmid pAR10 was carried out in several steps, as follows. The ten PKS DNA units were amplified by PCR using pfu DNA polymerase. The respective regions of eryAI gene, as well as the oligonucleotides used for each PCR are outlined: LM - segment of ety /gene (Bevitt et al., 1992) extending from nucleotide (N) 588 to N 2389;
  • KR1 - segment of eryAI gene extending from N 4808 to N 6316; 5'GGTCTAGAGTCGGTGCACCTGGGCACCGGAGCACGCCGGGTGCCC
  • TE - segment of eryAIII gene (Donadio et al. 1991) extending from N 8753 to N 9602; 5'GGTCTAGACAGCGGGACTCCCGCCCGGGAAGCG3' and 5'GGGCTAGCTCTAGATCATGAATTCCCTCCGCCCAGCCAGGCGTC3'. All PCR products were 5' phosphorylated and ligated to Smal-cut, dephosphorylated pUC18 vector and used to transform E. co// DH10B electrocompetent cells. The desired plasmids - containing the amplified DNA fragments were isolated and sequenced using standard pUC forward and reverse primers. No mistakes in the amplified products we.e detected.
  • Plasmid pAR1 was isolated, digested with Xbal, and ligated to the ACP2 fragment, and ligation products treated as mentioned above.
  • the other DNA fragments namely, KR2, AT2, KS2, ACP1 , KR1 , AT1 and KS1 were sequentially added to finally yield plasmid pAR10.
  • This plasmid was then digested with ⁇ / el and Xbal restriction enzymes and ligated with the LM fragment previously digested with the same two enzymes. The ligated products were used to transform E. coli DH10B electrocompetent cells and the final expression plasmid pAR10 isolated. Plasmid pAR10 was then used to transform S. erythraea/JC2 strain and colonies carrying the expression plasmid were selected through resistance to thiostrepton upon integration of the plasmid into the S. erythraea chromosome. Single transformants were picked and grown on - 28 -
  • Figure 7 outlines the strategy for the in vitro assembly of PKS DNA units.
  • the inventors have constructed the multienzyme DEBS1 -TE.
  • the in vivo construction of the gene for DEBS1 -TE took 12 days to complete.
  • the in vitro assembly on the other hand was completed in 2 days.
  • LM, KS1 , KR1 , AT1 , ACP1 , KS2, AT2, KR2, ACP2 and TE were amplified by means of PCR.
  • the forward primer in all cases, except the LM contained the Spel recognition sequence 5 ⁇ CTAGT3' while the reverse primer was engineered in such a way that it contained the Xbal recognition sequence 5' TCTAGA3' and Smal recognition sequence 5'CCCGGG3' downstream of the Xbal site ( Figure 7).
  • the amplification of the LM was carried out using a biotinylated forward primer and a reverse primer that contained the Xbal recognition sequence (5 CTAGA3').
  • PCR products were cloned in pUC-18 vector and the resulting plasmids sequenced to detect possible errors introduced by PCR. All plasmids, except the one containing the LM unit were then digested with Spel and Smal, dephosphorylated in order to remove the 5' phosphate group and the appropriate fragments isolated and eluted.
  • the LM unit was cleaved with Xbal and attached to a bead that was coated with streptavidin (following the manufacturer's instructions) as shown in figure 7.
  • the assembly process was initiated by adding DNA ligase to the tube containing a large excess of the first unit (KS1) and LM-bead.
  • the reason for having a large excess of the KS1 unit compared to the LM-bead unit is to favour the LM-bead ligating to the incoming unit, as opposed to the self-ligation of the LM-bead (see figure 7).
  • the ligation of the two DNA fragments is unidirectional as only the Spel-cut end of KS1 complements the Xbal-cut end of the LM-bead.
  • the desired product of the ligation reaction namely 'bead-LM-KS1' was separated from the reaction mixture and washed. This product was then cleaved with Xbal, in order to activate the 3' end of KS1.
  • the beads were washed again to remove the small Xbal-Smal DNA fragment that was - 30 -
  • a strategy employing the invention in order to construct the highly potent anti-breast cancer drug discodermolide, the anticholesterol compound decarestrictine, and the antitumour compound octalacin using polyketide synthase domains/modules is outlined below. - 31 -
  • the drug discodermolide ( Figure 18), isolated from the marine sponge 'Discodermia disoluta', has been identified as a highly potent anti-cancer compound and 80 times more effective than the well known anticancer drug Taxol (TerHarr et al., 1996). It has the same mechanism of action as Taxol, even though it is structurally different from the latter.
  • discodermolide is a polyketide and can therefore be constructed from a system that has the basic enzymatic building blocks (domains and modules) that make other polyketides like erythromycin and rapamycin. Having predicted that approximately 45 domains housed in 12 modules would be required in order to carry out the chemistry that accounts for the functionalities on the carbon skeleton of discodermolide, one can now begin to construct such a system. All one has to do is to identify the type and nature of the domains/modules that one requires to generate the observed functionalities, and then assemble these units in the desired order ( Figure 18). The resulting DNA assembly can then be put into a bacterial strain that makes a functional polyketide synthase.
  • discodermolide can be made available through chemical synthesis - there have been a few chemical routes reported in literature recently (Marshall and Johns, 1998 and references therein). However, as is the case with most other complex molecules, large scale production of discodermolide, using the chemical route would turn out to be excessively expensive. Chemists have been using the retrosynthetic analysis approach towards total synthesis of important bioactive molecules. This approach breaks the target compound into many smaller pieces - easily synthesised - which are then re- assembled.
  • the unit-DNA segments are amplified using the polymerase-chain-reaction (PCR) - from
  • Suitable vectors have an antibiotic resistance marker (for selection of this vector on an antibiotic-rich media) and an "origin-of -replication" (ori). Ori is essential for
  • vectors for the expression of the synthetic enzymes of the invention are the actinomycete vectors described by Rowe et al. (1998).
  • the strain is then grown in a media that is supplemented with the
  • Figures 4 and 5 show how the assembly proceeds.
  • Octalactin A and B are natural products isolated from the marine gorgonian octocoral 'Pacifigorgia sp.' (Tapiolas et. al., 1991).
  • Octalactin A shows very strong cytotoxicity toward B-16-F-10 murine melanoma and HCT-116 human colon tumour cell lines and is a promising drug candidate, while octalactine B displayed no such activity (Tapiolas et. al., 1991).
  • Total syntheses of both octalactin A and B have been reported in literature. One such synthesis (Buszek, et.
  • the molecule decarestrictine J can be synthesised using the retrebiosynthetic approach.
  • Decarestrictine J is a ten-membered lactone that comes from the family of decarestrictines, shown to display strong anti- cholesterol activity (Grabley et. al., 1992). The total synthesis of Decarestrictine J has been reported and involves numerous chemical steps (Yamada et. al., 1995).
  • the target molecule (figure 21) can be conceived to be formed by assembly of five acetate polyketide units. Using the retrobiosynthetic approach, one can identify the PKS domains/modules that - 37 -
  • decarestrcitine PKS is shown in figure 21.
  • the loading module, as well as the four internal modules along with the TE domains can be conveniently assembled using the invention.
  • the assembled 'decarestrictine gene' can then be expressed in a suitable host in order to check for the production of decarestrictine J.
  • the retrobiosynthetic approach involves the following steps; a). Identification of the number and nature of carbon units that make up the target molecule b). Identification of the modules/domains from libraries of polyketide/peptide synthetase/fatty acid/etc. encoding units that are responsible for the uptake of the said carbon units and the nature and degree of functionalisation of the carbon chain c). Assembly of the said modules/domains using the methods of the invention d). Expression of the assembled gene in a suitable expression host.
  • Example 4 Transforming strains with DNA encoding similar synthetic enzyme domains
  • recA E. coli strain
  • the vector, into which the assembled gene is being constructed contains a portion of a streptomyces recA gene.
  • This recA fragment carries a mutation.
  • the vector is used to transform a streptomyces host (e.g. S. lividans or S. erythraea).
  • the fragment of recA gene carrying a mutation recombines with the recA gene of the streptomyces host, abolishing the functional recA gene and making the strain recombination minus ( Figure 11).
  • the 1.0 kbp recA fragment, flanked at both ends by an Xbal site was then inserted in the expression vector pCJR24 that has a unique Xbal site.
  • the ligation mixture was used to transform E. co// DH10B cells and the desired plasmid DNA isolated.
  • the resulting plasmid (pARecA24) contains a non- methylated Xbal site at the 5' end of the recA gene fragment.
  • the ten PKS DNA units, namely, TE, two each of ACP1 , KR1 , AT1 & KS1 , and LM were inserted into the plasmid pARecA24 to finally yield the expression plasmid pfiecADITE.
  • This plasmid was used to transform wild-type S. lividans protoplasts, and thiostrepton resistant colonies were grown in defined liquid media as described above.
  • the compound ( Figure 12) was isolated from the bacterial broth and chemically character
  • the first gene in the biosynthesis of the polyketide antibiotic TA of Myxococcus xanthus codes for a unique PKS module coupled to a peptide synthetase. J. Mol. Biol. 286,465-474.
  • Discodermolide a cytotoxic marine agent that stabilizes microtubules more potently than taxol. Biochemistry 35, 243-250.

Landscapes

  • Genetics & Genomics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Mycology (AREA)
  • Cell Biology (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The invention comprises a method of assembling several DNA units in sequence in a DNA construct and all derivatives of this method. In particular the production of synthetic enzymes is contemplated. Each DNA unit is provided with the same restriction enzyme recognition site at its 5' and 3' ends. The restriction recognition site at its 3' end being combined with a recognition site for a DNA modification enzyme. A DNA construct having the same or a compatible accessible restriction site, as provided in the DNA unit, is cleaved at the restriction site by the appropriate restriction enzyme. The desired DNA unit is then inserted into the DNA construct, this ligated product subsequently being brought into contact with a DNA modification enzyme such that the restriction site at the 3' end of the inserted DNA unit is abolished. The ligated product is then cleaved at the remaining unmodified restriction recognition site and a subsequent DNA unit is inserted. This process is repeated introducing each desired DNA unit to give a DNA construct containing all the desired units in sequence.

Description

DNA MANIPULATION METHODS AND APPLICATIONS FOR SYNTHETIC ENZYMES.
Background
Polyketides, including the valuable drugs avermectin, erythromycin and rapamycin, are natural products that are synthesised by stepwise condensation of acetate, propionate and occasionally butyrate units. The enzymes that take part in the biosynthesis of polyketide chains are collectively known as the polyketide synthase (PKS). PKSs include examples of both type I (multifunctional enzyme) and type II (dissociable complex) organisation. The sequencing of the gene clusters encoding the erythromycin- (ery) and rapamycin- (rap) producing polyketide synthases has shown that each cycle of polyketide chain extension is catalysed by a different set or 'module' of enzyme activities, housed in a few very large multienzyme polypeptides. The basic building blocks of modules are enzymatic 'domains' that are covalently linked together. The ability of these domains to act upon the carbon chain and remove/add functionalities is reminiscent of a molecule being acted upon by chemical reagents in a chemical synthesis. The aim is therefore to assemble these domains or even modules in a manner as desired, so that the linked enzymes can carry out efficient synthesis of any target molecule. Until now, it has however not been possible to find a versatile methodology to assemble these PKS units. The whole area of polyketide research is at a stage where the flexibility of the whole enzymatic machinery is understood, despite the lack of any X-ray crystal structure data on these giant enzymes, but it remains difficult to "re-assemble" the enzymes de novo. A de novo synthesis is desirable for two reasons. Firstly, one does not need to change the structure of, for example, an antibiotic using tedious chemical methodologies that are time-consuming and expensive. Engineering an synthetic enzyme at the genetic level is much easier, faster and cheaper. As more and more antibiotics are rendered useless, simply because the bacteria they were active against have developed ways in which to become resistant to these drugs, there is an urgency to keep developing altered drug structures. Secondly, there is an ever-growing need for new drugs, more potent in their action than their predecessors. Whilst nature provides a large proportion of the new molecules that are, for example, antibiotic, anticholesterol, antifungal, or anti-cancer, the complicated structures of these drugs (for example the anti-cancer Taxol) makes it increasingly difficult for chemists to carry out conventional syntheses. The problem is made more difficult by the fact that the genes that make these drugs cannot always be isolated.
The isolation of the genes coding for the proteins that make the highly potent anti-cancer compound Taxol, has not as yet been reported. The resulting choice for obtaining Taxol is either to cut down 200 Pacific Yew trees to obtain enough taxol for one chemotherapy session, or to make the drug chemically using one of the many exceedingly expensive and long chemical routes that have appeared recently in the literature.
With the isolation, cloning and sequencing of the genes coding for the erythromycin polyketide synthases, a model for the functioning of modular type I PKSs began to emerge. It was clear that such a system is genetically programmed to carry out the necessary catalytic activities needed for processing of the polyketide chain. It is hypothesised that each domain acts independently on the progressing carbon skeleton and there is a correlation between the structure of the growing chain and the enzymatic activities carried out by the enzymes.
The first conclusive proof of such an arrangement came from experiments done by Donadio et al. (1991 , 1993). One such experiment (1991 ) involved an in-frame deletion in the ORF3 segment of erythromycin chromosome. This deletion eliminated the entire 183 amino acids of the ketoreductase domain of ery PKS module 5, along with some of the flanking region (a total of 271 amino acids) and resulted in the production of 5,6-dideoxy-3-α-mycarosyl-5-oxo-erythronolide B, the structure of which was confirmed by X-ray crystallography. Replacement of two amino acids in the putative NAD(P)H-binding motif of the enoylreductase domain encoded by ORF2 resulted in a new macrolide Δ6,7-anhydroerythromycin C being produced albeit in low yield. These results demonstrated that erythromycin PKS can be genetically reprogrammed to produce novel macrolides that would otherwise be difficult to get via chemical means. During the analysis of the fermentation products produced by a strain of S. erythraea that was genetically engineered to produce an analogue of 6dEB, it was found that a minor component of the fermentation was 3,5-dihydroxy-2,4-dimethyl-n-heptanoic acid δ-lactone (Donadio et al., 1991). This product was predicted to result from premature release of the chain from either the ACP of module 2 or the KS of module 3. A greater yield of this triketide product was obtained by heterologous over-expression of ORF1 in Streptomyces coelicolor (Kao et al., 1994), which also showed that DEBS1 can function autonomously. More recently (Cortes et al., 1995), a six-membered lactone was produced through genetically engineering the PKS. By repositioning the TE (cyclase) domain from module 6 to the C- terminus of module 2 (end of DEBS1 ), it was found that the yield of the lactone is increased by five-fold to 10-15 mg/L as compared to 1-3 mg/L obtained by Kao et al.
The relocation of the thioesterase domain at the end of DEBS1 was the first example demonstrating the efficacy of repositioning domains in type I modular systems. Since then, numerous such experiments have been carried out in order to probe further the efficacy of these multienzymes. The TE domain has been relocated at the end of module 5 as well as module 3 respectively (Kao et al., 1995, 1996). In both cases, the predicted compounds were produced that resulted from truncation of the progressing polyketide chain. Release of the 12-membered product in the former case showed that the thioesterase domain can indeed catalyse ring closure even for less energetically favourable reactions. In the second experiment, two products were produced, one of them thought to be resulting from spontaneous decarboxylation.
The first example of a chimaeric polyketide synthase constructed from a domain taken from a second PKS was demonstrated by Oliynyk ef al. (1996). An acyltransferase domain (AT) from module 2 of the rapamycin polyketide synthase was used to replace the AT of module 1 in the DEBS1- TE system. The resulting triketide lactone had a methyl group missing at position 5 of the six-membered ring. This was expected since the AT of module 2 of rap PKS (unlike the AT of module 1 of DEBS1 ) incorporates a malonyl-CoA extender unit, instead of a methylmalonyl-CoA unit.
Thus, it has been shown that not only can domains residing within a particular PKS be interchanged or destroyed, analogous domains can be derived from other synthases for the same purpose or for achieving the required synthetic goal. Such a strategy immediately provides a glimpse of the manner in which "designer" polyketides can be constructed through using "off-the-shelf" gene products.
More recently, another hybrid system has been constructed (Marsden et al., 1998) wherein a complete loading module from the avermectin PKS has been swapped with the erythromycin loading module, while keeping the rest of the DEBS modules intact. As expected, incorporation of butyryl-CoA as well as 2-methylisobutyryl-CoA was seen and in both cases, the end products contained the above mentioned residues. A closely-related experiment has been reported by Kuhstoss ef al. (1996) in which the loading module from the platenolide PKS was replaced with the loading module from tylactone PKS to yield the expected polyketide product.
It is very clear from the various engineering efforts outlined above that the aim must now be to exploit the potential for genetic manipulation of type I (modular) polyketide synthases (PKS) to produce hybrid synthases that might catalyse the formation of novel secondary metabolites in a predictable way.
What might be a giant step towards the realisation of this aim, would be to investigate whether these enzymes might be constructed de novo, as an essential step in developing a truly combinatorial biosynthesis of complex polyketides.
The 'assembly line' nature of type i polyketide synthases (PKS) that contain sets (called modules) of structurally similar but functionally different enzymatic activities (domains) suggests their potential as a source of "off- the-shelf" enzymatic reagents which can be used to synthesise new and complex polyketide molecules. Outlined below are methodologies for the rapid assembly of DNA units encoding such enzyme domains or modules of enzyme domains.
There are over 40 gene sequences for polyketides that are available from various databases. In addition there are numerous domains known from other synthetic enzymes such as, for example, fatty acid synthase (Joshi and Smith, 1993), peptide synthetases (Eisner et al., 1997) and hybrid polyketide/peptide synthesising enzymes (Paitan et al., 1999; Shen et al., 1999). This amounts to a vast library of domains and modules that cater for a chemical reaction (e.g. stereospecific condensation, dehydration, etc), or in the case of a module, a set of chemical reactions. In order to obtain analogues of a bio-active molecule, research efforts till now have been focused on strategies that involve either chromosomally altering the PKS genes that make the particular molecule (McDaniel et al., 1999) or feeding synthetic intermediates to the PKS (Jacobsen et a/.,1997) Because of the simplified nature of such experiments, these strategies will remain a fast route towards obtaining a wide variety of drug analogues. However, in the case of compounds like the highly potent anti-cancer discodermolide (TerHaar et al., 1996) the only possible means of obtaining sufficient quantities of the drug is through chemical synthesis. This is because in such cases, the genes responsible for making these bio-active molecules have not been isolated. The chemical synthesis of large molecules having numerous chiral centres like for example discodermolide, howsoever elegant, is tedious and expensive to scale-up (Marshall and Johns, 1998).
Abbreviations
In addition to those listed in Biochem. J. (1986) 233, 1-24, the following abbreviations have been used:
6-dEB 6-deoxyerythronolide B
6-MSA 6-methylsalicylic acid
6-MSAS 6-methylsalicylic acid synthase
ACP acyl carrier protein
AT β-keto acyl transferase bp base pair(s) of DNA
DEBS 6-deoxyerythronolide B synthase
DH β-hydroxyacyl-ACP dehydratase (dehydratase)
ER enoyl reductase
FAS fatty acid synthase kbp kilobase pair(s)
KR β-ketoacyl reductase
KS β-ketoacyl synthase
ORF open reading frame
PKS polyketide synthase
RAPS rapamycin synthase
TE thioesterase The Invention
In one aspect the invention provides a method of assembling several DNA units in sequence in a DNA construct. This method comprises the steps of: a) providing each DNA unit with a restriction enzyme recognition sequence at it's 5' end and with a recognition sequence for the same restriction enzyme at its 3' end that is combined with a recognition site for a DNA modification enzyme, b) providing a starting DNA construct having an accessible restriction site for the same or a compatible restriction enzyme and cleaving the starting DNA construct with such a restriction enzyme, c) inserting the desired DNA unit and bringing the ligated product into contact with a DNA modification enzyme such that the restriction site at the 3' end of the inserted DNA unit is abolished, d) cleaving the ligated product at an accessible unmodified recognition site for the same or a compatible restriction enzyme, e) repeating steps c) and d) to introduce each desired DNA unit to give a DNA construct containing all the desired units in sequence. DNA units can be any desired DNA sequence, though usually they encode enzyme domains or modules of two or more enzyme domains. The recognition sequences are usually positioned at the ends of the DNA unit once the DNA unit has been cut with the relevant enzyme, by this it is meant that the recognition sequences are adjacent to the coding sequence, or that they flank the said sequence. An accessible restπction site is herein defined as a restriction site which is unmodified, such that it can be cleaved by a restriction enzyme that normally recognises the sequence of the site. The accessible restriction site is preferably a unique site in the DNA unit or ligated product. Where there is more than one accessible site present, it is possible to perform a partial digest, as known in the art, to obtain digested products in which only the required site is cleaved in the DNA unit. The DNA modification enzyme employed in the method can be a methylase for example the dam methylase of Esche chia coli. Other methylases such as dcm are also envisaged.
A particular method comprises the steps of a) providing each DNA unit with an Xba\ recognition sequence
5'XXTCTAGA3' (where XX is not GA) at it's 5' end and with an Xba\ recognition sequence 5'GATCTAGA3' at its 3' end. b) providing a starting DNA construct having an accessible Xbal .site and cleaving the starting DNA construct with Xbal, c) inserting the desired DNA unit and using a resulting ligated product to transform a dam+ strain of E. coli, d) recovering a resulting plasmid and cleaving the plasmid at an accessible Xbal site with Xbal, e) repeating steps c) and d) to introduce each desired DNA unit to give a DNA construct containing all the desired units in sequence.
The recognition sequences for the restriction enzyme and the DNA modification enzyme employed in the method can be created in the DNA units prior to cutting with the restriction enzyme, for example by means of a primer extension reaction. The preferred DNA construct made by the method can be an expression vector capable of facilitating expression of the protein encoded by the desired DNA units.
It is also envisaged that the DNA modification can be removed and the restriction site re-established by replicating the ligated product in a dam- strain of E. coli by means of suitable vectors as known in the art. The invention also encompasses DNA unit assemblies where any given restriction enzyme recognition site can be modified by addition of a certain combination of nucleotide bases in order for it to be protected.
In a further aspect, the invention provides a method of making an assembly of several DNA units in sequence which method comprises the steps of: a) providing a first DNA unit with a recognition sequence for a first restriction enzyme at its 3' end, and cleaving the said first DNA unit with said first restriction enzyme, b) providing each other DNA unit with a recognition sequence at its 5' end for a second restπction enzyme which has a compatible ligation sequence with that of the first restriction enzyme, and an upstream recognition sequence for said first restriction enzyme and a downstream recognition sequence for a third restriction enzyme at its 3' end, and cleaving each said other DNA unit with the second and third restriction enzymes, c) ligating the said first DNA unit with a desired other DNA unit to form a ligated product such that the ligation of the two units abolishes the recognition site for the first restriction enzyme at the ligation junction, and cleaving the ligated product with said first restriction enzyme, d) ligating the product from c) with a desired DNA unit from b) to form a ligated product and cleaving the ligated product with said first restriction enzyme e) repeating step d) with each other DNA unit in turn so as to assemble the DNA units in sequence.
A particular method comprises the steps of: a) providing a first DNA unit with an Xbal recognition sequence
5TCTAGA3' at its 3' end, and cleaving the said first DNA unit with Xbal, b) providing each other DNA unit with a Spel recognition sequence 5ΑCTAGT3' at its 5' end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at its 3' end, cleaving each said other DNA unit with Spel and Smal, and dephosphorylating the 5' end of the cleaved DNA unit, c) ligating the said first DNA unit with a desired other DNA unit to form a ligated product and cleaving the ligated product with Xbal, d) ligating the product from c) with a desired DNA unit from b) to form a ligated product and cleaving the ligated product with Xbal e) repeating step d) with each other DNA unit in turn so as to assemble the DNA units in sequence.
In one embodiment the assembly can occur via stepwise addition of fragments to a vector. In an alternative embodiment the first DNA unit can be attached to the solid phase for use in step c). This permits the solid phase to be split and mixed between steps c), d), and e) to make several different assemblies. Methods of attaching DNA units to the solid phase are well know in the art. Preferred solid phase elements are beads attached to the DNA units via a biotinylated nucleotide, as known in the art.
The recognition sequences in one or more of the DNA units are preferably introduced by means of extension primers, as known in the art, though other methods such as the ligation of the required sequences or in vitro mutagenesis can also be employed. The assembly of several DNA units can be inserted into an expression vector and thus used to transform a host capable of expressing the protein encoded by the insert of the vector.
The method is particularly useful where one or more of the DNA units encodes a catalytic or transport protein domain for example a ketoreductase domain from a PKS enzyme or an ACP domain from a hybrid poiyketide/peptide synthesising enzyme. Such domains can be derived from enzyme domain DNA sequences from, for example, polyketide synthesising enzymes, peptide synthesising enzymes, hybrid peptide polyketide synthesising enzymes, fatty acid synthesising enzymes or other enzyme domains known in the art.
The DNA units used in the methods of the invention can encode modules comprising one or more catalytic or transport domains. Usually a module contains all of the domains required to complete one condensation step in the synthesis of a target molecule. Alternative aspects of the invention resulting from the methods of the invention include: DNA constructs or vectors incorporating a DNA assembly encoding synthetic enzymes, synthetic enzymes encoded by such DNA assemblies, hosts expressing synthetic enzymes, hybrids of transformed hosts expressing synthetic enzymes, and compounds produced by the synthetic enzymes. Where the product produced by the synthetic enzyme exhibits toxicity to a host stain, this can be worked around e.g. by means of choosing a different strain or mutating the original strain to provide mutants which are more tolerant. The diversity of compounds produced by hosts transformed with the synthetic enzymes of the invention can be further increased by using known methods of using different feedstocks in the fermentation to provide different starter units for the desired product. Where yield of desired synthetic enzyme product is low, routine steps e.g. mutation and selection, can be taken to improve this,
The synthetic enzymes of the invention can also be used in cell-free systems to produce the desired target molecule in vitro as known in the art, for example, see Carreras and Khosla (1998).
In a further aspect, the invention provides a method of synthesising a target molecule comprising the steps of a) examining the composition and stereochemistry of a target molecule, b) determining which catalytic and transport domains need to be present in a synthetic enzyme in order to catalyse the synthesis of the target molecule, c) using any one of the methods of the invention to assemble the required DNA units encoding the catalytic and transport domains into a DNA assembly that encodes said synthetic enzyme which is capable of synthesising the target molecule. d) placing the DNA assembly into a vector to allow expression of the synthetic enzyme in a host capable of synthesising the target molecule after transformation with said vector. Target molecules are generally bio-active molecules, usually having a predominantly carbon based backbone and usually are macromolecules comprised of condensed units. The transformed host can be tested for the presence of the target molecule after step d). If yields of the desired compound are low then conventional methods of improving product yield from, for example Streptomycetes, can be employed. Transformed hosts which result from the methods of the invention and their use in producing target molecules are also aspects of the invention. Hosts suitable for transformation with the DNA assemblies of the invention are known in the art and include insect or mammalian cells, though more usually suitable are bacterial cells, for example, the improved host strains described by Ziermann and Betlach (1999).
As stated previously, it is also envisaged that the synthetic enzyme can be used in a cell-free system to produce the target molecule in vitro.
A further aspect of the invention is a method of making a synthetic enzyme to catalyse the synthesis of a target molecule comprising the steps of a) examining the composition and stereochemistry of a target molecule, b) determining which catalytic and transport domains need to be present in the synthetic enzyme in order to catalyse synthesis of the target molecule, c) using any one of the methods of the invention to assemble the required DNA units encoding the catalytic and transport domains into a DNA assembly that encodes an enzyme which is capable of synthesising the target molecule, d) expressing the DNA assembly in a suitable host to produce the enzyme. In a further aspect the invention provides a library of DNA units encoding catalytic or transport protein domains, wherein each DNA unit has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme. In a particular embodiment of such a library, each DNA unit has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
Also provided by the invention is a library of DNA units encoding catalytic or transport protein domains, wherein each DNA unit has a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction. In one embodiment of this aspect of the invention each DNA unit has a Spel recognition sequence 5ΑCTAGT3' at its 5'-end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end Catalytic or transport protein domains can be derived from any enzyme, for example those listed above. Particularly envisaged are libraries in which the DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain. Also provided by the invention are modules comprising a DNA sequence encoding a functional set of polyketide synthetic domains wherein the module has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme. An envisaged module has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
Alternatively a module comprising a DNA sequence encoding a functional set of polyketide synthetic domains can have a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction. In one particular example, the module has a Spel recognition sequence 5ΑCTAGT3' at its 5'-end, and an upstream Xbal recognition sequence 5TCTAGA3' and a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end.
Particularly envisaged are modules wherein the DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain. It is also envisaged that other non-polyketide enzyme domains can be included in the modules provided by the invention.
Also provided by the invention are vectors containing one or more modules. Particularly useful are vectors in which a non-functional recA gene is also present. Such vectors prevent unwanted homologous recombination occurring between domains within the vector upon integration into a suitable host by abolishing the recA gene activity in that host. Thus the invention also provides a method of transforming a host with one or more synthetic DNA assemblies encoding enzyme domains which comprises the steps of: a) Inserting said DNA assembly into a vector containing a mutated internal fragment of a recA gene sequence such that the vector is capable of undergoing homologous recombination with the recA gene of the host, b) bringing said vector into contact with a host chromosome under conditions which permit homologous recombination to take place, c) disrupting the host recA gene by the integration of the DNA of said vector into the chromosome. The expression vector can be used to transform a Steptomyces host. The DNA assemblies contained in the vector can be modules as described herein. Also envisaged are transformed hosts which prior to transformation with a vector containing one or more modules according to the invention, were already lacking a recA function.
In a further aspect the invention provides kits containing DNA units, DNA modules, vectors, DNA manipulation hosts, DNA modification hosts, expression hosts, or solid phase elements for use in the methods of the invention. For example, one such kit might contain a first DNA unit which is a vector suitable for transforming a suitable host, a library of modules for insertion into that vector, both the first DNA unit and the library having the necessary recognition sites for use in the methods of the invention, together with host strains suitable for the manipulation and expression of the DNA assemblies of the invention.
A de novo "domain-by-domain" reconstruction of a hybrid multienzyme from the erythromycin-producing PKS has been achieved by the inventors by assembling DNA units corresponding to the constituent domains. The assembled gene was expressed in S. erythraea and the expected compounds were isolated from the bacterial broth. Application of this methodology, or variations of this methodology for making combinatorial assemblies of complex and aromatic PKSs allows for the rapid generation of novel or altered PKS or other synthetic multienzymes and paves the way for a quick and inexpensive synthesis of potentially bio- active molecules.
One alternative to chemical syntheses is to carry out a 'retrobiosynthetic analysis' of the desired molecule, by pinpointing the exact number and type of synthetic enzyme domains that are required for every chemical step, and then assembling the DNA units that encode these enzymes in order to make a hybrid synthetic enzyme. The aim is therefore, to assemble these domains or even modules in a manner as desired, so that the linked enzymes can carry out a progressive synthesis of a desired target molecule. Until now, it has not been possible to find a methodology to assemble these PKS DNA units using restriction enzymes and DNA ligase to cut and join the DNA pieces together - one of the limiting factors being the non-availability of appropriate restriction enzyme sites in the DNA sequence of the enzymes which synthesise these polyketide drugs. There exist very few unique restriction enzyme sites and even fewer restriction enzymes that do not cut in the polyketide DNA sequence (i.e. are "non- cutters"). However, the restπction enzyme Xbal, because of its TA-rich recognition sequence (5TCTAGA3'), does not cleave the majority of GC- rich polyketide gene clusters. Thus, flanking both ends of the DNA of the desired DNA unit (domain or module) with a recognition sequence that is cleaved on one end by Xbal, and on the other end by a restriction enzyme that is compatible with Xbal (e.g. Spel) is possible. A vectorial assembly, where such units are progressively joined, leaves one end of the unit that has been constructed by the ligation of Xbal and Spel-cut DNA ends, not recognisable by either of the two enzymes, thus making further addition of units possible at only one of the two ends.
This strategy makes use of selective recognition of the restriction enzyme site by the restriction enzyme Xbal, depending upon the sequence adjacent to the restriction enzyme site and upon the strain used (dam+ or dam") during the assembly process. The method has been shown to be successful, and by using this methodology to assemble modules, the complete erythromycin-producing PKS (comprising of six modules coded by three large open reading frames) can be built in under 10 days. Even though this time-period is small compared to what it would take to assemble the ery PKS genes using conventional methodologies, using a variation of the above mentioned methodology, complete gene-clusters, like the 33 kbp erythromycin PKS, can be built within a matter of hours.
Also described herein, is an approach wherein the assembly of the units itself can also be carried out in vitro without the need for an in vivo DNA modification step. Furthermore, employing the in vitro assembly methodology described below, one is now able to not only construct predetermined PKS genes, but also a randomly constructed combinatorial library of shuffled domains from one or more known synthetic enzymes. This has immediate and important implications for drug-discovery.
The methodology thus outlined requires DNA units to be modified so that they contain the appropriate 5'and 3' ends (X and Xd respectively). These units are then progressively assembled to achieve the desired gene length. The vector containing the assembled or reconstructed gene is then used to transform an expression system to achieve protein expression. This methodology has been shown to work effectively - the hybrid multienzyme DEBS1-TE was reconstructed by assembling de novo the ten constituent domains. The assembled gene, when expressed in S. erythraea gave the expected six-membered triketide lactones.
However, in the case of larger molecules like discodermolide, one would require a vectorial assembly of some 50 or so PKS units (if domains). A hypothetical PKS that would make a molecule as large as discodermolide would require 12 modules, each possessing the appropriate KS, AT, ACP and a set of reductive domains (e.g. KR, DH or ER). One would find that some of the domains in this group of 50 would be required to carry out the same catalytic function. For example, if all the hydroxy groups resulting from the ketoreductase activity from all 12 modules are of the same configuration, in effect 12 KRs that function in an identical fashion are required. Also, all 12 ACPs would, of course have the same catalytic function. It would therefore logically be more convenient, and less time-consuming if, to achieve ketoreduction from every one of the 12 modules, one used only one KR domain instead of 12 different ones in all the modules, or one ACP instead of 12 different ACPs. In fact, one can calculate that for every possible chemical reaction that can be carried out using PKS domains, one requires a set of only 12 domains, that in theory can be used repeatedly (Figure 1).
It is possible that inter-modular recombination events within the reconstituted PKS or other synthetic enzyme gene, may preclude the use of identical PKS or other enzyme domain DNA units in a set of modules. It might be expected that, for example (Figure 2) the ACP* DNA in module 1 to recombine with the identical ACP* DNA in module 3. This event can take place, for example, when the expression vector that possesses the assembled gene containing numerous identical PKS DNA units is used to transform a streptomyces host for polyketide production.
The inventors have developed a strategy that can circumvent this problem, therefore making it possible to construct large synthetic enzyme gene clusters using identical domains or modules repeatedly. This translates into a less expensive route towards synthetic enzyme gene construction (one would not require to have a start-up library of 200 or so to cover all possibilities), as the set of 12 domains, or similar functional arrangements of domains, are true "off-the-shelf components for the assembly of PKS genes or genes for other hybrid synthetic enzymes.
The inventors provide methods of DNA assembly that pave the way for a cheap and fast synthesis of a host of bio-active molecules, e.g. the anti- cancer drug Discodermolide.
The examples that follow are better described with reference to the following figures:
Figure 1 shows the chemical/stereochemical choices that each PKS domain can make. A total of 12 domains are required for every conceivable polyketide reaction.
Figure 2 shows integration of a plasmid containing more than one identical DNA unit (ACP*). After the plasmid has integrated in the streptomyces host through homologous recombination with TE, internal recombination can occur to yield truncated PKS genes. This is because the host is recA+.
Figures 3A and 3B show a schematic representation of the assembly process. The de novo construction of DEBS1-TE. DNA fragments (units) encoding for the constituent domains of the multienzyme DEBS1-TE were inserted sequentially into the expression plasmid pCJR24. The final plasmid pAR10 was then expressed in S. erythraea/JC2 to yield the expected triketide lactone products that are synthesised by the schematically shown re-assembled DEBS1-TE synthase. The amino acid changes made within the linker regions between domains are shown below the actual amino acid sequence. Construction of the expression plasmid pAR10 and structural characterisation of the two triketide lactones shown in the above figure is described in the methods section. X - Xbal restriction enzyme recognition sequence (5TCTAGA3'), Xd - Xbal and Dam methylase recognition sequence (5 GATCTAGA3 )
Figure 4 shows the methodology of the assembly of DNA units using Xbal/dam methylase technology. During the second last stage of assembly, indicated as transform and cut in the figure, transformation of a Dam'strain with plasmid (as it is a dam'strain, even Xd would be cleaved by Xbal) is effected. Cutting is achieved by Xbal and the DNA unit purified on a gel.
Figure 5 shows the procedure for the assembly of DNA units using Xbal/dam methylase technology.
Figure 6 shows how an Xbal site can be made sensitive to methylation.
The RE cuts at the sites shown by arrows. The boxed sequence is methylated in a dam+strain thereby altering the Xbal recognition site. The sequence however is not methylated in a dam strain, and so can still be cleaved by Xbal. The Xbal recognition sequence (5 CTAGA3') can therefore be selectively cleaved by Xbal. Assembly of DNA units uses only one restriction enzyme - Xbal.
Figure 7 shows the methodology of the in vitro assembly of DNA units - I using solid phase beads with the enzymes Xbal, Spel and Smal (other Xbal - compatible REs may be used). Figures 8 and 9 show how the methodology of the in vitro assembly of DNA units - II would proceed to the point of placing the DNA assembly into an expression vector for transforming and appropriate host. In vitro assembly of DNA units (domains) from the first multienzyme of erythromycin - producing PKS.
Figure 10 shows how in one single ligation, 16 ongoing assemblies are generated. This cascade can obtain exponential proportions. The gene library can be increased by increasing the diversity of the incoming unit.
Figure 11 shows the integration of an expression plasmid into a streptomyces host, using a mutated internal fragment of the recA gene as the region for homologous recombination. The resulting PKS gene can now contain more than one identical DNA units as the strain has been made recA minus.
Figure 12 shows the assembled PKS recADEBS1-TE. The second module is composed of domains that normally belong to the first module.
Figure 13 shows the amino acid sequence alignment of the recA protein of S. lividans (S.I.) and S. ambofaciens (S.a). Percent similarity: 96.496, percent identity: 95.418. Match display thresholds for the aiignment(s): I = identity : = 2 . = 1
Figures 14A and 14B show a DNA sequence alignment of the recA gene S. lividans (S.I) and S. ambofaciens (S.a). Start of the gene is from 'ATG' and stop is TGA'. Percent similarity: 94.713, percent identity: 94.713.
Figure 15 shows how an Xbal/Spel system might be used instead of an Xbal/dam methylase system to assemble DNA units, a strategy involving compatible restriction enzymes flanking either end of a DNA unit. An example of compatible REs would be Xbal and Spel. The recognition sequence of Xbal is - 5TCTAGA3' and that for Spel is 5ΑCTAGT3'. After Xbal and Spel have cleaved the DNA at their respective sites, the DNA unit can be ligated together as the overhanging is complementary. The junction where any two units are joined is now recognised by either Xbal or Spel.
Figure 16 is a schematic representation of the compatibility of Xbal- and Spel- digested DNA overhangs. It shows the compatibility of the sticky ends produced by Xbal and Spel and how re-ligation abolishes both sites. Figure 17 shows a schematic representation of the erythromycin-producing polyketide synthase; primary organisation of the genes and their corresponding protein domains. The multienzymes deoxyerythronolide B synthase 1 (DEBS1), DEBS2 and DEBS3 each have two modules, each of which processes one cycle of polyketide chain extension. Each of the six modules is constituted by covalently-linked enzymatic domains. Exploitation of such an enzymatic hierarchy as "of-the-shelf reagents can lead to synthesis of important chemical compounds.
Figure 18 shows the structure of the anticancer drug discodermolide (top) and the 'retrobiosynthetic approach' towards synthesising a target molecule (a discodermolide). Such an approach would involve opening up the structure (a.), identifying the number and type of polyketide carbon units that would make the discodermolide carbon skeleton (b.), and choosing the PKS DNA units (modules/domains) responsible for the uptake and subsequent processing of the carbon units (c).
Figure 19 shows the anti-tumour compound octalactin and the strategy behind the retrobiosynthetic approach towards synthesising bio-active molecules. The strategy comprises the steps of: 21a
Identify polyketide units - e.g. whether acetate, propionate, etc,
Break-up and identify - break up the carbon skeleton and identify how many such carbon units are present. Eight units would mean one requires eight modules to make a PKS.
Choose - choose the modules or domains that would be required, form an existing library of such PKS modules and domains.
Assemble - assemble the DNA units (modules/domains/using the invention.
Express - express the assembled gene in a host and check for compound production.
Figure 20 shows a schematic representation of they hypothetical polyketide synthase for synthesising octalactin B, assembled from enzyme units that belong to various PKSs in the public domain.
Figure 21 shows a schematic representation of the hypothetical decarestrictine polyketide synthase for synthesising the anti-cholesterol compound decarestrictine J, assembled from enzyme units that belong to various PKSs in the public domain.
- 22 -
Examples
Example 1 : Vectorial assembly of DNA units
DNA units that are to be assembled contain the Xbal recognition sequence at either end of the unit. At one of the ends, two nucleotides (GA) are arranged at the 5' end of the Xbal recognition sequence (thus making it 5'GATCTAGA3'). This is achieved by first incorporating the Xbal recognition sequences in the oligonucleotide primers and then amplifying the desired DNA unit by PCR. The PCR products are then ligated to a pUC-18 vector, used to transform a dam+ strain of E. coli, and the clones isolated and sequenced for possible errors in the PCR products. A dam+ strain of E. coli- like DH10B™ - methylate the nucleotide A in the sequence GATCTAGA, as 5'GATC3' is a sequence that is recognised by the product of the Dam methylase gene (Fujimoto ef a/.,1965; Geier et al., 1979). This makes only one end of the DNA unit cleavable by Xbal. The vector is then used to transform a dam" strain of E. coli (e.g. ET12567 - MacNeil et al. (1992)) and the plasmid DNA isolated. This DNA is now cleavable at bofb ends of the DNA unit by Xbal. When a library of units has been constructed using this strategy, and both ends of these units have been cleaved by Xbal, they are progressively inserted into a vector that has a unique Xbal site and the ligated products are used always to transform a dam+ strain of E. coli, thereby making sure that one end of the DNA unit is always protected from cleavage by Xbal through methylation. When the assembly of such units is completed, the final plasmid is integrated into a streptomyces strain for the production of the desired polyketide.
Using this methodology, the polyketide synthase DEBS1-TE, a multienzyme that has the first of the three bimodular erythromycin DEBS enzymes (DEBS1), fused with the erythromycin thioesterase (Cortes et al., 1995) was constructed in a de novo fashion. The ten inherent PKS domains in DEBS1-TE, namely, loading module (itself composed of an AT and an ACP), KS1 (ketosynthase of module 1), AT1 , KR1 , ACP1 , KS2 - 23 -
(ketosynthase of module 2), AT2, KR2, ACP2 and TE function in conjunction to catalyse the synthesis of (2R,3S,4S,5R)-2,4-dimethyl-3,5- dihydroxy-n-hexanoic acid δ-lactone (2), figure 3.
The DNA for all ten domains was amplified by PCR to incorporate the two aforementioned recognition sequences for Xbal (5TCTAGA3' and 5'GATCTAGA3') at the 5' and 3' ends of the DNA unit respectively. The PCR products were cloned in pUC18 vector, sequenced, and then used to transform the dam" E. co// ET12567 strain. To initiate the assembly process, the DNA unit for TE was inserted into S. erythraea expression vector pCJR24 (Rowe et al., 1998) which has a unique Xbal site. This vector also contains a thiostrepton-resistance gene as a marker for identifying successful integrands. The ligated products were used to transform the dam+ E. co// DH10B™ strain and the plasmid DNA isolated. This plasmid (pAR1) can only be singly cleaved with Xbal, despite possessing two Xbal recognition sequences, as one of the sites (situated at the 3' end of the TE unit) has been methylated by the E. coli Dam methylase. The next DNA unit (ACP2 from module 2 of DEBS1) was then ligated to the Xbal-cut pAR1 , the ligation mixture used to transform DH10B cells and the plasmid DNA isolated. Likewise, the other eight DNA units were successively added to pAR1 to finally yield the expression plasmid pAR10 containing the reconstituted DEBS1-TE gene (Figure 3). The junctions where these domains were joined were chosen in the linker regions that lie between these domains, so as to cause minimum disturbance of the structural features of these domains, that might in turn affect the proficiency of the domains themselves (Figure 3). Plasmid pAR10 was then used to transform S. erythraea/JC2 - a mutant strain of the wild- type S. erythraea NRRL2338 that lacks the DEBS genes except for the TE DNA fragment (Rowe et al., 1998). Thiostrepton-resistant colonies were selected upon integration of the vector into the S. erythraea chromosome. Single transformants were grown on selective media, as described in the methods section. The fermentation broth was extracted with ethyl acetate - 24 -
and a sample of the organic extract was analysed by gas chromatography- mass spectroscopy (GC-MS). Two peaks were observed, corresponding to molecular massess 158 and 172, indicating the presence of the expected acetate- and propionate- derived polyketides (2R,3S,4S,5R)-2,4-dimethyl- 3,5-dihydroxy-n-pentanoic acid d-lactone (1) and (2R,3S,4S,5R)-2,4- dimethyl-3,5-dihydroxy-n-hexanoic acid d-lactone (2). Both compounds were isolated and fully characterised by high-pressure liquid chromatography (HPLC), 1H 1 D and 2D NMR, 13C NMR, FT-ICR spectrometry, and by comparison with a synthetic standard of (2) (Brown et al., 1995). One litre of fermentation broth produces 24 mg of (1) and 56 mg of (2) - yields that are comparable to those reported elsewhere (Lau et al., 1999). It can therefore be asserted that the ten newly constructed inter- domain junctions have not in any way dimmed the catalytic proficiency of the DEBS1-TE synthase. In the absence of any crystal-structure data on PKS domains, all genetic engineering efforts known in the art have been based on trial-and- error methods of experimenting with where to join two such domains. As a result, the yield of the synthesised polyketide products have varied depending upon the position in the polypeptide chain at which the domains or modules have been linked (McDaniel et al., 1999; Ruan et al., 1997). The successful functioning of the reconstructed polyketide synthase described above has supplied new information about the inter-domain junction sites. Using this information, and the described methodology for the rapid assembly of these enzyme units, it is now possible to carry out a 'retrobiosynthetic analysis' of target molecules and then to use polyketide and other biosynthetic enzyme domains as truly 'off-the-shelf reagents to achieve a stereospecific synthesis. There is also the possibility of using this methodology for randomly combining DNA units that encode catalytic e.g. DH or transport e.g. ACP protein domains to generate combinatorial libraries of hybrid synthases. By using a suitable assay system to test for biological activity of the compounds that are generated by such means, it is - 25 -
possible to go back and isolate the hybrid synthetic gene resposible for the production of these compounds.
From 6-methylsalicilic acid to maitotoxin, nature displays a staggering diversity in compounds that are synthesised by means of 'combinatorial gene-shuffling'. This methodology, or variations of this methodology can be used as effective tools towards harnessing the combinatorial potential of discrete enzymatic units or their sets that are the feature of multi-functional PKS and other systems.
A similar system to the XbaMdam system described above, uses the restriction enzyme Fok\ which has the recognition site:
5'GGATG(N)9j3' 3'CCTAC(N)13T5' with the dcm methylase of E.coli. Adding CCA or CCT to the 5' end of the For recognition site would make the site dcm sensitive. Furthermore, if the sequence TCTAGA were inserted into the redundant section of the Fol restriction site, then the enzyme could be used to generate 'Xbal-cut ends'. Methods
E. coli dam+ DH10B™ strain was purchased from Gibco BRL, USA.. Pfu DNA polymerase was purchased from Boeringer, Germany. Construction of the final expression plasmid pAR10 was carried out in several steps, as follows. The ten PKS DNA units were amplified by PCR using pfu DNA polymerase. The respective regions of eryAI gene, as well as the oligonucleotides used for each PCR are outlined: LM - segment of ety /gene (Bevitt et al., 1992) extending from nucleotide (N) 588 to N 2389;
5'GGCATATGGCGGACCTGTCAAAGCTCTCCGACAGT3' and 5'GGTCTAGATCCCAGCCGCGGTCGGTCGGCAGTCCCG3', KS1 - segment of eryAI gene extending from N 2384 to N 3769; 5'GGTCTAGACTCGCTGTTCCACCCCGACCCCACGCGCTCGGGCACC GCGCACCA3' and - 26 -
5'GGTCTAGATCGCGCAGCGCGGCGGACTCGTCGACGGGGGCGAAG
GCGG3',
AT1 - segment of eryAI gene extending from N 3764 to N 4813;
5'GGTCTAGACGGTCTCGCGACGGGAAACGCCGACGGTGCCGCCGTT GGAA3' and
5'GGTCTAGATCCACCGCGACACCGGCGGCGAACGCGCGGGAGAGC
GCTTCGC3',
KR1 - segment of eryAI gene extending from N 4808 to N 6316; 5'GGTCTAGAGTCGGTGCACCTGGGCACCGGAGCACGCCGGGTGCCC
TT3' and
5'GGTCTAGATCGTCGAAGAGCCTGGTCGGGCGCTGCGCGGTGTA3',
ACP1 - segment of eryAI gene extending from N 6311 to N 6679; 5'GGTCTAGACGACGCGCGGCGGGCTGCGCCGCAGGCGCCGGCCGA
ACCGCGGG3' and
5'GGTCTAGATCGGCCGTGG-TCGCCGGTGCCGCCTGCTCGGCT3\
KS2 - segment of eryAI gene extending from N 6674 to N 8200; 5'GGTCTAGACGAGCCGATCGCGATCGTCGGCATGGCGTGC-
CGGCTGC3' and
5'GGTCTAGATCGTGCACGGCCTCGGCGGTGTCGGCGGCGAGC-
ACCGCGGCCCGCTCCTC3', AT2 - segment of eryAI gene extending from N 8195 to N 9340;
5'GGTCTAGAGGCGGTGGCCGACGGCGCGGTGGTT3' and
5'GGTCTAGATCGTCACGAGGGGTGGTGCGGTCCGGCAGCAGCCAGA
A3', KR2 - segment of eryAI gene extending from N 9335 to N 10639;
5'GGTCTAGACGGCTGGTTCTACC-GGGTCGACTGGACCGAG3' - 27 -
and
5'GGTCTAGATCCGGCCGGGGCCGGGCGGCGG-TGTAGGACT3\ ACP2 - segment of eryAI gene extending from N 10634 to N 10966; 5'GGTCTAGACCGCATCGTCACGACCGCGCCGAGCGA3' and
5'GGTCTAGATCG-GCGTCGAGGAAA3',
TE - segment of eryAIII gene (Donadio et al. 1991) extending from N 8753 to N 9602; 5'GGTCTAGACAGCGGGACTCCCGCCCGGGAAGCG3' and 5'GGGCTAGCTCTAGATCATGAATTCCCTCCGCCCAGCCAGGCGTC3'. All PCR products were 5' phosphorylated and ligated to Smal-cut, dephosphorylated pUC18 vector and used to transform E. co// DH10B electrocompetent cells. The desired plasmids - containing the amplified DNA fragments were isolated and sequenced using standard pUC forward and reverse primers. No mistakes in the amplified products we.e detected. All ten plasmids were then used to transform the E.coli ET12567 dam' strain. Isolated DNA was digested with Xbal restriction enzyme and desired fragments isolated and purified. The TE unit was then ligated to Xbal-cut pCJR24 vector and the ligation products used to transform E. coli DH10B electrocompetent cells. Plasmid pAR1 was isolated, digested with Xbal, and ligated to the ACP2 fragment, and ligation products treated as mentioned above. The other DNA fragments, namely, KR2, AT2, KS2, ACP1 , KR1 , AT1 and KS1 were sequentially added to finally yield plasmid pAR10. This plasmid was then digested with Λ/ el and Xbal restriction enzymes and ligated with the LM fragment previously digested with the same two enzymes. The ligated products were used to transform E. coli DH10B electrocompetent cells and the final expression plasmid pAR10 isolated. Plasmid pAR10 was then used to transform S. erythraea/JC2 strain and colonies carrying the expression plasmid were selected through resistance to thiostrepton upon integration of the plasmid into the S. erythraea chromosome. Single transformants were picked and grown on - 28 -
tap-water medium plates supplemented with thiostrepton, following which single transformants were grown in 5X200ml of SM3 liquid media supplemented with 5 ug/ml of thiostrepton for seven days (Rowe et al., 1998). Cells were removed by centrifugation, the supernatant was saturated with NaCI and extracted three times with equal volumes of ethyl acetate at pH 4.0. The solvent was evaporated to yield 1.12 g of crude product. A sample of this crude product was analysed by GC-MS. Two peaks were observed, corresponding to molecular masses 158 and 172, indicating the presence of the expected acetate- and propionate- derived polyketides (2R,3S,4S,5R)-2,4-dimethyl-3,5-dihydroxy-n-pentanoic acid δ- lactone (1) and (2R,3S,4S,5R)-2,4-dimethyl-3,5-dihydroxy-n-hexanoic acid δ-lactone (2). Compounds (1) and (2) were found to be structurally identical to those reported previously (Cortes et a/., 1995). Characterisation of (2R,3S,4S,5R)-2,4-dimethyl-3, 5-dihydroxy-n-pentanoic acid δ-lactone (1)
1H NMR (CDCI3l 500 MHz) δH 4.45-4.35 (1 H, dq, J = 6.56 and 1.62 Hz, C5- H), 3.8 (1 H, dd, J = 10.15 and 4.17 Hz C3-H), 2.45-2.70 (1 H, br, O-H), 2.42 (1 H, dq, J = 10.0 and 6.97 Hz C2-H), 2.05 (1 H, m, C4-H), 1.37 (3H, d, J = 7.17 Hz, C2-CH3), 1.32 (3H, d, J = 6.74 Hz, C5-CH3), 0.95 (3H, d, J = 7.20 Hz, C4-CH3) ppm. 13C NMR (CDCI3, 250 MHz) δ 174.20, 76.15, 73.62, 39.42, 38.14, 18.11 , 14.24, 4.48.
Characterisation of (2R,3S,4S,5R)-2,4-dimethyl-3,5-dihydroxy-n-hexanoic acid δ-lactone (2) 1H NMR (CDCI3, 500 MHz) dH 4.13 (1 H, ddd, J = 8.12, 5.93 and 2.19 Hz, C5-H), 3.82 (1 H, m, C3-H), 2.42-2.50 (1 H, dq, J = 10.17 and 7.08 Hz, C2-H), 2.12-2.19 (1 H, m, C4-H), 1.77-1.86 (1 H, m, one of C6-H2), 1.52-1.61 (1 H, m, one of C6-H2), 1.4 (3H, d, J = 7.09 Hz, C2-CH3), 1.0 (3H, t, J = 7.42 Hz, C6- CH3), 0.97 (3H, d, J = 6.96 Hz, C4-CH3) ppm. 13C NMR (CDCI3, 250 MHz) d 173.56, 81.34, 73.96, 40.08, 36.76, 25.27, 14.27, 9.88, 4.37.
Example 2: in vitro assembly of DNA units - 29 -
Figure 7 outlines the strategy for the in vitro assembly of PKS DNA units. The inventors have constructed the multienzyme DEBS1 -TE. The in vivo construction of the gene for DEBS1 -TE, it should be noted, took 12 days to complete. The in vitro assembly on the other hand was completed in 2 days.
All ten domains, namely, LM, KS1 , KR1 , AT1 , ACP1 , KS2, AT2, KR2, ACP2 and TE were amplified by means of PCR. The forward primer in all cases, except the LM contained the Spel recognition sequence 5ΑCTAGT3' while the reverse primer was engineered in such a way that it contained the Xbal recognition sequence 5' TCTAGA3' and Smal recognition sequence 5'CCCGGG3' downstream of the Xbal site (Figure 7). The amplification of the LM was carried out using a biotinylated forward primer and a reverse primer that contained the Xbal recognition sequence (5 CTAGA3'). All the PCR products were cloned in pUC-18 vector and the resulting plasmids sequenced to detect possible errors introduced by PCR. All plasmids, except the one containing the LM unit were then digested with Spel and Smal, dephosphorylated in order to remove the 5' phosphate group and the appropriate fragments isolated and eluted. The LM unit was cleaved with Xbal and attached to a bead that was coated with streptavidin (following the manufacturer's instructions) as shown in figure 7.
The assembly process was initiated by adding DNA ligase to the tube containing a large excess of the first unit (KS1) and LM-bead. The reason for having a large excess of the KS1 unit compared to the LM-bead unit is to favour the LM-bead ligating to the incoming unit, as opposed to the self-ligation of the LM-bead (see figure 7). The ligation of the two DNA fragments is unidirectional as only the Spel-cut end of KS1 complements the Xbal-cut end of the LM-bead. After the ligation was complete, the desired product of the ligation reaction, namely 'bead-LM-KS1' was separated from the reaction mixture and washed. This product was then cleaved with Xbal, in order to activate the 3' end of KS1. The beads were washed again to remove the small Xbal-Smal DNA fragment that was - 30 -
released from the 3' end of KS1 as a result of RE cleavage. The 'activated' bead-LM-KS1 unit was then ligated with Spel, S al-cut and 5' dephosphorylated AT1. The Spel-cut 5' end of AT1 complemented the Xbal-cut 3' end of KS1 to give bead-LM-KS1-AT1 as shown in figure 8. This product was separated from the reaction mixture and washed as before. The 3' end of AT1 in this product was then 'activated' through cleavage by Xbal, and the assembly process continued.
Finally, Spel, Smal-cut and 5' dephosphorylated TE unit was ligated with the DNA fragment that was now bead-LM-KS1 -AT1 -KR1 -ACP1 -KS2- AT2-KR2-ACP2 as shown in figure 9. The 3' end of the latter fragment was 'activated' by digesting it with Xbal. The assembled DEBS1 -TE gene was then inserted in the expression plasmid pCJR24 and the resulting plasmid used to transform a streptomyces strain. The expected triketide lactone products were isolated and structurally characterised. Use of the in vitro technology described above drastically reduces the time it takes to assemble predetermined or randomly shuffled genes. Also, the possibility of continuing with the assembly process while having numerous different assembly arrays attached to the beads, and splitting and mixing the beads between each unit/module addition from a library of units/modules, results finally in the generation of a cascade of different assemblies (Figure 10). These assembled genes can then be cloned simultaneously and expressed in a suitable host. An assay system can then be used to identify those assembled genes that yield bio-active compounds.
Example 3: Retrobiosynthetic synthesis of a target molecule
A strategy employing the invention in order to construct the highly potent anti-breast cancer drug discodermolide, the anticholesterol compound decarestrictine, and the antitumour compound octalacin using polyketide synthase domains/modules is outlined below. - 31 -
Discodermolide
The drug discodermolide (Figure 18), isolated from the marine sponge 'Discodermia disoluta', has been identified as a highly potent anti-cancer compound and 80 times more effective than the well known anticancer drug Taxol (TerHarr et al., 1996). It has the same mechanism of action as Taxol, even though it is structurally different from the latter.
One can infer from its structure (Figure 18) that discodermolide is a polyketide and can therefore be constructed from a system that has the basic enzymatic building blocks (domains and modules) that make other polyketides like erythromycin and rapamycin. Having predicted that approximately 45 domains housed in 12 modules would be required in order to carry out the chemistry that accounts for the functionalities on the carbon skeleton of discodermolide, one can now begin to construct such a system. All one has to do is to identify the type and nature of the domains/modules that one requires to generate the observed functionalities, and then assemble these units in the desired order (Figure 18). The resulting DNA assembly can then be put into a bacterial strain that makes a functional polyketide synthase.
Until now, it would have been exceedingly difficult, if not impossible to assemble 45 or so pieces of DNA in the wanted order, for several reasons. Firstly, one would have to look for two different restriction enzymes every time one needed to assemble two DNA segments. This is because if one uses just one restriction enzyme at either end of the - 32 -
domain, the already-assembled piece/pieces of DNA would be cleaved from the assembly every time one decided to insert a new domain. Secondly, in GC-rich DNA like the polyketide synthase producing Streptomyces strain, unique restriction enzyme sites are few and far between. To a molecular biologist, the task of assembling 40 pieces of DNA with the limitations mentioned above, would seem an insurmountable one. One would rather attempt to isolate the genes that make the drug at the first place than consider carrying out "step-by-step" reconstruction of the gene itself. In the case of discodermolide, even the last possibility is in the realms of fantasy. The organism within the marine sponge that makes the drug has not been identified. The only way discodermolide can be made available is through chemical synthesis - there have been a few chemical routes reported in literature recently (Marshall and Johns, 1998 and references therein). However, as is the case with most other complex molecules, large scale production of discodermolide, using the chemical route would turn out to be outrageously expensive. Chemists have been using the retrosynthetic analysis approach towards total synthesis of important bioactive molecules. This approach breaks the target compound into many smaller pieces - easily synthesised - which are then re- assembled.
The type of polyketide or other synthetic enzyme domains required in order to construct the target molecule from the starting units are identified using a "retrobiosynthetic analysis" approach for discodermolide, - 33 -
by matching which molecules need to be condensed to form the
macromolecule with the enzyme domains that carry out the required catalysis to build the macromolecule.
Having identified the enzyme units that are required, the unit-DNA segments are amplified using the polymerase-chain-reaction (PCR) - from
the library of existing polyketide synthase unit-DNA, and the appropriate recognition sequences are attached to each unit-DNA fragment. All of the unit fragments are then replicated in a dam" strain whereby both the
unmodified and modified sequences (5TCTAGA3' and 5'GATCTAGA3' respectively) are cleaved by the restriction enzyme Xbal.
Having constructed this library of appropriate PKS or other synthetic enzyme units, the corresponding DNA units are then assembled. The assembled DNA piece is then placed in a vector, so that it can be inserted in a bacterial strain to yield the desired synthetic protein. Suitable vectors have an antibiotic resistance marker (for selection of this vector on an antibiotic-rich media) and an "origin-of -replication" (ori). Ori is essential for
the independent growth of the vector in any strain. Particularly suitable
vectors for the expression of the synthetic enzymes of the invention are the actinomycete vectors described by Rowe et al. (1998).
The strain is then grown in a media that is supplemented with the
antibiotic, the resistance gene for which is present in the vector.
Figures 4 and 5 show how the assembly proceeds. The first domain
is inserted into a vector that is cut by cleavage with Xbal. After the ligation - 34 -
of the domain has taken place with the vector, the DNA is put in a bacterial strain that is dam+ and grown. Finally, bacterial colonies that have the desired vector-domain DNA are identified and DNA isolated from them. The whole procedure is cheap and fast. Only one restriction enzyme (Xbal) is made use of, routine cloning technology is employed, the desired DNA fragment is obtained, which can then be expressed in a Streptomyces strain to yield the polyketide synthase.
The in vivo "domain-by domain" construction of the discodermolide producing polyketide synthase would take approximately 55 days via this method. In comparison, assembly of modules would take less time, as one would need to assemble fewer pieces. Most importantly, once the synthase is shown to be functionally active, a large fermentation of the bacterial strain can be carried out, and the drug isolated in however much quantity one requires - unlike the chemical route where the starting materials have to be freshly synthesised every time one requires the target compound. Employing such a strategy would lead to a quick and inexpensive synthesis of important bioactive molecules like discodermolide. Retrobiosynthetic analysis
The whole approach (retrobiosynthetic analysis followed by identification of PKS units, followed by assembly of PKS units) is made clearer in the following two examples. - 35 -
Octalactin
A new addition to the rare class of eight-membered lactone natural products is the family of Octalactin. Octalactin A and B (Figure 20) are natural products isolated from the marine gorgonian octocoral 'Pacifigorgia sp.' (Tapiolas et. al., 1991). Octalactin A shows very strong cytotoxicity toward B-16-F-10 murine melanoma and HCT-116 human colon tumour cell lines and is a promising drug candidate, while octalactine B displayed no such activity (Tapiolas et. al., 1991). Total syntheses of both octalactin A and B have been reported in literature. One such synthesis (Buszek, et. al., 1994) typically involves more than 12 chemical steps in leading to the target molecules. Clearly, large-scale production of octalactins using chemical synthesis is industrially not viable. On the other hand, the genes that code for the enzymes that make octalactins have not be identified or isolated. This means that at present, modified octalactins can only be made using chemical synthesis. A gene is constructed from the available PKS spare parts - that would code for the enzymes that would make octalactin B. Octalactin B can then be converted into the cytotoxic octalactin A by one-step stereospecific epoxidation. Also, once the gene for octalactin B is constructed and shown to make the octalactin PKS, genetic engineering on this gene would yield modified octalactin PKSs that in turn would synthesise octalactin analogues.
Clearly, a polyketide, the carbon skeleton of octalactin B (Figure 19) can be seen to be assembled by acetate and propionate units. The uptake - 36 -
and assembly of these units in the prescribed sequence, as well as the functionalities that decorate the carbon chain of octalactin can be assigned to various PKS modules (see figure 19). Once a decision has been made regarding the type and nature of PKS modules, they can be strung together to make a gene using the invention. This gene can then be expressed in a suitable host in order to look for octalactin B production. The retrobiosynthetic approach towards octalactin is shown in detail in figure 19. A choice of what modules to select from the PKS module library is followed by amplification of the modular DNA fragments using the oligonucleotides such that the 5' and the 3' ends of every DNA fragment have the restriction enzyme recognition sites stated under the description of the invention. The choice of modules that, when assembled, would make the Octalactin gene' is displayed as a schematic representation in figure 20. Decarestrictine J
The molecule decarestrictine J can be synthesised using the retrebiosynthetic approach. Decarestrictine J is a ten-membered lactone that comes from the family of decarestrictines, shown to display strong anti- cholesterol activity (Grabley et. al., 1992). The total synthesis of Decarestrictine J has been reported and involves numerous chemical steps (Yamada et. al., 1995). The target molecule (figure 21) can be conceived to be formed by assembly of five acetate polyketide units. Using the retrobiosynthetic approach, one can identify the PKS domains/modules that - 37 -
would be required for the carbon skeleton of decarestrictine J. A hypothetical decarestrcitine PKS is shown in figure 21. The loading module, as well as the four internal modules along with the TE domains can be conveniently assembled using the invention. The assembled 'decarestrictine gene' can then be expressed in a suitable host in order to check for the production of decarestrictine J.
In summary, the retrobiosynthetic approach involves the following steps; a). Identification of the number and nature of carbon units that make up the target molecule b). Identification of the modules/domains from libraries of polyketide/peptide synthetase/fatty acid/etc. encoding units that are responsible for the uptake of the said carbon units and the nature and degree of functionalisation of the carbon chain c). Assembly of the said modules/domains using the methods of the invention d). Expression of the assembled gene in a suitable expression host.
Example 4: Transforming strains with DNA encoding similar synthetic enzyme domains
A method for transforming expression strains with DNA encoding similar synthetic enzyme domains has been devised. Instead of using the TE PKS DNA fragment as a region of integration from the assembled gene into a streptomyces host (S. erythraea 'JC2, Rowe et al., 1998), a mutated recA gene fragment from streptomyces is used. The assembly process is carried - 38 -
out in a recA' E. coli strain (e.g. DH10B) as previously described. As this strain is recA", one can assemble any number of identical DNA units. The vector, into which the assembled gene is being constructed, contains a portion of a streptomyces recA gene. This recA fragment carries a mutation. After the synthetic enzyme gene has been assembled, the vector is used to transform a streptomyces host (e.g. S. lividans or S. erythraea). The fragment of recA gene carrying a mutation recombines with the recA gene of the streptomyces host, abolishing the functional recA gene and making the strain recombination minus (Figure 11). This means that an event, such as the one described in figure 2 is now not possible. The strain is then grown to look for the encoded enzyme product. This strategy is tested by assembling a functional PKS gene having more than one type of identical DNA units (Figure 12). Construction of the PKS multienzyme recDEBS1-TE RecA protein has been characterised as a multifunctional enzyme that is essential for homologous recombination, DNA repair, SOS response and DNA rearrangements (Miller and Kokjohn, 1990). Most of the routinely used strains of E. coli are recAX The gene for recA has been identified from many streptomyces strains. The first streptomyces recA gene to be characterised and isolated was from S. lividans (NuBbaumer and Wohlleben, 1994) RecA mutants have since been generated in S. ambofaciens (Aigle et al., 1997). The streptomyces recA protein has approximately 372 amino acid residues (Figure 13). DNA sequence analysis suggests a coding region of 1 122 bp, and is found to be highly conserved within streptomyces (Figure 14). In fact the recA mutants of S. ambofaciens were generated by integrating a mutated portion of the S. lividans recA gene into the S. ambofaciens host. It was found that a recA mutant lacking 30 aa from the C-terminus of the protein inhibited recombination events in S. ambofaciens (Aigle et al., 1997). A recA mutant of the streptomyces host that is used for expression of the assembled gene was generated. - 39 -
The oligonucleotides:
5'- GGTCTAGAATTCGGCAAGGGCGCCGGTCATGCGCAT-3' and 5'- GG TCTAGA TCTGCGGCGTCGGCCGGGGCGGCGGAGGCG-3' were used as the forward and reverse primers respectively and the 1000 bp internal region of S. lividans recA gene (Nuβbaumer and Wohlleben, 1994) was amplified using pfu polymerase. An additional nucleotide (C) was incorporated into the forward primer to generate a frame shift in the amplified recA gene fragment. The PCR product was cloned- in pUC-18 vector and sequenced to detect for possible errors during PCR. The 1.0 kbp recA fragment, flanked at both ends by an Xbal site was then inserted in the expression vector pCJR24 that has a unique Xbal site. The ligation mixture was used to transform E. co// DH10B cells and the desired plasmid DNA isolated. The resulting plasmid (pARecA24) contains a non- methylated Xbal site at the 5' end of the recA gene fragment. The ten PKS DNA units, namely, TE, two each of ACP1 , KR1 , AT1 & KS1 , and LM were inserted into the plasmid pARecA24 to finally yield the expression plasmid pfiecADITE. This plasmid was used to transform wild-type S. lividans protoplasts, and thiostrepton resistant colonies were grown in defined liquid media as described above. The compound (Figure 12) was isolated from the bacterial broth and chemically characterised.
Thus, it has been shown that a gene carrying interspaced DNA units that are identical in structure as well as function does not lead to internal recombination events, as the native recA gene of the streptomyces host has been disrupted. Furthermore, it has been shown that it is possible to use identical domains to reach the objective of generating hybrid synthetic enzyme systems. This strategy will greatly reduce the number of domains that otherwise have to be employed for the purposes of de novo PKS gene assembly that yields the desired chemical compounds. The inventors have established a set of 12 domains that are capable of functioning robustly and are independent of flexibility and spacial constraints - problems that beset the choice of domains and modules previously. - 40 -
References
Aigle, B., Holl, A-C, Angulo, J.F., Leblond, P. and Decaris, B. (1997) Characterization of two Streptomyces ambofaciens recA mutants: identification of the recA protein by immunoblotting. FEMS Microbiol. lett., 149, 181-187.
Bevitt, D.J., Cortes, J., Haydock, S.F. and Leadlay, P.F. - (1992) 6- Deoxyerythronolide B synthase 2 from Saccharopolyspora erythraea. Cloning of the structural gene, sequence analysis and inferred domain structure of the multifunctional enzyme. Eur. J. Biochem., 204, 38-49.
Brown, M.J.B., Cortes, J., Cutter, A.L., Leadlay, P.F. and Staunton, J. (1995) A mutant generated by expression of an engineered DEBS1 protein from the erythromycin-producing polyketide synthase (PKS) in Streptomyces coelicolor produces the triketide as a lactone, but the major product is the nor-analogue derived from acetate as starter acid. J. Chem. Soc, Chem. Commun., 1517-1518.
Buszek, KR., Sato, N. and Jeong, Y.M. (1994) Total synthesis of octalactin- A and octalactin-B. J. Amer. Chem. Soc. 116, 5511 -5512.
Carreras C. and Khosla C. (1998) Purification and in vitro reconstitution of the essential protein components of an aromatic polyketide synthase. Biochemistry 37,2084-2088.
Cortes, J., Wiesmann, K.E.H., Roberts, G.A., Brown, M.J.B., Staunton, J. and Leadlay, P.F. (1995) Repositioning of a domain in a modular polyketide synthase to promote specific chain cleavage. Science, 268, 1487-1489. 41
Donadio, S., McAlpine, J.B., Sheldon, P.J., Jackson, M. and Katz, L. (1993) Proc. Natl. Acad. Sci. USA, 90, 7119-7123.
Donadio, S., Staver, M.J., Mcalpine, J.B., Swanson, S.J. and Katz, L. (1991) Modular organization of genes required for complex polyketide biosynthesis. Science, 252, 675-679
Eisner, A., Engert, H., Saenger, W., Hamoen, L., Venema, G. and Bemhard, F. (1997) Substrate specificity of hybrid molecules from peptide synthetases. J. Biol. Chem. 272, 4814-4819.
Fujimoto, D., Srinivasan, P.R. and Borek, E. (1965) On the nature of the deoxyribonucleic acid methylases. Biological evidence for the multiple nature of the enzymes. Biochemistry 4, 2849-2855.
Geier, G. E. and Modrich, P. (1979) Recognition sequence of the dam methylase of Escherichia coli K12 and mode of cleavage of Dpn I endonuclease. J. Biol. Chem, 254, 1408-1413.
Grabley, S., Granzer, E., Hutter, K., Ludwig, D., Mayer, M., Thiericke, R., Till, G., Wink, J., Phillips, S. and Zeeck, A. (1992) J. Antibiot. 45, 56-65.
Jacobsen, J.R., Hutchinson, C.R., Cane, D.E. and Khosla, C. Precursor- directed biosynthesis of erythromycin analogs by an engineered polyketide synthase. Science 277, 367-369 (1997)
Joshi, A.K. and Smith S. (1993) Construction of a cDNA encoding the multifunctional animal fatty acid synthase and expression in Spodoptera frugiperda cells using baculoviral vectors. Biochem J.,296, 143-149. 42 -
Kao, CM., Luo, G.L, Katz, L, Cane, D.E. and Khosla, C. (1995) J. Am. Chem. Soc, 117, 9105-9106.
Kao, CM., Luo, G.L., Katz, L, Cane, D.E. and Khosla, C. (1996) J. Am. Chem. Soc, 118, 9184-9185.
Kao, CM., Luo, G.L, Katz, L, Cane, D.E. and Khosla, C, (1994) J. Am. Chem. Soc, 116, 11612-11613.
Kuhstoss, S., Huber, M., Turner, J.R., Paschal, J.W. and Rao, R.N. (1996) Gene, 183, 231-236.
Lau, J., Fu, H., Cane, D. E. and Khosla, C. (1999) Dissecting the role of Acyltransferase domains of modular polyketide synthases in the choice and stereochemical fate of extender units. Biochemistry, 38, 1643-1651.
MacNeil, D.J., Gewain, K.M., Ruby, C.L., Dezeny, G., Gibbons, P.H. and MacNeil, T. (1992) Analysis of Streptomyces avermitilis genes required for avermectin biosynthesisutilizing a novel integration vector. Gene 111 , 61- 68.
Marsden, A.F.A., Wilkinson, B., Cortes, J., Dunster, N.J., Staunton, J. and Leadlay, P.F. (1998) Science, 279, 199-202.
Marshall, J.A. and Johns, B.A. (1998) Total synthesis of (+)- discodermolide. J. Org. Chem. 63, 7885-7892.
McDaniel, R. et al. (1999) and references therein. Multiple genetic modifications of the erythromycin polyketide synthase to produce a library of novel "unnatural" natural products. Proc. Natl. Acad. Sci. USA, 96, 1846- 1851. 43 -
Miller, RN. and Kokjohn, T.A. (1990) General microbiology of recA: Environmental and evolutionary significance. Annu. Rev. Microbioi, 44, 365-394.
NuBbaumer, B. and Wohlleben, W. (1994) Identification, isolation and sequencing of the recA gene of Streptomyces lividans TK24. FEMS Microbioi. lett, 118, 51-56.
Oliynyk, M., Brown, M.J.B., Cortes, J., Staunton, J. and Leadlay, P.F.
(1996) Chem. Biol., 3, 833-839.
Paitan, Y., Alon, G., Orr, E., Ron, E.Z., and Rosenberg, E. (1999) The first gene in the biosynthesis of the polyketide antibiotic TA of Myxococcus xanthus codes for a unique PKS module coupled to a peptide synthetase. J. Mol. Biol. 286,465-474.
Rowe, C.J., Cortes, J., Gaisser, S., Staunton, J., Leadlay, P.F. (1998) Construction of new vectors for regulated high-level expression in actinomycetes. Gene, 216, 215-223.
Ruan, X., Pereda, A., Stassi, D.L., Zeidner, D., Summers, R.G., Jackson, M., Shivakumar, A., Kakavas, S., Staver, M.J., Donadio, S. and Katz, L.
(1997) Acyltransferase domain substitutions in erythromycin polyketide synthase yield novel erythromycin derivatives. J. Bacteriol. 179, 6416-6425.
Shen, B., Du, L., Sanchez, C, Chen, M. and Edwards, D.J. (1999) Bleomycin biosynthesis in Streptomyces verticillus ATCC15003: A model of hybrid peptide and polyketide biosynthesis. Bioorganic Chemistry 27, 123- 129.
Tapiolas, D.M., Roman, M., Fenical, W., Stout, TJ. and Clardy, J. (1991) - 44 -
Octalactin-A and Octalactin-B - cytotoxic 8-membered-ring lactones from a marine bacterium, Streptomyces sp. J. Amer. Chem. Soc. 113, 4682-4683.
TerHaar, E., Kowalski, R.J., Hamel, E., Lin, CM., Longley, R.E., Gunasekera, S.P., Rosenkranz, H.S. and Day, B.W. (1996)
Discodermolide, a cytotoxic marine agent that stabilizes microtubules more potently than taxol. Biochemistry 35, 243-250.
Yamada, S., Tanaka, A. and Oritani, T. (1995) Total synthesis of Decarestrictine-J. Biosci. Biotech. & Biochem. 59, 1657-1660
Ziermann, R. and Betlach, M.C. (1999) Recombinant Polyketide Synthesis in Streptomyces: Engineering of improved host strains. BioTechniques 26, 106-110.

Claims

45CLAIMS
1. A method of assembling several DNA units in sequence in a
DNA construct, which method comprises the steps of
a) providing each DNA unit with a restriction enzyme -recognition sequence at it's 5' end and with a recognition sequence for the same restriction enzyme at its 3' end that is combined with a recognition site for a DNA modification enzyme.
b) providing a starting DNA construct having an accessible restπction site for the same or a compatible restriction enzyme and cleaving the starting DNA construct with such a restriction enzyme,
c) inserting the desired DNA unit and bringing the ligated product into contact with a DNA modification enzyme such that the restriction site at the 3' end of the inserted DNA unit is abolished
d) cleaving the ligated product at an accessible unmodified recognition site for the same or a compatible restriction enzyme,
e) repeating steps c) and d) to introduce each desired DNA unit to give a DNA construct containing all the desired units in sequence.
2. The method of claim 1 wherein the DNA modification enzyme is a methylase.
3. The method of claim 2 wherein the methylase is the dam methylase of Escherichia coli. - 46 -
4. The method of claim 3 which compπses the steps of
a) providing each DNA unit with an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5' end and with an Xbal recognition sequence 5'GATCTAGA3' at its 3' end.
b) providing a starting DNA construct having an accessible Xbal site and cleaving the starting DNA construct with Xbal,
c) inserting the desired DNA unit and using a resulting ligated product to transform a dam+ strain of E. coli,
d) recovering a resulting plasmid and cleaving the plasmid at an accessible Xbal site with Xbal,
e) repeating steps c) and d) to introduce each desired DNA unit to give a DNA construct containing all the desired units in sequence.
5. The method of any one of claims 1 to 4, wherein the recognition sequences for the restriction enzyme and the DNA modification enzyme are created in the DNA units prior to cutting with the restriction enzyme.
6. The method of claim 5 wherein the restriction sites are created in the fragment by means of a primer extension reaction.
7. The method of any one of claims 1 to 6, wherein the DNA construct is an expression vector capable of facilitating expression of the protein encoded by the desired DNA units 47 -
8. The method of claim 3 or claim 4, wherein the DNA modification is removed and the restriction site re-established by replicating the ligated product in a dam- strain of E. coli by means of a suitable vector.
9. A method of making an assembly of several DNA units in sequence which method comprises the steps of:
a) providing a first DNA unit with a recognition sequence for a first restriction enzyme at its 3' end, and cleaving the said first DNA unit with said first restriction enzyme,
b) providing each other DNA unit with a recognition sequence at its 5' end for a second restriction enzyme which has a compatible ligation sequence with that of the first restriction enzyme, and a downstream recognition sequence for said first restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, and cleaving each said other DNA unit with the second and third restriction enzymes,
c) ligating the said first DNA unit with a desired other DNA unit to form a ligated product such that the ligation of the two units abolishes the recognition site for the first restriction enzyme at the ligation junction, and cleaving the ligated product with said first restriction enzyme,
d) ligating the product from c) with a desired DNA unit from b) to form a ligated product and cleaving the ligated product with said first restriction enzyme
e) repeating step d) with each other DNA unit in turn so as to assemble the DNA units in sequence. - 48 -
10. The method of claim 9 which method comprises the steps of:
a) providing a first DNA unit with an Xbal recognition sequence 5TCTAGA3' at its 3' end, and cleaving the said first DNA unit with Xbal,
b) providing each other DNA unit with a Spel recognition sequence 5ΑCTAGT3' at its 5' end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at its 3' end, cleaving each said other DNA unit with Spel and Smal, and dephosphorylating the 5' end of the cleaved DNA unit,
c) ligating the said first DNA unit with a desired other DNA unit to form a ligated product and cleaving the ligated product with Xbal,
d) ligating the product from c) with a desired DNA unit from b) to form a ligated product and cleaving the ligated product with Xbal
e) repeating step d) with each other DNA unit in turn so as to assemble the DNA units in sequence.
11. The method of claim 9 or claim 10 wherein the assembly occurs via stepwise addition of fragments to a vector
12. The method of claim 9 or claim 10 wherein the said first DNA unit is attached to the solid phase for use in step c)
13. The method of claim 12, wherein the solid phase is split and mixed between steps c), d), and e) to make several different assemblies. 49
14. The method of any one of claims 9-13, wherein the recognition sequences in one or more of the DNA units are introduced by means of extension primers.
15. The method of any one of claims 9-14 wherein the assembly of several DNA units is inserted in to an expression vector which is used to transform a host capable of expressing the protein encoded by the vector
16. The method of any one of claims 1 -15, wherein one or more of the DNA units encodes a catalytic or transport protein domain, (see
Kleinkauf peptide/polyketide systems paper)
17. The method of claim 16 wherein one or more of the DNA units are derived from polyketide synthesising enzyme domain DNA sequences.
18. The method of claim 16 wherein one or more of the DNA units are derived from peptide synthesising enzyme domain DNA sequences.
19. The method of claim 16 wherein one or more of the DNA units are derived from hybrid peptide polyketide enzyme domain DNA sequences.
20. The method of claim 16 wherein one or more of the DNA units are derived from fatty acid synthesising enzyme domain DNA sequences
21. The method of claim 16 wherein one or more of the DNA units encode modules comprising one or more catalytic or transport domains 50
22. DNA constructs incorporating one or more DNA assemblies encoding synthetic enzymes made by any one of the methods of claims 1-21.
23. Synthetic enzymes encoded by one or more DNA assemblies made by the methods of anyone of claims 1-21
24. Hosts expressing DNA constructs encoding one or more synthetic enzymes made by any one of the methods of claims 1 -21.
25. Hybrids of transformed hosts expressing one or more DNA constructs encoding synthetic enzymes incorporating a DNA assembly made by any one of the methods of claims 1-21.
26. Compounds produced by synthetic enzymes encoded by DNA assemblies made by any one of the methods of claims 1-21.
27. A method of synthesising a target molecule comprising the steps of
a) examining the composition and stereochemistry of a target molecule,
b) determining which catalytic and transport domains need to be present in a synthetic enzyme in order to catalyse the synthesis of the target molecule,
c) using any one of the methods of claims 1-21 to assemble the required DNA units encoding the catalytic and transport domains into a 51 -
DNA assembly that encodes said synthetic enzyme which is capable of synthesising the target molecule.
d) placing the DNA assembly into a vector to allow expression of the synthetic enzyme in a host capable of synthesising the target molecule after transformation with said vector.
28. The method of claim 27 wherein the transformed host is tested for the presence of the target molecule after step d).
29. The transformed host of claim 27.
30. Use of transformed host of claim 27 to produce said target molecule.
31. A method of making a synthetic enzyme to catalyse the synthesis of a target molecule comprising the steps of
a) examining the composition and stereochemistry of a target molecule,
b) determining which catalytic and transport domains need to be present in the synthetic enzyme in order to catalyse the synthesis of the target molecule,
c) using any one of the methods of claims 1-21 to assemble the required DNA units encoding the catalytic and transport domains into a DNA assembly that encodes an enzyme which is capable of synthesising the target molecule. 52
d) expressing the DNA assembly in a suitable host to produce the enzyme.
32. A library of DNA units encoding catalytic or transport protein domains, wherein each DNA unit has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme.
33. The library of claim 32, wherein each DNA unit has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
34. A library of DNA units encoding catalytic or transport protein domains, wherein each DNA unit has a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed by a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction.
35. The library of claim 34, wherein each DNA unit has a Spel recognition sequence 5ΑCTAGT3' at its 5'-end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end
34. The library of claim 32 or claim 34, wherein the DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain. - 53 -
35. A module comprising a DNA sequence encoding a functional set of polyketide synthetic domains wherein the module has a recognition sequence for a restriction enzyme at it's 5'-end and a second recognition sequence for the same or a compatible enzyme at it's 3'-end which incorporates a recognition sequence for a DNA modifying enzyme
36. The module as claimed in claim 35, wherein the module has an Xbal recognition sequence 5'XXTCTAGA3' (where XX is not GA) at it's 5'-end and an Xbal recognition sequence 5'GATCTAGA3' at it's 3'-end
37. A module comprising a DNA sequence encoding a functional set of polyketide synthetic domains wherein the module has a recognition sequence at its 5' end for a first restriction enzyme, and a downstream recognition sequence for a second restriction enzyme followed a downstream recognition sequence for a third restriction enzyme at its 3' end, such that the DNA units, once restricted by the first and second restriction enzymes can be ligated together to abolish the restriction sites at the ligation junction
38. The module as claimed in claim 37, wherein the module has a
Spel recognition sequence 5ΑCTAGT3' at its 5'-end, and a downstream Xbal recognition sequence 5TCTAGA3' followed by a downstream Smal recognition sequence 5'CCCGGG3' at it's 3'-end
39. A module as claimed in claim 35 or claim 37, wherein the
DNA units encode polyketide synthetic domains, comprising two KS domains, at least two AT domains, two KR domains, two DH domains, two ER domains, an ACP domain and a TE domain
40. A vector containing one or more modules as claimed in claim
35 or claim 37. - 54 -
41. The vector as claimed in claim 40, wherein a non-functional recA gene is also present.
42. A method of transforming a host with one or more synthetic
DNA assemblies encoding enzyme domains which comprises the steps of:
a) Inserting said DNA assembly into a vector containing a mutated internal fragment of a recA gene sequence such that the vector is capable of undergoing homologous recombination with the recA gene of the host,
b) bringing said vector into contact with a host chromosome under conditions which permit homologous recombination to take place,
c) disrupting the host recA gene by the integration of the DNA of said vector into the chromosome.
43. The method of claim 42 wherein the expression vector is used to transform a Steptomyces host.
44. The method of claim 42 or claim 43, wherein the DNA assemblies are modules according to claim 35 or claim 37.
45. A host lacking a recA function, transformed with a vector containing one or more modules according to claim 35 or 37.
46. A kit containing DNA units, DNA modules, vectors, DNA manipulation hosts, DNA modification hosts, expression hosts, or solid phase elements for use in the methods claimed herein.
PCT/GB2000/002286 1999-06-11 2000-06-12 Dna manipulation methods, applications for synthetic enzymes and use for polyketide production WO2000077181A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA002376559A CA2376559A1 (en) 1999-06-11 2000-06-12 Dna manipulation methods, applications for synthetic enzymes and use for polyketide production
AU55457/00A AU5545700A (en) 1999-06-11 2000-06-12 Dna manipulation methods and applications for synthetic enzymes
EP00940533A EP1190045A2 (en) 1999-06-11 2000-06-12 Dna manipulation methods , applications for synthetic enzymes and use for polyketide production

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9913694.7A GB9913694D0 (en) 1999-06-11 1999-06-11 DNA manipulation methods and applications for synthetic enzymes
GB9913694.7 1999-06-11

Publications (2)

Publication Number Publication Date
WO2000077181A2 true WO2000077181A2 (en) 2000-12-21
WO2000077181A3 WO2000077181A3 (en) 2001-05-10

Family

ID=10855226

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2000/002286 WO2000077181A2 (en) 1999-06-11 2000-06-12 Dna manipulation methods, applications for synthetic enzymes and use for polyketide production

Country Status (5)

Country Link
EP (1) EP1190045A2 (en)
AU (1) AU5545700A (en)
CA (1) CA2376559A1 (en)
GB (1) GB9913694D0 (en)
WO (1) WO2000077181A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001081564A2 (en) * 2000-04-26 2001-11-01 Actinodrug Pharmaceuticals Gmbh Method for producing dna encoding polypeptides that are composed of several sections, and for producing polypeptides by expressing the dna thus obtained
US8999679B2 (en) 2008-12-18 2015-04-07 Iti Scotland Limited Method for assembly of polynucleic acid sequences
US9777305B2 (en) 2010-06-23 2017-10-03 Iti Scotland Limited Method for the assembly of a polynucleic acid sequence
CN113728130A (en) * 2019-04-01 2021-11-30 国立大学法人神户大学 Construction method of chimeric plasmid library

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4713337A (en) * 1985-01-03 1987-12-15 Massachusetts Institute Of Technology Method for deletion of a gene from a bacteria
US4963487A (en) * 1985-01-03 1990-10-16 Massachusetts Institute Of Technology Method for deletion of a gene from a bacteria
WO1996040968A1 (en) * 1995-06-07 1996-12-19 The Leland Stanford Junior University Recombinant production of novel polyketides
WO1997028282A1 (en) * 1996-01-29 1997-08-07 Stratagene Improved primer-mediated polynucleotide synthesis and manipulation techniques
WO1998017811A1 (en) * 1996-10-24 1998-04-30 Chromaxome Corporation Methods for generating and screening novel metabolic pathways
EP0841402A2 (en) * 1996-09-26 1998-05-13 National Institute Of Agrobiological Resources, Ministry Of Agriculture, Forestry And Fisheries High capacity binary shuttle vector
WO1998038326A1 (en) * 1997-02-28 1998-09-03 Nature Technology Corporation Self-assembling genes, vectors and uses thereof
WO1998049315A2 (en) * 1997-04-30 1998-11-05 Kosan Biosciences, Inc. Combinatorial polyketide libraries produced using a modular pks gene cluster as scaffold
US5863730A (en) * 1995-09-15 1999-01-26 Centre National De La Recherche Scientifique - Cnrs Procedure for the polymerization of nucleic acid sequences and its applications
WO2000063360A1 (en) * 1999-04-16 2000-10-26 Celltech Therapeutics Limited Combinatorial method for producing nucleic acids

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4713337A (en) * 1985-01-03 1987-12-15 Massachusetts Institute Of Technology Method for deletion of a gene from a bacteria
US4963487A (en) * 1985-01-03 1990-10-16 Massachusetts Institute Of Technology Method for deletion of a gene from a bacteria
WO1996040968A1 (en) * 1995-06-07 1996-12-19 The Leland Stanford Junior University Recombinant production of novel polyketides
US5863730A (en) * 1995-09-15 1999-01-26 Centre National De La Recherche Scientifique - Cnrs Procedure for the polymerization of nucleic acid sequences and its applications
WO1997028282A1 (en) * 1996-01-29 1997-08-07 Stratagene Improved primer-mediated polynucleotide synthesis and manipulation techniques
EP0841402A2 (en) * 1996-09-26 1998-05-13 National Institute Of Agrobiological Resources, Ministry Of Agriculture, Forestry And Fisheries High capacity binary shuttle vector
WO1998017811A1 (en) * 1996-10-24 1998-04-30 Chromaxome Corporation Methods for generating and screening novel metabolic pathways
WO1998038326A1 (en) * 1997-02-28 1998-09-03 Nature Technology Corporation Self-assembling genes, vectors and uses thereof
WO1998049315A2 (en) * 1997-04-30 1998-11-05 Kosan Biosciences, Inc. Combinatorial polyketide libraries produced using a modular pks gene cluster as scaffold
WO2000063360A1 (en) * 1999-04-16 2000-10-26 Celltech Therapeutics Limited Combinatorial method for producing nucleic acids

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
MCDANIEL R ET AL: "Multiple genetic modifications of the erythromycin polyketide synthase to produce a library of novel unnatural natural products" PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA,NATIONAL ACADEMY OF SCIENCE. WASHINGTON,US, vol. 96, no. 5, March 1999 (1999-03), pages 1846-1851, XP002143433 ISSN: 0027-8424 *
MUTH G ET AL: "Mutational analysis of Streptomyces lividans recA gene suggests that only mutants with residual activity remain viable." MOLECULAR & GENERAL GENETICS, vol. 255, no. 4, 1997, pages 420-428, XP002160032 ISSN: 0026-8925 *
NERENBERG J B ET AL: "TOTAL SYNTHESIS OF THE IMMUNOSUPPRESSIVE AGENT (-)-DISCODERMOLIDE" JOURNAL OF THE AMERICAN CHEMICAL SOCIETY,US,AMERICAN CHEMICAL SOCIETY, WASHINGTON, DC, vol. 115, no. 26, 1993, pages 12621-12622, XP000652058 ISSN: 0002-7863 *
RANGANATHAN ANAND ET AL: "Knowledge-based design of bimodular and trimodular polyketide synthases based on domain and module swaps: A route to simple statin analogues." CHEMISTRY & BIOLOGY (LONDON), vol. 6, no. 10, October 1999 (1999-10), pages 731-741, XP000971117 ISSN: 1074-5521 *
ROWE C J ET AL: "Construction of new vectors for high-level expression in actinomycetes" GENE,NL,ELSEVIER BIOMEDICAL PRESS. AMSTERDAM, vol. 216, no. 1, August 1998 (1998-08), pages 215-223, XP004149299 ISSN: 0378-1119 cited in the application *
TAPIOLAS D M ET AL: "OCTALACTINS A AND B CYTOTOXIC EIGHT-MEMBERED-RING LACTONES FROM A MARINE BACTERIUM STREPTOMYCES-SP" JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, vol. 113, no. 12, 1991, pages 4682-4683, XP002154630 ISSN: 0002-7863 cited in the application *
TER HAAR ERNST ET AL: "Discodermolide, a cytotoxic marine agent that stabilizes microtubules more potently than taxol." BIOCHEMISTRY, vol. 35, no. 1, 1996, pages 243-250, XP002154629 ISSN: 0006-2960 cited in the application *
YAMADA SHINYA ET AL: "Total synthesis of (-)-decarestrictine J." BIOSCIENCE BIOTECHNOLOGY AND BIOCHEMISTRY, vol. 59, no. 9, 1995, pages 1657-1660, XP002154631 ISSN: 0916-8451 cited in the application *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001081564A2 (en) * 2000-04-26 2001-11-01 Actinodrug Pharmaceuticals Gmbh Method for producing dna encoding polypeptides that are composed of several sections, and for producing polypeptides by expressing the dna thus obtained
WO2001081564A3 (en) * 2000-04-26 2002-04-25 Florian Schauwecker Method for producing DNA encoding polypeptides that are composed of several sections, and for producing polypeptides by expressing the DNA thus obtained
US8999679B2 (en) 2008-12-18 2015-04-07 Iti Scotland Limited Method for assembly of polynucleic acid sequences
US9777305B2 (en) 2010-06-23 2017-10-03 Iti Scotland Limited Method for the assembly of a polynucleic acid sequence
CN113728130A (en) * 2019-04-01 2021-11-30 国立大学法人神户大学 Construction method of chimeric plasmid library
EP3951028A4 (en) * 2019-04-01 2023-01-18 National University Corporation Kobe University Method for constructing chimeric plasmid library
US11643648B2 (en) 2019-04-01 2023-05-09 National University Corporation Kobe University Method for constructing chimeric plasmid library

Also Published As

Publication number Publication date
WO2000077181A3 (en) 2001-05-10
EP1190045A2 (en) 2002-03-27
AU5545700A (en) 2001-01-02
CA2376559A1 (en) 2000-12-21
GB9913694D0 (en) 1999-08-11

Similar Documents

Publication Publication Date Title
Bedford et al. Expression of a functional fungal polyketide synthase in the bacterium Streptomyces coelicolor A3 (2)
Oliynyk et al. A hybrid modular polyketide synthase obtained by domain swapping
Yoon et al. Generation of multiple bioactive macrolides by hybrid modular polyketide synthases in Streptomyces venezuelae
EP0910633B1 (en) Hybrid polyketide synthase I gene
JP3633629B2 (en) Cell-free synthesis of polyketides
Rodriguez et al. Rapid engineering of polyketide overproduction by gene transfer to industrially optimized strains
JP4489947B2 (en) Polyketides and their synthesis
CA2347412A1 (en) Recombinant oleandolide polyketide synthase
US6838265B2 (en) Overproduction hosts for biosynthesis of polyketides
Rodriguez et al. Heterologous production of polyketides in bacteria
US20060269528A1 (en) Production detection and use of transformant cells
EP1190045A2 (en) Dna manipulation methods , applications for synthetic enzymes and use for polyketide production
US20070059689A1 (en) Hybrid glycosylated products and their production and use
EP1414969B1 (en) Biosynthetic genes for butenyl-spinosyn insecticide production
WO2002097082A2 (en) Engineered biosynthesis of novel polyenes
US6828126B2 (en) Methods for introducing hydroxyl or epoxide groups into polyketides using OleP
US20040087003A1 (en) Methods and cells for improved production of polyketides
Zhang et al. Expanding Catalytic Versatility of Modular Polyketide Synthases for Alcohol Biosynthesis
US20050208629A1 (en) Plasmids for polyketide production
US20050233369A1 (en) Biosynthetic gene cluster for jerangolids
Bogdan et al. Molecular Biology of Polyketide Biosynthesis
AU2002305118A1 (en) Biosynthetic genes for butenyl-spinosyn insecticide production
WO2005118797A2 (en) Biosynthetic gene cluster for tautomycetin

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

ENP Entry into the national phase

Ref document number: 2376559

Country of ref document: CA

Ref document number: 2376559

Country of ref document: CA

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: IN/PCT/2001/01144/DE

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2000940533

Country of ref document: EP

712F Gb: determination of foreign entitlement (section 12(1)/1977)
713D Gb: proceedings under sect. 13(1) pat. act 1977 ** application filed
WWP Wipo information: published in national office

Ref document number: 2000940533

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase

Ref country code: JP

DPE2 Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101)