AU2013203045B2 - System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts - Google Patents

System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts Download PDF

Info

Publication number
AU2013203045B2
AU2013203045B2 AU2013203045A AU2013203045A AU2013203045B2 AU 2013203045 B2 AU2013203045 B2 AU 2013203045B2 AU 2013203045 A AU2013203045 A AU 2013203045A AU 2013203045 A AU2013203045 A AU 2013203045A AU 2013203045 B2 AU2013203045 B2 AU 2013203045B2
Authority
AU
Australia
Prior art keywords
yeast
dna
vectors
vector
chloroplast
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2013203045A
Other versions
AU2013203045A1 (en
Inventor
O'niell Bryan
Mike Mendez
Kari Mikkelson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sapphire Energy Inc
Original Assignee
Sapphire Energy Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2008307471A external-priority patent/AU2008307471B2/en
Application filed by Sapphire Energy Inc filed Critical Sapphire Energy Inc
Priority to AU2013203045A priority Critical patent/AU2013203045B2/en
Publication of AU2013203045A1 publication Critical patent/AU2013203045A1/en
Application granted granted Critical
Publication of AU2013203045B2 publication Critical patent/AU2013203045B2/en
Ceased legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The functional analysis of genes frequently requires the manipulation of large genomic regions. A yeast-bacteria shuttle vector is described that can be used to clone large regions of DNA by homologous recombination. Also described is a method for isolating entire genomes, including 5 chloroplast genomes, or large portions thereof, and manipulating the same. Also described are methods for determining minimal genomes, minimal pathway requirements, and minimal organelle genomes.

Description

P100/011 Regulation 3.2 AUSTRALIA Patents Act 1990 COMPLETE SPECIFICATION STANDARD PATENT Invention Title: System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts The following statement is a full description of this invention, including the best method of performing it known to us: SYSTEM FOR CAPTURING AND MODIFYING LARGE PIECES OF GENOMIC DNA AND CONSTRUCTING ORGANISMS WITH SYNTHETIC CHLOROPLASTS CROSS-REFERENCE 100011 This application is a divisional of Australian patent application no. 2008307471, 5 which claims priority to and benefit of U.S. Provisional Application No. 60/978,024 (filed October 5, 2007), which application is incorporated herein by reference. BACKGROUND OF THE INVENTION [0001a] Reference to any prior art in the specification is not, and should not be taken as, an acknowledgment or any form of suggestion that this prior art forms part of the common general 10 knowledge in Australia or any other jurisdiction or that this prior art could reasonably be expected to be ascertained, understood and regarded as relevant by a person skilled in the art. [00021 For the functional analysis of many genes, investigators need to isolate and manipulate large DNA fragments. The advent of genomics and the study of genomic regions of DNA have generated a need for vectors capable of carrying large DNA regions. 15 [00031 In general, two types of yeast vector systems are presently available. The first type of vector is one capable of transferring small insert DNA between yeast and bacteria. A second type of vector is a fragmenting vector which creates interstitial or terminal deletions in yeast artificial chromosomes (YACs). The small insert shuttle vectors are able to recombine with and recover homologous sequences. They are centromere-based and replicate stably and autonomously in yeast, 20 but also contain a high-copy origin of replication for maintenance as bacterial plasmids. However, these vectors are limited by their small insert capacity. The second type of vector (also known as fragmenting vectors) has recombinogenic sequences, but is unable to transfer the recovered insert DNA to bacteria for large preparations of DNA. 100041 Researchers use fragmentation techniques to narrow down the region of interest in 25 YACs. However, isolating sufficient quantities of YAC DNA from agarose gels for microinjection or electroporation remains cumbersome. Purification remains a problem when the YAC comigrates with an endogenous chromosome. In addition, YACs may be chimeric or contain additional DNA regions that are not required for the particular functional study. [00051 Types of vectors available for cloning large fragments in bacteria are cosmids, PIs 30 and bacterial artificial chromosomes (BACs). These vectors are limited to bacteria and cannot be 1 A 1000643322 shuttled to yeast for modification by homologous recombination. Bacterial vectors are also limited in their use for transforming plants and algae. For example, though chloroplasts are thought to originate from the endosymbiosis of photosynthetic bacteria into eukaryotic hosts translation of chloroplasts in more complex. Adding to the complexity of genetically engineering plants and algae is the presence of multiple chloroplasts with multiple copies of the chloroplast genome. Thus, there exists a need for developing a method to express proteins from large fragments of DNA in the chloroplasts of plants and algae. SUMMARY OF THE INVENTION [0006] The present invention relates to compositions and methods of isolating, characterizing, and/or modifying large DNA, including entire genomes of bacteria and chloroplasts. The compositions include shuttle vectors into which target DNA may be inserted. The methods include modifying or manipulating target DNA by removing, adding or rearranging portions and introducing the modified DNA into a host. [0006A] The present invention relates to an assembled chloroplast genome comprising all chloroplast genes necessary for photosynthesis, a yeast DNA replication element, a bacterial DNA replication element, and a yeast stability element comprising at least one yeast selection marker, wherein the assembled chloroplast genome is a circular construct. [0006B] The present invention relates to a plurality of vectors suitable for assembling into a chloroplast genome, said plurality of vectors collectively comprising all chloroplast genes necessary for photosynthesis; wherein each vector in said plurality of vectors comprises two regions for homologous recombination with one or two other vectors in said plurality of vectors such that said vector can be assembled into a circular construct; wherein at least one vector of said plurality of vectors comprises a yeast DNA replication element, at least one vector of said plurality of vectors comprises a bacterial DNA replication element; and 1B 1000643322 wherein at least one vector of said plurality of vectors comprises a yeast stability element comprising at least one yeast selection marker and said at least one vector comprising a yeast stability element does not contain a yeast DNA replication element. [0006C] In one embodiment, the present invention provides a method of assembling a chloroplast genome, comprising: obtaining a plurality of vectors collectively comprising all chloroplast genes necessary to carry out photosynthesis; wherein each vector in said plurality of vectors comprises two regions for homologous recombination with one or two other vectors in said plurality of vectors, wherein at least one vector of said plurality of vectors comprises a yeast DNA replication element and at least one vector of said plurality of vectors comprises a bacterial DNA replication element, wherein at least one vector of said plurality of vectors comprises a yeast stability element comprising at least one yeast selection marker and said at least one vector comprises a yeast stability element does not contain a yeast DNA replication element; introducing said plurality of vectors into yeast to assemble said plurality of vectors by homologous recombination to produce an assembled chloroplast genome; and selecting for complete assembly of said chloroplast genome by selecting for the presence of said at least one yeast stability element. [0007] Another aspect of the present invention provides an isolated vector comprising a yeast element, a bacterial origin of replication, and at least 20 kb genomic DNA. In some vectors, the yeast element is a yeast centromere, a yeast autonomous replicating sequence, yeast auxotrophic marker, or a combination thereof. The DNA may be from ic a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some embodiments, the genomic DNA is modified, for example by insertion of a heterologous or homologous polynucleotide, deletion of one or more nucleic acid bases, mutation of one or more nucleic acid bases, rearrangement of one or more polynucleotides, or a combination thereof. In some 5 instances, the modification is synthetic. Vectors of the present invention, when transformed into a host cell, may result in production of a product not naturally produced by the host cell. Some examples of such products include biomass-degrading enzymes, a fatty acids, terpenes or terpenoids. In some host cells, expression of the vector results in an increase production of a product naturally produced by said host cell, for example, a biomass-degrading' enzyme, a terpene or a terpenoid. The vectors of the present invention may further comprise one or more selection 10 markers, for example, a yeast marker, a yeast antibiotic resistance marker, a yeast auxotrophic marker, a bacterial marker, a bacterial antibiotic resistance marker, a bacterial auxotrophic marker, an algae marker, an algae antibiotic resistance marker, an algae auxotrophic marker, or a combination thereof. Vectors of the present invention may also contain chloroplast genomic DNA which comprises 1) 1-200 genes; 2) all essential chloroplast genes;and/or 3) 30 400 kb. 15 100081 Also described herein is a host cell comprising the vectors described herein. Exemplary host cells may be naturally non-photosynthetic or photosynthetic and include, for example, Saccharomyces cerevisiae, Escherichia coli, macroalgae, microalgae, Ch. vulgaris, C. reinhardii, D. salina, S. quadricanda or H. pluvalis. [00091 In another aspect of the invention, a method for producing a vector is provided where the method involves inserting targeting DNA into a vector - where the vector comprises a yeast centromere, a yeast autonomous 20 replicating sequence, and a bacterial origin of replication, transforming an organism with the vector and capturing a portion of a chloroplast genome, thus producing a vector with a portion of a chloroplast genome. In some instances, the targeting DNA is chloroplast genomic DNA. This method may be used to capture a portion of a genome which is 10-400 kb in length. In some instances, the capturing step occurs by recombination. The captured portion of a chloroplast genome may be co-transformed into an organism with a vector, thus the recombination step may occur 25 in vivo. Organisms used to practice methods disclosed herein may be eukaryotic and/or photosynthetic. In some instances, the organism is a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. Organisms used to practice methods disclosed herein may also be non-photosynthetic, for example yeast. In some instances, a non-photosynthetic organism may contain exogenous chloroplast DNA. In some embodiments, an additional step of modifying a portion of a 30 chloroplast genome is utilized. A modification may be achieved through homologous recombination. Such recombination may occur in an organism, for example a eukaryotic and/or photosynthetic organism. In some instances, the organism is a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardiii, D. salina, S. quadricanda or H. pluvalis. In other instances, the organism may be non photosynthetic, for example a yeast. In embodiments with a modification step, the step may comprise addition of a 35 polynucleotide, deletion of one or more nucleic acid bases, mutation of one or more nucleic acid bases, rearrangement or a polynucleotide, or combination thereof. 100101 Further disclosed herein is an isolated vector comprising essential chloroplast genes, a selectable marker and a manipulation in one or more nucleic acids in the vector. In some instances, essential chloroplast genes are cloned from a non-vascular photoshynthetic organism such as macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, 40 D. salina, S. quadricanda or H. pluvalis. Essential chloroplast genes for use in the vectors described herein may be synthetic. The vectors described herein may further comprise an expression cassette, which may further comprise a region for integration into target DNA, for example organelle DNA. The vectors described herein may also contain 2 one or more selection markers, for example, an auxotrophic marker, an antibiotic resistance marker, a chloroplast marker, or combinations thereof. In some instances, the essential chloroplast genes are those required for chloroplast function, photosynthesis, carbon fixation, production of one or more hydrocarbons, or a combination thereof: Essential chloroplast genes may comprise up to 200 genes and/or consist of up to 400 kb. In some of the 5 vectors described herein a manipulation in one or more nucleic acids is an addition, deletion, mutation, or rearrangement. In some instances, expression of the vector in a host cell produces a product not naturally produced by said host cell. In other instances, expression of a vector of the present invention results in an increase production of a product naturally produced by said host cell. Examples of such products are biomass degrading enzymes, fatty acids, terpenes or terpenoids. 10 100111 As described herein, one aspect of the present invention is an isolated chloroplast comprising a vector of the present invention. In another aspect, a host cell comprising a vector of the present invention is provided. Host cells useful in the present invention may be naturally non-photosynthetic or naturally photosynthetic. Examples of organisms useful for the present invention include Saccharomyces cerevisiae, Escherichia coli, macroalgae, microalgae, Ch. vulgaris, C. reinhardiji, D. salina, S. quadricanda or H. pluvalis. 15 [00121 In another aspect of the present invention, a method for transforming a cell or organism is provided where the method comprises inserting into said cell or organism a vector comprising all essential chloroplast genes and optionally one or more genes not naturally occurring in said cell or organism. In some embodiments, the method further comprises the step of eliminating substantially all chloroplast genomes in said cell or organism. A cell or organism useful for this method may be photosynthetic, non-photosynthetic and/or eukaryotic. A cell or organism 20 useful for this method may be non-vascular. In some instances, the vector for use in this method may also comprise an expression cassette and the expression cassette may be capable of integrating into non-nuclear DNA. In one embodiment the one or more genes not naturally occurring in the cell or organism is a gene in the isoprenoid pathway, MVA pathway, or MEP pathway. In another embodiment, the essential chloroplast genes are those that are required for chloroplast function, photosynthesis, carbon fixation, production of one or more hydrocarbons, or a 25 combination thereof. 100131 Further provided herein is a method for modifying an organism comprising the steps of transforming the organism with a vector comprising one or more polynucleotides sufficient to perform chloroplast function. In some instances, a vector useful for this method further comprises a sequence for production or secretion of a compound from said organism. In some instances, the compound is an isoprenoid. In other instances, the vector comprises all 30 essential chloroplast genes. In still other instances, the essential chloroplast genes are rearranged or mutated. An organism useful for some embodiments comprises essentially no chloroplast genome prior to transformation. [00141 Yet another method provided herein is a method for making a product from an organism comprising the step of transforming said organism with a vector comprising at least 20 kb of genomic DNA and one or more of the following: (i) a gene not naturally occurring in said organism; (ii) a deletion in a gene naturally occurring in said 35 organism; (iii) a rearrangement of genes naturally occurring in said organism; and (iv) a mutation in a gene naturally occurring in said organism. In some instances, the organism is naturally photosynthetic. In other instances, the additional genes encode enzymes in the isoprenoid pathway, MVA pathway, or MEP pathway. In still another embodiment, the present disclosure provides a method for transforming a cell or organism comprising inserting into said cell or organism a chloroplast and a vector comprising all essential chloroplast genes. 40 [0015] The present disclosure also provides a method of producing an artificial chloroplast genome comprising the steps of: (a) providing a vector comprising one or more essential chloroplast genes; (b) adding to said vector a DNA fragment; (c) transforming a cell or organism with the vector produced by step (b); and (d) determining whether 3 chloroplast function exists with said added DNA fragment. In some instances, the added DNA fragments comprises one or more coding regions for an enzyme in the isoprenoid, MVA or MEP pathway. [0016] The present disclosure also provides a shuttle vector comprising a chloroplast genome. A genome may be modified. Also provided herein is a vector comprising an isolated, functional chloroplast genome. A chloroplast 5 genome useful in such a vector may be modified. [00171 Further provided herein is a method of producing an artificial chloroplast genome comprising the steps of: (a) providing a vector comprising all essential chloroplast genes; and (b) removing, adding, mutating, or rearranging DNA from the chloroplast genome. Such a method may further comprise the steps of transforming a redacted genome into a host organism; and (d) determining chloroplast function in the host organism. In some instances, 10 steps (b), (c), and (d) are repeated. In still other instances, the chloroplast genome is from an organism selected from the group consisting of: macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In other instances, the host organism is selected from the group consisting of: macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. For some embodiments, the method may further comprise the step of removing redundant DNA from a chloroplast genome. In other embodiments, the vector 15 comprises all or substantially all of a chloroplast genome. A chloroplast genome useful in the present invention may be cloned from a photosynthetic organism or may be a synthetic chloroplast genome. In some instances, the vector further comprises a gene not naturally occurring in the host organism, for example a gene from the isoprenoid pathway, MVA pathway, or MEP pathway. 100181 Yet another method provided herein is a method of producing an artificial chloroplast genome comprising 20 the steps of: (a) providing a vector comprising an entire chloroplast genome; (b) deleting a portion of said entire chloroplast genome; and (c) determining whether chloroplast function exists without said deleted portion. In another aspect of the present invention, a composition comprising an isolated and functional chloroplast genome is provided. In some instances, a composition comprises a modification to said chloroplast genome. [0019] Further provided herein is an ex vivo vector comprising a nucleic acid comprising at least about 10% of a 25 chloroplast genome and a manipulation in one or more nucleic acids in the vector. In some instances, the nucleic acid is cloned from a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some instances, the nucleic acid is synthetic. A vector of the present invention may further comprise an expression cassette and an expression cassette may further comprise a region for integration into target DNA. In some instances, the target DNA is organelle DNA. A vector useful in the 30 present invention may further comprise one or more selection markers, for example an auxotrophic marker, an antibiotic resistance marker, a chloroplast marker, or combinations thereof In some embodiments, a manipulation in one or more nucleic acids in a vector may be an addition, deletion, mutation, or rearrangement. Expression of the vector may result in production of a product not naturally produced by a host cell and/or an increase production of a product naturally produced by a host cell. Examples of some products of the present invention include a terpene, 35 terpenoid, fatty acid, or biomass degrading enzyme. [00201 Also provided herein is an ex vivo vector comprising a nucleic acid comprising at least about 20 kilobases of a chloroplast genome and a manipulation in one or more nucleic acids in said vector. In some instances, the nucleic acid is cloned from a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some instances, the nucleic acid is synthetic. A 40 vector of the present invention may further comprise an expression cassette and an expression cassette may further comprise a region for integration into target DNA. In some instances, the target DNA is organelle DNA. A vector useful in the present invention may further comprise one or more selection markers, for example an auxotrophic 4 marker, an antibiotic resistance marker, a chloroplast marker, or combinations thereof. In some embodiments, a manipulation in one or more nucleic acids in a vector may be an addition, deletion, mutation, or rearrangement. Expression of the vector may result in production of a product not naturally produced by a host cell and/or an increase production of a product naturally produced by a host cell. Examples of some products of the present 5 invention include a terpene, terpenoid, fatty acid, or biomass degrading enzyme. 100211 Further provided herein is a method of producing a vector containing a reconstructed genome, comprising: introducing two or more vectors into a host cell, wherein the vectors comprise fragments of a genome, recombining the vectors into a single vector comprising at least about 90% of a genome, thereby producing a vector containing a reconstructed genome. In some instances, the host cell is eukaryotic, for example, S. cerevisiae. In other instances, 10 the genome is an organelle genome. The organelle may be a chloroplast, for example a chloroplast from an alga, particularly a microalga such as Ch. vulgaris, C. reinhardii, D. salina, S. quadricanda or H. pluvalis. In some instances, the two or more vectors comprise a selectable marker. In other instances, at least one of said fragments is synthetic. In still other instances, a further step comprising modifying a portion of the genome is useful in this method. Such a modification may comprise an addition, deletion, mutation, or rearrangement. In other 15 embodiments, the modification is the addition of an exogenous nucleic acid which results in the production or increased production of a terpene, terpenoid, fatty acid or biomass degrading enzyme. INCORPORATION BY REFERENCE 100221 All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated 20 to be incorporated by reference. BRIEF DESCRIPTION OF THE DRAWINGS [00231 The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and 25 the accompanying drawings of which: [00241 In the figures, the following abbreviations are used: HIS3: yeast HIS3 gene; TRPJ: yeast TRPI gene; URA3: yeast URA3 gene; ADE2: yeast ADE2 gene; LYS2: yeast LYS2 gene; TEL: yeast telomere; CEN: yeast centromere; ARS: autonomously replicating sequences, yeast origin of replication; 5FOA: 5-fluoroorotic acid; Kan: kanamycin resistance gene; P I plasmid rep: P1 plasmid replicon; p I lytic rep: p1 lytic replicon. 30 100251 Figure 1 provides a general description of a hybrid vector of the present invention. Figure IA) Vector schematic. Figure IB) DNA shuttling between organisms. 100261 Figure 2 is a schematic showing construction of a hybrid vector. 100271 Figure 3 is a schematic showing sites of integration in chloroplast genome DNA. Circled numbers indicate target sites for modification. The box indicates the site targeted by the hybrid gap-filling vector 35 [00281 Figures 4 A, B, and C is a schematic of selectable markers for modification and/or stabilization. 100291 Figure 5 is a schematic for introduction of hybrid vector into chloroplast genome DNA. 100301 Figure 6 is PCR data showing integration of hybrid vector (and stabilization vector) in algae. [00311 Figure 7 shows analysis of captured DNA. Figure 7A) Restriction digest with EcoRI of isolated vectors containing chloroplast(L, ladder; C, parent hybrid vector; 1, Clone 1; and 2; Clone 2). Figure 7B) Restriction digest 40 with EcoRl of isolates of Clone I that were passaged through yeast (L, ladder; C,; 1, Clone 1; and A-M; yeast 5 isolates) Figure 7C) Southern analysis of Clones 1 and 2 digested with EcoRl and probed with radioactive HindlYl-digested total DNA from C reinhardtii. [0032] Figures 8 A and B is a schematic showing architecture of isolated ex vivo vectors containing chloroplast genome DNA. 5 [00331 Figure 9 shows growth of parent and transformed algae cells under various selection conditions. [00341 Figure 10 is a schematic of restriction analysis for manipulation vectors Figure 10A) Schematic of vector architecture Figure 10B) Analysis of modified vector by restriction analysis with EcoRI. 10 [00351 Figure 11 shows modification of a chloroplast genome to produce a biomass degrading enzyme Figure 11 A) PCR screen of isolated transformants Figure 11 B) Endoxylanase activity from isolated transformants. [00361 Figures 12 shows a PCR screen of isolated transformants modified to produce isoprenoids with the FPP synthase expression cassette targeted to site 3 (Figure 12A) or site 4 15 (Figure 12B). 100371 Figure 13 is a schematic showing capture of a partial chloroplast genome using recombination in yeast. 10038] Figure 14 is a schematic showing capture of an entire chloroplast genome using recombination in yeast. 20 [00391 Figure 15 is a schematic showing reassembly of an entire chloroplast genome using recombination in yeast. DETAILED DESCRIPTION OF THE INVENTION [00401 While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided 25 by way of example only Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing 6 the invention It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby. [0040a] As used herein, except where the context requires otherwise the term 'comprise' and variations of the term, such as 'comprising', 'comprises' and 'comprised', are not intended 5 to exclude other additives, components, integers or steps. [00411 Technical and scientific terms used herein have the meanings commonly understood by one of ordinary skill in the art to which the instant invention pertains, unless otherwise defined Reference is made herein to various materials and methodologies known to those of skill in the art Standard reference works setting forth the general principles of 10 recombinant DNA technology include Sambrook et al , "Molecular Cloning A Laboratory Manual", 2d ed , Cold Spring Harbor Laboratory Press, Plainview, N Y , 1989, Kaufman et al, eds, "Handbook of Molecular and Cellular Methods in Biology and Medicine", CRC Press, Boca Raton, 1995, and McPherson, ed, "Directed Mutagenesis A Practical Approach", IRL Press, Oxford, 1991 Standard reference literature teaching general methodologies and principles of 15 yeast genetics useful for selected aspects of the invention include Sherman et al "Laboratory Course Manual Methods in Yeast Genetics", Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1986 and Guthrie et al, "Guide to Yeast Genetics and Molecular Biology", Academic, New York, 1991. [00421 Any suitable materials and/or methods known to those of skill can be utilized in 20 carrying out the instant invention Materials and/or methods for practicing the instant invention are described Materials, reagents and the like to which reference is made in the following description and examples are obtainable from commercial sources, unless otherwise noted This invention teaches methods and describes tools for capturing and modifying large pieces of DNA It is especially useful for modifying genomic DNA, including the whole genome of an organism 25 or organelle, or a part thereof Novel prophetic uses of the invention are also described The invention relates to the manipulation and delivery of large nucleic acids The invention further relates to recombinational cloning vectors and systems and to methods of using the same. 1004311 Contemporary methods for genetically engineering genomes (eg, chloroplast genomes) are time intensive (> 1 month) and allow for only a limited number of manipulations at 30 a time If multiple modifications to a target 6a genome are desired, the process must be iterated, further increasing the time required to generate a desired strain. Because metabolic engineering and/or synthetic biology require numerous modifications to a genome, these technologies are not amenable to rapid introduction of modifications to a genome. Thus, a new technology that allows for multiple modification of the chloroplast genome in a short amount of time will enable the application of 5 metabolic engineering and/or synthetic biology to chloroplast genomes. The disclosure herein describes such technology. 100441 The instant invention provides a versatile, recombinational approach to the capture, cloning, and manipulation of large nucleic acids from target cells and organelles (e.g., chloroplasts). One aspect of the present invention provides a recombinational cloning system. More specifically, the invention provides vectors, which in 10 some embodiments, rely on homologous recombination technologies to mediate the isolation and manipulation of large nucleic acid segments. Another aspect of the present invention provides methods for using such recombinational cloning vectors to clone, to manipulate and to deliver large nucleic acids to target cells and/or organelles such as chloroplasts. [00451 In one preferred embodiment, homologous recombination is performed in vitro. In a particularly preferred 15 embodiment of the invention, homologous recombination is performed in vivo. In a more preferred embodiment, homologous recombination occurs in an algae cell. In another more preferred embodiment, homologous recombination occurs in a yeast cell. In one preferred embodiment, homologous recombination occurs in Saccharomyces cerevisiae or Saccharomycespombe. In yeast, the combination of efficient recombination processes and the availability of numerous selectable markers provides for rapid and complex engineering of target DNA 20 sequences. Once all of the modifications are made to an ex vivo vector containing chloroplast genome DNA, the entire vector can be introduced into a chloroplast in a single transformation step. Thus, employing yeast technology will enable the application of metabolic engineering and/or synthetic biology to chloroplast genomes. One aspect of the present invention provides an isolated vector comprising a yeast element, a bacterial origin of replication, and at least 20 kb genomic DNA. In some vectors, the yeast element is a yeast centromere, a yeast autonomous replicating 25 sequence, or a combination thereof. The DNA may be from a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H pluvalis. In some embodiments, the genomic DNA is modified, for example by insertion of a heterologous or homologous polynucleotide, deletion of one or more nucleic acid bases, mutation of one or more nucleic acid bases, rearrangement of one or more polynucleotides, or a combination thereof. In some instances, the modification is 30 synthetic. Vectors of the present invention, when transformed into a host cell, may result in production of a product not naturally produced by the host cell. Some examples of such products include biomass-degrading enzymes, a fatty acids, terpenes or terpenoids. In some host cells, expression of the vector results in an increase production of a product naturally produced by said host cell, for example, a biomass-degrading enzyme, a terpene or a terpenoid. The vectors of the present invention may further comprise one or more selection markers, for example, a yeast 35 marker, a yeast antibiotic resistance marker, a bacterial marker, a bacterial antibiotic resistance marker, an algae marker, an algae antibiotic resistance marker or a combination thereof. Vectors of the present invention may also contain chloroplast genomic DNA which comprises 1) 1-200 genes; 2) all essential chloroplast genes;and/or 3) 30 400 kb. [0046] Also described herein is a host cell comprising the vectors described herein. Exemplary host cells may be 40 naturally non-photosynthetic or photosynthetic and include, for example, Saccharomyces cerevisiae, Escherichia coli, macroalgae, microalgae, Ch. vulgaris, C. reinhardiji, D. salina, S. quadricanda or H. pluvalis. 7 100471 In another aspect of the invention, a method for producing a vector is provided where the method involves inserting targeting DNA into a vector - where the vector comprises a yeast centromere, a yeast autonomous replicating sequence, and a bacterial origin of replication, transforming an organism with the vector and capturing a portion of a chloroplast genome, thus producing a vector with a portion of a chloroplast genome. In some instances, 5 the targeting DNA is chloroplast genomic DNA. This method may be used to capture a portion of a genome which is 10-400 kb in length. In some instances, the capturing step occurs by recombination. The captured portion of a chloroplast genome may be co-transformed into an organism with a vector, thus the recombination step may occur in vivo. Organisms used to practice methods disclosed herein may be eukaryotic and/or photosynthetic. In some instances, the organism is a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. 10 vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. Organisms used to practice methods disclosed herein may also be non-photosynthetic, for example yeast. In some instances, a non-photosynthetic organism may contain exogenous chloroplast DNA. In some embodiments, an additional step of modifying a portion of a chloroplast genome is utilized. A modification may be achieved through homologous recombination. Such recombination may occur in an organism, for example a eukaryotic and/or photosynthetic organism. In some 15 instances, the organism is a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In other instances, the organism may be non photosynthetic, for example a yeast. In embodiments with a modification step, the step may comprise addition of a polynucleotide, deletion of one or more nucleic acid bases, mutation of one or more nucleic acid bases, rearrangement or a polynucleotide, or combination thereof. 20 100481 Further disclosed herein is an isolated vector comprising essential chloroplast genes, a selectable marker and a manipulation in one or more nucleic acids in the vector. In some instances, essential chloroplast genes are cloned from a non-vascular photoshynthetic organism such as macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. Essential chloroplast genes for use in the vectors described herein may be synthetic. The vectors described herein may further comprise an expression cassette, which may further comprise a 25 region for integration into target DNA, for example organelle DNA. The vectors described herein may also contain one or more selection markers, for example, an auxotrophic marker, an antibiotic resistance marker, a chloroplast marker, or combinations thereof. In some instances, the essential chloroplast genes are those required for chloroplast function, photosynthesis, carbon fixation, production of one or more hydrocarbons, or a combination thereof. Essential chloroplast genes may comprise up to 200 genes and/or consist of up to 400 kb. In some of the 30 vectors described herein a manipulation in one or more nucleic acids is an addition, deletion, mutation, or rearrangement. In some instances, expression of the vector in a host cell produces a product not naturally produced by said host cell. In other instances, expression of a vector of the present invention results in an increase production of a product naturally produced by said host cell. Examples of such products are biomass degrading enzymes, fatty acids, terpenes or terpenoids. 35 100491 As described herein, one aspect of the present invention is an isolated chloroplast comprising a vector of the present invention. In another aspect, a host cell comprising a vector of the present invention is provided. Host cells useful in the present invention may be naturally non-photosynthetic or naturally photosynthetic. Examples of organisms useful for the present invention include Saccharomyces cerevisiae, Escherichia coli, macroalgae, microalgae, Ch. vulgaris, C. reinhardiii, D. salina, S. quadricanda or H. pluvalis. 40 [00501 In another aspect of the present invention, a method for transforming a cell or organism is provided where the method comprises inserting into said cell or organism a vector comprising all essential chloroplast genes and optionally one or more genes not naturally occurring in said cell or organism. In some embodiments, the method 8 further comprises the step of eliminating substantially all chloroplast genomes in said cell or organism. A cell or organism useful for this method may be photosynthetic, non-photosynthetic and/or eukaryotic. A cell or organism useful for this method may be non-vascular. In some instances, the vector for use in this method may also comprise an expression cassette and the expression cassette may be capable of integrating into non-nuclear DNA. In one 5 embodiment the one or more genes not naturally occurring in the cell or organism is a gene in the isoprenoid pathway, MVA pathway, or MEP pathway. In another embodiment, the essential chloroplast genes are those that are required for chloroplast function, photosynthesis, carbon fixation, production of one or more hydrocarbons, or a combination thereof. [00511 Further provided herein is a method for modifying an organism comprising the steps of transforming the 10 organism with a vector comprising one or more polynucleotides sufficient to perform chloroplast function. In some instances, a vector useful for this method further comprises a sequence for production or secretion of a compound from said organism. In some instances, the compound is an isoprenoid. In other instances, the vector comprises all essential chloroplast genes. In still other instances, the essential chloroplast genes are rearranged or mutated. An organism useful for some embodiments comprises essentially no chloroplast genome prior to transformation. 15 [00521 Yet another method provided herein is a method for making a product from an organism comprising the step of transforming said organism with a vector comprising at least 20 kb of genomic DNA and one or more of the following: (i) a gene not naturally occurring in said organism; (ii) a deletion in a gene naturally occurring in said organism; (iii) a rearrangement of genes naturally occurring in said organism; and (iv) a mutation in a gene naturally occurring in said organism. In some instances, the organism is naturally photosynthetic. In other instances, the 20 additional genes encode enzymes in the isoprenoid pathway, MVA pathway, or MEP pathway. In still another embodiment, the present disclosure provides a method for transforming a cell or organism comprising inserting into said cell or organism a chloroplast and a vector comprising all essential chloroplast genes. [00531 The present disclosure also provides a method of producing an artificial chloroplast genome comprising the steps of: (a) providing a vector comprising one or more essential chloroplast genes; (b) adding to said vector a DNA 25 fragment; (c) transforming a cell or organism with the vector produced by step (b); and (d) determining whether chloroplast function exists with said added DNA fragment. In some instances, the added DNA fragments comprises one or more coding regions for an enzyme in the isoprenoid, MVA or MEP pathway. 100541 The present disclosure also provides a shuttle vector comprising a chloroplast genome. A genome may be modified. Also provided herein is a vector comprising an isolated, functional chloroplast genome. A chloroplast 30 genome useful in such a vector may be modified. [0055] Further provided herein is a method of producing an artificial chloroplast genome comprising the steps of: (a) providing a vector comprising all essential chloroplast genes; and (b) removing, adding, mutating, or rearranging DNA from the chloroplast genome. Such a method may further comprise the steps of transforming a redacted genome into a host organism; and (d) determining chloroplast function in the host organism. In some instances, 35 steps (b), (c), and (d) are repeated. In still other instances, the chloroplast genome is from an organism selected from the group consisting of: macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In other instances, the host organism is selected from the group consisting of: macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. For some embodiments, the method may further comprise the step of removing redundant DNA from a chloroplast genome. In other embodiments, the vector 40 comprises all or substantially all of a chloroplast genome. A chloroplast genome useful in the present invention may be cloned from a photosynthetic organism or may be a synthetic chloroplast genome. In some instances, the vector 9 further comprises a gene not naturally occurring in the host organism, for example a gene from the isoprenoid pathway, MVA pathway, or MEP pathway. [00561 Yet another method provided herein is a method of producing an artificial chloroplast genome comprising the steps of: (a) providing a vector comprising an entire chloroplast genome; (b) deleting a portion of said entire 5 chloroplast genome; and (c) determining whether chloroplast function exists without said deleted portion. In another aspect of the present invention, a composition comprising an isolated and functional chloroplast genome is provided. In some instances, a composition comprises a modification to said chloroplast genome. 100571 Further provided herein is an ex vivo vector comprising a nucleic acid comprising at least about 10% of a chloroplast genome and a manipulation in one or more nucleic acids in the vector. In some instances, the nucleic 10 acid is cloned from a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some instances, the nucleic acid is synthetic. A vector of the present invention may further comprise an expression cassette and an expression cassette may further comprise a region for integration into target DNA. In some instances, the target DNA is organelle DNA. A vector useful in the present invention may further comprise one or more selection markers, for example an auxotrophic marker, an 15 antibiotic resistance marker, a chloroplast marker, or combinations thereof. In some embodiments, a manipulation in one or more nucleic acids in a vector may be an addition, deletion, mutation, or rearrangement. Expression of the vector may result in production of a product not naturally produced by a host cell and/or an increase production of a product naturally produced by a host cell. Examples of some products of the present invention include a terpene, terpenoid, fatty acid, or biomass degrading enzyme. 20 [00581 Also provided herein is an ex vivo vector comprising a nucleic acid comprising at least about 20 kilobases of a chloroplast genome and a manipulation in one or more nucleic acids in said vector. In some instances, the nucleic acid is cloned from a non-vascular photosynthetic organism, for example a macroalgae, microalgae, Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some instances, the nucleic acid is synthetic. A vector of the present invention may further comprise an expression cassette and an expression cassette may further 25 comprise a region for integration into target DNA. In some instances, the target DNA is organelle DNA. A vector useful in the present invention may further comprise one or more selection markers, for example an auxotrophic marker, an antibiotic resistance marker, a chloroplast marker, or combinations thereof. In some embodiments, a manipulation in one or more nucleic acids in a vector may be an addition, deletion, mutation, or rearrangement. Expression of the vector may result in production of a product not naturally produced by a host cell and/or an 30 increase production of a product naturally produced by a host cell. Examples of some products of the present invention include a terpene, terpenoid, fatty acid, or biomass degrading enzyme. 100591 Further provided herein is a method of producing a vector containing a reconstructed genome, comprising: introducing two or more vectors into a host cell, wherein the vectors comprise fragments of a genome, recombining the vectors into a single vector comprising at least about 90% of a genome, thereby producing a vector containing a 35 reconstructed genome. In some instances, the host cell is eukaryotic, for example, S. cerevisiae. In other instances, the genome is an organelle genome. The organelle may be a chloroplast, for example a chloroplast from an alga, particularly a microalga such as Ch. vulgaris, C. reinhardtii, D. salina, S. quadricanda or H. pluvalis. In some instances, the two or more vectors comprise a selectable marker. In other instances, at least one of said fragments is synthetic. In still other instances, a further step comprising modifying a portion of the genome is useful in this 40 method. Such a modification may comprise an addition, deletion, mutation, or rearrangement. In other embodiments, the modification is the addition of an exogenous nucleic acid which results in the production or increased production of a terpene, terpenoid, fatty acid or biomass degrading enzyme. 10 100601 Large DNA Cloning and Content 100611 An advantage of this invention is that it provides for the cloning, manipulation, and delivery of a vector containing chloroplast genome DNA consisting of up to all chloroplast genes (or sequences). The chloroplast genome DNA contained in the vector can be obtained by recombination of a hybrid cloning vector with one 5 contiguous fragment of DNA or by recombination of two or more contiguous fragments of DNA. 10062] The methods and compositions of the present invention may include captured and/or modified large pieces of DNA may comprise DNA from an organelle, such as mitochondrial DNA or plastid DNA (e.g., chloroplast DNA). The captured and/or modified large pieces of DNA may also comprise the entirety of an organelle's genome, e.g., a chloroplast genome. In other embodiments, the captured and/or modified large pieces of DNA 10 comprise a portion of a chloroplast genome. A chloroplast genome may originate from any vascular or non-vascular plant, including algae, bryophytes (e.g., mosses, ferns), gymnosperms (e.g., conifers), and angiosperms (e.g., flowering plants - trees, grasses, herbs, shrubs). A chloroplast genome, or essential portions thereof, may comprise synthetic DNA, rearranged DNA, deletions, additions, and/or mutations. A chloroplast genome, or portions thereof, may comprise a one or more deletions, additions, mutations, and/or rearrangements. The deletions, additions, 15 mutations, and/or rearrangements may be naturally found in an organism, for example a naturally occurring mutation, or may not be naturally found in nature. The chloroplast or plastid genomes of a number of organisms are widely available, for example, at the public database from the NCBI Organelle Genomes section available at http://www.ncbi.nlm.nih.gov/genomes/static/euko.html. [00631 The target DNA sequence described herein may comprise at least 1, 2, 3, 4, or 5 deletions, additions, 20 mutations, and/or rearrangements as compared to a control sequence (naturally occurring sequence). In some embodiments, the mutations may be functional or nonfunctional. For example, a functional mutation may have an effect on a cellular function when the mutation is present in a host cell as compared to a control cell without the mutation. A non-functional mutation may be silent in function, for example, there is no discernable difference in phenotype of a host cell without the mutation as compared to a cell with the mutation. 25 100641 Captured and/or modified large pieces of DNA (e.g., target DNA), may comprise a minimal or minimized chloroplast genome (e.g., the minimum number of genes and/or DNA fragment, required for chloroplast functionality). The captured and/or modified DNA may comprise the essential chloroplast genes, it may comprise a portion or all, or substantially all of the essential chloroplast genes. An essential gene may be a gene that is essential for one or more metabolic processes or biochemical pathways. An essential gene may be a gene required for 30 chloroplast function, such as photosynthesis, carbon fixation, or hydrocarbon production. An essential gene may also be a gene that is essential for gene expression, such as transcription, translation, or other process(es) that affect gene expression. The essential genes may comprise mutations or rearrangements. Essential genes may also comprise a minimally functional set of genes to perform a function. For example, a particular function (e.g., photosynthesis) may be performed inefficiently by a set of genes/gene products, however, this set would still 35 comprise essential genes because the function is still performed. [00651 Modified DNA may comprise at least 5, 10, 15, 20, 25, 30, 40, or 50 essential genes. In some embodiments, the DNA may comprise essential chloroplast genomic sequence of up to 150kb in length. The DNA may comprise essential chloroplast genes as well as non-essential chloroplast gene sequences. The DNA may be single stranded or double stranded, linear or circular, relaxed or supercoiled. The DNA may also be in the form of 40 an expression cassette. For example, an expression cassette may comprise an essential gene to be expressed in a host cell. The expression cassette may comprise one or more essential genes as well as DNA sequences that promote the expression of the essential genes. The expression cassette may also comprise a region for integration 11 into target DNA of a host. The expression cassette may also comprise one or more essential genes and one or more genes not naturally occurring in a host cell comprising the expression cassette. One of ordinary skill in the arts will easily ascertain various combinations of the aforementioned aspects of the expression cassettes. [00661 In other instances, captured and/or modified pieces of DNA may comprise the entire genome of a plastid or 5 organelle. For example, about 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%,20%,30%,31%,32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% of a plastid genome, or more. In one embodiment the captured and/or 10 modified large pieces of DNA may comprise 10-100%, 20-100%, 30-100%, 40-100%, 50-100%, 60-100%, 70 100%, 80-100%, or 90-100% of the entire genome of a plastid or cell. 100671 In still other instances, the captured and/or modified large pieces of DNA may comprise about 10 kb, I 1 kb, 12 kb, 13 kb, 14 kb, 15 kb, 16 kb, 17 kb, 18 kb, 19 kb, 20 kb, 21 kb, 22 kb, 23 kb, 24 kb, 25 kb, 26 kb, 27 kb, 28 kb, 29 kb, 30 kb, 31 kb, 32 kb, 33 kb, 34 kb, 35 kb, 36 kb, 37 kb, 38 kb, 39 kb, 40 kb, 41 kb, 42 kb, 43 kb, 44 kb, 45 15 kb, 46 kb, 47 kb, 48 kb, 49 kb, 50 kb, 51 kb, 52 kb, 53 kb, 54 kb, 55 kb, 56 kb, 57 kb, 58 kb, 59 kb, 60 kb, 61 kb, 62 kb, 63 kb, 64 kb, 65 kb, 66 kb, 67 kb, 68 kb, 69 kb, 70 kb, 71 kb, 72 kb, 73 kb, 74 kb, 75 kb, 76 kb, 77 kb, 78 kb, 79 kb, 80 kb, 81 kb, 82 kb, 83 kb, 84 kb, 85 kb, 86 kb, 87 kb, 88 kb, 89 kb, 90 kb, 91 kb, 92 kb, 93 kb, 94 kb, 95 kb, 96 kb, 97 kb, 98 kb, 99 kb, 100 kb, 101 kb, 102 kb, 103 kb, 104 kb, 105 kb, 106 kb, 107 kb, 108 kb, 109 kb, 110 kb, I Ikb, 112 kb, 113 kb, 114 kb, 115 kb, 116 kb, 117 kb, 118 kb, 119 kb, 120 kb, 121 kb,122 kb, 123 kb, 124 kb, 125 20 kb, 126 kb, 127 kb, 128 kb, 129 kb, 130 kb, 131 kb, 132 kb, 133 kb, 134 kb, 135 kb, 136 kb, 137 kb, 138 kb, 139 kb, 140 kb, 141 kb, 142 kb, 143 kb, 144 kb, 145 kb, 146 kb, 147 kb, 148 kb, 149 kb, 150 kb, 151 kb, 152 kb, 153 kb, 154 kb, 155 kb, 156 kb, 157 kb, 158 kb, 159 kb, 160 kb, 161 kb, 162 kb, 163 kb, 164 kb, 165 kb, 166 kb, 167 kb, 168 kb, 169 kb, 170 kb, 171 kb, 172 kb, 173 kb, 174 kb, 175 kb, 176 kb, 177 kb, 178 kb, 179 kb, 180 kb, 181 kb, 182 kb, 183 kb, 184 kb, 185 kb, 186 kb, 187 kb, 188 kb, 189 kb, 190 kb, 191 kb, 192 kb, 193 kb, 194 kb, 195 kb, 25 196 kb, 197 kb, 198 kb, 199 kb, 200 kb, 201 kb, 202 kb, 203 kb, 204 kb, 205 kb, 206 kb, 207 kb, 208 kb, 209 kb, 210 kb, 211 kb, 212 kb, 213 kb, 214 kb, 215 kb, 216 kb, 217 kb, 218 kb, 219 kb, 220 kb, 221 kb, 222 kb, 223 kb, 224 kb, 225 kb, 226 kb, 227 kb, 228 kb, 229 kb, 230 kb, 231 kb, 232 kb, 233 kb, 234 kb, 235 kb, 236 kb, 237 kb, 238 kb, 239 kb, 240 kb, 241 kb, 242 kb, 243 kb, 244 kb, 245 kb, 246 kb, 247 kb, 248 kb, 249 kb, 50 kb, 51 kb, 252 kb, 253 kb, 254 kb, 255 kb, 256 kb, 257 kb, 258 kb, 259 kb, 260 kb, 261 kb, 262 kb, 263 kb, 264 kb, 265 kb, 266 kb, 30 267 kb, 268 kb, 269 kb, 270 kb, 271 kb, 272 kb, 273 kb, 274 kb, 275 kb, 276 kb, 277 kb, 278 kb, 279 kb, 280 kb, 281 kb, 282 kb, 283 kb, 284 kb, 285 kb, 286 kb, 287 kb, 288 kb, 289 kb, 290 kb, 291 kb, 292 kb, 293 kb, 294 kb, 295 kb, 296 kb, 297 kb, 298 kb, 299 kb, 300 kb, 301 kb, 302 kb, 303 kb, 304 kb, 305 kb, 306 kb, 307 kb, 308 kb, 309 kb, 310 kb, 311 kb, 312 kb, 313 kb, 314 kb, 315 kb, 316 kb, 317 kb, 318 kb, 319 kb, 320 kb, 321 kb, 322 kb, 323 kb, 324 kb, 325 kb, 326 kb, 327 kb, 328 kb, 329 kb, 330 kb, 331 kb, 332 kb, 333 kb, 334 kb, 335 kb, 336 kb, 35 337 kb, 338 kb, 339 kb, 340 kb, 341 kb, 342 kb, 343 kb, 344 kb, 345 kb, 346 kb, 347 kb, 348 kb, 349 kb, 350 kb, 351 kb, 352 kb, 353 kb, 354 kb, 355 kb, 356 kb, 357 kb, 358 kb, 359 kb, 360 kb, 361 kb, 362 kb, 363 kb, 364 kb, 365 kb, 366 kb, 367 kb, 368 kb, 369 kb, 370 kb, 371 kb, 372 kb, 373 kb, 374 kb, 375 kb, 376 kb, 377 kb, 378 kb, 379 kb, 380 kb, 381 kb, 382 kb, 383 kb, 384 kb, 385 kb, 386 kb, 387 kb, 388 kb, 389 kb, 390 kb, 391 kb, 392 kb, 393 kb, 394 kb, 395 kb, 396 kb, 397 kb, 398 kb, 399 kb, 400 kb or more genomic (e.g., nuclear or organelle) DNA. 40 In some embodiments the captured and or modified large pieces of DNA may comprise about 10-400 kb, 50-350 kb, 100-300 kb, 100-200 kb, 200-300 kb, 150-200 kb, 200-250 kb genomic DNA 12 100681 In still other instances, the captured and or modified large pieces of DNA may comprise about 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90,91, 92,93, 94, 95,96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 5 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121,122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149,150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161,162, 163, 164, 165, 166, 167,168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182,1 83, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194,195, 196, 197, 198, 199, 200,201,202,203,204,205,206,207,208,209,210, 211,212,213,214,215,216,217,218,219,220, 221, 222, 10 223, 224, 225, 226,227,228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240,241, 242, 243, 244, 245, 246, 247, 248, 249, 50, 51, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261,262,263,264, 265, 266, 267,268, 269, 270, 271, 272, 273,274,275,276,277,278,279,280,281,282,283,284,285,286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300 or more open reading frames, partial open reading frames, pseudogenes and/or repeating sequences. 15 100691 The present invention also provides vectors comprising a cassette-able chloroplast genome or portion thereof (e.g., a removable DNA fragment comprising a chloroplast genome or functional portion thereof). A vector of the present invention may comprise functional chloroplast units (e.g., a unit essential for metabolic processes, photosynthesis, gene expression, photosystem I, photosystem II). Vectors of the present invention may comprise a transplantable chloroplast genome or portion thereof. Additionally, the vectors of the present invention may 20 comprise a transferable chloroplast genome or portion thereof. In other embodiments, the vectors comprise: 1) one or more large pieces of modified DNA; 2) all genes necessary to carry out photosynthesis; 3) all genes required for chloroplast survival and/or function; 4) essential chloroplast genes; and/or 5) sufficient naturally occurring or modified chloroplast genes to perform one or more chloroplast functions, such as photosynthesis. A vector may comprise a portion, substantially all, or all of the essential chloroplast genes. A vector may comprise at least 5, 10, 25 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100 or more essential genes. 100701 A vector may comprise chloroplast DNA of 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250 kb or more in length. A vector may comprise essential chloroplast genes as well as non-essential chloroplast gene sequences. A vector may comprise one or more, or all, essential chlioroplast genes and/or one or more genes not naturally occurring in a host cell comprising a vector. In some 30 embodiments, a vector may comprise chloroplast genes and genes not naturally occurring in the chloroplast. A vector may comprise one or more essential chloroplast genes and/or one or more DNA sequences or genes involved in chloroplast function, photosynthesis, carbon fixation, and/or hydrocarbon production. For example, a vector may comprise a sequence required for photosynthesis and a sequence involved in the isoprenoid production, MVA, and/or MEP pathways, such as a DNA sequence encoding a terpene synthase, or other polypeptide that produces a 35 hydrocarbon, such as a terpene or isoprenoid. The invention further provides methods for cloning, manipulating and delivering a large target nucleic acid to a cell or particle, such as, for example, yeasts or bacteria. Certain embodiments of this method make use of a hybrid yeast-bacteria cloning system (See, e.g., U.S. Patent Nos. 5,866,404 and 7,083,971 and Hokanson et al., (2003) Human Gene Ther.: 14: 329-339). The vectors herein (e.g., cloning system) is comprised of a shuttle vector that contains elements for function and replication in both yeast and 40 bacteria, allowing it to stably function and replicate in either organism. This composition of functional and replicative elements yields a hybrid system which enjoys the benefits of both genetic engineering systems. The 13 genetics of yeast (e.g., S. cerevisiae) are well understood and a powerful assortment of molecular biology tools exists for genetic engineering in yeast. [00711 In another aspect the invention produces a gap-filled vector by homologous recombination among the two arms and the target nucleic acid. In still another embodiment, at least one arm further comprises an origin of 5 replication. In another embodiment of the invention, each arm further comprises a rare restriction endonuclease recognition site. Homologous recombination may be performed in vitro or in vivo, for example, in a fungal cell (e.g., S. cerevisiae, Sz. pombe or U maydis). The invention also provides a eukaryotic host cell harboring the recombinational cloning system or vector according to the invention, for example, in a fungal cell (e.g., S. cerevisiae, Sz. pombe or U maydis). 10 [00721 A gap-filled linear vector may be converted to a circular vector in vitro (e.g. using T4 ligase) or in vivo, for example, in a bacterium. The circular vectors of interest can be amplified, purified, cut and used to recover sufficient amounts of DNA to be introduced either directly into a cell or into a suitable delivery system for subsequent delivery to a target cell. The methodology offers great versatility to clone and to modify any large bacterial or non-bacterial genome, and easily facilitate the use thereof as recombinational vectors. Direct delivery of 15 a gap-filled vector into a cell may be performed by methods well known in the field such as, for example, calcium phosphate transformation methodologies or electroporation (see Sambrook et al., supra). [00731 Accordingly, the invention provides a method for producing a recombinant delivery unit including the steps of: (a) producing a gap-filled vector containing a target sequence; (b) optionally circularizing the gap-filled vector segments of the invention; (c) propagating the vector, and (d) introducing the gap-filled vector in a delivery unit. 20 [0074] Bacterial systems are useful for amplifying and purifying DNA, and for functionally testing the genetic modifications and their effect on pathways. One embodiment of the present invention provides cloning system will aid in the cloning and modifying of any large genome and easily facilitate the cloning and introduction of pathways. With the ability to deliver whole pathways, certain embodiments of the present invention allow for a system biological approach to problem solving. 25 [0075] In general, target DNA (e.g., genomic DNA) may be captured by creating sites allowing for homologous recombination in the vector. For example, such sites may be created by, but not limited to, gap-repair cloning, wherein a gap is created in the vector, usually by restrictive enzyme digestion prior to transformation into the yeast. When the target DNA is mixed with the vector, the target DNA is recombined into the vector. This operation is called "gap filling." This recombination can occur in bacteria, yeast, the original host organism, another organism, 30 or in vitro. In some embodiments, recombination is performed in yeast because of the high rate of homologous recombination. Once captured, the target DNA can be modified in many ways including adding, altering, or removing DNA sequences. In some embodiments, the target DNA is genomic DNA. In other embodiments, the target DNA is organelle (e.g., mitochondria or chloroplast) DNA. [00761 In some embodiments, target DNA is modified by adding, altering or removing genes, coding sequences, 35 partial coding sequences, regulatory elements, positive and/or negative selection markers, recombination sites, restriction sites, and/or codon bias sites. For example, the target DNA sequence may be codon biased for expression in the organism being transformed. The skilled artisan is well aware of the "codon-bias" exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. Without being bound by theory, by using a host cell's preferred codons, the rate of translation may be greater. Therefore, when synthesizing a gene for improved 40 expression in a host cell, it may be desirable to design the gene such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell. The codons of the present invention may be A/T rich, for example, A/TI' rich in the third nucleotide position of the codons. Typically, the A/T rich codon bias is used for 14 algae. In some embodiments, at least 50% of the third nucleotide position of the codons are A or T. In other embodiments, at least 60%, 70%, 80%, 90%, or 99% of the third nucleotide position of the codons are A or T. (see also U.S. Publication No. 2004/0014174). 100771 Such manipulations are well known in the art and can be performed in numerous ways. In some 5 embodiments, the modifications may be performed using cloned sequences. In other embodiments, the modifications may be performed using synthetic DNA. [00781 Genetic manipulations include cloning large pieces of target DNA (e.g., chromosomes, genomes) and/or dividing and reorganizing target DNA based on functional relations between genes, such as metabolic pathways or operons. Genetic manipulations also include introducing and removing metabolic pathways, recombining DNA into 10 functional units (e.g., metabolic pathways, synthetic operons), and/or determining sites of instability in large pieces of DNA (e.g., sites where a native or non-native host tends to delete or recombine a sequence of DNA). [00791 Target DNA may be DNA from a prokaryote. Target DNA may also be genomic DNA, mitochondrial DNA, or chloroplast DNA from a eukaryote. Examples of such organisms from which genomic and/or organelle DNA may serve as target DNA include, but are not limited to Z. mobilis, algae (e.g., macroalgae or microalgae, such 15 as Chlamydomonas reinhardtii ), a rhodophyte, a chlorophyte, a heterokontophyte, a tribophyte, a glaucophyte, a chlorarachniophyte, a euglenoid, a haptophyte, a cryptomonad, a dinoflagellum, or a phytoplankton. One of skill in the art will recognize that these organisms are listed only as examples and that the methods disclosed herein are applicable to the large DNA from any organism, including bacteria, plants, fungi, protists, and animals. Genetic manipulations of the present invention may include stabilizing large pieces of DNA by removing or inserting 20 sequences that force transformed cells to preserve certain sequences of DNA and to stably maintain the sequences in its progeny. Genetic manipulations may also include altering codons of the target DNA, vector DNA, and/or synthetic DNA to reflect any codon bias of the host organism. Additionally, genetic manipulations of the present invention may include determining the minimal set of genes required for an organism to be viable. In another embodiment, the genetic manipulations of the present invention include determining the minimal set of genes 25 required for a certain metabolic pathway to be created or maintained. 100801 The genetic manipulations of the present invention may include determining redundant genes both within a genome, and between two genomes (e.g., redundancy between the nuclear and chloroplast genome). Additionally, the genetic manipulations of the present invention may include determining a functional sequence of DNA that could be artificially synthesized (e.g. the genes in a certain metabolic pathway, the genes of a functional genome). 30 In another embodiment, the genetic manipulations of the present invention include creating DNA and genomes packaged into cassettes (e.g., sites within a vector where genes can be easily inserted or removed). The genetic manipulations of the present invention may also include creating a nuclear or organelle genome that is viable in multiple species (e.g. a transplantable chloroplast genome). Furthermore, the genetic manipulations of the present invention may include a method for testing the viability of any of these manipulations or creations (e.g., transferring 35 a shuttle vector back into a host system and assaying for survival). 100811 Vectors, Markers and Transformation 100821 A vector or other recombinant nucleic acid molecule may include a nucleotide sequence encoding a selectable marker. The term or "selectable marker" or "selection marker" refers to a polynucleotide (or encoded polypeptide) that confers a detectable phenotype. A selectable marker generally encodes a detectable polypeptide, 40 for example, a green fluorescent protein or an enzyme such as luciferase, which, when contacted with an appropriate agent (a particular wavelength of light or luciferin, respectively) generates a signal that can be detected by eye or using appropriate instrumentation (Giacomin, Plant Sci. 116:59-72, 1996; Scikantha, J. Bacteriol. 178:121, 1996; 15 Gerdes, FEBS Lett. 389:44-47, 1996; see, also, Jefferson, EMBO J. 6:3901-3907, 1997, fl-glucuronidase). A selectable marker generally is a molecule that, when present or expressed in a cell, provides a selective advantage (or disadvantage) to the cell containing the marker, for example, the ability to grow in the presence of an agent that otherwise would kill the cell. 5 [00831 A selectable marker can provide a means to obtain prokaryotic cells or plant cells or both that express the marker and, therefore, can be useful as a component of a vector of the invention (see, for example, Bock, supra, 200 1). Examples of selectable markers include, but are not limited to, those that confer antimetabolite resistance, for example, dihydrofolate reductase, which confers resistance to methotrexate (Reiss, Plant Physiol. (Life Sci. Adv.) 13:143-149, 1994); neomycin phosphotransferase, which confers resistance to the aminoglycosides neomycin, 10 kanamycin and paromycin (Herrera-Estrella, EMBO J. 2:987-995, 1983), hygro, which confers resistance to hygromycin (Marsh, Gene 32:481-485, 1984), trpB, which allows cells to utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in place of histidine (Hartman, Proc. Nall. Acad. Sci., USA 85:8047, 1988); mannose-6-phosphate isomerase which allows cells to utilize mannose (WO 94/20627); ornithine decarboxylase, which confers resistance to the ornithine decarboxylase inhibitor, 2-(difluoromethyl)-DL-omithine (DFMO; 15 McConlogue, 1987, In: Current Communications in Molecular Biology, Cold Spring Harbor Laboratory ed.); and deaminase from Aspergillus terreus, which confers resistance to Blasticidin S (Tamura, Biosci. Biotechnol. Biochem. 59:2336-2338, 1995). Additional selectable markers include those that confer herbicide resistance, for example, phosphinothricin acetyltransferase gene, which confers resistance to phosphinothricin (White et al., Nucl. Acids Res. 18:1062, 1990; Spencer et al., Theor. Apple. Genet. 79:625-631, 1990), a mutant EPSPV-synthase, which 20 confers glyphosate resistance (Hinchee et al., BioTechnology 91:915-922, 1998), a mutant acetolactate synthase, which confers imidazolione or sulfonylurea resistance (Lee et al., EMBO J. 7:1241-1248, 1988), a mutant psbA, which confers resistance to atrazine (Smeda et al., Plant Physiol. 103:911-917, 1993), or a mutant protoporphyrinogen oxidase (see U.S. Pat. No. 5,767,373), or other markers conferring resistance to an herbicide such as glufosinate. Selectable markers include polynucleotides that confer dihydrofolate reductase (DHFR) or 25 neomycin resistance for eukaryotic cells and tetracycline; ampicillin resistance for prokaryotes such as E. coli; and bleomycin, gentamycin, glyphosate, hygromycin, kanamycin, methotrexate, phleomycin, phosphinotricin, spectinomycin, streptomycin, sulfonamide and sulfonylurea resistance in plants (see, for example, Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Laboratory Press, 1995, page 39). [00841 Methods for nuclear and plastid transformation are routine and well known for introducing a 30 polynucleotide into a plant cell chloroplast (see U.S. Pat. Nos. 5,451,513, 5,545,817, and 5,545,818; WO 95/16783; McBride et al., Proc. Nall. Acad. Sci., USA 91:7301-7305, 1994). In some embodiments, chloroplast transformation involves introducing regions of chloroplast DNA flanking a desired nucleotide sequence, allowing for homologous recombination of the exogenous DNA into the target chloroplast genome. In some instances one to 1.5 kb flanking nucleotide sequences of chloroplast genomic DNA may be used. Using this method, point mutations in the 35 chloroplast 16S rRNA and rps12 genes, which confer resistance to spectinomycin and streptomycin, can be utilized as selectable markers for transformation (Svab et al., Proc. Nail. Acad. Sci., USA 87:8526-8530, 1990), and can result in stable homoplasmic transformants, at a frequency of approximately one per 100 bombardments of target leaves. [00851 Microprojectile mediated transformation also can be used to introduce a polynucleotide into a plant cell 40 chloroplast (Klein et al., Nature 327:70-73, 1987). This method utilizes microprojectiles such as gold or tungsten, which are coated with the desired polynucleotide by precipitation with calcium chloride, spermidine or polyethylene glycol. The microprojectile particles are accelerated at high speed into a plant tissue using a device such as the 16 BIOLISTIC PD-1000 particle gun (BioRad; Hercules Calif.). Methods for the transformation using biolistic methods are well known in the art (see, e.g.; Christou, Trends in Plant Science 1:423-431, 1996). Microprojectile mediated transformation has been used, for example, to generate a variety of transgenic plant species, including cotton, tobacco, corn, hybrid poplar and papaya. Important cereal crops such as wheat, oat, barley, sorghum and rice also 5 have been transformed using microprojectile mediated delivery (Duan et al., Nature Biotech. 14:494-498, 1996; Shimamoto, Curr. Opin. Biotech. 5:158-162, 1994). The transformation of most dicotyledonous plants is possible with the methods described above. Transformation of monocotyledonous plants also can be transformed using, for example, biolistic methods as described above, protoplast transformation, electroporation of partially permeabilized cells, introduction of DNA using glass fibers, the glass bead agitation method, and the like. 10 [00861 Transformation frequency may be increased by replacement of recessive rRNA or r-protein antibiotic resistance genes with a dominant selectable marker, including, but not limited to the bacterial aadA gene (Svab and Maliga, Proc. Nat. Acad. Sci., USA 90:913-917, 1993). Approximately 15 to 20 cell division cycles following transformation are generally required to reach a homoplastidic state. It is apparent to one of skill in the art that a chloroplast may contain multiple copies of its genome, and therefore, the term "homoplasmic" or "homoplasmy" 15 refers to the state where all copies of a particular locus of interest are substantially identical. Plastid expression, in which genes are inserted by homologous recombination into all of the several thousand copies of the circular plastid genome present in each plant cell, takes advantage of the enormous copy number advantage over nuclear-expressed genes to permit expression levels that can readily exceed 10% of the total soluble plant protein. 100871 Any of the nucleotide sequences of target DNA, vector DNA, or synthetic DNA on the vectors of the 20 invention can further include codons biased for expression of the nucleotide sequences in the organism transformed. In some instances, codons in the nucleotide sequences are AfT rich in a third nucleotide position of the codons. For example, at least 50% of the third nucleotide position of the codons may be A or T. In other instances, the codons are G/C rich, for example at least 50% of the third nucleotide positions of the codons may be G or C. 100881 The nucleotide sequences of the shuttle vectors of the present invention can be adapted for chloroplast 25 expression. For example, a nucleotide sequence herein can comprise a chloroplast specific promoter or chloroplast specific regulatory control region. The nucleotide sequences can also be adapted for nuclear expression. For example, a nucleotide sequence can comprise a nuclear specific promoter or nuclear specific regulatory control regions. The nuclear sequences can encode a protein with a targeting sequence that encodes a chloroplast targeting protein (e.g., a chloroplast transit peptide), or a signal peptide that directs a protein to the endomembrane system for 30 deposition in the endoplasmic reticulum or plasma membrane. [00891 In embodiments where a vector encodes genes capable of fuel production, fuel products are produced by altering the enzymatic content of the cell to increase the biosynthesis of specific fuel molecules. For example, nucleotides sequences (e.g., an ORF isolated from an exogenous source) encoding biosynthetic enzymes can be introduced into the chloroplast of a photosynthetic organism. Nucleotide sequences encoding fuel biosynthetic 35 enzymes can also be introduced into the nuclear genome of the photosynthetic organisms. Nucleotide sequences introduced into the nuclear genome can direct accumulation of the biosynthetic enzyme in the cytoplasm of the cell, or may direct accumulation of the biosynthetic enzyme in the chloroplast of the photosynthetic organism. 100901 Any of the nucleotide sequences herein may further comprise a regulatory control sequence. Regulatory control sequences can include one or more of the following: a promoter, an intron, an exon, processing elements, 3' 40 untranslated region, 5' untranslated region, RNA stability elements, or translational enhancers A promoter may be one or more of the following: a promoter adapted for expression in the organism, an algal promoter, a chloroplast promoter, and a nuclear promoter, any of which may be a native or synthetic promoters. A regulatory control 17 sequence can be inducible or autoregulatable. A regulatory control sequence can include autologous and/or heterologous sequences. In some cases, control sequences can be flanked by a first homologous sequence and a second homologous sequence. The first and second homologous sequences can each be at least 500 nucleotides in length. The homologous sequences can allow for either homologous recombination or can act to insulate the 5 heterologous sequence to facilitate gene expression. 100911 Vectors may also comprise sequences involved in producing products useful as biopharmaceuticals, such as, but not limited to, antibodies (including functional portions thereof), interleukins and other immune modulators, and antibiotics. See, e.g., Mayfield et al., (2003) Proc. Nat'lAcad. Sci.: 100 (438-42) and U.S. Pub. No. 2004/0014174. 10 [00921 Vectors of the present invention may comprise a cassette-able bacterial genome or portion thereof (e.g., a removable DNA fragment comprising a bacterial genome or functional portion thereof). Additionally, vectors of the present invention may comprise functional genomic units (e.g., a unit essential for metabolic processes, biochemical pathways, gene expression). Vectors of the present invention may comprise a transplantable bacterial genome or portion thereof. Vectors of the present invention may comprise a transferable bacterial genome or portion thereof. 15 100931 In some embodiments, the large piece of target DNA is modified. The modified DNA may comprise all genes necessary to carry out ethanologenesis, all genes required for the Entner-Duodorff pathway, the glucose tolerance pathway, the ethanol tolerance pathway, the carboxylic acid byproduct resistance pathway, the acetic acid tolerance pathway, the sugar transport pathway, sugar fermentation pathways, and/or the cellulose and hemicellulose digestive pathways. 20 [00941 Hybrid cloning systems and methods of the invention combine the high versatility of yeast as a system for the capture and manipulation of a given nucleic acid and the high efficiency of bacterial systems for the amplification of such nucleic acid. Recombinational vectors relying on homologous recombination to mediate the isolation, manipulation and delivery of large nucleic acid fragments were constructed. The invention described herein also provides methods for using such recombinational cloning vectors to clone, to manipulate and to deliver 25 large nucleic acids. Finally, the invention provides methods for using such recombinational cloning systems as potentiators of biochemical pathway analysis, organelle analysis, and synthetic chloroplast construction. [00951 The vectors of the present invention may be introduced into yeast. The yeast may be a suitable strain of Saccharomyces cerevisiae; however, other yeast models may be utilized. Introduction of vectors into yeast may allow for genetic manipulation of the vectors. Yeast vectors have been described extensively in the literature and 30 methods of manipulating the same also are well known as discussed hereinafter (see e.g., Ketner et al. (1994) Proc. Natl. Acad. Sci. (USA) 91:6186 6190). 100961 Following genetic manipulation, the cloning system may allow for the transition to a bacterial environment, suitable for the preparation of larger quantities of nucleic acids. Representative examples of a bacterial type vectors include, but are not limited to, the P1 artificial chromosome, bacterial artificial chromosome (BAC) and single copy 35 plasmid F factors (Shizuya et al. (1992) Proc. Nat!. Acad. Sci. 89:8794 8797). Similarly, bacterial vectors are well known in the art (e.g., Ioannou et al. (1994) Nature 6:84 89). The invention also provides a shuttle vector comprising a yeast selectable marker, a bacterial selectable marker, a telomere, a centromere, a yeast origin of replication, and/or a bacterial origin of replication. 100971 Shuttle vectors of the present invention may enable homologous recombination in yeast to capture and to 40 integrate in a vector of interest a target nucleic acid of interest. Shuttle vectors may allow for the manipulation of target DNA in any of the hosts to which the vectors can be introduced. In some embodiments, after desired manipulations, shuttle vector components may be removed, leaving just the modified target DNA. Such extraction 18 of vector sequences can be performed using standard methodologies and may occur in any host cell. The target nucleic acid of interest can be a large nucleic acid, and can include, for example, a vector, such as a viral vector, including the foreign gene of interest contained therein. The target nucleic acid can also be a bacterial (including archaebacteria and eubacteria), viral, fungal, protist, plant or animal genome, or a portion thereof. For example, the 5 target nucleic acid of one embodiment of the present invention comprises an entire prokaryotic genome. As an additional example, a target nucleic acid of the present invention may comprises the chloroplast genome of a eukaryotic organism. [00981 Shuttle vectors according to the invention may comprise an appropriately oriented DNA that functions as a telomere in yeast and a centromere. Any suitable telomere may be used. Suitable telomeres include without 10 limitation telomeric repeats from many organisms, which can provide telomeric function in yeast. The terminal repeat sequence in humans (TTAGGG)N, is identical to that in trypanosomes and similar to that in yeast ((TG)1.
3 )N and Tetrahymena (TTGGG)N (Szostak & Blackburn (1982) Cell 29:245 255; Brown (1988) EMBOJ. 7:2377 2385; and Moyzis et al. (1988) Proc. NaIl. Acad. Sci. 85:6622 6626). 100991 The term "centromere" is used herein to identify a nucleic acid, which mediates the stable replication and 15 precise partitioning of the vectors of the invention at mciosis and at mitosis thereby ensuring proper segregation into daughter cells. Suitable centromeres include, without limitation, the yeast centromere, CEN4, which confers mitotic and meiotic stability on large linear plasmids (Murray & Szostak (1983) Nature 305:189 193; Carbon (1984) Cell 37:351 353; and Clark et al. (1990) Nature 287:504 509)). 1001001 In some embodiments, at least one of the two segments of the circular vector according to the invention 20 includes at least one replication system that is functional in a host cell/particle of choice. As it will become apparent hereinafter, one of skill will realize that the manipulation, amplification and/or delivery of a target nucleic acid of choice may entail the use of more than one host cell/particle. Accordingly, more than one replication system functional in each host cell/particle of choice may be included. 1001011 When a host cell(s) is a prokaryote, particularly E. coli, replication systems) include those which are 25 functional in prokaryotes, such as, for example, P1 plasmid replicon, ori, P1 lytic replicon, ColEI, BAC, single copy plasmid F factors and the like. Either one or both segments, and/or the circular vector, may further include a yeast origin of replication capable of supporting the replication of large nucleic acids. Non-limiting examples of replication regions according to the invention include the autonomously replicating sequence or "ARS element." ARS elements were identified as yeast sequences that conferred high-frequency transformation. Tetrahymena DNA 30 termini have been used as ARS elements in yeast along with ARS 1 and ARS4 (Kiss et al. (1981) Mol. Cell BioL. 1:535 543; Stinchcomb et al. (1979) Nature 282:39; and Barton & Smith (1986) MoL. Cell BioL. 6:2354). For each segment (e.g., those corresponding to the yeast and bacterial elements of the gap-filling shuttle vector) there may be two or more origins of replication. [001021 The first and/or the second segment according to an aspect of the invention may be joined in a circularized 35 vector form (e.g., plasmid form). Circularization can occur in vivo or in vitro using the segment of interest. Alternatively, a circular vector of interest can be used. As used herein, the term "vector" designates a plasmid or phage DNA or other nucleic acid into which DNA or other nucleic acid may be cloned. The vector may replicate autonomously in a host cell and may be characterized further by one or a small number of restriction endonuclease recognition sites at which such nucleic acids may be cut in a determinable fashion and into which nucleic acid 40 fragments may be inserted. The vector further may contain a selectable marker suitable for the identification of cells transformed with the vector. 19 1001031 Target nucleic acids of the invention may vary considerably in complexity. The target nucleic acid may include viral, prokaryotic or eukaryotic DNA, cDNA, exonic (coding), and/or intronic (noncoding) sequences. Hence, the target nucleic acid of the invention may include one or more genes. A target nucleic acid may be a chromosome, genome, or operon and/or a portion of a chromosome, genome or operon. A target nucleic acid may 5 comprise coding sequences for all the genes in a pathway, the minimum complement of genes necessary for survival of an organelle, and/or the minimum complement of genes necessary for survival of an organism. A target nucleic acid may comprise Zymomonas mobilis DNA sequence, including, but not limited to genomic DNA and/or cDNA. A target nucleic acid may comprise eukaryotic chloroplast DNA sequence, including but not limited to, chloroplast genome DNA and/or cDNA. A target nucleic acid may comprise cyanobacteria DNA, including but not limited to 10 genomic DNA and/or cDNA. The target nucleic acid also may be of any origin and/or nature. 1001041 It may be desirable for the gene to also comprise a promoter operably linked to the coding sequence in order to effectively promote transcription. Enhancers, repressors and other regulatory sequences may also be included in order to modulate activity of the gene, as is well known in the art. A gene as provided herein can refer to a gene that is found in the genome of the individual host cell (i.e., endogenous) or to a gene that is not found in the 15 genome of the individual host cell (i.e., exogenous or a "foreign gene"). Foreign genes may be from the same species as the host or from different species. For transfection of a cell using DNA containing a gene with the intent that the gene will be expressed in the cell, the DNA may contain any control sequences necessary for expression of the gene in the required orientation for expression. The term "intron" as used herein, refers to a DNA sequence present in a given gene which is not translated into protein and is generally found between exons. 20 1001051 Genetic elements, or polynucleotides comprising a region that encodes a polypeptide or a region that regulates transcription or translation or other processes important to expression of the polypeptide in a host cell, or a polynucleotide comprising both a region that encodes a polypeptide and a region operably linked thereto that regulates expression. Genetic elements may be comprised within a vector that replicates as an episomal element; that is, as a molecule physically independent of the host cell genome. They may be comprised within mini 25 chromosomes, such as those that arise during amplification of transfected DNA by methotrexate selection in eukaryotic cells. Genetic elements also may be comprised within a host cell genome; not in their natural state but, rather, following manipulation such as isolation, cloning and introduction into a host cell in the form of purified DNA or in a vector, among others. [001061 Vectors of the present invention may contain sufficient linear identity or similarity (homology) to have the 30 ability to hybridize to a portion of a target nucleic acid made or which is single-stranded, such as a gene, a transcriptional control element or intragenic DNA. Without being bound to theory, such hybridization is ordinarily the result of base-specific hydrogen bonding between complementary strands, preferably to form Watson-Crick base pairs. As a practical matter, such homology can be inferred from the observation of a homologous recombination event. In some embodiments, such homology is from about 8 to about 1000 bases of the linear nucleic acid. In 35 other embodiments, such homology is from about 12 to about 500 bases. One skilled in the art will appreciate that homology may extend over longer stretches of nucleic acids. 1001071 Homologous recombination is a type of genetic recombination, a process of physical rearrangement occurring between two strands of DNA. Homologous recombination involves the alignment of similar sequences, a crossover between the aligned DNA strands, and breaking and repair of the DNA to produce an exchange of 40 material between the strands. The process homologous recombination naturally occurs in organisms and is also utilized as a molecular biology technique for introducing genetic changes into organism. 20 [001081 The vectors of the invention may be modified further to include functional entities other than the target sequence which may find use in the preparation of the construct(s), amplification, transformation or transfection of a host cell, and--if applicable--for integration in a host cell. For example, the vector may comprise regions for integration into host DNA. Integration may be into nuclear DNA of a host cell. In some embodiments, integration 5 may be into non-nuclear DNA, such as chloroplast DNA. Other functional entities of the vectors may include, but are not limited to, markers, linkers and restriction sites. [001091 A target nucleic acid may include a regulatory nucleic acid. This refers to any sequence or nucleic acid which modulates (either directly or indirectly, and either up or down) the replication, transcription and/or expression of a nucleic acid controlled thereby. Control by such regulatory nucleic acid may make a nucleic acid constitutively 10 or inducibly transcribed and/or translated. Any of the nucleotide sequences herein may further comprise a regulatory control sequence. Examples of regulatory control sequences can include, without limitation, one or more of the following: a promoter, an intron, an exon, processing elements, 3' untranslated region, 5' untranslated region, RNA stability elements, or translational enhancers. A promoter may be one or more of the following: a promoter adapted for expression in the organism (e.g., bacterial, fungal, viral, plant, mammalian, or protist), an algal 15 promoter, a chloroplast (or other plastid) promoter, a mitochondrial promoter, and a nuclear promoter, any of which may be a native or synthetic promoters. A regulatory control sequence can be inducible or autoregulatable. A regulatory control sequence can include autologous and/or heterologous sequences. In some cases, control sequences can be flanked by a first homologous sequence and a second homologous sequence. The first and second homologous sequences can each be at least 500 nucleotides in length. The homologous sequences can allow for 20 either homologous recombination or can act to insulate the heterologous sequence to facilitate gene expression. [001101 In some instances, target DNA, vector DNA or other DNA present in a shuttle vector of the present invention does not result in production of a polypeptide product but rather allows for secretion of the product from the cell. In these cases, the nucleotide sequence may encode a protein that enhances or initiates or increases the rate of secretion of a product from an organism to the external environment. Thus, segments of vectors and/or vectors of 25 the invention may include a transcriptional regulatory region such as, for example, a transcriptional initiation region. One skilled in the art will appreciate that a multitude of transcriptional initiation sequences have been isolated and are available, including thymidine kinase promoters, beta-actin promoters, immunoglobin promoters, methallothionein promoters, human cytomegalovirus promoters and SV40 promoters. (001111 One embodiment of the invention provides a method of producing a gap-filled vector. A gap-filled vector 30 may undergo homologous recombination and insertion of a target nucleic acid according to the invention by filling in the region (gap) between the sequences homologous to the 5' and the 3' regions of the target nucleic acid. Hence, in some embodiments, one would contact the instant cloning system with a target nucleic acid under conditions that allow homologous recombination. [001121 Another method combines: (i) a first segment including a first nucleic acid homologous to the 5' terminus 35 of a target nucleic acid, a first selectable marker and a first cyclization element; (ii) a target nucleic acid; and (iii) a second segment including a second nucleic acid homologous to the 3' terminus of a target nucleic acid, a second selectable marker and a second cyclization element, under conditions which allow homologous recombination. One embodiment of the invention produces a gap-filled vector by homologous recombination between the two arms and the target nucleic acid. The exchange between the homologous regions found in the arms and the target nucleic acid 40 is effected by homologous recombination at any point between the homologous nucleic acids. With respect to a circular vector of the present invention, the "gap filling" essentially is insertion (i.e., subcloning) of the target sequence into the vector. 21 1001131 Homologous recombination may be effected in vitro according to methodologies well known m the art. For example, the method of the invention can be practiced using yeast lysate preparations. Homologous recombination may take place in vivo. Hence, the method of the invention may be practiced using any host cell capable of supporting homologous recombination events such as, for example, bacteria, yeast and mammalian cells. 5 One skilled in the art will appreciate that the choice of a suitable host depends on the particular combination of selectable markers used in the cloning system of the method. 1001141 Techniques that may be used to introduce the vector into a host cell of interest include calcium phosphate/DNA coprecipitation, electroporation, bacterial-protoplast fusion, microinjection of DNA into the nucleus and so on. One of skill will appreciate that a number of protocols may be used virtually interchangeably, for 10 example, to transfect mammalian cells, as set forth for example in Keown et al. (Meth. Enzymol. 185:527 537, 1990). 100115) Transformation may be achieved by using a soil bacterium, such as Agrobacterium tumefaciens. Agrobacterium tumefaciens may carry an engineered plasmid vector, or carrier of selected extra genes. Plant tissue, such as leaves, are cut in small pieces, eg. IOxOmm, and soaked for 10 minutes in a fluid containing suspended 15 Agrobacterium. Some cells along the cut is transformed by the bacterium, that inserts its DNA into the cell. Placed on selectable rooting and shooting media, the plants will regrow. Some plants species can be transformed just by dipping the flowers into suspension of Agrobacteria and then planting the seeds in a selective medium. [001161 Another methodology is use of a "gene gun" approach. The gene gun is part of a method called the biolistic (also known as bioballistic) method, and under certain conditions, DNA (or RNA) become "sticky," 20 adhering to biologically inert particles such as metal atoms (usually tungsten or gold). By accelerating this DNA particle complex in a partial vacuum and placing the target tissue within the acceleration path, DNA is effectively introduced. Uncoated metal particles could also be shot through a solution containing DNA surrounding the cell thus picking up the genetic material and proceeding into the living cell. A perforated plate stops the shell cartridge but allows the slivers of metal to pass through and into the living cells on the other side. The cells that take up the 25 desired DNA, identified through the use of a marker gene (in plants the use of GUS is most common), are then cultured to replicate the gene and possibly cloned. The biolistic method is most useful for inserting genes into plant cells such as pesticide or herbicide resistance. Different methods have been used to accelerate the particles: these include pneumatic devices; instruments utilizing a mechanical impulse or macroprojectile; centripetal, magnetic or electrostatic forces; spray or vaccination guns; and apparatus based on acceleration by shock wave, such as electric 30 discharge (for example, see Christou and McCabe, 1992, Agracetus, Inc. Particle Gun Transformation of Crop Plants Using Electric Discharge (ACCELLTM Technology)). [001171 Transformation can be performed, for example, according to the method of Cohen et al. (Proc. Natl. Acad. Sci. USA, 69:2110 (1972)), the protoplast method (Mol. Gen. Genet., 168:111 (1979)), or the competent method (J. Mol. Biol., 56:209 (1971)) when the hosts are bacteria (E. coli, Bacillus subtilis, and such), the method of Hinnen et 35 al. (Proc. Natl. Acad. Sci. USA, 75:1927 (1978)), or the lithium method (J. Bacteriol., 153:163 (1983)) when the host is S. cerevisiae, the method of Graham (Virology, 52:456 (1973)) when the hosts are animal cells, and the method of Summers et al. (Mol. Cell. Biol., 3:2156-2165 (1983)) when the hosts are insect cells. Typically, following a transformation event, potential transformants are plated on nutrient media for selection and/or cultivation. 40 [001181 The nutrient media preferably comprises a carbon source, an inorganic nitrogen source, or an organic nitrogen source necessary for the growth of host cells (transformants). Examples of the carbon source are glucose, dextran, soluble starch, and sucrose, and examples of the inorganic or organic nitrogen source are ammonium salts, 22 nitrates, amino acids, corn steep liquor, peptone, casein, meat extract, soy bean cake, and potato extract. If desired, the media may comprise other nutrients (for example, an inorganic salt (for example, calcium chloride, sodium dihydrogenphosphate, and magnesium chloride), vitamins, antibiotics (for example, tetracycline, neomycin, ampicillin, kananmycin, etc.). Media for some photosynthetic organisms may not require a carbon source as such 5 organisms may be photoautotrophs and, thus, can produce their own carbon sources. 1001191 Cultivation and/or selection are performed by methods known in the art. Cultivation and selection conditions such as temperature, pH of the media, and cultivation time are selected appropriately for the vectors, host cells and methods of the present invention. One of skill in the art will recognize that there are numerous specific media and cultivation/selection conditions which can be used depending on the type of host cell (transformant) and 10 the nature of the vector (e.g., which selectable markers are present). The media herein are merely described by way of example and are not limiting. 1001201 When the hosts are bacteria, actinomycetes, yeast, or filamentous fungi, media comprising the nutrient source(s) mentioned above are appropriate. When the host is E. coli, examples of preferable media are LB media, M9 media (Miller et al. Exp. Mol. Genet., Cold Spring Harbor Laboratory, p.431 (1972)), and so on. When the host 15 is yeast, an example of medium is Burkhoter minimal medium (Bostian, Proc. Nail. Acad. Sci. USA, 77:4505 (1980)). (001211 The selection of vectors in yeast may be accomplished by the use of yeast selectable markers. Examples include, but are not limited to, HIS3, TRP1, URA3, LEU2 and ADE markers. In some embodiments of the invention, a vector or segment thereof may comprise two or more selectable markers. Thus, in one embodiment, a segment of 20 a vector of the present invention may comprise an ADE marker to be lost upon homologous recombination with the target nucleic acid, and a HIS3 marker. The other segment may comprise a TRPI marker. Selection is achieved by growing transformed cells on a suitable drop-out selection media (see e.g., Watson et al. (1992) Recombinant DNA, 2.sup.nd ed., Freeman and Co., New York, N.Y.). For example, HIS3 allows for selection of cells containing the first segment. TRPl allows for selection of cells containing the second segment. ADE allows screening and 25 selection of clones in which homologous recombination took place. ADE enables color selection (red). [001221 Recombinant yeast cells may be selected using the selectable markers described herein according to methods well known in the art. Hence, one skilled in the art will appreciate that recombinant yeast cells harboring a gap-filled vector of the invention may be selected on the basis of the selectable markers included therein. For example, recombinant vectors carrying HIS3 and TRPI may be selected by growing transformed yeast cells in the 30 presence of drop-out selection media lacking histidine and tryptophan. Isolated positive clones may be purified further and analyzed to ascertain the presence and structure of the recombinant vector of the invention by, e.g., restriction analysis, electrophoresis, Southern blot analysis, polymerase chain reaction or the like. The invention further provides gap-filled vectors engineered according to the method of the invention. Such a vector is the product of homologous recombination between the segments or vectors of the invention and a target nucleic acid of choice. 35 The invention also provides a prokaryotic cell and/or a eukaryotic host cell harboring the cloning system or vector according to the invention. The organism can be unicellular or multicellular. The organism may be naturally photosynthetic or naturally non-photosynthetic. Other examples of organisms that can be transformed include vascular and non-vascular organisms. When hosts, such as plant, yeast, animal, algal, or insect cells are used, a vector of the present invention may contain, at least, a promoter, an initiation codon, the polynucleotide encoding a 40 protein, and a termination codon. The vectors of the present invention may also contain, if required, a polynucleotide for gene amplification (marker) that is usually used. 1001231 Products 23 1001241 The vectors of the present invention may comprise sequences that result in production of a product naturally, or not naturally, produced in the organism comprising the vector. In some instances the product encoded by one or more sequences on a vector is a polypeptide, for example an enzyme. Enzymes utilized in practicing the present invention may be encoded by nucleotide sequences derived from any organism, including bacteria, plants, 5 fungi and animals. Vectors may also comprise nucleotide sequences that affect the production or secretion of a product from the organism. In some instances, such nucleotide sequence(s) encode one or more enzymes that function in isoprenoid biosynthetic pathway. Examples of polypeptides in the isoprenoid biosynthetic pathway include synthases such as C5, C10, C15, C20, C30, and C40 synthases. In some instances, the enzymes are isoprenoid producing enzymes. In some instances, an isoprenoid producing enzyme produces isoprenoids with two 10 phosphate groups (e.g., GPP synthase, FPP synthase, DMAPP synthase). In other instances, isoprenoid producing enzymes produce isoprenoids with zero, one, three or more phosphates or may produce isoprenoids with other functional groups. Polynucleotides encoding enzymes and other proteins useful in the present invention may be isolated and/or synthesized by any means known in the art, including, but not limited to cloning, sub-cloning, and PCR. 15 1001251 An isoprenoid producing enzyme for use in the present invention may also be botryococcene synthase, p caryophyllene synthase, germacrene A synthase, 8-epicedrol synthase, valencene synthase, (+)-S-cadinene synthase, germacrene C synthase, (E)-s-farnesene synthase, casbene synthase, vetispiradiene synthase, 5-epi-aristolochene synthase, aristolchene synthase, ot-humulene, (E,E)-a-farnesene synthase, (- )-p-pinene synthase, y-terpinene synthase, limonene cyclase, linalool synthase, (+)-bornyl diphosphate synthase, levopimaradiene synthase, 20 isopimaradiene synthase, (E)-y-bisabolene synthase, copalyl pyrophosphate synthase, kaurene synthase, longifolene synthase, y-humulene synthase, 6-selinene synthase, P- phellandrene synthase, terpinolene synthase, (+)-3-carene synthase, syn-copalyl diphosphate synthase, a-terpineol synthase, syn-pimara-7,15-diene synthase, ent sandaaracopimaradiene synthase, sterner- 13 -ene synthase, E-s-ocimene, S-linalool synthase, geraniol synthase, 'y terpinene synthase, linalool synthase, E-P-ocimene synthase, epi-cedrol synthase, a-zingiberene synthase, 25 guaiadiene synthase, cascarilladiene synthase, cis-muuroladiene synthase, aphidicolan- 1 6b-ol synthase, elizabethatriene synthase, sandalol synthase, patchoulol synthase, zinzanol synthase, cedrol synthase, scareol synthase, copalol synthase, or manool synthase. 1001261 Other enzymes which may be produced by vectors of the present invention include biomass-degrading enzymes. Non-limiting examples of biomass-degrading enzymes include: cellulolytic enzymes, hemicellulolytic 30 enzymes, pectinolytic enzymes, xylanases, ligninolytic enzymes, cellulases, cellobiases, softening enzymes (e.g., endopolygalacturonase), amylases, lipases, proteases, RNAses, DNAses, inulinase, lysing enzymes, phospholipases, pectinase, pullulanase, glucose isomerase, endoxylanase, beta-xylosidase, alpha-L-arabinofuranosidase, alpha glucoronidase, alpha-galactosidase, acetylxylan esterase, and feruloyl esterase. Examples of genes that encode such enzymes include, but are not limited to, amylases, cellulases, hemicellulases, (e.g., p-glucosidase, endocellulase, 35 exocellulase), exo-o-glucanase, endo-o-glucanase and xylanse (endoxylanase and exoxylanse). Examples of ligninolytic enzymes include, but are not limited to, lignin peroxidase and manganese peroxidase from Phanerochaete chryososporium. One of skill in the art will recognize that these enzymes are only a partial list of enzymes which could be used in the present invention. [001271 The present invention contemplates making enzymes that contribute to the production of fatty acids, lipids 40 or oils by transforming host cells (e.g., alga cells such as C. reinhardtii, D. salina, H. pluvalis and cyanobacterial cells) and/or organisms comprising host cells with nucleic acids encoding one or more different enzymes. In some 24 embodiments the enzymes that contribute to the production of fatty acids, lipids or oils are anabolic enzymes. Some examples of anabolic enzymes that contribute to the synthesis of fatty acids include, but are not limited to, acetyl CoA carboxylase, ketoreductase, thioesterase, malonyltransferase, dehydratase, acyl-CoA ligase, ketoacylsynthase, enoylreductase and a desaturase. In some embodiments the enzymes are catabolic or biodegrading enzymes. In 5 some embodiments, a single enzyme is produced. (001281 Some host cells may be transformed with multiple genes encoding one or more enzymes. For example, a single transformed cell may contain exogenous nucleic acids encoding enzymes that make up an entire fatty acid synthesis pathway. One example of a pathway might include genes encoding an acetyl CoA carboxylase, a malonyltransferase, a ketoacylsynthase, and a thioesterase. Cells transformed with entire pathways and/or enzymes 10 extracted from them, can synthesize complete fatty acids or intermediates of the fatty acid synthesis pathway. In some embodiments constructs may contain multiple copies of the same gene, and/or multiple genes encoding the same enzyme from different organisms, and/or multiple genes with mutations in one or more parts of the coding sequences. 1001291 In some instances, a product (e.g. fuel, fragrance, insecticide) is a hydrocarbon-rich molecule, e.g. a 15 terpene. A terpene (classified by the number of isoprene units) can be a hemiterpene, monoterpene, sesquiterpene, diterpene, triterpene, or tetraterpene. In specific embodiments the terpene is a terpenoid (aka isoprenoid), such as a steroid or carotenoid. Subclasses of carotenoids include carotenes and xanthophylls. In specific embodiments, a fuel product is limonene, 1, 8-cineole, a-pinene, camphene, (+)-sabinene, myrcene, abietadiene, taxadiene, farnesyl pyrophosphate, amorphadiene, (E)-a-bisabolene, beta carotene, alpha carotene, lycopene, or diapophytoene. Some 20 of these terpenes are pure hydrocarbons (e.g. limonene) and others are hydrocarbon derivatives (e.g. cineole). [001301 Examples of fuel products include petrochemical products and their precursors and all other substances that may be useful in the petrochemical industry. Fuel products include, for example, petroleum products, and precursors of petroleum, as well as petrochemicals and precursors thereof. The fuel product may be used for generating substances, or materials, useful in the petrochemical industry, including petroleum products and petrochemicals. 25 The fuel or fuel products may be used in a combustor such as a boiler, kiln, dryer or furnace. Other examples of combustors are internal combustion engines such as vehicle engines or generators, including gasoline engines, diesel engines, jet engines, and others. Fuel products may also be used to produce plastics, resins, fibers, elastomers, lubricants, and gels. 1001311 Examples of products contemplated herein include hydrocarbon products and hydrocarbon derivative 30 products. A hydrocarbon product is one that consists of only hydrogen molecules and carbon molecules. A hydrocarbon derivative product is a hydrocarbon product with one or more heteroatoms, wherein the heteroatom is any atom that is not hydrogen or carbon. Examples of heteroatoms include, but not limited to, nitrogen, oxygen, sulfur, and phosphorus. Some products are hydrocarbon-rich, wherein as least 50%, 60%, 70%, 80%, 90%, or 95% of the product by weight is made up carbon and hydrogen. 35 1001321 Fuel products, such as hydrocarbons, may be precursors or products conventionally derived from crude oil, or petroleum, such as, but not limited to, liquid petroleum gas, naptha (ligroin), gasoline, kerosene, diesel, lubricating oil, heavy gas, coke, asphalt, tar, and waxes. For example, fuel products may include small alkanes (for example, I to approximately 4 carbons) such as methane, ethane, propane, or butane, which may be used for heating (such as in cooking) or making plastics. Fuel products may also include molecules with a carbon backbone of 40 approximately 5 to approximately 9 carbon atoms, such as naptha or ligroin, or their precursors. Other fuel products may be about 5 to about 12 carbon atoms or cycloalkanes used as gasoline or motor fuel. Molecules and aromatics of approximately 10 to approximately 18 carbons, such as kerosene, or its precursors, may also be fuel products. 25 Fuel products may also include molecules, or their precursors, with more than 12 carbons, such as used for lubricating oil. Other fuel products include heavy gas or fuel oil, or their precursors, typically containing alkanes, cycloalkanes, and aromatics of approximately 20 to approximately 70 carbons. Fuel products also includes other residuals from crude oil, such as coke, asphalt, tar, and waxes, generally containing multiple rings with about 70 or 5 more carbons, and their precursors. 1001331 Host Cells and Organisms [001341 Examples of organisms that can be transformed using the vectors and methods herein include vascular and non-vascular organisms. The organism can be prokaroytic or eukaroytic. The organism can be unicellular or multicellular. 10 [00135] Eukaryotic cells, such as a fungal cell (e.g., Saccharomyces cerevisiae, Schizosaccharomyces pombe or Ustilago maydis) may be transformed using the methods and compositions of the present invention. Methods for introducing nucleic acids in a fungal/yeast cells are well known in the art. Hence, such a step may be accomplished by conventional transformation methodologies. Non-limiting examples of suitable methodologies include electroporation, alkali cations protocols and spheroplast transformation. 15 1001361 Examples of non-vascular photosynthetic organisms include bryophtyes, such as marchantiophytes or anthocerotophytes. In some instances the organism is a cyanobacteria. In some instances, the organism is algae (e.g., macroalgae or microalgae). The algae can be unicellular or multicellular algae. In some instances the organism is a rhodophyte, chlorophyte, heterokontophyte, tribophyte, glaucophyte, chlorarachniophyte, euglenoid, haptophyte, cryptomonad, dinoflagellum, or phytoplankton. 20 1001371 The methods of the present invention are exemplified using the microalga, C. reinhardtii. The use of microalgae to express a polypeptide or protein complex according to a method of the invention provides the advantage that large populations of the microalgae can be grown, including commercially (Cyanotech Corp.; Kailua Kona HI), thus allowing for production and, if desired, isolation of large amounts of a desired product. However, the ability to express, for example, functional polypeptides, including protein complexes, in the chloroplasts of any plant 25 and/or modify the chloroplasts or any plant allows for production of crops of such plants and, therefore, the ability to conveniently produce large amounts of the polypeptides. Accordingly, the methods of the invention can be practiced using any plant having chloroplasts, including, for example, macroalgae, for example, marine algae and seaweeds, as well as plants that grow in soil, for example, corn (Zea mays), Brassica sp. (e.g., B. napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago saliva), rice (Oryza 30 saliva), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinclorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (ipomoea batatus), cassaya (Manihot esculenta), coffee (Cofea 35 spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugar cane (Saccharum spp.), oats, duckweed (Lemna), barley, tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca 40 saliva), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon (C. melo). Ornamentals such as azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus 26 rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum are also included. Additional ornamentals useful for practicing a method of the invention include impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, Antihirrhinum, Aquilegia, 5 Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia. Conifers that may be employed in practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), Douglas-fir (Pseudotsuga menziesi); Western hemlock (Tsuga ultilane); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true 10 firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis noolkatensis). 1001381 Leguminous plants useful for practicing a method of the invention include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mung bean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are not limited to, Arachis, e.g., peanuts, Vicia, e.g., crown vetch, hairy vetch, adzuki 15 bean, mung bean, and chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus, e.g., common bean and lima bean, Pisum, e.g., field bean, Melilotus, e.g., clover, Medicago, e.g., alfalfa, Lotus, e.g., trefoil, lens, e.g., lentil, and false indigo. Preferred forage and turf grass for use in the methods of the invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and redtop. Other plants useful in the invention include Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey 20 dew, jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, cauliflower, Brassica, e.g., broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, ultilane, chicory, 25 groundnut and zucchini. Thus, the compositions contemplated herein include host organisms comprising any of the above nucleic acids. The host organism can be any chloroplast-containing organism. 100139] The term "plant" is used broadly herein to refer to a eukaryotic organism containing plastids, particularly chloroplasts, and includes any such organism at any stage of development, or to part of a plant, including a plant cutting, a plant cell, a plant cell culture, a plant organ, a plant seed, and a plantlet. A plant cell is the structural and 30 physiological unit of the plant, comprising a protoplast and a cell wall. A plant cell can be in the form of an isolated single cell or a cultured cell, or can be part of higher organized unit, for example, a plant tissue, plant organ, or plant. Thus, a plant cell can be a protoplast, a gamete producing cell, or a cell or collection of cells that can regenerate into a whole plant. As such, a seed, which comprises multiple plant cells and is capable of regenerating into a whole plant, is considered plant cell for purposes of this disclosure. A plant tissue or plant organ can be a 35 seed, protoplast, callus, or any other groups of plant cells that is organized into a structural or functional unit. Particularly useful parts of a plant include harvestable parts and parts useful for propagation of progeny plants. A harvestable part of a plant can be any useful part of a plant, for example, flowers, pollen, seedlings, tubers, leaves, stems, fruit, seeds, roots, and the like. A part of a plant useful for propagation includes, for example, seeds, fruits, cuttings, seedlings, tubers, rootstocks, and the like. 40 1001401 Eukaryotic host cells may be a fungal cell (e.g., S. cerevisiae, Sz. pombe or U maydis). Examples of prokaryotic host cells include E. coli and B. subtilis, cyanobacteria and photosynthetic bacteria (e.g. species of the genus Synechocystis or the genus Synechococcus or the genus Athrospira). Examples of non-vascular plants which 27 may be a host organism (or the source of target DNA) include bryophtyes, such as marchantiophytes or anthocerotophytes. In some instances, the organism is algae (e.g., macroalgae or microalgae, such as Chlamydomonas reinhardtii, Chorella vulgaris, Dunaliella salina. Haematococcus pluvalis, Scenedesmus ssp.). The algae can be unicellular or multicellular algae. In some instances the organism is a rhodophyte, chlorophyte, 5 heterokontophyte, tribophyte, glaucophyte, chlorarachniophyte, euglenoid, haptophyte, cryptomonad, dinoflagellum, or phytoplankton. In other instancesOne of skill in the art will recognize that these organisms are given merely as examples and other organisms may be substituted where appropriate positive and negative selectable markers are available. 1001411 Although the foregoing invention has been described in some detail by way of illustration and example for 10 purposes of clarity of understanding, it is readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims. EXAMPLES Example 1: DNA Purification and Analysis 15 1001421 DNA is isolated and analyzed according to methods known in the art. 1001431 To prepare DNA from Chlamydomonas reinhardiii to use as a template for PCR, 106 algae cells (from agar plate or liquid culture) are suspended in 10 mM EDTA and heated to 95'C for 10 minutes, then cooled to near 23'C. The solution is added to the PCR mixture directly. [001441 To prepare purified chloroplast DNA from Chlamydomonas reinhardtii, 5x108 algae cells are collected 20 from liquid culture by centrifugation at 3000 x g for 10 min, washed once with water, centrifuged at 3000 x g for 10 min, resuspended in 10 mL of lysis solution (10 mM Tris pH=8.0, 10 mM EDTA, 150 mM NaCl, 2% SDS, 2% Sarkosyl, and 25 ug/mL Pronase (Roche)), and incubated at 37'C for 1 hour. The lysate is then gently extracted with phenol/chloroform followed by two chloroform washes. Total DNA is isolated by ethanol precipitation and resuspension in resuspension buffer (10 mM Tris pH=7.4, 1 mM EDTA, and 0.1 mg/mL RNase). Chloroplast DNA 25 can be purified by adding of denaturing solution (200 mM NaOH and 1% SDS (w/v)) is added to the resuspended DNA and inverted several times. Neutralizing solution (3.0 M potassium acetate, pH=5.5) is added, mixed, incubated on ice for 10 min and centrifuged at 15000 RPM for 30 min. The supernatant is decanted and applied to a QIAGEN-tip 500 and the DNA is isolated according to the QIAGEN Plasmid Maxi Kit. 1001451 An alternative method for preparing purified chloroplast DNA from Chlamydomonas reinhardtii involves 30 embedding algae cells or purified chloroplasts in low-melt agarose plugs to prevent shearing of the DNA. Chloroplasts are isolated by lysing whole cells in a nitrogen decompression chamber and separating intact cells and debris from the chloroplasts by percoll density gradient centrifugation. To lyse the cells and/or chloroplasts, the plugs are incubated at 55'C for 36 hours in lysis buffer (0.5 M EDTA, 1% Sarkosyl, 0.2 mg/mL proteinase K). When lysis is complete, plugs are washed 3 times with TE, and then stored in storage buffer (10 mM Tris pH=7.4, 1 35 mM EDTA). To release chloroplast DNA into solution, the plugs are washed 3 times with 30 mM NaCI, and melted at 65'C for 10 min. The melted plugs are shifted to 42'C and treated with D-agarase (New England Biolabs) for I hour. The solution of DNA can be used directly for downstream applications or ethanol precipitated to concentrate the sample. [00146] To prepare DNA from yeast to use as a template for PCR, 106 yeast cells (from agar plate or liquid culture) 40 are suspended in lysis buffer (6 mM KHPO4, pH=7.5, 6 mM NaCl, 3% glycerol, I U/mL zymolyase) and heated to 37'C for 30 min, 95'C for 10 minutes, then cooled to near 23'C. The solution is added to the PCR mixture directly. 28 {001471 To prepare plasmid DNA from yeast, desired clones are grown in selective liquid media (e.g., CSM-Trp) to saturation at 30'C. Cells are collected by centrifugation at 3006 x g for 10 minutes and resuspended in 150 uL of lysis buffer (1 M sorbitol, 0.1 M sodium citrate, 0.06 M EDTA pH=7.0, 100 mM beta-mercaptoethanol, and 2.5 mg/mL zymolyase). The solution is incubated for 1 hr at 37'C. 300 uL of denaturing solution (1% SDS and 0.2N 5 NaOH) is added and solution is incubated at 60'C for 15 min. 150 uL of neutralizing solution (3M potassium acetate, pH=4.8) is added and the solution is incubated on ice for 10 min. The solution is centrifuged at 14,000 RPM for 10 min and the supernatant is transferred to another tube. I mL of isopropanol is added, the mixture is gently mixed and centrifuged at 14,000 RPM for 10 min. The pellet is washed once with 1 mL of 70% ethanol and centrifuged at 14,000 RPM for 10 min. The DNA pellet is air-dried and resuspended in 60 uL of resuspension 10 buffer (10 mM Tris pH=7.4, 1 mM EDTA, and 0.1 mg/mL RNase). 1001481 To prepare plasmid DNA from bacteria, cells are grown to saturation at 37'C in LB containing the appropriate antibiotic (Kan or Amp). If the DNA of interest contains standard replication elements, cells are harvested by centrifugation. If the DNA of interest contains P1 replication elements, saturated cell cultures are diluted 1:20 in LB+Kan+IPTG and grown for 4 hours at 37'C, then harvested. The Plasmid Maxi kit (QIAGEN) is 15 used to prepare plasmid DNA from the cell pellets. 1001491 For illustrative purposes, and without limiting the invention to the specific methods described, DNA samples prepared from algae, yeast, or bacteria (in plugs or in solution) are analyzed by pulse-field gel electrophoresis (PFGE), or digested with the appropriate restriction endonuclease (e.g., SmaI) and analyzed by PFGE, conventional agarose gel electrophoresis, and/or Southern blot. Standard protocols useful for these purposes 20 are fully described in Gemmill et al. (in "Advances in Genome Biology", Vol. 1, "Unfolding The Genome," pp 217 251, edited by Ram S. Verma). 1001501 One of skill will appreciate that many other methods known in the art may be substituted in lieu of the ones specifically described or referenced. Example 2: Transformation methods 25 1001511 E. coli strains DHIOB or Genehog are made electrocompetent by growing the cells to an OD 00 of 0.7, then collected and washed twice with ice-cold 10% glycerol, flash frozen in a dry-ice ethanol bath and kept at -80*C. Total yeast or algae DNA is prepared and electroporated into E. coli by using, for example, a 0.1 cm cuvette at 1,800 V, 200 ohms and 25 mF in a Bio-Rad Gene Pulsar Electroporator. Cells are allowed to recover and clones are selected on agar growth media containing one or more antibiotics, such as kanamycin (50 gg/mL), ampicillin (100 30 tg/mL), gentamycin (50 pig/mL), tetracycline (51 pg/mL), or chloramphenicol (34 l.g/mL). 1001521 Yeast strains YPH857, YPH858 or AB 1380 may be transformed by the lithium acetate method as described in Sheistl & Geitz (Curr. Genet. 16:339 346, 1989) and Sherman et al., "Laboratory Course Manual Methods in Yeast Genetics" (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986) or a spheroplast method such as the one described by Sipiczki et al., Curr. Microbiol., 12(3):169-173 (1985). Yeast transformants are selected and 35 screened on agar media lacking amino and/or nucleic acids, such as tryptophan, leucine, or uracil. Standard methods for yeast growth and phenotype testing are employed as described by Sherman et al., supra. 1001531 Algae strains cscl37c (mt+), W1.1, or Wl-1 may be transformed by particle bombardment. Cells are grown to late log phase (approximately 7 days) in TAP medium in the presence or absence of 0.5 mM 5 fluorodeoxyuridine (Gorman and Levine, Proc. Natl. Acad. Sci., USA 54:1665-1669, 1965, which is incorporated 40 herein by reference) at 23"C under constant illumination of 450 Lux on a rotary shaker set at 100 rpm. Fifty ml of cells are harvested by centrifugation at 4,000xg at 23*C for 5 min. The supernatant is decanted and cells resuspended 29 in 4 ml TAP medium for subsequent chloroplast transformation by particle bombardment (Cohen et al., supra, 1998). [00154] One of skill will appreciate that many other transformation methods known in the art may be substituted in lieu of the ones specifically described or referenced herein. 5 Example 3: A hybrid ga-filling vector to capture a chloroolast Renome 100155) In this example, a system is established using a hybrid gap-filling vector to capture chloroplast DNA (FIG. 1). The hybrid gap filling vector backbone contains yeast elements that allow it to function as a yeast artificial plasmid (YAP) and bacterial elements that allow it to function as a plasmid artificial chromosome (PAC). The yeast elements include a yeast selection marker sequence (e.g. TRPJ or LEU2), a yeast centromere sequence (CEN), and a 10 yeast autonomously replicating nucleotide sequence (ARS). Bacterial elements include a P1 or bacterial origin of replication sequence and a bacterial selection maker sequence (e.g. Kan'). [001561 To manipulate the hybrid gap-filling vector, the vector pDOCI (SEQ ID NO. 1) was generated. Portions of pTRP-AU (FIG. 2) were amplified using PCR primer pairs that anneal to sites surrounding the region encompassing TEL, ADE2, and URA3. One pair amplifies a region within the yeast elements (SEQ ID NOs. 21 and 22) and the 15 other pair amplifies a region within the bacterial elements (SEQ ID NOs. 23 and 24). The PCR products were assembled into a single DNA fragment by PCR assembly using a single primer pair (SEQ ID NOs. 21 and 24). The assembled product was digested with NoiI and ligated to NotI-digested pUC-SE (SEQ ID NO. 2) to form pDOCI (SEQ ID NO. 1). [001571 To adapt the hybrid gap-filling vector to capture chloroplast DNA, the vector pDOCI-10 (SEQ ID NO. 3) 20 was generated. Portions of the C. reinhardtii chloroplast genome were PCR amplified using two primer pairs specific for two adjacent regions (SEQ ID NOs. 25 and 26 and SEQ ID NOs. 27 and 28) near the psbD locus. (Fig. 3; indicated by number 10 surrounded by a box). Each PCR product was digested with NodI and I-Scel and ligated to pDOCI (SEQ ID NO. 1) that was digested with I-Scel to form pDOCI-10 (SEQ ID NO. 3). 100158] To adapt pDOCI-10 to confer antibiotic resistance in algae, a selection marker was cloned. pSE-3HB-Kan 25 (SEQ ID NO. 4), which contains a kanamycin resistance encoding gene from bacteria, which is regulated by the 5' UTR and promoter sequence for the atpA gene from C. reinhardiii and the 3' UTR sequence for the rbcL gene from from C. reinhardtii, was digested with SnaBI , which liberated the kanamycin resistance cassette. The cassett was ligated to Snafl-digested pDOCI-10 to form pDOCI- I 0-Kan (SEQ ID NO. 5). [00159) The hybrid gap-filling for capturing chloroplast DNA, pTRP-10-Kan (SEQ ID NO. 6), was constructed 30 using recombination in yeast (Fig. 2). Briefly, pDOCI-1 0-Kan was digested with Pac and AscI to liberate the cassette that introduces chloroplast genome-specific elements into the hybrid gap-filling vector. This cassette was transformed along with pTRP-AU into the yeast strain YPH858 using the lithium acetate method. Homologous recombination takes place in vivo in the transformed yeast cells. Transformants that correctly integrated with cassette were isolated based on growth on CSM-Trp agar media containing 5-fluorooratic acid (5-FOA) and by red 35 color. 5-FOA selects for clones that lack a functional URA3 gene and the red color results when the ADE2 gene is eliminated. Plasmid DNA was isolated from yeast clones that were grown in CSM-Trp liquid media and transformed into E. coil (DHlOB). To generate large amounts of pTRP-10-Kan, DH1OB cells harboring pTRP-10 Kan were grown to saturation at 37'C in LB+Kan (50 ug/mL), and then diluted 1:20 in LB+Kan+IPTG and grown for 4 hours at 37'C. DNA was prepared from the bacterial culture using the Plasmid Maxi kit (QIAGEN). 40 [00160] The composition of the vector was verified by DNA sequencing of the entire plasmid. Example 4: Vectors to stabilize and/or modify chloroplast genome DNA in an exogenous host 30 [001611 Often, large pieces of heterologous DNA are instable in host organisms such as yeast or bacteria. This may be due to multiple factors, including, but not limited to, the presence of toxic gene products or codon bias and/or lack of selective pressure. Therefore, the target DNA within the shuttle vector may be altered within yeast or bacteria. For example, certain portions of a target DNA sequence (e.g., coding regions or promoters) may be deleted 5 or moved by recombination within the host organism. In a similar way, when a shuttle vector carrying the target DNA is transferred back to the organism (or a closely related species) that donated the target DNA, the target DNA can become unstable. [001621 Such sites of instability and susceptible sequences are readily discovered by determining which pieces of genomic DNA elude capture or disappear over time. These sites can be detected by comparing initially isolated gap 10 filling target DNA plasmids with plasmids isolated from transformed strains which have been sequentially passaged in the laboratory under conditions which select for the presence of the plasmid-encoded selectable marker (e.g., TRP 1) or by comparison with native target DNA (e.g., C. reinhardtii chloroplast DNA). Such comparison may be performed, for example, by restriction fragment length polymorphism (RFLP) analysis or direct sequencing. One of skill in the art will recognize that there are multiple other protocols and methods to determine such differences. 15 1001631 Once identified, such sites and sequences can be forced to remain through selection of markers. Fig. 4 shows examples of arrangements of selectable markers in the multiple cloning site of a vector. [001641 To generate a stabilization vector containing yeast and algae stability elements, the region of DNA in pTrp AU encompassing the ADE2 and URA3 genes was liberated by digestion with Sfil followed by gel purification of the desired fragment. The fragment was treated with Klenow fragment to create blunt ends and ligated to PmlI 20 degested pSE-3HB-Strep (SEQ ID NO. 7), creating pSE-3HB-Strep-AU (Fig. 4B). pSE-3HB-Strep is a vector that targets a streptomycin resistance encoding gene from bacteria, which is regulated by the 5' UTR and promoter sequence for the atpA gene from C. reinhardtii and the 3' UTR sequence for the rbcL gene from C. reinhardtii, to the 3HB locus of the C. reinhardtii chloroplast genome. 1001651 To generate vectors that target other regions of the chloroplast genome, 800-1000 bp regions were 25 amplified using PCR primer pairs that anneal to 5' and 3' regions flanking the sites indicated by numbered circles in Fig. 3 (site 1-5', SEQ ID NOs. 29 and 30; site 1-3', SEQ ID NOs. 31 and 32; site 3-5', SEQ ID NOs. 33 and 34; site 3-3', SEQ ID NOs. 35 and 36; site 4-5', SEQ ID NOs. 37 and 38; site 4-3', SEQ ID NOs. 39 and 40; site 5-5', SEQ ID NOs. 41 and 42; site 5-3', SEQ ID NOs. 43 and 44; and site 7-5', SEQ ID NOs. 45 and 46; site 7-3', SEQ ID NOs. 47 and 48). Each pair of PCR products was digested with NotI and I-Scel, mixed, and ligated to Nod-digested 30 pUC-SE (SEQ ID NO. 2), producing plasmids pUCI, pUC3, pUC4, pUC5 and pUC7 (named for their position of integration). [001661 Pairs of yeast selection markers were constructed so that multiple stabilization sites could be employed simultaneously (FIG. 4B). Each marker pair contains the URA3 gene (SEQ ID NO. 8), which was PCR amplified from pRS416. Each marker pair also contains the LEU2 gene (SEQ ID NO. 9) amplified from pRS415, the HIS3 35 gene (SEQ ID NO. 10) amplified from pRS413, the ADE2 gene (SEQ ID NO. 11) amplified from pTrp-AU, the LYS2 gene (SEQ ID NO. 12) amplified from S. cerevisiae genomic DNA, or the kanMX6 gene (SEQ ID NO. 13) from pFA6a-kanMX6, which confers resistance to the antifungal agent G418. The primers used for the URA3 gene add the Xmal restriction site to the 5' end (SEQ ID NO. 49) and Sall and SacII to the 3' end (SEQ ID NO. 50). The primers used for the LEU2, HJS3, ADE2, LYS2, and G418' genes add the Xmal restriction site to the 5' end (SEQ I1) 40 NO. 51 for LEU2, SEQ ID NO. 52 for HIS3, SEQ ID NO. 53 for ADE2, SEQ ID NO. 54 for LYS2, and SEQ ID NO. 55 for G418') and Sall, Fsel, and Spel sites to the 3' end (SEQ ID NO. 56 for LEU2, SEQ ID NO. 57 for HIS3, SEQ ID NO. 58 for ADE2, SEQ ID NO. 59 for LYS2, and SEQ ID NO. 60 for G418'). Each PCR product was digested 31 with XmaI and Sall, mixed pairwise, and ligated into the desired integration vector, resulting in pUCl-URA3/ADE2, pUC3-URA3/LEU2, pUC4-URA3/HIS3, pUC5-URA3/ADE2, and pUC7-URA3/LYS2. URA3 is used in each case because it allows for positive and negative selection. Marker pairs can be introduced based on selection for either gene (in the case of a single modification), the non-URA3 gene in the case of two or more modification. Then the 5 markers can be removed by introducing DNA with terminal sequences homologous to those surrounding the marker pairs and selecting for growth on minimal media containing 5-FOA. 1001671 To promote sequence stability in bacteria, antibiotic resistance markers were cloned into the yeast selection marker pairs. The bacterial stability markers include, but are not limited to, the ampicillin resistance gene (Amp', SEQ ID NO. 14) amplified from pET-21 a, the tetracycline resistance gene (Tee, SEQ ID NO. 15) amplified from 10 pBR322, the chloramphenicol resistance gene (Camr, SEQ ID NO. 16) amplified from pETcoco-1, and the gentamycin resistance gene (Gent, SEQ ID NO. 17) amplified from pJQ200. For each gene, primer pairs (SEQ ID NOs. 61 and 62 for Ampr, SEQ ID NOs. 63 and 64 for Tet, SEQ ID NOs. 65 and 66 for Cam', and SEQ ID NOs. 67 and 68 for Gent') that add Xmal sites to both the 5' and 3' ends were used to PCR amplify the antibiotic resistance fragment. Each PCR product was digested with XmaI and ligated into Xmal-digested pUCl-URA3/ADE2, pUC3 15 URA3/LEU2, pUC4-URA3/HIS3, pUC5-URA3/ADE2, and pUC7-URA3/LYS2. Fig. 4C shows the arrangement of the yeast and bacterial stability markers. Example 5: Introduction of hybrid and stabilization vectors into a chloroolast genome [001681 To generate a C. reinhardtii chloroplast genome that contains a hybrid vector with or without a stabilization vector, pTRP-10-Kan and pSE-3HB-Strep-AU were transformed into algae cells. pTRP-10-Kan was digested with 20 Noti to linearize the vector such that the chloroplast targeting elements on are on each end (FIG. 5). pSE-3HB Strep-AU was transformed as circular DNA. 1001691 For these experiments, all transformations are carried out on C. reinhardtii strain 137c (mt+). Cells are grown to late log phase (approximately 7 days) in the presence of 0.5 mM 5-fluorodeoxyuridine in TAP medium (Gorman and Levine, Proc. Nail. Acad. Sci., USA 54:1665-1669, 1965, which is incorporated herein by reference) at 25 23*C under constant illumination of 450 Lux on a rotary shaker set at 100 rpm. Fifty ml of cells are harvested by centrifugation at 4,000xg at 23*C for 5 min. The supernatant is decanted and cells resuspended in 4 ml TAP medium for subsequent chloroplast transformation by particle bombardment (Cohen et al., supra, 1998). All transformations are carried out under kanamycin selection (100 pg/ml) in which resistance is conferred by the gene encoded by the segment in Fig. 5 labeled "Kan." (Chlamydomonas Stock Center, Duke University). 30 100170) PCR is used to identify transformed strains. For PCR analysis, 106 algae cells (from agar plate or liquid culture) are suspended in 10 mM EDTA and heated to 95*C for 10 minutes, then cooled to near 23'C. A PCR cocktail consisting of reaction buffer, MgCt2, dNTPs, PCR primer pair(s), DNA polymerase, and water is prepared. Algae lysate in EDTA is added to provide a template for the reaction. The magnesium concentration is varied to compensate for amount and concentration of algae lysate and EDTA added. Annealing temperature gradients are 35 employed to determine optimal annealing temperature for specific primer pairs. [001711 Fig. 6 shows examples of isolated algae strains that contain pTrp-10-Kan with and without pSE-3HB Strep-AU. Strains that integrated pTRP-10-Kan were identified using any of a set of primer pairs that amplify different regions in the vector backbone (Lane 1, SEQ ID NOs. 69 and 70; Lane 2, SEQ ID NOs. 71 and 72; Lane 3, SEQ ID NOs. 73 and 74; Lane 4, SEQ ID NOs. 75 and 76; Lane 5, SEQ ID NOs. 77 and 78; Lane 6, SEQ ID NOs. 40 79 and 80; Lane 7, SEQ ID NOs. 81 and 82; Lane 8, SEQ ID NOs. 83 and 84; and Lane 9, SEQ ID NOs. 85 and 86) or primer pairs that span the junction between pTrp- 1 0-Kan DNA and chloroplast genome DNA (Lane 10, SEQ ID NOs. 87 and 88; Lane 11, SEQ ID NOs. 89 and 90). To identify strains that have integrated pSE-3HB-Strep-AU, 32 primer pairs (SEQ ID NOs. 91 and 92) were used which amplify regions within the ADE2 gene. Desired clones are those that yield PCR products of expected size. Example 6: Capture of a chloroplast genome into an exogenous host [001721 In this example, C. reinhardtii chloroplast DNA is isolated using standard techniques as described above. 5 C. reinhardti chloroplast DNA containing the pTRP-10-Kan vector with or without pSE-3HB-Strep-AU was used to transform bacteria. Electrocompetent E. coli strains DH 101B or Genehog were transformed and selected on LB agar growth medium with kanamycin (50 mg/I). DNA from individual clones was isolated by growing the cells to saturation at 37'C in LB liquid growth medium with kanamycin (50 mg/I). Saturated cell cultures are diluted 1:20 in LB+Kan+IPTG and grown at 37'C for 4 hours. The Plasmid Maxi kit (QIAGEN) is used to prepare plasmid DNA 10 from the isolated clones. 1001731 Fig. 7A shows the restriction analysis of two types of clones obtained from bacterial transformation (Clone I and Clone 2) compared to the parent hybrid vector (Clone C). Clones 1 and 2 are composed of >100 kb of DNA while the parent hybrid vector is composed of only 23 kb of DNA, demonstrating that large portions of DNA were indeed captured.by the hybrid vector. Restriction mapping indicates that Clone I comprises approximately half of 15 the chloroplast genome (FIG. 8A). DNA sequencing of the regions flanking the hybrid vector indicate that a recombination event occurred between the 3' UTR of the C. reinhardtii kanamycin resistance cassette (which is the 3' UTR from the rbcL gene in C. reinhardtii) and the 3' UTR of the C. reinhardtii streptomycin resistance cassette (which is the 3' UTR from the rbcL gene in C. reinhardt). Clone I retained the C. reinhardtii kanamycin resistance cassette, but lost the C. reinhardtii streptomycin resistance cassette. Restriction mapping indicates that 20 Clone 2 also comprises approximately half of the chloroplast genome (FIG. 8B). However, DNA sequencing of the regions flanking the hybrid vector indicate that a different recombination event occurred to give rise to Clone 2. The 5' UTR of the C. reinhardtii kanamycin resistance cassette (which is the 5' UTR from the alpA gene in C. reinhardlii) recombined with the 5' UTR of the atpA gene in the C. reinhardtii chloroplast genome and the inverted repeat B (IR-B in FIG. 8) recombined with inverted repeat A (IR-A in FIG. 8). Clone 2 lost both the C. reinhardtii 25 kanamycin resistance cassette and the C. reinhardtii streptomycin resistance cassette. 1001741 To demonstrate that the hybrid vector would support stable replication in yeast, Clone I was transformed into the yeast strain AB 1380 by the lithium acetate method and transfomants were selected on CSM-Trp agar media. Transformants were PCR screened using a primer pair (SEQ ID NOs 97 and 98) that amplifies a region with the chloroplast genome DNA of Clone 1. Desired clones are those that give rise to a PCR product of expected size. 30 Individual PCR-positive clones were streaked to generate multiple clones per isolate. DNA was prepared from the isolated yeast clones and transformed into bacteria. Bacteria were PCR screened using a primer pair (SEQ ID NOs 97 and 98) that amplifies a region within the chloroplast genome DNA of Clone 1. Desired clones are those that give PCR products of expected size. DNA was prepared from the isolated bacterial clones and analyzed by restriction digest with EcoRI. Fig. 7B shows that all isolated clones have restriction maps that are identical to the 35 originally isolated Clone 1. Thus, the hybrid vector supports stable replication in yeast. 1001751 To determine if the captured DNA is indeed chloroplast genome DNA, a Southern blot was performed. Clones I and 2 prepared from bacteria were digested with EcoRI, separated by gel electrophoresis, transferred to a membrane, and probed with radioactive HindIBl-digested total DNA from C. reinhardtii. Fig. 7C shows that the DNA in Clones I and 2 give rise to a signal, thus indicating that the captured DNA is chloroplast genome DNA. 40 Example 7: Reintroduction of chloroplast genome DNA into aleae 1001761 For these experiments, all transformations are carried out on either C. reinhardtii strain WI. 1, which expresses SAA from the endogenous psbA loci, or WI-1, which expresses LuxAB from the endogenous psbA loci. 33 Both W 1.1 and WI-I are resistant to spectinomycin by virtue of transformation of p 2 2 8 , which introduces a mutation in the 16S rRNA. Cells are grown to late log phase (approximately 7 days) in TAP medium (Gorman and Levine, Proc. NaIl. Acad. Sci., USA 54:1665-1669, 1965, which is incorporated herein by reference) at 23*C under constant illumination of 450 Lux on a rotary shaker set at 100 rpm. Fifty ml of cells are harvested by centrifugation 5 at 4,000xg at 23*C for 5 min. The supernatant is decanted and cells resuspended in 4 ml HSM medium for subsequent chloroplast transformation by particle bombardment (Cohen et al., supra, 1998). All transformations are carried out on HSM agar. 1001771 Transformants were selected by growth on HSM, indicating that function of the psbA gene locus was restored. Clones were also subsequently checked for growth on TAP, TAP with spectinomycin (150 pg/ml),TAP 10 with kanamycin (100 pg/ml), HSM, and HSM with kanamycin (100 pg/ml). Fig. 9 shows that Wi-I transformed with Clone I is able to grow on all media types. Fig. 9 also shows that W1-I transformed with Clone 2 is unable to grow on media containing kanamycin, which is expected since Clone 2 does not contain a Kan resistance marker. Example 8: Modification of Chloronlasts for Producing biornass-derading enzymes 1001781 To modify captured chloroplast genome DNA for production of a xylanase, an algae expression cassette 15 was cloned into the yeast marker vectors described in EXAMPLE 4. Briefly, a C. reinhardtii chloroplast expression vector was digested with Spel to liberate a fragment of DNA (SEQ ID NO. 18) with xylanase from T. reesei regulated by the 5' UTR for the psbD gene from C. reinhardtii and the 3' UTR for the psbA gene from C. reinhardtii. The fragment was treated with Klenow fragment to create blunt ends and cloned into SmaI (Xmal) site between the yeast markers in pUCl-URA3/ADE2, pUC3-URA3/LEU2, and pUC4-URA3/HIS3. Fig. 10A shows the 20 arrangement of the various elements in the new vectors. 1001791 Chloroplast genome DNA was modified by transforming the modification vector into yeast that harbored the captured DNA. For transformation, all modification vectors were linearized by digestion with Nod. Yeast harboring Clone I (from EXAMPLE 6) were grown to saturation in CSM-Trp media at 30'C, then diluted 1:20 in YPAD and grown for 4 hours at 30'C. Yeast were transformed using the lithium-acetate method and transformants 25 were selected for by growth on CSM-Ura media. Transformants were propagated on CSM-Trp-Ura media and then PCR screened using primers specific for the xylanase expression cassette (SEQ ID NOs 95 and 96) and a region within the chloroplast genome DNA (SEQ ID NOs 97 and 98). Desired clones are those that give PCR products of expected size for both reactions. DNA was prepared from the isolated yeast clones and transformed into bacteria. Bacteria were PCR screened using primers specific for the xylanase expression cassette (SEQ ID NOs 95 and 96) 30 and a region within the chloroplast genome DNA (SEQ ID NOs 97 and 98). Desired clones are those that give PCR products of expected size for both reactions. DNA was prepared from the isolated bacterial clones and analyzed by restriction digest with EcoRI. Fig. 1OB shows that clones were isolated that have restriction maps that are consistent with integration of the xylanase expression cassette in the desired position. [001801 For these experiments, all transformations are carried out on either C. reinhardtii strain W1.1, which 35 expresses SAA from the endogenous psbA loci, or WI-1, which expresses LuxAB from the endogenous psbA loci. Both W1.1 and WI-I are resistant to spectinomycin by virtue of transformation of p228, which introduces a mutation in the 16S rRNA. Cells are grown to late log phase (approximately 7 days) in TAP medium (Gorman and Levine, Proc. Nail. Acad. Sci., USA 54:1665-1669, 1965, which is incorporated herein by reference) at 23*C under constant illumination of 450 Lux on a rotary shaker set at 100 rpm. Fifty ml of cells are harvested by centrifugation 40 at 4,000xg at 2 3 *C for 5 min. The supernatant is decanted and cells resuspended in 4 ml HSM medium for subsequent chloroplast transformation by particle bombardment (Cohen et al., supra, 1998). All transformations are carried out on HSM agar. 34 1001811 Transformants were identified by growth on HSM, indicating that function of the psbA gene locus was restored. PCR is used to identify transformants that also contain the endoxylanase expression cassette. For PCR analysis, 106 algae cells (from agar plate or liquid culture) are suspended in 10 mM EDTA and heated to 95'C for 10 minutes, then cooled to near 23'C. A PCR cocktail consisting of reaction buffer, MgCl2, dNTPs, PCR primer 5 pair(s) (Table 2), DNA polymerase, and water is prepared. Algae lysate in EDTA is added to provide a template for the reaction. The magnesium concentration is varied to compensate for amount and concentration of algae lysate and EDTA added. Annealing temperature gradients are employed to determine optimal annealing temperature for specific primer pairs. [001821 To identify strains that have the endoxylanase expression cassette, a primer pair (SEQ ID NOs 95 and 96) 10 was used which amplifies a region within the endoxylanase expression cassette. Desired clones are those that yield PCR products of expected size. Fig. I1A shows that multiple algae strains were obtained that contain the endoxylanase expression cassette (PCR products indicated with asterisk). (001831 To determine whether functional xylanase is expressed, enzyme activity is examined. Patches of cells (approximately 2mg per patch) from TAP agar plates containing kanamycin (100 pg/mL)were resuspended in 50 ul 15 of 100mM sodium acetate pH 4.8 in a round bottom 96 well plate (Corning). Resuspended cells were lysed by addition of 20ul of BugBuster Protein Extraction Reagent (Novagen) and shaken for five minutes at room temperature. Cell lysate (50ul) was transferred to a black 96 well plate, and the chlorophyll fluorescence of the resulting wells was measured in a SpectraMax M2 microplate reader (Molecular Devices), with an excitation wavelength of 440 n and an emission wavelength of 740 nm, with a 695 rn cutoff filter. The measured 20 chlorophyll signal in RFUs (relative fluorescence units) was used to normalize the xylanase activity signal to the amount of biomass added to the reaction. [001841 After measurement of the chlorophyll fluorescence, xylanase substrate was added. EnzCheck Ultra Xylanase substrate (Invitrogen) was dissolved at a concentration of 50ug/mi in 100mM sodium acetate pH 4.8, and 50ul of substrate was added to each well of the microplate. The fluorescent signal was measured in a SpectraMax 25 M2 microplate reader (Molecular Devices), with an excitation wavelength of 360 nm and an emission wavelength of 460 nm, without a cutoff filter and with the plate chamber set to 42 degrees Celsius. The fluorescence signal was measured for 15 minutes, and the enzyme velocity was calculated with Sofbnax Pro v5.2 (Molecular Devices). Enzyme velocities were recorded as RFU/minute. Enzyme specific activities were calculated as milliRFU per minute per RFU of chlorophyll fluorescence. Fig. 11B shows that multiple algae strains containing the xylanase 30 expression cassette at site 1, 3, and 4 (see FIGS. 3 and 10) were obtained that produce functional xylanase enzyme. Example 9: Modification of a chloroplast genome to produce ternenes [001851 To modify captured chloroplast genome DNA for production of an FPP synthase, an algae expression cassette was cloned into the yeast marker vectors described in EXAMPLE 4. Briefly, a C. reinhardtii chloroplast expression vector was digested with Spel to liberate a fragment of DNA (SEQ ID NO. 19) with FPP synthase from 35 G. gallus regulated by the 5' UTR for the psbD gene from C. reinhardtii and the 3' UTR for the psbA gene from C. reinhardtii. The fragment was treated with Klenow fragment to create blunt ends and cloned into Smal (Xmal) site between the yeast markers in pUCl-URA3/ADE2, pUC3-URA3/LEU2, and pUC4-URA3/HIS3. Fig. 10A shows the arrangement of the various elements in the new vectors. (001861 Chloroplast genome DNA was modified by transforming the modification vector into yeast that harbored 40 the captured DNA. For transformation, all modification vectors were linearized by digestion with NotI. Yeast harboring Clone I (from EXAMPLE 6) were grown to saturation in CSM-Trp media at 30'C, then diluted 1:20 in YPAD and grown for 4 hours at 30'C. Yeast were transformed using the lithium-acetate method and transformants 35 were selected for by growth on CSM-Ura media. Transformants were propagated on CSM-Trp-Ura media and then PCR screened using primers specific for the FPP synthase expression cassette (SEQ ID NOs 95 and 99) and a region within the chloroplast genome DNA (SEQ ID NOs 97 and 98). Desired clones are those that gave PCR products of expected size for both reactions. DNA was prepared from the isolated yeast clones and transformed into bacteria. 5 Bacteria were PCR screened using primers specific for the FPP synthase expression cassette (SEQ ID NOs 95 and 99) and a region within the chloroplast genome DNA (SEQ IDs 97 and 98). Desired clones are those that gave PCR products of expected size for both reactions. DNA was prepared from the isolated bacterial clones and analyzed by restriction digest. Fig. 10C shows that clones were isolated that have restriction maps that are consistent with integration of the FPP synthase expression cassette in the desired position. 10 100 187] For these experiments, all transformations are carried out on either C. reinhardtii strain W1. 1, which expresses SAA from the endogenous psbA loci, or WI -1, which expresses LuxAB from the endogenous psbA loci. Both W 1.1 and WI-I are resistant to spectinomycin by virtue of transformation of p228, which introduces a mutation in the 16S rRNA. Cells are grown to late log phase (approximately 7 days) in TAP medium (Gorman and Levine, Proc. Nail. Acad. Sci., USA 54:1665-1669, 1965, which is incorporated herein by reference) at 23*C under 15 constant illumination of 450 Lux on a rotary shaker set at 100 rpm. Fifty ml of cells are harvested by centrifugation at 4,000xg at 23"C for 5 min. The supernatant is decanted and cells resuspended in 4 ml HSM medium for subsequent chloroplast transformation by particle bombardment (Cohen et al., supra, 1998). All transformations are carried out on HSM agar. 100188] Transformants were identified by growth on HSM, indicating that function of the psbA gene locus was 20 restored. PCR is used to identify transformants that also contain the FPP synthase expression cassette. For PCR analysis, 10' algae cells (from agar plate or liquid culture) are suspended in 10 mM EDTA and heated to 95'C for 10 minutes, then cooled to near 23'C. A PCR cocktail consisting of reaction buffer, MgCI2, dNTPs, PCR primer pair(s), DNA polymerase, and water is prepared. Algae lysate in EDTA is added to provide a template for the reaction. The magnesium concentration is varied to compensate for amount and concentration of algae lysate and 25 EDTA added. Annealing temperature gradients are employed to determine optimal annealing temperature for specific primer pairs. 1001891 To identify strains that have the FPP synthase expression cassette, a primer pair was used which amplifies a region within the endoxylanase expression cassette. Desired clones are those that yield PCR products of expected size. Multiple algae strains were obtained that contain the FPP synthase expression cassette at (FIG. 12, PCR 30 products indicated with asterisk). Example 11: Gap-filling a partial chloroplast genome 100190] In this example, a system is established using a hybrid gap-filling vector to capture chloroplast DNA by recombination in yeast (FIG. 13). To adapt the hybrid gap-filling vector to capture chloroplast DNA, the vector pDOCI-B5Al0 (SEQ ID NO. 20) was generated. Portions of the C. reinhardili chloroplast genome were PCR 35 amplified using two primer pairs specific for a region adjacent the psbD gene (A10, SEQ ID NOs. 27 and 28) and a region adjacent the near the psbH gene (B5, SEQ ID NOs. 787 and 788). Each PCR product was digested with NotI and I-Scel and ligated to pDOCI (SEQ ID NO. 1) that was digested with I-SceI to form pDOCI-B5Al0 (SEQ ID NO. 20). [00191] The hybrid gap-filling for capturing chloroplast DNA, pTRP-B5Al0, was constructed using recombination 40 in yeast. Briefly, pDOCI-B5A10 (SEQ ID NO. 20) was digested with Pacl and AscI to liberate the cassette that introduces chloroplast genome-specific elements into the hybrid gap-filling vector. This cassette was transformed along with pTRP-AU into the yeast strain YPH858 using the lithium acetate method. Homologous recombination 36 takes place in vivo in the transformed yeast cells. Transformants that correctly integrated with cassette were isolated based on growth on CSM-Trp agar media containing 5-fluorooratic acid (5-FOA) and by red color. 5-FOA selects for clones that lack a functional URA3 gene and the red color results when the ADE2 gene is eliminated. Plasmid DNA was isolated from yeast clones that were grown in CSM-Trp liquid media and transformed into E. coli 5 (DH1OB). To generate large amounts of pTRP-10-Kan, DH1OB cells harboring pTRP-10-Kan were grown to saturation at 37C in LB+Kan (50 ug/mL), and then diluted 1:20 in LB+Kan+IPTG and grown for 4 hours at 37C. DNA was prepared from the bacterial culture using the Plasmid Maxi kit (QIAGEN). 1001921 The desired genomic DNA is captured by using the sites in a gap filling vector that have high homology to the regions of the target genomic DNA (indicated by A10 and B5 in FIG. 13). Linearized gap filling vector DNA 10 and chloroplast genome DNA are used to transform S. cerevisiae using the lithium acetate or spheroplast methods as described in EXAMPLE 2. Homologous recombination takes place in vivo in the transformed yeast cells. Once the target DNA is captured by the vector via homologous recombination, the DNA can be stably replicated in both yeast and bacterial systems. As indicated in FIG. 1, the gap filling vector contains the selectable marker TRP1. Therefore, the yeast cells are plated on tryptophan-free medium to screen for positive transformants. Transformed 15 strains growing on the tryptophan-free medium are screened for the presence of target DNA inserts in the gap filling plasmid. Such screening can be by any known method in the art, for example, restriction enzyme digestion, PCR and/or gel electrophoresis. 1001931 Yeast transformants screened by PCR using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated 20 yeast clones and transformed into bacteria. Bacterial transformants are PCR screened using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated bacterial clones and analyzed by restriction digest with EcoRI. Example 12: Gap-filling a complete chloroplast genome 1001941 In this example, a system is established using a hybrid gap-filling vector to capture a complete chloroplast 25 genome by recombination in yeast (FIG. 14). The hybrid gap-filling for capturing chloroplast DNA, pTRP-10, was constructed using recombination in yeast. Briefly, pDOCI-10 (described in EXAMPLE 3, SEQ ID NO. 3) was digested with PacI and AscI to liberate the cassette that introduces chloroplast genome-specific elements into the hybrid gap-filling vector. This cassette was transformed along with pTRP-AU into the yeast strain YPH858 using the lithium acetate method. Homologous recombination takes place in vivo in the transformed yeast cells. 30 Transformants that correctly integrated with cassette were isolated based on growth on CSM-Trp agar media containing 5-fluorooratic acid (5-FOA) and by red color. 5-FOA selects for clones that lack a functional URA3 gene and the red color results when the ADE2 gene is eliminated, Plasmid DNA was isolated from yeast clones that were grown in CSM-Trp liquid media and transformed into E. coli (DH OB). To generate large amounts of pTRP-10 Kan, DH1OB cells harboring pTRP-10-Kan were grown to saturation at 37'C in LB+Kan (50 ug/mL), and then 35 diluted 1:20 in LB+Kan+IPTG and grown for 4 hours at 37'C. DNA was prepared from the bacterial culture using the Plasmid Maxi kit (QIAGEN). 100195] The desired genomic DNA is captured by using the sites in a gap filling vector that have high homology to the adjacent regions of the target genomic DNA (indicated by A10 and B10 in FIG. 14). Linearized gap filling vector DNA and chloroplast genome DNA are used to transform S. cerevisiae using the lithium acetate or 40 spheroplast methods as described in EXAMPLE 2. Homologous recombination takes place in vivo in the transformed yeast cells. Once the target DNA is captured by the vector via homologous recombination, the DNA can be stably replicated in both yeast and bacterial systems. As indicated in FIG. 1, the gap filling vector contains 37 the selectable marker TRPI. Therefore, the yeast cells are plated on tryptophan-free medium to screen for positive transformants. Transformed strains growing on the tryptophan-free medium are screened for the presence of target DNA inserts in the gap filling plasmid. Such screening can be by any known method in the art, for example, restriction enzyme digestion, PCR and/or gel electrophoresis. 5 1001961 Yeast transformants screened by PCR using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated yeast clones and transformed into bacteria. Bacterial transformants are PCR screened using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated bacterial clones and analyzed by restriction digest with EcoRI. 10 [001971 Example 13: Reassembly on a complete chloroplast genome 1001981 In some instances, the chloroplast genome may be divided into different plasmids, which, in total, comprise the entirety of the genome (FIG. 15). In such instances, capture of smaller portions may facilitate rapid and complex modification of multiple positions within the genome. The chloroplast genome fragments are then combined to reform a complete (and possibly modified) chloroplast genome. 15 1001991 In this example, the chloroplast is divided between two vectors. One vector consists of the chloroplast genome DNA from B5 to A10, or nucleotide 76,400 to 176,500 (according to the sequence available from NCBI, NC_005353), respectively. This vector may be obtained by the procedure describe in EXAMPLE 11. The other vector consists of the chloroplast genome DNA from B 10 to A5, or nucleotide 176,500 to 76,400 (according to the sequence available from NCBI, NC_005353), respectively. This vector is the same one as Clone I described in may 20 be obtained by the procedure described in EXAMPLE 6. Both vector share the hybrid vector sequence and can use that as a region of homology for recombination. In this example, the recombination event insert the hybrid vector sequenc between regions A10 and B10 (Fig. 5). However, there is no homology sufficient to facilitate recombination that joins regions A5 and B5 (Fig. 5). Thus, a third vector is added that has selectable markers between sequences homologous to A5 and B5. 25 1002001 The chloroplast genome is reassembled by combining all three vectors and transforming S. cerevisiae using the lithium acetate or spheroplast methods as described in EXAMPLE 2. Homologous recombination takes place in vivo in the transformed yeast cells. Once the target DNA is created via homologous recombination, the DNA can be stably replicated in both yeast and bacterial systems. As indicated in Fig. 15, the third vector contains the selectable markers URA3 and ADE2. Therefore, the yeast cells are plated on tryptophan-free medium to screen for positive 30 transformants. Transformed strains growing on the uracil and/or adenine-free medium are screened for the presence of target DNA inserts in the gap filling plasmid. Such screening can be by any known method in the art, for example, restriction enzyme digestion, PCR and/or gel electrophoresis. 1002011 Yeast transformants screened by PCR using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated 35 yeast clones and transformed into bacteria. Bacterial transformants are PCR screened using primers specific for chloroplast genome DNA (SEQ IDs 89 and 90). Desired clones are those that give a PCR product of expected size. DNA is prepared from the isolated bacterial clones and analyzed by restriction digest with EcoRI. Example 14: Vectors and methods to remove regions chloroplast genome DNA in an exogenous host 1002021 To generate vectors that remove regions of chloroplast genome DNA, 800-1000 bp regions of chloroplast 40 DNA are amplified using PCR primer pairs that anneal to 5' and 3' regions flanking the sites indicated in Fig. 4 (site 1-5', SEQ ID NOs. 1079 and 1080; site 1-3', SEQ ID NOs. 1125 and 1126; site 3-5', SEQ ID NOs. 1083 and 1084; site 3-3', SEQ ID NOs. 1129 and 1130; site 4-5', SEQ ID NOs. 1087 and 1088; site 4-3', SEQ ID NOs. 1133 and 38 1134; site 5-5', SEQ ID NOs. 789 and 790; site 5-3', SEQ ID NOs. 787 and 788; and site 7-5', SEQ ID NOs. 1091 and 1092; site 7-3', SEQ ID NOs. 1089 and 1090). Pairs of PCR products from the 5' and 3' regions for different sites are digested with NotI and I-SceI, mixed, and ligated to NotI-digested pUC-SE (SEQ ID NO. 2). Pairs of yeast selection markers (described in EXAMPLE 4) are cloned between the 5' and 3' fragments using Sall. 5 1002031 Regions of chloroplast genome DNA are removed by transforming the deletion vector into yeast that harbor the captured DNA. For transformation, all modification vectors were linearized by digestion with NotI. Yeast harboring the desired clone are grown to saturation in CSM-Trp media at 30C, then diluted 1:20 in YPAD and grown for 4 hours at 30'C. Yeast were transformed using the lithium-acetate method and transformants were selected for by growth on CSM-Ura media. Transformants were propagated on CSM-Trp-Ura media and then PCR 10 screened using primers specific for the targeted region and a region of the chloroplast genome DNA not targeted by the deletion vector. Desired clones are those that give not product for the targeted region, but do give a PCR product of expected size for the untargeted area. DNA is prepared from the isolated yeast clones and transformed into bacteria. Bacteria PCR screened using primers specific for the targeted region and a region of the chloroplast genome DNA not targeted by the deletion vector. Desired clones are those that give not product for the targeted 15 region, but do give a PCR product of expected size for the untargeted area. DNA is prepared from the isolated bacterial clones and analyzed by restriction digest 1002041 Various modifications, processes, as well as numerous structures that may be applicable herein will be apparent. Various aspects, features or embodiments may have been explained or described in relation to understandings, beliefs, theories, underlying assumptions, and/or working or prophetic examples, although it will be 20 understood that any particular understanding, belief, theory, underlying assumption, and/or working or prophetic example is not limiting. Although the various aspects and features may have been described with respect to various embodiments and specific examples herein, it will be understood that any of same is not limiting with respect to the full scope of the appended claims or other claims that may be associated with this application. 39

Claims (30)

1. An assembled chloroplast genome comprising all chloroplast genes necessary for photosynthesis, a yeast DNA replication element, a bacterial DNA replication element, and a yeast stability element comprising at least one yeast selection marker, wherein the assembled chloroplast genome is a circular construct.
2. A plurality of vectors suitable for assembling into a chloroplast genome, said plurality of vectors collectively comprising all chloroplast genes necessary for photosynthesis; wherein each vector in said plurality of vectors comprises two regions for homologous recombination with one or two other vectors in said plurality of vectors such that said vector can be assembled into a circular construct; wherein at least one vector of said plurality of vectors comprises a yeast DNA replication element, at least one vector of said plurality of vectors comprises a bacterial DNA replication element; and wherein at least one vector of said plurality of vectors comprises a yeast stability element comprising at least one yeast selection marker and said at least one vector comprising a yeast stability element does not contain a yeast DNA replication element.
3. The plurality of vectors according to claim 2, wherein said plurality of vectors comprises two vectors and each of said vectors comprise two areas for recombination with the other vector.
4. The assembled chloroplast genome according to claim 1 or the plurality of vectors according to claims 2 or 3, wherein: a) said yeast DNA replication element is a yeast centromere, a yeast autonomously replicating sequence, or both; or b) said bacterial DNA replication element is a P1 replication sequence or an F factor replication sequence.
5. The assembled chloroplast genome or the plurality of vectors according to any one of claims 1 to 4, wherein said assembled chloroplast genome comprises a single yeast DNA replication element.
6. The plurality of vectors according to any one of claims 2 to 5, wherein said yeast DNA replication element and said bacterial DNA replication element are contained in the same vector. 40 1000643322
7. The assembled chloroplast genome according to any one of claims 1, 4 or 5, wherein said assembled chloroplast genome contains at least 140kb of DNA.
8. The plurality of vectors according to any one of claims 2 to 6, wherein at least one of said vectors comprises chemically synthesized DNA.
9. The assembled chloroplast genome according to any one of claims 1, 4, 5 or 7, wherein the assembled chloroplast genome comprises: a) chemically synthesized DNA; or b) genomic chloroplast DNA from a photosynthetic organism.
10. The plurality of vectors according to any one of claims 2 to 6 and 8, wherein at least one of said vectors comprises genomic chloroplast DNA from a photosynthetic organism.
11. The assembled chloroplast genome according to claim 9 or the plurality of vectors according to claim 10, wherein said photosynthetic organism is an alga.
12. The assembled chloroplast genome or the plurality of vectors according to claim 11, wherein said alga is a macroalga or a microalga.
13. The assembled chloroplast genome or the plurality of vectors according to claim 12, wherein said microalgae is a Chlamydomanas sp., Chlorella sp., Dunaliella sp., Haematococcus sp. or Scenedesmus sp.
14. The assembled chloroplast genome according to claim 9 or the plurality of vectors according to claim 10, wherein said genomic chloroplast DNA from a photosynthetic organism comprises at least one modification.
15. The assembled chloroplast genome or the plurality of vectors according to claim 14, wherein said at least one modification is insertion of a heterologous or homologous polynucleotide, deletion of one or more nucleic acid bases, insertion of one or more nucleic acid bases, mutation of one or more nucleic acid bases, rearrangement of one or more polynucleotides, or any combination thereof.
16. The assembled chloroplast genome or the plurality of vectors according to any one of claims 1 to 15, wherein said yeast stability element comprises a pair of yeast selection markers. 41 1000643322
17. The assembled chloroplast genome or the plurality of vectors according to claim 16, wherein: a) said pair of yeast selection markers is a URA3 gene and an additional yeast selection marker; or b) said pair of yeast selection markers is a URA3 gene and an additional yeast selection marker chosen from the group of ADE, LEU, HIS, LYS or G418.
18. The plurality of vectors according to claim 17, wherein for each vector comprising a yeast stability element, said yeast stability element comprises a different pair of yeast selection markers.
19. The assembled chloroplast genome according to any one of claims I to 18, wherein said assembled genome further comprises at least one nucleotide sequence for the production of a terpene, a terpenoid, a fatty acid, or a biomass degrading enzyme.
20. A host cell comprising the assembled chloroplast genome according to claims 9, 11 to 17, and 19.
21. The host cell of claim 20, wherein said host cell: is a) a eukaryote; b) is a yeast; c) is an alga; d) is a microalga; e) is a Chlamydomanas sp., Chlorella sp., Dunaliella sp., Haematococcus sp. or Scenedesmus sp.; or f) was not photosynthetic prior to introduction of the assembled chloroplast genome.
22. A method for producing a product of interest in a photosynthetic organism comprising: obtaining the plurality of vectors according to claim 2; wherein at least one of said vectors comprises at least one heterologous polynucleotide encoding said product of interest; introducing said plurality of vectors into yeast to assemble said plurality of vectors by homologous recombination to produce an assembled chloroplast genome; selecting for complete assembly of said chloroplast genome by selecting for the presence of said at least one yeast stability element; introducing said assembled chloroplast genome into said photosynthetic organism; and expressing said at least one heterologous polynucleotide to produce said product of interest. 42 1000643322
23. The method according to claim 22, wherein said plurality of vectors comprises two vectors and each of said vectors comprise two areas for recombination with the other vector.
24. The method according to claim 22 or 23, wherein: a) said yeast DNA replication element is a yeast centromere, a yeast autonomously replicating sequence, or both; b) said bacterial DNA replication element is a P1 replication sequence or an F factor replication sequence; c) said assembled chloroplast genome comprises a single yeast DNA replication element; or d) said yeast DNA replication element and said bacterial DNA replication element are contained in the same vector.
25. The method according to any one of claims 22 to 24, wherein: a) at least one of said vectors comprises chemically synthesized DNA; b) at least one of said vectors comprises genomic chloroplast DNA; or c) said yeast stability element comprises a pair of yeast selection markers.
26. The method according to claim 25, wherein: a) said pair of yeast selection markers is a URA3 gene and an additional yeast selection marker; or b) said pair of yeast selection markers is a URA3 gene and an additional yeast selection marker chosen from the group of ADE, LEU, HIS, LYS or G418.
27. The method according to claim 26, wherein for each vector comprising a yeast stability element, said yeast stability element comprises a different pair of yeast selection markers.
28. The method according to any one of claims 22 to 27, wherein said photosynthetic organism is: a) a eukaryote; b) an alga; c) a microalga; d) a Chlamydomanas sp., Chlorella sp., Dunaliella sp., Haematococcus sp. or Scenedesmus sp.; e) a prokaryote; or f) a cyanobacterium.
29. The method according to any one of claims 22 to 28, wherein said product of interest is a) a terpene or a terpenoid, b) a fatty acid, c) a biomass degrading enzyme, or d) is not naturally produced by said photosynthetic organism.
30. An assembled chloroplast genome according to claim 1, substantially as hereinbefore described. 43
AU2013203045A 2007-10-05 2013-04-03 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts Ceased AU2013203045B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2013203045A AU2013203045B2 (en) 2007-10-05 2013-04-03 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US60/978,024 2007-10-05
AU2008307471A AU2008307471B2 (en) 2007-10-05 2008-10-06 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts
AU2013203045A AU2013203045B2 (en) 2007-10-05 2013-04-03 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2008307471A Division AU2008307471B2 (en) 2007-10-05 2008-10-06 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts

Publications (2)

Publication Number Publication Date
AU2013203045A1 AU2013203045A1 (en) 2013-05-02
AU2013203045B2 true AU2013203045B2 (en) 2014-06-19

Family

ID=48481339

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2013203045A Ceased AU2013203045B2 (en) 2007-10-05 2013-04-03 System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts

Country Status (1)

Country Link
AU (1) AU2013203045B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113025556B (en) * 2021-04-26 2023-08-29 东北师范大学 Separation method of wheat protoplast, cytoplasm and chloroplast

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005108568A1 (en) * 2004-05-10 2005-11-17 Basf Plant Science Gmbh Methods for assembling multiple expression constructs

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005108568A1 (en) * 2004-05-10 2005-11-17 Basf Plant Science Gmbh Methods for assembling multiple expression constructs

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Gupta M et al, Plant Molecular Biology, 1991, 17(3):361-369 *

Also Published As

Publication number Publication date
AU2013203045A1 (en) 2013-05-02

Similar Documents

Publication Publication Date Title
AU2008307471B2 (en) System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts
AU2009302889B2 (en) System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts
RU2666916C2 (en) Constructed by engineering techniques platform (etip) for transgene embedding for genetic targeting and trait stacking
NZ581472A (en) High throughput screening of genetically modified photosynthetic organisms
Fuentes et al. Experimental reconstruction of the functional transfer of intron-containing plastid genes to the nucleus
Kindle Nuclear transformation: technology and applications
MX2007014611A (en) Post-transcriptional regulation of gene expression.
US10793867B2 (en) Methods for targeted transgene-integration using custom site-specific DNA recombinases
US20090269816A1 (en) System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts
Tungsuchat-Huang et al. Visual spectinomycin resistance (aadA au) gene for facile identification of transplastomic sectors in tobacco leaves
CA2405364A1 (en) Transgenic plants
WO2017123772A1 (en) Glyphosate tolerant plants having modified 5-enolpyruvylshikimate-3-phosphate synthase gene regulation
AU2013203045B2 (en) System for capturing and modifying large pieces of genomic DNA and constructing organisms with synthetic chloroplasts
US20100050301A1 (en) System for capturing and modifying large pieces of genomic dna and constructing vascular plants with synthetic chloroplast genomes
US20150344895A1 (en) Methods of modifying algal cell genomes
Mireau et al. Expression of Barstar as a selectable marker in yeast mitochondria
US11549121B2 (en) Episomal DNA vectors for plant genetic engineering
WO2023247773A1 (en) Induction of meiotic recombination using a crispr system
Robson The construction of an expression vector for the transformation of the grape chloroplast genome
Pootakham et al. Reverse genetics in Chlamydomonas: a platform for isolating insertional mutants
Halfter Transgenic Arabidopsis
Cai et al. Zinc finger nuclease-mediated gene targeting in plants
Yang et al. Development of chemical inducible deletion technologies for citrus marker-free transformation

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired