WO2002016944A2 - Synthetic nucleic acid molecule compositions and methods of preparation - Google Patents
Synthetic nucleic acid molecule compositions and methods of preparation Download PDFInfo
- Publication number
- WO2002016944A2 WO2002016944A2 PCT/US2001/026566 US0126566W WO0216944A2 WO 2002016944 A2 WO2002016944 A2 WO 2002016944A2 US 0126566 W US0126566 W US 0126566W WO 0216944 A2 WO0216944 A2 WO 0216944A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- acid molecule
- synthetic nucleic
- synthetic
- codons
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0069—Oxidoreductases (1.) acting on single donors with incorporation of molecular oxygen, i.e. oxygenases (1.13)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
Definitions
- RNA molecule Transcription, the synthesis of an RNA molecule from a sequence of DNA is the first step in gene expression.
- Sequences which regulate DNA transcription include promoter sequences, polyadenylation signals, transcription factor binding sites and enhancer elements.
- a promoter is a DNA sequence capable of specific initiation of transcription and consists of three general regions.
- the core promoter is the sequence where the RNA polymerase and its co factors bind to the DNA.
- proximal promoter Immediately upstream of the core promoter is the proximal promoter which contains several transcription factor binding sites that are responsible for the assembly of an activation complex that in turn recruits the polymerase complex.
- the distal promoter located further upstream of the proximal promoter also contains transcription factor binding sites.
- Enhancers are regulatory regions, containing multiple transcription factor binding sites, that can significantly increase the level of transcription from a responsive promoter regardless of the enhancer' s orientation and distance with respect to the promoter as long as the enhancer and promoter are located within the same DNA molecule.
- the amount of transcript produced from a gene may also be regulated by a post-transcriptional mechanism, the most important being RNA splicing that removes intervening sequences (introns) from a primary transcript between splice donor and splice acceptor sequences.
- Natural selection is the hypothesis that genotype-environment interactions occurring at the phenotypic level lead to differential reproductive success of individuals and therefore to modification of the gene pool of a population.
- Some properties of nucleic acid molecules that are acted upon by natural selection include codon usage frequency, RNA secondary structure, the efficiency of intron splicing, and interactions with transcription factors or other nucleic acid binding proteins. Because of the degenerate nature of the genetic code, these properties can be optimized by natural selection without altering the corresponding amino acid sequence.
- altering codon usage may, in turn, result in the unintentional introduction into a synthetic nucleic acid molecule of inappropriate transcription regulatory sequences. This may adversely effect transcription, resulting in anomalous expression of the synthetic DNA.
- Anomalous expression is defined as departure from normal or expected levels of expression. For example, transcription factor binding sites located downstream from a promoter have been demonstrated to effect promoter activity (Michael et al, 1990; Lamb et al., 1998; Johnson et al., 1998; Jones et al., 1997).
- an enhancer element to exert activity and result in elevated levels of DNA transcription in the absence of a promoter sequence or for the presence of transcription regulatory sequences to increase the basal levels of gene expression in the absence of a promoter sequence.
- a method for making synthetic nucleic acid molecules with altered codon usage without also introducing inappropriate or unintended transcription regulatory sequences for expression in a particular host cell.
- the invention provides a synthetic nucleic acid molecule comprising at least 300 nucleotides of a coding region for a polypeptide, having a codon composition differing at more than 25%> of the codons from a wild type nucleic acid sequence encoding a polypeptide, and having at least 3-fold fewer, preferably at least 5-fold fewer, transcription regulatory sequences than would result if the differing codons were randomly selected.
- the synthetic nucleic acid molecule encodes a polypeptide that has an amino acid sequence that is at least 85%, preferably 90%, and most preferably 95% or 99% identical to the amino acid sequence of the naturally-occurring (native or wild type) polypeptide (protein) from which it is derived.
- the amino acid sequence identity is over at least 100 contiguous amino acid residues.
- the codons in the synthetic nucleic acid molecule that differ preferably encode the same amino acids as the corresponding codons in the wild type nucleic acid sequence.
- the transcription regulatory sequences which are reduced in the synthetic nucleic acid molecule include, but are not limited to, any combination of transcription factor binding sequences, intron splice sites, poly(A) addition sites, enhancer sequences and promoter sequences. Transcription regulatory sequences are well known in the art.
- the synthetic nucleic acid molecule of the invention has a codon composition that differs from that of the wild type nucleic acid sequence at more than 30%, 35%, 40% or more than 45%, e.g., 50%, 55%, 60% or more of the codons.
- Preferred codons for use in the invention are those which are employed more frequently than at least one other codon for the same amino acid in a particular organism and, more preferably, are also not low-usage codons in that organism and are not low-usage codons in the organism used to clone or screen for the expression of the synthetic nucleic acid molecule (for example, E. coli).
- preferred codons for certain amino acids may include two or more codons that are employed more frequently than the other (non-preferred) codon(s).
- the presence of codons in the synthetic nucleic acid molecule that are employed more frequently in one organism than in another organism results in a synthetic nucleic acid molecule which, when introduced into the cells of the organism that employs those codons more frequentiy, is expressed in those cells at a level that is greater than the expression of the wild type or parent nucleic acid sequence in those cells.
- the synthetic nucleic acid molecule of the invention is expressed at a level that is at least about 110%, e.g., 150%), 200%, 500% or more (1000%, 5000%, or 10000%) of that of the wild type nucleic acid sequence in a cell or cell extract under identical conditions (such as cell culture conditions, vector backbone, and the like).
- the codons that are different are those employed more frequently in a mammal, while in another embodiment the codons that are different are those employed more frequently in a plant.
- a particular type of mammal e.g., human
- a particular type of plant may have a different set of preferred codons than another type of plant.
- the majority of the codons which differ are ones that are preferred codons in a desired host cell.
- Preferred codons for mammals (e.g., humans) and plants are known to the art (e.g., Wada et al., 1990).
- preferred human codons include, but are not limited to, CGC (Arg), CTG (Leu), TCT (Ser), AGC (Ser), ACC (Thr), CCA (Pro), CCT (Pro), GCC (Ala), GGC (Gly), GTG (Val), ATC (He), ATT (lie), AAG (Lys), AAC (Asn), CAG (Gin), CAC (His), GAG (Glu), GAC (Asp), TAC (Tyr), TGC (Cys) and TTC (Phe) (Wada et al., 1990).
- preferred "humanized" synthetic nucleic acid molecules of the invention have a codon composition which differs from a wild type nucleic acid sequence by having an increased number of the preferred human codons, e.g. CGC, CTG, TCT, AGC, ACC, CCA, CCT, GCC, GGC, GTG, ATC, ATT, AAG, AAC, CAG, CAC, GAG, GAC, TAC, TGC, TTC, or any combination thereof.
- the synthetic nucleic acid molecule of the invention may have an increased number of CTG or TTG leucine-encoding codons, GTG or GTC valine-encoding codons, GGC or GGT glycine-encoding codons, ATC or ATT isoleucine-encoding codons, CCA or CCT proline- encoding codons, CGC or CGT arginine-encoding codons, AGC or TCT serine- encoding codons, ACC or ACT threonine-encoding codon, GCC or GCT alanine-encoding codons, or any combination thereof, relative to the wild type nucleic acid sequence.
- synthetic nucleic acid molecules having an increased number of codons that are employed more frequently in plants have a codon composition which differs from a wild type or parent nucleic acid sequence by having an increased number of the plant codons including, but not limited to, CGC (Arg), CTT (Leu), TCT (Ser), TCC (Ser), ACC (Thr), CCA (Pro), CCT (Pro), GCT (Ser), GGA (Gly), GTG (Val), ATC (He), ATT (He),
- Preferred codons may differ for different types of plants (Wada et al., 1990).
- the choice of codon may be influenced by many factors such as, for example, the desire to have an increased number of nucleotide substitutions or decreased number of transcription regulatory sequences. Under some circumstances (e.g.
- codon pairs are selected based upon the largest number of mismatched bases, as well as the criteria described above.
- a synthetic nucleic acid molecule of the invention may encode a selectable marker protein or a reporter molecule.
- the invention applies to any gene and is not limited to synthetic reporter genes or synthetic selectable marker genes.
- the synthetic nucleic acid molecule encodes a luciferase having a codon composition different than that of a wild type or parent Renilla luciferase or a beetle luciferase nucleic acid sequence.
- a synthetic click beetle luciferase nucleic acid molecule of the invention may optionally encode the amino acid valine at position 224 (i.e., it emits green light), or may optionally encode the amino acid histidine at position 224, histidine at position 247, isoleucine at position 346, glutamine at position 348 or combination thereof (i.e., it emits red light).
- Preferred synthetic luciferase nucleic acid molecules that are related to a wild type Renilla luciferase nucleic acid sequence include, but are not limited to, SEQ ID NO:21 (Rlucver2) or SEQ ID NO:22 (Rluc-final).
- Preferred synthetic luciferase nucleic acid molecules that are related to click beetle luciferase nucleic acid sequences include, but are not limited to, SEQ ID NO:7 (GRver5), SEQ ID NO:8 (GR6), SEQ ID NO:9 (GRver5.1), SEQ TD NO:14 (RDver5), SEQ ID NO:15 (RD7), SEQ ID NO:16 (RDver5.1), SEQ ID NO:17 (RDver5.2) or SEQ LD NO:18 (RD156-1H9).
- the invention also provides an expression cassette.
- the expression cassette of the invention comprises a synthetic nucleic acid molecule of the invention operatively linked to a promoter that is functional in a cell.
- Preferred promoters are those functional in mammalian cells and those functional in plant cells.
- the expression cassette may include other sequences, e.g., restriction enzyme recognition sequences and a Kozak sequence, and be a part of a larger polynucleotide molecule such as a plasmid, cosmid, artificial chromosome or vector, e.g., a viral vector.
- a host cell comprising the synthetic nucleic acid molecule of the invention, an isolated polypeptide (e.g., a fusion polypeptide encoded by the synthetic nucleic acid molecule of the invention), and compositions and kits comprising the synthetic nucleic acid molecule of the invention or the polypeptide encoded thereby in suitable container means and, optionally, instruction means.
- Preferred isolated polypeptides include, but are not limited to, those comprising SEQ ID NO:31 (GRver5.1), SEQ ID NO:226 (Rluc-final), or SEQ ID NO:223 (RD156-1H9).
- the invention also provides a method to prepare a synthetic nucleic acid molecule of the invention by genetically altering a parent (either a wild type or another synthetic) nucleic acid sequence.
- the method may be used to prepare a synthetic nucleic acid molecule encoding a polypeptide comprising at least 100 amino acids.
- One embodiment of the invention is directed to the preparation of synthetic genes encoding reporter or selectable marker proteins.
- the method of the invention may be employed to alter the codon usage frequency and decrease the number of transcription regulatory sequences in any open reading frame or to decrease the number of transcription regulatory sites in a vector backbone.
- the codon usage frequency in the synthetic nucleic acid molecule is altered to reflect that of the host organism desired for expression of that nucleic acid molecule while also decreasing the number of potential transcription regulatory sequences relative to the parent nucleic acid molecule.
- the invention provides a method to prepare a synthetic nucleic acid molecule comprising an open reading frame. The method comprises altering (e.g., decreasing or eliminating) a plurality of transcription regulatory sequences in a parent (wild type or a synthetic) nucleic acid sequence that encodes a polypeptide having at least 100 amino acids to yield a synthetic nucleic acid molecule which has a decreased number of transcription regulatory sequences and which preferably encodes the same amino acids as the parent nucleic acid molecule.
- the transcription regulatory sequences are selected from the group consisting of transcription factor binding sequences, intron splice sites, poly(A) addition sites, enhancer sequences and promoter sequences, and the resulting synthetic nucleic acid molecule has at least 3-fold fewer, preferably 5-fold fewer, transcription regulatory sequences relative to the parent nucleic acid sequence.
- the method also comprises altering greater than 25% of the codons in the synthetic nucleic acid sequence which has a decreased number of transcription regulatory sequences to yield a further synthetic nucleic acid molecule, wherein the codons that are altered encode the same amino acids as those in the corresponding position in the synthetic nucleic acid molecule which has a decreased number of transcription regulatory sequences and/or in the parent nucleic acid sequence.
- the codons which are altered do not result in an increase in transcriptional regulatory sequences.
- the further synthetic nucleic acid molecule encodes a polypeptide that has at least 85%>, preferably 90%, and most preferably 95% or 99%> contiguous amino acid sequence identity to the amino acid sequence of the polypeptide encoded by the parent nucleic acid sequence.
- the method comprises altering greater than 25%o of the codons in a parent nucleic acid sequence which encodes a polypeptide having at least 100 amino acids to yield a codon-altered synthetic nucleic acid molecule, wherein the codons that are altered encode the same amino acids as those present in the corresponding positions in the parent nucleic acid sequence. Then, a plurality of transcription regulatory sequences in the codon-altered synthetic nucleic acid molecule are altered to yield a further synthetic nucleic acid molecule. Preferably, the codons which are altered do not result in an increase in transcriptional regulatory sequences.
- the further synthetic nucleic acid molecule encodes a polypeptide that has at least 85%>, preferably 90%, and most preferably 95% or 99% contiguous amino acid sequence identity to the amino acid sequence of the polypeptide encoded by the parent nucleic acid sequence.
- a synthetic (including a further synthetic) nucleic acid molecule prepared by the methods of the invention.
- luciferase As described hereinbelow, the methods of the invention were employed with click beetle luciferase and Renilla luciferase nucleic acid sequences. While both of these nucleic acid molecules encode luciferase proteins, they are from entirely different families and are widely separated evolutionarily. These proteins have unrelated amino acid sequences, protein stractures, and they utilize dissimilar chemical substrates. The fact that they share the name "luciferase" should not be inte ⁇ reted to mean that they are from the same family, or even largely similar families.
- the methods produced synthetic luciferase nucleic acid molecules which exhibited significantly enhanced levels of mammalian expression without negatively effecting other desirable physical or biochemical properties (including protein half-life) and which were also largely devoid of known transcription regulatory elements.
- the invention also provides at least two synthetic nucleic acid molecules that encode highly related polypeptides, but which synthetic nucleic acid molecules have an increased number of nucleotide differences relative to each other. These differences decrease the recombination frequency between the two synthetic nucleic acid molecules when those molecules are both present in a cell (i.e., they are "codon distinct” versions of a synthetic nucleic acid molecule).
- the invention provides a method for preparing at least two synthetic nucleic acid molecules that are codon distinct versions of a parent nucleic acid sequence that encodes a polypeptide.
- the method comprises altering a parent nucleic acid sequence to yield a first synthetic nucleic acid molecule having an increased number of a first plurality of codons that are employed more frequently in a selected host cell relative to the number of those codons present in the parent nucleic acid sequence.
- the first synthetic nucleic acid molecule also has a decreased number of transcription regulatory sequences relative to the parent nucleic acid sequence.
- the parent nucleic acid sequence is also altered to yield a second synthetic nucleic acid molecule having an increased number of a second plurality of codons that are employed more frequently in the host cell relative to the number of those codons in the parent nucleic acid sequence, wherein the first plurality of codons is different than the second plurality of codons, and wherein the first and the second synthetic nucleic acid molecules preferably encode the same polypeptide.
- the second synthetic nucleic acid molecule has a decreased number of franscription regulatory sequences relative to the parent nucleic acid sequence. Either or both synthetic molecules can then be further modified.
- the present invention has applications with many genes and across many fields of science including, but not limited to, life science research, agrigenetics, genetic therapy, developmental science and pharmaceutical development.
- FIG. 1 A nucleotide sequence comparison of a yellow-green (YG) click beetle luciferase nucleic acid sequence (YG #81-6G01; SEQ ID NO:2) and various synthetic green (GR) click beetle luciferase nucleic acid sequences (GRverl, SEQ ID NO:3; GRver2, SEQ ID NO:4; GRver3, SEQ ID NO:5; GRver4, SEQ ID NO:6; GRver5, SEQ ID NO:7; GR6, SEQ ID NO:8; GRver5.1, SEQ ID NO:9) and various red (RD) click beetle luciferase nucleic acid sequences (RDverl, SEQ ID NO:10; RDver2, SEQ ID NO:ll; RDver3, SEQ ID NO: 12; RDver4, SEQ LD NO: 13; RDver5, SEQ ID NO: 14; RD7, SEQ LD NO:15; RDver5.1, SEQ ID NO
- FIG. 3 An amino acid sequence comparison of a YG click beetle luciferase amino acid sequence (YG#81-6G01, SEQ ID NO:24) and various synthetic GR click beetle luciferase amino acid sequences (GRverl, SEQ ID NO:25; GRver2, SEQ ID NO:26; GRver3, SEQ ID NO:27; GRver4, SEQ ID NO:28; GRver5, SEQ ID NO:29; GR6, SEQ ID NO:30; GRver5.1, SEQ ID NO:31) and various red (RD) click beetle luciferase amino acid sequences (RDverl, SEQ ID NO:32; RDver2, SEQ ID NO:33; RDver3, SEQ ID NO:34; RDver4, SEQ ID NO:218; RDver5, SEQ ID NO:219; RD7, SEQ ID NO:220; RDver5.1, SEQ ID NO:221; RDver5.2, SEQ ID
- AU amino acid sequences are inferred from the corresponding nucleotide sequence.
- the amino acids enclosed in boxes are amino acids that differ from the amino acid present at the homologous position in SEQ ID NO:24.
- Figure 4. Codon usage in YG#81-6G01, GRverl, RDverl, GRver5, and RDver5, and humans (HUM) and relative codon usage in YG#81-6G01, GRver5, RDver5, and humans.
- Codon usage summaries for YG#81-6G01 ( Figure 5 A), and GR/RD synthetic nucleic acid sequences, GRverl ( Figure 5B), RDverl (Figure 5C), GRver2 (Figure 5D), RDver2 (Figure 5E), GRver3 ( Figure 5F), RDver3 ( Figure 5G), GRver4 ( Figure 5H), RDver4 ( Figure 51), GRver5 ( Figure 5J), RDver5 (5K).
- FIG. 7 A nucleotide sequence comparison of a wild type Renilla reniformis luciferase nucleic acid sequence Genbank Accession No. M63501 (RELLUC, SEQ ID NO : 19) and various synthetic Renilla luciferase nucleic acid sequences (Rlucverl, SEQ ID NO:20; Rlucver2, SEQ ID NO:21; Rluc-final, SEQ ID NO:22).
- the nucleotides enclosed in boxes are nucleotides that differ from the nucleotide present at the homologous position in SEQ ID NO: 19.
- FIG. 8 An amino acid sequence comparison of a wild type Renilla reniformis luciferase amino acid sequence (RELLUC, SEQ ID NO:224) and various synthetic Renilla reniformis luciferase amino acid sequences (Rlucverl, SEQ ID NO:225; Rlucver2, SEQ LD NO:226; Rluc-final, SEQ ID NO:227).
- AU amino acid sequences are inferred from the corresponding nucleotide sequence.
- the amino acids enclosed in boxes are amino acids that differ from the amino acid present at the homologous position in SEQ ID NO:224.
- Figure 9 Codon usage in wild-type (A) versus synthetic (B) Renilla luciferase genes. For codon usage in selected organisms, see, e.g., Wada et al., 1990; Sha ⁇ et al., 1988; Aota et al., 1988; and Sha ⁇ et al., 1987, and for plant codons, Murray et al. 1989.
- Figure 10. Oligonucleotides employed to prepare synthetic Renilla luciferase gene (SEQ ID Nos. 246-292).
- FIG. 11 A nucleotide sequence comparison of a wild type yellow- green (YG) click beetle luciferase nucleic acid sequence (LUCPPLYG, SEQ ID NO:l) and the synthetic green click beetle luciferase nucleic acid sequences (GRver5.1, SEQ ID NO:9) and the synthetic red click beetle luciferase nucleic acid sequences (RD156-1H9, SEQ LD NO:18).
- the nucleotides enclosed in boxes are nucleotides ti at differ from the nucleotide present at the homologous position in SEQ ID NO:l.
- Both synthetic sequences have a codon composition that differs from LUCPPLYG at more than 25% of the codons and have at least 3-fold fewer transcription regulatory sequences relative to a random selection of codons at the codons which differ.
- Figure 12. An amino acid sequence comparison of a wild type YG click beetle luciferase amino acid sequence (LUCPPLYG, SEQ ID NO:23) and the synthetic GR click beetle luciferase amino acid sequences (GRver5.1, SEQ ID NO: 31) and the red (RD) click beetle luciferase amino acid sequences (RD156- 1H9, SEQ JJD NO:223). All amino acid sequences are inferred from the corresponding nucleotide sequence. The amino acids enclosed in boxes are amino acids that differ from the amino acid present at the homologous position in SEQ ID NO:23.
- Figure 13 pRL vector series. All of the vectors contain the Renilla wild type or synthetic gene as further described herein.
- Figure 13 A illustrates the Renilla luciferase gene in the pGL3 vectors (Promega Co ⁇ .)
- Figure 13B illustrates the Renilla luciferase co-reporter vector series.
- pRL-TK has the he ⁇ es simplex viras (HSV) tk promoter; pRL-SV40 has the SV40 virus early enhancer/promoter; pRL-CMV has the cytomegalovirus (CMV) enhancer and immediate early promoter; pRL-nuU has MCS (multiple cloning sites) but no promoter or enhancer; pRL-TK(Int " ) has HS V/tk promoter without an intron that is present in the other plasmids; pR-GL3B has the pGL-3 Basic backbone (Promega Co ⁇ .); pR-GL3 TK has the pGL3-Basic backbone with an HSV tk promoter.
- HSV simplex viras
- pRL-SV40 has the SV40 virus early enhancer/promoter
- pRL-CMV has the cytomegalovirus (CMV) enhancer and immediate early promoter
- FIG. 16 High expression from a synthetic Renilla nucleic acid sequence reduces the risk of promoter interference in a co-transfection assay.
- CHO cells were co-transfected with a constant amount (50 ng) of firefly luciferase expression vector (pGL3 control vector, with SV40 promoter and enhancer; Luc+) and a pRL vector having a native (0 ng, 50 ng, 100 ng, 500 ng, 1 ⁇ g or 2 ⁇ g) or synthetic (0 ng, 5 ng, 10 ng, 50 ng, 100 ng or 200 ng) Renilla luciferase gene.
- Figures 17A-B Illustrates the reactions catalyzed by firefly and click beetle (17A), and Renilla (17B) luciferases.
- FIG. 18 Nucleotide and inferred amino acid sequence of click beetle luciferases in GL3 vectors (GRver5.1 in pGL3, SEQ ID NO:297 encoding SEQ ID NO:298; RDver5.1 in pGL3, SEQ ID NO:299 encoding SEQ ID NO:300; and RD156-1H9 in pGL3, SEQ ID NO:301 encoding SEQ ID NO:302).
- an oligonucleotide having an Nco I site at the initiation codon was employed, which resulted in an amino acid substitution at position 2 to valine.
- gene refers to a DNA sequence that comprises coding sequences necessary for the production of a polypeptide or protein precursor.
- the polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence, as long as the desired protein activity is retained.
- nucleic acid is a covalently linked sequence of nucleotides in which the 3' position of the pentose of one nucleotide is joined by a phosphodiester group to the 5' position of the pentose of the next, and in which the nucleotide residues (bases) are linked in specific sequence, i.e., a linear order of nucleotides.
- a "polynucleotide”, as used herein, is a nucleic acid containing a sequence that is greater than about 100 nucleotides in length.
- An “oligonucleotide”, as used herein, is a short polynucleotide or a portion of a polynucleotide.
- oligonucleotide typically contains a sequence of about two to about one hundred bases.
- the word "oligo” is sometimes used in place of the word “oligonucleotide”.
- Nucleic acid molecules are said to have a "5'-terminus” (5' end) and a
- a terminal nucleotide is the nucleotide at the end position of the 3'- or 5'-terminus.
- DNA molecules are said to have "5' ends” and "3' ends” because mononucleotides are reacted to make oligonucleotides in a manner such that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides referred to as the "5' end” if its 5' phosphate is not linked to the 3' oxygen of a mononucleotide pentose ring and as the "3' end” if its 3' oxygen is not linked to a 5' phosphate of a subsequent mononucleotide pentose ring.
- a nucleic acid sequence even if internal to a larger oligonucleotide or polynucleotide, also may be said to have 5' and 3' ends.
- discrete elements are referred to as being "upstream” or 5' of the "downstream” or 3' elements. This terminology reflects the fact that transcription proceeds in a 5' to 3' fashion along the DNA strand.
- promoter and enhancer elements that direct transcription of a linked gene are generally located 5' or upstream of the coding region. However, enhancer elements can exert their effect even when located 3' of the promoter element and the coding region. Transcription termination and polyadenylation signals are located 3' or downstream of the coding region.
- codon is a basic genetic coding unit, consisting of a sequence of three nucleotides that specify a particular amino acid to be inco ⁇ oration into a polypeptide chain, or a start or stop signal.
- Figure 1 contains a codon table.
- coding region when used in reference to structural gene refers to the nucleotide sequences that encode the amino acids found in the nascent polypeptide as a result of translation of a mRNA molecule.
- the coding region is bounded on the 5' side by the nucleotide triplet "ATG” which encodes the initiator methionine and on the 3' side by a stop codon (e.g., TAA, TAG, TGA).
- ATG nucleotide triplet
- TTG stop codon
- protein and “polypeptide” is meant any chain of amino acids, regardless of length or post-translational modification (e.g., glycosylation or phosphorylation).
- the synthetic genes of the invention may also encode a variant of a naturally-occurring protein or polypeptide fragment thereof.
- a protein polypeptide has an amino acid sequence that is at least 85%, preferably 90%, and most preferably 95% or 99% identical to the amino acid sequence of the naturally-occurring (native) protem from which it is derived.
- Polypeptide molecules are said to have an "amino terminus” (N-terminus) and a “carboxy terminus” (C-terminus) because peptide linkages occur between the backbone amino group of a first amino acid residue and the backbone carboxyl group of a second amino acid residue.
- N-terminal and C-terminal in reference to polypeptide sequences refer to regions of polypeptides including portions of the N-terminal and C-terminal regions of the polypeptide, respectively.
- a sequence that includes a portion of the N-terminal region of polypeptide includes amino acids predominantly from the N-terminal half of the polypeptide chain, but is not limited to such sequences.
- an N-terminal sequence may include an interior portion of the polypeptide sequence including bases from both the N-terminal and C-terminal halves of the polypeptide.
- N-terminal and C-terminal regions may, but need not, include the amino acid defining the ultimate N-terminus and C-terminus of the polypeptide, respectively.
- wild type refers to a gene or gene product that has the characteristics of that gene or gene product isolated from a naturally occurring source.
- a wild type gene is that which is most frequently observed in a population and is thus arbitrarily designated the "wild type” form of the gene.
- mutant refers to a gene or gene product that displays modifications in sequence and/or functional properties (i.e., altered characteristics) when compared to the wild type gene or gene product. It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that they have altered characteristics when compared to the wild type gene or gene product.
- complementarity are used in reference to a sequence of nucleotides related by the base-pairing rales. For example, for the sequence 5' "A-G-T” 3', is complementary to the sequence 3' "T-C-A” 5'. Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods which depend upon hybridization of nucleic acids.
- recombinant protein or "recombinant polypeptide” as used herein refers to a protein molecule expressed from a recombinant DNA molecule.
- native protein is used herein to indicate a protein isolated from a naturally occurring (i.e., a nonrecombinant) source.
- fusion protein and “fusion partner” refer to a chimeric protein containing the protein of interest (e.g., luciferase) joined to an exogenous protein fragment (e.g., a fusion partner which consists of a non-luciferase protein).
- the fusion partner may enhance the solubility of protein as expressed in a host cell, may, for example, provide an affinity tag to allow purification of the recombinant fusion protein from the host cell or culture supernatant, or both. If desired, the fusion partner may be removed from the protein of interest by a variety of enzymatic or chemical means known to the art.
- the terms "cell,” “cell line,” “host cell,” as used herein, are used interchangeably, and all such designations include progeny or potential progeny of these designations.
- transformed cell is meant a cell into which (or into an ancestor of which) has been introduced a DNA molecule comprising a synthetic gene.
- a synthetic gene of the invention may be introduced into a suitable cell line so as to create a stably-fransfected cell line capable of producing the protein or polypeptide encoded by the synthetic gene.
- Vectors , cells, and methods for constructing such cell lines are well known in the art, e.g. in Ausubel, et al. (infra).
- the words "transformants" or “transformed cells” include the primary transformed cells derived from the originally transformed cell without regard to the number of transfers.
- AU progeny may not be precisely identical in DNA content, due to deliberate or inadvertent mutations. Nonetheless, mutant progeny that have the same functionality as screened for in the originally transformed cell are included in the definition of transformants.
- Nucleic acids are known to contain different types of mutations. A
- point mutation refers to an alteration in the sequence of a nucleotide at a single base position from the wild type sequence. Mutations may also refer to insertion or deletion of one or more bases, so that the nucleic acid sequence differs from the wild-type sequence.
- the term "homology” refers to a degree of complementarity. There may be partial homology or complete homology (i.e., identity). Homology is often measured using sequence analysis software (e.g., Sequence Analysis Software Package of the Genetics Computer Group. University of Wisconsin Biotechnology Center. 1710 University Avenue. Madison, WI 53705). Such software matches similar sequences by assigning degrees of homology to various substitutions, deletions, insertions, and other modifications.
- Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine.
- a “partially complementary” sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid is referred to using the functional term "substantially homologous.”
- the inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency.
- a substantially homologous sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction.
- the absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30%> identity). In this case, in the absence of non-specific binding, the probe will not hybridize to the second non-complementary target.
- a second target which lacks even a partial degree of complementarity (e.g., less than about 30%> identity).
- the probe will not hybridize to the second non-complementary target.
- substantially homologous refers to any probe which can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described herein.
- Probe refers to an oligonucleotide designed to be sufficiently complementary to a sequence in a denatured nucleic acid to be probed (in relation to its length) to be bound under selected stringency conditions.
- Hybridization and “binding” in the context of probes and denature melted nucleic acid are used interchangeably.
- Probes which are hybridized or bound to denatured nucleic acid are base paired to complementary sequences in the polynucleotide. Whether or not a particular probe remains base paired with the polynucleotide depends on the degree of complementarity, the length of the probe, and the stringency of the binding conditions. The higher the stringency, the higher must be the degree of complementarity and/or the longer the probe.
- hybridization is used in reference to the pairing of complementary nucleic acid strands.
- Hybridization and the strength of hybridization is impacted by many factors well known in the art including the degree of complementarity between the nucleic acids, stringency of the conditions involved affected by such conditions as the concentration of salts, the Tm (melting temperature) of the formed hybrid, the presence of other components (e.g., the presence or absence of polyethylene glycol), the molarity of the hybridizing strands and the G:C content of the nucleic acid strands.
- stringency is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds, under which nucleic acid hybridizations are conducted. With “high stringency” conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences. Thus, conditions of “medium” or “low” stringency are often required when it is desired that nucleic acids which are not completely complementary to one another be hybridized or annealed together. The art knows well that numerous equivalent conditions can be employed to comprise medium or low stringency conditions.
- hybridization conditions are generally evident to one skilled in the art and is usually guided by the pu ⁇ ose of the hybridization, the type of hybridization (DNA-DNA or DNA-RNA), and the level of desired relatedness between the sequences (e.g., Sambrook et al, 1989; Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington D.C., 1985, for a general discussion of the methods).
- hybridization stringency can be used to maximize or minimize stability of such duplexes.
- Hybridization stringency can be altered by: adjusting the temperature of hybridization; adjusting the percentage of helix destabilizing agents, such as formamide, in the hybridization mix; and adjusting the temperature and/or salt concentration of the wash solutions.
- the final stringency of hybridizations often is determined by the salt concentration and/or temperature used for the post-hybridization washes.
- High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 H 2 O and 1.85 g/1 EDTA, pH adjusted to 7.4 withNaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1X SSPE, 1.0% SDS at 42°C when a probe of about 500 nucleotides in length is employed.
- 5X SSPE 43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 H 2 O and 1.85 g/1 EDTA, pH adjusted to 7.4 withNaOH
- SDS 5X Denhardt's reagent
- 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1X SSPE, 1.0% SDS at 42
- “Medium stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 H 2 O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.0X SSPE, 1.0% SDS at 42°C when a probe of about 500 nucleotides in length is employed.
- Low stringency conditions comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 H 2 O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5X Denhardt's reagent [50X Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 g/ml denatured salmon sperm DNA followed by washing in a solution comprising 5X SSPE, 0.1%) SDS at 42°C when a probe of about 500 nucleotides in length is employed.
- 5X SSPE 43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 H 2 O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH
- SDS 5X Denhardt's reagent
- T m is used in reference to the "melting temperature".
- the melting temperature is the temperature at which 50%> of a population of double-stranded nucleic acid molecules becomes dissociated into single strands.
- the equation for calculating the T m of nucleic acids is well-known in the art.
- the Tm of a hybrid nucleic acid is often estimated using a formula adopted from hybridization assays in 1 M salt, and commonly used for calculating Tm for PCR primers: [(number of A + T) x 2°C + (number of G+C) x 4°C]. (C.R. Newton et al., PCR, 2nd Ed., Springer- Verlag (New York, 1997), p. 24).
- T m 81.5 + 0.41(%> G + C), when a nucleic acid is in aqueous solution at 1 M NaCl.
- T m 81.5 + 0.41(%> G + C)
- Anderson and Young Quantitative Filter Hybridization, in Nucleic Acid Hybridization, 1985.
- Other more sophisticated computations exist in the art which take structural as well as sequence characteristics into account for the calculation of T m .
- a calculated T m is merely an estimate; the optimum temperature is commonly determined empirically.
- isolated when used in relation to a nucleic acid, as in “isolated oligonucleotide” or “isolated polynucleotide” refers to a nucleic acid sequence that is identified and separated from at least one contaminant with which it is ordinarily associated in its source. Thus, an isolated nucleic acid is present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids (e.g., DNA and RNA) are found in the state they exist in nature.
- isolated nucleic acid e.g., DNA and RNA
- a given DNA sequence e.g., a gene
- RNA sequences e.g., a specific mRNA sequence encoding a specific protein
- isolated nucleic acid includes, by way of example, such nucleic acid in cells ordinarily expressing that nucleic acid where the nucleic acid is in a chromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature.
- the isolated nucleic acid or oligonucleotide may be present in single-stranded or double-stranded form.
- the oligonucleotide When an isolated nucleic acid or oligonucleotide is to be utilized to express a protein, the oligonucleotide contains at a minimum, the sense or coding strand (i.e., the oligonucleotide may single-stranded), but may contain both the sense and anti-sense strands (i.e., the oligonucleotide may be double-stranded) .
- isolated when used in relation to a polypeptide, as in “isolated protein” or “isolated polypeptide” refers to a polypeptide that is identified and separated from at least one contaminant with which it is ordinarily associated in its source. Thus, an isolated polypeptide is present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated polypeptides (e.g., proteins and enzymes) are found in the state they exist in nature.
- purified or “to purify” means the result of any process that removes some of a contaminant from the component of interest, such as a protein or nucleic acid. The percent of a purified component is thereby increased in the sample.
- operably linked refers to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced.
- the term also refers to the linkage of sequences encoding amino acids in such a manner that a functional (e.g., enzymatically active, capable of binding to a binding partner, capable of inhibiting, etc.) protein or polypeptide is produced.
- recombinant DNA molecule means a hybrid DNA sequence comprising at least two nucleotide sequences not normally found together in nature.
- Prokaryotic expression vectors include a promoter, a ribosome binding site, an origin of replication for autonomous replication in a host cell and possibly other sequences, e.g. an optional operator sequence, optional restriction enzyme sites.
- a promoter is defined as a DNA sequence that directs RNA polymerase to bind to DNA and to initiate RNA synthesis.
- Eukaryotic expression vectors include a promoter, optionally a polyadenlyation signal and optionally an enhancer sequence.
- a polynucleotide having a nucleotide sequence encoding a gene means a nucleic acid sequence comprising the coding region of a gene, or in other words the nucleic acid sequence which encodes a gene product.
- the coding region may be present in either a cDNA, genomic DNA or RNA form.
- the oligonucleotide may be single-stranded (i.e., the sense strand) or double-stranded.
- Suitable control elements such as enhancers/promoters, splice junctions, polyadenylation signals, etc.
- the coding region utilized in the expression vectors of the present invention may contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc.
- the coding region may contain a combination of both endogenous and exogenous control elements.
- transcription regulatory element refers to a genetic element or sequence that controls some aspect of the expression of nucleic acid sequence(s).
- a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region.
- Other regulatory elements include, but are not limited to, transcription factor binding sites, splicing signals, polyadenylation signals, termination signals and enhancer elements.
- Transcriptional control signals in eukaryotes comprise "promoter" and
- Promoters and enhancers consist of short arrays of DNA sequences that interact specifically with cellular proteins involved in transcription (Maniatis et al., 1987). Promoter and enhancer elements have been isolated from a variety of eukaryotic sources including genes in yeast, insect and mammalian cells. Promoter and enhancer elements have also been isolated from virases and analogous control elements, such as promoters, are also found in prokaryotes. The selection of a particular promoter and enhancer depends on the cell type used to express the protein of interest.
- Some eukaryotic promoters and enhancers have a broad host range while others are functional in a limited subset of cell types (for review, see Voss et al., 1986; and Maniatis et al., 1987.
- the SV40 early gene enhancer is very active in a wide variety of cell types from many mammalian species and has been widely used for the expression of proteins in mammalian cells (Dijkema et al., 1985).
- Two other examples of promoter/enhancer elements active in a broad range of mammalian cell types are those from the human elongation factor 1 gene (Uetsuki et al.,
- Rous sarcoma viras Gorman et al., 1982; and the human cytomegalovirus (Boshart et al., 1985).
- promoter/enhancer denotes a segment of DNA containing sequences capable of providing both promoter and enhancer functions (i.e., the functions provided by a promoter element and an enhancer element as described above).
- promoter/promoter may be "endogenous” or “exogenous” or “heterologous.”
- An “endogenous” enhancer/promoter is one that is naturally linked with a given gene in the genome.
- an “exogenous” or “heterologous” enhancer/promoter is one that is placed in juxtaposition to a gene by means of genetic manipulation (i.e., molecular biological techniques) such that transcription of the gene is directed by the linked enhancer/promoter.
- Splicing signals mediate the removal of introns from the primary RNA transcript and consist of a splice donor and acceptor site (Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York , 1989, pp. 16.7-16.8).
- a commonly used splice donor and acceptor site is the splice junction from the 16S RNA of SV40. Efficient expression of recombinant DNA sequences in eukaryotic cells requires expression of signals directing the efficient termination and polyadenylation of the resulting transcript.
- Transcription termination signals are generally found downstream of the polyadenylation signal and are a few hundred nucleotides in length.
- the term "poly(A) site” or "poly(A) sequence” as used herein denotes a DNA sequence which directs both the termination and polyadenylation of the nascent RNA transcript. Efficient polyadenylation of the recombinant transcript is desirable, as transcripts lacking a poly(A) tail are ⁇ unstable and are rapidly degraded.
- the poly(A) signal utilized in an expression vector may be "heterologous" or "endogenous.” An endogenous poly(A) signal is one that is found naturally at the 3' end of the coding region of a given gene in the genome.
- a heterologous poly(A) signal is one which has been isolated from one gene and positioned 3' to another gene.
- a commonly used heterologous poly(A) signal is the SV40 poly(A) signal.
- the SV40 poly(A) signal is contained on a 237 bp BamH T/Bcl I restriction fragment and directs both termination and polyadenylation (Sambrook, supra, at 16.6-16.7).
- Eukaryotic expression vectors may also contain "viral replicons "or "viral origins of replication.”
- Viral replicons are viral DNA sequences which allow for the extrachromosomal replication of a vector in a host cell expressing the appropriate replication factors.
- Vectors containing either the SV40 or polyoma viras origin of replication replicate to high copy number (up to 10 4 copies/cell) in cells that express the appropriate viral T antigen.
- vectors containing the replicons from bovine papillomavirus or Epstein-Barr virus replicate extrachromosomally at low copy number (about 100 copies/cell).
- in vitro refers to an artificial environment and to processes or reactions that occur within an artificial environment. In vitro environments include, but are not limited to, test tubes and cell lysates.
- in situ refers to cell culture.
- in vivo refers to the natural environment (e.g., an animal or a cell) and to processes or reaction that occur within a natural environment.
- expression system refers to any assay or system for determining (e.g., detecting) the expression of a gene of interest.
- Those skilled in the field of molecular biology will understand that any of a wide variety of expression systems may be used.
- a wide range of suitable mammalian cells are available from a wide range of source (e.g., the American Type Culture Collection, Rockland, MD).
- the method of transformation or transfection and the choice of expression vehicle will depend on the host system selected. Transformation and transfection methods are described, e.g., in Ausubel, et al., Current Protocols in Molecular Biology. John Wiley & Sons, New York. 1992.
- Expression systems include in vitro gene expression assays where a gene of interest (e.g., a reporter gene) is linked to a regulatory sequence and the expression of the gene is monitored following treatment with an agent that inhibits or induces expression of the gene. Detection of gene expression can be through any suitable means including, but not limited to, detection of expressed mRNA or protein (e.g., a detectable product of a reporter gene) or through a detectable change in the phenotype of a cell expressing the gene of interest. Expression systems may also comprise assays where a cleavage event or other nucleic acid or cellular change is detected.
- a gene of interest e.g., a reporter gene
- Detection of gene expression can be through any suitable means including, but not limited to, detection of expressed mRNA or protein (e.g., a detectable product of a reporter gene) or through a detectable change in the phenotype of a cell expressing the gene of interest.
- Expression systems may also comprise assay
- enzyme refers to molecules or molecule aggregates that are responsible for catalyzing chemical and biological reactions. Such molecules are typically proteins, but can also comprise short peptides, RNAs, ribozymes, antibodies, and other molecules. A molecule that catalyzes chemical and biological reactions is referred to as “having enzyme activity” or “having catalytic activity.” AU amino acid residues identified herein are in the natural
- sequence homology means the proportion of base matches between two nucleic acid sequences or the proportion of amino acid matches between two amino acid sequences.
- sequence homology is expressed as a percentage, e.g., 50%>, the percentage denotes the proportion of matches over the length of sequence from one sequence that is compared to some other sequence. Gaps (in either of the two sequences) are permitted to maximize matching; gap lengths of 15 bases or less are usually used, 6 bases or less are preferred with 2 bases or less more preferred.
- the sequence homology between the target nucleic acid and the oligonucleotide sequence is generally not less than 17 target base matches out of 20 possible oligonucleotide base pair matches (85%); preferably not less than 9 matches out of 10 possible base pair matches (90%), and more preferably not less than 19 matches out of 20 possible base pair matches (95%>).
- Two amino acid sequences are homologous if there is a partial or complete identity between their sequences. For example, 85% homology means that 85%o of the amino acids are identical when the two sequences are aligned for maximum matching. Gaps (in either of the two sequences being matched) are allowed in maximizing matching; gap lengths of 5 or less are preferred with 2 or less being more preferred.
- two protein sequences or polypeptide sequences derived from them of at least 100 amino acids in length
- sequence relationships between two or more polynucleotides are used to describe the sequence relationships between two or more polynucleotides: “reference sequence”, “comparison window”, “sequence identity”, “percentage of sequence identity”, and
- a "reference sequence” is a defined sequence used as a basis for a sequence comparison; a reference sequence may be a subset of a larger sequence, for example, as a segment of a full-length cDNA or gene sequence given in a sequence listing, or may comprise a complete cDNA or gene sequence. Generally, a reference sequence is at least 20 nucleotides in length, frequently at least 25 nucleotides in length, and often at least 50 nucleotides in length.
- two polynucleotides may each (1) comprise a sequence (i.e., a portion of the complete polynucleotide sequence) that is similar between the two polynucleotides, and (2) may further comprise a sequence that is divergent between the two polynucleotides
- sequence comparisons between two (or more) polynucleotides are typically performed by comparing sequences of the two polynucleotides over a "comparison window" to identify and compare local regions of sequence similarity.
- a “comparison window”, as used herein, refers to a conceptual segment of at least 20 contiguous nucleotides and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
- Methods of alignment of sequences for comparison are well known in the art. Thus, the determination of percent identity between any two sequences can be accomplished using a mathematical algorithm.
- Such implementations include, but are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, California); the ALIGN program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 Science Drive, Madison, Wisconsin, USA). Alignments using these programs can be performed using the default parameters.
- the CLUSTAL program is well described by Higgins et al. (1988); Higgins et al. (1989); Co ⁇ et et al. (1988); Huang et al. (1992); and Pearson et al. (1994).
- the ALIGN program is based on the algorithm of Myers and Miller, supra.
- BLAST programs of Altschul et al. (1990), are based on the algorithm of Karlin and Altschul supra.
- Gapped BLAST in BLAST 2.0
- PSI-BLAST in BLAST 2.0
- the default parameters of the respective programs e.g. BLASTN for nucleotide sequences, BLASTX for proteins
- Alignment may also be performed manually by inspection.
- sequence identity means that two polynucleotide sequences are identical (i.e., on a nucleotide-by-nucleotide basis) over the window of comparison.
- percentage of sequence identity means that two polynucleotide sequences are identical (i.e., on a nucleotide-by-nucleotide basis) for the stated proportion of nucleotides over the window of comparison.
- percentage of sequence identity is calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity.
- the identical nucleic acid base e.g., A, T, C, G, U, or I
- substantially identical denote a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 60%, preferably at least 65%>, more preferably at least 70%, up to about 85%, and even more preferably at least 90 to 95%, more usually at least 99%, sequence identity as compared to a reference sequence over a comparison window of at least 20 nucleotide positions, frequently over a window of at least 20-50 nucleotides, and preferably at least 300 nucleotides, wherein the percentage of sequence identity is calculated by comparing the reference sequence to the polynucleotide sequence which may include deletions or additions which total 20 percent or less of the reference sequence over the window of comparison.
- the reference sequence may be a subset of a larger sequence.
- substantially identical means that two peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap weights, share at least about 85% sequence identity, preferably at least about 90% sequence identity, more preferably at least about 95 % sequence identity, and most preferably at least about 99 % sequence identity.
- the invention provides compositions comprising synthetic nucleic acid molecules, as well as methods for preparing those molecules which yield synthetic nucleic acid molecules that are efficiently expressed as a polypeptide or protem with desirable characteristics including reduced inappropriate or unintended transcription characteristics when expressed in a particular cell type.
- Natural selection is the hypothesis that genotype-environment interactions occurring at the phenotypic level lead to differential reproductive success of individuals and hence to modification of the gene pool of a population. It is generally accepted that the amino acid sequence of a protein found in nature has undergone optimization by natural selection. However, amino acids exist within the sequence of a protein that do not contribute significantly to the activity of the protein and these amino acids can be changed to other amino acids with little or no consequence.
- a protein may be useful outside its natural environment or for pu ⁇ oses that differ from the conditions of its natural selection. In these circumstances, the amino acid sequence can be synthetically altered to better adapt the protein for its utility in various applications.
- nucleic acid sequence that encodes a protein is also optimized by natural selection.
- the relationship between coding DNA and its transcribed RNA is such that any change to the DNA affects the resulting RNA.
- natural selection works on both molecules simultaneously.
- this relationship does not exist between nucleic acids and proteins. Because multiple codons encode the same amino acid, many different nucleotide sequences can encode an identical protein. A specific protein composed of 500 amino acids can theoretically be encoded by more than 10 150 different nucleic acid sequences.
- Natural selection acts on nucleic acids to achieve proper encoding of the corresponding protein. Presumably, other properties of nucleic acid molecules are also acted upon by natural selection. These properties include codon usage frequency, RNA secondary stracture, the efficiency of intron splicing, and interactions with transcription factors or other nucleic acid binding proteins. These other properties may alter the efficiency of protein translation and the resulting phenotype. Because of the redundant nature of the genetic code, these other attributes can be optimized by natural selection without altering the corresponding amino acid sequence.
- reporter genes Because of the need for evolutionary distance, the codon usage of reporter genes often does not correspond to the optimal codon usage of the experimental cells. Examples include ⁇ -galactosidase -gal) and chloramphenicol acetyltransferase (cat) reporter genes that are derived from E. coli and are commonly used in mammalian cells; the ⁇ -glucuronidase (gus) reporter gene that is derived from E.
- reporter genes are usually selected from organisms having unique and distinctive phenotypes. Consequently, these organisms often have widely separated evolutionary histories from the experimental host cells.
- a useful synthetic reporter gene of the invention has a minimal risk of influencing or perturbing intrinsic transcriptional characteristics of the host cell because the structure of that gene has been altered.
- a particularly useful synthetic reporter gene will have desirable characteristics under a new set and/or a wide variety of experimental conditions. To best achieve these characteristics, the structure of the synthetic gene should have minimal potential for interacting with transcription factors within a broad range of host cells and physiological conditions.
- a reporter gene comprising a native nucleotide sequence, based on a genomic or cDNA clone from the original host organism, may interact with transcription factors when expressed in an exogenous host. This risk stems from two circumstances. First, the native nucleotide sequence contains sequences that were optimized through natural selection to influence gene transcription within the native host organism.
- nucleotide sequence might inadvertently interact with transcription factors that were not present in the native host organism, and thus did not participate in its natural selection. The probability of such inadvertent interactions increases with greater evolutionary separation between the experimental cells and the native organism of the reporter gene.
- the invention provides a method for preparing synthetic nucleic acid sequences that reduce the risk of undesirable interactions of the nucleic acid with transcription factors when expressed in a particular host cell, thereby reducing inappropriate or unintended transcriptional characteristics.
- the method yields synthetic genes containing improved codon usage frequencies for a particular host cell and with a reduced occurrence of transcription factor binding sites.
- the invention also provides a method of preparing synthetic genes containing improved codon usage frequencies with a reduced occurrence of transcription factor binding sites and additional beneficial structural attributes.
- additional attributes include the absence of inappropriate RNA splicing junctions, poly(A) addition signals, undesirable restriction sites, ribosomal binding sites, and secondary structural motifs such as hahpin loops.
- the two synthetic genes have a reduced ability to hybridize to a common polynucleotide probe sequence, or have a reduced risk of recombining when present together in living cells.
- PCR amplification of the reporter sequences using primers complementary to flanking sequences and sequencing of the amplified sequences may be employed.
- preferred codons have a relatively high codon usage frequency in a selected host cell, and their introduction results in the introduction of relatively few transcription factor binding sites, relatively few other undesirable structural attributes, and optionally a characteristic that distinguishes the synthetic gene from another gene encoding a highly similar protein.
- the synthetic nucleic acid product obtained by the method of the invention is a synthetic gene with improved level of expression due to improved codon usage frequency, a reduced risk of inappropriate transcriptional behavior due to a reduced number of undesirable transcription regulatory sequences, and optionally any additional characteristic due to other criteria that may be employed to select the synthetic sequence.
- the invention may be employed with any nucleic acid sequence, e.g., a native sequence such as a cDNA or one which has been manipulated in vitro, e.g., to introduce specific alterations such as the introduction or removal of a restriction enzyme recognition site, the alteration of a codon to encode a different amino acid or to encode a fusion protein, or to alter GC or AT content (% of composition) of nucleic acid molecules.
- a native sequence such as a cDNA or one which has been manipulated in vitro, e.g., to introduce specific alterations such as the introduction or removal of a restriction enzyme recognition site, the alteration of a codon to encode a different amino acid or to encode a fusion protein, or to alter GC or AT content (% of composition) of nucleic acid molecules.
- the method of the invention is useful with any gene, but particularly useful for reporter genes as well as other genes associated with the expression of reporter genes, such as selectable markers.
- Preferred genes include, but are not limited to, those encoding lactamase ( ⁇ -gal), neomycin resistance (Neo), CAT, GUS, galactopyranoside, GFP, xylosidase, thymidine kinase, arabinosidase and the like.
- a "marker gene” or “reporter gene” is a gene that imparts a distinct phenotype to cells expressing the gene and thus permits cells having the gene to be distinguished from cells that do not have the gene.
- Such genes may encode either a selectable or screenable marker, depending on whether the marker confers a trait which one can 'select' for by chemical means, i.e., through the use of a selective agent (e.g., a herbicide, antibiotic, or the like), or whether it is simply a "reporter" trait that one can identify through observation or testing, i.e., by 'screening'.
- a selective agent e.g., a herbicide, antibiotic, or the like
- reporter simply a "reporter” trait that one can identify through observation or testing, i.e., by 'screening'.
- Exemplary marker genes include, but are not limited to, a neo gene, a ⁇ - gal gene, a gus gene, a cat gene, a gpt gene, a hyg gene, a hisD gene, a ble gene, a mprt gene, a bar gene, a nitrilase gene, a mutant acetolactate synthase gene (ALS) or acetoacid synthase gene (AAS), a methotrexate-resistant dlifr gene, a dalapon dehalogenase gene, a mutated anthranilate synthase gene that confers resistance to 5-methyl tryptophan (WO 97/26366), an R-locus gene, a ⁇ - lactamase gene, a xylE gene, an ⁇ -amylase gene, a tyrosinase gene, a luciferase (luc) gene, (e.g., a Ren
- secretable proteins fall into a number of classes, including small, diffusible proteins detectable, e.g., by ELISA, and proteins that are inserted or trapped in the cell membrane.
- the method of the invention can be performed by, although it is not limited to, a recursive process.
- the process includes assigning preferred codons to each amino acid in a target molecule, e.g., a native nucleotide sequence, based on codon usage in a particular species, identifying potential transcription regulatory sequences such as transcription factor binding sites in the nucleic acid sequence having preferred codons, e.g., using a database of such binding sites, optionally identifying other undesirable sequences, and substituting an alternative codon (i.e., encoding the same amino acid) at positions where undesirable transcription factor binding sites or other sequences occur.
- alternative preferred codons are substituted in each version.
- nucleotide sequence containing a maximum number of preferred codons and a minimum number of undesired sequences including transcription regulatory sequences or other undesirable sequences.
- desired sequences e.g., restriction enzyme recognition sites, can be introduced.
- the method of the invention comprises identifying a target nucleic acid sequence, such as a vector backbone, a reporter gene or a selectable marker gene, and a host cell of interest, for example, a plant (dicot or monocot), fungus, yeast or mammalian cell.
- a host cell of interest for example, a plant (dicot or monocot), fungus, yeast or mammalian cell.
- Preferred host cells are mammalian host cells such as CHO, COS, 293, Hela, CV-1 and NIH3T3 cells. Based on preferred codon usage in the host cell(s) and, optionally, low codon usage in the host cell(s), e.g., high usage mammalian codons and low usage E. coli and mammalian codons, codons to be replaced are determined.
- alternative preferred codons are introduced to each version.
- one preferred codon is introduced to one version and another preferred codon is introduced to the other version.
- the two codons with the largest number of mismatched bases are identified and one is introduced to one version and the other codon is introduced to the other version.
- desired and undesired sequences such as undesired transcriptional regulatory sequences, in the target sequence are identified.
- sequences can be identified using databases and software such as ⁇ PD, NNPD, R ⁇ BAS ⁇ , TRANSFAC, T ⁇ SS, GenePro, MAR (www.ncgr.org/MAR-search) and BCM Gene Finder, further described herein. After the sequences are identified, the modification(s) are introduced.
- a desired synthetic nucleic acid sequence Once a desired synthetic nucleic acid sequence is obtained, it can be prepared by methods well known to the art (such as PCR with overlapping primers), and its structural and functional properties compared to the target nucleic acid sequence, including, but not limited to, percent homology, presence or absence of certain sequences, for example, restriction sites, percent of codons changed (such as an increased or decreased usage of certain codons) and expression rates.
- the method was used to create synthetic reporter genes encoding Renilla reniformis luciferase, and two click beetle luciferases (one emitting green light and the other emitting red light).
- the synthetic genes support much greater levels of expression than the corresponding native or parent genes for the protein.
- the native and parent genes demonstrated anomalous transcription characteristics when expressed in mammalian cells, which were not evident in the synthetic genes.
- basal expression of the native or parent genes is relatively high.
- the expression is induced to very high levels by an enhancer sequence in the absence of known promoters.
- the synthetic genes show lower basal expression and do not show the anomalous enhancer behavior.
- the enhancer is activating transcriptional elements found in the native genes that are absent in the synthetic genes. The results clearly show that the synthetic nucleic acid sequences exhibit superior performance as reporter genes.
- the synthetic genes of the invention preferably encode the same proteins as their native counte ⁇ art (or nearly so), but have improved codon usage while being largely devoid of known transcription regulatory elements in the coding region. (It is recognized that a small number of amino acid changes may be desired to enhance a property of the native counte ⁇ art protein, e.g. to enhance luminescence of a luciferase.) This increases the level of expression of the protein the synthetic gene encodes and reduces the risk of anomalous expression of the protein. For example, studies of many important events of gene regulation, which may be mediated by weak promoters, are limited by insufficient reporter signals from inadequate expression of the reporter proteins.
- the synthetic luciferase genes described herein permit detection of weak promoter activity because of the large increase in level of expression, which enables increased detection sensitivity. Also, the use of some selectable markers may be limited by the expression of that marker in an exogenous cell. Thus, synthetic selectable marker genes which have improved codon usage for that cell, and have a decrease in other undesirable sequences, (e.g., transcription factor binding sites), can permit the use of those markers in cells that otherwise were undesirable as hosts for those markers.
- Promoter crosstalk is another concern when a co-reporter gene is used to normalize transfection efficiencies.
- the amount of DNA containing strong promoters can be reduced, or DNA containing weaker promoters can be employed, to drive the expression of the co- reporter.
- reporter genes in imaging systems, which can be used for in vivo biological studies or drug screening, is another use for the synthetic genes of the invention. Due to their increased level of expression, the protein encoded by a synthetic gene is more readily detectable by an imaging system. In fact, using a synthetic Renilla luciferase gene, luminescence in transfected CHO cells was detected visually without the aid of instrumentation.
- the synthetic genes may be used to express fusion proteins, for example fusions with secretion leader sequences or cellular localization sequences, to study transcription in difficult-to-transfect cells such as primary cells, and/or to improve the analysis of regulatory pathways and genetic elements.
- Other uses include, but are not limited to, the detection of rare events that require extreme sensitivity (e.g., studying RNA recoding), use with IRES, to improve the efficiency of in vitro translation or in vitro transcription-translation coupled systems such as TNT (Promega Co ⁇ ., Madison, WI), study of reporters optimized to different host organisms (e.g., plants, fungus, and the like), use of multiple genes as co-reporters to monitor drag toxicity, as reporter molecules in multiwell assays, and as reporter molecules in drug screening with the advantage of minimizing possible interference of reporter signal by different signal transduction pathways and other regulatory mechanisms.
- nucleic acid molecules of the invention include fluorescence activated cell sorting (FACS), fluorescent microscopy, to detect and/or measure the level of gene expression in vitro and in vivo, (e.g., to determine promoter strength), subceUular localization or targeting (fusion protein), as a marker, in calibration, in a kit, (e.g., for dual assays), for in vivo imaging, to analyze regulatory pathways and genetic elements, and in multi-well formats.
- FACS fluorescence activated cell sorting
- fluorescent microscopy to detect and/or measure the level of gene expression in vitro and in vivo, (e.g., to determine promoter strength), subceUular localization or targeting (fusion protein), as a marker, in calibration, in a kit, (e.g., for dual assays), for in vivo imaging, to analyze regulatory pathways and genetic elements, and in multi-well formats.
- the use of synthetic click beetle luciferases provides advantages such as the measurement of dual reporters.
- Renilla luciferase is better suited for in vivo imaging (because it does not depend on ATP or Mg 2+ for reaction, unlike firefly luciferase, and because coelenterazine is more permeable to the cell membrane than luciferin)
- the synthetic Renilla luciferase gene can be employed in vivo.
- the synthetic Renilla luciferase has improved fidelity and sensitivity in dual luciferase assays, e.g., for biological analysis or in drag screening platform.
- the reporter genes for click beetle luciferase and Renilla luciferase were used to demonstrate the invention because the reaction catalyzed by the protein they encode are significantly easier to quantify than the product of most genes. However, for the pu ⁇ oses of demonstrating the present invention they represent genes in general.
- the click beetle luciferase and Renilla luciferase genes share the name "luciferase”, this should not be inte ⁇ reted to mean that they originate from the same family of genes.
- the two luciferase proteins are evolutionarily distinct; they have fundamentally different traits and physical stractures, they use vastly different substrates ( Figure 17), and they evolved from completely different families of genes.
- the click beetle luciferase is 61 kD in size, uses luciferin as a substrate and evolved from the CoA synthetases.
- the Renilla luciferase originates from the sea pansy Renilla Reniformis, is 35 kD in size, uses coelenterazine as a substrate and evolved from the ⁇ hydrolases.
- the only shared trait of these two enzymes is that the reaction they catalyze results in light output. They are no more similar for resulting in light output than any other two enzymes would be, for example, simply because the reaction they catalyze results in heat.
- Bioluminescence is the light produced in certain organisms as a result of luciferase-mediated oxidation reactions.
- the luciferase genes e.g., the genes from luminous beetles, sea pansy, and, in particular, the luciferase from Photinus pyralis (the common firefly of North America), are currently the most popular luminescent reporter genes.
- Firefly luciferase and Renilla luciferase are highly valuable as genetic reporters due to the convenience, sensitivity and linear range of the luminescence assay.
- luciferase is used in virtually every type of experimental biological system, including, but not limited to, prokaryotic and eukaryotic cell culture, transgenic plants and animals, and cell-free expression systems.
- the firefly luciferase enzyme is derived from a specific North American beetle, Photinus pyralis.
- the firefly luciferase enzyme and the click beetle luciferase enzyme are monomeric proteins (61 kDa) which generate light through monooxygenation of beetle luciferin utilizing ATP and O (Figure 17A).
- the Renilla luciferase is derived from the sea pansy Renilla reniformis.
- the Renilla luciferase enzyme is a 36 kDa monomeric protein that utilizes O 2 and coelenterazine to generate light (Figure 17B).
- the gene encoding firefly luciferase was cloned from Photinus pyralis, and demonstrated to produce active enzyme in E. coli (de Wet et al., 1987).
- the cDNA encoding firefly luciferase (luc) continues to gain favor as the gene of choice for reporting genetic activity in animal, plant and microbial cells.
- the firefly luciferase reaction modified by the addition of CoA to produce persistent light emission, provides an extremely sensitive and rapid in vitro assay for quantifying firefly luciferase expression in small samples of transfected cells or tissues.
- firefly luciferase or click beetle luciferase as a genetic reporter, extracts of cells expressing the luciferase are mixed with substrates (beetle luciferin, Mg 2+ ATP, and O ), and luminescence is measured immediately.
- substrates beetle luciferin, Mg 2+ ATP, and O
- luminescence is measured immediately.
- the assay is very rapid and sensitive, providing gene expression data with little effort.
- the conventional firefly luciferase assay has been further improved by including coenzyme A in the assay reagent to yield greater enzyme turnover and thus greater luminescence intensity (Promega Luciferase Assay Reagent, Cat.# E1500, Promega Co ⁇ oration, Madison, Wis.).
- luciferase activity can be readily measured in luminometers or scintillation counters. Firefly and click beetle luciferase activity can also be detected in living cells in culture by adding luciferin to the growth medium. This in situ luminescence relies on the ability of beetle luciferin to diffuse through ceUular and peroxisomal membranes and on the intracellular availability of ATP and O 2 in the cytosol and peroxisome.
- reporter genes are widely used to measure franscription events, their utility can be limited by the fidelity and efficiency of reporter expression.
- a firefly luciferase gene (referred to as luc+) was modified to improve the level of luciferase expression. While a higher level of expression was observed, it was not determined that higher expression had improved regulatory control.
- the invention will be further described by the following nonlimiting examples.
- LucPplYG is a wild-type click beetle luciferase that emits yellow-green luminescence (Wood, 1989).
- a mutant of LucPplYG named YG#81-6G01 was envisioned.
- YG#81-6G01 lacks a peroxisome targeting signal, has a lower K M for luciferin and ATP, has increased signal stability and increased temperature stability when compared to the wild type (PCT/WO9914336).
- YG #81-6G01 was mutated to emit green luminescence by changing Ala at position 224 to Val (A224V is a green-shifting mutation), or to emit red luminescence by simultaneously introducing the amino acid substitutions A224H, S247H, N346I, and H348Q (red-shifting mutation set) (PCT/WO9518853)
- YG #81-6G01 a parent gene
- two synthetic gene sequences were designed. One codes for a luciferase emitting green luminescence (GR) and one for a luciferase emitting red luminescence (RD). Both genes were designed to 1) have optimized codon usage for expression in mammalian cells, 2) have a reduced number of transcriptional regulatory sites including mammalian transcription factor binding sites, splice sites, poly(A) addition sites and promoters, as well as prokaryotic (E.
- GR green luminescence
- RD red luminescence
- coli regulatory sites 3) be devoid of unwanted restriction sites, e.g., those which are likely to interfere with standard cloning procedures, and 4) have a low DNA sequence identity compared to each other in order to minimize genetic rearrangements when both are present inside the same cell.
- desired sequences e.g., a Kozak sequence or restriction enzyme recognition sites, may be identified and introduced.
- step 6 S49N, P230S for GR6 and H36Y for RD7 were reversed to create GRver5.1 and RDver5.1.
- RDver5.1 was further modified by changing the arginine codon at position 351 to a glycine codon (R351G) thereby creating RDver5.2 with improved spectral properties compared to RDver5.1.
- 9. RDver5.2 was further mutated to increase luminescence intensity thereby creating RD156-1H9 which encodes four additional amino acid changes (M2I, S349T, K488T, E538V) and three silent single base changes (SEQ ID NO: 18).
- the starting gene sequence for this design step was YG #81-6G01 (SEQ ID NO:
- the strategy was to adapt the codon usage for optimal expression in human cells and at the same time to avoid E. coli low-usage codons. Based on these requirements, the best two codons for expression in human cells for all amino acids with more than two codons were selected (see Wada et al., 1990). In the selection of codon pairs for amino acids with six codons, the selection was biased towards pairs that have the largest number of mismatched bases to allow design of GR and RD genes with minimum sequence identity (codon distinction):
- Gly GGC/GGT Val: GTC/GTG lie: ATC/ATT
- each codon in the two genes was replaced by a codon from the limited list described above in an alternating fashion (e.g., Arg( n ) is CGC in gene 1 and CGT in gene 2, Arg (n+1) is CGT in gene 1 and CGC in gene 2).
- the two output sequences from this first design step were named GRverl (version 1 GR) and RDverl (version 1 RD). Their DNA sequences are 63%> identical (594 mismatches), while the proteins they encode differ only by the 4 amino acids that determine luminescence color (see Figures 2 and 3 for an alignment of the DNA and protein sequences).
- Tables 1 and 2 show, as an example, the codon usage for valine and leucine in human genes, the parent gene YG#81-6G01, the codon-optimized synthetic genes GRverl and RDverl, as well as the final versions of the synthetic genes after completion of step 5 in the design process (GRver5 and RDver5).
- Table 1 Valine
- restriction enzymes were classified as undesired:
- TRANSFAC database http ://transfac. gbf.de/TRANSFAC/index .html holds information on gene regulatory DNA sequences (TF binding sites) and proteins (TFs) that bind to and act through them.
- the SITE table of TRANSFAC Release 3.2 contains 4,401 entries of individual (putative) TF binding sites (including TF binding sites in eukaryotic genes, in artificial sequences resulting from mutagenesis studies and in vitro selection procedures based on random oligonucleotide mixtures or specific theoretical considerations, and consensus binding sequences (from Faisst and Meyer, 1992)).
- the software tool used to locate and display these TF binding sites in the synthetic gene sequences was TESS (Transcription Element Search Software, http://agave.humgen.upenn.edu/tess/index.html).
- TESS Transcription Element Search Software, http://agave.humgen.upenn.edu/tess/index.html.
- the filtered string-based search option was used with the following user-defined search parameters:
- This parameter selection specifies that only mammalian TF binding sites (approximately 1,400 of the 4,401 entries in the database) that are at least 5 bases long will be included in the search. It further specifies that only TF binding sites that have a perfect match in the query sequence and a minimum log likelihood (LLH) score of 10 will be reported.
- the LLH scoring method assigns 2 to an unambiguous match, 1 to a partially ambiguous match (e.g., A or T match W) and 0 to a match against 'N'.
- a lower stringency test was performed at the end of the design process to re-evaluate the search parameters.
- the first search for TF binding sites using the parameters described above found about 100 franscription factor binding sites (hits) for each of the two synthetic genes (GRver2 and RDver2). AU sites were eliminated by changing one or more codons of the synthetic gene sequences in accordance with the codon optimization guidelines described in la above. However, it was expected that some these changes created new TF binding sites, other regulatory sites, and new restriction sites. Thus, steps 2 a-d were repeated as described, and 4 new restriction sites and 2 new splice sites were removed. The two output sequences from this third design step were named GRver3 and RDver3. Their DNA sequences are 66% identical (541 mismatches) (Figs. 2 and 3).
- This fourth step is an iteration of the process described in step 3.
- the search for newly introduced TF binding sites yielded about 50 hits for each of the two synthetic genes.
- AU sites were eliminated by changing one or more codons of the synthetic gene sequences in general accordance with the codon optimization guidelines described in la above. However, more high to medium usage codons were used to allow elimination of all TF binding sites. The lowest priority was placed on maintaining low sequence identity between the GR and RD genes.
- steps 2 a-d were repeated as described.
- the two output sequences from this fourth design step were named GRver4 and RDver4. Their DNA sequences are 68%> identical (506 mismatches) (Figs 2 and 3). 5.
- Remove new transcription factor (TF) binding sites then repeat steps 2 a-d
- the starting gene sequences for this design step were GRver4 and RDver4.
- This fifth step is another iteration of the process described in step 3 above.
- the 5 search for new TF binding sites introduced in step 4 yielded about 20 hits for each of the two synthetic genes.
- AU sites were eliminated by changing one or more codons of the synthetic gene sequences in general accordance with the codon optimization guidelines described in la above. However, more high to medium usage codons were used (these are all considered "preferred") to allow
- the Eukaryotic Promoter Database contains information about reliably mapped transcription start sites (1253 sequences) of eukaryotic genes. This database was searched using BLASTN 1.4.11 with default parameters (optimized to find nearly identical sequences rapidly; see Altschul et al, 1990) at the National Center for Biotechnology Information site
- Both genes are also completely devoid of eukaryotic TF binding sites consisting of more than four unambiguous bases, donor and acceptor splice sites
- GRver5 contains one splice acceptor site), poly(A) addition sites, specific prokaryotic (E. coli) regulatory sequences, and undesired restriction sites.
- the two synthetic genes were constructed by assembly from synthetic oligonucleotides in a thermocycler followed by PCR amplification of the full- length genes (similar to Stemmer et al. (1995) Gene. 164, pp. 49-53). Unintended mutations that interfered with the design goals of the synthetic genes were corrected.
- flanking regions of both genes matched the ends of the amplification primers (pRAMtailup: 5'-gtactgagacgacgccagcccaagcttaggcctgagtg SEQ ID NO:229, and pRAMtaildn: 5'-ggcatgagcgtgaactgactgaactagcggccgccgag SEQ ID NO:230) to allow cloning of the genes into our E. coli expression vector pRAM (WO99/14336).
- AU 183 oligonucleotides were ran through the hahpin analysis of the OLIGO software (OLIGO 4.0 Primer Analysis Software ⁇ 1989-1991 by Wojciech Rychlik) to identify potentially detrimental intra-molecular loop formation.
- the guidelines for evaluating the analysis results were set according to recommendations of Dr. Sims (Sigma-Genosys Custom Gene Synthesis
- oligos forming hai ⁇ ins with ⁇ G ⁇ -10 have to be avoided, those forming hai ⁇ ins with ⁇ G ⁇ -7 involving the 3' end of the oligonucleotide should also be avoided, while those with an overall ⁇ G ⁇ -5 should not pose a problem for this application.
- the analysis identified 23 oligonucleotides able to form hai ⁇ ins with a ⁇ G between -7.1 and -4.9. Of these, 5 had blocked or nearly blocked 3' ends (0-3 free bases) and were re-designed by removing 1-4 bases at their 3' end and adding it to the adjacent oligonucleotide.
- the 40mer oligonucleotide covering the sequence complementary to the poly(A) tail had a very low complexity 3' end (13 consecutive T bases).
- An additional 40mer was designed with a high complexity 3' end but a consequently reduced overlap with one of its complementary oligonucleotides (11 instead of 20 bases) on the opposite strand.
- the oligos were designed for use in a thermocycler-based assembly reaction, they could also be used in a ligation-based protocol for gene construction.
- the oligonucleotides are annealed in a pairwise fashion and the resulting short double-stranded fragments are ligated using the sticky overhangs.
- each of the two synthetic genes was assembled in a separate reaction from 98 oligonucleotides.
- the total volume for each reaction was 50 ⁇ l:
- the assembly reaction contained, in addition to the 98 GR or RD oligonucleotides, a small amount of DNA from the corresponding full-length clones with mutations described above. This allows the oligos to correct mutations present in the templates.
- Cycling conditions 94°C for 30 seconds, then (94°C for 20 seconds, 65°C for 60 seconds and 72°C for 3 minutes) for 30 cycles, then 72°C for 5 minutes.
- the genes obtained from the corrective assembly and amplification step were subcloned into the pRAM vector and expressed in E. coli, yielding 75% luminescent GR or RD clones. Forty-four GR and 44 RD clones were analyzed with our screening robot (WO99/14336). The six best GR and RD clones were manually analyzed and one best GR and RD clone was selected (GR6 and RD7). Sequence analysis of GR6 revealed two point mutations in the coding region, both of which resulted in an amino acid substitution (S49N and P230S). Sequence analysis of RD7 revealed three point mutations in the coding region, one of which resulted in an amino acid substitution (H36Y). It was confirmed that none of the silent point mutations introduced any regulatory or restriction sites conflicting with the overall design criteria for the synthetic genes.
- the unintended amino acid substitutions present in the GR6 and RD7 synthetic genes were reversed by site-directed mutagenesis to match the GRver5 and RDver5 designed sequences, thereby creating GRver5.1 and RDver5.1.
- the DNA sequences of the mutated regions were confirmed by sequence analysis.
- RDver5.1 gene was further modified to improve its spectral properties by introducing an amino change (R351G), thereby creating RDver5.2 pGL3 vectors with RD and GR genes
- - pGL3 -Enhancer SV40 enhancer (3' to luciferase coding sequences)
- the primers employed in the assembly of GR and RD synthetic genes facilitated the cloning ofthose genes into pRAM vectors.
- pRAM RDver5.1, pRAM GRver5.1, and pRAM RD156-1H9 were amplified to introduce an Neo I site at the 5' end and anXba I site at the 3' end of the gene.
- the primers for pRAM RDver5.1 and pRAM GRver5.1 were:
- the pGL3-control vectors containing each of the luciferase genes was digested with Neo I and Xba I, ligated with other pGL3 vectors that also were digested with Neo I and Xba I, and the ligated products introduced to E. coli.
- the polypeptide encoded by GRver5.1 and RDver5.1 (and RD156-1H9, see below) nucleic acid sequences in pGL3 vectors has an amino acid substitution at position 2 to valine as a result of the Neo I site at the initiation codon in the oligonucleotide.
- the native gene in YG #81- 6G01 was amplified from a Hind III site upstream to a Hpa I site downstream of the coding region and which included flanking sequences found in the GR and RD clones.
- the upstream primer (5'-CAA AAA GCT TGG CAT TCC GGT ACT GTT GGT AAA GCC ACC ATG GTG AAG CGA GAG- 3'; S ⁇ Q ID NO:234) and a downstream primer (5'- CAA TTG TTG TTG TTA ACT TGT TTA TT -3'; S ⁇ Q ID NO:235) were mixed with YG#81-6G01 and amplified using the PCR conditions above.
- the purified PCR product was digested with Neo I and Xba I, ligated with pGL3-control that was also digested with Hind III and Hpa I, and the ligated products introduced into E. coli.
- YG#81- 6G01 into the other pGL3 reporter vectors (basic, promoter and enhancer)
- the pGL3-control vectors containing YG#81-6G01 were digested with Neo I and Xba I, ligated with the other pGL3 vectors that also were digested with Neo I and Xba I, and the ligated products introduced to E. coli.
- the clone of YG#81-6G01 in the pGL3 vectors has a C instead of an A at base 786, which yields a change in the amino acid sequence at residue 262 from Phe to Leu ( Figure 2 shows the sequence of YG#81-6G01 prior to introduction into pGL3 vectors).
- Figure 2 shows the sequence of YG#81-6G01 prior to introduction into pGL3 vectors.
- Partially purified enzymes expressed from the synthetic genes and the parent gene were employed to determine Km for luciferin and ATP (see Table 3).
- the parent gene appeared to contain one or more internal transcriptional regulatory sequences that are activated by the enliancer in the vector, and thus is not suitable as a reporter gene while the synthetic GR and RD genes showed a clean reporter response (transfection efficiency normalized by comparison to native Renilla luciferase gene). See Table 9.
- RDver5.2 was mutated to increase its luminescence intensity, thereby creating RD156-1H9 which carries four additional amino acid changes (M2I, S349T, K488T, E538V) and three silent point mutations (SEQ ID NO:18).
- Site-directed mutagenesis The initial strategy was to use site-directed mutagenesis. There are four amino acid differences between the GR and RD synthetic genes with H348Q providing the greatest contribution to red color. Thus, this substitution may also cause structural changes in the protein that could lead to low light output. Optimization of positions near this area could increase light output. The following positions were selected for mutagenesis:
- Oligonucleotides designed to mutate the above positions were used in a site-directed mutagenesis experiment (WO99/14336) and the resulting mutants were screened for luminescence intensity. There was little variation in light intensity and only about 25%) were luminescent.
- clones were picked and analyzed with the screening robot (PCT/WO9914336). None of the clones had a luminescence intensity (LI) higher than RDver5.2, but four of the clones had slightly lower composite Km for luciferin and ATP (Km).
- Directed evolution Protocols and procedures used for the directed evolution are detailed in see PCT/WO9914336.
- the three clones with the highest LI values were selected for manual analysis to confirm that their luminescence intensity was higher than that of RDver5.2 and to ensure that their spectral properties were not compromised.
- One of the clones was slightly green-shifted, all others maintained the spectral properties of RDver5.2 (Table 5).
- the Km values for luciferin and the luminescence intensity relative to RDver5.2 were determined for all three clones in several independent experiments.
- AU cells samples were processed with CCLR lysis buffer (E1483, Promega Corp., Madison, WI) and diluted 1: 10 into buffer (25 mM HEPES pH 7.8, 5% glycerol, 1 mg/ml BSA, 150 mM NaCl).
- Table 7 summarizes the results (Luin: luminescence values were normalized to optical density; measurements for independent experiments are separated by forward slashes) from expression in bacterial cells.
- RD156-1H9 the clone with the highest luminescence intensity (5 to 10-fold increase) also has an about 2-fold higher Km for luciferin.
- Table 7 shows a comparison between the luminescence intensities of RD156-1H9, GRver5.1 and RDver5.2 normalized to GRver5.1 with and without correction for the spectral sensitivity of the luminometer photomultiplier tube. With correction, the luminescence intensity of clone RD156-1H9 was only about 2-fold lower than that of GRver5.1.
- the luciferin Km for clone RD 156- 1 H9 is approximately 40-fold higher than GRver5.1.
- RD156-1H9 is thermostable at 50°C for at least 2 hours.
- Tables 8 and 9 show a comparison of luciferase expression levels in CHO cells.
- Table 9 shows a comparison of the expression levels in all four pGL3 vectors calculated as a percent of the expression level in pGL3 -control.
- Control vector rlu YG#81-6G01 177 Control vector rlu
- the synthetic Renilla Luciferase genes prepared include 1) an introduced Kozak sequence, 2) codon usage optimized for mammalian (human) expression, 3) a reduction or elimination of unwanted restriction sites, 4) removal of prokaryotic regulatory sites (ribosome binding site and TATA box), 5) removal of splice sites and poly(A) addition sites, and 6) a reduction or elimination of mammalian transcriptional factor binding sequences.
- the process of computer-assisted design of synthetic Renilla luciferase genes by iterative rounds of codon optimization and removal of franscription factor binding sites and other regulatory sites as well as restriction sites can be described in three steps:
- Renilla luciferase gene codon usage was optimized, one amino acid was changed (T— »A) to generate a Kozak consensus sequence, and undesired restriction sites were eliminated thereby creating synthetic gene Rlucverl.
- the Kozak sequence 5' aaccATGGCT 3' (S ⁇ Q ID NO: 293) (the Neo I site is underlined, the coding region is shown in capital letters) was introduced to the synthetic Renilla luciferase gene.
- the introduction of the Kozak sequence changes the second amino acid from Thr to Ala (GCT). Removal of undesired restriction sites
- R ⁇ BAS ⁇ ver. 808 (updated August 1, 1998; Restriction Enzyme Database; www.neb.com rebase) was employed to identify undesirable restriction sites as described in Example 1.
- the following undesired restriction sites (in addition to those described in Example 1) were removed according to the process described in Example 1 : EcoICR I, Ndel, Nsil, Sphl, Spel, Xmal, Pstl.
- the version of Renilla luciferase (Rluc) which inco ⁇ orates all these changes is Rlucverl.
- Rlucver2 was obtained (SEQ ID Nos. 21 and 226).
- Example 1 lower stringency search parameters were specified for the TESS filtered string search to further evaluate the synthetic Renilla gene. With the LLH reduced from 10 to 9 and the minimum element length reduced from 5 to 4, the TESS filtered string search did not show any new hits. When, in addition to the parameter changes listed above, the organism classification was expanded from "mammalia" to "chordata”, the search yielded only four more TF binding sites. When the Min LLH was further reduced to between 8 and 0, the search showed two additional 5-base sites (MAMAG and CTKTK) which combined had four matches in Rlucver2, as well as several 4- base sites. Also as in Example 1, Rlucver2 was checked for hits to entries in the EPD (Eukaryotic Promoter Database, Release 45).
- EPD Eukaryotic Promoter Database, Release 45
- Rluc-final When introduced into pGL3, Rluc-final has a Kozak sequence (CACCATGGCT).
- CACCATGGCT The changes in Rluc-final relative to Rlucver2 were introduced during gene assembly.
- One change was at position 619, a C to an A, which eliminated a eukaryotic promoter sequence and reduced the stability of a hai ⁇ in stracture in the corresponding oligonucleotide employed to assemble the gene.
- Other changes included a change from CGC to AGA at positions 218-220 (resulted in a better oligonucleotide for PCR).
- the resulting synthetic gene fragment was cloned into a pRAM vector using Neo I and Xba I. Two clones having the correct size insert were sequenced. Four to six mutations were found in the synthetic gene from each clone. These mutations were fixed by site-directed mutagenesis (Gene Editor from Promega Co ⁇ ., Madison, WI) and swapping the correct regions between these two genes. The corrected gene was confirmed by sequencing.
- the desired vector backbone fragment was purified using Qiagen' s QIAquick gel extraction kit.
- Neo I-RL-F The native Renilla luciferase gene fragment was cloned into pGL3- control vector using two oligonucleotides, Neo 1-RL-F and Xba I-RL-R, to PCR amplify native Renilla luciferase gene using pRL-CMV as the template.
- the sequence for Neo I-RL-F is 5'-
- DNA template (Plasmid) 1.0 ⁇ l (1.0 ng/ ⁇ l final) 10 X Rec. Buffer 10.0 ⁇ l (Stratagene Co ⁇ .)
- Primer 1 (10 ⁇ M) 2.0 ⁇ l (0.2 ⁇ M final)
- Renilla luciferase gene fragment was introduced into pGL3-control vector.
- 5 ⁇ g of the PCR product of the native Renilla luciferase gene was digested with Neo I and Xba I.
- the desired Renilla luciferase gene fragment was purified and stored at -20°C.
- the gene was cloned into the mammalian expression vector pGL3-control vector under the confrol of SV40 promoter and SV40 early enhancer (Fig. 13 A).
- the native Renilla luciferase gene was also cloned into the pGL-3 control vector so that the expression from synthetic gene and the native gene could be compared.
- the expression vectors were then transfected into four common mammalian cell lines (CHO, NTH3T3, Hela and CV-1; Table 10), and the expression levels compared between the vectors with the synthetic gene versus the native gene.
- the amount of DNA used was at two different levels to ascertain that expression from the synthetic gene is consistently increased at different expression levels. The results show a 70-600 fold increase of expression for the synthetic Renilla luciferase gene in these cells (Table 10).
- luciferase reporter One important advantage of luciferase reporter is its short protem half- life. The enhanced expression could also result from extended protein half-life and, if so, this gives an undesired disadvantage of the new gene. This possibility is ruled out by a cycloheximide chase ("CHX Chase”) experiment ( Figure 14), which demonstrated that there was no increase of protein half-life resulted from the humanized Renilla luciferase gene.
- CHX Chase cycloheximide chase
- Renilla gene (Rluc-final) as well as native Renilla gene were cloned into different vector backbones and under different promoters (Figure 13B). The synthetic gene always exhibited increased expression compared to its wild-type counte ⁇ art (Table 11).
- Vector CHO cells NLH3T3 cells HeLa cells pRL-control native 100 100 100 pRL-control synthetic 100 100 100 pRL-basic native 4.1 5.6 0.2 pRL-basic synthetic 0.4 0.1 0.0 pRL-promoter native 5.9 7.8 0.6 pRL-promoter synthetic 15.0 9.9 1.1 Percent of control vector pRL-enhancer native 42.1 123.9 52.7 pRL-enhancer synthetic 2.6 1.5 5.4
- the synthetic gene should exhibit less basal level transcription in a promoterless vector.
- the synthetic and native Renilla luciferase genes were cloned into the pGL3 -basic vector to compare the basal level of transcription. Because the synthetic gene itself has increased expression efficiency, the activity from the promoterless vector cannot be compared directly to judge the difference in basal transcription, rather, this is taken into consideration by comparing the percentage of activity from the promoterless vector in reference to the control vector (expression from the basic vector divided by the expression in the fully functional expression vector with both promoter and enhancer elements).
- the synthetic Renilla gene (Rluc-final) was used in in vitro systems to compare translation efficiency with the native gene.
- pRL-null native plasmid (having the native Renilla luciferase gene under the control of the T7 promoter) or the same amount of pRL-null-synthetic plasmid (having the synthetic Renilla luciferase gene under the control of the T7 promoter) was added to the TNT reaction mixture and luciferase activity measured every 5 minutes up to 60 minutes.
- Dual Luciferase assay kit (Promega Co ⁇ .) was used to measure Renilla luciferase activity.
- RNA was prepared by an in vitro transcription system, then purified.
- pRL-null (native or synthetic) vectors were linearized with BamH I.
- the DNA was purified by multiple phenol-chloroform extraction followed by ethanol precipitation.
- An in vitro T7 transcription system was employed by prepare RNAs.
- the DNA template was removed by using RNase-free DNase, and RNA was purified by phenol-chloroform extraction followed by multiple isopropanol precipitations.
- RNA was then added to a rabbit reticulocyte lysate ( Figure 15 C, D) or wheat germ lysate ( Figure 15 E, F).
- Figure 15 C, D rabbit reticulocyte lysate
- Figure 15 E, F wheat germ lysate
- the synthetic Renilla luciferase gene RNA produced more luciferase than the native one.
- Reporter gene assays are widely used to study transcriptional regulation events. This is often carried out in co-transfection experiments, in which, along with the primary reporter constract containing the testing promoter, a second control reporter under a constitutive promoter is transfected into cells as an internal control to normalize experimental variations including transfection efficiencies between the samples.
- Control reporter signal, potential promoter cross talk between the control reporter and primary reporter, as well as potential regulation of the control reporter by experimental conditions, are important aspects to consider for selecting a reliable co-reporter vector. As described above, vector constructs were made by cloning synthetic
- Renilla luciferase gene into different vector backbones under different promoters. AU the constructs showed higher expression in the three mammalian cell lines tested (Table 11). Thus, with better expression efficiency, the synthetic Renilla luciferase gives out higher signal when transfected into mammalian cells. Because a higher signal is obtained, less promoter activity is required to achieve the same reporter signal, this reduced risk of promoter interference.
- CHO cells were transfected with 50 ng pGL3-control (firefly luc+) plus one of 5 different amounts of native pRL-TK plasmid (50, 100, 500, 1000, or 2000 ng) or synthetic pRL-TK (5, 10, 50, 100, or 200 ng).
- TPA induces expression of co-reporter vectors harboring the wild-type gene when transfecting MCF-7 cells.
- 500 ng pRL-TK (native), 5 ⁇ g native and synthetic pRG-B, 2.5 ⁇ g native and synthetic pRG-TK were transfected per well of MCF-7 cells.
- 100 ng/well pGL3-control (firefly luc+) was co-transfected with all RL plasmids.
- Carrier DNA, pUC19 was used to bring the total DNA transfected to 5.1 ⁇ g/well.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Saccharide Compounds (AREA)
Abstract
Description
Claims
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE60140898T DE60140898D1 (en) | 2000-08-24 | 2001-08-24 | SYNTHETIC NUCLEIC ACID MOLECULE COMPOSITIONS AND METHOD FOR THE PRODUCTION THEREOF |
EP01964425A EP1341808B1 (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
AU8527801A AU8527801A (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
DK01964425.1T DK1341808T3 (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
AT01964425T ATE452905T1 (en) | 2000-08-24 | 2001-08-24 | SYNTHETIC NUCLEIC ACID MOLECULE COMPOSITIONS AND METHODS FOR THEIR PRODUCTION |
JP2002521985A JP2004520807A (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule composition and preparation method |
CA002420328A CA2420328A1 (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
AU2001285278A AU2001285278B2 (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/645,706 US7879540B1 (en) | 2000-08-24 | 2000-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
US09/645,706 | 2000-08-24 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2002016944A2 true WO2002016944A2 (en) | 2002-02-28 |
WO2002016944A3 WO2002016944A3 (en) | 2003-06-26 |
Family
ID=24590123
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/026566 WO2002016944A2 (en) | 2000-08-24 | 2001-08-24 | Synthetic nucleic acid molecule compositions and methods of preparation |
Country Status (10)
Country | Link |
---|---|
US (3) | US7879540B1 (en) |
EP (1) | EP1341808B1 (en) |
JP (3) | JP2004520807A (en) |
AT (1) | ATE452905T1 (en) |
AU (2) | AU2001285278B2 (en) |
CA (1) | CA2420328A1 (en) |
DE (1) | DE60140898D1 (en) |
DK (1) | DK1341808T3 (en) |
ES (1) | ES2335268T3 (en) |
WO (1) | WO2002016944A2 (en) |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005021752A1 (en) * | 2003-08-29 | 2005-03-10 | Takara Bio Inc. | Method of searching for functional nucleotide molecule |
WO2005067410A2 (en) * | 2002-12-09 | 2005-07-28 | Promega Corporation | Synthetic nucleic acids from aquatic species |
WO2006046132A3 (en) * | 2004-09-17 | 2006-08-17 | Pasteur Institut | Method for modulating the evolution of a polypeptide encoded by a nucleic acid sequence |
WO2007120522A2 (en) | 2006-04-03 | 2007-10-25 | Promega Corporation | Permuted and nonpermuted luciferase biosensors |
US7291711B2 (en) | 2002-12-09 | 2007-11-06 | University Of Miami | Fluorescent proteins from aquatic species |
JP2008513021A (en) * | 2004-09-17 | 2008-05-01 | プロメガ コーポレイション | Synthetic nucleic acid molecules and methods of preparation |
EP1956086A1 (en) * | 2005-11-16 | 2008-08-13 | Toyo Boseki Kabushiki Kasisha | Luciferase gene optimized for use in imaging of intracellular luminescence |
WO2009087967A1 (en) * | 2008-01-07 | 2009-07-16 | Probex Inc. | Method for detecting a protein-protein interaction |
WO2009142735A2 (en) | 2008-05-19 | 2009-11-26 | Promega Corporation | LUCIFERASE BIOSENSORS FOR cAMP |
US7879540B1 (en) | 2000-08-24 | 2011-02-01 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
EP2298902A1 (en) | 2003-10-10 | 2011-03-23 | Promega Corporation | Luciferase biosensor |
US7939649B2 (en) * | 2005-09-06 | 2011-05-10 | Stanford University | Polynucleotide encoding luciferase |
WO2011143339A1 (en) | 2010-05-11 | 2011-11-17 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
WO2011155973A2 (en) | 2010-06-10 | 2011-12-15 | Switchgear Genomics | Modified renilla luciferase nucleic acids and methods of use |
EP2492342A1 (en) | 2006-10-30 | 2012-08-29 | Promega Corporation | Mutant hydrolase proteins with enhanced kinetics and functional expression |
US8314225B2 (en) | 2007-06-29 | 2012-11-20 | Hoffman-La Roche Inc. | Heavy chain mutant leading to improved immunoglobulin production |
WO2013071237A1 (en) | 2011-11-11 | 2013-05-16 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
WO2016040788A1 (en) * | 2014-09-11 | 2016-03-17 | Promega Corporation | Luciferase sequences utilizing infrared-emitting substrates to produce enhanced luminescence |
US9290794B2 (en) | 2010-05-11 | 2016-03-22 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US9840730B2 (en) | 2010-11-02 | 2017-12-12 | Promega Corporation | Oplophorus-derived luciferases, novel coelenterazine substrates, and methods of use |
US10815277B2 (en) | 2006-07-13 | 2020-10-27 | Institute For Advanced Study | Viral inhibitory nucleotide sequences and vaccines |
WO2020260431A1 (en) | 2019-06-28 | 2020-12-30 | F. Hoffmann-La Roche Ag | Method for the production of an antibody |
WO2023215505A1 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Modified dehalogenase with extended surface loop regions |
WO2023215452A2 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Split modified dehalogenase variants |
WO2023215432A1 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Circularly permuted dehalogenase variants |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6602677B1 (en) * | 1997-09-19 | 2003-08-05 | Promega Corporation | Thermostable luciferases and methods of production |
US7572629B2 (en) * | 2003-05-06 | 2009-08-11 | National Institute Of Advanced Industrial Science And Technology | Multiple gene transcription activity assay system |
JP5409354B2 (en) * | 2006-05-25 | 2014-02-05 | インスティチュート フォー アドバンスド スタディ | Methods for identifying sequence motifs and their applications |
JP4849698B2 (en) | 2009-05-29 | 2012-01-11 | 国立大学法人 東京大学 | Sensitive detection method for protein-protein interaction |
CN103180324A (en) | 2010-11-02 | 2013-06-26 | 普罗美加公司 | Coelenterazine derivatives and methods of using same |
PL2721153T3 (en) | 2011-06-16 | 2020-03-31 | The Regents Of The University Of California | Synthetic gene clusters |
WO2013163628A2 (en) | 2012-04-27 | 2013-10-31 | Duke University | Genetic correction of mutated genes |
WO2014071182A1 (en) | 2012-11-01 | 2014-05-08 | Massachusetts Institute Of Technology | Directed evolution of synthetic gene cluster |
EP2969435B1 (en) | 2013-03-15 | 2021-11-03 | Promega Corporation | Substrates for covalent tethering of proteins to functional groups or solid surfaces |
WO2015116806A1 (en) | 2014-01-29 | 2015-08-06 | Promega Corporation | Pro-substrates for live cell applications |
WO2015116867A1 (en) | 2014-01-29 | 2015-08-06 | Promega Corporation | Quinone-masked probes as labeling reagents for cell uptake measurements |
EP3322679A4 (en) | 2015-07-13 | 2019-07-10 | Pivot Bio, Inc. | Methods and compositions for improving plant traits |
EP3359664A4 (en) | 2015-10-05 | 2019-03-20 | Massachusetts Institute Of Technology | Nitrogen fixation using refactored nif clusters |
WO2017066497A2 (en) | 2015-10-13 | 2017-04-20 | Duke University | Genome engineering with type i crispr systems in eukaryotic cells |
EP3510035B1 (en) | 2016-09-09 | 2021-04-07 | Promega Corporation | Dual protected pro-coelenterazine substrates |
CN110799474B (en) | 2017-01-12 | 2022-07-26 | 皮沃特生物公司 | Methods and compositions for improving plant traits |
CN111587287A (en) | 2017-10-25 | 2020-08-25 | 皮沃特生物股份有限公司 | Methods and compositions for improved nitrogen-fixing engineered microorganisms |
US11579149B2 (en) | 2017-11-01 | 2023-02-14 | Queen's University At Kingston | Hippo pathway bioluminescent biosensor |
CN112739668A (en) | 2018-06-27 | 2021-04-30 | 皮沃特生物股份有限公司 | Agricultural compositions comprising reconstituted nitrogen-fixing microorganisms |
JP2022518489A (en) | 2019-01-25 | 2022-03-15 | プレジデント・アンド・フェロウズ・オブ・ハーバード・カレッジ | Compositions and Methods for Synthesizing Nucleic Acids |
WO2021119402A1 (en) | 2019-12-12 | 2021-06-17 | President And Fellows Of Harvard College | Compositions and methods for light-directed biomolecular barcoding |
US20230203567A1 (en) | 2020-04-22 | 2023-06-29 | President And Fellows Of Harvard College | Isothermal methods, compositions, kits, and systems for detecting nucleic acids |
WO2023250511A2 (en) | 2022-06-24 | 2023-12-28 | Tune Therapeutics, Inc. | Compositions, systems, and methods for reducing low-density lipoprotein through targeted gene repression |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995018853A1 (en) * | 1994-01-03 | 1995-07-13 | Promega Corporation | Mutant luciferases |
US5670356A (en) * | 1994-12-12 | 1997-09-23 | Promega Corporation | Modified luciferase |
WO1999014336A2 (en) * | 1997-09-19 | 1999-03-25 | Promega Corporation | Thermostable luciferases and methods of production |
Family Cites Families (69)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE428379B (en) | 1978-05-31 | 1983-06-27 | Lkb Produkter Ab | DETERMINATION OF ATOL AND REAGENTS OF BIOLUMINISM |
US4412001A (en) | 1981-01-30 | 1983-10-25 | Board Of Trustees Of The University Of Illinois | Isolation of bacterial luciferase |
US4503142A (en) | 1982-06-25 | 1985-03-05 | Litton Bionetics, Inc. | Open reading frame vectors |
US4581335A (en) | 1982-12-01 | 1986-04-08 | Texas A&M University System | Process for producing a cloned luciferase-synthesizing microorganism |
US5096825A (en) | 1983-01-12 | 1992-03-17 | Chiron Corporation | Gene for human epidermal growth factor and synthesis and expression thereof |
US5380831A (en) | 1986-04-04 | 1995-01-10 | Mycogen Plant Science, Inc. | Synthetic insecticidal crystal protein gene |
US5168062A (en) | 1985-01-30 | 1992-12-01 | University Of Iowa Research Foundation | Transfer vectors and microorganisms containing human cytomegalovirus immediate-early promoter-regulatory DNA sequence |
US5583024A (en) | 1985-12-02 | 1996-12-10 | The Regents Of The University Of California | Recombinant expression of Coleoptera luciferase |
US5221623A (en) | 1986-07-22 | 1993-06-22 | Boyce Thompson Institute For Plant Research, Inc. | Use of bacterial luciferase structural genes for cloning and monitoring gene expression in microorganisms and for tagging and identification of genetically engineered organisms |
US4968613A (en) | 1987-07-29 | 1990-11-06 | Kikkoman Corporation | Luciferase gene and novel recombinant DNA as well as a method of producing luciferase |
US5182202A (en) | 1987-11-30 | 1993-01-26 | Kikkoman Corporation | Purified luciferase from luciola cruciata |
JPH088864B2 (en) | 1988-04-12 | 1996-01-31 | キッコーマン株式会社 | Luciferase |
EP0353464B1 (en) | 1988-07-01 | 1993-10-20 | Kikkoman Corporation | Luciferase gene and novel recombinant DNA as well as a method for production of luciferase |
EP0387355B1 (en) | 1988-08-09 | 1996-11-06 | Toray Industries, Inc. | Process for preparing luciferase by recombinant expression of a luciferase-coding gene |
US5604123A (en) | 1988-08-09 | 1997-02-18 | Toray Industries, Inc. | Luciferase, gene encoding the same and production process of the same |
JPH0771485B2 (en) | 1988-09-01 | 1995-08-02 | キッコーマン株式会社 | Luciferase production method |
US5196524A (en) | 1989-01-06 | 1993-03-23 | Eli Lilly And Company | Fusion reporter gene for bacterial luciferase |
CA2009925C (en) | 1989-02-14 | 2002-10-01 | Koichi Kondo | Method for enhancement of chemiluminescence |
FI901681A0 (en) | 1989-04-10 | 1990-04-03 | Ela Technologies Inc | FOERFARANDE FOER OEKNING AV KAENSLIGHETEN HOS LUMINESCENSANALYSER. |
JPH03167288A (en) | 1989-11-27 | 1991-07-19 | Chisso Corp | Method for sensitized luminescence of aequorin by surface active agent |
US5292658A (en) | 1989-12-29 | 1994-03-08 | University Of Georgia Research Foundation, Inc. Boyd Graduate Studies Research Center | Cloning and expressions of Renilla luciferase |
US5219737A (en) | 1990-03-27 | 1993-06-15 | Kikkoman Corporation | Mutant luciferase of a firefly, mutant luciferase genes, recombinant dnas containing the genes and a method of producing mutant luciferase |
EP0528819A1 (en) | 1990-04-18 | 1993-03-03 | Plant Genetic Systems, N.V. | Modified bacillus thuringiensis insecticidal-crystal protein genes and their expression in plant cells |
US5283179A (en) | 1990-09-10 | 1994-02-01 | Promega Corporation | Luciferase assay method |
EP0575319B1 (en) | 1991-03-11 | 1999-11-10 | The University Of Georgia Research Foundation, Inc. | Cloning and expression of renilla luciferase |
US5229285A (en) | 1991-06-27 | 1993-07-20 | Kikkoman Corporation | Thermostable luciferase of firefly, thermostable luciferase gene of firefly, novel recombinant dna, and process for the preparation of thermostable luciferase of firefly |
CA2122261A1 (en) * | 1991-10-30 | 1993-05-13 | Marc Cornelissen | Modified genes and their expression in plant cells |
US5629168A (en) | 1992-02-10 | 1997-05-13 | British Technology Group Limited | Chemiluminescent enhancers |
AT401526B (en) | 1993-02-10 | 1996-09-25 | Scheirer Winfried | REAGENT SOLUTION TO STABILIZE LUMINESCENCE IN LUCIFERASE MEASUREMENT |
CA2104815A1 (en) | 1993-02-26 | 1994-08-27 | Naotaka Kuroda | Method for measuring adenyl group-containing substances |
US5610335A (en) | 1993-05-26 | 1997-03-11 | Cornell Research Foundation | Microelectromechanical lateral accelerometer |
US6118047A (en) | 1993-08-25 | 2000-09-12 | Dekalb Genetic Corporation | Anthranilate synthase gene and method of use thereof for conferring tryptophan overproduction |
JPH0767696A (en) | 1993-09-06 | 1995-03-14 | Tosoh Corp | Method for reducing back ground luminescence |
US5605793A (en) | 1994-02-17 | 1997-02-25 | Affymax Technologies N.V. | Methods for in vitro recombination |
GB9501170D0 (en) | 1994-03-23 | 1995-03-08 | Secr Defence | Luciferases |
US5795737A (en) | 1994-09-19 | 1998-08-18 | The General Hospital Corporation | High level expression of proteins |
US5786464C1 (en) | 1994-09-19 | 2012-04-24 | Gen Hospital Corp | Overexpression of mammalian and viral proteins |
DE69633171T2 (en) | 1995-01-20 | 2005-08-18 | The Secretary Of State For Defence, Salisbury | MUTATED LUCIFERASES |
US5744320A (en) | 1995-06-07 | 1998-04-28 | Promega Corporation | Quenching reagents and assays for enzyme-mediated luminescence |
CA2229043C (en) | 1995-08-18 | 2016-06-07 | Morphosys Gesellschaft Fur Proteinoptimierung Mbh | Protein/(poly)peptide libraries |
US5874304A (en) | 1996-01-18 | 1999-02-23 | University Of Florida Research Foundation, Inc. | Humanized green fluorescent protein genes and methods |
US6020192A (en) | 1996-01-18 | 2000-02-01 | University Of Florida | Humanized green fluorescent protein genes and methods |
JPH09294600A (en) | 1996-04-26 | 1997-11-18 | Kikkoman Corp | Determination of activity of a plurality of promoters |
EP1009763A4 (en) * | 1996-06-11 | 2002-08-07 | Merck & Co Inc | Synthetic hepatitis c genes |
JPH1087621A (en) | 1996-09-13 | 1998-04-07 | Sankyo Co Ltd | Enhancer for lucigenin chemiluminescence |
US6114148C1 (en) | 1996-09-20 | 2012-05-01 | Gen Hospital Corp | High level expression of proteins |
CA2266423A1 (en) | 1996-09-27 | 1998-04-02 | Maxygen, Inc. | Methods for optimization of gene therapy by recursive sequence shuffling and selection |
US5976796A (en) | 1996-10-04 | 1999-11-02 | Loma Linda University | Construction and expression of renilla luciferase and green fluorescent protein fusion genes |
JP3167288B2 (en) | 1997-03-17 | 2001-05-21 | 株式会社バンダイ | Portable electronic equipment |
GB9707486D0 (en) | 1997-04-11 | 1997-05-28 | Secr Defence | Enzyme assays |
US6074859A (en) | 1997-07-08 | 2000-06-13 | Kikkoman Corporation | Mutant-type bioluminescent protein, and process for producing the mutant-type bioluminescent protein |
WO1999004024A2 (en) * | 1997-07-15 | 1999-01-28 | Dow Agrosciences Llc | Nucleotide sequences of genes encoding sink proteins and uses thereof for improving the nutritional quality of feeds |
US6602677B1 (en) | 1997-09-19 | 2003-08-05 | Promega Corporation | Thermostable luciferases and methods of production |
US6306600B1 (en) | 1998-04-17 | 2001-10-23 | Clontech Laboratories, Inc. | Rapidly degrading GFP-fusion proteins and methods of use |
US6130313A (en) | 1997-10-02 | 2000-10-10 | Clontech Laboratories, Inc. | Rapidly degrading GFP-fusion proteins |
US7090976B2 (en) | 1999-11-10 | 2006-08-15 | Rigel Pharmaceuticals, Inc. | Methods and compositions comprising Renilla GFP |
US6700038B1 (en) | 1999-03-31 | 2004-03-02 | Wisconsin Alumni Research Foundation | Plant expression vectors based on the flock house virus genome |
MXPA02003232A (en) | 1999-09-30 | 2003-09-22 | Alexion Pharma Inc | Compositions and methods for altering gene expression. |
AU783767B2 (en) | 1999-10-14 | 2005-12-01 | Takara Bio Usa, Inc. | Anthozoa derived chromophores/fluorophores and methods for using the same |
FR2812883B1 (en) | 2000-08-11 | 2002-10-18 | Aventis Cropscience Sa | USE OF HPPD INHIBITORS AS SELECTING AGENTS IN PLANT TRANSFORMATION |
US7879540B1 (en) | 2000-08-24 | 2011-02-01 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
US20030157643A1 (en) | 2000-08-24 | 2003-08-21 | Almond Brian D | Synthetic nucleic acids from aquatic species |
WO2002094992A2 (en) | 2001-05-18 | 2002-11-28 | Rigel Pharmaceuticals, Incorporated | Directed evolution of protein in mammalian cells |
JP2005509420A (en) | 2001-11-13 | 2005-04-14 | クローンテック ラボラトリーズ インク. | Novel chromophores / fluorescent chromophores and their use |
CA2499221A1 (en) | 2002-09-16 | 2004-03-25 | Promega Corporation | Rapidly degraded reporter fusion proteins |
WO2004042010A2 (en) | 2002-10-30 | 2004-05-21 | University Of Tennessee Research Foundation | Modified luciferase nucleic acids and methods of use |
JP4311003B2 (en) | 2002-12-02 | 2009-08-12 | アイシン精機株式会社 | Prokaryotic gene expression analysis method |
US6878531B1 (en) | 2003-11-10 | 2005-04-12 | Medical College Of Georgia Research Institute | Method for multiple site-directed mutagenesis |
US7728118B2 (en) | 2004-09-17 | 2010-06-01 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
-
2000
- 2000-08-24 US US09/645,706 patent/US7879540B1/en not_active Expired - Fee Related
-
2001
- 2001-08-24 ES ES01964425T patent/ES2335268T3/en not_active Expired - Lifetime
- 2001-08-24 AU AU2001285278A patent/AU2001285278B2/en not_active Ceased
- 2001-08-24 CA CA002420328A patent/CA2420328A1/en not_active Abandoned
- 2001-08-24 WO PCT/US2001/026566 patent/WO2002016944A2/en active Application Filing
- 2001-08-24 JP JP2002521985A patent/JP2004520807A/en active Pending
- 2001-08-24 AU AU8527801A patent/AU8527801A/en active Pending
- 2001-08-24 AT AT01964425T patent/ATE452905T1/en not_active IP Right Cessation
- 2001-08-24 DK DK01964425.1T patent/DK1341808T3/en active
- 2001-08-24 EP EP01964425A patent/EP1341808B1/en not_active Expired - Lifetime
- 2001-08-24 DE DE60140898T patent/DE60140898D1/en not_active Expired - Lifetime
-
2005
- 2005-12-22 US US11/316,042 patent/US20060127988A1/en not_active Abandoned
-
2006
- 2006-10-23 JP JP2006288147A patent/JP2007006910A/en active Pending
-
2007
- 2007-04-12 US US11/786,785 patent/US7906282B2/en not_active Expired - Fee Related
-
2010
- 2010-01-18 JP JP2010008451A patent/JP2010081942A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995018853A1 (en) * | 1994-01-03 | 1995-07-13 | Promega Corporation | Mutant luciferases |
US5670356A (en) * | 1994-12-12 | 1997-09-23 | Promega Corporation | Modified luciferase |
WO1999014336A2 (en) * | 1997-09-19 | 1999-03-25 | Promega Corporation | Thermostable luciferases and methods of production |
Non-Patent Citations (5)
Title |
---|
DATABASE EBI [Online] 8 August 2000 (2000-08-08) FERBITZ L. ET AL.: "A synthetic gene coding for Renilla luciferase is a versatile expression marker in green algae" Database accession no. AY004213 XP002230923 * |
KIM C.H. ET AL.: "Codon optimization for high-level expression of human erythropoietin (EPO) in mammalian cells" GENE, vol. 199, no. 1-2, 15 October 1997 (1997-10-15), pages 293-301, XP004126394 ISSN: 0378-1119 * |
PAN W. ET AL.: "Vaccine candidate MSP-1 from Plasmodium falciparum: a redesigned 4917 bp polynucleotide enables synthesis and isolation of full-length protein from Escherichia coli and mammalian cells" NUCLEIC ACIDS RESEARCH, vol. 27, no. 4, 15 February 1999 (1999-02-15), pages 1094-1103, XP000953100 ISSN: 0305-1048 * |
See also references of EP1341808A2 * |
WOOD K.V.: "The chemical mechanism and evolutionary development of beetle bioluminescence" PHOTOCHEMISTRY AND PHOTOBIOLOGY, vol. 62, no. 4, 1995, pages 662-673, XP000983576 ISSN: 0031-8655 * |
Cited By (61)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7906282B2 (en) | 2000-08-24 | 2011-03-15 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
US7879540B1 (en) | 2000-08-24 | 2011-02-01 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
US7413874B2 (en) | 2002-12-09 | 2008-08-19 | University Of Miami | Nucleic acid encoding fluorescent proteins from aquatic species |
WO2005067410A2 (en) * | 2002-12-09 | 2005-07-28 | Promega Corporation | Synthetic nucleic acids from aquatic species |
WO2005067410A3 (en) * | 2002-12-09 | 2005-12-08 | Promega Corp | Synthetic nucleic acids from aquatic species |
JP2006512098A (en) * | 2002-12-09 | 2006-04-13 | プロメガ コーポレイション | Synthetic nucleic acids of aqueous species |
AU2003297293B2 (en) * | 2002-12-09 | 2007-09-13 | Promega Corporation | Synthetic nucleic acids from aquatic species |
US7291711B2 (en) | 2002-12-09 | 2007-11-06 | University Of Miami | Fluorescent proteins from aquatic species |
WO2005021752A1 (en) * | 2003-08-29 | 2005-03-10 | Takara Bio Inc. | Method of searching for functional nucleotide molecule |
US9290745B2 (en) | 2003-10-10 | 2016-03-22 | Promega Corporation | Luciferase biosensor |
EP2308978A1 (en) | 2003-10-10 | 2011-04-13 | Promega Corporation | Luciferase biosensor |
EP2298902A1 (en) | 2003-10-10 | 2011-03-23 | Promega Corporation | Luciferase biosensor |
US8673558B2 (en) | 2003-10-10 | 2014-03-18 | Promega Corporation | Luciferase biosensor |
JP2008513021A (en) * | 2004-09-17 | 2008-05-01 | プロメガ コーポレイション | Synthetic nucleic acid molecules and methods of preparation |
WO2006046132A3 (en) * | 2004-09-17 | 2006-08-17 | Pasteur Institut | Method for modulating the evolution of a polypeptide encoded by a nucleic acid sequence |
US8008006B2 (en) | 2004-09-17 | 2011-08-30 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
US7728118B2 (en) | 2004-09-17 | 2010-06-01 | Promega Corporation | Synthetic nucleic acid molecule compositions and methods of preparation |
US7939649B2 (en) * | 2005-09-06 | 2011-05-10 | Stanford University | Polynucleotide encoding luciferase |
US8383797B2 (en) | 2005-11-16 | 2013-02-26 | Toyo Boseki Kabushiki Kaisha | Luciferase gene optimized for use in imaging of intracellular luminescence |
EP1956086A4 (en) * | 2005-11-16 | 2009-04-01 | Toyo Boseki | Luciferase gene optimized for use in imaging of intracellular luminescence |
EP1956086A1 (en) * | 2005-11-16 | 2008-08-13 | Toyo Boseki Kabushiki Kasisha | Luciferase gene optimized for use in imaging of intracellular luminescence |
WO2007120522A2 (en) | 2006-04-03 | 2007-10-25 | Promega Corporation | Permuted and nonpermuted luciferase biosensors |
EP2327768A2 (en) | 2006-04-03 | 2011-06-01 | Promega Corporation | Permuted and nonpermuted luciferase biosensors |
US10077433B2 (en) | 2006-04-03 | 2018-09-18 | Promega Corporation | Permuted and nonpermuted luciferase biosensors |
US9359635B2 (en) | 2006-04-03 | 2016-06-07 | Promega Corporation | Permuted and nonpermuted luciferase biosensors |
US10815277B2 (en) | 2006-07-13 | 2020-10-27 | Institute For Advanced Study | Viral inhibitory nucleotide sequences and vaccines |
EP2502990A2 (en) | 2006-10-30 | 2012-09-26 | Promega Corporation | Mutant hydrolase proteins with enhanced kinetics and functional expression |
EP2492342A1 (en) | 2006-10-30 | 2012-08-29 | Promega Corporation | Mutant hydrolase proteins with enhanced kinetics and functional expression |
US8314225B2 (en) | 2007-06-29 | 2012-11-20 | Hoffman-La Roche Inc. | Heavy chain mutant leading to improved immunoglobulin production |
WO2009087967A1 (en) * | 2008-01-07 | 2009-07-16 | Probex Inc. | Method for detecting a protein-protein interaction |
US9045730B2 (en) | 2008-05-19 | 2015-06-02 | Promega Corporation | Luciferase biosensors for cAMP |
US9879306B2 (en) | 2008-05-19 | 2018-01-30 | Promega Corporation | Luciferase biosensors for cAMP |
WO2009142735A2 (en) | 2008-05-19 | 2009-11-26 | Promega Corporation | LUCIFERASE BIOSENSORS FOR cAMP |
US9757478B2 (en) | 2010-05-11 | 2017-09-12 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US9339561B2 (en) | 2010-05-11 | 2016-05-17 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US8735559B2 (en) | 2010-05-11 | 2014-05-27 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US9248201B2 (en) | 2010-05-11 | 2016-02-02 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
EP2990479A1 (en) | 2010-05-11 | 2016-03-02 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
WO2011143339A1 (en) | 2010-05-11 | 2011-11-17 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
EP3508570A1 (en) | 2010-05-11 | 2019-07-10 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US9290794B2 (en) | 2010-05-11 | 2016-03-22 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US20120035077A1 (en) * | 2010-06-10 | 2012-02-09 | Switchgear Genomics | Modified renilla luciferase nucleic acids and methods of use |
EP2580329A4 (en) * | 2010-06-10 | 2013-11-27 | Switchgear Genomics | Modified renilla luciferase nucleic acids and methods of use |
EP2580329A2 (en) * | 2010-06-10 | 2013-04-17 | Switchgear Genomics | Modified renilla luciferase nucleic acids and methods of use |
US9006405B2 (en) * | 2010-06-10 | 2015-04-14 | SwitchGear Genomics, Inc. | Modified renilla luciferase nucleic acids and methods of use |
WO2011155973A2 (en) | 2010-06-10 | 2011-12-15 | Switchgear Genomics | Modified renilla luciferase nucleic acids and methods of use |
US11661623B2 (en) | 2010-11-02 | 2023-05-30 | Promega Corporation | Oplophorus-derived luciferases, novel coelenterazine substrates, and methods of use |
US9840730B2 (en) | 2010-11-02 | 2017-12-12 | Promega Corporation | Oplophorus-derived luciferases, novel coelenterazine substrates, and methods of use |
US9938564B2 (en) | 2010-11-02 | 2018-04-10 | Promega Corporation | Substituted imidazo[1,2-a]pyrazines for use in bioluminogenic methods |
US9951373B2 (en) | 2010-11-02 | 2018-04-24 | Promega Corporation | Oplophorus-derived luciferases, novel coelenterazine substrates, and methods of use |
US10774364B2 (en) | 2010-11-02 | 2020-09-15 | Promega Corporation | Oplophorus-derived luciferases, novel coelenterazine substrates, and methods of use |
EP3467119A1 (en) | 2011-11-11 | 2019-04-10 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
WO2013071237A1 (en) | 2011-11-11 | 2013-05-16 | Promega Corporation | Mutant protease biosensors with enhanced detection characteristics |
US10550420B2 (en) | 2014-09-11 | 2020-02-04 | Promega Corporation | Luciferase sequences utilizing infrared-emitting substrates to produce enhanced luminescence |
WO2016040788A1 (en) * | 2014-09-11 | 2016-03-17 | Promega Corporation | Luciferase sequences utilizing infrared-emitting substrates to produce enhanced luminescence |
US11293047B2 (en) | 2014-09-11 | 2022-04-05 | Promega Corporation | Luciferase sequences utilizing infrared-emitting substrates to produce enhanced luminescence |
US9732373B2 (en) | 2014-09-11 | 2017-08-15 | Promega Corporation | Luciferase sequences utilizing infrared-emitting substrates to produce enhanced luminescence |
WO2020260431A1 (en) | 2019-06-28 | 2020-12-30 | F. Hoffmann-La Roche Ag | Method for the production of an antibody |
WO2023215505A1 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Modified dehalogenase with extended surface loop regions |
WO2023215452A2 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Split modified dehalogenase variants |
WO2023215432A1 (en) | 2022-05-04 | 2023-11-09 | Promega Corporation | Circularly permuted dehalogenase variants |
Also Published As
Publication number | Publication date |
---|---|
DK1341808T3 (en) | 2010-04-12 |
AU8527801A (en) | 2002-03-04 |
AU2001285278B2 (en) | 2008-05-08 |
DE60140898D1 (en) | 2010-02-04 |
JP2010081942A (en) | 2010-04-15 |
ES2335268T3 (en) | 2010-03-24 |
US7879540B1 (en) | 2011-02-01 |
EP1341808A2 (en) | 2003-09-10 |
JP2004520807A (en) | 2004-07-15 |
US20060127988A1 (en) | 2006-06-15 |
EP1341808B1 (en) | 2009-12-23 |
CA2420328A1 (en) | 2002-02-28 |
US20080090291A1 (en) | 2008-04-17 |
ATE452905T1 (en) | 2010-01-15 |
US7906282B2 (en) | 2011-03-15 |
JP2007006910A (en) | 2007-01-18 |
WO2002016944A3 (en) | 2003-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7879540B1 (en) | Synthetic nucleic acid molecule compositions and methods of preparation | |
AU2001285278A1 (en) | Synthetic nucleic acid molecule compositions and methods of preparation | |
WO2006034061A2 (en) | Synthetic nucleic acid molecule and methods of preparation | |
US11667950B2 (en) | Synthetic Oplophorus luciferases with enhanced light output | |
US20090191622A1 (en) | Synthetic nucleic acids from aquatic species | |
US5670356A (en) | Modified luciferase | |
KR20030092013A (en) | Novel expression vectors | |
US20140087402A1 (en) | Synthetic luciferase gene and protein | |
US8206961B2 (en) | Modified Luciola cruciata luciferase protein | |
JP4528623B2 (en) | Rapidly degradable reporter fusion protein | |
JP2006508678A (en) | Fluorescent proteins from aqueous species | |
JP7356749B2 (en) | modified luciferase | |
US20120028257A1 (en) | Modified luciola cruciata luciferase gene and protein |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2002521985 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2420328 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001285278 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001964425 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWP | Wipo information: published in national office |
Ref document number: 2001964425 Country of ref document: EP |