US20020160377A1 - Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA -protein fusion formation - Google Patents
Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA -protein fusion formation Download PDFInfo
- Publication number
- US20020160377A1 US20020160377A1 US09/910,518 US91051801A US2002160377A1 US 20020160377 A1 US20020160377 A1 US 20020160377A1 US 91051801 A US91051801 A US 91051801A US 2002160377 A1 US2002160377 A1 US 2002160377A1
- Authority
- US
- United States
- Prior art keywords
- rna
- library
- dna
- molecules
- products
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000004927 fusion Effects 0.000 title claims abstract description 64
- 108091036066 Three prime untranslated region Proteins 0.000 title claims abstract description 45
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 21
- 150000007523 nucleic acids Chemical class 0.000 title claims description 29
- 102000039446 nucleic acids Human genes 0.000 title claims description 28
- 108020004707 nucleic acids Proteins 0.000 title claims description 28
- 108091092328 cellular RNA Proteins 0.000 title claims description 10
- 238000000034 method Methods 0.000 title abstract description 42
- 108020004999 messenger RNA Proteins 0.000 claims abstract description 61
- 108020004705 Codon Proteins 0.000 claims description 76
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 65
- 108020004414 DNA Proteins 0.000 claims description 52
- 238000013519 translation Methods 0.000 claims description 45
- 239000000203 mixture Substances 0.000 claims description 27
- 230000000295 complement effect Effects 0.000 claims description 26
- 238000000338 in vitro Methods 0.000 claims description 22
- 239000000370 acceptor Substances 0.000 claims description 18
- 125000001151 peptidyl group Chemical group 0.000 claims description 17
- 102000053602 DNA Human genes 0.000 claims description 13
- 108700026244 Open Reading Frames Proteins 0.000 claims description 13
- 108091008146 restriction endonucleases Proteins 0.000 claims description 13
- 108091034117 Oligonucleotide Proteins 0.000 claims description 11
- 239000011541 reaction mixture Substances 0.000 claims description 9
- 238000003786 synthesis reaction Methods 0.000 claims description 9
- 230000000694 effects Effects 0.000 claims description 8
- 102100034343 Integrase Human genes 0.000 claims description 7
- 230000002194 synthesizing effect Effects 0.000 claims description 6
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims description 5
- 101710163270 Nuclease Proteins 0.000 claims description 4
- 210000003705 ribosome Anatomy 0.000 claims description 4
- 241000124008 Mammalia Species 0.000 claims description 3
- 108010010677 Phosphodiesterase I Proteins 0.000 claims description 3
- 239000013615 primer Substances 0.000 claims 9
- 239000003155 DNA primer Substances 0.000 claims 1
- 239000002299 complementary DNA Substances 0.000 abstract description 31
- 239000002773 nucleotide Substances 0.000 description 27
- 125000003729 nucleotide group Chemical group 0.000 description 27
- 102000004169 proteins and genes Human genes 0.000 description 27
- 108090000623 proteins and genes Proteins 0.000 description 27
- 238000013459 approach Methods 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 16
- 108091026890 Coding region Proteins 0.000 description 15
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 13
- 239000003112 inhibitor Substances 0.000 description 11
- 238000013518 transcription Methods 0.000 description 11
- 230000035897 transcription Effects 0.000 description 11
- 230000037452 priming Effects 0.000 description 10
- 238000010804 cDNA synthesis Methods 0.000 description 9
- 238000010561 standard procedure Methods 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 8
- 102000004196 processed proteins & peptides Human genes 0.000 description 8
- 102100030667 Eukaryotic peptide chain release factor subunit 1 Human genes 0.000 description 7
- 229950010131 puromycin Drugs 0.000 description 7
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 6
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- 108020003589 5' Untranslated Regions Proteins 0.000 description 5
- 101710137500 T7 RNA polymerase Proteins 0.000 description 5
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 5
- 229920001519 homopolymer Polymers 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 108020005345 3' Untranslated Regions Proteins 0.000 description 4
- 102000012410 DNA Ligases Human genes 0.000 description 4
- 108010061982 DNA Ligases Proteins 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- 108010052305 exodeoxyribonuclease III Proteins 0.000 description 4
- 229920002401 polyacrylamide Polymers 0.000 description 4
- 238000001542 size-exclusion chromatography Methods 0.000 description 4
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 3
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 3
- 101710175705 Eukaryotic peptide chain release factor subunit 1 Proteins 0.000 description 3
- 238000001261 affinity purification Methods 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010086507 peptide-chain-release factor 3 Proteins 0.000 description 3
- 239000000700 radioactive tracer Substances 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 2
- 108050008072 Cytochrome c oxidase subunit IV Proteins 0.000 description 2
- QTANTQQOYSUMLC-UHFFFAOYSA-O Ethidium cation Chemical compound C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 QTANTQQOYSUMLC-UHFFFAOYSA-O 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- 101710203526 Integrase Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000012743 protein tagging Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- GEYBMYRBIABFTA-VIFPVBQESA-N O-methyl-L-tyrosine Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1 GEYBMYRBIABFTA-VIFPVBQESA-N 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 108090000279 Peptidyltransferases Proteins 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 241000723873 Tobacco mosaic virus Species 0.000 description 1
- 108010028230 Trp-Ser- His-Pro-Gln-Phe-Glu-Lys Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- ARLKCWCREKRROD-POYBYMJQSA-N [[(2s,5r)-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 ARLKCWCREKRROD-POYBYMJQSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000012082 adaptor molecule Substances 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 238000003936 denaturing gel electrophoresis Methods 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 230000017156 mRNA modification Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000003498 protein array Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000012409 standard PCR amplification Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 108010018381 streptavidin-binding peptide Proteins 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical group [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1062—Isolating an individual clone by screening libraries mRNA-Display, e.g. polypeptide and encoding template are connected covalently
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1075—Isolating an individual clone by screening libraries by coupling phenotype to genotype, not provided for in other groups of this subclass
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1096—Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
Definitions
- the invention features methods for modifying nucleic acid substrates, for example, for the production of RNA-protein fusions.
- RNA-protein fusions may be used in methods for generating or isolating proteins with desired properties from pools of proteins.
- an RNA and the peptide or protein that it encodes may be joined during in vitro translation using synthetic RNA that carries a peptidyl acceptor, such as puromycin, at its 3′-end (Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302).
- the synthetic RNA which is devoid of stop codons, is typically synthesized by in vitro transcription from a DNA template followed by 3′-ligation to a DNA linker carrying puromycin.
- the DNA sequence causes the ribosome to pause at the end of the open reading frame, providing additional time for the puromycin to accept the nascent peptide chain and resulting in the production of the RNA-protein fusion molecule.
- the present invention involves methods for optimizing the production of RNA-protein fusions beginning with cellular RNA or other nucleic acids having 3′-untranslated regions.
- such fusions may be generated by at least two general techniques.
- nucleic acids are produced which lack both 3′-untranslated regions and poly A tails. These nucleic acids, which may also lack a terminal stop codon, are then used for the production of RNA-protein fusions.
- the fusion is generated in an in vitro translation reaction mixture which lacks functional translation release factors.
- RNA-protein fusions The absence of these factors circumvents the problem of termination at terminal stop codons (or other stop codons inadvertently introduced into a protein coding sequence) and allows for the generation of RNA-protein fusions.
- the invention also encompasses methods in which these two general approaches are combined for the purpose of RNA-protein fusion formation and methods in which the approaches, singly or in combination, are used for other purposes in which nucleic acids lacking 3′-terminal sequences or translation through stop codons are useful or desirable.
- the invention features a method for removing the 3′-untranslated region of a DNA molecule including an open reading frame, the method involving: (a) providing a DNA molecule having an open reading frame and a 3′-untranslated region, the DNA molecule terminating at its 5′ end in an overhang and at its 3′ end in a blunt end; and (b) treating the DNA molecule first with a 3′ ⁇ 5′ exonuclease and then with a single-stranded nuclease under conditions that allow removal of the 3′-untranslated region.
- the 3′ ⁇ 5′ exonuclease is exonuclease III; the nuclease is Mung bean nuclease; step (b) further results in removal of the stop codon of the open reading frame; the DNA molecule is a cDNA produced by reverse transcription from an mRNA sequence; and the method is carried out on a population of DNA molecules.
- the invention features a method for removing the 3′-untranslated region of an mRNA molecule, the method involving: (a) translating an mRNA molecule in vitro in a translation reaction mixture lacking functional translation release factor activity, resulting in pausing of the translation reaction mixture ribosomes at the stop codon of the mRNA molecule; (b) adding, to the translation reaction mixture of step (a), reverse transcriptase and an oligonucleotide primer which is complementary to the 3′-untranslated region of the mRNA molecule at a site proximal to the stop codon, under conditions which allow the synthesis of a strand of DNA that is complementary to the 3′-untranslated region and terminates at a site proximal to the stop codon; and (c) removing the RNA portion of the RNA-DNA duplex formed in step (b), thereby removing the 3′-untranslated region of the mRNA molecule.
- the oligonucleotide primer comprises a poly T sequence; step (c) is carried out by treatment of the product of step (b) with RNaseH; the method is carried out on a population of mRNA molecules; and the method further involves the steps of: (d) ligating to the 3′ end of the product of step (c) a linker including a Type IIS restriction site; (e) extending the product of step (d) to produce a double-stranded DNA molecule; and (f) treating the double-stranded DNA molecule with the Type IIS restriction enzyme to cleave the DNA molecule and remove the stop codon.
- the invention features a method for removing the 3′-untranslated regions and stop codons of a population of mRNA molecules, the method involving: (a) providing a population of mRNA molecules; (b) synthesizing strands of DNA, each of which is complementary to one of said mRNA molecules, using a random primer mixture, the random primer mixture including primers, each having (i) a 3′ region including a stop codon flanked by a random oligonucleotide located 3′, 5′, or both to the stop codon; and (ii) a 5′ region including a Type IIS restriction site; (c) ligating to the 3′ ends of the DNA products of step (b) an oligonucleotide tail; (d) amplifying the products of step (c) using (i) a first primer which is complementary to the Type IIS restriction site-containing sequence; and (ii) a second primer which is complementary to the oligonucleotide tail
- the second primer of step (d) further includes a 5′ region including an RNA polymerase recognition site; and the method further comprises: (f) ligating a sequence which encodes an affinity tag to the cleaved ends of the products of step (e); (g) transcribing the products of step (f); (h) ligating peptidyl acceptors to the 3′ ends of the RNA products of step (g); (i) translating the products of step (h) to produce a population of RNA-protein fusions; and (j) substantially isolating RNA-protein fusions which comprise the affinity tag, thereby obtaining a population of mRNA molecules lacking 3′-untranslated regions and stop codons.
- the invention features a method for removing the 3′-untranslated regions and stop codons of a population of mRNA molecules, involving: (a) providing a population of mRNA molecules; (b) synthesizing strands of DNA, each of which is complementary to one of the mRNA molecules, using a random primer mixture, the random primer mixture including primers, each having (i) a 5′ region which lacks a stop codon in at least one reading frame and (ii) a random 3′ region; and (c) synthesizing strands of DNA complementary to the DNA strands of step (b), using a second random primer mixture.
- the second random primer mixture includes primers, each having (i) a 5′ region which includes a translation start site and (ii) a random 3′ region; and wherein said method further involves (d) amplifying the product of step (c) using a first amplification primer having (i) a 5′ sequence which includes an RNA polymerase recognition site and (ii) a 3′ region which is complementary to the translation start site.
- the RNA polymerase recognition site is a T7 or SP6 RNA polymerase recognition site;
- the affinity tag is a hexahistidine peptide, a streptavidin-binding peptide, or an epitope;
- the peptidyl acceptor is puromycin; and the method is carried out on a population of mRNA molecules.
- the invention features a method for producing an RNA-protein fusion from an mRNA having a 3′-untranslated region, the method involving: (a) covalently bonding the mRNA to a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the mRNA; and (b) translating the mRNA molecule in vitro in a translation reaction mixture lacking functional translation release factor activity.
- the invention features a method for producing an RNA-protein fusion from a nucleic acid having a 3′-untranslated region, the method involving: (a) providing the DNA product obtained above lacking a 3′-untranslated region; (b) transcribing the DNA to produce RNA lacking a 3′-untranslated region; (c) covalently bonding to the RNA a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the RNA; and (d) translating the product of step (c) to produce an RNA-protein fusion.
- the DNA product lacks a stop codon; and the translating step is carried out in vitro in a translation reaction mixture lacking functional translation release factor activity.
- the invention features a method for producing an RNA-protein fusion from a nucleic acid having a 3′-untranslated region, the method involving: (a) providing the RNA product obtained above lacking a 3′-untranslated region; (b) covalently bonding to the RNA a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the RNA; and (c) translating the product of step (b) to produce an RNA-protein fusion.
- the invention features a library of nucleic acid molecules, each molecule including an open reading frame and lacking the 3′-untranslated region normally associated with the open reading frame.
- the nucleic acid is DNA or RNA (for example, messenger RNA or cellular RNA derived, for example, from a eukaryotic organism, such as a mammal, and, for example, a human); the library includes at least 10 5 members; and the nucleic acid molecules of the library also lack stop codons.
- DNA or RNA for example, messenger RNA or cellular RNA derived, for example, from a eukaryotic organism, such as a mammal, and, for example, a human
- the library includes at least 10 5 members; and the nucleic acid molecules of the library also lack stop codons.
- the invention features libraries of nucleic acid molecules and RNA-protein fusions produced by the methods of the invention.
- a “population” is meant more than one molecule.
- a population includes at least 10 molecules, more preferably, at least 10 2 or 10 3 molecules, and, most preferably, at least 10 4 , 10 5 , or 10 6 molecules.
- a “library” is also any group of molecules.
- a library includes at least 10, preferably, at least 10 2 or 10 3 , and, most preferably, at least 10 4 , 10 5 , or 10 6 molecules.
- a “protein” is meant any two or more naturally occurring or modified amino acids joined by one or more peptide bonds. “Protein” and “peptide” are used interchangeably herein.
- RNA is meant a sequence of two or more covalently bonded, naturally occurring or modified ribonucleotides.
- a modified RNA included within this term is phosphorothioate RNA.
- DNA is meant a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides.
- covalently bonded to a peptidyl acceptor is meant that the peptidyl acceptor is joined either directly through a covalent bond or indirectly through another covalently bonded sequence (for example, DNA corresponding to a pause site).
- a “peptidyl acceptor” is meant any molecule capable of being added to the C-terminus of a growing protein chain by the catalytic activity of the ribosomal peptidyl transferase function.
- such molecules contain (i) a nucleotide or nucleotide-like moiety (for example, adenosine or an adenosine analog (di-methylation at the N-6 amino position is acceptable)), (ii) an amino acid or amino acid-like moiety (for example, any of the 20 D- or L-amino acids or any amino acid analog thereof (for example, O-methyl tyrosine or any of the analogs described by Ellman et al., Meth. Enzymol.
- Peptide acceptors may also possess a nucleophile, which may be, without limitation, an amino group, a hydroxyl group, or a sulfhydryl group.
- peptidyl acceptors may be composed of nucleotide mimetics, amino acid mimetics, or mimetics of the combined nucleotide-amino acid structure.
- FIG. 1 is a schematic illustration of one exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 2 is a schematic illustration of a second exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 3 is a schematic illustration of a third exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 4 is a diagram illustrating a map of the human cytochrome oxidase IV subunit A mRNA. This mRNA contains a total of 19 stop codons: one authentic codon, one in the 5′ UTR, 14 in the open reading frame, and three in the 3′ UTR.
- FIG. 5 is a photograph illustrating the products of first strand cDNA synthesis of the mRNA of FIG. 4, run on a denaturing polyacrylamide gel. As expected, a series of bands were observed, likely due to priming at stop codons within the RNA.
- FIG. 6 is a photograph illustrating the products of second strand cDNA synthesis of the mRNA of FIG. 4. PCR amplification following second strand synthesis revealed a banding pattern similar to that observed after first strand synthesis.
- FIG. 7 is a photograph illustrating the products of an in vitro transcription reaction using the cDNA of FIG. 6 and “pull through” PCR following ligation of the affinity tag 3′ terminus. The image shown is color reversed from an ethidium stained agarose gel to enhance resolution.
- FIG. 8 is a photograph illustrating RNA-protein fusions produced from cellular mRNA using biased random priming to remove stop codons.
- FIG. 9 is a photograph showing the products of random primed cDNA synthesis from polyA+ mRNA from HL60 cells and normal human bone marrow (NBM) run on a denaturing acrylamide gel.
- FIG. 10 is a photograph illustrating PCR-amplified second strand cDNA generated from the product of FIG. 9. An aliquot of the second strand synthesis reaction was PCR amplified under standard conditions. Aliquots were removed after the specified number of cycles and run on a 2% agarose gel. The image shown is a negative of the ethidium stained gel to enhance resolution.
- FIG. 11 is a photograph illustrating radiolabeled RNA transcripts produced from the dsDNA template library of FIG. 10. These transcripts were produced using T7 RNA polymerase and run on a denaturing polyacrylamide gel.
- FIG. 12 is a photograph illustrating that ligation of a 32 P-labeled linker to the RNA library of FIG. 11 results in a shift in mobility of the linker.
- FIG. 13 is a photograph illustrating fusions formed between the RNA library of FIG. 11 and translated peptides. These fusions were purified by oligo-dT cellulose and analyzed by SDS-PAGE. Such fusions could only be formed in the absence of a stop codon.
- FIG. 14 is a diagram illustrating the sequence of clones selected from an RNA-protein fusion library derived from cellular RNA and which lack both stop codons and 3′ untranslated regions.
- the first line is the clone sequence from the fusion library
- the second line is the parent RNA sequence.
- the shaded regions correspond to the N 9 portion of the primers.
- the present invention provides two general approaches for the modification or use of nucleic acids having 3′-untranslated regions for the production of RNA-protein fusions, or any other technique where stop codons or untranslated regions are undesirable.
- mRNA or cDNA libraries are created that lack 3′ untranslated regions and poly A tails, and, if desired, also lack 3′-terminal stop codons.
- Such cDNAs are greatly improved compared to traditional cDNA libraries since they are enriched for coding sequence information.
- creation of these cDNA libraries enables the creation of libraries of cellular mRNA molecules covalently linked to the protein molecules the mRNAs encode.
- fusion libraries can be used for a variety of applications, including the identification of protein-protein interactions, identification of drug targets, and hybridization to solid supports to create, for example, protein chips (or beads); if desired, the RNA-protein molecules may be arranged in spatially defined arrays on such chips to carry out large scale screening, for example, for protein or compound identification.
- Exemplary uses for RNA-protein fusions are described, for example, in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998 and U.S. Ser. No.
- the second approach of the invention focuses on overcoming the natural translational termination which is brought about by the interaction between the stop codon at the 3′ end of an mRNA coding sequence and the release factors present in a translation lysate.
- stop codons are removed from the mRNA molecule (as described above) or the release factor activity is removed from the in vitro translation system.
- translation results in mRNA-polypeptide-ribosome complexes which are suitable substrates for the formation of mRNA-protein fusions.
- this approach simplifies fusion formation beginning with natural mRNA messages which contain stop codons and also simplifies the use of such fusion technology for such applications as functional genomics.
- FIG. 1 shows a first mRNA modification technique in which the coding sequence is modified at the DNA level.
- the coding regions of a cDNA library are excised from host vectors in such a way that the sequence upstream of the coding sequence terminates in a single 3′ DNA chain overhang of at least four bases, whereas the sequence downstream of the coding sequence terminates in a blunt cut. This may be accomplished by the use of appropriate restriction enzymes (in combination, for example, with vectors containing useful restriction sites) and standard molecular biology techniques.
- Exonuclease III and Mung bean nuclease are then used sequentially (with exonuclease III being used first and Mung bean nuclease being used second) to remove nucleotides from the unprotected, downstream end of the cDNA clone.
- the length of incubation with exonuclease III is adjusted by standard techniques such that the cDNA polyadenosine tail, 3′ untranslated region, and (if desired) stop codon, but little of the coding sequence, are removed.
- S1 nuclease may be used in place of Mung bean nuclease, again adjusting the incubation time to allow removal of the 3′-untranslated region but little or none of the coding sequence.
- RNA-protein fusion formation For use in RNA-protein fusion formation, a defined DNA sequence may then be ligated to the newly created downstream end, creating the ideal substrate for in vitro transcription and translation.
- This DNA sequence is complementary to a splint sequence that is used to facilitate the ligation of a peptidyl acceptor to the mRNA product of the modified DNA upon transcription.
- Exemplary sequences and methods for in vitro transcription, in vitro translation, and fusion formation are described, for example, in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190.
- RNA substrate may be used directly in in vitro transcription and in vitro translation steps or, as shown in FIG. 1, may be amplified (for example, by standard PCR amplification) to generate a library of cDNA molecules lacking 3′-untranslated regions.
- cDNA clones are transcribed in vitro into mRNA molecules which contain stop codons, untranslated 3′ regions, and polyadenosine tails.
- mRNA may be isolated from cells and used directly. The mRNA is then subjected to in vitro translation by any standard technique in the presence of inhibitors of translation release factors (see below). Under such reaction conditions, ribosomes do not release the polypeptide chain upon reaching the stop codon, but instead pause.
- a DNA oligonucleotide primer complementary to the polyA tail that is, a poly T sequence preferably of a length of between 10-30 nucleotides
- reverse transcriptase are then added to the mix, resulting in the synthesis of a strand of DNA complementary to the downstream region of the mRNA which terminates in the region proximal to the stop codon.
- RNaseH is then used to remove the RNA portion of the RNA-DNA region.
- the RNA product may then be used to generate cDNA libraries or for RNA-protein fusion formation.
- an adaptor molecule is preferably ligated to the RNA to create a defined sequence on the 3′ end using T4 RNA ligase.
- This adaptor is a short, double-stranded piece of DNA (preferably, between 10-50 base pairs in length) with a sequence designed to facilitate further processing of the cDNA library.
- the adaptor is used as the basis for complementary PCR primers for cDNA library construction, or as “splint” oligonucleotides to facilitate the ligation of RNA products to peptidyl acceptor-containing linkers, as described below.
- RNA-linker-puromycin construct may then be used directly for in vitro translation in a lysate depleted of release factors to generate RNA-protein fusion molecules.
- a linker with a defined sequence containing an offset cutting restriction enzyme site such as a Type IIS restriction site (for example, a BsgI, HphI, or AsuHPI restriction site) is ligated, as described above, to the region downstream of the stop codon.
- the RNA is then amplified, for example, by standard methods of RT-PCR, and treated with the restriction enzyme. This type of restriction enzyme cuts upstream from its recognition site, thus removing the stop codon.
- the DNA, which contains the coding sequence but not the stop codon may then be used in standard protocols for transcription and formation of RNA-protein fusions (see, for example, Roberts & Szostak (1997) Proc.
- biased random priming is used to remove both 3′ untranslated regions and the stop codons from the members of a cDNA library.
- This general approach is shown in FIG. 3.
- a cDNA library is made, by standard techniques, from purified cellular mRNA using a biased random primer mix.
- This mix includes primers with sequences complementary to each of the three stop codons (TGA, TAA, or TAG) (one stop codon per primer) in the 3′ region flanked on the 3′ side, 5′ side, or both by an additional 1-8 nucleotide long, completely random sequence.
- the 5′ region of the primer contains a fixed sequence corresponding to the recognition site for an offset cutting (Type IIS) restriction enzyme.
- Type IIS restriction enzymes include BsgI, HphI, and AsuHPI.
- RNA template is removed. This can be accomplished either enzymatically, for example, through the action of an RNase, or chemically, for example, by treatment at high pH (for example, a pH of at least 13).
- the cDNA strands are then tailed with a homopolymeric sequence using an enzyme such as terminal deoxynucleotidyl transferase (TdT).
- TdT terminal deoxynucleotidyl transferase
- a particularly suitable tail is poly-deoxycytidine.
- the resulting tailed cDNA is then amplified, for example, using PCR and appropriate primer sequences.
- One of these primers is complementary to the conserved region of the initial primer which contained the restriction site, and the second primer contains a 5′ region that includes an RNA polymerase recognition sequence (for example, a T7 or SP6 RNA polymerase recognition site) and a 3′ region that is complementary to the homopolymer tail plus 1-3 terminal nucleotides containing a mix of all nucleotides.
- the closest of these mixed nucleotides to the homopolymer region may contain any nucleotide except G.
- Such a tail ensures that the primer preferentially aligns with the first few nucleotides of the poly-deoxycytidine tail.
- the double-stranded PCR product is then digested with the off-set cutting Type IIS restriction enzyme. Because of the primer used in the random priming step, this restriction cut occurs upstream of the stop codon at which the initial priming event occurred. In certain situations, it may be desirable to only partially cut the PCR products, for example, if those products are known or suspected to contain one or more native internal restriction sites for the chosen enzyme. In these circumstances, the restriction conditions are adjusted such that the enzyme cuts each product, on average, only once.
- RNA polymerase that is, one which corresponds to the RNA polymerase recognition site chosen above
- the double-stranded DNA is transcribed to produce single-stranded RNA.
- Each of these RNA molecules has the same 3′ terminus, corresponding to the ligated affinity purification tag.
- Additional sequence is then ligated onto the 3′ ends of these RNA strands in a template-directed manner, using an enzyme such as T4 DNA ligase.
- This new 3′ sequence is preferably poly-deoxyadenosine with a 3′ terminal moiety suitable for producing nucleic acid/protein fusions, for example, a dCC-puromycin group.
- the ligated product is then purified and translated using any suitable in vitro translation system, for example, a rabbit reticulocyte lysate.
- a rabbit reticulocyte lysate In such a system, the ribosome pauses upon reaching the poly-deoxyadenosine region, and the dCC-puromycin group is fused to the nascent polypeptide strand. If a stop codon is encountered prior to the poly-deoxyadenosine, the ribosome is released, and no fusion occurs. This will be the case if the initial priming site occurred in the 3′ untranslated region.
- Nucleic acid/protein fusions are then purified using the translated affinity purification tag. If the initial site of priming was an out-of-frame stop codon, the affinity tag will be mis-translated. Therefore, by this selection, only fusions from in-frame stop codons will be present after purification.
- RNA from the purified fusions is then recovered and amplified using, for example, RT-PCR.
- the resulting cDNA library should have only full length, in-frame mRNAs with no in-frame stop codons and no 3′ untranslated regions.
- the RNA population may be used as described above to generate a cDNA library or directly for RNA-protein fusion formation.
- RNA encoded the human cytochrome oxidase IV subunit A.
- the particular RNA that was used (FIG. 4) was generated by transcription from a PCR fragment and contained a 42 nucleotide 5′ UTR, a 501 nucleotide open reading frame (ORF), and a 124 nucleotide 3′ UTR. There were a total of 19 stop codons contained within the RNA: one authentic, one in the 5′ UTR, 14 out of frame in the open reading frame, and three in the 3′ UTR.
- This RNA also contained an internal restriction site for the Type IIS restriction enzyme used in the method, thereby representing a realistic model for cellular mRNA populations.
- first strand cDNA synthesis was performed using a mix of primers that contained (5′ to 3′) the recognition sequence for the Type IIS restriction endonuclease, Bpm I, followed by six random nucleotides and, at the 3′ terminus, three nucleotides complementary to the human stop codons.
- N denotes a mix of all four nucleotides dG/dA/dC/dT: 5′-GCT TGC TGG AGT GCG AGT NNN NNN CTA 5′-GCT TGC TGG AGT GCG AGT NNN NNN TTA 5′-GCT TGC TGG AGT GCG AGT NNN NNN TCA.
- RNA was annealed to between 25-125 pmoles of primer mix, then extended with reverse transcriptase by standard techniques.
- ⁇ - 32 P-dATP was included as a trace label in the reaction.
- E. coli RNase H was added to remove the RNA strand, and an aliquot of the reaction was run on a denaturing polyacrylamide gel (FIG. 5).
- a homopolymer tail of dC was added to the first strand cDNA using the enzyme terminal deoxynucleotidyl transferase.
- the length of the tail was controlled by including ddCTP in the extension reaction at a ratio of 1:9 with dCTP.
- the tailed cDNA was then copied in a second strand synthesis reaction using a primer that contained a T7 promoter followed by a 9 nucleotide dG tail, a penultimate nucleotide mix of dC/dA/dT, and a terminal random nucleotide.
- This primer had the following sequence (SEQ ID NO: 4; H denotes a mix of the nucleotides dA/dC/dT and N denotes a mix of all four nucleotides dG/dA/dC/dT):
- the final two nucleotides conferred priming specificity by preferentially being extended from the extreme internal portion of the homopolymer tail.
- PCR (using primers complementary to the fixed regions of the primers from FIGS. 4) was used to generate a double-stranded template (FIG. 6).
- This template was then partially digested with Bpm I endonuclease. Cleavage from the Bpm I site in the second strand primer resulted in the removal of the third position nucleotide from all stop codons.
- a new double-stranded 3′ terminus encoding the affinity sequence Strep-Tag II (available from Genosys Biotechnologies, Inc., The Woodlands, Tex.) was then ligated onto the cleaved fragments. This new terminus was designed to be ligated in frame with the authentic stop codon, converting it to a tyrosine and thus eliminating the stop.
- RNA was then enzymatically ligated to a puromycin-containing DNA linker (by the method of Roberts & Szostak (1997) Proc. Natl. Acad. Sci.
- random priming is used to remove both 3′ untranslated regions and stop codons from cDNA molecules.
- the methods described above for producing fusions from cellular RNA are generally designed to produce protein moieties with essentially wild-type N-termini.
- libraries of fusions from cellular RNA that consist of various N- and C-terminal truncated species as well.
- such a domain library may contain functional units that are easier to produce and select than full-length proteins.
- random priming was utilized to generate cDNA molecules as follows.
- Poly A + mRNA was obtained by standard methods from two sources, human bone marrow and HL60 cells. A cDNA copy of this mRNA was then produced using the following primer (SEQ ID NO: 5):
- This first strand primer was in the minus sense relative to the RNA strand and in one reading frame encoded the FLAG epitope. Because this fixed sequence contained no stop codons in two of the three potential reading frames, RNA produced from this template would contain no stop codons in two reading frames.
- This primer contained a 5′ fixed sequence and nine random nucleotides at the 3′ terminus. 125 pmoles of the primer was annealed to 5 ⁇ g of mRNA and then extended using reverse transcriptase and standard techniques. A portion of the reaction was performed in the presence of ⁇ - 32 P-dATP as a tracer and assayed by denaturing gel electrophoresis (FIG. 9). After first strand synthesis, the RNA strand was removed by digestion with RNase H. Unextended primers were removed by size exclusion chromatography.
- Second strand cDNA synthesis was performed using the Klenow fragment of DNA polymerase and the following primer (SEQ ID NO: 6):
- This second strand primer was in the plus sense relative to the RNA strand, contained nine random nucleotides at the 3′ end, and included a 5′ fixed region having an ATG start codon and the 5′ UTR from tobacco mosaic virus as a ribosome binding site. Again, a portion of the reaction was performed in the presence of ⁇ - 32 P-dATP as a tracer (FIG. 9). The unextended primers were removed by size exclusion chromatography.
- the second strand cDNA containing both fixed regions was then amplified by PCR to create a double stranded template (FIG. 10).
- the forward PCR primer was complementary to the 5′ UTR region of the second strand primer and also encoded the promoter sequence for T7 RNA polymerase.
- the reverse PCR primer was complementary to the fixed region of the first strand primer and also encoded sequences required for subsequent ligation of RNA produced from the template. These primer sequences are shown below (SEQ ID NOS: 7, 8): 5′ TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT (forward) 5′ AGA AGA TGC GCG ATC GTC ATC GTC CTT GTA GTC (reverse).
- FIG. 10 The results of this amplification step are shown in FIG. 10.
- the intense PCR product of approximately 75 nucleotides (FIG. 10) was apparently due to primer-dimer formation and could be reduced with an additional size exclusion chromatography step.
- the double-stranded template from PCR was transcribed using T7 RNA polymerase (as described in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999).
- RNA transcripts When ⁇ - 32 P-dATP was included in the transcription reaction a range of RNA transcripts was produced that reflected the variable size of the template library (FIG. 11). Because the specific activity of a given transcript was proportional to the length, longer RNA products appeared darker.
- RNA linker with a 5′ puromycin moiety was then ligated to the end of the RNA in a template directed reaction using T4 DNA ligase (as described in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999).
- the DNA linker was 5′ radiolabeled with 32 P to allow the reaction to be followed on a denaturing polyacrylamide gel (FIG. 12). The shift in mobility of the linker was the result of ligation to the RNA library.
- RNA-Protein Fusions by the methods of Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999).
- the translation reaction contained 35 S-met so that the newly translated proteins were radiolabeled.
- RNA being translated contained a stop codon
- the ribosome complex would dissociate from the template, and no fusion would be formed. Accordingly, the formation of fusions correlated with the lack of stop codons.
- a fusion library constructed essentially as above was subsequently selected for a particular aspect of the protein portion of the protein-RNA fusion.
- a number of individual members of the resulting selected pool were isolated and sequenced (FIG. 14). Alignment with the parental RNA sequences obtained from a sequence database allowed the selected region to be identified. Comparison of the recovered clones with the parent RNA showed that, in general, each of these clones represented an in-frame region of a cellular RNA message devoid of both stop codons and a 3′ UTR.
- stop codons present in an RNA sequence are overcome by neutralization or removal of translation release factors from in vitro translation mixes.
- eRF1 and eRF3 must be neutralized.
- both RF1 and RF2 or, alternatively, RF3 alone must be neutralized to inhibit polypeptide chain release.
- a release factor is neutralized by the use of antibodies or by exploiting genetically engineered variants of the natural release factor binding partners.
- the release factor may be removed from the translation mix by using its affinity to specific components of the translation complex, such as stop codons.
- Neutralizing antibodies which can be either polyclonal or monoclonal, are raised against the entire release factor or to one of its constituent domains or peptides.
- One such antibody and an exemplary method of preparation is described in Zhouravleva et al. (EMBO J. 14:4065-72 (1995)).
- Such antibodies may be produced by any standard technique.
- the antigen is first expressed in a heterologous expression system or synthesized chemically and then purified to homogeneity.
- the antigenic peptide may be coupled to a carrier protein, such as KLH as described in Ausubel et al, Current Protocols in Molecular Biology, Wiley Interscience, New York, N.Y.
- the peptide may then be mixed with Freund's adjuvant and injected into guinea pigs, rats, or preferably rabbits to produce polyclonal antibodies.
- the antibodies may be purified by peptide antigen affinity chromatography.
- Monoclonal antibodies may be prepared using these same antigenic peptides and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:511, 1976; Kohler et al., Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, N.Y., 1981; Ausubel et al., supra).
- eRF1 may be neutralized by an excess of an inactive mutant of eRF3.
- eRF3 may be neutralized by an inactive mutant of eRF1.
- RF1 and RF2 can both be inhibited by an excess of an inactive mutant of RF3, and RF3 can be inhibited by an excess of an inactive mutant of RF1 or RF2.
- Such mutants are created by standard techniques, for example, by random or site-directed mutagenesis, followed by an assay for loss of RF activity; in one particular example, residues in the GTP-binding motif of RF3 necessary for activity may be mutated.
- analogues of stop codons may be used as inhibitors to bind, for example, to RF1.
- Exemplary stop codon analogues are short oligonucleotides (composed of RNA, DNA, or chemically modified RNA) which contain the sequence of all possible stop codons.
- any of the above described release factor inhibitors may be used in at least three different ways.
- a soluble inhibitor may be added to an in vitro translation mixture.
- the inhibitor binds tightly to its target and prevents the release factor from interacting with the mRNA-protein-ribosome-GTP complex.
- the inhibitor (including a stop codon sequence) may be immobilized on a solid bead.
- the inhibitor binds to the release factor, and the complex of release factor and immobilized inhibitor are removed from solution, for example, by centrifugation or microfiltration.
- the inhibitor may be immobilized on a column, and the translation mixture passed through the column. The translation mixture that flows through the column is cleared of release factor and, when used as an in vitro translation mix, fails to release a nascent polypeptide chain from an mRNA-ribosome-GTP complex.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
Described herein are methods for removing the 3′-untranslated regions from cDNA or mRNA molecules, as well as methods for the use of such products for RNA-protein fusion formation.
Description
- This application claims the benefit of the filing date of provisional application, U.S. Ser. No. 60/096,818, filed Aug. 17, 1998, now abandoned, and utility application, U.S. Ser. No. 09/374,962, filed Aug. 16, 1999.
- In general, the invention features methods for modifying nucleic acid substrates, for example, for the production of RNA-protein fusions.
- Covalently bonded RNA-protein fusions may be used in methods for generating or isolating proteins with desired properties from pools of proteins. To create such fusions, an RNA and the peptide or protein that it encodes may be joined during in vitro translation using synthetic RNA that carries a peptidyl acceptor, such as puromycin, at its 3′-end (Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302). In this process, the synthetic RNA, which is devoid of stop codons, is typically synthesized by in vitro transcription from a DNA template followed by 3′-ligation to a DNA linker carrying puromycin. The DNA sequence causes the ribosome to pause at the end of the open reading frame, providing additional time for the puromycin to accept the nascent peptide chain and resulting in the production of the RNA-protein fusion molecule.
- The present invention involves methods for optimizing the production of RNA-protein fusions beginning with cellular RNA or other nucleic acids having 3′-untranslated regions. As described in more detail below, such fusions may be generated by at least two general techniques. According to one general approach, nucleic acids are produced which lack both 3′-untranslated regions and poly A tails. These nucleic acids, which may also lack a terminal stop codon, are then used for the production of RNA-protein fusions. According to the second technique, rather than modifying the nucleic acid substrate, the fusion is generated in an in vitro translation reaction mixture which lacks functional translation release factors. The absence of these factors circumvents the problem of termination at terminal stop codons (or other stop codons inadvertently introduced into a protein coding sequence) and allows for the generation of RNA-protein fusions. The invention also encompasses methods in which these two general approaches are combined for the purpose of RNA-protein fusion formation and methods in which the approaches, singly or in combination, are used for other purposes in which nucleic acids lacking 3′-terminal sequences or translation through stop codons are useful or desirable.
- Accordingly, in a first aspect, the invention features a method for removing the 3′-untranslated region of a DNA molecule including an open reading frame, the method involving: (a) providing a DNA molecule having an open reading frame and a 3′-untranslated region, the DNA molecule terminating at its 5′ end in an overhang and at its 3′ end in a blunt end; and (b) treating the DNA molecule first with a 3′→5′ exonuclease and then with a single-stranded nuclease under conditions that allow removal of the 3′-untranslated region.
- In preferred embodiments, the 3′→5′ exonuclease is exonuclease III; the nuclease is Mung bean nuclease; step (b) further results in removal of the stop codon of the open reading frame; the DNA molecule is a cDNA produced by reverse transcription from an mRNA sequence; and the method is carried out on a population of DNA molecules.
- In a related aspect, the invention features a method for removing the 3′-untranslated region of an mRNA molecule, the method involving: (a) translating an mRNA molecule in vitro in a translation reaction mixture lacking functional translation release factor activity, resulting in pausing of the translation reaction mixture ribosomes at the stop codon of the mRNA molecule; (b) adding, to the translation reaction mixture of step (a), reverse transcriptase and an oligonucleotide primer which is complementary to the 3′-untranslated region of the mRNA molecule at a site proximal to the stop codon, under conditions which allow the synthesis of a strand of DNA that is complementary to the 3′-untranslated region and terminates at a site proximal to the stop codon; and (c) removing the RNA portion of the RNA-DNA duplex formed in step (b), thereby removing the 3′-untranslated region of the mRNA molecule.
- In preferred embodiments, the oligonucleotide primer comprises a poly T sequence; step (c) is carried out by treatment of the product of step (b) with RNaseH; the method is carried out on a population of mRNA molecules; and the method further involves the steps of: (d) ligating to the 3′ end of the product of step (c) a linker including a Type IIS restriction site; (e) extending the product of step (d) to produce a double-stranded DNA molecule; and (f) treating the double-stranded DNA molecule with the Type IIS restriction enzyme to cleave the DNA molecule and remove the stop codon.
- In another related aspect, the invention features a method for removing the 3′-untranslated regions and stop codons of a population of mRNA molecules, the method involving: (a) providing a population of mRNA molecules; (b) synthesizing strands of DNA, each of which is complementary to one of said mRNA molecules, using a random primer mixture, the random primer mixture including primers, each having (i) a 3′ region including a stop codon flanked by a random oligonucleotide located 3′, 5′, or both to the stop codon; and (ii) a 5′ region including a Type IIS restriction site; (c) ligating to the 3′ ends of the DNA products of step (b) an oligonucleotide tail; (d) amplifying the products of step (c) using (i) a first primer which is complementary to the Type IIS restriction site-containing sequence; and (ii) a second primer which is complementary to the oligonucleotide tail; and (e) treating the products of step (d) with the Type IIS restriction enzyme to cleave the products, thereby removing the 3′-untranslated regions and stop codons.
- In preferred embodiments, the second primer of step (d) further includes a 5′ region including an RNA polymerase recognition site; and the method further comprises: (f) ligating a sequence which encodes an affinity tag to the cleaved ends of the products of step (e); (g) transcribing the products of step (f); (h) ligating peptidyl acceptors to the 3′ ends of the RNA products of step (g); (i) translating the products of step (h) to produce a population of RNA-protein fusions; and (j) substantially isolating RNA-protein fusions which comprise the affinity tag, thereby obtaining a population of mRNA molecules lacking 3′-untranslated regions and stop codons.
- In yet another related aspect, the invention features a method for removing the 3′-untranslated regions and stop codons of a population of mRNA molecules, involving: (a) providing a population of mRNA molecules; (b) synthesizing strands of DNA, each of which is complementary to one of the mRNA molecules, using a random primer mixture, the random primer mixture including primers, each having (i) a 5′ region which lacks a stop codon in at least one reading frame and (ii) a random 3′ region; and (c) synthesizing strands of DNA complementary to the DNA strands of step (b), using a second random primer mixture.
- In preferred embodiments, the second random primer mixture includes primers, each having (i) a 5′ region which includes a translation start site and (ii) a random 3′ region; and wherein said method further involves (d) amplifying the product of step (c) using a first amplification primer having (i) a 5′ sequence which includes an RNA polymerase recognition site and (ii) a 3′ region which is complementary to the translation start site.
- In other preferred embodiments of each of the above two aspects, the RNA polymerase recognition site is a T7 or SP6 RNA polymerase recognition site; the affinity tag is a hexahistidine peptide, a streptavidin-binding peptide, or an epitope; the peptidyl acceptor is puromycin; and the method is carried out on a population of mRNA molecules.
- In a second aspect, the invention features a method for producing an RNA-protein fusion from an mRNA having a 3′-untranslated region, the method involving: (a) covalently bonding the mRNA to a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the mRNA; and (b) translating the mRNA molecule in vitro in a translation reaction mixture lacking functional translation release factor activity.
- In a related aspect, the invention features a method for producing an RNA-protein fusion from a nucleic acid having a 3′-untranslated region, the method involving: (a) providing the DNA product obtained above lacking a 3′-untranslated region; (b) transcribing the DNA to produce RNA lacking a 3′-untranslated region; (c) covalently bonding to the RNA a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the RNA; and (d) translating the product of step (c) to produce an RNA-protein fusion.
- In preferred embodiments, the DNA product lacks a stop codon; and the translating step is carried out in vitro in a translation reaction mixture lacking functional translation release factor activity.
- In another related aspect, the invention features a method for producing an RNA-protein fusion from a nucleic acid having a 3′-untranslated region, the method involving: (a) providing the RNA product obtained above lacking a 3′-untranslated region; (b) covalently bonding to the RNA a peptidyl acceptor, the peptidyl acceptor being positioned 3′ of the protein coding sequence of the RNA; and (c) translating the product of step (b) to produce an RNA-protein fusion.
- In a third aspect, the invention features a library of nucleic acid molecules, each molecule including an open reading frame and lacking the 3′-untranslated region normally associated with the open reading frame.
- In preferred embodiments, the nucleic acid is DNA or RNA (for example, messenger RNA or cellular RNA derived, for example, from a eukaryotic organism, such as a mammal, and, for example, a human); the library includes at least 105 members; and the nucleic acid molecules of the library also lack stop codons.
- In final related aspects, the invention features libraries of nucleic acid molecules and RNA-protein fusions produced by the methods of the invention.
- As used herein, by a “population” is meant more than one molecule. Preferably, a population includes at least 10 molecules, more preferably, at least 102 or 103 molecules, and, most preferably, at least 104, 105, or 106 molecules.
- Similarly, a “library” is also any group of molecules. A library includes at least 10, preferably, at least 102 or 103, and, most preferably, at least 104, 105, or 106 molecules.
- By a “protein” is meant any two or more naturally occurring or modified amino acids joined by one or more peptide bonds. “Protein” and “peptide” are used interchangeably herein.
- By “RNA” is meant a sequence of two or more covalently bonded, naturally occurring or modified ribonucleotides. One example of a modified RNA included within this term is phosphorothioate RNA.
- By “DNA” is meant a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides.
- By “covalently bonded” to a peptidyl acceptor is meant that the peptidyl acceptor is joined either directly through a covalent bond or indirectly through another covalently bonded sequence (for example, DNA corresponding to a pause site).
- By a “peptidyl acceptor” is meant any molecule capable of being added to the C-terminus of a growing protein chain by the catalytic activity of the ribosomal peptidyl transferase function. Typically, such molecules contain (i) a nucleotide or nucleotide-like moiety (for example, adenosine or an adenosine analog (di-methylation at the N-6 amino position is acceptable)), (ii) an amino acid or amino acid-like moiety (for example, any of the 20 D- or L-amino acids or any amino acid analog thereof (for example, O-methyl tyrosine or any of the analogs described by Ellman et al., Meth. Enzymol. 202:301, 1991), and (iii) a linkage between the two (for example, an ester, amide, or ketone linkage at the 3′ position or, less preferably, the 2′ position); preferably, this linkage does not significantly perturb the pucker of the ring from the natural ribonucleotide conformation. Peptide acceptors may also possess a nucleophile, which may be, without limitation, an amino group, a hydroxyl group, or a sulfhydryl group. In addition, peptidyl acceptors may be composed of nucleotide mimetics, amino acid mimetics, or mimetics of the combined nucleotide-amino acid structure.
- Other embodiments of the invention will be apparent from the detailed description thereof, and from the claims.
- FIG. 1 is a schematic illustration of one exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 2 is a schematic illustration of a second exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 3 is a schematic illustration of a third exemplary approach for removing the 3′-untranslated region and poly A tail from a nucleic acid molecule.
- FIG. 4 is a diagram illustrating a map of the human cytochrome oxidase IV subunit A mRNA. This mRNA contains a total of 19 stop codons: one authentic codon, one in the 5′ UTR, 14 in the open reading frame, and three in the 3′ UTR.
- FIG. 5 is a photograph illustrating the products of first strand cDNA synthesis of the mRNA of FIG. 4, run on a denaturing polyacrylamide gel. As expected, a series of bands were observed, likely due to priming at stop codons within the RNA.
- FIG. 6 is a photograph illustrating the products of second strand cDNA synthesis of the mRNA of FIG. 4. PCR amplification following second strand synthesis revealed a banding pattern similar to that observed after first strand synthesis.
- FIG. 7 is a photograph illustrating the products of an in vitro transcription reaction using the cDNA of FIG. 6 and “pull through” PCR following ligation of the
affinity tag 3′ terminus. The image shown is color reversed from an ethidium stained agarose gel to enhance resolution. - FIG. 8 is a photograph illustrating RNA-protein fusions produced from cellular mRNA using biased random priming to remove stop codons.
- FIG. 9 is a photograph showing the products of random primed cDNA synthesis from polyA+ mRNA from HL60 cells and normal human bone marrow (NBM) run on a denaturing acrylamide gel.
- FIG. 10 is a photograph illustrating PCR-amplified second strand cDNA generated from the product of FIG. 9. An aliquot of the second strand synthesis reaction was PCR amplified under standard conditions. Aliquots were removed after the specified number of cycles and run on a 2% agarose gel. The image shown is a negative of the ethidium stained gel to enhance resolution.
- FIG. 11 is a photograph illustrating radiolabeled RNA transcripts produced from the dsDNA template library of FIG. 10. These transcripts were produced using T7 RNA polymerase and run on a denaturing polyacrylamide gel.
- FIG. 12 is a photograph illustrating that ligation of a32P-labeled linker to the RNA library of FIG. 11 results in a shift in mobility of the linker.
- FIG. 13 is a photograph illustrating fusions formed between the RNA library of FIG. 11 and translated peptides. These fusions were purified by oligo-dT cellulose and analyzed by SDS-PAGE. Such fusions could only be formed in the absence of a stop codon.
- FIG. 14 is a diagram illustrating the sequence of clones selected from an RNA-protein fusion library derived from cellular RNA and which lack both stop codons and 3′ untranslated regions. In each pair of sequences, the first line is the clone sequence from the fusion library, and the second line is the parent RNA sequence. The shaded regions correspond to the N9 portion of the primers.
- As discussed above, the present invention provides two general approaches for the modification or use of nucleic acids having 3′-untranslated regions for the production of RNA-protein fusions, or any other technique where stop codons or untranslated regions are undesirable.
- In the first approach, mRNA or cDNA libraries are created that
lack 3′ untranslated regions and poly A tails, and, if desired, also lack 3′-terminal stop codons. Such cDNAs are greatly improved compared to traditional cDNA libraries since they are enriched for coding sequence information. In addition, creation of these cDNA libraries enables the creation of libraries of cellular mRNA molecules covalently linked to the protein molecules the mRNAs encode. Such “fusion libraries” can be used for a variety of applications, including the identification of protein-protein interactions, identification of drug targets, and hybridization to solid supports to create, for example, protein chips (or beads); if desired, the RNA-protein molecules may be arranged in spatially defined arrays on such chips to carry out large scale screening, for example, for protein or compound identification. Exemplary uses for RNA-protein fusions are described, for example, in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998 and U.S. Ser. No. 09/247,190, Feb. 9, 1999; and Kuimelis et al., Addressable Protein Arrays, U.S. Ser. No. 60/080,686, Apr. 3, 1998, and U.S. Ser. No. 09/282,734, Mar. 31, 1999. - The second approach of the invention focuses on overcoming the natural translational termination which is brought about by the interaction between the stop codon at the 3′ end of an mRNA coding sequence and the release factors present in a translation lysate. To circumvent this obstacle, stop codons are removed from the mRNA molecule (as described above) or the release factor activity is removed from the in vitro translation system. By either of these strategies, translation results in mRNA-polypeptide-ribosome complexes which are suitable substrates for the formation of mRNA-protein fusions. Again, this approach simplifies fusion formation beginning with natural mRNA messages which contain stop codons and also simplifies the use of such fusion technology for such applications as functional genomics.
- Exemplary methods for carrying out the general approaches of the invention are now described below. These examples are provided for the purpose of illustrating, and not limiting, the invention.
- In a first approach, the termination of translation is avoided by removing the region of an mRNA which contains a stop codon, while preserving as much of the mRNA coding sequence as possible. Four alternative ways of modifying the mRNA coding sequence are presented below.
- FIG. 1 shows a first mRNA modification technique in which the coding sequence is modified at the DNA level. The coding regions of a cDNA library are excised from host vectors in such a way that the sequence upstream of the coding sequence terminates in a single 3′ DNA chain overhang of at least four bases, whereas the sequence downstream of the coding sequence terminates in a blunt cut. This may be accomplished by the use of appropriate restriction enzymes (in combination, for example, with vectors containing useful restriction sites) and standard molecular biology techniques. Exonuclease III and Mung bean nuclease are then used sequentially (with exonuclease III being used first and Mung bean nuclease being used second) to remove nucleotides from the unprotected, downstream end of the cDNA clone. The length of incubation with exonuclease III is adjusted by standard techniques such that the cDNA polyadenosine tail, 3′ untranslated region, and (if desired) stop codon, but little of the coding sequence, are removed. In an alternative technique, S1 nuclease may be used in place of Mung bean nuclease, again adjusting the incubation time to allow removal of the 3′-untranslated region but little or none of the coding sequence.
- For use in RNA-protein fusion formation, a defined DNA sequence may then be ligated to the newly created downstream end, creating the ideal substrate for in vitro transcription and translation. This DNA sequence is complementary to a splint sequence that is used to facilitate the ligation of a peptidyl acceptor to the mRNA product of the modified DNA upon transcription. Exemplary sequences and methods for in vitro transcription, in vitro translation, and fusion formation are described, for example, in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190. These sequences may be joined to the RNA molecule using, for example, T4 DNA ligase. The resulting RNA substrate may be used directly in in vitro transcription and in vitro translation steps or, as shown in FIG. 1, may be amplified (for example, by standard PCR amplification) to generate a library of cDNA molecules lacking 3′-untranslated regions.
- In a second approach (shown in FIG. 2), cDNA clones are transcribed in vitro into mRNA molecules which contain stop codons, untranslated 3′ regions, and polyadenosine tails. Alternatively, mRNA may be isolated from cells and used directly. The mRNA is then subjected to in vitro translation by any standard technique in the presence of inhibitors of translation release factors (see below). Under such reaction conditions, ribosomes do not release the polypeptide chain upon reaching the stop codon, but instead pause. A DNA oligonucleotide primer complementary to the polyA tail (that is, a poly T sequence preferably of a length of between 10-30 nucleotides) and reverse transcriptase are then added to the mix, resulting in the synthesis of a strand of DNA complementary to the downstream region of the mRNA which terminates in the region proximal to the stop codon. RNaseH is then used to remove the RNA portion of the RNA-DNA region.
- The RNA product may then be used to generate cDNA libraries or for RNA-protein fusion formation. To create cDNA libraries (lacking 3′ untranslated regions), an adaptor molecule is preferably ligated to the RNA to create a defined sequence on the 3′ end using T4 RNA ligase. This adaptor is a short, double-stranded piece of DNA (preferably, between 10-50 base pairs in length) with a sequence designed to facilitate further processing of the cDNA library. The adaptor is used as the basis for complementary PCR primers for cDNA library construction, or as “splint” oligonucleotides to facilitate the ligation of RNA products to peptidyl acceptor-containing linkers, as described below.
- Primers are then used in combination with standard cDNA construction methodologies to create cDNA libraries. Alternatively, to generate RNA-protein fusions, a linker sequence may be ligated onto the 3′ end of the RNA with either T4 RNA or T4 DNA ligase, where the 3′ end of the linker contains a peptidyl acceptor, such as puromycin (see, for example, Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999). This RNA-linker-puromycin construct may then be used directly for in vitro translation in a lysate depleted of release factors to generate RNA-protein fusion molecules.
- Alternatively, to remove the stop codon from the mRNA, a linker with a defined sequence containing an offset cutting restriction enzyme site, such as a Type IIS restriction site (for example, a BsgI, HphI, or AsuHPI restriction site), is ligated, as described above, to the region downstream of the stop codon. The RNA is then amplified, for example, by standard methods of RT-PCR, and treated with the restriction enzyme. This type of restriction enzyme cuts upstream from its recognition site, thus removing the stop codon. The DNA, which contains the coding sequence but not the stop codon, may then be used in standard protocols for transcription and formation of RNA-protein fusions (see, for example, Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999).
- In a third general approach, biased random priming is used to remove both 3′ untranslated regions and the stop codons from the members of a cDNA library. This general approach is shown in FIG. 3. In the first step of this method, a cDNA library is made, by standard techniques, from purified cellular mRNA using a biased random primer mix. This mix includes primers with sequences complementary to each of the three stop codons (TGA, TAA, or TAG) (one stop codon per primer) in the 3′ region flanked on the 3′ side, 5′ side, or both by an additional 1-8 nucleotide long, completely random sequence. In addition, the 5′ region of the primer contains a fixed sequence corresponding to the recognition site for an offset cutting (Type IIS) restriction enzyme. Examples of Type IIS restriction enzymes include BsgI, HphI, and AsuHPI. By optimizing the stringency of annealing during cDNA synthesis, such primers will only significantly anneal to and be extended from sites corresponding to stop codons within the mRNA. These stop codon sequences are found in all three cDNA reading frames as well as in both the 3′ and 5′ untranslated regions.
- Following cDNA synthesis, the RNA template is removed. This can be accomplished either enzymatically, for example, through the action of an RNase, or chemically, for example, by treatment at high pH (for example, a pH of at least 13). The cDNA strands are then tailed with a homopolymeric sequence using an enzyme such as terminal deoxynucleotidyl transferase (TdT). A particularly suitable tail is poly-deoxycytidine. The resulting tailed cDNA is then amplified, for example, using PCR and appropriate primer sequences. One of these primers is complementary to the conserved region of the initial primer which contained the restriction site, and the second primer contains a 5′ region that includes an RNA polymerase recognition sequence (for example, a T7 or SP6 RNA polymerase recognition site) and a 3′ region that is complementary to the homopolymer tail plus 1-3 terminal nucleotides containing a mix of all nucleotides. In addition, the closest of these mixed nucleotides to the homopolymer region may contain any nucleotide except G. Such a tail ensures that the primer preferentially aligns with the first few nucleotides of the poly-deoxycytidine tail.
- The double-stranded PCR product is then digested with the off-set cutting Type IIS restriction enzyme. Because of the primer used in the random priming step, this restriction cut occurs upstream of the stop codon at which the initial priming event occurred. In certain situations, it may be desirable to only partially cut the PCR products, for example, if those products are known or suspected to contain one or more native internal restriction sites for the chosen enzyme. In these circumstances, the restriction conditions are adjusted such that the enzyme cuts each product, on average, only once.
- After removal of the short fragments cleaved from the ends of the DNAs, new ends are ligated on. These new ends encode an affinity purification tag, for example, a hexahistidine peptide, streptavidin-binding protein, or any suitable epitope, in-frame with the initial stop codon at which cDNA synthesis was primed. This double-stranded DNA with the newly ligated 3′ terminus may then be purified, if desired.
- Next, using a suitable RNA polymerase (that is, one which corresponds to the RNA polymerase recognition site chosen above), the double-stranded DNA is transcribed to produce single-stranded RNA. Each of these RNA molecules has the same 3′ terminus, corresponding to the ligated affinity purification tag. Additional sequence is then ligated onto the 3′ ends of these RNA strands in a template-directed manner, using an enzyme such as T4 DNA ligase. This new 3′ sequence is preferably poly-deoxyadenosine with a 3′ terminal moiety suitable for producing nucleic acid/protein fusions, for example, a dCC-puromycin group. The ligated product is then purified and translated using any suitable in vitro translation system, for example, a rabbit reticulocyte lysate. In such a system, the ribosome pauses upon reaching the poly-deoxyadenosine region, and the dCC-puromycin group is fused to the nascent polypeptide strand. If a stop codon is encountered prior to the poly-deoxyadenosine, the ribosome is released, and no fusion occurs. This will be the case if the initial priming site occurred in the 3′ untranslated region.
- Nucleic acid/protein fusions are then purified using the translated affinity purification tag. If the initial site of priming was an out-of-frame stop codon, the affinity tag will be mis-translated. Therefore, by this selection, only fusions from in-frame stop codons will be present after purification.
- RNA from the purified fusions is then recovered and amplified using, for example, RT-PCR. The resulting cDNA library should have only full length, in-frame mRNAs with no in-frame stop codons and no 3′ untranslated regions. The RNA population may be used as described above to generate a cDNA library or directly for RNA-protein fusion formation.
- To demonstrate the utility of this approach, an exemplary RNA was chosen as a model system. This mRNA encoded the human cytochrome oxidase IV subunit A. The particular RNA that was used (FIG. 4) was generated by transcription from a PCR fragment and contained a 42
nucleotide 5′ UTR, a 501 nucleotide open reading frame (ORF), and a 124nucleotide 3′ UTR. There were a total of 19 stop codons contained within the RNA: one authentic, one in the 5′ UTR, 14 out of frame in the open reading frame, and three in the 3′ UTR. This RNA also contained an internal restriction site for the Type IIS restriction enzyme used in the method, thereby representing a realistic model for cellular mRNA populations. - To carry out this technique, first strand cDNA synthesis was performed using a mix of primers that contained (5′ to 3′) the recognition sequence for the Type IIS restriction endonuclease, Bpm I, followed by six random nucleotides and, at the 3′ terminus, three nucleotides complementary to the human stop codons. These primers are shown below (SEQ ID NOS: 1-3; N denotes a mix of all four nucleotides dG/dA/dC/dT):
5′-GCT TGC TGG AGT GCG AGT NNN NNN CTA 5′-GCT TGC TGG AGT GCG AGT NNN NNN TTA 5′-GCT TGC TGG AGT GCG AGT NNN NNN TCA. - For the cDNA synthesis reaction, 100 ng of RNA was annealed to between 25-125 pmoles of primer mix, then extended with reverse transcriptase by standard techniques. α-32P-dATP was included as a trace label in the reaction. Subsequently, E. coli RNase H was added to remove the RNA strand, and an aliquot of the reaction was run on a denaturing polyacrylamide gel (FIG. 5).
- A homopolymer tail of dC was added to the first strand cDNA using the enzyme terminal deoxynucleotidyl transferase. The length of the tail was controlled by including ddCTP in the extension reaction at a ratio of 1:9 with dCTP. The tailed cDNA was then copied in a second strand synthesis reaction using a primer that contained a T7 promoter followed by a 9 nucleotide dG tail, a penultimate nucleotide mix of dC/dA/dT, and a terminal random nucleotide. This primer had the following sequence (SEQ ID NO: 4; H denotes a mix of the nucleotides dA/dC/dT and N denotes a mix of all four nucleotides dG/dA/dC/dT):
- 5′-TAA TAC GAC TCA CTA TAG GGG GGG GGH N.
- The final two nucleotides conferred priming specificity by preferentially being extended from the extreme internal portion of the homopolymer tail.
- After second strand synthesis, PCR (using primers complementary to the fixed regions of the primers from FIGS. 4) was used to generate a double-stranded template (FIG. 6). This template was then partially digested with Bpm I endonuclease. Cleavage from the Bpm I site in the second strand primer resulted in the removal of the third position nucleotide from all stop codons. A new double-stranded 3′ terminus encoding the affinity sequence Strep-Tag II (available from Genosys Biotechnologies, Inc., The Woodlands, Tex.) was then ligated onto the cleaved fragments. This new terminus was designed to be ligated in frame with the authentic stop codon, converting it to a tyrosine and thus eliminating the stop.
- After ligation, a PCR reaction was performed using a primer that annealed to the new 3′ terminus. Thus, only successfully ligated templates were amplified. As shown in FIG. 7, a number of products were amplified, resulting in a pattern similar to that observed in FIG. 6. One additional major product was observed at ˜250 nucleotides as was expected from partial cleavage at the internal BpmI site.
- The double-stranded template from FIG. 7 was used in a transcription reaction to produce RNA (as described in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999). The RNA was then enzymatically ligated to a puromycin-containing DNA linker (by the method of Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999) and placed in a translation reaction containing35S-methionine. After translation and a subsequent high-salt fusion formation step (as described in Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/247,190, Feb. 9, 1999), the RNA and fused protein were purified using oligo-dT cellulose (FIG. 8). The resulting library of RNA-protein fusion molecules indicated that the present method very efficiently generated such fusions beginning with an mRNA starting material.
- Finally, in a fourth general approach, random priming is used to remove both 3′ untranslated regions and stop codons from cDNA molecules. The methods described above for producing fusions from cellular RNA are generally designed to produce protein moieties with essentially wild-type N-termini. However, it is sometimes advantageous to create libraries of fusions from cellular RNA that consist of various N- and C-terminal truncated species as well. For example, such a domain library may contain functional units that are easier to produce and select than full-length proteins. To generate such a library, random priming was utilized to generate cDNA molecules as follows.
- Poly A+ mRNA was obtained by standard methods from two sources, human bone marrow and HL60 cells. A cDNA copy of this mRNA was then produced using the following primer (SEQ ID NO: 5):
- 5′ GC CTT ATC GTC ATC GTC CTT GTA GTC GAA ACT AGA NNN NNN NNN.
- This first strand primer was in the minus sense relative to the RNA strand and in one reading frame encoded the FLAG epitope. Because this fixed sequence contained no stop codons in two of the three potential reading frames, RNA produced from this template would contain no stop codons in two reading frames. This primer contained a 5′ fixed sequence and nine random nucleotides at the 3′ terminus. 125 pmoles of the primer was annealed to 5 μg of mRNA and then extended using reverse transcriptase and standard techniques. A portion of the reaction was performed in the presence of α-32P-dATP as a tracer and assayed by denaturing gel electrophoresis (FIG. 9). After first strand synthesis, the RNA strand was removed by digestion with RNase H. Unextended primers were removed by size exclusion chromatography.
- Second strand cDNA synthesis was performed using the Klenow fragment of DNA polymerase and the following primer (SEQ ID NO: 6):
- 5′ GGA CAA TTA CTA TTT ACA ATT ACA ATG NNN NNN NNN
- This second strand primer was in the plus sense relative to the RNA strand, contained nine random nucleotides at the 3′ end, and included a 5′ fixed region having an ATG start codon and the 5′ UTR from tobacco mosaic virus as a ribosome binding site. Again, a portion of the reaction was performed in the presence of α-32P-dATP as a tracer (FIG. 9). The unextended primers were removed by size exclusion chromatography.
- The second strand cDNA containing both fixed regions was then amplified by PCR to create a double stranded template (FIG. 10). The forward PCR primer was complementary to the 5′ UTR region of the second strand primer and also encoded the promoter sequence for T7 RNA polymerase. The reverse PCR primer was complementary to the fixed region of the first strand primer and also encoded sequences required for subsequent ligation of RNA produced from the template. These primer sequences are shown below (SEQ ID NOS: 7, 8):
5′ TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT (forward) 5′ AGA AGA TGC GCG ATC GTC ATC GTC CTT GTA GTC (reverse). - The results of this amplification step are shown in FIG. 10. The intense PCR product of approximately 75 nucleotides (FIG. 10) was apparently due to primer-dimer formation and could be reduced with an additional size exclusion chromatography step. The double-stranded template from PCR was transcribed using T7 RNA polymerase (as described in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999). When α-32P-dATP was included in the transcription reaction a range of RNA transcripts was produced that reflected the variable size of the template library (FIG. 11). Because the specific activity of a given transcript was proportional to the length, longer RNA products appeared darker.
- A parallel transcription reaction was performed without a radioactive tracer and the resulting RNA was purified by phenol/chloroform extraction and size exclusion chromatography. A DNA linker with a 5′ puromycin moiety was then ligated to the end of the RNA in a template directed reaction using T4 DNA ligase (as described in Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999). The DNA linker was 5′ radiolabeled with32P to allow the reaction to be followed on a denaturing polyacrylamide gel (FIG. 12). The shift in mobility of the linker was the result of ligation to the RNA library.
- The ligated RNA was then purified from unligated RNA and linker, and incubated in an in vitro translation system to generate protein-RNA fusions (by the methods of Roberts & Szostak (1997) Proc. Natl. Acad. Sci. USA 94, 12297-12302; and Szostak et al., Selection of Proteins Using RNA-Protein Fusions, U.S. Ser. No. 09/007,005, Jan. 14, 1998, and U.S. Ser. No. 09/247,190, Feb. 9, 1999). The translation reaction contained35S-met so that the newly translated proteins were radiolabeled. After fusion formation, the resultant complexes were purified using oligo-dT cellulose, and an aliquot was analyzed by SDS-PAGE (FIG. 13). If an RNA being translated contained a stop codon, the ribosome complex would dissociate from the template, and no fusion would be formed. Accordingly, the formation of fusions correlated with the lack of stop codons.
- A fusion library constructed essentially as above was subsequently selected for a particular aspect of the protein portion of the protein-RNA fusion. A number of individual members of the resulting selected pool were isolated and sequenced (FIG. 14). Alignment with the parental RNA sequences obtained from a sequence database allowed the selected region to be identified. Comparison of the recovered clones with the parent RNA showed that, in general, each of these clones represented an in-frame region of a cellular RNA message devoid of both stop codons and a 3′ UTR.
- In a second general approach of the invention, stop codons present in an RNA sequence are overcome by neutralization or removal of translation release factors from in vitro translation mixes. To inhibit polypeptide chain release in a eukaryotic translation system, either or both of the two eukaryotic release factors, eRF1 and eRF3, must be neutralized. In prokaryotic translation systems, both RF1 and RF2 or, alternatively, RF3 alone must be neutralized to inhibit polypeptide chain release. In either case, a release factor is neutralized by the use of antibodies or by exploiting genetically engineered variants of the natural release factor binding partners. Alternatively, the release factor may be removed from the translation mix by using its affinity to specific components of the translation complex, such as stop codons.
- Neutralizing antibodies, which can be either polyclonal or monoclonal, are raised against the entire release factor or to one of its constituent domains or peptides. One such antibody and an exemplary method of preparation is described in Zhouravleva et al. (EMBO J. 14:4065-72 (1995)). Such antibodies may be produced by any standard technique. Preferably, the antigen is first expressed in a heterologous expression system or synthesized chemically and then purified to homogeneity. The antigenic peptide may be coupled to a carrier protein, such as KLH as described in Ausubel et al, Current Protocols in Molecular Biology, Wiley Interscience, New York, N.Y. The peptide may then be mixed with Freund's adjuvant and injected into guinea pigs, rats, or preferably rabbits to produce polyclonal antibodies. The antibodies may be purified by peptide antigen affinity chromatography. Monoclonal antibodies may be prepared using these same antigenic peptides and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:511, 1976; Kohler et al., Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, N.Y., 1981; Ausubel et al., supra).
- Alternatively, natural release factor-binding partners may be exploited as inhibitors. Exemplary binding partners include other release factors and components of the translation termination complex. For example, eRF1 may be neutralized by an excess of an inactive mutant of eRF3. Conversely, eRF3 may be neutralized by an inactive mutant of eRF1. Similarly, RF1 and RF2 can both be inhibited by an excess of an inactive mutant of RF3, and RF3 can be inhibited by an excess of an inactive mutant of RF1 or RF2. Such mutants are created by standard techniques, for example, by random or site-directed mutagenesis, followed by an assay for loss of RF activity; in one particular example, residues in the GTP-binding motif of RF3 necessary for activity may be mutated. Alternatively, analogues of stop codons may be used as inhibitors to bind, for example, to RF1. Exemplary stop codon analogues are short oligonucleotides (composed of RNA, DNA, or chemically modified RNA) which contain the sequence of all possible stop codons.
- Any of the above described release factor inhibitors may be used in at least three different ways. First, as described above, a soluble inhibitor may be added to an in vitro translation mixture. Upon addition, the inhibitor binds tightly to its target and prevents the release factor from interacting with the mRNA-protein-ribosome-GTP complex. Alternatively, the inhibitor (including a stop codon sequence) may be immobilized on a solid bead. Following the addition of immobilized inhibitor to the translation mixture, the inhibitor binds to the release factor, and the complex of release factor and immobilized inhibitor are removed from solution, for example, by centrifugation or microfiltration. In yet another alternative, the inhibitor may be immobilized on a column, and the translation mixture passed through the column. The translation mixture that flows through the column is cleared of release factor and, when used as an in vitro translation mix, fails to release a nascent polypeptide chain from an mRNA-ribosome-GTP complex.
- All patents and publications mentioned herein are hereby incorporated by reference.
- Other embodiments are within the claims.
Claims (16)
1. A library of nucleic acid molecules, each molecule comprising an open reading frame and lacking the 3′-untranslated region normally associated with said open reading frame.
2. The library of claim 1 , wherein said nucleic acid is RNA.
3. The library of claim 2 , wherein said RNA is messenger RNA.
4. The library of claim 2 , wherein said RNA is cellular RNA.
5. The library of claim 4 , wherein said cellular RNA is derived from a eukaryotic organism.
6. The library of claim 5 , wherein said cellular RNA is derived from a mammal.
7. The library of claim 6 , wherein said mammal is a human.
8. The library of claim 1 , wherein said nucleic acid is DNA.
9. The library of claim 1 , wherein said library comprises at least 105 members.
10. The library of claim 1 , wherein said nucleic acid molecules of said library also lack stop codons.
11. A library of nucleic acid molecules produced by the steps of:
(a) providing a library of DNA molecules, each having an open reading frame and a 3′-untranslated region, each of said DNA molecules terminating at its 5′ end in an overhang and at its 3′ end in a blunt end; and
(b) treating said library of DNA molecules first with a 3′→5′ exonuclease and then with a single-stranded nuclease under conditions that allow removal of the 3′-untranslated regions of said DNA molecules.
12. A library of nucleic acid molecules produced by the steps of:
(a) translating a library of mRNA molecules in vitro in a translation reaction mixture lacking functional translation release factor activity, resulting in pausing of the translation reaction mixture ribosomes at the stop codons of said mRNA molecules;
(b) adding, to said translation reaction mixture of step (a), reverse transcriptase and oligonucleotide primers which are complementary to the 3′-untranslated regions of said mRNA molecules at a site proximal to said stop codons, under conditions which allow the synthesis of strands of DNA that are complementary to said 3′-untranslated regions and terminate at sites proximal to said stop codons; and
(c) removing the RNA portions of the RNA-DNA duplexes formed in step (b), thereby removing the 3′-untranslated regions of said mRNA molecules.
13. The library of claim 12 , produced by the further steps of:
(d) ligating to each of the 3′ ends of the products of step (c) a linker comprising a Type IIS restriction site;
(e) extending the products of step (d) to produce double-stranded DNA molecules; and
(f) treating said double-stranded DNA molecules with a Type IIS restriction enzyme that recognizes said Type II restriction site to cleave said DNA molecules and remove said stop codons.
14. A library of nucleic acid molecules produced by the steps of:
(a) providing a population of mRNA molecules;
(b) synthesizing strands of DNA, each of which is complementary to one of said mRNA molecules, using a random primer mixture, said random primer mixture comprising primers, each having
(i) a 3′ region comprising a stop codon flanked by a random oligonucleotide located 3′, 5′, or both to said stop codon; and
(ii) a 5′ region comprising a Type IIS restriction site;
(c) ligating to the 3′ ends of the DNA products of step (b) an oligonucleotide tail;
(d) amplifying the products of step (c) using
(i) a first primer which is complementary to said Type IIS restriction site-containing sequence; and
(ii) a second primer which is complementary to said oligonucleotide tail; and
(e) treating the products of step (d) with a Type IIS restriction enzyme that recognizes said Type IIS restriction site to cleave said products, thereby removing the 3′-untranslated regions and stop codons.
15. The library of nucleic acid molecules of claim 14 , produced by the further steps of:
(f) ligating a sequence which encodes an affinity tag to the cleaved ends of the products of step (e);
(g) transcribing the products of step (f);
(h) ligating peptidyl acceptors to the 3′ ends of the RNA products of step (g);
(i) translating said products of step (h) to produce a population of RNA-protein fusions; and
(j) substantially isolating RNA-protein fusions which comprise said affinity tag, thereby obtaining a population of mRNA molecules lacking 3′-untranslated regions and stop codons.
16. A library of nucleic acid molecules produced by the steps of:
(a) providing a population of mRNA molecules;
(b) synthesizing strands of DNA, each of which is complementary to one of said mRNA molecules, using a random primer mixture, said random primer mixture comprising primers, each having (i) a 5′ region which lacks a stop codon in at least one reading frame and (ii) a random 3′ region; and
(c) synthesizing strands of DNA complementary to said DNA strands of step (b), using a second random primer mixture.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/910,518 US20020160377A1 (en) | 1998-08-17 | 2001-07-20 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA -protein fusion formation |
US10/646,985 US20040086980A1 (en) | 1998-08-17 | 2003-08-21 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9681898P | 1998-08-17 | 1998-08-17 | |
US09/374,962 US6312927B1 (en) | 1998-08-17 | 1999-08-16 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
US09/910,518 US20020160377A1 (en) | 1998-08-17 | 2001-07-20 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA -protein fusion formation |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/374,962 Division US6312927B1 (en) | 1998-08-17 | 1999-08-16 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
US09/374,962 Continuation US6312927B1 (en) | 1998-08-17 | 1999-08-16 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/646,985 Continuation US20040086980A1 (en) | 1998-08-17 | 2003-08-21 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Publications (1)
Publication Number | Publication Date |
---|---|
US20020160377A1 true US20020160377A1 (en) | 2002-10-31 |
Family
ID=22259227
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/374,962 Expired - Lifetime US6312927B1 (en) | 1998-08-17 | 1999-08-16 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
US09/910,518 Abandoned US20020160377A1 (en) | 1998-08-17 | 2001-07-20 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA -protein fusion formation |
US10/646,985 Abandoned US20040086980A1 (en) | 1998-08-17 | 2003-08-21 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/374,962 Expired - Lifetime US6312927B1 (en) | 1998-08-17 | 1999-08-16 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/646,985 Abandoned US20040086980A1 (en) | 1998-08-17 | 2003-08-21 | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation |
Country Status (6)
Country | Link |
---|---|
US (3) | US6312927B1 (en) |
EP (1) | EP1105516A4 (en) |
JP (1) | JP2002522091A (en) |
AU (1) | AU5488399A (en) |
CA (1) | CA2334946A1 (en) |
WO (1) | WO2000009737A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021072075A1 (en) * | 2019-10-09 | 2021-04-15 | Edward Fritsch | Multi-domain protein vaccine |
Families Citing this family (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030003489A1 (en) * | 1998-09-09 | 2003-01-02 | Hamilton Paul Theodore | Combinatorial peptide expression libraries using suppressor genes |
US6358713B1 (en) | 1999-04-12 | 2002-03-19 | Johns Hopkins University | In vitro ribosome evolution |
CA2372795A1 (en) | 1999-07-12 | 2001-01-18 | Robert G. Kuimelis | C-terminal protein tagging |
WO2002073180A1 (en) * | 2001-03-08 | 2002-09-19 | Ruo-Pan Huang | An antibody-based protein array system |
US20050003360A1 (en) * | 2001-11-13 | 2005-01-06 | Ruo-Pang Huang | Array systems and methods |
US20030166010A1 (en) * | 2002-02-25 | 2003-09-04 | Affholter Joseph A. | Custom ligand design for biomolecular filtration and purification for bioseperation |
US20030153013A1 (en) * | 2002-11-07 | 2003-08-14 | Ruo-Pan Huang | Antibody-based protein array system |
WO2005010159A2 (en) * | 2003-07-17 | 2005-02-03 | Children's Hospital Medical Center | Rolling circle amplification of micro-rna samples |
WO2005074417A2 (en) * | 2003-09-03 | 2005-08-18 | Salk Institute For Biological Studies | Multiple antigen detection assays and reagents |
US20060094025A1 (en) * | 2004-11-02 | 2006-05-04 | Getts Robert C | Methods for detection of microrna molecules |
WO2006069099A2 (en) * | 2004-12-21 | 2006-06-29 | Genecopoeia, Inc. | Method and compositions for rapidly modifying clones |
JP4913391B2 (en) * | 2004-12-24 | 2012-04-11 | 株式会社生体分子計測研究所 | Anti-denaturing protein antibody purification kit and protein detection method |
EP1848815B1 (en) * | 2005-02-02 | 2010-05-19 | Universität Bayreuth | Esterases for monitoring protein biosynthesis in vitro |
US7749957B2 (en) | 2006-04-06 | 2010-07-06 | E.I. Du Pont De Nemours And Company | Clay-binding peptides and methods of use |
US7951559B2 (en) * | 2007-07-25 | 2011-05-31 | E.I. Du Pont De Nemours And Company | Recombinant peptide production using a cross-linkable solubility tag |
US7829311B2 (en) | 2007-07-25 | 2010-11-09 | E.I. Du Pont De Nemours And Company | Ketosteroid isomerase inclusion body tag engineered to be acid-resistant by replacing aspartates with glutamate |
US7678883B2 (en) | 2007-07-25 | 2010-03-16 | E.I. Du Pont De Nemours And Company | Solubility tags for the expression and purification of bioactive peptides |
US7794963B2 (en) | 2007-11-02 | 2010-09-14 | E.I. Du Pont De Nemours And Company | Use of tetracysteine tags in fluorescence-activated cell sorting analysis of prokaryotic cells producing peptides or proteins |
US8883146B2 (en) | 2007-11-30 | 2014-11-11 | Abbvie Inc. | Protein formulations and methods of making same |
CA2707483A1 (en) | 2007-11-30 | 2009-06-11 | Wolfgang Fraunhofer | Protein formulations and methods of making same |
WO2010011944A2 (en) | 2008-07-25 | 2010-01-28 | Wagner Richard W | Protein screeing methods |
US20100158846A1 (en) * | 2008-12-18 | 2010-06-24 | E. I. Du Pont De Nemours And Company | Hair-binding peptides |
US20100158822A1 (en) | 2008-12-18 | 2010-06-24 | E .I. Du Pont De Nemours And Company | Peptides that bind to silica-coated particles |
US8287845B2 (en) * | 2008-12-18 | 2012-10-16 | E I Du Pont De Nemours And Company | Hair-binding peptides |
US20100158837A1 (en) | 2008-12-18 | 2010-06-24 | E. I. Du Pont De Nemours And Company | Iron oxide-binding peptides |
JP2012522057A (en) * | 2009-03-30 | 2012-09-20 | ジヨンソン・アンド・ジヨンソン・コンシユーマー・カンパニーズ・インコーポレーテツド | Peptide-based systems for the delivery of cosmetic agents |
WO2010114638A1 (en) | 2009-03-30 | 2010-10-07 | E. I. Du Pont De Nemours And Company | Peptide-based tooth whitening reagents |
CA2789125A1 (en) | 2010-02-10 | 2011-08-18 | Novartis Ag | Methods and compounds for muscle growth |
DE102010056289A1 (en) | 2010-12-24 | 2012-06-28 | Geneart Ag | Process for the preparation of reading frame correct fragment libraries |
EP2686349B1 (en) | 2011-03-15 | 2020-12-09 | X-Body, Inc. | Antibody screening methods |
CN103732738A (en) | 2011-04-28 | 2014-04-16 | 小利兰斯坦福大学托管委员会 | Identification of polynucleotides associated with a sample |
CN107353342B (en) | 2011-12-05 | 2021-08-10 | X 博迪生物科学公司 | PDGF receptor beta binding polypeptides |
EP3014271B1 (en) | 2013-06-28 | 2020-03-18 | X-Body, Inc. | Target antigen discovery, phenotypic screens and use thereof for identification of target cell specific target epitopes |
CA2924879C (en) | 2013-09-23 | 2023-01-10 | X-Body, Inc. | Methods and compositions for generation of binding agents against cell surface antigens |
WO2019163602A1 (en) * | 2018-02-21 | 2019-08-29 | 富士フイルム和光純薬株式会社 | Method for purifying target region, detection method, and method for determining cancer |
EP4384614A1 (en) * | 2021-08-12 | 2024-06-19 | The University of Hong Kong | Materials and methods to comprehensively define adaptive immune responses |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4843003A (en) * | 1984-02-17 | 1989-06-27 | Fred Hutchinson Cancer Research Center | Process for producing shortened target DNA fragments usable in sequencing large DNA segments |
DE69032483T2 (en) | 1989-10-05 | 1998-11-26 | Optein Inc | CELL-FREE SYNTHESIS AND ISOLATION OF GENES AND POLYPEPTIDES |
US6063630A (en) * | 1991-11-05 | 2000-05-16 | Transkaryotic Therapies, Inc. | Targeted introduction of DNA into primary or secondary cells and their use for gene therapy |
US5635602A (en) | 1993-08-13 | 1997-06-03 | The Regents Of The University Of California | Design and synthesis of bispecific DNA-antibody conjugates |
US5561043A (en) | 1994-01-31 | 1996-10-01 | Trustees Of Boston University | Self-assembling multimeric nucleic acid constructs |
DK0744958T3 (en) | 1994-01-31 | 2003-10-20 | Univ Boston | Polyclonal antibody libraries |
US5627024A (en) | 1994-08-05 | 1997-05-06 | The Scripps Research Institute | Lambdoid bacteriophage vectors for expression and display of foreign proteins |
PT971946E (en) * | 1997-01-21 | 2006-11-30 | Gen Hospital Corp | Selection of proteins using rna-protein fusions |
US6586180B1 (en) * | 1998-03-28 | 2003-07-01 | University Of Utah | Directed antisense libraries |
CA2323638A1 (en) | 1998-04-03 | 1999-10-14 | Phylos, Inc. | Addressable protein arrays |
US5985575A (en) | 1998-05-20 | 1999-11-16 | Wisconsin Alumni Research Foundation | Tethered function assay for protein function |
-
1999
- 1999-08-16 WO PCT/US1999/018603 patent/WO2000009737A1/en not_active Application Discontinuation
- 1999-08-16 AU AU54883/99A patent/AU5488399A/en not_active Abandoned
- 1999-08-16 EP EP99941179A patent/EP1105516A4/en not_active Withdrawn
- 1999-08-16 CA CA002334946A patent/CA2334946A1/en not_active Abandoned
- 1999-08-16 US US09/374,962 patent/US6312927B1/en not_active Expired - Lifetime
- 1999-08-16 JP JP2000565171A patent/JP2002522091A/en active Pending
-
2001
- 2001-07-20 US US09/910,518 patent/US20020160377A1/en not_active Abandoned
-
2003
- 2003-08-21 US US10/646,985 patent/US20040086980A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021072075A1 (en) * | 2019-10-09 | 2021-04-15 | Edward Fritsch | Multi-domain protein vaccine |
Also Published As
Publication number | Publication date |
---|---|
EP1105516A4 (en) | 2002-01-09 |
US20040086980A1 (en) | 2004-05-06 |
WO2000009737A1 (en) | 2000-02-24 |
AU5488399A (en) | 2000-03-06 |
JP2002522091A (en) | 2002-07-23 |
EP1105516A1 (en) | 2001-06-13 |
CA2334946A1 (en) | 2000-02-24 |
US6312927B1 (en) | 2001-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6312927B1 (en) | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation | |
US5629179A (en) | Method and kit for making CDNA library | |
US6716973B2 (en) | Use of a ribozyme to join nucleic acids and peptides | |
DE60034566T2 (en) | PEPTIDE RECEPTOR LIGATION PROCEDURE | |
CA1198068A (en) | Bacterial polypeptide expression employing tryptophan promoter-operator | |
US6140086A (en) | Methods and compositions for cloning nucleic acid molecules | |
CN110637086B (en) | Method for producing complex of RNA molecule and peptide, and use thereof | |
US7846694B2 (en) | Process for producing template DNA and process for producing protein in cell-free protein synthesis system with the use of the same | |
EP1073730A1 (en) | Use of a ribozyme to join nucleic acids and peptides | |
KR920703839A (en) | Cell-free synthesis and isolation of novel genes and polypeptides | |
US20220243244A1 (en) | Compositions and methods for in vivo synthesis of unnatural polypeptides | |
US20030027194A1 (en) | Modular assembly of nucleic acid-protein fusion multimers | |
AU2004200998B2 (en) | Methods for producing nucleic acids lacking 3'-untranslated regions and optimizing cellular RNA-protein fusion formation | |
JP2612264B2 (en) | DNA sequence | |
WO2003062417A1 (en) | Rna-dna ligation product and utilization thereof | |
JPWO2005024018A1 (en) | Nucleic acid construct and method for producing the same | |
EP0090433A1 (en) | Creation of DNA sequences encoding modified proinsulin precursors | |
JP2002291491A (en) | Rna-dna conjugate | |
JP5858415B2 (en) | Linker for preparing mRNA / cDNA-protein conjugate and purification method of nucleotide-protein conjugate using the same | |
US9133452B2 (en) | High-speed maturation method for an oligonucleotide library for the purpose of preparing a protein library | |
JPWO2011125833A1 (en) | Structure of biomolecular interaction analysis tool and analysis method using it | |
KR0184771B1 (en) | Mass production of flat fish growth hormone | |
JP2003169672A (en) | Method for processing library by using ligation inhibition | |
JP2003070483A (en) | Method for connecting nucleic acid | |
JPS61149087A (en) | Plasmid |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |