US20210130914A1 - High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures - Google Patents
High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures Download PDFInfo
- Publication number
- US20210130914A1 US20210130914A1 US17/146,427 US202117146427A US2021130914A1 US 20210130914 A1 US20210130914 A1 US 20210130914A1 US 202117146427 A US202117146427 A US 202117146427A US 2021130914 A1 US2021130914 A1 US 2021130914A1
- Authority
- US
- United States
- Prior art keywords
- dna
- sequencing
- unknown
- usid
- samples
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 238000012163 sequencing technique Methods 0.000 title claims description 24
- 239000000203 mixture Substances 0.000 title description 6
- 230000000813 microbial effect Effects 0.000 title description 2
- 239000012472 biological sample Substances 0.000 claims abstract description 49
- 239000000523 sample Substances 0.000 claims abstract description 37
- 108700022487 rRNA Genes Proteins 0.000 claims abstract description 20
- 241000894006 Bacteria Species 0.000 claims abstract description 17
- 238000013412 genome amplification Methods 0.000 claims abstract description 14
- 108020004414 DNA Proteins 0.000 claims description 53
- 102000053602 DNA Human genes 0.000 claims description 53
- 108090000623 proteins and genes Proteins 0.000 claims description 31
- 238000003752 polymerase chain reaction Methods 0.000 claims description 27
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 18
- 238000001712 DNA sequencing Methods 0.000 claims description 16
- 241000282414 Homo sapiens Species 0.000 claims description 11
- 238000007481 next generation sequencing Methods 0.000 claims description 10
- 238000007865 diluting Methods 0.000 claims description 8
- 244000005700 microbiome Species 0.000 claims description 8
- 108020004465 16S ribosomal RNA Proteins 0.000 claims description 7
- 108700005443 Microbial Genes Proteins 0.000 claims description 6
- 210000000987 immune system Anatomy 0.000 claims description 6
- 239000013612 plasmid Substances 0.000 claims description 6
- 125000006850 spacer group Chemical group 0.000 claims description 6
- 101710098119 Chaperonin GroEL 2 Proteins 0.000 claims description 5
- 239000013592 cell lysate Substances 0.000 claims description 5
- 238000007480 sanger sequencing Methods 0.000 claims description 5
- 206010028980 Neoplasm Diseases 0.000 claims description 4
- 230000001580 bacterial effect Effects 0.000 claims description 4
- 230000029087 digestion Effects 0.000 claims description 4
- 239000000463 material Substances 0.000 claims description 4
- 206010062717 Increased upper airway secretion Diseases 0.000 claims description 3
- 241001465754 Metazoa Species 0.000 claims description 3
- 101100038261 Methanococcus vannielii (strain ATCC 35089 / DSM 1224 / JCM 13029 / OCM 148 / SB) rpo2C gene Proteins 0.000 claims description 3
- 231100000757 Microbial toxin Toxicity 0.000 claims description 3
- 239000013060 biological fluid Substances 0.000 claims description 3
- 239000012620 biological material Substances 0.000 claims description 3
- 210000004369 blood Anatomy 0.000 claims description 3
- 239000008280 blood Substances 0.000 claims description 3
- 210000003608 fece Anatomy 0.000 claims description 3
- 238000012268 genome sequencing Methods 0.000 claims description 3
- 230000007918 pathogenicity Effects 0.000 claims description 3
- 208000026435 phlegm Diseases 0.000 claims description 3
- 101150079601 recA gene Proteins 0.000 claims description 3
- 101150085857 rpo2 gene Proteins 0.000 claims description 3
- 101150090202 rpoB gene Proteins 0.000 claims description 3
- 210000001519 tissue Anatomy 0.000 claims description 3
- -1 tissue Substances 0.000 claims description 3
- 239000000427 antigen Substances 0.000 claims description 2
- 108091007433 antigens Proteins 0.000 claims description 2
- 102000036639 antigens Human genes 0.000 claims description 2
- 239000012530 fluid Substances 0.000 claims description 2
- 210000000265 leukocyte Anatomy 0.000 claims description 2
- 210000003705 ribosome Anatomy 0.000 claims description 2
- 229920002477 rna polymer Polymers 0.000 claims description 2
- 241000203069 Archaea Species 0.000 abstract description 11
- 241000206602 Eukaryota Species 0.000 abstract description 10
- 238000012216 screening Methods 0.000 abstract description 6
- 230000008569 process Effects 0.000 abstract description 4
- 230000002068 genetic effect Effects 0.000 abstract description 3
- 239000012470 diluted sample Substances 0.000 abstract description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 29
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 28
- 210000004027 cell Anatomy 0.000 description 26
- 230000003321 amplification Effects 0.000 description 16
- 238000003199 nucleic acid amplification method Methods 0.000 description 16
- 238000000746 purification Methods 0.000 description 12
- 108091093088 Amplicon Proteins 0.000 description 9
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 9
- 108060002716 Exonuclease Proteins 0.000 description 8
- 102000013165 exonuclease Human genes 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 108010017826 DNA Polymerase I Proteins 0.000 description 7
- 102000004594 DNA Polymerase I Human genes 0.000 description 7
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 238000006073 displacement reaction Methods 0.000 description 5
- 210000003527 eukaryotic cell Anatomy 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 241001624918 unidentified bacterium Species 0.000 description 4
- 108020004463 18S ribosomal RNA Proteins 0.000 description 3
- 241000322342 Bacillus phage M2 Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 101150054516 PRD1 gene Proteins 0.000 description 3
- 101100459905 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NCP1 gene Proteins 0.000 description 3
- 241000701518 Salmonella virus PRD1 Species 0.000 description 3
- 101000865057 Thermococcus litoralis DNA polymerase Proteins 0.000 description 3
- 241000589500 Thermus aquaticus Species 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 230000009089 cytolysis Effects 0.000 description 3
- 239000003599 detergent Substances 0.000 description 3
- 238000003205 genotyping method Methods 0.000 description 3
- 230000004083 survival effect Effects 0.000 description 3
- 230000004544 DNA amplification Effects 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000007928 solubilization Effects 0.000 description 2
- 238000005063 solubilization Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000701988 Escherichia virus T5 Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000205180 Thermococcus litoralis Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012670 alkaline solution Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 238000002875 fluorescence polarization Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000010422 painting Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000004513 sizing Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/689—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Definitions
- the screening process tracks the unknown microbe or eukaryote in a diluted sample where the DNA has been prepared using whole genome amplification. The whole genome of the unknown microbe or eukaryote is then sequenced and assembled.
- Unknown bacteria and archaea typically exist in diverse communities that may consist of hundreds or thousands of different strains, and each strain may comprise a small or large part of a given population. In addition, most bacteria and archaea cannot be cultured by any known means. Shotgun sequencing of all the bacteria and archaea that constitute a complex microbiome results in mostly wasted sequence data that comes primarily from the most frequent and best understood bacteria and archaea. For these reasons, identification, isolation and sequencing of unknown microbes from biological samples is impractical or impossible for most investigators. Disclosed herein are rapid and simple methods for identification, quantification, tracking, isolation, and sequencing of unknown microbes that can be applied to any biological sample containing known and unknown microbes.
- the disclosed methods may be used for isolation and sequencing of microbes in human samples, or for any sample of microbes from any source. Because the disclosed methods do not require microbes to be cultured to obtain sufficient material for sequencing, they can be used to sequence microbes that are considered unculturable.
- DNA amplification techniques are typically employed when DNA sources are limiting, with the goal of producing enough DNA for study.
- Some of these amplification techniques include I-PEP-PCR (Dietmaier, W., Hartmann, A., Wallinger, S., Heinmoller, E., Kerner, T., Endl, E., Jauch, K. W., Hofstadter, F., and Ruschoff, J. Multiple mutation analyses in single tumor cells with improved whole genome amplification, Am. J. Pathol . (1999) 154: 83-95), PEP-PCR (Zhang, L., Cui, X., Schmitt. K., Hubert, R., Navidi, W., and Arnheim, N.
- Thermostable Bst DNA polymerase exonuclease ( ⁇ ) large fragment Aliotta, J. M., J. J. Pelletier, J. L. Ware, L. S. Moran, J. S. Benner, and H. Kong.
- Thermostable Bst DNA polymerase I lacks a 3 ′ to 5 ′ proofreading exonuclease activity. Genet. Anal. 12:185-195, (1996)), Exonuclease ( ⁇ ) Bca DNA polymerase (Walker, G. T. and Linn, C. P.
- Samples of interest may contain mixtures of prokaryotes or eukaryotes, both known and unknown.
- An example is a microbiome sample that contains mixtures of known and unknown microbes.
- Microbes can be identified by their DNA sequence.
- An efficient method for scanning many microbial samples at once is to prepare barcoded DNA amplicons targeting the rRNA genes from each sample, and pool the barcoded samples for DNA sequencing in a high throughput format. Sequencing the rRNA genes from the microbes will reveal previously unsequenced genetic regions (Unique Sequence IDs, or USIDs) corresponding to novel bacteria.
- the USIDs can be used to detect individual unknown bacteria, which can be isolated from the original sample, and their genomic DNA can be amplified by Whole Genome Amplification for genome sequencing.
- identifying an unknown microbe in a biological sample including: (a) selecting one or more biological samples having an unknown microbe, wherein the selecting includes deoxyribonucleic acid (DNA) sequencing of one or more polymerase chain reaction (PCR) target sequences from whole genome DNA of the biological samples, so as to identify one or more Unique DNA sequence Identifiers (USID) of the unknown microbe; (b) diluting the biological sample having the unknown microbe selected in step (a) into a high throughput format so as to enable whole genome amplification and sequencing of single microbes in the biological sample; (c) amplifying whole genome DNA from the single microbes in step (b) in a high throughput format; (d) PCR amplifying the USID sequences identified in step (a) from the genomic DNA in step (c) and detecting the presence or absence of the USID of the unknown microbe in each sample from step (c), and (e) conducting DNA sequencing on the whole genome amplified DNA of the unknown microbe identified in step (d
- the one or more USID sequences are highly conserved gene sequences, or portions or fragments of such gene sequences, for example survival genes or other genes that are conserved in over 70% of prokaryotes.
- the USID sequence is selected from the group consisting of: a bacterial or eukaryotic ribosomal ribonucleic acid (rRNA) gene, human leukocyte antigen (HLA), microbial toxin producing genes, microbial pathogenicity genes, microbial plasmid genes, human immune system genes, immune system components, 16S rRNA gene sequence, 18S rRNA gene sequence, 26S rRNA sequence, Internally Transcribed Spacer 1 (ITS1), Internally Transcribed Spacer 2 (ITS2), heat shock protein 65 (HSP65), rpoB, recA or any combination thereof, including portions or fragments thereof.
- the USID sequence is a rRNA gene sequence and an ITS sequence, or portions or fragments thereof.
- the diluting in step (b) results in one microbe in at least one well of a multiwell plate.
- the step of detecting the presence or absence of the USID comprises DNA sequencing or restriction digestion or both.
- the high throughput format is selected from the group consisting of: at least six samples, at least twenty-four samples, at least forty-eight samples, or at least ninety-six samples. In some embodiments, the high throughput format contains ninety-six samples.
- the biological sample is any material or fluid obtained from a human, animal, plant, or the environment.
- the biological sample is a crude biological sample or a partially-purified biological sample.
- the biological sample is selected from the group consisting of: feces, cell lysate, tissue, blood, tumor, tongue, tooth, buccal swab, phlegm, mucous, wound swab, skin swab, vaginal swab, biological material or biological fluid.
- the biological sample is a microbiome sample.
- the DNA sequencing in step (c) is selected from the group consisting of: a next-generation sequencing (NGS) method, Sanger sequencing or other sequencing methods.
- NGS next-generation sequencing
- the assembling in step (e) employs computer implemented methods.
- the methods disclosed herein are performed to identify one or more unknown eukaryotes in a biological sample.
- the USID sequences are selected from highly conserved eukaryotic gene sequences, or portions or fragments of such gene sequences, for example the 18s rRNA gene and related eukaryotic rRNA genes.
- FIG. 1 depicts an example of an embodiment of the present invention as described herein.
- Microbes such as bacteria and archaea
- rRNA ribosomal RNA
- USID Unique Sequence IDentifiers
- the methods described herein enable comprehensive generation of USIDs for each of the microbes in a sample, at high resolution, using amplicons spanning sections of the rRNA gene, using, for example, the Shoreline Biome EXT-16S DNA Purification and PCR Amplification Kit, which generates USID sequences in the Internally Transcribed Spacer Region between the 16S and 23S rRNA genes in bacteria and archaea.
- a subset of samples (referred to as “sub-samples”) may contain one or more previously unknown microbes with novel USIDs that can be identified and tracked using their novel amplicon DNA sequence. These unknown microbes are of significant interest to investigators studying the microbiome.
- the general methods described herein can also be used to track rRNA gene sequences to identify novel eukaryotes, such as fungi, from biological samples.
- the term “unknown,” as used herein, means a cell or microbe or other target of interest that is not known to be present in a biological sample or has not been confirmed as present in a biological sample.
- the steps of the methods include identifying and selecting biological samples having unknown bacteria that contain novel USID DNA sequences; diluting the selected biological sample into a high throughput format so as to enable whole genome amplification of single cells (or microbes) in a sub-sample of the biological sample: amplifying whole genome DNA from the single cells or microbes in a high throughput format; performing PCR amplification on the whole genome amplified DNA, for example PCR of the rRNA sequences from the genomic DNA, wherein the target rRNA amplicon sequence contains a recognizable unique sequence element (USID) that is simultaneously tagged with a unique DNA code corresponding to the sample; either digesting the amplicon with restriction enzymes that cut the USID into discrete fragments so that the presence of the unknown's USID in that sample can be recognized by a pattern on a gel, or pooling and sequencing the rRNA amplicons to identify the samples containing the USID from the unknown of interest; and conducting DNA sequencing on the samples containing the whole genome of the unknown single cell (
- the step of selecting biological samples having unknown bacteria with novel USID DNA sequences includes (i) purifying genomic DNA from biological samples in a high throughput format; and (ii) for each biological sample, PCR amplifying target DNA sequences (e.g., rRNA gene) from the genomic DNA from step (i), wherein each target DNA sequence has a Unique Sequence Identifier (USID) associated with the cell (or microbe) of interest, and sequencing the PCR target DNA sequences so as to identify biological samples having an unknown target DNA sequence which can be selected for subsequent steps of the method.
- USID Unique Sequence Identifier
- a commercially available kit for example the Shoreline Biome EXT-16S DNA Purification and PCR Amplification Kit, can be used for this purpose.
- the step of diluting the selected biological sample results in a sub-sample of one cell (or microbe) in at least one well of a multiwell plate. In some embodiments, the diluting results in a sub-sample of one or more cells (or microbes) in at least one well of a multiwell plate. In some embodiments, the diluting results in a sub-sample of one, two, three, four, live, six, seven, eight, nine or ten cells (or microbes) in at least one well of a multiwell plate.
- the methods are used to identify and sequence an unknown organism or cell selected from the group consisting of: multicellular organisms, unicellular organisms, prokaryotes, eukaryotes, microbes, bacteria, archaea, protozoa, algae and fungi.
- the high throughput format is selected from the group consisting of: at least six samples, at least twenty-four samples, at least forty-eight samples, or at least ninety-six samples.
- Biological samples may be transferred to multi-well plates, for example 96, 384 or 1096 well plates, or to microreactors contained in microfluidics devices that integrate one or more laboratory functions.
- the biological sample is selected from the group consisting of: feces, cell lysate, tissue, blood, tumor, tongue, tooth, buccal swab, phlegm, mucous, wound swab, skin swab, vaginal swab, or any other biological material or biological fluid originally obtained from a human, animal, plant, or environmental sample, including raw samples, complex samples, mixtures, and microbiome samples.
- the biological sample is crude or partially-purified.
- a “crude biological sample” as used herein, means a sample that has not been processed, altered or treated relative to its natural state.
- a “partially-purified biological sample” as used herein means a sample that has been processed, altered or treated relative to its natural state but still contains contaminants or impurities.
- the method further comprises a step of cell lysis or cell membrane solubilization to open the cells to make the DNA accessible for amplification by a polymerase.
- Methods of cell lysis and cell membrane solubilization are known in the art, for example, alkaline lysis. (Bimboim, H. C. and J. Doly, J., A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucl. Acids Res . (1979) 7 (6): 1513-1523).
- cells in the crude sample are lysed an alkaline solution consisting of 0.2M KOH. Either higher or lower concentrations of KOH may be used. Other bases, such as NaOH, may be substituted.
- a detergent such as sodium dodecyl sulfate
- the method comprises one or more reagents to lyse cells or solubilize cell membranes so as to release DNA from cells in the sample, including but not limited to, alkaline reagents or bases, for example potassium hydroxide or sodium hydroxide, or a detergent, such as sodium dodecyl sulfate, or an enzyme, such as lysozyme.
- purification of DNA involves cell lysate that is used directly in PCR.
- purification of DNA involves cell lysate that is subjected to further DNA purification methods know in the art, such as various DNA miniprep methods, alcohol precipitation, or commercially available methods such as column purification.
- Whole genome amplification steps may include thermocycling or isothermal protocols, or a combination thereof.
- Whole genome amplification primers may include random primers or target specific primers, or combinations thereof.
- purification of DNA involves the amplification of whole genome DNA using a DNA polymerase capable of producing high yields of purified DNA from a crude or partially purified biological sample.
- the polymerase is a strand displacement DNA polymerase.
- the polymerase is phi29 DNA polymerase (NCBI Accession No: ACE96023, U.S. Pat. Nos. 5,198,543 and 5,001,050, inventors Luis Blanco, Antonio Bernad, Margarita Salas).
- the polymerase is selected from the group consisting of phi29, Thermostable Bst DNA polymerase exonuclease ( ⁇ ) large fragment, Exonuclease ( ⁇ ) Bca DNA polymerase, Thermus aquaticus YT-1 polymerase, Phage M2 DNA polymerase, Phage PRD1 DNA polymerase, Exonuclease ( ⁇ )VENT DNA polymerase, Klenow fragment of DNA polymerase I, T5 DNA polymerase, and PRD1 DNA polymerase.
- the amplification of target DNA sequences employs a polymerase chain reaction (PCR).
- PCR primers may contain identifying ‘barcode’ sequences that can be used to determine sample identity after pooling with other samples and sequencing.
- DNA barcodes can be selected to be of sufficient length to generate the desired number of barcodes with sufficient variability to account for common sequencing errors, generally ranging in size from about 3 to about 20 bases, but may be longer or shorter.
- the barcode has a length of 2 bases, 3 bases, 4 bases, 5 bases, 6 bases, 7, bases, 8 bases, 9 bases 10 bases, 11 bases 12 based 13 bases, 14 bases, 15 bases, 16 bases, 17 bases, 18 bases, 19 bases, 20 bases, 25 bases, 30 bases or 35 bases.
- the target specific PCR sequences for the forward and reverse PCR primers can be specific for any DNA sequence, in coding or non-coding regions of a target genome, plasmid, or organelle.
- the target USID DNA sequence is an amplicon, or targeted gene sequence, such as bacterial or eukaryotic rRNA genes, human HLA, microbial toxin producing genes, microbial pathogenicity genes, microbial plasmid genes, human immune system genes, immune system components, a 16S rRNA gene sequence, 18S rRNA gene sequence, 26S rRNA sequence, other genes such as ITS1, ITS2, HSP65, rpoB, or recA, and other variable genetic regions of non-human organisms.
- the target USID sequence encompasses one or more gene sequences or portions thereof. In some embodiments, the target USID sequence encompasses the 16S rRNA gene, the adjacent Internally Transcribed Spacer (ITS) region, and part of the 23S rRNA gene. In some embodiments the target USID sequence encompasses the 16S rRNA gene. In some embodiments, the target USID sequence consists of all or part of each of the 16S rRNA gene, the ITS region, the 23S rRNA gene.
- the high throughput sequencing method can be any method, with high throughput generally meaning greater than 1000 reads per run.
- the high throughput DNA sequencing is a next-generation sequencing (NGS) method.
- the DNA sequencing method is Sanger sequencing or other low throughput sequencing methods.
- Sanger sequencing is a method of DNA sequencing based on the selective incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication.
- assembling DNA sequences into a genome employs computer implemented methods.
- FIG. 1 depicts an example of an embodiment of the present invention used to identify, isolate, and sequence an unknown microbe in a biological sample without having to culture the organism (Steps 1-5, described below).
- Step 1 a number (for example 96) biological samples are screened for novel microbes using DNA sequencing methods.
- the screening process in this embodiment includes lysis of cells and purification of cellular genomic DNA (gDNA) from the sample, amplification of the rRNA gene target region(s) while adding DNA barcodes to each sample so that the 96 samples can be pooled for simultaneous sequencing.
- the process as described in this example can be carried out using, for example, Shoreline Biome DNA Preparation and Amplicon Kits.
- the amplified barcoded rRNA sequences in each sample can be screened for the presence of unknown sequences that are contained in unknown microbes.
- the rRNA sequence data will reveal the presence of unknown microbes in the samples.
- samples may be sequenced together, the reads that originated with each sample can be identified by reading the barcode attached during PCR of the target region.
- the PCR target region is selected such that all microbes will generate a PCR product, for example, the target may be in the 16S, 23S, ITS, or similar genespace that is required for survival, and therefore conserved. Part or all of the sequenced PCR target can become a USID for each organism in the sample.
- Previously unknown USIDs are identified by mapping the DNA sequencing results to genomic databases such as GenBank.
- a sample with an unknown rRNA gene unique sequence ID (USID) in the ITS region between the 16S and 23S rRNA genes corresponding to an unknown microbe is identified as the sample of interest.
- USID unknown rRNA gene unique sequence ID
- an aliquot of a sample of interest is diluted into wells such that only one microbe (or cell) is in each well.
- This is especially useful in the case of fecal microbiomes, where there is a very low percentage ( ⁇ 1%) of host eukaryotic cells. Since eukaryotic genomes are approximately 1000 ⁇ larger than most bacterial genomes, limiting the number of eukaryotic cells is important for downstream shotgun sequencing.
- the methods can also be used when the sample of interest is diluted to a limited number of microbes per well, for example, ten or fewer microbes per well.
- microbe(s) in each well are subjected to high throughput whole genome amplification (shown here in 8 wells of a multiwell plate) in parallel to produce high yields of purified DNA, for example using alkaline, detergent, or enzymatic lysis to open cells followed by phi29 or Bst DNA polymerase amplification methods.
- a small amount of each whole genome amplified sample is analyzed for the presence of the USID corresponding to the unknown microbe.
- the USID sequence is amplified by PCR using primers specific for the USID.
- the presence of a cell in each diluted well is determined by gel sizing of the PCR product. Wells with no PCR product can be ignored for subsequent steps.
- Samples with PCR product corresponding to the expected size of the USID must have contained a cell with a USID similar in size to the target organism, and are selected for testing in subsequent steps.
- the USID base sequence can be determined by DNA sequencing and/or restriction digestion using methods known in the art. DNA sequencing will determine the full sequence of the target region PCR USID product, but can be slow and expensive.
- Restriction digestion of the target region PCR USID product is performed with one or more restriction enzymes that cut the desired target at specific locations, resulting in specific banding patterns on a gel.
- the banding patterns can be compared to those expected based on the USID sequence to identify wells containing the microbe of interest.
- Whole genome amplified DNA from the well with the unknown organism/USID genetic sequence is subjected to shotgun sequencing to sequence the full genome of the unknown microbe.
- the dilution was performed to generate one microbe per well, only the unknown microbe of interest is expected to be present.
- reads mapping only to known microbe(s) can be discarded prior to assembly, or an assembly can be done de novo, on all DNA within the multi-microbe mixture, to reconstruct the separate genomes of each microbe in the mix.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
- This application claims the benefit under 35 U.S.C. § 119(e) to U.S. Provisional Application No. 62/337,507, filed May 17, 2016, the entire contents of which are incorporated herein by reference in their entirety.
- Disclosed are methods for screening biological samples for the presence of unknown microbes, including prokaryotes (bacteria and archaea) and eukaryotes using ribosomal RNA gene regions, across multiple biological samples using a Unique Sequence IDentifier (USID) corresponding to the microbe. The screening process tracks the unknown microbe or eukaryote in a diluted sample where the DNA has been prepared using whole genome amplification. The whole genome of the unknown microbe or eukaryote is then sequenced and assembled.
- Unknown bacteria and archaea typically exist in diverse communities that may consist of hundreds or thousands of different strains, and each strain may comprise a small or large part of a given population. In addition, most bacteria and archaea cannot be cultured by any known means. Shotgun sequencing of all the bacteria and archaea that constitute a complex microbiome results in mostly wasted sequence data that comes primarily from the most frequent and best understood bacteria and archaea. For these reasons, identification, isolation and sequencing of unknown microbes from biological samples is impractical or impossible for most investigators. Disclosed herein are rapid and simple methods for identification, quantification, tracking, isolation, and sequencing of unknown microbes that can be applied to any biological sample containing known and unknown microbes. The disclosed methods may be used for isolation and sequencing of microbes in human samples, or for any sample of microbes from any source. Because the disclosed methods do not require microbes to be cultured to obtain sufficient material for sequencing, they can be used to sequence microbes that are considered unculturable.
- Methods of preparing DNA, amplifying rRNA gene sequences and sample barcode tagging are described in U.S. patent application Ser. No. 15/372,588, which is incorporated by reference herein in its entirety.
- DNA amplification techniques are typically employed when DNA sources are limiting, with the goal of producing enough DNA for study. Some of these amplification techniques include I-PEP-PCR (Dietmaier, W., Hartmann, A., Wallinger, S., Heinmoller, E., Kerner, T., Endl, E., Jauch, K. W., Hofstadter, F., and Ruschoff, J. Multiple mutation analyses in single tumor cells with improved whole genome amplification, Am. J. Pathol. (1999) 154: 83-95), PEP-PCR (Zhang, L., Cui, X., Schmitt. K., Hubert, R., Navidi, W., and Arnheim, N. Whole genome amplification from a single cell: Implications for genetic analysis. Proc. Natl. Acad. Sci. (1992) 89: 5847-5851), phi29 amplification (Lizardi. Paul M. Multiple displacement amplification, U.S. Pat. No. 6,280,949, issued Aug. 28, 2001), and Degenerate Oligonucleotide Primer (DOP) PCR (Telenius, H., Ponder, B., Tunnacliffe, A., Pelmear, A., Carter, N., Ferguson-Smith, M., Behmel A., Nordenskjöld, M., and Pfragner, R. Cytogenetic analysis by chromosome painting using dop-pcr amplified flow-sorted chromosomes, Genes, Chromosomes and Cancer 4(3) (1992): 257-263). Although it is typically used for whole genome amplification reactions, phi29 polymerase is also useful because it can function in the presence of contaminants that strongly inhibit other polymerases. Phi29 DNA polymerase is commercially available from vendors such as Thermo Fisher Scientific (Waltham, Mass.) and New England BioLabs (Ipswich, Mass.). U.S. Pat. No. 6,280,949 is incorporated by reference herein in its entirety.
- Barcoding individual samples with specific DNA tags allows many samples to be combined for sequencing, streamlining the workflow, decreasing time to result, and reducing costs. DNA barcoding for sample identification has been described for the very first of the next generation sequencing (NGS) platforms, the GS 20, in 2007 (Parameswaran P, Jalili R, Tao L, et al. A pyrosequencing-tailored nucleotide barcode design unveils opportunities for large-scale sample multiplexing. Nucleic Acids Research. 2007; 35(19): e130. doi:10.1093/nar/gkn760). Since that time, barcoding strategies have been made commercially available by Illumina (San Diego, Calif.), Pacific Biosciences (Menlo Park, Calif.), and Thermo Fisher Scientific (Waltham, Mass.) sequencing platforms.
- Rolling circle amplification protocols, materials, and methods using phi29 polymerase are described in Dean, et al. (Dean F. B., Nelson, J. R., Giesler, T. L., and Lasken, R. S. Rapid Amplification of Plasmid and Phage DNA Using Phi29 DNA Polymerase and Multiply-Primed Rolling Circle Amplification. Genome Research, (2001), 11(6), 1095-1099). Whole genome amplification using phi29 DNA polymerase is described in U.S. Pat. No. 6,124,120, Paul Lizardi, issued Sep. 26, 2000, titled “Multiple displacement amplification”. U.S. Pat. No. 6,124,120 is incorporated by reference herein in its entirety.
- Advantages of using phi29 as a DNA purification technique were outlined in R. Deadman, and K. Jones. DNA purification through amplification: Use of Phi29 DNA polymerase to prepare DNA for genomic analyses, Amersham Biosciences Lie Science News, (18), (2004): 14-15, which highlights the use of phi29 amplification for sequencing of single clones using Sanger sequencing, and mentions that “amplified DNA has been successfully used in many applications including PCR (simple, multiplex and real-time), SNP genotyping (Third Wave Invader™ assay (Third Wave Technologies, Inc., Madison, Wis.), MegaBACE™ SNuPe™ genotyping kit (GE Healthcare Life Sciences, Pittsburgh, Pa.), Affymetrix™ GeneChip™ HuSNP™ chip (Affymetrix, Santa Clara, Calif.), Pyrosequencing, STR and SSR genotyping, comparative genomic hybridization (CGI), cloning and library construction, heteroduplex analysis, slot and dot blots, yeast-2-hybrid systems, and microarray analysis”. This article was published by the commercial vendor Amersham before the launch of Next-Generation Sequencing Systems.
- Other polymerases are known in the art, including Thermostable Bst DNA polymerase exonuclease (−) large fragment (Aliotta, J. M., J. J. Pelletier, J. L. Ware, L. S. Moran, J. S. Benner, and H. Kong. Thermostable Bst DNA polymerase I lacks a 3′ to 5′ proofreading exonuclease activity. Genet. Anal. 12:185-195, (1996)), Exonuclease (−) Bca DNA polymerase (Walker, G. T. and Linn, C. P. Detection of Mycobacterium tuberculosis DNA with thermophilic strand displacement amplification and fluorescence polarization. Clinical Chemistry 42:1604-1608 (1996)), Thermus aquaticus YT-1 polymerase (Lawyer, F. C., Stoffel, S., Saikit, R. K., Myambo, K., Drummond, R., and Gelfand D. H. Isolation, Characterization, and Expression in Escherichia coli of the DNA Polymerase Gene from Thermus aquaticus. J. Biol. Chem., 264, 6427-6437 (1989)), Phage M2 DNA polymerase (Matsumoto, K., Takano, H., Kim, C. I., and Hirokawa H. Primary structure of bacteriophage M2 DNA polymerase: conserved segments within protein-priming DNA polymerases and DNA polymerase I of Escherichia coli. Gene 84:247-255 (1989)), Phage PRD1 DNA polymerase (Jung, G., Leavitt M. C., Hsieh J-C., Ito, J. Bacteriophage PRD1 DNA polymerase: evolution of DNA polymerases. Proc. Natl. Acad. Sci. U.S.A. 84:8287-8291 (1987)), Exonuclease (−)VENT DNA polymerase (Kong. H., Kucera, R. B., and Jack W. E. Characterization of a DNA polymerase from the hyperthermophile archaea Thermococcus litoralis. Vent DNA polymerase, steady state kinetics, thermal stability, processivity, strand displacement, and exonuclease activities. J. Biol. Chem. 268:1965-1975 (1993)), Klenow fragment of DNA polymerase I (Jacobsen, H., Klenow, H., and Overoaard-Hansen, H. The N-Terminal Amino-Acid Sequences of DNA Polymerase I from Escherichia coli and of the Large and the Small Fragments Obtained by a Limited Proteolysis. Eur J. Biochem. 45:623-627 (1974)), T5 DNA polymerase (Chatterjee D. K. Fujimura R. K., Campbell J. H., Gerard G. F. Cloning and overexpression of the gene encoding bacteriophage T5 DNA polymerase. Gene 97:13-19 (1991)) U.S. Pat. No. 5,270,179 and PRD1 DNA polymerase (Zhu, W; Ito, J.: Purification and characterization of PRD1 DNA polymerase. Biochimica et Biophysica Acta 1219:267-276 (1994)).
- A publication by Robert Pinard, et al. (Pinard, R., et al., “Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing” (BMC Genomics (7):216 (2006)) indicated that the bias across the genome for certain sequences was up to 100 fold, implying that there are severe limitations on the utility of the method for DNA purification.
- In a publication demonstrating amplification of DNA from a single cell for 16S rRNA amplicon sequencing, the inventor Mark Driscoll was an author because his group performed the work at Molecular Staging in 2004, prior to the introduction of Next Generation Sequencing technology. The publication contains useful discussion of bias as it relates to number of cells in starting material. (Ragunathan, et al., Genomic DNA Amplification from a Single Bacterium, Appl. Environ. Microbiol. June 2005, vol. 71 no. 6 3342-3347.) See also, Blainey, P. C., The future is now: single-cell genomics of bacteria and archaea. FEMS Microbiol Rev 37 (2013) 407-427, and Lasken, R., and McLean, J., Recent advances in genomic DNA sequencing of microbial species from single cells. Nat Rev Genet. 2014 September; 15(9): 577-584.
- Samples of interest may contain mixtures of prokaryotes or eukaryotes, both known and unknown. An example is a microbiome sample that contains mixtures of known and unknown microbes. Microbes can be identified by their DNA sequence. An efficient method for scanning many microbial samples at once is to prepare barcoded DNA amplicons targeting the rRNA genes from each sample, and pool the barcoded samples for DNA sequencing in a high throughput format. Sequencing the rRNA genes from the microbes will reveal previously unsequenced genetic regions (Unique Sequence IDs, or USIDs) corresponding to novel bacteria. The USIDs can be used to detect individual unknown bacteria, which can be isolated from the original sample, and their genomic DNA can be amplified by Whole Genome Amplification for genome sequencing.
- Disclosed herein are methods for identifying an unknown microbe in a biological sample, including: (a) selecting one or more biological samples having an unknown microbe, wherein the selecting includes deoxyribonucleic acid (DNA) sequencing of one or more polymerase chain reaction (PCR) target sequences from whole genome DNA of the biological samples, so as to identify one or more Unique DNA sequence Identifiers (USID) of the unknown microbe; (b) diluting the biological sample having the unknown microbe selected in step (a) into a high throughput format so as to enable whole genome amplification and sequencing of single microbes in the biological sample; (c) amplifying whole genome DNA from the single microbes in step (b) in a high throughput format; (d) PCR amplifying the USID sequences identified in step (a) from the genomic DNA in step (c) and detecting the presence or absence of the USID of the unknown microbe in each sample from step (c), and (e) conducting DNA sequencing on the whole genome amplified DNA of the unknown microbe identified in step (d), and assembling the DNA sequences into the genome of the unknown microbe. In some embodiments, the methods are conducted to identify more than one unknown microbe in a biological sample.
- In some embodiments, the one or more USID sequences are highly conserved gene sequences, or portions or fragments of such gene sequences, for example survival genes or other genes that are conserved in over 70% of prokaryotes. In some embodiments, the USID sequence is selected from the group consisting of: a bacterial or eukaryotic ribosomal ribonucleic acid (rRNA) gene, human leukocyte antigen (HLA), microbial toxin producing genes, microbial pathogenicity genes, microbial plasmid genes, human immune system genes, immune system components, 16S rRNA gene sequence, 18S rRNA gene sequence, 26S rRNA sequence, Internally Transcribed Spacer 1 (ITS1), Internally Transcribed Spacer 2 (ITS2), heat shock protein 65 (HSP65), rpoB, recA or any combination thereof, including portions or fragments thereof. In some embodiments, the USID sequence is a rRNA gene sequence and an ITS sequence, or portions or fragments thereof.
- In some embodiments, the diluting in step (b) results in one microbe in at least one well of a multiwell plate. In some embodiments, the step of detecting the presence or absence of the USID comprises DNA sequencing or restriction digestion or both.
- In some embodiments, the high throughput format is selected from the group consisting of: at least six samples, at least twenty-four samples, at least forty-eight samples, or at least ninety-six samples. In some embodiments, the high throughput format contains ninety-six samples.
- In some embodiments, the biological sample is any material or fluid obtained from a human, animal, plant, or the environment. In some embodiments, the biological sample is a crude biological sample or a partially-purified biological sample. In some embodiments, the biological sample is selected from the group consisting of: feces, cell lysate, tissue, blood, tumor, tongue, tooth, buccal swab, phlegm, mucous, wound swab, skin swab, vaginal swab, biological material or biological fluid. In some embodiments, the biological sample is a microbiome sample.
- In some embodiments, the DNA sequencing in step (c) is selected from the group consisting of: a next-generation sequencing (NGS) method, Sanger sequencing or other sequencing methods. In some embodiments, the assembling in step (e) employs computer implemented methods.
- In some embodiments, the methods disclosed herein are performed to identify one or more unknown eukaryotes in a biological sample. In such embodiments, the USID sequences are selected from highly conserved eukaryotic gene sequences, or portions or fragments of such gene sequences, for example the 18s rRNA gene and related eukaryotic rRNA genes.
- The subject matter that is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention will be apparent from the following detailed description taken in conjunction with the accompanying drawings.
-
FIG. 1 depicts an example of an embodiment of the present invention as described herein. - Microbes, such as bacteria and archaea, can be identified using their ribosomal RNA (rRNA) gene sequence, which is required for the survival of all prokaryotic microbes. Individual species of bacteria and archaea have characteristic DNA variations in the rRNA gene that serve as identifiers, fingerprints, or Unique Sequence IDentifiers (USID) for that species. The methods described herein enable comprehensive generation of USIDs for each of the microbes in a sample, at high resolution, using amplicons spanning sections of the rRNA gene, using, for example, the Shoreline Biome EXT-16S DNA Purification and PCR Amplification Kit, which generates USID sequences in the Internally Transcribed Spacer Region between the 16S and 23S rRNA genes in bacteria and archaea. A subset of samples (referred to as “sub-samples”) may contain one or more previously unknown microbes with novel USIDs that can be identified and tracked using their novel amplicon DNA sequence. These unknown microbes are of significant interest to investigators studying the microbiome. The general methods described herein can also be used to track rRNA gene sequences to identify novel eukaryotes, such as fungi, from biological samples.
- Disclosed herein are methods for identifying, isolating and sequencing unknown cells (or microbes) in a biological sample. The term “unknown,” as used herein, means a cell or microbe or other target of interest that is not known to be present in a biological sample or has not been confirmed as present in a biological sample. The steps of the methods include identifying and selecting biological samples having unknown bacteria that contain novel USID DNA sequences; diluting the selected biological sample into a high throughput format so as to enable whole genome amplification of single cells (or microbes) in a sub-sample of the biological sample: amplifying whole genome DNA from the single cells or microbes in a high throughput format; performing PCR amplification on the whole genome amplified DNA, for example PCR of the rRNA sequences from the genomic DNA, wherein the target rRNA amplicon sequence contains a recognizable unique sequence element (USID) that is simultaneously tagged with a unique DNA code corresponding to the sample; either digesting the amplicon with restriction enzymes that cut the USID into discrete fragments so that the presence of the unknown's USID in that sample can be recognized by a pattern on a gel, or pooling and sequencing the rRNA amplicons to identify the samples containing the USID from the unknown of interest; and conducting DNA sequencing on the samples containing the whole genome of the unknown single cell (or microbe) of interest from and assembling the DNA sequences into the genome of the single cell (or microbe).
- In some embodiments, the step of selecting biological samples having unknown bacteria with novel USID DNA sequences includes (i) purifying genomic DNA from biological samples in a high throughput format; and (ii) for each biological sample, PCR amplifying target DNA sequences (e.g., rRNA gene) from the genomic DNA from step (i), wherein each target DNA sequence has a Unique Sequence Identifier (USID) associated with the cell (or microbe) of interest, and sequencing the PCR target DNA sequences so as to identify biological samples having an unknown target DNA sequence which can be selected for subsequent steps of the method. A commercially available kit, for example the Shoreline Biome EXT-16S DNA Purification and PCR Amplification Kit, can be used for this purpose.
- In some embodiments, the step of diluting the selected biological sample results in a sub-sample of one cell (or microbe) in at least one well of a multiwell plate. In some embodiments, the diluting results in a sub-sample of one or more cells (or microbes) in at least one well of a multiwell plate. In some embodiments, the diluting results in a sub-sample of one, two, three, four, live, six, seven, eight, nine or ten cells (or microbes) in at least one well of a multiwell plate.
- In some embodiments, the methods are used to identify and sequence an unknown organism or cell selected from the group consisting of: multicellular organisms, unicellular organisms, prokaryotes, eukaryotes, microbes, bacteria, archaea, protozoa, algae and fungi.
- In some embodiments, the high throughput format is selected from the group consisting of: at least six samples, at least twenty-four samples, at least forty-eight samples, or at least ninety-six samples. Biological samples may be transferred to multi-well plates, for example 96, 384 or 1096 well plates, or to microreactors contained in microfluidics devices that integrate one or more laboratory functions.
- In some embodiments, the biological sample is selected from the group consisting of: feces, cell lysate, tissue, blood, tumor, tongue, tooth, buccal swab, phlegm, mucous, wound swab, skin swab, vaginal swab, or any other biological material or biological fluid originally obtained from a human, animal, plant, or environmental sample, including raw samples, complex samples, mixtures, and microbiome samples. In some embodiments, the biological sample is crude or partially-purified. A “crude biological sample” as used herein, means a sample that has not been processed, altered or treated relative to its natural state. A “partially-purified biological sample” as used herein, means a sample that has been processed, altered or treated relative to its natural state but still contains contaminants or impurities.
- In some embodiments, the method further comprises a step of cell lysis or cell membrane solubilization to open the cells to make the DNA accessible for amplification by a polymerase. Methods of cell lysis and cell membrane solubilization are known in the art, for example, alkaline lysis. (Bimboim, H. C. and J. Doly, J., A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucl. Acids Res. (1979) 7 (6): 1513-1523). In some embodiments of the methods described herein, cells in the crude sample are lysed an alkaline solution consisting of 0.2M KOH. Either higher or lower concentrations of KOH may be used. Other bases, such as NaOH, may be substituted. In some embodiments, a detergent, such as sodium dodecyl sulfate, may be used to solubilize cell membranes and proteins. In some embodiments, the method comprises one or more reagents to lyse cells or solubilize cell membranes so as to release DNA from cells in the sample, including but not limited to, alkaline reagents or bases, for example potassium hydroxide or sodium hydroxide, or a detergent, such as sodium dodecyl sulfate, or an enzyme, such as lysozyme.
- In some embodiments, purification of DNA involves cell lysate that is used directly in PCR.
- In some embodiments, purification of DNA involves cell lysate that is subjected to further DNA purification methods know in the art, such as various DNA miniprep methods, alcohol precipitation, or commercially available methods such as column purification.
- Whole genome amplification steps may include thermocycling or isothermal protocols, or a combination thereof. Whole genome amplification primers may include random primers or target specific primers, or combinations thereof. In some embodiments, purification of DNA involves the amplification of whole genome DNA using a DNA polymerase capable of producing high yields of purified DNA from a crude or partially purified biological sample. In some embodiments, the polymerase is a strand displacement DNA polymerase. In some embodiments, the polymerase is phi29 DNA polymerase (NCBI Accession No: ACE96023, U.S. Pat. Nos. 5,198,543 and 5,001,050, inventors Luis Blanco, Antonio Bernad, Margarita Salas). In some embodiments, the polymerase is selected from the group consisting of phi29, Thermostable Bst DNA polymerase exonuclease (−) large fragment, Exonuclease (−) Bca DNA polymerase, Thermus aquaticus YT-1 polymerase, Phage M2 DNA polymerase, Phage PRD1 DNA polymerase, Exonuclease (−)VENT DNA polymerase, Klenow fragment of DNA polymerase I, T5 DNA polymerase, and PRD1 DNA polymerase.
- In some embodiments, the amplification of target DNA sequences employs a polymerase chain reaction (PCR). In some embodiments, PCR primers may contain identifying ‘barcode’ sequences that can be used to determine sample identity after pooling with other samples and sequencing. DNA barcodes can be selected to be of sufficient length to generate the desired number of barcodes with sufficient variability to account for common sequencing errors, generally ranging in size from about 3 to about 20 bases, but may be longer or shorter. In some embodiments, the barcode has a length of 2 bases, 3 bases, 4 bases, 5 bases, 6 bases, 7, bases, 8 bases, 9 bases 10 bases, 11 bases 12 based 13 bases, 14 bases, 15 bases, 16 bases, 17 bases, 18 bases, 19 bases, 20 bases, 25 bases, 30 bases or 35 bases. The target specific PCR sequences for the forward and reverse PCR primers can be specific for any DNA sequence, in coding or non-coding regions of a target genome, plasmid, or organelle. In some embodiments, the target USID DNA sequence is an amplicon, or targeted gene sequence, such as bacterial or eukaryotic rRNA genes, human HLA, microbial toxin producing genes, microbial pathogenicity genes, microbial plasmid genes, human immune system genes, immune system components, a 16S rRNA gene sequence, 18S rRNA gene sequence, 26S rRNA sequence, other genes such as ITS1, ITS2, HSP65, rpoB, or recA, and other variable genetic regions of non-human organisms. In some embodiments, the target USID sequence encompasses one or more gene sequences or portions thereof. In some embodiments, the target USID sequence encompasses the 16S rRNA gene, the adjacent Internally Transcribed Spacer (ITS) region, and part of the 23S rRNA gene. In some embodiments the target USID sequence encompasses the 16S rRNA gene. In some embodiments, the target USID sequence consists of all or part of each of the 16S rRNA gene, the ITS region, the 23S rRNA gene.
- The high throughput sequencing method can be any method, with high throughput generally meaning greater than 1000 reads per run. In some embodiments, the high throughput DNA sequencing is a next-generation sequencing (NGS) method. In some embodiments, the DNA sequencing method is Sanger sequencing or other low throughput sequencing methods. Sanger sequencing is a method of DNA sequencing based on the selective incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. In some embodiments, assembling DNA sequences into a genome employs computer implemented methods.
-
FIG. 1 depicts an example of an embodiment of the present invention used to identify, isolate, and sequence an unknown microbe in a biological sample without having to culture the organism (Steps 1-5, described below). In Step 1, a number (for example 96) biological samples are screened for novel microbes using DNA sequencing methods. The screening process in this embodiment includes lysis of cells and purification of cellular genomic DNA (gDNA) from the sample, amplification of the rRNA gene target region(s) while adding DNA barcodes to each sample so that the 96 samples can be pooled for simultaneous sequencing. The process as described in this example can be carried out using, for example, Shoreline Biome DNA Preparation and Amplicon Kits. The amplified barcoded rRNA sequences in each sample can be screened for the presence of unknown sequences that are contained in unknown microbes. - Step 1.
- The rRNA sequence data will reveal the presence of unknown microbes in the samples. Although samples may be sequenced together, the reads that originated with each sample can be identified by reading the barcode attached during PCR of the target region. The PCR target region is selected such that all microbes will generate a PCR product, for example, the target may be in the 16S, 23S, ITS, or similar genespace that is required for survival, and therefore conserved. Part or all of the sequenced PCR target can become a USID for each organism in the sample. Previously unknown USIDs are identified by mapping the DNA sequencing results to genomic databases such as GenBank. In this example, a sample with an unknown rRNA gene unique sequence ID (USID) in the ITS region between the 16S and 23S rRNA genes corresponding to an unknown microbe is identified as the sample of interest.
-
Step 2. - In one embodiment, an aliquot of a sample of interest (containing an unknown USID sequence) is diluted into wells such that only one microbe (or cell) is in each well. This is especially useful in the case of fecal microbiomes, where there is a very low percentage (˜1%) of host eukaryotic cells. Since eukaryotic genomes are approximately 1000× larger than most bacterial genomes, limiting the number of eukaryotic cells is important for downstream shotgun sequencing. The methods can also be used when the sample of interest is diluted to a limited number of microbes per well, for example, ten or fewer microbes per well. Depending upon the prevalence of the unknown microbes and contaminating eukaryotic cells in the sample, it may make more sense from a cost, speed, and efficiency standpoint to either target one cell per well (if the unknown microbes are common and eukaryotic cells are also common) or multiple cells per well (if the unknown microbes are uncommon, and eukaryotic cells are also uncommon).
- Step 3.
- The microbe(s) in each well are subjected to high throughput whole genome amplification (shown here in 8 wells of a multiwell plate) in parallel to produce high yields of purified DNA, for example using alkaline, detergent, or enzymatic lysis to open cells followed by phi29 or Bst DNA polymerase amplification methods.
- Step 4:
- A small amount of each whole genome amplified sample is analyzed for the presence of the USID corresponding to the unknown microbe. The USID sequence is amplified by PCR using primers specific for the USID. The presence of a cell in each diluted well is determined by gel sizing of the PCR product. Wells with no PCR product can be ignored for subsequent steps. Samples with PCR product corresponding to the expected size of the USID must have contained a cell with a USID similar in size to the target organism, and are selected for testing in subsequent steps. The USID base sequence can be determined by DNA sequencing and/or restriction digestion using methods known in the art. DNA sequencing will determine the full sequence of the target region PCR USID product, but can be slow and expensive. Restriction digestion of the target region PCR USID product is performed with one or more restriction enzymes that cut the desired target at specific locations, resulting in specific banding patterns on a gel. The banding patterns can be compared to those expected based on the USID sequence to identify wells containing the microbe of interest.
-
Step 5. - Whole genome amplified DNA from the well with the unknown organism/USID genetic sequence is subjected to shotgun sequencing to sequence the full genome of the unknown microbe. In the embodiment where the dilution was performed to generate one microbe per well, only the unknown microbe of interest is expected to be present. In an embodiment where more than one microbe is present, reads mapping only to known microbe(s) can be discarded prior to assembly, or an assembly can be done de novo, on all DNA within the multi-microbe mixture, to reconstruct the separate genomes of each microbe in the mix.
- All patent applications, patents and other publications cited herein are incorporated by reference in their entirety. One or more embodiments of the present invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.
Claims (14)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/146,427 US20210130914A1 (en) | 2016-05-17 | 2021-01-11 | High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662337507P | 2016-05-17 | 2016-05-17 | |
US15/596,366 US10894990B2 (en) | 2016-05-17 | 2017-05-16 | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
US17/146,427 US20210130914A1 (en) | 2016-05-17 | 2021-01-11 | High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/596,366 Continuation US10894990B2 (en) | 2016-05-17 | 2017-05-16 | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
Publications (1)
Publication Number | Publication Date |
---|---|
US20210130914A1 true US20210130914A1 (en) | 2021-05-06 |
Family
ID=60326113
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/596,366 Active US10894990B2 (en) | 2016-05-17 | 2017-05-16 | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
US17/146,427 Pending US20210130914A1 (en) | 2016-05-17 | 2021-01-11 | High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/596,366 Active US10894990B2 (en) | 2016-05-17 | 2017-05-16 | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
Country Status (3)
Country | Link |
---|---|
US (2) | US10894990B2 (en) |
EP (1) | EP3452619A4 (en) |
WO (1) | WO2017201112A1 (en) |
Families Citing this family (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2972835T3 (en) | 2015-04-10 | 2024-06-17 | 10X Genomics Sweden Ab | Multiplex analysis of biological specimens of spatially distinguished nucleic acids |
US10894990B2 (en) * | 2016-05-17 | 2021-01-19 | Shoreline Biome, Llc | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
WO2019000254A1 (en) * | 2017-06-28 | 2019-01-03 | 中国医学科学院药用植物研究所 | Method for quality control of chinese patent medicine based on metagenome |
CN108018360B (en) * | 2017-12-15 | 2021-08-13 | 中国检验检疫科学研究院 | Edible sea cucumber species identification method based on DNA micro-bar code technology |
WO2020047010A2 (en) * | 2018-08-28 | 2020-03-05 | 10X Genomics, Inc. | Increasing spatial array resolution |
US11519033B2 (en) | 2018-08-28 | 2022-12-06 | 10X Genomics, Inc. | Method for transposase-mediated spatial tagging and analyzing genomic DNA in a biological sample |
WO2020096782A1 (en) * | 2018-11-06 | 2020-05-14 | Dowd Scot E | Universal or broad range assays and multi-tag sample specific diagnostic process using non-optical sequencing |
WO2020123311A2 (en) | 2018-12-10 | 2020-06-18 | 10X Genomics, Inc. | Resolving spatial arrays using deconvolution |
US11649485B2 (en) | 2019-01-06 | 2023-05-16 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
US11926867B2 (en) | 2019-01-06 | 2024-03-12 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
EP3976820A1 (en) | 2019-05-30 | 2022-04-06 | 10X Genomics, Inc. | Methods of detecting spatial heterogeneity of a biological sample |
EP4025711A2 (en) | 2019-11-08 | 2022-07-13 | 10X Genomics, Inc. | Enhancing specificity of analyte binding |
EP4081656A1 (en) | 2019-12-23 | 2022-11-02 | 10X Genomics, Inc. | Compositions and methods for using fixed biological samples in partition-based assays |
FI3891300T3 (en) | 2019-12-23 | 2023-05-10 | 10X Genomics Inc | Methods for spatial analysis using rna-templated ligation |
US11702693B2 (en) | 2020-01-21 | 2023-07-18 | 10X Genomics, Inc. | Methods for printing cells and generating arrays of barcoded cells |
US11732299B2 (en) | 2020-01-21 | 2023-08-22 | 10X Genomics, Inc. | Spatial assays with perturbed cells |
US12076701B2 (en) | 2020-01-31 | 2024-09-03 | 10X Genomics, Inc. | Capturing oligonucleotides in spatial transcriptomics |
US11898205B2 (en) | 2020-02-03 | 2024-02-13 | 10X Genomics, Inc. | Increasing capture efficiency of spatial assays |
US12110541B2 (en) | 2020-02-03 | 2024-10-08 | 10X Genomics, Inc. | Methods for preparing high-resolution spatial arrays |
US11732300B2 (en) | 2020-02-05 | 2023-08-22 | 10X Genomics, Inc. | Increasing efficiency of spatial analysis in a biological sample |
US12129516B2 (en) | 2020-02-07 | 2024-10-29 | 10X Genomics, Inc. | Quantitative and automated permeabilization performance evaluation for spatial transcriptomics |
US11891654B2 (en) | 2020-02-24 | 2024-02-06 | 10X Genomics, Inc. | Methods of making gene expression libraries |
ES2965354T3 (en) | 2020-04-22 | 2024-04-12 | 10X Genomics Inc | Methods for spatial analysis using targeted RNA deletion |
WO2021237087A1 (en) | 2020-05-22 | 2021-11-25 | 10X Genomics, Inc. | Spatial analysis to detect sequence variants |
EP4153775B1 (en) | 2020-05-22 | 2024-07-24 | 10X Genomics, Inc. | Simultaneous spatio-temporal measurement of gene expression and cellular activity |
WO2021242834A1 (en) | 2020-05-26 | 2021-12-02 | 10X Genomics, Inc. | Method for resetting an array |
US12031177B1 (en) | 2020-06-04 | 2024-07-09 | 10X Genomics, Inc. | Methods of enhancing spatial resolution of transcripts |
EP4421186A3 (en) | 2020-06-08 | 2024-09-18 | 10X Genomics, Inc. | Methods of determining a surgical margin and methods of use thereof |
WO2021252591A1 (en) | 2020-06-10 | 2021-12-16 | 10X Genomics, Inc. | Methods for determining a location of an analyte in a biological sample |
EP4172362B1 (en) | 2020-06-25 | 2024-09-18 | 10X Genomics, Inc. | Spatial analysis of dna methylation |
US11761038B1 (en) | 2020-07-06 | 2023-09-19 | 10X Genomics, Inc. | Methods for identifying a location of an RNA in a biological sample |
US11981960B1 (en) | 2020-07-06 | 2024-05-14 | 10X Genomics, Inc. | Spatial analysis utilizing degradable hydrogels |
US11981958B1 (en) | 2020-08-20 | 2024-05-14 | 10X Genomics, Inc. | Methods for spatial analysis using DNA capture |
US11926822B1 (en) | 2020-09-23 | 2024-03-12 | 10X Genomics, Inc. | Three-dimensional spatial analysis |
US11827935B1 (en) | 2020-11-19 | 2023-11-28 | 10X Genomics, Inc. | Methods for spatial analysis using rolling circle amplification and detection probes |
WO2022140028A1 (en) | 2020-12-21 | 2022-06-30 | 10X Genomics, Inc. | Methods, compositions, and systems for capturing probes and/or barcodes |
EP4347879A1 (en) | 2021-06-03 | 2024-04-10 | 10X Genomics, Inc. | Methods, compositions, kits, and systems for enhancing analyte capture for spatial analysis |
CN113755613A (en) * | 2021-08-20 | 2021-12-07 | 浙江省海洋水产养殖研究所 | Technology for high-throughput analysis of bacterial symbiosis, microbial pollution and strain identification |
EP4196605A1 (en) | 2021-09-01 | 2023-06-21 | 10X Genomics, Inc. | Methods, compositions, and kits for blocking a capture probe on a spatial array |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6054278A (en) * | 1997-05-05 | 2000-04-25 | The Perkin-Elmer Corporation | Ribosomal RNA gene polymorphism based microorganism identification |
US10894990B2 (en) * | 2016-05-17 | 2021-01-19 | Shoreline Biome, Llc | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5001050A (en) | 1989-03-24 | 1991-03-19 | Consejo Superior Investigaciones Cientificas | PHφ29 DNA polymerase |
US5198543A (en) | 1989-03-24 | 1993-03-30 | Consejo Superior Investigaciones Cientificas | PHI29 DNA polymerase |
US5270179A (en) | 1989-08-10 | 1993-12-14 | Life Technologies, Inc. | Cloning and expression of T5 DNA polymerase reduced in 3'- to-5' exonuclease activity |
JP2526747B2 (en) | 1991-05-21 | 1996-08-21 | 信越化学工業株式会社 | Epoxy resin composition and semiconductor device |
US6124120A (en) | 1997-10-08 | 2000-09-26 | Yale University | Multiple displacement amplification |
EP2163652B2 (en) | 2002-01-28 | 2023-05-31 | Life Technologies Corporation | Preparing crude biological extracts using protease, suitable for preparing cDNA |
AU2002324326A1 (en) | 2002-07-25 | 2004-02-16 | Indian Oil Corporation Limited | Corrosion inhibitor/metal passivator additive composition from waste refinery streams |
WO2010021936A1 (en) | 2008-08-16 | 2010-02-25 | The Board Of Trustees Of The Leland Stanford Junior University | Digital pcr calibration for high throughput sequencing |
EP3133169B1 (en) | 2009-08-25 | 2019-10-16 | Illumina, Inc. | Methods for selecting and amplifying polynucleotides |
CN103890191B (en) | 2011-05-27 | 2018-12-04 | 哈佛大学校长及研究员协会 | Single cell whole genome amplification method |
WO2013033721A1 (en) | 2011-09-02 | 2013-03-07 | Atreca, Inc. | Dna barcodes for multiplexed sequencing |
US10167520B2 (en) | 2011-12-06 | 2019-01-01 | Scot E. Dowd | Universal or broad range assays and multi-tag sample specific diagnostic process using non-optical sequencing |
US20140378345A1 (en) * | 2012-08-14 | 2014-12-25 | 10X Technologies, Inc. | Compositions and methods for sample processing |
US20160040215A1 (en) | 2013-03-14 | 2016-02-11 | Seres Therapeutics, Inc. | Methods for Pathogen Detection and Enrichment from Materials and Compositions |
CN105407728A (en) | 2013-07-21 | 2016-03-16 | 霍勒拜欧姆公司 | Methods and systems for microbiome characterization, monitoring and treatment |
CA2948134C (en) | 2014-05-06 | 2023-03-14 | Is-Diagnostics Ltd. | Microbial population analysis |
RU2736728C2 (en) | 2014-10-17 | 2020-11-19 | Иллумина Кембридж Лимитед | Transposition with preservation of gene adhesion |
SG11201706504RA (en) | 2015-02-10 | 2017-09-28 | Illumina Inc | Methods and compositions for analyzing cellular components |
-
2017
- 2017-05-16 US US15/596,366 patent/US10894990B2/en active Active
- 2017-05-17 EP EP17800055.0A patent/EP3452619A4/en active Pending
- 2017-05-17 WO PCT/US2017/033008 patent/WO2017201112A1/en unknown
-
2021
- 2021-01-11 US US17/146,427 patent/US20210130914A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6054278A (en) * | 1997-05-05 | 2000-04-25 | The Perkin-Elmer Corporation | Ribosomal RNA gene polymorphism based microorganism identification |
US10894990B2 (en) * | 2016-05-17 | 2021-01-19 | Shoreline Biome, Llc | High throughput method for identification and sequencing of unknown microbial and eukaryotic genomes from complex mixtures |
Also Published As
Publication number | Publication date |
---|---|
US10894990B2 (en) | 2021-01-19 |
US20170335410A1 (en) | 2017-11-23 |
WO2017201112A1 (en) | 2017-11-23 |
EP3452619A1 (en) | 2019-03-13 |
EP3452619A4 (en) | 2019-12-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210130914A1 (en) | High Throughput Method for Identification and Sequencing of Unknown Microbial and Eukaryotic Genomes from Complex Mixtures | |
EP3320110B1 (en) | 16s ribosomal rna universal primers and use thereof in microbiological analysis and diagnostics | |
Hawkins et al. | Whole genome amplification—applications and advances | |
US9850525B2 (en) | CAS9-based isothermal method of detection of specific DNA sequence | |
JP5806841B2 (en) | Bacterial strain detection and typing | |
JP2020522243A (en) | Multiplexed end-tagging amplification of nucleic acids | |
Milligan et al. | Evolution of a thermophilic strand-displacing polymerase using high-temperature isothermal compartmentalized self-replication | |
US20150354015A1 (en) | Method for direct amplification from crude nucleic acid samples | |
US20170166956A1 (en) | Methods for DNA Preparation for Multiplex High Throughput Targeted Sequencing | |
US20150159207A1 (en) | Use of markers including nucleotide sequence based codes to monitor methods of detection and identification of genetic material | |
JP7071341B2 (en) | How to identify a sample | |
US10011866B2 (en) | Nucleic acid ligation systems and methods | |
KR20110126925A (en) | Pcr primer for amplifying 5'end region of mitochondrial cytochrome oxidase subunit i gene used for dna barcoding of scale insect | |
Santosa | Rapid extraction and purification of environmental DNA for molecular cloning applications and molecular diversity studies | |
CA2987414A1 (en) | Molecular detection of rna | |
US11345959B2 (en) | Method for exploring useful genetic resources through bulk metagenome analysis and use thereof | |
US5393657A (en) | Detection of residual host cell DNA by PCR | |
CN103667252A (en) | Nucleic acid amplification method | |
Etebu | Potential panacea to the complexities of polymerase chain reaction (PCR) | |
Pepper et al. | Detection of specific DNA sequences in environmental samples via polymerase chain reaction | |
KR101811737B1 (en) | Method for Screening Useful Gene Products via Metagenomics-based Mega-throughput Screening System and Uses Thereof | |
Perumal et al. | A combined computational-experimental analyses of selected metabolic enzymes in Pseudomonas species | |
CA2662024C (en) | Thermostablization of dna polymerase by protein folding pathway from a hyperthermophile archaeon, pyrococcus furiosus | |
CN115803433A (en) | Thermostable ligases with reduced sequence bias | |
Do | Analysis Of Yeast DNA Sequences To Determine Yeast Strains Unique To Texas That Are Optimal For The Production Of Craft Distilled Spirits |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
AS | Assignment |
Owner name: VC23 INVESTORS LLC, CONNECTICUT Free format text: SECURITY INTEREST;ASSIGNOR:SHORELINE BIOME, LLC;REEL/FRAME:058859/0950 Effective date: 20220122 Owner name: CONNECTICUT INNOVATIONS, INCORPORATED, CONNECTICUT Free format text: SECURITY INTEREST;ASSIGNOR:SHORELINE BIOME, LLC;REEL/FRAME:058859/0950 Effective date: 20220122 Owner name: JARVIE, THOMAS P, CONNECTICUT Free format text: SECURITY INTEREST;ASSIGNOR:SHORELINE BIOME, LLC;REEL/FRAME:058859/0950 Effective date: 20220122 Owner name: DENSLOW, PAUL, CALIFORNIA Free format text: SECURITY INTEREST;ASSIGNOR:SHORELINE BIOME, LLC;REEL/FRAME:058859/0950 Effective date: 20220122 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |