US20230313192A1 - Positional Delivery and Encoding by Oligonucleotides of Biological Cells for Single Cell Sequencing (POS SEQ) - Google Patents
Positional Delivery and Encoding by Oligonucleotides of Biological Cells for Single Cell Sequencing (POS SEQ) Download PDFInfo
- Publication number
- US20230313192A1 US20230313192A1 US18/207,585 US202318207585A US2023313192A1 US 20230313192 A1 US20230313192 A1 US 20230313192A1 US 202318207585 A US202318207585 A US 202318207585A US 2023313192 A1 US2023313192 A1 US 2023313192A1
- Authority
- US
- United States
- Prior art keywords
- cells
- biological sample
- molecular probes
- molecular
- single cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108091034117 Oligonucleotide Proteins 0.000 title claims abstract description 43
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 30
- 238000012384 transportation and delivery Methods 0.000 title abstract description 24
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 title abstract description 5
- 239000012472 biological sample Substances 0.000 claims abstract description 71
- 239000003068 molecular probe Substances 0.000 claims abstract description 69
- 238000000034 method Methods 0.000 claims abstract description 44
- 239000000523 sample Substances 0.000 claims description 15
- 238000003752 polymerase chain reaction Methods 0.000 claims description 14
- 238000004113 cell culture Methods 0.000 claims description 6
- 238000005259 measurement Methods 0.000 claims description 6
- 238000012174 single-cell RNA sequencing Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 119
- 238000010586 diagram Methods 0.000 description 20
- 239000002299 complementary DNA Substances 0.000 description 19
- 241000713869 Moloney murine leukemia virus Species 0.000 description 18
- 102000020313 Cell-Penetrating Peptides Human genes 0.000 description 17
- 108010051109 Cell-Penetrating Peptides Proteins 0.000 description 17
- 239000011324 bead Substances 0.000 description 16
- 238000012545 processing Methods 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- 102000053602 DNA Human genes 0.000 description 14
- 230000006870 function Effects 0.000 description 14
- 239000007788 liquid Substances 0.000 description 10
- 239000011859 microparticle Substances 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 8
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 239000013615 primer Substances 0.000 description 7
- 241001430294 unidentified retrovirus Species 0.000 description 7
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 229920002477 rna polymer Polymers 0.000 description 6
- 241000713666 Lentivirus Species 0.000 description 5
- 238000007726 management method Methods 0.000 description 5
- 108090000623 proteins and genes Proteins 0.000 description 5
- 102100034343 Integrase Human genes 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 2
- 101000692455 Homo sapiens Platelet-derived growth factor receptor beta Proteins 0.000 description 2
- 102100028123 Macrophage colony-stimulating factor 1 Human genes 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 102100026547 Platelet-derived growth factor receptor beta Human genes 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 238000000126 in silico method Methods 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000006855 networking Effects 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical group CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 108091011896 CSF1 Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 101710203526 Integrase Proteins 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000009172 bursting Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008568 cell cell communication Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000012517 data analytics Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 230000003076 paracrine Effects 0.000 description 1
- 210000003668 pericyte Anatomy 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 238000013439 planning Methods 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000013468 resource allocation Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1096—Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B20/00—Methods specially adapted for identifying library members
- C40B20/04—Identifying library members by means of a tag, label, or other readable or detectable entity associated with the library members, e.g. decoding processes
Definitions
- the present invention relates to single cell sequencing, and more particularly, to techniques for positional delivery and position encoding by oligonucleotides of biological cells for single cell ribonucleic acid (RNA) sequencing, i.e., positional sequencing (POS SEQ).
- RNA ribonucleic acid
- POS SEQ positional sequencing
- the successful functioning of multi-cellular organisms relies on the coordinated functions of a multitude of molecular constituents from individual cells and the interactions among functionally distinct cells. Further, these molecular constituents are constantly changing such as in response to cell-to-cell interactions which oftentimes result from local physical cell-to-cell contact and/or from short length-scale paracrine cell-to-cell communications. Thus, the state of a biological system is often defined by the relative position of the cells in the system and the highly dimensional molecular composition of each of those cells.
- PDGFR platelet-derived growth factor receptor
- CSF1 colony-stimulating factor 1
- spatial and molecular measurements are made using image-based analysis where molecular and positional information is obtained by taking microscopy images of samples treated with either enzymatically- or fluorescently-labeled antibodies that bind specifically to the molecular target of interest.
- the sensor pixel position reflects the spatial relationship of the cells, while the sensor pixel signal intensity reflects the local density of the labeled antibodies molecular target of interest.
- the present invention provides techniques for positional delivery and position encoding by oligonucleotides of biological cells for single cell ribonucleic acid (RNA) sequencing (POS SEQ).
- RNA ribonucleic acid
- POS SEQ single cell ribonucleic acid
- a method of positional delivery and encoding of cells in a biological sample includes: encoding the cells in the biological sample for single cell sequencing by delivering molecular probes inside the cells that encode a position of the cells in the biological sample.
- another method of positional delivery and encoding of cells in a biological sample includes: constructing a cDNA library of molecular probes that encode a position of cells in a biological sample; linking the molecular probes to a vessel; delivering the vessel with the molecular probes to specific locations of the biological sample where the vessel delivers the molecular probes inside the cells at the specific locations; extracting the cells containing the molecular probes from the sample; and performing single cell sequencing of the extracted cells.
- a system for positional delivery and encoding of cells in a biological sample includes: a processor device, connected to a memory, that is implemented to: analyze data from single cell sequencing of cells along with molecular probes, that have been delivered inside the cells, which uniquely encode a position of the cells in a biological sample.
- FIG. 1 is a diagram illustrating an exemplary methodology of positional delivery and encoding of cells in a biological sample for single cell sequencing according to an embodiment of the present invention
- FIG. 2 is a diagram illustrating an exemplary barcoded deoxyribonucleic acid (DNA) oligonucleotide primer molecule according to an embodiment of the present invention
- FIG. 3 is a diagram illustrating an exemplary methodology for constructing a cDNA library using Moloney murine leukemia virus reverse transcriptase (MMLV RT) technology according to an embodiment of the present invention
- FIG. 4 is a diagram illustrating an exemplary methodology for a ‘template switch’ by the MMLV RT using a template switch oligonucleotide (TSO) sequence according to an embodiment of the present invention
- FIG. 5 is a diagram illustrating use of a lentiviruses as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention
- FIG. 6 is a diagram illustrating use of a disulfide-linked cell-penetrating peptide (CPP) as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention
- CPP disulfide-linked cell-penetrating peptide
- FIG. 7 is a diagram illustrating use of bead micro-particles as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention
- FIG. 8 is a diagram illustrating an exemplary methodology for using a liquid cargo delivery device to deliver the vessels with unique molecular probes to specific locations of the biological sample, where the vessel delivers the location-specific molecular probes inside the cells at those locations for single cell sequencing according to an embodiment of the present invention
- FIG. 9 is a diagram illustrating an exemplary apparatus for performing one or more of the methodologies presented herein according to an embodiment of the present invention.
- FIG. 10 depicts a cloud computing environment according to an embodiment of the present invention.
- FIG. 11 depicts abstraction model layers according to an embodiment of the present invention.
- a molecular probe having a unique oligonucleotide sequence that encodes a positioning of the cells within a biological sample such as a cell culture (e.g., a cell culture including living eukaryotic and/or prokaryotic cells) and/or a tissue sample (e.g., a biopsy, formalin-fixed paraffin-embedded (“FFPE”) and/or frozen tissue containing living cells).
- a biological sample such as a cell culture (e.g., a cell culture including living eukaryotic and/or prokaryotic cells) and/or a tissue sample (e.g., a biopsy, formalin-fixed paraffin-embedded (“FFPE”) and/or frozen tissue containing living cells).
- FFPE formalin-fixed paraffin-embedded
- the molecular probe encodes the position at which each of the cells being sequenced is located within the biological sample.
- the molecular probe is first linked to a vessel such as a retrovirus, disulfide-linked cell-penetrating peptide (CPP) and/or bead micro-particle.
- a liquid cargo delivery device such as a microfluidic probe (MFP) is then used to deliver the molecular probe/vessel to specific locations of the biological sample.
- MFP microfluidic probe
- the vessel By way of the vessel, the molecular probes with unique nucleotide sequences are delivered inside the cells at those specific locations of the biological sample.
- these oligonucleotide sequences are in effect a label of the position of a given cell in the biological sample.
- a unique oligonucleotide sequence is delivered inside the cells at that location in the biological sample.
- a complementary deoxyribonucleic acid (cDNA) library of molecular probes containing unique oligonucleotide sequences is constructed.
- cDNA complementary deoxyribonucleic acid
- the library construction leverages the template-switching activity of Moloney murine leukemia virus reverse transcriptase (“MMLV RT”).
- the molecular probes with unique oligonucleotide sequences are then linked to a particular vessel such as a retrovirus, coupled to a disulfide-linked cell-penetrating peptide (CPP) or a bead micro-particle.
- a particular vessel such as a retrovirus, coupled to a disulfide-linked cell-penetrating peptide (CPP) or a bead micro-particle.
- CPP cell-penetrating peptide
- a bead micro-particle This vessel will enable the molecular probes to be delivered inside the cells of a biological sample.
- the cells can be uniquely identified—even when disassociated from the biological sample—due to the unique oligonucleotide sequences carried by the molecular probes.
- the vessels with the molecular probes are delivered to specific locations of the biological sample (e.g., a living cell culture and/or tissue sample with living cells), where the vessels deliver the molecular probes inside the cells at those specific locations.
- this location-specific delivery is accomplished using a liquid cargo delivery device such a microfluidic probe or MFP.
- a microfluidic probe is a non-contact, scanning platform that can hydrodynamically localize as little as 100 picoliters of a liquid cargo with micrometer precision.
- the molecular probes can be dispersed in a processing solution (e.g., an aqueous solution) that is then delivered via the liquid cargo delivery device to specific locations of the biological sample.
- the molecular probes delivered to a given specific location of the biological sample contain a unique oligonucleotide sequence that is associated with that given specific location.
- the oligonucleotide sequence encodes the original position of the cells in the sample (i.e., positional encoding).
- the vessels deliver the molecular probes inside the cells at those specific locations.
- the cells take in the vessels with the molecular probes through an active transfection/transduction process using living cell machinery.
- the present techniques are preferably performed with a living biological system.
- the biological sample preferably contains living cells, whether as a living cell culture or as a tissue sample containing living cells. The living cells permit transfection/transduction to occur. Following the present positional encoding process, the cells/tissue can be fixed if so desired.
- step 108 single cell sequencing is performed on the cells extracted from the biological sample. Even though the cells are disassociated from the biological sample for sequencing, the cells now contain the molecular probe with oligonucleotide sequence encoding the position of the cells in the biological sample. Thus, this positional data can be retained through the sequencing process.
- step 110 the data from the single cell sequencing is stored and analyzed (e.g., in silico) along with the data from the molecular probes which uniquely encodes the positions of the cells in the biological sample.
- An exemplary apparatus for storing and analyzing this data is provided in FIG. 9 , described below.
- Being able to analyze transcriptomic data (i.e., RNA transcripts produced by a genome) from the cells along with the position of those cells in the biological sample is extremely beneficial. For instance, as highlighted above, the state of a biological system is often defined by the relative position of the cells in the system and the highly dimensional molecular composition of each of those cells. Take, for example, the development of therapeutic agents for cancer treatment that leverage cell positioning to target specific tumor cell subpopulations. See above.
- the process begins with the construction of a cDNA library of molecular probes containing unique oligonucleotide sequences for positional encoding.
- the cDNA library construction begins with many barcoded DNA oligonucleotide primer molecules 204 conjugated to a microparticle 202 (e.g., a bead) such as a glass bead.
- each DNA oligonucleotide primer molecule 204 contains a universal polymerase chain reaction (PCR) handle 204 a , a cell barcode 204 b , a unique molecular identifier (UMI) 204 c , a position code 204 d , and a poly T sequence 204 e (i.e., Tn—a sequence of n thymine repeats).
- PCR handle 204 a enables PCR amplification.
- PCR handle 204 a is a DNA oligonucleotide sequence for PCR primers in the amplification step (see, e.g., FIG. 4 —described below).
- Cell barcode 204 b is a DNA oligonucleotide sequence that is unique to bead 202 /cell into which the molecular probe is delivered.
- UMI 204 c is a DNA oligonucleotide sequence that is unique to this particular DNA oligonucleotide primer molecule 204 .
- the DNA oligonucleotide primer molecules attached to the same bead 202 can share the same cell barcode, but different UMIs.
- the UMIs of each DNA oligonucleotide primer molecules has a different, unique oligonucleotide sequence.
- the UMIs can be used for normalizing gene counts during computational data processing.
- the UMIs can be used to identify PCR duplicates during the single cell sequencing (see below).
- the position code 204 d provides the (location-specific) oligonucleotide sequence for positional encoding. Namely, as described above, the position code 204 d uses a unique oligonucleotide sequence to encode the location (x,y) of cells in a biological sample into which the present molecular probes will be delivered. A length of the position code 204 d can depend on the total number of locations (x,y) to be encoded. For example, according to an exemplary embodiment, the length of the position code 204 d is determined as follows,
- L represents the length of the position code 204 d (i.e., the number of nucleotides that make up the position code 204 d ), and wherein N represents the total number of locations (x,y) to be encoded.
- the library and/or library construction (such as the generation of the location-specific oligonucleotide sequence for positional encoding) can optionally be provided as a service in a cloud environment.
- the cDNA library is constructed using MMLV RT. See, for example, FIG. 3 where an endogenous messenger RNA (mRNA) template 302 hybridizes with DNA oligonucleotide 204 , and MMLV RT 304 synthesizes a DNA complement (cDNA) to the mRNA template 302 , and then appends a poly cytosine (C) sequence to the newly synthesized cDNA sequence.
- mRNA messenger RNA
- cDNA DNA complement
- C poly cytosine
- mRNA template 302 of the i th gene includes a generic 5′ CAP 302 a , a gene-specific coding region g i 302 b , and a poly A tail 302 c (i.e., a sequence of m adenine (A) repeats).
- the mRNA template 302 hybridizes with the 3′ poly T sequence 204 e of DNA oligonucleotide 204 , and the MMLV RT 304 synthesizes a DNA complement (see for example gene-specific coding region f i 204 f ) to the mRNA template 302 .
- This new cDNA sequence is now given reference numeral 204 ′.
- MMLV RT 304 then appends cDNA sequence 204 ′ with poly C sequence 204 g.
- a template switch is performed where a template switch oligonucleotide (TSO) sequence 402 is hybridized with the cDNA sequence 204 ′, after which the MMLV RT 304 performs the ‘template switch’ in which MMLV RT 304 uses the TSO sequence 402 as a template for replication.
- TSO sequence 402 includes two groups, an oligonucleotide code 402 a that one wants to append to the mRNA template 302 , and a poly ribosomal Guanine (rG) repeat sequence 402 b .
- the oligonucleotide code 402 a is a PCR handle. As provided above, a PCR handle enables PCR amplification.
- the poly rG sequence 402 b of TSO 402 hybridizes with the poly C sequence 204 g (that was appended to the cDNA sequence 204 ′ by MMLV RT 304 — see above). Doing so enables the MMLV RT 304 to then use TSO 402 as a template for replication. For instance, as shown in step 410 MMLV RT 304 appends a PCR handle 204 h to the poly C sequence 204 g at the 3′ end of cDNA sequence 204 ′.
- the cDNA sequence 204 ′ is then separated from the mRNA template 302 /TSO 402 .
- the cDNA sequence 204 ′ can be separated from the mRNA template 302 /TSO 402 by ribonuclease H activity of the MMLV RT technology and/or through the use of RNA degradation by sodium hydroxide (NaOH) and heat.
- NaOH sodium hydroxide
- the result is a molecular probe with a unique oligonucleotide sequence (i.e., position code 204 d ) that encodes positional data.
- each molecular probe is location-specific, meaning that it contains an oligonucleotide sequence position code 204 d that is unique to a specific location of a biological sample.
- the molecular probes are then delivered inside the cells at specific locations of the biological sample corresponding to the oligonucleotide sequence position code 204 d each of molecular probe carries.
- the cDNA sequence 204 ′/molecular probes can be amplified by PCR.
- the molecular probes with unique oligonucleotide sequences are then linked to a vessel such as a retrovirus, coupled to disulfide-linked cell-penetrating peptide (CPP) or bead micro-particle which will permit transfer of molecular probes into the cells at specific locations of the biological sample.
- a vessel such as a retrovirus
- CPP cell-penetrating peptide
- bead micro-particle which will permit transfer of molecular probes into the cells at specific locations of the biological sample.
- retroviruses such as lentiviruses like the MMLV can be employed as the vessel. See FIG. 5 .
- Lentiviruses such as MMLV are advantageous as gene delivery vehicles because they are able to stably integrate into the genome of cells.
- lentiviruses have the distinguishing property of being able to insert genetic material into both dividing and non-dividing cells.
- the process for modifying retroviruses such as lentiviruses for use as vectors for gene delivery into cells is well known to those of ordinary skill in the art.
- a disulfide-linked cell-penetrating peptide (CPP) or activatable cell-penetrating peptide (ACCP) is also a suitable vessel for transferring the molecular probes into the cells of the biological sample when the sample is live cells or tissue containing live cells. See FIG. 6 .
- CPP are biocarriers that are able to penetrate biological membranes and thus translocate into cells, thereby permitting the cells to internalize different cargo molecules.
- the CPPs are short polycations attached via protease-cleavable linkers to neutralizing polyanions.
- the disulfide-linked CPP-to-oligonucleotides complexes are non-permanent in the reducing environment within the cells.
- the disulfide bond between the CPP biocarrier molecule and the molecular probe can be cleaved.
- CPP molecules as biocarriers see, for example, Gagat et al., “Cell-penetrating peptides and their utility in genome function modifications (Review),” International Journal of Molecular Medicine 40: 1615-1623 (October 2017), the contents of which are incorporated by refence as if fully set forth herein.
- bead micro-particles 702 are also a suitable vessel for transferring the molecular probes into the cells of the biological sample. See FIG. 7 .
- Beads micro-particles 702 are able to permeate the biological membranes of cells by a process called bead transfection.
- micro-particle beads such as glass beads can first be incubated in a solution containing the molecular probes.
- the micro-particle beads now conjugated with molecular probes can then be introduced into the cells using a process such as electroporation.
- the bead micro-particles 702 are the same as bead 202 described above (see FIG. 2 ).
- Other vessel delivery mechanisms (such as a retrovirus, coupled to a disulfide-linked CPP, etc.) are needed because some mechanisms are better than others, depending on if they are being used for tissues or cells.
- a liquid cargo delivery device such as a microfluidic probe is employed to deliver the vessels with unique molecular probes to specific locations of the biological sample, where the vessel delivers the location-specific molecular probes inside the cells at those locations. See, for example, FIG. 8 .
- a liquid cargo delivery device 802 (such as a microfluidic probe) scans the surface of a biological sample 804 and deposits the vessels with unique molecular probes at specific locations (x,y) in the biological sample 804 .
- a microfluidic probe is a non-contact, scanning platform that can hydrodynamically localize as little as 100 picoliters of a liquid cargo with micrometer precision.
- the liquid cargo delivery device 802 dispenses a controlled amount of a processing solution (e.g., an aqueous solution) containing the vessels/molecular probes at multiple locations (i.e., (x 1 ,y 1 ), (x 1 ,y 2 ), (x 1 ,y 3 ), etc.) in the biological sample 804 . See step 812 .
- a processing solution e.g., an aqueous solution
- the vessel/molecular probe is delivered to a specific location of the biological sample 804 , the vessel delivers the molecular probes inside the cell(s) 806 at that specific location.
- the cells 806 are extracted from the biological sample 804 . See step 814 . However, even after being disassociated from the biological sample 804 , the individual cells 806 retain the molecular probe with oligonucleotide sequence encoding the original position of the cells 806 in the biological sample 804 . Thus, this positional data can be retained through the subsequent sequencing process. See step 816 .
- one or more single cell sequencing techniques can be performed. Suitable single cell sequencing techniques include, but are not limited to, drop-seq, seq-well, cyto-seq, and combinations thereof.
- the single cell sequencing performed in step 816 can be used to identify the subject cell by the cell barcode (see above), the original position of the cells 806 within the biological sample 804 via the unique, location-specific oligonucleotide sequence of the molecular probes, and/or transcriptome information of the cells 806 .
- the combination of the present positional delivery and encoding process with extraction and single cell sequencing can collect concomitant spatial and molecular measurements (e.g., position coordinates and transcriptomes of one or more of the cells 806 in the biological sample 804 ) which, as described in conjunction with the description of step 110 of methodology 100 above, can be recorded and/or analyzed in silico.
- concomitant spatial and molecular measurements e.g., position coordinates and transcriptomes of one or more of the cells 806 in the biological sample 804 .
- the present invention may be a system, a method, and/or a computer program product.
- the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
- the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
- the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
- RAM random access memory
- ROM read-only memory
- EPROM or Flash memory erasable programmable read-only memory
- SRAM static random access memory
- CD-ROM compact disc read-only memory
- DVD digital versatile disk
- memory stick a floppy disk
- a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
- a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
- Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
- the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
- the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
- electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
- These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
- each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
- the functions noted in the block may occur out of the order noted in the figures.
- two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
- apparatus 900 can be configured to implement one or more of the steps of methodology 100 of FIG. 1 .
- apparatus 900 may be configured to store and/or analyze the transcriptomic data extracted from the single cell sequencing along with the unique positional data obtained from the molecular probes indicating the original positioning of the cells in the biological sample.
- Apparatus 900 includes a computer system 910 and removable media 950 .
- Computer system 910 includes a processor device 920 , a network interface 925 , a memory 930 , a media interface 935 and an optional display 940 .
- Network interface 925 allows computer system 910 to connect to a network
- media interface 935 allows computer system 910 to interact with media, such as a hard drive or removable media 950 .
- Processor device 920 can be configured to implement the methods, steps, and functions disclosed herein.
- the memory 930 could be distributed or local and the processor device 920 could be distributed or singular.
- the memory 930 could be implemented as an electrical, magnetic or optical memory, or any combination of these or other types of storage devices.
- the term “memory” should be construed broadly enough to encompass any information able to be read from, or written to, an address in the addressable space accessed by processor device 920 . With this definition, information on a network, accessible through network interface 925 , is still within memory 930 because the processor device 920 can retrieve the information from the network.
- each distributed processor that makes up processor device 920 generally contains its own addressable memory space.
- some or all of computer system 910 can be incorporated into an application-specific or general-use integrated circuit.
- Optional display 940 is any type of display suitable for interacting with a human user of apparatus 900 .
- display 940 is a computer monitor or other similar display.
- Cloud computing is a model of service delivery for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services) that can be rapidly provisioned and released with minimal management effort or interaction with a provider of the service.
- This cloud model may include at least five characteristics, at least three service models, and at least four deployment models.
- On-demand self-service a cloud consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with the service's provider.
- Resource pooling the provider's computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to demand. There is a sense of location independence in that the consumer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or datacenter).
- Rapid elasticity capabilities can be rapidly and elastically provisioned, in some cases automatically, to quickly scale out and rapidly released to quickly scale in. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be purchased in any quantity at any time.
- Measured service cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service.
- level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts).
- SaaS Software as a Service: the capability provided to the consumer is to use the provider's applications running on a cloud infrastructure.
- the applications are accessible from various client devices through a thin client interface such as a web browser (e.g., web-based e-mail).
- a web browser e.g., web-based e-mail
- the consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings.
- PaaS Platform as a Service
- the consumer does not manage or control the underlying cloud infrastructure including networks, servers, operating systems, or storage, but has control over the deployed applications and possibly application hosting environment configurations.
- IaaS Infrastructure as a Service
- the consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, deployed applications, and possibly limited control of select networking components (e.g., host firewalls).
- Private cloud the cloud infrastructure is operated solely for an organization. It may be managed by the organization or a third party and may exist on-premises or off-premises.
- Public cloud the cloud infrastructure is made available to the general public or a large industry group and is owned by an organization selling cloud services.
- Hybrid cloud the cloud infrastructure is a composition of two or more clouds (private, community, or public) that remain unique entities but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load-balancing between clouds).
- a cloud computing environment is service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability.
- An infrastructure that includes a network of interconnected nodes.
- cloud computing environment 50 includes one or more cloud computing nodes 10 with which local computing devices used by cloud consumers, such as, for example, personal digital assistant (PDA) or cellular telephone 54 A, desktop computer 54 B, laptop computer 54 C, and/or automobile computer system 54 N may communicate.
- Nodes 10 may communicate with one another. They may be grouped (not shown) physically or virtually, in one or more networks, such as Private, Community, Public, or Hybrid clouds as described hereinabove, or a combination thereof.
- This allows cloud computing environment 50 to offer infrastructure, platforms and/or software as services for which a cloud consumer does not need to maintain resources on a local computing device.
- computing devices 54 A-N shown in FIG. 10 are intended to be illustrative only and that computing nodes 10 and cloud computing environment 50 can communicate with any type of computerized device over any type of network and/or network addressable connection (e.g., using a web browser).
- FIG. 11 a set of functional abstraction layers provided by cloud computing environment 50 ( FIG. 10 ) is shown. It should be understood in advance that the components, layers, and functions shown in FIG. 11 are intended to be illustrative only and embodiments of the invention are not limited thereto. As depicted, the following layers and corresponding functions are provided:
- Hardware and software layer 60 includes hardware and software components.
- hardware components include: mainframes 61 ; RISC (Reduced Instruction Set Computer) architecture based servers 62 ; servers 63 ; blade servers 64 ; storage devices 65 ; and networks and networking components 66 .
- software components include network application server software 67 and database software 68 .
- Virtualization layer 70 provides an abstraction layer from which the following examples of virtual entities may be provided: virtual servers 71 ; virtual storage 72 ; virtual networks 73 , including virtual private networks; virtual applications and operating systems 74 ; and virtual clients 75 .
- management layer 80 may provide the functions described below.
- Resource provisioning 81 provides dynamic procurement of computing resources and other resources that are utilized to perform tasks within the cloud computing environment.
- Metering and Pricing 82 provide cost tracking as resources are utilized within the cloud computing environment, and billing or invoicing for consumption of these resources. In one example, these resources may include application software licenses.
- Security provides identity verification for cloud consumers and tasks, as well as protection for data and other resources.
- User portal 83 provides access to the cloud computing environment for consumers and system administrators.
- Service level management 84 provides cloud computing resource allocation and management such that required service levels are met.
- Service Level Agreement (SLA) planning and fulfillment 85 provide pre-arrangement for, and procurement of, cloud computing resources for which a future requirement is anticipated in accordance with an SLA.
- SLA Service Level Agreement
- Workloads layer 90 provides examples of functionality for which the cloud computing environment may be utilized. Examples of workloads and functions which may be provided from this layer include: mapping and navigation 91 ; software development and lifecycle management 92 ; virtual classroom education delivery 93 ; data analytics processing 94 ; transaction processing 95 ; and cDNA library construction 96 .
Abstract
Techniques for positional delivery and position encoding by oligonucleotides of biological cells for single cell RNA sequencing are provided. In one aspect, a method of positional delivery and encoding of cells in a biological sample includes: encoding the cells in the biological sample for single cell sequencing by delivering molecular probes inside the cells that encode a position of the cells in the biological sample. A system for positional delivery and encoding of cells in a biological sample is also provided.
Description
- The present invention relates to single cell sequencing, and more particularly, to techniques for positional delivery and position encoding by oligonucleotides of biological cells for single cell ribonucleic acid (RNA) sequencing, i.e., positional sequencing (POS SEQ).
- The successful functioning of multi-cellular organisms relies on the coordinated functions of a multitude of molecular constituents from individual cells and the interactions among functionally distinct cells. Further, these molecular constituents are constantly changing such as in response to cell-to-cell interactions which oftentimes result from local physical cell-to-cell contact and/or from short length-scale paracrine cell-to-cell communications. Thus, the state of a biological system is often defined by the relative position of the cells in the system and the highly dimensional molecular composition of each of those cells.
- For example, with diseases such as cancer, specific tumor cell subpopulations can co-opt adjacent normal cells to support tumor progression. Thus, the relevance of cell positioning has motivated the development of therapeutic agents that target the co-opted cells, such as platelet-derived growth factor receptor (“PDGFR”) inhibitors to target PDGFR+pericytes, and small molecule inhibitors or neutralizing antibodies of colony-stimulating factor 1 (“CSF1”) receptors to target macrophages.
- Typically, spatial and molecular measurements are made using image-based analysis where molecular and positional information is obtained by taking microscopy images of samples treated with either enzymatically- or fluorescently-labeled antibodies that bind specifically to the molecular target of interest. When the images are digital, the sensor pixel position reflects the spatial relationship of the cells, while the sensor pixel signal intensity reflects the local density of the labeled antibodies molecular target of interest.
- Other techniques employed for concomitant spatial and molecular measurements involve first recording the positioning of the individual cells that are then measured. It is however impractical to implement such a technique with potentially millions of distinct cells that need to be stored and processed separately for molecular profiling.
- Thus, improved techniques for concomitant spatial and molecular measurements of biological cells would be desirable.
- The present invention provides techniques for positional delivery and position encoding by oligonucleotides of biological cells for single cell ribonucleic acid (RNA) sequencing (POS SEQ). In one aspect of the invention, a method of positional delivery and encoding of cells in a biological sample is provided. The method includes: encoding the cells in the biological sample for single cell sequencing by delivering molecular probes inside the cells that encode a position of the cells in the biological sample.
- In another aspect of the invention, another method of positional delivery and encoding of cells in a biological sample is provided. The method includes: constructing a cDNA library of molecular probes that encode a position of cells in a biological sample; linking the molecular probes to a vessel; delivering the vessel with the molecular probes to specific locations of the biological sample where the vessel delivers the molecular probes inside the cells at the specific locations; extracting the cells containing the molecular probes from the sample; and performing single cell sequencing of the extracted cells.
- In yet another aspect of the invention, a system for positional delivery and encoding of cells in a biological sample is provided. The system includes: a processor device, connected to a memory, that is implemented to: analyze data from single cell sequencing of cells along with molecular probes, that have been delivered inside the cells, which uniquely encode a position of the cells in a biological sample.
- A more complete understanding of the present invention, as well as further features and advantages of the present invention, will be obtained by reference to the following detailed description and drawings.
-
FIG. 1 is a diagram illustrating an exemplary methodology of positional delivery and encoding of cells in a biological sample for single cell sequencing according to an embodiment of the present invention; -
FIG. 2 is a diagram illustrating an exemplary barcoded deoxyribonucleic acid (DNA) oligonucleotide primer molecule according to an embodiment of the present invention; -
FIG. 3 is a diagram illustrating an exemplary methodology for constructing a cDNA library using Moloney murine leukemia virus reverse transcriptase (MMLV RT) technology according to an embodiment of the present invention; -
FIG. 4 is a diagram illustrating an exemplary methodology for a ‘template switch’ by the MMLV RT using a template switch oligonucleotide (TSO) sequence according to an embodiment of the present invention; -
FIG. 5 is a diagram illustrating use of a lentiviruses as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention; -
FIG. 6 is a diagram illustrating use of a disulfide-linked cell-penetrating peptide (CPP) as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention; -
FIG. 7 is a diagram illustrating use of bead micro-particles as the vessel for delivering a location-specific molecular probe inside the cells of a biological sample according to an embodiment of the present invention; -
FIG. 8 is a diagram illustrating an exemplary methodology for using a liquid cargo delivery device to deliver the vessels with unique molecular probes to specific locations of the biological sample, where the vessel delivers the location-specific molecular probes inside the cells at those locations for single cell sequencing according to an embodiment of the present invention; -
FIG. 9 is a diagram illustrating an exemplary apparatus for performing one or more of the methodologies presented herein according to an embodiment of the present invention; -
FIG. 10 depicts a cloud computing environment according to an embodiment of the present invention; and -
FIG. 11 depicts abstraction model layers according to an embodiment of the present invention. - Provided herein are techniques for concomitant positional and molecular measuring of cells using a molecular probe having a unique oligonucleotide sequence that encodes a positioning of the cells within a biological sample such as a cell culture (e.g., a cell culture including living eukaryotic and/or prokaryotic cells) and/or a tissue sample (e.g., a biopsy, formalin-fixed paraffin-embedded (“FFPE”) and/or frozen tissue containing living cells). Thus, when the cells are later dissociated from the biological sample and sequenced, the cells will take with them the positional information encoded in the molecular probe. Advantageously, the molecular probe encodes the position at which each of the cells being sequenced is located within the biological sample.
- As will be described in detail below, the molecular probe is first linked to a vessel such as a retrovirus, disulfide-linked cell-penetrating peptide (CPP) and/or bead micro-particle. A liquid cargo delivery device such as a microfluidic probe (MFP) is then used to deliver the molecular probe/vessel to specific locations of the biological sample. See, for example, Juncker et al., “Multipurpose microfluidic probe,” Nature Materials, Advanced Online Publication (July 2005) (8 total pages), the contents of which are incorporated by reference as if fully set forth herein. By way of the vessel, the molecular probes with unique nucleotide sequences are delivered inside the cells at those specific locations of the biological sample. As highlighted above, these oligonucleotide sequences are in effect a label of the position of a given cell in the biological sample. Thus, for each location (x,y) of a biological sample that the liquid cargo delivery device visits, a unique oligonucleotide sequence is delivered inside the cells at that location in the biological sample.
- An overview of the present techniques for positional delivery and encoding of cells in a biological sample for single cell sequencing is now provided by way of reference to
methodology 100 ofFIG. 1 . Instep 102, a complementary deoxyribonucleic acid (cDNA) library of molecular probes containing unique oligonucleotide sequences is constructed. As will be described in detail below, according to an exemplary embodiment, the library construction leverages the template-switching activity of Moloney murine leukemia virus reverse transcriptase (“MMLV RT”). For a general description of MMLV RT for library construction see, for example, Zhu et al., “Reverse Transcriptase Template Switching: A SMART™ Approach for Full-Length cDNA Library Construction,” BioTechniques 30:892-897 (April 2001) (hereinafter “Zhu”), the contents of which are incorporated by reference as if fully set forth herein. - In
step 104, the molecular probes with unique oligonucleotide sequences are then linked to a particular vessel such as a retrovirus, coupled to a disulfide-linked cell-penetrating peptide (CPP) or a bead micro-particle. This vessel will enable the molecular probes to be delivered inside the cells of a biological sample. By delivering the molecular probes into the cells, the cells can be uniquely identified—even when disassociated from the biological sample—due to the unique oligonucleotide sequences carried by the molecular probes. - In
step 106, the vessels with the molecular probes are delivered to specific locations of the biological sample (e.g., a living cell culture and/or tissue sample with living cells), where the vessels deliver the molecular probes inside the cells at those specific locations. According to an exemplary embodiment, this location-specific delivery is accomplished using a liquid cargo delivery device such a microfluidic probe or MFP. A microfluidic probe is a non-contact, scanning platform that can hydrodynamically localize as little as 100 picoliters of a liquid cargo with micrometer precision. For instance, the molecular probes can be dispersed in a processing solution (e.g., an aqueous solution) that is then delivered via the liquid cargo delivery device to specific locations of the biological sample. The molecular probes delivered to a given specific location of the biological sample contain a unique oligonucleotide sequence that is associated with that given specific location. Thus, as provided above, when the cells are later disassociated from the biological sample for sequencing, the oligonucleotide sequence encodes the original position of the cells in the sample (i.e., positional encoding). - Once the molecular probes are delivered to a given specific location of the biological sample, the vessels deliver the molecular probes inside the cells at those specific locations. According to an exemplary embodiment, the cells take in the vessels with the molecular probes through an active transfection/transduction process using living cell machinery. Thus, the present techniques are preferably performed with a living biological system. For instance, the biological sample preferably contains living cells, whether as a living cell culture or as a tissue sample containing living cells. The living cells permit transfection/transduction to occur. Following the present positional encoding process, the cells/tissue can be fixed if so desired.
- In
step 108, single cell sequencing is performed on the cells extracted from the biological sample. Even though the cells are disassociated from the biological sample for sequencing, the cells now contain the molecular probe with oligonucleotide sequence encoding the position of the cells in the biological sample. Thus, this positional data can be retained through the sequencing process. - For instance, in
step 110 the data from the single cell sequencing is stored and analyzed (e.g., in silico) along with the data from the molecular probes which uniquely encodes the positions of the cells in the biological sample. An exemplary apparatus for storing and analyzing this data is provided inFIG. 9 , described below. Being able to analyze transcriptomic data (i.e., RNA transcripts produced by a genome) from the cells along with the position of those cells in the biological sample is extremely beneficial. For instance, as highlighted above, the state of a biological system is often defined by the relative position of the cells in the system and the highly dimensional molecular composition of each of those cells. Take, for example, the development of therapeutic agents for cancer treatment that leverage cell positioning to target specific tumor cell subpopulations. See above. - As described in conjunction with the description of
step 102 ofmethodology 100 above, the process begins with the construction of a cDNA library of molecular probes containing unique oligonucleotide sequences for positional encoding. As shown inFIG. 2 , the cDNA library construction begins with many barcoded DNAoligonucleotide primer molecules 204 conjugated to a microparticle 202 (e.g., a bead) such as a glass bead. The techniques for preparing distinctly barcoded oligonucleotide primers are described generally in Macosko et al., “Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets,” Cell 161, 1202-1214 (May 2015), the contents of which are incorporated by reference as if fully set forth herein. - As shown in
FIG. 2 , beginning from its 5′ end, each DNAoligonucleotide primer molecule 204 contains a universal polymerase chain reaction (PCR) handle 204 a, acell barcode 204 b, a unique molecular identifier (UMI) 204 c, aposition code 204 d, and apoly T sequence 204 e (i.e., Tn—a sequence of n thymine repeats). PCR handle 204 a enables PCR amplification. For example, according to an exemplary embodiment, PCR handle 204 a is a DNA oligonucleotide sequence for PCR primers in the amplification step (see, e.g.,FIG. 4 —described below). -
Cell barcode 204 b is a DNA oligonucleotide sequence that is unique to bead 202/cell into which the molecular probe is delivered.UMI 204 c is a DNA oligonucleotide sequence that is unique to this particular DNAoligonucleotide primer molecule 204. For instance, the DNA oligonucleotide primer molecules attached to thesame bead 202 can share the same cell barcode, but different UMIs. In other words, the UMIs of each DNA oligonucleotide primer molecules has a different, unique oligonucleotide sequence. By way of example only, the UMIs can be used for normalizing gene counts during computational data processing. For example, the UMIs can be used to identify PCR duplicates during the single cell sequencing (see below). - The
position code 204 d provides the (location-specific) oligonucleotide sequence for positional encoding. Namely, as described above, theposition code 204 d uses a unique oligonucleotide sequence to encode the location (x,y) of cells in a biological sample into which the present molecular probes will be delivered. A length of theposition code 204 d can depend on the total number of locations (x,y) to be encoded. For example, according to an exemplary embodiment, the length of theposition code 204 d is determined as follows, -
L≥log 4(N), (1) - wherein L represents the length of the
position code 204 d (i.e., the number of nucleotides that make up theposition code 204 d), and wherein N represents the total number of locations (x,y) to be encoded. As will be described below, the library and/or library construction (such as the generation of the location-specific oligonucleotide sequence for positional encoding) can optionally be provided as a service in a cloud environment. - According to an exemplary embodiment, the cDNA library is constructed using MMLV RT. See, for example,
FIG. 3 where an endogenous messenger RNA (mRNA)template 302 hybridizes withDNA oligonucleotide 204, andMMLV RT 304 synthesizes a DNA complement (cDNA) to themRNA template 302, and then appends a poly cytosine (C) sequence to the newly synthesized cDNA sequence. For instance, as shown instep 310, in its simplestform mRNA template 302 of the ith gene includes a generic 5′CAP 302 a, a gene-specificcoding region g i 302 b, and apoly A tail 302 c (i.e., a sequence of m adenine (A) repeats). - As shown in
step 312, themRNA template 302 hybridizes with the 3′poly T sequence 204 e ofDNA oligonucleotide 204, and theMMLV RT 304 synthesizes a DNA complement (see for example gene-specificcoding region f i 204 f) to themRNA template 302. This new cDNA sequence is now givenreference numeral 204′.MMLV RT 304 then appendscDNA sequence 204′ withpoly C sequence 204 g. - According to an exemplary embodiment, a template switch is performed where a template switch oligonucleotide (TSO)
sequence 402 is hybridized with thecDNA sequence 204′, after which theMMLV RT 304 performs the ‘template switch’ in whichMMLV RT 304 uses theTSO sequence 402 as a template for replication. SeeFIG. 4 . As shown inFIG. 4 ,TSO sequence 402 includes two groups, anoligonucleotide code 402 a that one wants to append to themRNA template 302, and a poly ribosomal Guanine (rG)repeat sequence 402 b. According to an exemplary embodiment, theoligonucleotide code 402 a is a PCR handle. As provided above, a PCR handle enables PCR amplification. - As shown in
step 410, thepoly rG sequence 402 b ofTSO 402 hybridizes with thepoly C sequence 204 g (that was appended to thecDNA sequence 204′ byMMLV RT 304— see above). Doing so enables theMMLV RT 304 to then useTSO 402 as a template for replication. For instance, as shown instep 410MMLV RT 304 appends aPCR handle 204 h to thepoly C sequence 204 g at the 3′ end ofcDNA sequence 204′. - As shown in
step 412, thecDNA sequence 204′ is then separated from themRNA template 302/TSO 402. By way of example only, thecDNA sequence 204′ can be separated from themRNA template 302/TSO 402 by ribonuclease H activity of the MMLV RT technology and/or through the use of RNA degradation by sodium hydroxide (NaOH) and heat. The result is a molecular probe with a unique oligonucleotide sequence (i.e.,position code 204 d) that encodes positional data. For instance, as highlighted above, each molecular probe is location-specific, meaning that it contains an oligonucleotidesequence position code 204 d that is unique to a specific location of a biological sample. By way of the present techniques, the molecular probes are then delivered inside the cells at specific locations of the biological sample corresponding to the oligonucleotidesequence position code 204 d each of molecular probe carries. As shown instep 414, thecDNA sequence 204′/molecular probes can be amplified by PCR. - As described in conjunction with the description of
step 104 ofmethodology 100 above, the molecular probes with unique oligonucleotide sequences are then linked to a vessel such as a retrovirus, coupled to disulfide-linked cell-penetrating peptide (CPP) or bead micro-particle which will permit transfer of molecular probes into the cells at specific locations of the biological sample. For live cells, retroviruses such as lentiviruses like the MMLV can be employed as the vessel. SeeFIG. 5 . Lentiviruses such as MMLV are advantageous as gene delivery vehicles because they are able to stably integrate into the genome of cells. Further, among retroviruses, lentiviruses have the distinguishing property of being able to insert genetic material into both dividing and non-dividing cells. The process for modifying retroviruses such as lentiviruses for use as vectors for gene delivery into cells is well known to those of ordinary skill in the art. - A disulfide-linked cell-penetrating peptide (CPP) or activatable cell-penetrating peptide (ACCP) is also a suitable vessel for transferring the molecular probes into the cells of the biological sample when the sample is live cells or tissue containing live cells. See
FIG. 6 . CPP are biocarriers that are able to penetrate biological membranes and thus translocate into cells, thereby permitting the cells to internalize different cargo molecules. According to an exemplary embodiment, the CPPs are short polycations attached via protease-cleavable linkers to neutralizing polyanions. Thus, as shown inFIG. 6 , the disulfide-linked CPP-to-oligonucleotides complexes are non-permanent in the reducing environment within the cells. As such, once the CPP biocarriers deliver the molecular probes into the individual cells of the biological sample, the disulfide bond between the CPP biocarrier molecule and the molecular probe can be cleaved. For a general description of CPP molecules as biocarriers see, for example, Gagat et al., “Cell-penetrating peptides and their utility in genome function modifications (Review),” International Journal of Molecular Medicine 40: 1615-1623 (October 2017), the contents of which are incorporated by refence as if fully set forth herein. - For tissue with living cells,
bead micro-particles 702 are also a suitable vessel for transferring the molecular probes into the cells of the biological sample. SeeFIG. 7 .Beads micro-particles 702 are able to permeate the biological membranes of cells by a process called bead transfection. For instance, micro-particle beads such as glass beads can first be incubated in a solution containing the molecular probes. The micro-particle beads now conjugated with molecular probes can then be introduced into the cells using a process such as electroporation. According to an exemplary embodiment, thebead micro-particles 702 are the same asbead 202 described above (seeFIG. 2 ). Other vessel delivery mechanisms (such as a retrovirus, coupled to a disulfide-linked CPP, etc.) are needed because some mechanisms are better than others, depending on if they are being used for tissues or cells. - As described in conjunction with the description of
step 106 ofmethodology 100 above, a liquid cargo delivery device such as a microfluidic probe is employed to deliver the vessels with unique molecular probes to specific locations of the biological sample, where the vessel delivers the location-specific molecular probes inside the cells at those locations. See, for example,FIG. 8 . Instep 810, a liquid cargo delivery device 802 (such as a microfluidic probe) scans the surface of abiological sample 804 and deposits the vessels with unique molecular probes at specific locations (x,y) in thebiological sample 804. A microfluidic probe is a non-contact, scanning platform that can hydrodynamically localize as little as 100 picoliters of a liquid cargo with micrometer precision. - By way of example only, the liquid
cargo delivery device 802 dispenses a controlled amount of a processing solution (e.g., an aqueous solution) containing the vessels/molecular probes at multiple locations (i.e., (x1,y1), (x1,y2), (x1,y3), etc.) in thebiological sample 804. Seestep 812. As provided above, once the vessel/molecular probe is delivered to a specific location of thebiological sample 804, the vessel delivers the molecular probes inside the cell(s) 806 at that specific location. - After the location-specific molecular probes have been delivered/inserted into the
cells 806, thecells 806 are extracted from thebiological sample 804. Seestep 814. However, even after being disassociated from thebiological sample 804, theindividual cells 806 retain the molecular probe with oligonucleotide sequence encoding the original position of thecells 806 in thebiological sample 804. Thus, this positional data can be retained through the subsequent sequencing process. Seestep 816. - For example, one or more single cell sequencing techniques can be performed. Suitable single cell sequencing techniques include, but are not limited to, drop-seq, seq-well, cyto-seq, and combinations thereof. The single cell sequencing performed in
step 816 can be used to identify the subject cell by the cell barcode (see above), the original position of thecells 806 within thebiological sample 804 via the unique, location-specific oligonucleotide sequence of the molecular probes, and/or transcriptome information of thecells 806. Therefore, the combination of the present positional delivery and encoding process with extraction and single cell sequencing can collect concomitant spatial and molecular measurements (e.g., position coordinates and transcriptomes of one or more of thecells 806 in the biological sample 804) which, as described in conjunction with the description ofstep 110 ofmethodology 100 above, can be recorded and/or analyzed in silico. - The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
- The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
- Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
- Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
- These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
- The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
- Turning now to
FIG. 9 , a block diagram is shown of anapparatus 900 for implementing one or more of the methodologies presented herein. By way of example only,apparatus 900 can be configured to implement one or more of the steps ofmethodology 100 ofFIG. 1 . For instance, according to an exemplary embodiment,apparatus 900 may be configured to store and/or analyze the transcriptomic data extracted from the single cell sequencing along with the unique positional data obtained from the molecular probes indicating the original positioning of the cells in the biological sample. -
Apparatus 900 includes acomputer system 910 andremovable media 950.Computer system 910 includes aprocessor device 920, anetwork interface 925, amemory 930, amedia interface 935 and anoptional display 940.Network interface 925 allowscomputer system 910 to connect to a network, whilemedia interface 935 allowscomputer system 910 to interact with media, such as a hard drive orremovable media 950. -
Processor device 920 can be configured to implement the methods, steps, and functions disclosed herein. Thememory 930 could be distributed or local and theprocessor device 920 could be distributed or singular. Thememory 930 could be implemented as an electrical, magnetic or optical memory, or any combination of these or other types of storage devices. Moreover, the term “memory” should be construed broadly enough to encompass any information able to be read from, or written to, an address in the addressable space accessed byprocessor device 920. With this definition, information on a network, accessible throughnetwork interface 925, is still withinmemory 930 because theprocessor device 920 can retrieve the information from the network. It should be noted that each distributed processor that makes upprocessor device 920 generally contains its own addressable memory space. It should also be noted that some or all ofcomputer system 910 can be incorporated into an application-specific or general-use integrated circuit. -
Optional display 940 is any type of display suitable for interacting with a human user ofapparatus 900. Generally,display 940 is a computer monitor or other similar display. - Referring to
FIG. 10 andFIG. 11 , it is to be understood that although this disclosure includes a detailed description on cloud computing, implementation of the teachings recited herein are not limited to a cloud computing environment. Rather, embodiments of the present invention are capable of being implemented in conjunction with any other type of computing environment now known or later developed. - Cloud computing is a model of service delivery for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services) that can be rapidly provisioned and released with minimal management effort or interaction with a provider of the service. This cloud model may include at least five characteristics, at least three service models, and at least four deployment models.
- Characteristics are as follows:
- On-demand self-service: a cloud consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with the service's provider.
- Broad network access: capabilities are available over a network and accessed through standard mechanisms that promote use by heterogeneous thin or thick client platforms (e.g., mobile phones, laptops, and PDAs).
- Resource pooling: the provider's computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to demand. There is a sense of location independence in that the consumer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or datacenter).
- Rapid elasticity: capabilities can be rapidly and elastically provisioned, in some cases automatically, to quickly scale out and rapidly released to quickly scale in. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be purchased in any quantity at any time.
- Measured service: cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service.
- Service Models are as follows:
- Software as a Service (SaaS): the capability provided to the consumer is to use the provider's applications running on a cloud infrastructure. The applications are accessible from various client devices through a thin client interface such as a web browser (e.g., web-based e-mail). The consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings.
- Platform as a Service (PaaS): the capability provided to the consumer is to deploy onto the cloud infrastructure consumer-created or acquired applications created using programming languages and tools supported by the provider. The consumer does not manage or control the underlying cloud infrastructure including networks, servers, operating systems, or storage, but has control over the deployed applications and possibly application hosting environment configurations.
- Infrastructure as a Service (IaaS): the capability provided to the consumer is to provision processing, storage, networks, and other fundamental computing resources where the consumer is able to deploy and run arbitrary software, which can include operating systems and applications. The consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, deployed applications, and possibly limited control of select networking components (e.g., host firewalls).
- Deployment Models are as follows:
- Private cloud: the cloud infrastructure is operated solely for an organization. It may be managed by the organization or a third party and may exist on-premises or off-premises.
- Community cloud: the cloud infrastructure is shared by several organizations and supports a specific community that has shared concerns (e.g., mission, security requirements, policy, and compliance considerations). It may be managed by the organizations or a third party and may exist on-premises or off-premises.
- Public cloud: the cloud infrastructure is made available to the general public or a large industry group and is owned by an organization selling cloud services.
- Hybrid cloud: the cloud infrastructure is a composition of two or more clouds (private, community, or public) that remain unique entities but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load-balancing between clouds).
- A cloud computing environment is service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability. At the heart of cloud computing is an infrastructure that includes a network of interconnected nodes.
- Referring now to
FIG. 10 , illustrativecloud computing environment 50 is depicted. As shown,cloud computing environment 50 includes one or morecloud computing nodes 10 with which local computing devices used by cloud consumers, such as, for example, personal digital assistant (PDA) orcellular telephone 54A,desktop computer 54B, laptop computer 54C, and/or automobile computer system 54N may communicate.Nodes 10 may communicate with one another. They may be grouped (not shown) physically or virtually, in one or more networks, such as Private, Community, Public, or Hybrid clouds as described hereinabove, or a combination thereof. This allowscloud computing environment 50 to offer infrastructure, platforms and/or software as services for which a cloud consumer does not need to maintain resources on a local computing device. It is understood that the types ofcomputing devices 54A-N shown inFIG. 10 are intended to be illustrative only and thatcomputing nodes 10 andcloud computing environment 50 can communicate with any type of computerized device over any type of network and/or network addressable connection (e.g., using a web browser). - Referring now to
FIG. 11 , a set of functional abstraction layers provided by cloud computing environment 50 (FIG. 10 ) is shown. It should be understood in advance that the components, layers, and functions shown inFIG. 11 are intended to be illustrative only and embodiments of the invention are not limited thereto. As depicted, the following layers and corresponding functions are provided: - Hardware and
software layer 60 includes hardware and software components. Examples of hardware components include:mainframes 61; RISC (Reduced Instruction Set Computer) architecture based servers 62; servers 63; blade servers 64; storage devices 65; and networks and networking components 66. In some embodiments, software components include network application server software 67 and database software 68. -
Virtualization layer 70 provides an abstraction layer from which the following examples of virtual entities may be provided: virtual servers 71;virtual storage 72;virtual networks 73, including virtual private networks; virtual applications and operating systems 74; andvirtual clients 75. - In one example, management layer 80 may provide the functions described below. Resource provisioning 81 provides dynamic procurement of computing resources and other resources that are utilized to perform tasks within the cloud computing environment. Metering and
Pricing 82 provide cost tracking as resources are utilized within the cloud computing environment, and billing or invoicing for consumption of these resources. In one example, these resources may include application software licenses. Security provides identity verification for cloud consumers and tasks, as well as protection for data and other resources. User portal 83 provides access to the cloud computing environment for consumers and system administrators. Service level management 84 provides cloud computing resource allocation and management such that required service levels are met. Service Level Agreement (SLA) planning and fulfillment 85 provide pre-arrangement for, and procurement of, cloud computing resources for which a future requirement is anticipated in accordance with an SLA. -
Workloads layer 90 provides examples of functionality for which the cloud computing environment may be utilized. Examples of workloads and functions which may be provided from this layer include: mapping and navigation 91; software development and lifecycle management 92; virtual classroom education delivery 93; data analytics processing 94; transaction processing 95; andcDNA library construction 96. - Although illustrative embodiments of the present invention have been described herein, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be made by one skilled in the art without departing from the scope of the invention.
Claims (20)
1. A system, comprising:
a processor device, connected to a memory, that is implemented to:
analyze data obtained by single cell sequencing of cells along with molecular probes, that have been delivered inside the cells, which uniquely encode a position of the cells in a biological sample.
2. The system of claim 1 , wherein the cells have been disassociated from the biological sample for the single cell sequencing.
3. The system of claim 2 , wherein the molecular probes encode positional data comprising the position of the cells in the biological sample, and wherein the molecular probes retain the positional data of the cells that have been disassociated from the biological sample.
4. The system of claim 1 , wherein the molecular probes comprise an oligonucleotide sequence that uniquely encodes the position of the cells in the biological sample.
5. The system of claim 4 , wherein the molecular probes further comprise:
a polymerase chain reaction (PCR) handle;
a cell barcode; and
a unique molecular identifier (UMI).
6. The system of claim 5 , wherein the processor device is further implemented to:
identify individual ones of the cells by the cell barcode.
7. The system of claim 5 , wherein the processor device is further implemented to:
identify an original position of the cells within the biological sample via the oligonucleotide sequence.
8. The system of claim 5 , wherein the processor device is further implemented to:
collect transcriptome information of the cells.
9. The system of claim 1 , wherein the biological sample is selected from the group consisting of: a cell culture, a tissue sample, and combinations thereof.
10. The system of claim 1 , wherein the single cell sequencing of the cells is performed using a technique selected from the group consisting of: drop-seq, seq-well, cyto-seq, and combinations thereof.
11. A system, comprising:
a processor device, connected to a memory, that is implemented to:
analyze data obtained by single cell sequencing of cells along with molecular probes, that have been delivered inside the cells, which uniquely encode a position of the cells in a biological sample; and
collect from the data concomitant spatial and molecular measurements of one or more of the cells in the biological sample.
12. The system of claim 11 , wherein the cells have been disassociated from the biological sample for the single cell sequencing.
13. The system of claim 12 , wherein the molecular probes encode positional data comprising the position of the cells in the biological sample, and wherein the molecular probes retain the positional data of the cells that have been disassociated from the biological sample.
14. The system of claim 11 , wherein the molecular probes comprise an oligonucleotide sequence that uniquely encodes the position of the cells in the biological sample.
15. The system of claim 14 , wherein the molecular probes further comprise:
a polymerase chain reaction (PCR) handle;
a cell barcode; and
a unique molecular identifier (UMI).
16. The system of claim 15 , wherein the processor device is further implemented to:
identify individual ones of the cells by the cell barcode.
17. The system of claim 15 , wherein the processor device is further implemented to:
identify an original position of the cells within the biological sample via the oligonucleotide sequence.
18. The system of claim 11 , wherein the molecular measurements comprise transcriptome information of the cells.
19. The system of claim 11 , wherein the biological sample is selected from the group consisting of: a cell culture, a tissue sample, and combinations thereof.
20. The system of claim 11 , wherein the single cell sequencing of the cells is performed using a technique selected from the group consisting of: drop-seq, seq-well, cyto-seq, and combinations thereof.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/207,585 US20230313192A1 (en) | 2020-04-11 | 2023-06-08 | Positional Delivery and Encoding by Oligonucleotides of Biological Cells for Single Cell Sequencing (POS SEQ) |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/846,297 US11739327B2 (en) | 2020-04-11 | 2020-04-11 | Positional delivery and encoding by oligonucleotides of biological cells for single cell sequencing (POS SEQ) |
US18/207,585 US20230313192A1 (en) | 2020-04-11 | 2023-06-08 | Positional Delivery and Encoding by Oligonucleotides of Biological Cells for Single Cell Sequencing (POS SEQ) |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/846,297 Division US11739327B2 (en) | 2020-04-11 | 2020-04-11 | Positional delivery and encoding by oligonucleotides of biological cells for single cell sequencing (POS SEQ) |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230313192A1 true US20230313192A1 (en) | 2023-10-05 |
Family
ID=78006020
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/846,297 Active 2041-10-30 US11739327B2 (en) | 2020-04-11 | 2020-04-11 | Positional delivery and encoding by oligonucleotides of biological cells for single cell sequencing (POS SEQ) |
US18/207,585 Pending US20230313192A1 (en) | 2020-04-11 | 2023-06-08 | Positional Delivery and Encoding by Oligonucleotides of Biological Cells for Single Cell Sequencing (POS SEQ) |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/846,297 Active 2041-10-30 US11739327B2 (en) | 2020-04-11 | 2020-04-11 | Positional delivery and encoding by oligonucleotides of biological cells for single cell sequencing (POS SEQ) |
Country Status (1)
Country | Link |
---|---|
US (2) | US11739327B2 (en) |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016149522A1 (en) | 2015-03-18 | 2016-09-22 | Bio-Rad Laboratories, Inc. | Sample analysis systems and methods |
WO2017075293A1 (en) | 2015-10-28 | 2017-05-04 | Silicon Valley Scientific, Inc. | Method and apparatus for encoding cellular spatial position information |
US10633648B2 (en) | 2016-02-12 | 2020-04-28 | University Of Washington | Combinatorial photo-controlled spatial sequencing and labeling |
EP3526348A4 (en) | 2016-10-17 | 2020-06-24 | Lociomics Corporation | High resolution spatial genomic analysis of tissues and cell aggregates |
CN117056774A (en) | 2016-11-08 | 2023-11-14 | 贝克顿迪金森公司 | Methods for cell marker classification |
WO2019051335A1 (en) | 2017-09-07 | 2019-03-14 | Juno Therapeutics, Inc. | Methods of identifying cellular attributes related to outcomes associated with cell therapy |
WO2019126209A1 (en) | 2017-12-19 | 2019-06-27 | Cellular Research, Inc. | Particles associated with oligonucleotides |
US20220229044A1 (en) * | 2018-05-14 | 2022-07-21 | The Broad Institute, Inc. | In situ cell screening methods and systems |
US11208648B2 (en) | 2018-11-16 | 2021-12-28 | International Business Machines Corporation | Determining position and transcriptomes of biological cells |
-
2020
- 2020-04-11 US US16/846,297 patent/US11739327B2/en active Active
-
2023
- 2023-06-08 US US18/207,585 patent/US20230313192A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US11739327B2 (en) | 2023-08-29 |
US20210317449A1 (en) | 2021-10-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
David et al. | Nanocall: an open source basecaller for Oxford Nanopore sequencing data | |
Chapple et al. | Extreme multifunctional proteins identified from a human protein interaction network | |
Claverie | Computational methods for the identification of genes in vertebrate genomic sequences | |
Sheneman et al. | Clearcut: a fast implementation of relaxed neighbor joining | |
Glenn | Field guide to next‐generation DNA sequencers | |
Marioni et al. | BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data | |
Solovyev et al. | PromH: promoters identification using orthologous genomic sequences | |
Kinsella et al. | Sensitive gene fusion detection using ambiguously mapping RNA-Seq read pairs | |
Leith et al. | Sequence-dependent sliding kinetics of p53 | |
US20140278461A1 (en) | System and method for integrating a medical sequencing apparatus and laboratory system into a medical facility | |
Boley et al. | Genome-guided transcript assembly by integrative analysis of RNA sequence data | |
CN105637098A (en) | Methods and systems for aligning sequences | |
McPherson et al. | Comrad: detection of expressed rearrangements by integrated analysis of RNA-Seq and low coverage genome sequence data | |
KR20150110767A (en) | Methods and systems for using a cloud computing environment to share biological related data | |
CN106462337A (en) | Integrated consumer genomic services | |
CN110178184B (en) | Oncogenic splice variant determination | |
Xia et al. | Accounting for pairwise distance restraints in FFT-based protein–protein docking | |
US11928603B2 (en) | Machine learning (ML) modeling by DNA computing | |
de Koning et al. | NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy | |
Baker | Genomes in three dimensions | |
Olivarius et al. | High-throughput verification of transcriptional starting sites by Deep-RACE | |
Graña et al. | Bicycle: a bioinformatics pipeline to analyze bisulfite sequencing data | |
Okoniewski et al. | High correspondence between Affymetrix exon and standard expression arrays | |
Celesti et al. | Are next-generation sequencing tools ready for the cloud? | |
Kumar et al. | Less is more in the hunt for driver mutations |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MEYER ROJAS, PABLO;STOLOVITZKY, GUSTAVO ALEJANDRO;REEL/FRAME:063900/0295 Effective date: 20200410 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |