WO2023205707A2 - Reiterative short read sequencing inside a cellular sample - Google Patents

Reiterative short read sequencing inside a cellular sample Download PDF

Info

Publication number
WO2023205707A2
WO2023205707A2 PCT/US2023/065972 US2023065972W WO2023205707A2 WO 2023205707 A2 WO2023205707 A2 WO 2023205707A2 US 2023065972 W US2023065972 W US 2023065972W WO 2023205707 A2 WO2023205707 A2 WO 2023205707A2
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
nucleotide
sequence
concatemer
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2023/065972
Other languages
English (en)
French (fr)
Other versions
WO2023205707A3 (en
Inventor
Tsungli LIU
Sinan ARSLAN
Michael Previte
Tuval Ben-Yehezkel
Connor THOMPSON
Matthew KELLINGER
Hui Zhen MAH
Molly He
Andrew Price
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Element Biosciences Inc
Original Assignee
Element Biosciences Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Element Biosciences Inc filed Critical Element Biosciences Inc
Priority to KR1020247038517A priority Critical patent/KR20250011725A/ko
Priority to EP23792763.7A priority patent/EP4511515A4/en
Priority to CA3249276A priority patent/CA3249276A1/en
Priority to JP2024562006A priority patent/JP2025514791A/ja
Priority to AU2023258027A priority patent/AU2023258027A1/en
Priority to CN202380048112.1A priority patent/CN119487210A/zh
Publication of WO2023205707A2 publication Critical patent/WO2023205707A2/en
Publication of WO2023205707A3 publication Critical patent/WO2023205707A3/en
Priority to US18/919,127 priority patent/US20250146068A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1096Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6804Nucleic acid analysis using immunogens
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6841In situ hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/58Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances
    • G01N33/582Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances with fluorescent label
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2531/00Reactions of nucleic acids characterised by
    • C12Q2531/10Reactions of nucleic acids characterised by the purpose being amplify/increase the copy number of target nucleic acid
    • C12Q2531/125Rolling circle
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2537/00Reactions characterised by the reaction format or use of a specific feature
    • C12Q2537/10Reactions characterised by the reaction format or use of a specific feature the purpose or use of
    • C12Q2537/143Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2563/00Nucleic acid detection characterized by the use of physical, structural and functional properties
    • C12Q2563/107Nucleic acid detection characterized by the use of physical, structural and functional properties fluorescence

Definitions

  • the present disclosure provides compositions, apparatus and methods for conducting reiterative short read sequencing inside a cellular sample.
  • the reiterative short read sequencing can be used to discover the RNA content of the cellular sample.
  • a method for detecting in situ at least two target nucleic acid sequences in a biological sample comprising: (a) providing the biological sample comprising: (i) a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement thereof or a portion thereof; and (ii) a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement thereof or a portion thereof; (b) determining in situ the sequence of: (i) the first nucleic acid molecule or a portion thereof in the biological sample to generate a first sequencing product nucleic acid molecule that is complementary and binds to the first nucleic acid molecule or a portion thereof; and (ii) the second nucleic acid molecule or a portion thereof in the
  • the biological sample is fixed and permeabilized.
  • the method further comprises generating in situ : (a) a first complementary DNA (cDNA) molecule through reverse transcription of a first messenger RNA (mRNA) molecule inside the biological sample, wherein the first cDNA molecule or the first mRNA molecule comprises the first target nucleic acid sequence, or the reverse complement of the first target nucleic acid sequence, and/or (b) a second cDNA molecule through reverse transcription of a second mRNA molecule inside the biological sample, wherein the second cDNA molecule or the second mRNA molecule comprises the second target nucleic acid sequence, or the reverse complement of the second target nucleic acid sequence.
  • cDNA complementary DNA
  • mRNA messenger RNA
  • the method further comprises contacting in situ: (a) the first target nucleic acid sequence or a reverse complement thereof with a first oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the first oligonucleotide are complementary and bind to two neighboring segments of the first target nucleic acid sequence or the reverse complement thereof so that the first oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion; and/or (b) the second target nucleic acid sequence or a reverse complement thereof with a second oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the second oligonucleotide are complementary and bind to two neighboring segments of the second target nucleic acid sequence or a reverse complement thereof so that the second oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion; and
  • the first target nucleic acid sequence comprises the first cDNA molecule or the first mRNA molecule; and/or (b) the second target nucleic acid sequence comprises the second cDNA molecule or the second mRNA molecule.
  • the gap of the first oligonucleotide or the second oligonucleotide has a size of one nucleotide. In some embodiments, wherein the gap of the first oligonucleotide or the second oligonucleotide has a size of at least two nucleotides.
  • the first oligonucleotide further comprises a first identification sequence that identifies the first target nucleic acid sequence and the second oligonucleotide further comprises a second identification sequence that identifies the second target nucleic acid sequence.
  • the first oligonucleotide and the second oligonucleotide further comprise a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a sequencing primer.
  • the first oligonucleotide and the second oligonucleotides further comprise a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a primer for nucleic acid amplification.
  • the primer for nucleic acid amplification is a primer for rolling circle amplification (RCA) that produces a concatemer, wherein the concatemer comprises at least two repeats of a target nucleic acid sequence of the at least two target nucleic acid sequences or a portion thereof, or a reverse complement thereof.
  • RCA rolling circle amplification
  • first oligonucleotide and the second oligonucleotide further comprise a reverse complement for a compaction oligonucleotide, wherein: (a) a first segment of the compaction oligonucleotide is complementary and binds to a first portion of the concatemer; and (b) a second segment of the compaction oligonucleotide is complementary andbindsto a second portion of the concatemer, to result in a reduction in the size or a change in the shape of the concatemer.
  • the method further comprises: (a) joining in situ the firstand second end portions of the first oligonucleotide to produce a first circular oligonucleotide inside the biological sample; and (b) joining in situ the firstand second end portions of the second oligonucleotide to produce a second circular oligonucleotide inside the biological sample.
  • the joining the firstand second end portions of the first oligonucleotide comprises joining the firstand second end portions of the first oligonucleotides through a first nucleic acid enzyme
  • the joiningthe first and second end portions of the second oligonucleotide comprises joining the first and second end portions of the second oligonucleotide through a second nucleic acid enzyme, wherein the first nucleic acid enzyme and the second nucleic acid enzyme are the same type of enzyme.
  • the method further comprises amplifying in situ: (a) the first circular oligonucleotide to produce the first nucleic acid molecule; and/or (b) the second circular oligonucleotide to produce the second nucleic acid molecule.
  • the amplifying comprises rolling circle amplification (RCA), wherein the first nucleic acid molecule comprises a first concatemer and the second nucleic acid molecule comprises a second concatemer.
  • the first concatemer comprises at least two repeats of a first unit nucleic acid sequence comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof; and/or (b) the second concatemer comprises at least two repeats of a second unit nucleic acid sequence comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof.
  • the first concatemer further comprises the first identification sequence that identifies the first target nucleic acid sequence, or a sequencing primer or a reverse complement thereof
  • the second concatemer further comprises the second identification sequence that identifies the second target nucleic acid sequence, or a sequencing primer or a reverse complement thereof.
  • the first concatemer further comprises a first compaction oligonucleotide, wherein: (a) a first segment of the first compaction oligonucleotide is complementary and binds to a first portion of the first concatemer; and (b) a second segment of the first compaction oligonucleotide is complementary and binds to a second portion of the first concatemer, to result in a reduction in the size or a change in the shape of the first concatemer.
  • the second concatemer further comprises a second compaction oligonucleotide, wherein: (a) a first segment of the second compaction oligonucleotide is complementary and binds to a first portion of the second concatemer; and (b) a second segment of the second compaction oligonucleotide is complementary and binds to a second portion of the second concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • the determining comprises: (a) determining the sequence of the first nucleic acid molecule or the portion thereof, wherein the first nucleic acid molecule or the portion thereof consists of 2 -30 nucleotides; and/or (b) determining the sequence of the second nucleic acid molecule or the portion thereof, wherein the second nucleic acid molecule or the portion thereof consists of 2-30 nucleotides.
  • the 2-30 nucleotides of the first nucleic acid molecule or the portion thereof comprises the first identification sequence
  • the 2-30 nucleotides of the second nucleic acid molecule or the portion thereof comprises the second identification sequence.
  • the 2-30 nucleotides of the first nucleic acid molecule or the portion thereof further comprises at least a portion of the sequence of the first cDNA molecule or the first mRNA molecule and the 2-30 nucleotides of the second nucleic acid molecule or the portion thereof further comprises at least a portion of the sequence of the second cDNA molecule or the second mRNA molecule.
  • the determining comprises: (a) contacting the first concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the first concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the first concatemer, and the first concatemer hybridized to the primer sequence, wherein the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety; (b) incorporating the nucleotide into the 3 ’ end of the primer sequence; and (c) identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide.
  • the determining comprises: (a) contacting the second concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the second concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the second concatemer, and the second concatemer hybridized to the primer sequence, wherein the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety; (b) incorporating the nucleotide into the 3 ’ end of the primer sequence; and (c) identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide.
  • the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generatea 3’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining comprises: (a) contacting two of the first concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the first concatemer hybridized to each of the two of the primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the first concatemer; (b) detecting the multivalent binding complex through the label of the nucleotide conjugate; (c) identifying the nucleobases of the nucleotides of the two of the first concatemer
  • the determining comprises: (a) contacting two of the second concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, andtwo of a primer sequence that is complementary to a portion of the second concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the second concatemer hybridized to each of the two of the primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the second concatemer; (b) detecting the multivalent binding complex through the label of the nucleotide conjugate; (c) identifying the nucleobases of the nucleotides of the two of the second concatemer
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first or second concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, wherein the detecting comprises imaging the fluorescent label.
  • the determining further comprises: (a) removingthe two of the polymerizing enzyme and the nucleotide conjugate from the two of the first concatemer; (b) contacting each of the two of the first concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides; and (c) incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first concatemer, wherein an unlabeled nucleotide of
  • the determining further comprises: (a) removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the second concatemer; (b) contacting each of the two of the second concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions sufficient for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the second concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the second concatemer, wherein an unlabeled nucleotide of the plurality of
  • the determining comprises: (a) contacting two of the first concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the first concatemer, an d a second primer sequence that is complementary to a second portion of the first concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the first concatemer hybridized to the first primer sequence and a second portion of the first concatemer hybridized to the second primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of a first portion and a second portion of the first concatemer; (b)
  • the determining comprises: (a) contacting two of the second concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the second concatemer, and a second primer sequence that is complementary to a second portion of the second concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the second concatemer hybridized to the first primer sequence and a second portion of the second concatemer hybridized to the second primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of a first portion and a second portion of the second concatemer; (b) detecting the multivalent binding complex through the
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first or second concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, wherein the detecting comprises imagingthe fluorescent label.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent lab el of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generate a 3’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the first target nucleic acid sequence and the second target nucleic acid sequence correspond to two separate portions of the same mRNA or cDNA molecule.
  • the first target nucleic acid sequence and the second target nucleic acid sequence correspond to two different mRNA or cDNA molecules.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the determining comprises detecting in situ the first or second sequencing product nucleic acid molecule inside the biological sample through imaging.
  • the determining comprises detecting in situ simultaneously the first and second sequencing product nucleic acid molecules inside the biological sample through imaging.
  • the imaging comprises fluorescent imaging.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly -frozen sample, a sectioned sample, or a formalin-fixed and paraffin-embedded (FFPE) sample.
  • the biological sample comprises a fresh cellular sample, a freshly -frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample.
  • the at least two target nucleic acid sequences comprise a target DNA sequence. In some embodiments, the at least two target nucleic acid sequences comprise a target RNA sequence. In some embodiments, the at least two target nucleic acid sequences comprises a first target RNA sequence and a second target RNA sequence. In some embodiments, the first target RNA sequence comprises coding RNA, non-coding RNA, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), microRNA (miRNA), guide RNA (gRNA), small nuclear RNA (snRNA), small interference RNA (siRNA), anti-sense RNA, mature microRNA, or immature microRNA.
  • mRNA messenger RNA
  • tRNA transfer RNA
  • rRNA ribosomal RNA
  • miRNA microRNA
  • gRNA small nuclear RNA
  • siRNA small interference RNA
  • anti-sense RNA mature microRNA, or immature microRNA.
  • the second target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the at least two target nucleic acid sequences comprise a first target DNA sequence and a second target DNA sequence.
  • the first target DNA sequence comprises complementary DNA (cDNA), genomic DNA (gDNA), non -coding DNA, or coding DNA.
  • the second target DNA sequence comprises cDNA, gDNA, non-coding DNA, or coding DNA.
  • the at least two target nucleic acid sequences comprise a target RNA sequence and a target DNA sequence.
  • the target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the target DNA sequence comprises cDNA, gDNA, non-coding DNA, or coding DNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, or an organism.
  • the biological sample is immobilized on a surface.
  • the surface comprises an interior surface of a flow cell.
  • a method for detecting in situ at least two target nucleic acid molecules and at least two target polypeptides in a biological sample wherein the at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence, wherein the at least two target polypeptides comprise a first target polypeptide encoded by the first target nucleic acid sequence or a reverse complement thereof and a second target polypeptide encoded by the second target nucleic acid sequence ora reverse complement thereof
  • the method comprising: (a) providing the biological sample comprising: (i) a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof; (ii) a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof; (i
  • a method for detecting in situ at least two target nucleic acid sequences and at least two target polypeptides in a biological sample wherein the at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence, wherein the at least two target polypeptides comprise a first target polypeptide encoded by the first target nucleic acid sequence or a reverse complement thereof and a second target polypeptide encoded by the second target nucleic acid sequence ora reverse complement thereof
  • the method comprising: (a) providing the biological sample comprising: (i) a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof; (ii) a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof; (i) a first nucleic acid molecule, compris
  • a method for detecting in situ at least two target RNA sequences and at least two target polypeptides in a biological sample comprising:(a) providing the biological sample that is immobilized on a surface, permeabilized and fixed, wherein the biological sample comprises: (i) a first target RNA sequence of at least two target RNA sequencesand a first target polypeptide of the at least two polypeptides, wherein the first target polypeptide is encoded by the first target RNA sequence or a reverse complement thereof; and (ii) a second target RNA sequence of at least two target RNA sequences and a second target polypeptide of the at least two polypeptides, wherein the second target polypeptide is encoded by the second target RNA sequence or a reverse complement thereof ; (b) producing a first target cDNA sequence through reverse transcription of the first target RNA sequence and a second target cDNA sequence through reverse transcription of the second target RNA sequence; (c) contacting:
  • the determining comprises imagingthe first, or third sequencing product nucleic acid molecule, wherein the identifying comprises imaging the second or fourth sequencing product nucleic acid molecule and the fourth sequencing product nucleic acid molecule.
  • the method further comprises imaging simultaneously the first sequencing product nucleic acid molecule and the second sequencing product nucleic acid molecule to analyze the spatial distribution of the first and second sequencing product nucleic acid molecules inside the biological sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly-frozen sample, a sectioned sample, or a formalin- fixed and paraffin-embedded (FFPE) sample.
  • FFPE formalin- fixed and paraffin-embedded
  • the first target nucleic acid sequence comprises DNA, cDNA, RNA, codin RNA, non -coding RN A, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/orwherein the first target RNA sequence comprises codingRNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target nucleic acid sequence comprises DNA, cDNA, RNA, codingRNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the second target RNA sequence comprises codin RNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the first, second, third or fourth concatemer further comprises a compaction oligonucleotide, wherein: (a) a first segment of the compaction oligonucleotide is complementary andbindsto a first portion of the first, second, third, or fourth concatemer; and (b) a second segment of the compaction oligonucleotide is complementary and binds to a second portion of the first, second, third, or fourth concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and a portion of the first target RNA sequence
  • the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence and a portion of the second target RNA sequence.
  • the determining comprises: (a) contacting the first second, third, or fourth concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the first second, third, or fourth concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the first second, third, or fourth concatemer, and the first second, third, or fourth concatemer hybridized to the primer sequence, wherein the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety; (b) incorporating the nucleotide into the 3 ’ end of the primer sequence; and (c) identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide.
  • the determining further comprises contacting the nucleotide with an agent to remove a blocking group from the nucleotide and generate a 3’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O -azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides comprise one type of nucleotide selected from a group comprising dATP, dGTP, dCTP, dTTP and dUTP. In some embodiments, the plurality of nucleotides comprise a mixture of any combination of two or more types of nucleotides selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • the determining comprises: (a) contacting two of the first, second, third, or fourth concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first, second, third, or fourth concatemer under conditions sufficientto form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the first concatemer; (b) detecting the multivalent binding complex through the label of the nucleotide conjugate; (c) identifying
  • the determining further comprises: (a) removingthe two of the polymerizing enzyme and the nucleotide conjugate from the two of the first, second, third, or fourth concatemer; (b) contacting each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the primer sequence
  • the determining comprises: (a) contacting two of the first, second, third, or fourth concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the first, second, third, or fourth concatemer, and a second primer sequence that is complementary to a second portion of the first, second, third, or fourth concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the first, second, third, or fourth concatemer hybridized to the first primer sequence and a second portion of the first, second, third, or fourth concatemer hybridized to the second primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first, second, third, or fourth concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imagingthe fluorescent label.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generate a 3 ’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the first target RNA sequence comprises coding RNA, non-coding RNA, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), microRNA (miRNA), guide RNA (gRNA), small nuclear RNA (snRNA), small interference RNA (siRNA), anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, or an organism.
  • the biological sample is immobilized on a surface.
  • the surface comprises an interior surface of a flow cell.
  • a system for detecting in situ at least two target nucleic acid sequences in a biological sample wherein the at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence
  • the system comprising: (a) a biological sample comprising a first nucleic acid molecule and a second nucleic acid molecule, wherein the first nucleic acid molecule comprises the first target nucleic acid sequence or a reverse complement thereof, or a portion thereof, and wherein the second nucleic acid molecule comprises the second target nucleic acid sequence or a reverse complement thereof, or a portion thereof; (b) a first oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the first oligonucleotide are complementary and bind to two neighboring segments of the first target nucleic acid sequence or the reverse complement thereof, or the portion thereof, so that the first oligonucleotide
  • the first circular oligonucleotide further comprises a first nucleic acid enzyme configured to join the first and second end portions of the first oligonucleotide
  • the second circular oligonucleotide further comprises a second nucleic acid enzyme configured to join the first and second end portions of the second oligonucleotide.
  • the first and second nucleic acid enzymes comprises a nucleic acid ligase, a nucleic acid ligation enzyme, a nucleic acid polymerase, a nucleic acid polymerization enzyme, or combinations thereof.
  • the system further comprises a first amplicon of the first circular oligonucleotide, and a second amplicon of the second circular oligonucleotide.
  • the first amplicon comprises a first concatemer and/or the second amplicon comprises a second concatemer.
  • the first concatemer comprises at least two repeats of a first unit nucleic acid sequence comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement thereof or a portion thereof; and/or (b) the second concatemer comprises at least two repeats of a second unit nucleic acid sequence comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement thereof or a portion thereof.
  • the first concatemer further comprises the first identification sequence that identifies the first target nucleic acid sequence, or a sequencing primer or a reverse complement thereof
  • the second concatemer further comprises the second identification sequence that identifies the second target nucleic acid sequence, or a sequencing primer or a reverse complement thereof.
  • the first concatemer further comprises a first compaction oligonucleotide, wherein: (a) a first segment of the first compaction oligonucleotide is complementary andbindsto a first portion of the first concatemer; and (b) a second segment of the first compaction oligonucleotide is complementary and binds to a second portion of the first concatemer, to result in a reduction in the size or a change in the shape of the first concatemer.
  • the second concatemer further comprises a second compaction oligonucleotide, wherein: (a) a first segment of the second compaction oligonucleotide is complementary and binds to a first portion of the second concatemer; and (b) a second segment of the second compaction oligonucleotide is complementary and binds to a second portion of the second concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • the system further comprises a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to at least a portion of the first concatemer or the second concatemer under conditions sufficient to form a binding complex.
  • the system further comprises an agent configured to remove a blocking group from a nucleotide of the plurality of nucleotides and generate a 3 ’ OH group on a sugar moiety of the nucleotide.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O -azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides comprises a fluorescent label.
  • the plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the system further comprises a plurality of nucleotide conjugates, wherein a nucleotide conjugate of the plurality of nucleotide conjugates is configured to form a multivalent binding complex comprising two or more of the polymerizing enzyme, the nucleotide conjugate, and the at least two target nucleic acid sequences, wherein the nucleotide conjugate comprises a label and at least two nucleotide moieties that are each complementary and bind to a nucleotide of each of the at least two target nucleic acid sequences.
  • the system further comprises a first oligonucleotide conjugate comprising a first short nucleic acid and a first binding moiety that binds specifically to a first target polypeptide, wherein the first short nucleic acid comprises a first tag sequence and a second tag sequence, wherein the first oligonucleotide conjugate binds specifically to the first target polypeptide in the biological sample through the firstbinding moiety to form a firstbinding complex, wherein the firstand second tag sequences identify the first binding moiety in a nucleic acid sequence reaction.
  • the system further comprises a second oligonucleotide conjugate comprising a second short nucleic acid and a second binding moiety that binds specifically to a second target polypeptide, wherein the second short nucleic acid comprises a third tag sequence and a fourth tag sequence, wherein the second oligonucleotide conjugate binds specifically to the second target polypeptide in the biological sample through the second binding moiety to form a second binding complex, wherein the third and fourth tag sequences identify the second binding moiety in a nucleic acid sequence reaction.
  • the first or second binding moiety is an antibody or an antigen -binding fragment thereof.
  • the system further comprises a third oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the third oligonucleotide are complementary and bind to the first tag sequence and the second tag sequence of the first oligonucleotide conjugate so that the third oligo nucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the system further comprises a fourth oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the fourth oligonucleotide are complementary and bind to the third tag sequence and the fourth tag sequence so that the fourth oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the system further comprises a third circular oligonucleotide that results from joining the first and second end portions of the third oligonucleotide and a fourth circular oligonucleotide that results from joining the firstand second end portions of the fourth oligonucleotide, wherein the first circular oligonucleotide and the third circular oligonucleotide comprise a first sequencing primer or the reverse complement thereof, wherein the second circular oligonucleotide and the fourth circular oligonucleotide comprise a second sequencing primer or the reverse complement thereof, wherein the sequences of the first and second sequencing primers have at least one nucleotide of difference, wherein the joining is carried out by the first or second nucleic acid enzyme.
  • the system further comprises a third amplicon of the third circular oligonucleotide, and a fourth amplicon of the fourth circular oligonucleotide.
  • the third amplicon comprises a third concatemer and/or the fourth amplicon comprises a fourth concatemer.
  • the third concatemer comprises at least two repeats of a third unit nucleic acid sequence comprising the third circular oligonucleotide or a portion thereof, or the reverse complement thereof or a portion thereof; and/or (b) the fourth concatemer comprises at least two repeats of a second unit nucleic acid sequence comprisingthe fourth circular oligonucleotide or a portion thereof, or the reveres complement thereof or a portion thereof.
  • the third concatemer further comprises the first or second tag sequence or a reverse complement thereof that identifies the first binding moiety, wherein the fourth concatemer further comprises the third or fourth tag sequence or a reverse complement thereof that identifies the second binding moiety.
  • the third concatemer further comprises a third compaction oligonucleotide, wherein: (a) a first segment of the third compaction oligonucleotide is complementary and binds to a first portion of the third concatemer; and (b) a second segment of the third compaction oligonucleotide is complementary and binds to a second portion of the third concatemer, to result in a reduction in the size or a change in the shape of the third concatemer.
  • the fourth concatemer further comprises a fourth compaction oligonucleotide, wherein: (a) a first segment of the fourth compaction oligonucleotide is complementary and binds to a first portion of the fourth concatemer; and (b) a second segment of the fourth compaction oligonucleotide is complementary and binds to a second portion of the fourth concatemer, to result in a reduction in the size or a change in the shape of the fourth concatemer.
  • the system further comprises a second polymerizing enzyme, a second plurality of nucleotides, and a second primer sequence that is complementary to at least a portion of the third concatemer or the fourth concatemer under conditions sufficient to form a binding complex comprising the third or fourth concatemer hybridized to the second primer sequence, the second polymerizing enzyme, and a second nucleotide of the second plurality of nucleotides that is complementary and binds to the a nucleotide of the third or fourth concatemer.
  • the system further comprises a second agent configured to remove a second blocking group from a second nucleotide of the second plurality of nucleotides, and generate a 3 ’ OH group on a sugar moiety of the second nucleotide.
  • the second blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O -azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the second plurality of nucleotides comprises a second fluorescent label.
  • the second plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the second fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the second fluorescent label of another type of nucleotide of the group.
  • the second plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the second fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the second fluorescent label of another type of nucleotide of the group.
  • the system further comprises a second plurality of nucleotide conjugates, wherein a second nucleotide conjugate of the second plurality of nucleotide conjugates is configured to form a multivalent binding complex comprising two or more of the second polymerizing enzyme, the second nucleotide conjugate of the a second plurality of nucleotide conjugates plurality of nucleotide conjugates, and the at least two of the first short nucleic acid or the short nucleic acid or a potion thereof, wherein the second nucleotide conjugate comprises a label and at least two nucleotide moieties that are each complementary and bind to a nucleotide of each of the at least two of the first short nucleic acid or the short nucleic acid or a potion thereof.
  • the system further comprises a solid surface comprising the biological sample immobilized to the solid surface.
  • the biological sample is permeabilized.
  • the solid surface further comprises a hydrophilic polymer coating layer coupled thereto.
  • the hydrophilic polymer coating layer has a water contact angle that is less than 50 degrees.
  • the system further comprises an optical imaging module configured to image the biological sample coupled to the solid surface to detecting in situ the at least two target nucleic acid sequences and/or the at least two target polypeptides in the biological sample.
  • the first target polypeptide is encoded by the first target nucleic acid molecule or a reverse complement thereof and the second target polypeptide is encoded by the second target nucleic acid molecule or a reverse complement thereof.
  • a computer-implemented system comprising a computing device comprising at least one processor, an operating system configured to perform executable instructions, a memory, and a computer program including instruction s executable by the computing device wherein the instructions comprise a method disclosed herein.
  • kits for detecting in situ at least two target nucleic acid sequences in a biological sample wherein the at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence
  • the kit comprising: (a) a first oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the first oligonucleotide are complementary and bind to two neighboring segments of the first target nucleic acid sequence or the reverse complement thereof, or a portion thereof, so that the first oligonucleotide forms a first circular oligonucleotide within the biological sample, wherein the first circular oligonucleot
  • the kit is further configured for detecting in situ at least two target RNA sequences and at least two target polypeptides comprising a first target polypeptide and a second target polypeptide in a biological sample, the kit comprising: the first target polypeptide encoded by the first target nucleic acid molecule or a reverse complement there of, wherein the second target polypeptide is encoded by the second target nucleic acid molecule or a reverse complement thereof, wherein the full sequence of the first target RNA sequence and the full sequence of the second target RNA sequence have at least one nucleotide of difference, wherein the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the first, second, third, or fourth nucleic acid enzyme comprises a nucleic acid ligase, a nucleic acid ligation enzyme, a nucleic acid polymerase, a nucleic acid polymerization enzyme, or combinations thereof.
  • the nucleotide of the plurality of nucleotides comprises a fluorescent label.
  • the nucleotide of the plurality of nucleotides comprises a removable blocking group at the 3’ carbon position of the sugar moiety.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide comprising a fluorescent label, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein a type of the at least two types of nucleotides comprises a fluorescent label, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the nucleotide conjugate comprises a detectable label
  • the nucleotide conjugate comprises a fluorescent label.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, or an organism.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tis sue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly-frozen sample, a sectioned sample, or a formalin-fixed and paraffin-embedded (FFPE) sample.
  • FFPE formalin-fixed and paraffin-embedded
  • the first target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or, wherein the first target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti -sense RNA, mature microRNA, or immature microRNA.
  • the second target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the second target RNA sequence comprises coding RNA, noncoding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the kit further comprises: (a) a first oligonucleotide conjugate comprising a first short nucleic acid and a first binding moiety that binds specifically to the first target polypeptide, wherein the first short nucleic acid comprises a first tag sequence, a second tag sequence, wherein the first oligonucleotide conjugate binds specifically to the first target polypeptide through the first binding moiety to form a first binding complex, wherein the firstand second tag sequences identify the first binding moiety; or (b) a second oligonucleotide conjugate comprising a second short nucleic acid and a second binding moiety that binds specifically to the second target polypeptide, wherein the second short nucleic acid comprises a third tag sequence, a fourth tag sequence, wherein the second oligonucleotide conjugate binds specifically to the second target polypeptide through the second binding moiety to form a second binding complex, wherein the third and fourth tag sequences identify the second binding mo
  • the kit further comprises: (a) a third oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the third oligonucleotide are complementary and bind to the first tag sequence and the second tag sequence of the first oligonucleotide conjugate so that the third oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion; or (b) a fourth oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the fourth oligonucleotide are complementary and bind to the third tag sequence and the fourth tag sequence so thatthe fourth oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the third oligonucleotide comprises a third identification sequence that identifies the first binding moiety
  • the fourth oligonucleotide comprises a fourth identification sequence that identifies the second binding moiety.
  • the first oligonucleotide and third oligonucleotide comprise the same first sequencing primer or the reverse complement thereof
  • the second oligonucleotide and the fourth oligonucleotide comprise the same second sequencing primer or the reverse complement thereof.
  • the kit further comprises an agent that reacts with the reactive group at the 3 ’ carbon of the sugar moiety in the nucleotide moiety of the nucleotide conjugate.
  • the kit further comprises a reagent for use in the nucleotide binding reaction.
  • the reagent comprises a cation.
  • the kit further comprises a reverse transcriptase, a primer for reverse transcription, a sequencing primer, a reagent configured to permeabilize the biological sample, a reagent configured to fix the biological sample, an unblocked nucleotide, a blocked nucleotide, a reagent for use in the nucleotide incorporation reaction, a solution comprising a cation, one or more unlabeled nucleotides, one or more buffers for reverse transcription, one or more buffers for nucleic acid binding, one or more buffers for nucleic acid amplification, or one or more buffers for nucleic acid dissociation
  • the kit further comprises instructions for use of the kit to detect in situ the at least two target nucleic acid sequences in the biological sample.
  • the kit further comprises instructions for use of the kit to detect in situ the at least two target nucleic acid sequences and the at least two target polypeptides in the biological sample.
  • the instructions comprise performing a sequencing by synthesis reaction.
  • the instructions comprise performing a sequencing by binding reaction in which detection in situ is not contemporaneous with a nucleotide incorporation step.
  • the kit further comprises instructions for use of the kit using a method disclosed herein.
  • FIG. 1 is a schematic showing a non-limiting example workflow for generating concatemers inside a cellular sample.
  • Target RNA harbored by a cellular sample is hybridized to a reverse transcription primer (RT primer) and reverse transcription is conducted to synthesize a first strand cDNA.
  • the first strand cDNA is hybridized to a target-specific padlock probe to generate a circularized padlock probe having a nick (solid downward triangle) between the first and second ends of the hybridized padlock probe.
  • the padlock probe carries an adaptor for a universal compaction oligonucleotide binding site, an adaptor for a universal sequencing primer binding site, and a target barcode sequence that corresponds to a given target cDNA.
  • the nick in the circularized padlock probe is ligated to form covalently closed circular padlock probe which carries a cDNA sequence that corresponds to the target RNA.
  • the sequence of an non-limiting example padlock probe which binds selectively to GAPDH cDNA is shown at the bottom of FIG. 1 .
  • the covalently closed circular padlock probe can be subjected to rolling circle amplification inside the cellular sample to generate a concatemer molecule.
  • the concatemer molecule can be reiteratively sequenced inside the cellular sample.
  • FIG. 2 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • the concatemer depicted in FIG. 2 includes tandem repeat units where each unit comprises: (i) a universal sequencing primer binding site (Seq), (ii) universal compaction oligonucleotide binding site (CO), (iii) an insert sequence that corresponds to a given target cDNA, and (iv) a target barcode sequence that corresponds to the given target cDNA (BC).
  • universal sequencing primers hybridize to the universal sequencing primer binding sites and no more than 30 sequencing cycles are conducted to generate a plurality of first sequencing read products (dashed arrows), where the first sequencing read products include only the target barcode sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include only the target barcode sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is once again repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include only the target barcode sequence.
  • the reiterative sequencing can be conducted up to 50 times.
  • the sequences of all of the first sequencing read products can be determined and aligned with a first reference sequence (e.g., reference barcode sequence) to confirm the presence of the first target RNA molecules inside the cellular sample.
  • a first reference sequence e.g., reference barcode sequence
  • FIG. 3 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • the concatemer depicted in FIG. 1 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • each unit comprises: (i) a universal sequencing primer binding site (Seq), (ii) universal compaction oligonucleotide binding site (CO), (iii) an insert sequence that corresponds to a given target cDNA, and (iv) a target barcode sequence that corresponds to the given target cDNA (BC)
  • universal sequencing primers (solid arrows) hybridize to the universal sequencing primer binding sites and no more than 30 sequencing cycles are conducted to generate a plurality of first sequencing read products (dashed arrows), where the first sequencing read products include the target barcode sequence and a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include the target barcode sequence and a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is once again repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include the target barcode sequence and a portion of the insert sequence.
  • the reiterative sequencing can be conducted upto 50 times.
  • the sequences of all the first sequencing read products can be determined and aligned with a first reference sequence (e g., reference barcode sequence and the insert sequence that corresponds to the target RNA) to confirm the presence of the first target RNA molecules inside the cellular sample.
  • FIG. 4 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • the concatemer depicted in FIG. 4 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1. The concatemer depicted in FIG. 1.
  • each unit comprises: (i) a universal sequencing primer binding site (Seq), (ii) universal compaction oligonucleotide binding site (CO), and (iii) an insert sequence that corresponds to a given target cDNA.
  • universal sequencing primers (solid arrows) hybridize to the universal sequencing primer binding sites and no more than 30 sequencing cycles are conducted to generate a plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is once again repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the reiterative sequencing can be conducted up to 50 times.
  • the sequences of all of the first sequencing read products can be determined and aligned with a first reference sequence (e.g., the insert sequence that corresponds to the target RNA) to confirm the presence of the first target RNA molecules inside the cellular sample.
  • FIG. 5 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • the concatemer depicted in FIG. 1 is a schematic showing a non-limiting example workflow for sequencing a concatemer that is generated inside the cell as shown in FIG. 1.
  • each unit comprises: (i) a universal sequencing primer binding site (Seq) and (ii) an insert sequence that corresponds to a given target cDNA.
  • universal sequencing primers (solid arrows) hybridize to the universal sequencing primer binding sites and no more than 30 sequencing cycles are conducted to generate a plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the plurality of first sequencing read products are removed from the concatemer, and the sequencing is once again repeated where no more than 30 sequencing cycles are conducted to generate another plurality of first sequencing read products (dashed arrows), where the first sequencing read products include a portion of the insert sequence.
  • the reiterative sequencing can be conducted up to 50 times.
  • the sequences of all of the first sequencing read products can be determined and aligned with a first reference sequence (e g., the insert sequence that corresponds to the target RNA) to confirm the presence of the first target RNA molecules inside the cellular sample.
  • FIG. 6 shows images of fluorescent sequencing signals emitted from HEK 293 cells cultured on a poly -lysine coated flow cell.
  • the cultured cells were fixed, permeabilized, subjected to reverse transcription reactions, and subjected to rolling circle amplification on the flow cell, all under conditions suitable for retaining the cellular nucleic acids inside the cells.
  • the coated flow cell lacked surface capture primers.
  • the retained nucleic acids in the cells were sequenced using a two-stage sequencing method that employed detectably labeled multivalent molecules and unlabeled nucleotide analogs. Thirty cycles of the two-stage sequencing reactions were conducted using multivalent molecules lab eled with one of four fluoroph ores and unlabeled nucleotide analogs.
  • FIG. 6 shows results from the second in situ sequencing experiment (see Example 1).
  • FIG. 6 shows fluorescent signals from cycles 20-25 where the sequence reads CCTCCT. A total of 4620 fluorescent spots were detected with significant signals in all cycles. More than 90% of the fluorescent spots detected the sample index sequences.
  • FIG. 7 is a schematic of various non-limiting example configurations of multivalent molecules.
  • Left (Class I) schematics of multivalent molecules having a “starburst” or “helter- skelter” configuration.
  • Center (Class II) a schematic of a multivalent molecule having a dendrimer configuration.
  • Right (Class III) a schematic of multiple multivalent molecules formed by reacting streptavidin with 4-arm or 8-armPEG-NHS with biotin and dNTPs. Nucleotide units are designated 'N’, biotin is designated 13’, and streptavidin is designated ' SA’.
  • FIG. 8 is a schematic of an non -limiting example multivalent molecule comprising a generic core attached to a plurality of nucleotide-arms.
  • FIG. 9 is a schematic of an non -limiting example multivalent molecule comprising a dendrimer core attached to a plurality of nucleotide-arms.
  • FIG. 10 shows a schematic of a non-limiting example multivalent molecule comprising a core attached to a plurality of nucleotide-arms, where the nucleotide arms comprise biotin, spacer, linker and a nucleotide unit.
  • FIG. 11 is a schematic of a non-limiting example nucleotide-arm comprising a core attachment moiety, spacer, linker and nucleotide unit.
  • FIG. 12 shows the chemical structure of a non-limiting example spacer (top), and the chemical structures of various non-limiting example linkers, including an 11 -atom Linker, 16- atom Linker, 23 -atom Linker and an N3 Linker (bottom).
  • FIG. 13 shows the chemical structures of various non-limiting example linkers, including Linkers 1-9.
  • FIG. 14 shows the chemical structures of various non-limiting example linkers joined/attached to nucleotide units.
  • FIG. 15 shows the chemical structures of various non-limiting example linkers joined/attached to nucleotide units.
  • FIG. 16 shows the chemical structures of various non-limiting example linkers joined/attached to nucleotide units.
  • FIG. 17 shows the chemical structure of a non-limiting example biotinylated nucleotide- arm.
  • the nucleotide unit is connected to the linker via a propargyl amine attachment at the 5 position of a pyrimidine base or the 7 position of a purine base.
  • FIG. 18 is a schematic of a guanine tetrad (e g., G-tetrad).
  • FIG. 19 is a schematic of a non-limiting example intramolecular G-quadruplex structure.
  • FIG. 20 is a schematic showing a workflow for generating circularized padlock probes, comprising generating first and second cDNAs from first and second target RNA molecules (respectively), hybridizing first and second padlock probes to the first and second cDNA molecules (respectively) to generate first and second circularized padlock probes (respectively).
  • the first padlock probe comprises (i) a first target barcode sequence (target BC-l)thatuniquely identifies the first target RNA, (ii) a first batch-specific sequencing primer binding site (Batch Seq-1) (or a complementary sequence thereof), (iii) a universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof), and (iv) a universal binding site for a compaction oligonucleotide (or a complementary sequence thereof).
  • the second padlock probe comprises (i) a second target barcode sequence (target BC -2) that uniquely identifiesthe second target RNA, (ii) a second batch-specific sequencing primer binding site (Batch Seq-2) (or a complementary sequence thereof), (iii) a universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof), and (iv) a universal binding site for a compaction oligonucleotide (or a complementary sequence thereof).
  • target BC -2 a second target barcode sequence
  • Batch Seq-2 second batch-specific sequencing primer binding site
  • a universal binding site for an amplification primer universal RCA
  • a compaction oligonucleotide or a complementary sequence thereof
  • FIG. 21 is a schematic showing a rolling circle and sequencing workflow comprising generating first and second concatemers by conducting rolling circle amplification using first and second covalently closed circular molecules (respectively).
  • the firstand second concatemers are subjected to a first sequencing workflow using first batch -specific sequencing primers, sequencing polymerases, and a plurality of nucleotide reagents.
  • the first concatemers undergo reiterative sequencing but the second concatemers do not.
  • the first and second concatemers are subjected to a second sequencing workflow using second batch -specific sequencing primers, sequencing polymerases, and a plurality of nucleotide reagents.
  • the second concatemers undergo reiterative sequencing but the first concatemers do not.
  • FIG. 22 is a schematic showing first and second antibody -oligonucleotide conjugates.
  • the first antibody -oligonucleotide conjugate comprises a first antibody that selectively binds a first target polypeptide.
  • the first antibody is linked to a first oligonucleotide, where the first oligonucleotide carries a first and second tag sequence.
  • the second antibody -oligonucleotide conjugate comprises a second antibody that selectively binds a second target polypeptide.
  • the second antibody is linked to a second oligonucleotide, where the second oligonucleotide carries a third and fourth tag sequence.
  • the first, second, third and fourth tag sequences differ from each other.
  • FIG. 23 is a schematic showing first and second antibody -oligonucleotide conjugates each bound to their respective target polypeptides and their respective padlock probes.
  • the first padlock probe comprises (i) a sequence that can hybridize to the first tag sequence, (ii) a third target barcode sequence (target BC -3) that uniquely identifies the first antibody which selectively binds the first target polypeptide, (iii) a first batch -specific sequencing primer binding site (Batch Seq-1) (or a complementary sequence thereof), (iv) a universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof), (v) a universal binding site for a compaction oligonucleotide (or a complementary sequence thereof), and (vi) a sequence that can hybridize to the second tag sequence
  • the second padlock probe comprises (i) a sequence that can hybridize to the third tag sequence, (ii) a fourth target barcode sequence (target BC-4) that uniquely identifies the second antibody which selectively binds the second target polypeptide, (iii) a second batch-specific sequencing primer binding site (Batch Seq-2) (or a complementary
  • FIG. 24 is a schematic showing a rolling circle amplification and sequencing workflow for a first target RNA and first target polypeptide
  • a first target cDNA molecule hybridizes with target-specific padlock probe for a first cDNA carrying (i) first and second binding arms that hybridize to the first target cDNA, (ii) a first barcode sequence which uniquely identifies the first target cDNA (BC-1), (iii) a first batch-specific sequencing primer binding site (batch Seq-1), (iv) a universal binding site for an amplification primer, and (v) a universal binding site for a compaction oligonucleotide.
  • the hybridized first padlock probe includes a nick or gap, the nick or gap are enzymatically closed to generate a first covalently closed circular molecule carrying first cDNA sequences.
  • the covalently closed circular molecule undergoes rolling circle amplification to generate a first cDNA concatemer molecule (top concatemer molecule).
  • the first cDNA concatemer molecule undergoes sequencing using first batch-specific sequencing primers.
  • a first antib ody-oligonucleotide conjugate (right top schematic) binds with a first target polypeptide.
  • the oligonucleotide of the first antib ody- oligonucleotide conjugate binds a target- specific padlock probe for a first target polypeptideoligonucleotide conjugate carrying (i) first and second binding arms that hybridize to a first and second tag sequence, (ii) a third barcode sequence (BC -3) which uniquely identifies the first antibody -oligonucleotide conjugate which selectively binds a first target polypeptide, and (iii) a first batch-specific sequencing primer binding site (batch Seq-1), (iv) a universal binding site for an amplification primer, and (v) a universal binding site for a compaction oligonucleotide.
  • the hybridized first padlock probe includes a nick or gap, the nick or gap are enzymatically closed to generate a third covalently closed circular molecule carrying first and second tag sequences.
  • the covalently closed circular molecule undergoes rolling circle amplification to generate a third oligonucleotide-tagged concatemer molecule (bottom concatemer molecule).
  • the third oligonucleotide-tagged concatemer molecule undergoes sequencing using first batch -specific sequencing primers.
  • FIG. 25 is a schematic showing a rolling circle amplification and sequencing workflow for a second target RNA and second target polypeptide.
  • a second target cDNA molecule (left top schematic) hybridizes with target- specific padlock probe for a second cDNA carrying (i) first and second binding arms that hybridize to the second target cDNA, (ii) a second barcode sequence which uniquely identifies the second target cDNA (BC-2), (iii) a second batch-specific sequencing primer binding site (batch Seq-2), (iv) a universal binding site for an amplification primer, and (v) a universal binding site for a compaction oligonucleotide.
  • the hybridized second padlock probe includes a nick or gap, the nick or gap are enzymatically closed to generate a second covalently closed circular molecule carrying second cDNA sequences.
  • the covalently closed circular molecule undergoes rolling circle amplification to generate a second cDNA concatemer molecule (top concatemer molecule).
  • the second cDNA concatemer molecule undergoes sequencing using second batch-specific sequencing primers.
  • a second antibody- oligonucleotide conjugate (right top schematic) binds with a second target polypeptide.
  • the oligonucleotide of the second antib ody-oligonucleotide conjugate binds a target-specific padlock probe for a second target polypeptide-oligonucleotide conjugate carrying (i) first and second binding arms that hybridize to a third and fourth tag sequence, (ii) a fourth barcode sequence (BC-4) which uniquely identifies the second antibody-oligonucleotide conjugate which selectively binds a second target polypeptide, and (iii) a second batch-specific sequencing primer binding site (batch Seq-2), (iv) a universal binding site for an amplification primer, and (v) a universal binding site for a compaction oligonucleotide.
  • BC-4 fourth barcode sequence
  • the hybridized fourth padlock probe includes a nick or gap, the nick or gap are enzymatically closed to generate a fourth covalently closed circular molecule carrying third and fourth tag sequences.
  • the covalently closed circular molecule undergoes rolling circle amplification to generate a fourth oligonucleotide-tagged concatemer molecule (bottom concatemer molecule).
  • the fourth oligonucleotide-tagged concatemer molecule undergoes sequencing using second b atch-specific sequencing primers.
  • FIG. 26 shows a computer system that is programmed or otherwise configured to implement methods provided herein.
  • FIG. 27 depicts a non-limiting example method of an antibody conjugated to multiple oligonucleotides via multiple linkers. Detection efficiency may be increased by adding linker moieties directly orvia multivalent structures as described herein. This may reduce non detection due to rolling circle amplification.
  • FIG. 28 depicts a non-limiting example method as disclosed herein.
  • a first antibody- oligonucleotide conjugate comprising a first antibody and a first oligonucleotide tag and a second antibody-oligonucleotide conjugate comprising a second antibody and a second oligonucleotide tag can be used against the same target protein corresponding to a target RNA.
  • the first oligonucleotide tag and the second oligonucleotide tag can be used to circularize a padlock probe as disclosed herein. This method can increase binding specificity.
  • FIG. 29 depicts a non-limiting example method as disclosed herein.
  • a first antibody- oligonucleotide conjugate comprising a first antibody and a first oligonucleotide tag and a second antibody-oligonucleotide conjugate comprising a second antibody and a second oligonucleotide tag can be used against the same target protein corresponding to a targetRNA.
  • the first oligonucleotide tag and the second oligonucleotide tag can be used to circularize a padlock probe as disclosed herein. This method can increase binding specificity.
  • the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following aspects: “A, B, and C”; “A, B, or C”; “A or C”; “A orB”; “B or C”; “A and B”; “B and C”; “A and C”; “A” (A alone); “B” (B alone); or“C” (C alone).
  • the terms “about” and “approximately” refer to a value or composition that is within an acceptable error range for the particular value or composition as determined by one of ordinary skill in the art, which will depend in part on how the value or composition is measured or determined, i.e., the limitations of the measurement system.
  • “about” or “approximately” can mean within one or more than one standard deviation per the practice in the art.
  • “about” or “approximately” can mean a range of up to 10% (i.e., ⁇ 10%) or more depending on the limitations of the measurement system.
  • about 5 mg can include any number between 4.5 mgand 5.5 mg.
  • the terms can mean up to an order of magnitude or up to 5 -fold of a value.
  • the meaning of “about” or “approximately” should be assumed to be within an acceptable error range for that particular value or composition.
  • the ranges and/or subranges can include the endpoints of the ranges and/or subranges.
  • polymerase and its variants, as used herein, comprises an enzyme comprising a domain that binds a nucleotide (or nucleoside) where the polymerase can form a complex having a template nucleic acid and a complementary nucleotide.
  • the polymerase can have one or more activities including, but not limited to, base analog detection activities, DNA polymerization activity, reverse transcriptase activity, DNA binding, strand displacement activity, and nucleotide binding and recognition.
  • a polymerase canbe any enzyme that can catalyze polymerization of nucleotides (including analogs thereof) into a nucleic acid strand.
  • nucleotide polymerization can occur in a templatedependent fashion.
  • a polymerase comprises one or more active sites at which nucleotide binding and/or catalysis of nucleotide polymerization can occur.
  • a polymerase includes other enzymatic activities, such as for example, 3 ’ to 5 ’ exonuclease activity or 5’ to 3 ’ exonuclease activity.
  • a polymerase has strand displacing activity.
  • a polymerase can include without limitation naturally occurring polymerases and any subunits and truncations thereof, mutant polymerases, variant polymerases, recombinant, fusion or otherwise engineered polymerases, chemically modified polymerases, synthetic molecules or assemblies, and any analogs, derivatives or fragments thereof that retain the ability to catalyze nucleotide polymerization (e.g., catalytically active fragment).
  • the polymerase includes catalytically inactive polymerases, catalytically active polymerases, reverse transcriptases, and other enzymes comprising a nucleotide binding domain.
  • a polymerase canbe isolated from a cell, or generated using recombinant DNA technology or chemical synthesis methods.
  • a polymerase can be expressed in prokaryote, eukaryote, viral, or phage organisms. In some embodiments, a polymerase can be post-translationally modified proteins or fragments thereof. A polymerase can be derived from a prokaryote, eukaryote, virus or phage. A polymerase comprises DNA- directed DNA polymerase and RNA-directed DNA polymerase.
  • strand displacing refers to the ability of a polymerase to locally separate strands of double-stranded nucleic acids and synthesize a new strand in a template-based manner.
  • Strand displacing polymerases displace a complementary strand from a template strand and catalyze new strand synthesis.
  • Strand displacing polymerases include mesophilic and thermophilic polymerases.
  • Strand displacing polymerases include wild type enzymes, and variants including exonuclease minus mutants, mutant versions, chimeric enzymes and truncated enzymes.
  • strand displacing polymerases examples include phi29 DNA polymerase, large fragment of BstDNA polymerase, large fragment of Bsu DNA polymerase (exo-), Bea DNA polymerase (exo-), Klenow fragment of E. coli DNA polymerase, T5 polymerase, M-MuLV reverse transcriptase, HIV viral reverse transcriptase, Deep Vent DNA polymerase and KOD DNA polymerase.
  • the phi29 DNA polymerase can be wild type phi29 DNA polymerase (e.g., MagniPhi from Expedeon), or variant EquiPhi29 DNA polymerase (e.g., from Thermo Fisher Scientific), or chimeric QualiPhi DNA polymerase (e g., from 4basebio).
  • wild type phi29 DNA polymerase e.g., MagniPhi from Expedeon
  • variant EquiPhi29 DNA polymerase e.g., from Thermo Fisher Scientific
  • chimeric QualiPhi DNA polymerase e g., from 4basebio.
  • Nucleic acid include recombinant and chemically-synthesized forms. Nucleic acids can be isolated.
  • Nucleic acids include DNA molecules (e.g., cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs (e.g., peptide nucleic acids (PNA) and non -naturally occurring nucleotide analogs), and chimeric forms containing DNA and RNA.
  • Nucleic acids can be single -stranded or doublestranded.
  • Nucleic acids comprise polymers of nucleotides, where the nucleotides include natural or non-natural bases and/or sugars.
  • Nucleic acids comprise naturally -occurring internucleosidic linkages, for example phosphodiester linkages. Nucleic acids can lack a phosphate group.
  • Nucleic acids comprise non-natural intemucleoside linkages, including phosphorothioate, phosphorothiolate, or peptide nucleic acid (PNA) linkages.
  • nucleic acids comprise a one type of polynucleotides or a mixture of two or more different types of polynucleotides.
  • operably linked and “operably joined” or related terms as used herein refers to juxtaposition of components.
  • the juxtaposed components can be linked together covalently.
  • two nucleic acid components can be enzymatically ligated together where the linkage thatjoins together the two components comprises phosphodiester linkage.
  • a first and second nucleic acid component can be linked together, where the first nucleic acid component can confer a function on a second nucleic acid component.
  • linkage between a primer binding sequence and a sequence of interest forms a nucleic acid library molecule having a portion that can bind to a primer.
  • a transgene e.g., a nucleic acid encoding a polypeptide or a nucleic acid sequence of interest
  • a transgene can be ligated to a vector where the linkage permits expression or functioning of the transgene sequence contained in the vector.
  • a transgene is operably linked to a host cell regulatory sequence (e.g., a promoter sequence) that affects expression of the transgene.
  • the vector comprises at least one host cell regulatory sequence, including a promoter sequence, enhancer, transcription and/or translation initiation sequence, transcription and/or translation termination sequence, polypeptide secretion signal sequences, and the like.
  • the host cell regulatory sequence controls expression of the level, timing and/or location of the transgene.
  • the terms “linked”, “joined”, “attached”, “appended” and variants thereof comprise any type of fusion, bond, adherence or association between any combination of compounds or molecules that is of sufficient stability to withstand use in a particular procedure.
  • the procedure can include but is not limited to: nucleotide binding; nucleotide incorporation; de-blocking (e.g., removal of chain -terminating moiety); washing; removing, flowing; detecting; imaging and/or identifying; or any combination thereof.
  • Such linkage can comprise, for example, covalent, ionic, hydrogen, dipole-dipole, hydrophilic, hydrophobic, or affinity bonding, bonds or associations involving van der Waals forces, mechanical bonding, the like, or any combination thereof.
  • linkage occurs intramolecularly, for example linking together the ends of a single-stranded or double-stranded linear nucleic acid molecule to form a circular molecule.
  • linkage can occur between a combination of different molecules, or between a molecule and a non-molecule, includingbut not limited to: linkage between a nucleic acid molecule and a solid surface; linkage between a protein and a detectable reporter moiety, linkage between a nucleotide and detectable reporter moiety; and the like.
  • linkages can be found, for example, in Hermanson, G., “Bioconjugate Techniques”, SecondEdition (2008); Aslam, M., Dent, A., “Bioconjugation: Protein Coupling Techniques for the Biomedical Sciences”, London: Macmillan (1998); Aslam, M., Dent, A., “Bioconjugation: Protein Coupling Techniques for the Biomedical Sciences”, London: Macmillan (1998), which is incorporated herein by reference in its entirety.
  • primer refers to an oligonucleotide that is capable of hybridizing with aDNA and/or RNA polynucleotide template to form a duplex molecule.
  • Primers comprise natural nucleotides and/or nucleotide analogs.
  • Primers can be recombinant nucleic acid molecules.
  • Primers may have any length, but typically range from 4-50 nucleotides.
  • Atypical primer comprises a 5’ end and 3’ end.
  • the 3’ end of the primer can include a 3’ OH moiety which serves as a nucleotide polymerization initiation site in a polymerase-catalyzed primer extension reaction.
  • the 3 ’ end of the primer can lack a 3 ’ OH moiety, or can include a terminal 3 ’ blocking group that inhibits nucleotide polymerization in a polymerase-catalyzed reaction. Any one nucleotide, or more than one nucleotide, alongthe length of the primer canbe labeled with a detectable reporter moiety.
  • a primer can be in solution (e.g., a soluble primer) or can be immobilized to a support (e.g., a capture primer).
  • template nucleic acid refers to a nucleic acid strand that serves as the basis nucleic acid molecule for any of the reiterative sequencing methods described herein.
  • the template nucleic acid can be single-stranded or double-stranded, or the template nucleic acid can have single-stranded or double-stranded portions.
  • the template nucleic acid can be obtained from a naturally -occurring source, recombinant form, or chemically synthesized to include any type of nucleic acid analog.
  • the template nucleic acid can be linear, circular, or other forms.
  • the template nucleic acids can include an insert portion having an insert sequence.
  • the template nucleic acids can also include at least one adaptor sequence.
  • the insert portion can be isolated in any form, including chromosomal, genomic, organellar (e.g., mitochondrial, chloroplast or ribosomal), recombinant molecules, cloned, amplified, cDNA, RNA such as precursor mRNA or mRNA, oligonucleotides, whole genomic DNA, obtained from fresh frozen paraffin embedded tissue, needle biopsies, circulating tumor cells, cell free circulating DNA, or any type of nucleic acidlibrary.
  • the insertportion can beisolated from any source including from organisms such as prokaryotes, eukaryotes (e.g., humans, plants and animals), fungus, viruses cells, tissues, normal or diseased cells or tissues, body fluids including blood, urine, serum, lymph, tumor, saliva, anal and vaginal secretions, amniotic samples, perspiration, semen, environmental samples, culture samples, or synthesized nucleic acid molecules prepared using recombinant molecular biology or chemical synthesis methods.
  • organisms such as prokaryotes, eukaryotes (e.g., humans, plants and animals), fungus, viruses cells, tissues, normal or diseased cells or tissues, body fluids including blood, urine, serum, lymph, tumor, saliva, anal and vaginal secretions, amniotic samples, perspiration, semen, environmental samples, culture samples, or synthesized nucleic acid molecules prepared using recombinant molecular biology or chemical synthesis methods.
  • the insert portion canbe isolated from any organ, including, head, neck, brain, breast, ovary, cervix, colon, rectum, endometrium, gallbladder, intestines, bladder, prostate, testicles, liver, lung, kidney, esophagus, pancreas, thyroid, pituitary; thymus, skin, heart, larynx, or other organs.
  • the template nucleic acid can be subjected to nucleic acid analysis, including sequencing and composition analysis.
  • the term “adaptor” and related terms refers to oligonucleotides that can be operably linked to a target polynucleotide, where the adaptor confers a function to the co-joined adaptortarget molecule.
  • Adaptors may comprise DNA, RNA, chimeric DNA/RNA, or analogs thereof. Adaptors can include at least one ribonucleoside residue. Adaptors can be single -stranded, double-stranded, or have single-stranded and/or double-stranded portions. Adaptors can be configured to be linear, stem-looped, hairpin, or Y-shaped forms. Adaptors can be any length, including 4-100 nucleotides or longer. Adaptors can have blunt ends, overhang ends, or a combination of both. Overhang ends include s’ overhang and 3 ’ overhang ends.
  • the 5’ end of a single-stranded adaptor, or one strand of a double-stranded adaptor, can have a 5 ’ phosphate group or lack a 5’ phosphate group.
  • Adaptors can include a 5’ tail that does not hybridize to a target polynucleotide (e.g., tailed adaptor), or adaptors can be non-tailed.
  • An adaptor can include a sequence that is complementary to at least a portion of a primer, such as an amplification primer, a sequencing primer, or a capture primer (e.g., soluble or immobilized capture primers).
  • Adaptors can include a random sequence or degenerate sequence.
  • Adaptors can include at least one inosine residue.
  • Adaptors can include at least one phosphorothioate, phosphorothiolate and/or phosphoramidate linkage.
  • Adaptors can include a barcode sequence which can be used to distinguish polynucleotides (e.g., insert sequences) from different sample sources in a multiplex assay.
  • Adaptors can include a unique identification sequence (e.g., unique molecular index, UMI; or a unique molecular tag) that can be used to uniquely identify a nucleic acid molecule to which the adaptor is appended.
  • a unique identification sequence can be used to increase error correction and accuracy, reduce the rate of false -positive variant calls and/or increase sensitivity of variant detection.
  • Adaptors can include at least one restriction enzyme recognition sequence, including any one or any combination of two or more selected from a group consisting of type I, type II, type in, type IV, type Hs or type IIB .
  • any of the amplification primer sequences, sequencing primer sequences, capture primer sequences, target capture sequences, circularization anchor sequences, sample barcode sequences, spatial barcode sequences, or anchor region sequences can be about 3-50 nucleotides in length, or about 5-40 nucleotides in length, or about 5-25 nucleotides in length.
  • universal sequence refers to a sequence in a nucleic acid molecule that is common among two or more polynucleotide molecules.
  • an adaptor having a universal sequence can be operably joined to a plurality of polynucleotides so thatthe population of co-joined molecules carry the same universal adaptor sequence.
  • Examples of universal adaptor sequences include an amplification primer sequence, a sequencing primer sequence or a capture primer sequence (e g., soluble or immobilized capture primers).
  • hybridize or “hybridizing” or “hybridization” or other related terms refers to hydrogen bonding between two different nucleic acids to form a duplex nucleic acid.
  • Hybridization also includes hydrogen bonding between two different regions of a single nucleic acid molecule to form a self-hybridizing molecule having a duplex region.
  • Hybridization can comprise Watson-Crick or Hoogsteen bindingto form a duplex double-stranded nucleic acid, or a double-stranded region within a nucleic acid molecule.
  • the double-stranded nucleic acid may be wholly complementary, or partially complementary.
  • Complementary nucleic acid strands need not hybridize with each other across their entire length.
  • the complementary base pairing can be the standard A-T or C-Gbase pairing, or can be other forms of base-pairing interactions.
  • Duplex nucleic acids can include mismatched base-paired nucleotides.
  • nucleic acid incorporation comprises polymerization of one or more nucleotides into the terminal 3 ’ OH end of a nucleic acid strand, resulting in extension of the nucleic acid strand.
  • Nucleotide incorporation can be conducted with natural nucleotides and/or nucleotide analogs. Typically, but not necessarily, nucleotide incorporation occurs in a template-dependent fashion. Any suitable method of extending a nucleic acid molecule may be used, including primer extension catalyzed by a DNA polymerase or RNA polymerase.
  • nucleotides refers to a molecule comprising an aromatic base, a five carbon sugar (e g., ribose or deoxyribose), and at least one phosphate group. Canonical or non-canonical nucleotides are consistent with use of the term.
  • the phosphate in some embodiments comprises a monophosphate, diphosphate, or triphosphate, or corresponding phosphate analog.
  • nucleoside refers to a molecule comprising an aromatic base and a sugar. Nucleotides and nucleosides can be non-labeled or labeled with a detectable reporter moiety.
  • Nucleotides typically comprise a hetero cyclic base including sub stituted or unsub stituted nitrogen-containing parent heteroaromatic ring which are commonly found in nucleic acids, including naturally -occurring, substituted, modified, or engineered variants, or analogs of the same.
  • the base of a nucleotide (or nucleoside) is capable of forming Watson-Crick and/or Hoogsteen hydrogen bonds with an appropriate complementary base.
  • Nonlimiting example bases include, but are not limited to, purines and pyrimidines such as: 2- aminopurine, 2, 6-diamin opurine, adenine (A), ethenoadenine, N6-A2-isopentenyladenine (6iA), N6-A2-isopentenyl-2-methylthioadenine (2ms6iA), N6-methyladenine, guanine (G), isoguanine, N2-dimethylguanine (dmG), 7-methylguanine (7mG), 2-thiopyrimidine, 6- thioguanine (6sG), hypoxanthine andO6-methylguanine; 7-deaza -purines such as 7- deazaadenine (7-deaza-A) and 7 -deazaguanine (7-deaza-G); pyrimidines such as cytosine (C), 5- propynylcytosine, isocytosine, thymine (T), 4-thiothymine
  • Nucleotides typically comprise a sugar moiety, such as carbocyclic moiety (Ferraro and Gotor 2000 Chem. Rev. 100: 4319-48, which is incorporated herein by reference in its entirety), acyclic moieties (Martinez, et al., 1999 Nucleic AcidsResearch 27 : 1271-1274; Martinez, et al., 1997Bioorganic & Medicinal Chemistry Letters vol. 7: 3013-3016, which are both incorporated herein by reference in their entirety), and other sugar moieties (Joeng, et al., 1993 J. Med. Chem. 36: 2627-2638; Kim, etal., 1993 J. Med. Chem.
  • the sugar moiety comprises: ribosyl; 2’- deoxyribosyl; 3 ’-deoxyribosyl; 2’, 3 ’-dideoxyribosyl; 2 ’,3 ’-didehydrodideoxy ribosyl; 2’- alkoxyribosyl; 2’ -azidoribosyl; 2 ’-amino ribosyl; 2 ’-fluoro ribosyl; 2’-mercaptoriboxyl; 2’- alkylthioribosyl; 3 ’-alkoxyribosyl; 3 ’ -azidoribosyl; 3 ’-aminoribosyl; 3 ’-fluororibosyl; 2’-mercaptoriboxyl; 2’- alkylthioribosyl; 3 ’-alkoxyribosyl; 3 ’ -azidoribo
  • nucleotides comprise a chain of one, two or three phosphorus atoms where the chain is typically attached to the 5 ’ carbon of the sugar moiety via an ester or phosphoramide linkage.
  • the nucleotide is an analog having a phosphorus chain in which the phosphorus atoms are linked together with intervening 0, S, NH, methylene or ethylene.
  • the phosphorus atoms in the chain include substituted side groups including 0, S or BH3.
  • the chain includes ph osphate groups substituted with analogs including phosphoramidate, phosphorothioate, phosphorodithioate, and O-methyl phosphoramidite groups.
  • reporter moiety refers to a compound that generates, or causes to generate, a detectable signal.
  • a reporter moiety is sometimes called a “label”. Any suitable reporter moiety may be used, including luminescent, photoluminescent, electroluminescent, bioluminescent, chemiluminescent, fluorescent, phosphorescent, chromophore, radioisotope, electrochemical, mass spectrometry, Raman, hapten, affinity tag, atom, or an enzyme.
  • a reporter moiety generates a detectable signal resulting from a chemical or physical change (e.g., heat, light, electrical, pH, salt concentration, enzymatic activity, or proximity events).
  • a proximity event includes two reporter moieties approaching each other, or associating with each other, or binding each other.
  • reporter moieties may be selected so that each absorb sexcitation radiation and/or emits fluorescence at a wavelength distinguishable from the other reporter moieties to permit monitoring the presence of different reporter moi eties in the same reaction or in different reactions.
  • Two or more different reporter moieties can be selected having spectrally distinct emission profiles, or having minimal overlapping spectral emission profiles.
  • Reporter moieties can be linked (e.g., operably linked) to nucleotides, nucleosides, nucleic acids, enzymes (e.g., polymerases or reverse transcriptases), or support (e.g., surfaces).
  • a reporter moiety may comprise a fluorescent label or a fluorophore.
  • fluorescent moieties which may serve as fluorescent labels or fluorophores include, but are not limited to, fluorescein and fluorescein derivatives such as carboxyfluorescein, tetrachlorofluorescein, hexachlorofluorescein, carboxynapthofluorescein, fluorescein isothiocyanate, NHS-fluorescein, iodoacetamidofluorescein, fluorescein maleimide, SAMSA-fluorescein, fluorescein thiosemicarbazide, carbohydrazinomethylthioacetyl-amino fluorescein, rhodamine and rhodamine derivatives such as TRITC, TMR, lissamine rhodamine, Texas Red, rhodamine B, rhodamine 6G, rhodamine
  • cyanine fluorophores include, for example, Cy3, (which may comprise l-[6-(2,5-dioxopyrrolidin-l- yloxy)-6-oxohexyl]-2-(3 - ⁇ 1- [6-(2,5-dioxopyrrolidin-l-yloxy)-6-oxohexyl]-3,3 -dimethyl-1,3 - dihydro-2H-indol-2-ylidene ⁇ prop-l-en-l-yl)-3,3-dimethyl-3H-indolium or l-[6-(2,5- dioxopyrrolidin-l-yloxy)-6-oxohexyl]-2-(3- ⁇ 1- [6-(2,5-dioxopyrrolidin-l-yloxy)-6-oxohexyl]- 3,3-dimethyl-5-sulfo-l,3-dihydro-2
  • the reporter moiety canbe a FRET pair, such thatmultiple classifications can be performed under a single excitation and imaging step.
  • FRET may comprise excitation exchange (Forster) transfers, or electron -exchange (Dexter) transfers.
  • the terms “amplify”, “amplifying”, “amplification”, and other related terms include producing multiple copies of an original polynucleotide template molecule, where the copies comprise a sequence that is complementary to the template sequence, or the copies comprise a sequence that is the same as the template sequence. In some embodiments, the copies comprise a sequence that is substantially identical to a template sequence, or is substantially identical to a sequence that is complementary to the template sequence.
  • the term “support” as used herein refers to a substrate that is designed for deposition of biological molecules or biological samples for assays and/or analyses.
  • biological molecules to be deposited onto a support include nucleic acids (e.g., DNA, RNA), polypeptides, saccharides, lipids, a single cell or multiple cells.
  • biological samples include but are not limited to saliva, phlegm, mucus, blood, plasma, serum, urine, stool, sweat, tears and fluids from tissues or organs.
  • the support is solid, semi-solid, or a combination of both.
  • the support is porous, semi-porous, non-porous, or any combination of porosity.
  • the support can be substantially planar, concave, convex, or any combination thereof.
  • the support can be cylindrical, for example comprising a capillary or interior surface of a capillary.
  • the surface of the support can be substantially smooth.
  • the support can be regularly or irregularly textured, including bumps, etched, pores, three-dimensional scaffolds, or any combination thereof.
  • the support comprises a bead having any shape, including spherical, hemi-spherical, cylindrical, barrel-shaped, toroidal, disc-shaped, rod-like, conical, triangular, cubical, polygonal, tubular or wire-like.
  • the support can be fabricated from any material, including but not limited to glass, fused-silica, silicon, a polymer (e.g., polystyrene (PS), macroporous polystyrene (1VIPPS), polymethylmethacrylate (PMMA), polycarbonate (PC), polypropylene (PP), polyethylene (PE), high density polyethylene (HDPE), cyclic olefin polymers (COP), cyclic olefin copolymers (COC), polyethylene terephthalate (PET)), or any combination thereof.
  • a polymer e.g., polystyrene (PS), macroporous polystyrene (1VIPPS), polymethylmethacrylate (PMMA), polycarbonate (PC), polypropylene (PP), polyethylene (PE), high density polyethylene (HDPE), cyclic olefin polymers (COP), cyclic olefin copolymers (COC), polyethylene terephthalate (PET)
  • the support can have a plurality (e.g., two or more) of nucleic acid templates immobilized thereon.
  • the plurality of immobilized nucleic acid templates may havethe same sequence or have different sequences.
  • individual nucleic acid template molecules in the plurality of nucleic acid templates are immobilized to a different site on the support.
  • two or more individual nucleic acid template molecules in the plurality of nucleic acid templates are immobilized to a site on the support.
  • array refers to a support comprising a plurality of sites located at predetermined locations on the support to form an array of sites.
  • the sites can be discrete and separated by interstitial regions.
  • the pre -determined sites on the support can be arranged in one dimension in a row or a column, or arranged in two dimensions in rows and columns.
  • the plurality of pre -determined sites is arranged on the support in an organized fashion.
  • the plurality of pre -determined sites is arranged in any organized pattern, including rectilinear, hexagonal patterns, grid patterns, patterns having reflective symmetry, patterns having rotational symmetry, or the like. The pitch between different pairs of sites can be that same or can vary.
  • the support comprises at least 102 sites, at least 103 sites, at least 104 sites, at least 105 sites, at least 106 sites, at least 107 sites, at least 108 sites, at least 109 sites, at least 1010 sites, at least 1011 sites, at least 1012 sites, at least 1013 sites, at least 1014 sites, at least 1015 sites, or more, where the sites are located at pre-determined locations on the support.
  • a plurality of pre-determined sites on the support e.g., 102 - 1015 sites or more
  • the nucleic acid templates that are immobilized at a plurality of pre -determined sites by hybridization to immobilized surface capture primers, or the nucleic acid templates are covalently attached to the surface capture primer.
  • the nucleic acid templates that are immobilized at a plurality of pre-determined sites for example immobilized at 102 - 1015 sites or more.
  • the immobilized nucleic acid templates are clonally -amplified to generate immobilized nucleic acid clusters at the plurality of pre-determined sites.
  • individual immobilized nucleic acid clusters comprise linear clusters, or comprise singlestranded or double-stranded concatemers.
  • a support comprising a plurality of sites located at random locations on the support is referred to herein as a support having randomly located sites thereon.
  • the location of the randomly located sites on the support are not pre -determined.
  • the plurality of randomly -located sites is arranged on the support in a disordered and/or unpredictable fashion.
  • the support comprises at least 102 sites, at least 103 sites, at least 104 sites, atleast 105 sites, atleast 106 sites, atleast 107 sites, atleast 108 sites, atleast 109 sites, at least 1010 sites, atleast 1011 sites, atleast 1012 sites, atleast 1013 sites, atleast 1014 sites, at least 1015 sites, or more, where the sites are randomly located on the support.
  • a plurality of randomly located sites on the support e.g., 102 - 1015 sites or more
  • the nucleic acid templates that are immobilized at a plurality of randomly located sites by hybridization to immobilized surface capture primers, or the nucleic acid templates are covalently attached to the surface capture primer.
  • the nucleic acid templates that are immobilized at a plurality of randomly located sites for example immobilized at 102 - 1015 sites or more.
  • the immobilized nucleic acid templates are clonally-amplified to generate immobilized nucleic acid clusters at the plurality of randomly located sites.
  • individual immobilized nucleic acid clusters comprise linear clusters, or comprise single -stranded or double-stranded concatemers.
  • the plurality of immobilized surface capture primers on the support are in fluid communication with each other to permit flowing a solution of reagents (e.g., nucleic acid template molecules, soluble primers, enzymes, nucleotides, divalent cations, buffers, and the like) onto the support so that the plurality of immobilized surface capture primers on the support can be essentially simultaneously reacted with the reagents in a massively parallel manner.
  • the fluid communication of the plurality of immobilized surface capture primers can be used to conduct nucleic acid amplification reactions (e.g., RCA, MDA, PCR and bridge amplification) essentially simultaneously on the plurality of immobilized surface capture primers.
  • the plurality of immobilized nucleic acid clusters on the support are in fluid communication with each other to permit flowing a solution of reagents (e g., enzymes, nucleotides, divalent cations, and the like) onto the support so that the plurality of immobilized nucleic acid clusters on the support can be essentially simultaneously reacted with the reagents in a massively parallel manner.
  • reagents e g., enzymes, nucleotides, divalent cations, and the like
  • the fluid communication of the plurality of immobilized nucleic acid clusters can be used to conduct nucleotide binding assays and/or conduct nucleotide polymerization reactions (e.g., primer extension or sequencing) essentially simultaneously on the plurality of immobilized nucleic acid clusters, and optionally to conduct detection and imaging for massively parallel sequencing.
  • nucleotide binding assays e.g., primer extension or sequencing
  • nucleotide polymerization reactions e.g., primer extension or sequencing
  • immobilized When used in reference to immobilized enzymes, the term “immobilized” and related terms refer to enzymes (e g , polymerases) that are attached to a support through covalent bond or non-covalent interaction, or attached to a coating on the support, or buried within a matrix formed by a coating on the support.
  • enzymes e g , polymerases
  • immobilized When used in reference to immobilized nucleic acids, the term “immobilized” and related terms refer to nucleic acid molecules that are attached to a support through covalentbond or non-covalent interaction, or attached to a coating on the support, or buried within a matrix formed by a coating on the support, where the nucleic acid molecules include surface capture primers, nucleic acid template molecules and extension products of capture primers. Extension products of capture primers includes nucleic acid concatemers (e.g., nucleic acid clusters).
  • one or more nucleic acid templates are immobilized on the support, for example immobilized at the sites on the support.
  • the one or more nucleic acid templates are clonally-amplified. In some embodiments, the one or more nucleic acid templates are clonally-amplified off the support (e.g., in-solution) and then deposited onto the support and immobilized on the support. In some embodiments, the clonal amplification reaction of the one or more nucleic acid templates is conducted on the support resultingin immobilization on the support.
  • the one or more nucleic acid templates are clonally -amplified (e.g., in solution or on the support) using a nucleic acid amplification reaction, including any one or any combination of : polymerase chain reaction (PCR), multiple displacement amplification (MDA), transcription -mediated amplification (TMA), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), real-time SDA, bridge amplification, isothermal bridge amplification, rolling circle amplification (RCA), circle-to-circle amplification, helicase-dependent amplification, recombinase-dependent amplification, and/or single-stranded binding (SSB) protein-dependent amplification.
  • PCR polymerase chain reaction
  • MDA multiple displacement amplification
  • TMA transcription -mediated amplification
  • NASBA nucleic acid sequence-based amplification
  • SDA strand displacement amplification
  • bridge amplification isothermal bridge amplification
  • Persistence time refers to the length of time that a binding complex, which is formed between the target nucleic acid, a polymerase, a conjugated or unconjugated nucleotide, remains stable without any binding component dissociates from the binding complex.
  • the persistence time is indicative of the stability of the binding complex and strength of the binding interactions.
  • Persistence time can be measured by observing the onset and/or duration of a binding complex, such as by observing a signal from a labeled component of the binding complex.
  • a labeled nucleotide or a labeled reagent comprising one or more nucleotides may be present in a binding complex, thus allowing the signal from the label to be detected duringthe persistence time of the binding complex.
  • a label is a fluorescent label.
  • the present disclosure provides methods, compositions, systemsand kits forusein processing or analyzing a biomolecule.
  • the biomolecule may comprise, e.g., a nucleic acid, a polypeptide or protein, a lipid, or a carbohydrate.
  • Methods, compositions, systems and kits as described herein may be used to obtain information about a location of a nucleic acid sequence, such as an RNA or a DNA, and a protein encoded by the nucleic acid sequence.
  • target nucleic acid sequences may be identified and sequenced as disclosed herein.
  • a target RNA or DNA molecule associated with a target protein or polypeptide may be identified by a padlock probe as described herein, and may be further amplified and/or sequenced. In this manner, information can be gathered about the location and/or sequence of a target protein, RNA sequence, and DNA sequence.
  • Methods, systems, kits for use in processing and/or analyzing a nucleic acid sequence.
  • Methods, systemskit and compositions as described herein maybe usedin in situ reiterative short read sequencing.
  • the sequencing may be performed in one or more cells, tissues, or tumors.
  • the nucleic acid sequence may be, e.g., a DNA sequence or an RNA sequence.
  • Methods, systems, kits and compositions as described herein maybe useful in leveraging massively parallel sequencing technologies. Such methods, composition, kits, and systems can also be found in US Publication No. US20210139884, which is incorporated herein by reference in its entirety.
  • the present disclosure provides methods for analyzing a nucleic acid sequence.
  • Methods as disclosed herein may comprise introducing one or more reagents to the biological sample under conditions sufficient to identify a DNA encoding an RNA, the RNA encoding a protein, or the protein in a single nucleic acid sequencing.
  • the identifying may comprise running any of the methods disclosed herein.
  • the methods disclosed herein do not require the use of a barcoded nucleic acid, and instead, can utilize spatially encoded reagents. For example, primers spatially encoded on a solid surface may be identified by its relative position on the solid surface.
  • the methods described herein may employ in situ reiterative short read sequencing within a cellular sample including a single cell, multiple cells, a tissue or a tumor.
  • the methods comprise repeatedly conducting a short number of sequencing cycles of the same region of the template molecules.
  • the RNA content of the cellular sample can be discovered.
  • Methods as described herein may be employed in conducting transcriptomics workflows which leverages massively parallel sequencing technologies. Compared to long read sequencing workflows, the reiterative short sequencing cycles described herein use a reduced amount of sequencing reagents which reduces cost and saves time. Methods for conducting reiterative short sequencing cycles has many uses, including, but not limited to, detecting specific RNAs of interest, mutant RNA sequences, splice variants, and their abundance levels thereof.
  • RNAs may be converted into first strand cDNAs, which may be selectively hybridized to target-specific padlock probes for generating circularized padlock probes that correspond to particular cDNAs.
  • the padlock probes may carry at least a universal adaptor for a sequencing primer binding site.
  • the padlock probes can also carry a target barcode sequence each corresponding to a given target cDNA.
  • the circularized padlockprobes maybe subjected to a ligation and/or fill-in reaction to form covalently closed circular padlock probes, which may be amplified inside the cellular sample via rolling circle amplification to generate single -stranded concatemers.
  • the rolling circle amplification may be conducted in the presence of one or more compaction oligonucleotides.
  • the concatemers may carry tandem repeat units of a cDNA-of-interest, the universal sequencing primer binding site, and the target barcode sequence.
  • the concatemers maybe sequencedinside the cellular sample where a short number of sequencing cycles are conducted for each round and multiple rounds of short read sequencing is conducted.
  • the full length of the target barcode and cDNA region may notbe sequenced. Instead, at least a portion of the target barcode region may be reiteratively sequenced. In some embodiments, it is not necessary to sequence the cDNA region. In some embodiments, the targetbarcode and a portion of the cDNA region are reiteratively sequenced. It may notbe necessary to sequence the entire length of the cDNA region. It may notbe necessary to assemble the sequencing reads or to obtain a full-length sequence of the cDNAs-of-interest. The redundant sequencing information obtained from the short sequencing reads may obviate the need to sequence the complementary strand of the concatemer. Thus, pairwise sequencing may notbe necessary.
  • the cellular sample may be placed on a solid support, for example a planar support comprising glass or plastic which can be fabricated into any shape and size. Assembly of a hybridization chamber on the support may notbe needed. Preparation of chemically-washed glass beadsfor cell adherence may also notbe needed.
  • the support canbe passivated with a coating that promotes cell adhesion to the support. The coating may not need to be formulated to include tethered capture primers.
  • the support, having a cell sample adhered thereon can be easily adapted to fit into an existing flow cell holder/cradle which is fluidically connected to an automated fluid dispensing system and configured on a fluorescent microscope.
  • any combination of the steps for conducting in situ reiterative short read sequencing can be performed in an automated mode using a fluid dispensing system, including cell seeding, cell fixation, cell permeabilization, reverse transcription reactions, padlock probe hybridization, padlock probe ligation reaction, rolling circle amplification, and sequencing.
  • the single-stranded concatemers may collapse into compact DNA nanoballs, where each nanoball carries numerous tandem copies of a polynucleotide unit along their lengths, where the polynucleotide unit includes a cDNA sequence -of-interest and at least a universal sequencing primer binding site.
  • Each polynucleotide unit can bind a sequencing primer, a sequencing polymerase and a detectably-labeled nucleotide reagent (e.g., detectably labeled multivalent molecules), to form a detectable sequencing complex (e.g., a detectable ternary complex).
  • Each nanoball carries numerous detectable sequencing complexes.
  • the compact nature of the nanoballs may increase the local concentration of detectably-labeled nucleotide reagents that are used during the sequencing workflow, which may thereby increase the signal intensity emitted from a nanoball.
  • This may give a discrete detectable signal, which can be imaged as a fluorescent spot inside the cellular sample.
  • Each spot correspondsto a concatemer and each concatemer corresponds to a target RNA molecule in the cellular sample. Multiple spots can be detected and imaged simultaneously in the cellular sample.
  • a shortportion of the cDNA region in the concatemer may be re-sequenced at least once (e.g., reiterative sequencing) from the same start position to generate overlapping sequencing reads that can be aligned to a reference sequence.
  • the same portion of the concatemer molecule can be sequenced at least two, three, four, five, or up to 50 times.
  • the start sequencing site can be any location of the concatemer and may be dictated by the sequencing primers which are designed to anneal to a selected position within the concatemer.
  • the reiterative short sequencing reads may increase the redundancy of sequencing information for individual bases in the cDNA region.
  • Reiteratively sequencing one strand of the concatemer template molecule may provide enough base coverage to reveal the presence of target RNAs in the cellular sample so that pairwise sequencing of the complementary strand is not necessary.
  • a concatemer template molecule may include multiple sequencing primer binding sites along the same concatemer molecule which can be used to generate multiple usable sequencing reads for increased sequencing depth. Together, reiteratively sequencing one strand of the concatemer templates may increase sequencing base coverage and sequencing depth compared to sequencing a one-copy template molecule.
  • the methods described herein can be conducted in uniplex or multiplex modes. Two or more different target RNAs can be detected and imaged simultaneously inside a cellular sample using different reverse transcription primers, different target-specific padlock probes, and universal sequencing primers. For example, the presence of a housekeeping RNA and at least one target RNA in a cellular sample can be simultaneously detected and imaged using any of the reiterative short read sequencing methods described herein.
  • the RNA may not be extracted from the cellular sample, and sequencing information may not needto be tracked and mapped back to an image of the cellular sample. Rather, RNA may be retained inside the cellular sample to permit direct imaging of the spatial location of target RNAs within the cells. Additionally, RNA within the cellular sample may not be fragmented and enrichment of target RNA may not be necessary.
  • target-specific and/or random-sequence reverse transcription primers enables detection of both poly -A and non-poly-A RNAs in either uniplex or multiplex modes.
  • the present disclosure provides methods for conducting in situ sequencing in a biological sample.
  • One or more nucleic acids may be analyzed in a biological sample, e.g., a cell.
  • the nucleic acid may comprise, e.g., deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
  • the sequencing may be conducted reiteratively, so that multiple sequencing cycles are conducted of a nucleic acid.
  • the present disclosure provides methods for detecting in situ at least two nucleic acid sequences in a biological sample.
  • the present disclosure provides methods for detecting in situ at least two target RNA molecules in a biological sample.
  • the at least two target nucleic acid sequences may comprise a first target nucleic acid sequence and a second target nucleic acid sequence.
  • the biological sample maybe a cellular sample.
  • the method may comprise conducting sequencing reactions insidethe cellular sample, where the cDNA amplicons canbe the concatemer molecules.
  • the biological sample may comprise a first nucleic acid molecule, a second nucleic acid molecule, or a combination thereof.
  • the first nucleic acid molecule may comprise a first target nucleic acid sequence or portion thereof, or the reverse complement thereof or a portion thereof.
  • the second nucleic acid molecule may comprise the second target nucleic acid sequence or a portion thereof, or the reverse complement thereof or a portion thereof.
  • the method may comprise determining in situ the sequence of the first nucleic acid molecule or a portion thereof in the biological sample to generate a first sequencing product nucleic acid molecule that is complementary and binds to the first nucleic acid molecule or a portion thereof.
  • the method may comprise determining in situ the sequence of the second nucleic acid molecule or a portion thereof in the biological sample to generate a second sequencing product nucleic acid molecule that is complementary and binds to the second nucleic acid molecule or a portion thereof.
  • the full sequence of the first target nucleic acid sequence and the full sequence of the second target nucleic acid sequence may have at least one nucleotide of difference.
  • the method may further comprise removing the first sequencing product nucleic acid molecule from the first nucleic acid molecule and the second sequencing product nucleic acid molecule from the second nucleic acid molecule.
  • the first nucleic acid molecule and the second nucleic acid molecule maybe positioned inside the biological sample after the removing.
  • the method may further comprise repeating the determining in situ the sequence of the first and second nucleic acids. In some embodiments, the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the method may further comprise repeating the removing the first sequencing product nucleic acid molecule from the first nucleic acid molecule and the second sequencing product nucleic acid molecule from the second nucleic acid molecule.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the present disclosure provides methods for detecting in situ at least two different target nucleic acid molecules in a biological sample.
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules in a cellular sample comprising step (a): providing a cellular sample deposited on a solid support, wherein the cellular sample harbors at least a first plurality of DNA amplicons that correspond to a first target RNA molecule and the cellular sample harbors a second plurality of DNA amplicons that correspond to a second target RNA molecule.
  • the cellular sample can comprise a first target RNA molecule
  • the cellular sample can comprise at least a first plurality of DNA amplicons that can correspond to a first target RNA molecule.
  • the cellular sample can comprise a second target RNA molecule. In some embodiments, the cellular sample can comprise at least a second plurality of DNA amplicons that can correspond to a second target RNA. In some embodiments, the cellular sample can be deposited on a solid support.
  • the cellular sample harbors 2-25 different target RNA molecules, or harbors 25-50 different target RNA molecules, or harbors 50-75 different target RNA molecules, or harbors 75-100 different target RNA molecules. In some embodiments, the cellular sample harbors more than 100 different target RNA molecules, or more than 250 different target RNA molecules, or more than 500 different target molecules, or more than 1000 different target RNA molecules, or more. In some embodiments, the cellular sample harbors more than 10,000 different target RNA molecules. In some embodiments, the cellular sample comprises a whole cell, a plurality of whole cells, an intact tissue or an intact tumor.
  • the cellular sample comprises a fresh cellular sample, a freshly -frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample.
  • the cellular sample is deposited onto a solid support.
  • the cellular sample is deposited onto a solid support which is passivated with a coating that promotes cell adhesion.
  • the cellular sample is deposited on a support that lacks immobilized capture oligonucleotides.
  • the cellular sample comprises an expanded cellular sample that has been cultured in a simple or complex cell culture media.
  • the first plurality of DNA amplicons comprises a first plurality of concatemers.
  • the second plurality of DNA amplicons comprises a second plurality of concatemers.
  • the first plurality of DNA amplicons comprises a first target DNA sequence that corresponds to a first target RNA molecule. In some embodiments, the first plurality of DNA amplicons comprises a first target DNA sequence that corresponds to a first target RNA molecule and a first target barcode sequence that corresponds to the first target RNA molecule. In some embodiments, the first plurality of DNA amplicons comprises a first target DNA sequence that corresponds to a first target RNA molecule and at least one universal adaptor sequence, such as for example a universal sequencing primer binding site (or a complementary sequence thereof).
  • the first plurality ofDNA amplicons comprises a first target DNA sequence that corresponds to a first target RNA molecule and a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof). In some embodiments, the first plurality of DNA amplicons comprises a first target DNA sequence that corresponds to a first target RNA molecule and a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the second plurality ofDNA amplicons comprises a second target DNA sequence that corresponds to a second target RNA molecule. In some embodiments, the second plurality ofDNA amplicons comprises a second target DNA sequence that corresponds to a second target RNA molecule and a second target barcode sequence that corresponds to the second target RNA molecule. In some embodiments, the second plurality of DNA amplicons comprises a second target DNA sequence that corresponds to a second target RNA molecule and at least one universal adaptor sequence, such as for example a universal sequencing primer binding site (or a complementary sequence thereof).
  • the second plurality of DNA amplicons comprises a second target DNA sequence that corresponds to a second target RNA molecule and a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof).
  • the second plurality ofDNA amplicons comprises a second target DNA sequence that corresponds to a second target RNA molecule and a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the methods for detecting in situ at least two different target RNA molecules in a cellular sample further comprise step (b): sequencing the first plurality of DNA amplicons inside the cellular sample.
  • the sequencing may comprise conducting no more than 2- 30 sequencing cycles to generate a plurality of first sequencing read products, and sequencing the second plurality of DNA amplicons inside the cellular sample which comprises conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products.
  • (b) can comprise sequencing the first plurality ofDNA amplicons inside the cellular sample which can comprise conducting no more than 2 -30 sequencing cycles.
  • the 2-30 sequencing cycles can generate a plurality of first sequencing read products.
  • (b) can comprise sequencing the second plurality ofDNA amplicons inside the cellular sample which can comprise conducting no more than 2 -30 sequencing cycles
  • the 2-30 sequencing cycles can generate a plurality of second sequencing read products.
  • the sequences of the first sequencing read products can be aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular product.
  • the sequences of the second sequencing read products can be aligned with a second target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample
  • sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • the first target reference sequence comprises the first target barcode sequence. In some embodiments, the first target reference sequence comprises the first target barcode sequence and at least a portion of the first target RNA sequence. In some embodiments, the first target reference sequence comprises at least a portion of the first target RNA sequence.
  • the second target reference sequence comprises the second target barcode sequence. In some embodiments, the second target reference sequence comprises the second targetbarcode sequence and at least a portion of the second targetRNA sequence. In some embodiments, the second target reference sequence comprises at least a portion of the second targetRNA sequence.
  • the methods for detecting in situ at least two different target RNA molecules in a cellular sample can further comprise (c): removing the plurality of first sequencing read products from the first DNA amplicons and retaining the first DNA amplicons inside the cellular sample, and removing the plurality of second sequencing read products from the second DNA amplicons and retaining the second DNA amplicons inside the cellular sample.
  • (c) can comprise removing the plurality of first sequencing read products from the first DNA amplicons
  • (c) can comprise retaining the first DNA amplicons inside the cellular sample.
  • (c) can comprise removing the plurality of second sequence ready products from the second DNA amplicons.
  • (c) can comprise retaining the second DNA amplicons inside the cellular sample.
  • the methods for detecting in situ at least two different target RNA molecules in a cellular sample further comprise step (c): removing the plurality of first sequencing read products from the first DNA amplicons and retaining the first DNA amplicons inside the cellular sample, and removing the plurality of second sequencing read products from the first DNA amplicons and retaining the second DNA amplicons inside the cellular sample.
  • the methods for detecting in situ at least two different target RNA molecules in a cellular sample further can comprise (d): reiteratively sequencing the firstand second plurality of DNA amplicons by repeating (b) and (c) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • (d) can comprise reiteratively sequencing the first plurality ofDNA amplicons by repeating (b) and (c)at least once.
  • (d) can comprise reiteratively sequencing the second plurality of DNA amplicons by repeating (b) and (c) at least once.
  • (b) and (c) can be repeated at least 2 times, at least 3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 8 times, at least 9 times, or at least 10 times.
  • (b) and (c) can be repeated up to 10 times, up to 20 times, up to 30 time, up to 40 times, orup to 50 times.
  • the methods for detecting in situ at least two different target RNA molecules in a cellular sample further comprise step (d): reiteratively sequencing the first and second plurality ofDNA amplicons by repeating steps (b) and (c) at least once.
  • steps (b) and (c) can be repeated at least 2 times, at least 3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 8 times, at least 9 times, or at least 10 times.
  • steps (b) and (c) can berepeatedup to 10 times, up to 20 times, up to 30 time, up to 40 times, orup to 50 times.
  • the sequences of the first and second sequencing read products can be aligned after each round of generating the first and second sequencing read products which are no more than 30 bases in length (e.g., after step (b)), or after generating a set of reiterative sequencing read products wherein the first and second sequencing read products which are no more than 30 bases in length (e.g., after step (d)).
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules in a cellular sample comprising step (a): providing a cellular sample harboring a plurality of RNA which comprises at least a first target RNA molecule and a second target RNA molecule, wherein the cellular sample is fixed and permeabilized.
  • the cellular sample harbors 2-25 different target RNA molecules, or harbors 25 -50 different target RNA molecules, or harbors 50-75 different target RNA molecules, or harbors 75 _, 100 different target RNA molecules.
  • the cellular sample harbors more than 100 different target RNA molecules, or more than 250 different target RNA molecules, or more than 500 different target molecules, or more than 1000 different target RNA molecules, ormore. In some embodiments, the cellular sample harbors more than 10,000 different target RNA molecules. In some embodiments, the cellular sample comprises a whole cell, a plurality of whole cells, an intact tissue or an intact tumor. In some embodiments, the cellular sample comprises a fresh cellular sample, a freshly-frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample. In some embodiments, the cellular sample is deposited onto a solid support.
  • the cellular sample is deposited onto a solid support which is passivated with a coating that promotes cell adhesion. In some embodiments, the cellular sample is deposited on a support that lacks immobilized capture oligonucleotides.
  • the plurality ofRNA can comprise at least a first target RNA molecule. In some embodiments, the plurality ofRNA can compriseatleast a second target RNA molecule. In some embodiments, the cellular sample is cultured prior to conducting step (b) which is described below.
  • methods for detecting at least two different target RNA molecules in a cellular sample further comprising step (b): generating inside the cellular sample a plurality of cDNA molecules which include at least a first target cDNA molecule that corresponds to the first target RNA molecule, and the plurality of cDNA molecules includes a second target cDNA molecule that corresponds to the second target RNA molecule.
  • a cDNA molecule of the plurality of cDNA molecules maybe generated through reverse transcription of a messenger RNA (mRNA) molecule inside the biological sample, wherein the cDNA molecule or mRNA molecule comprise a target nucleic acid sequence, or the reverse complement of the target nucleic acid sequence.
  • mRNA messenger RNA
  • a plurality of cDNA molecules may be generated through reverse transcription of one or more messenger RNA (mRNA) molecule(s) inside the biological sample, wherein the plurality of cDNA molecules or the one or more mRNA molecules comprise a target nucleic acid sequence, or the reverse complement of the target nucleic acid sequence.
  • the first target cDNA molecule or the second target cDNA molecule may be generated through reverse transcription of one or more messenger RNA (mRNA) molecule(s) inside the biological sample, wherein the plurality of cDNA molecules or the one or more mRNA molecules comprise a target nucleic acid sequence, or the reverse complement of the target nucleic acid sequence.
  • the method comprises generating at least 2-10,000 different target cDNA molecules that correspondto 2-10,000 different target RNA molecules.
  • the generating of step (b) comprises contacting the plurality ofRNA inside the cellular sample with (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and (iii) a plurality of nucleotides, under a condition suitable for conducting a reverse transcription reaction to generate a plurality of cDNA molecules (e.g., a plurality of first strand cDNA molecules) in the cellular sample (e.g., FIG. 1).
  • the plurality of reverse transcription primers comprises a first sub -population of target-specific reverse transcription primers that hybridize selectively to the first target RNA, and comprises a second sub -population of target-specific reverse transcription primers that hybridize selectively to the second targetRNA.
  • the plurality of reverse transcription primers comprises a first sub -population of random-sequence reverse transcription primers that hybridize to the first target RNA, and comprises a second sub-population of random-sequence reverse transcription primers that hybridize to the second target RNA.
  • methods for detecting at least two different target RNA molecules in a cellular sample can comprise step (b): generating inside the cellular sample a plurality of cDNA molecules which can include at least a first target cDNA molecule that can corresponds to a first target RNA molecule.
  • the plurality of cDNA molecules can include a second target cDNA molecule that can correspond to the second target RNA molecule.
  • the plurality of cDNA molecules can include at least a first target cDNA molecule.
  • the first target cDNA molecule can correspond to the first target RNA molecule.
  • the plurality of cDNA molecules can include at least a second target cDNA molecule.
  • the second target cDNA molecule can correspond to the second target RNA molecule.
  • the method can comprise generating at least 2-10,000 different target cDNA molecules that can correspond to 2-10,000 different target RNA molecules.
  • the generating of step (b) can comprises contacting the plurality of RNA inside the cellular sample with (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and (iii) a plurality of nucleotides.
  • the generating of step (b) can be conducted under a condition suitable for conducting a reverse transcription reaction to generate a plurality of cDNA molecules (e g., a plurality of first strand cDNA molecules) in the cellular sample (e.g., FIG. 1).
  • the generating of step (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of reverse transcription primers.
  • the generating of step (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of reverse transcription enzymes.
  • the generating of step (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of nucleotides.
  • the plurality of reverse transcription primers can comprise a first sub-population of target-specific reverse transcription primers that can hybridize selectively to the first target RNA. In some embodiments, the plurality of reverse transcription primers can comprise a second sub-population of target-specific reverse transcription primers that can hybridize selectively to the second target RNA. In some embodiments, the plurality of reverse transcription primers can comprise a first sub -population of target-specific reverse transcription primers. In some embodiments, the first sub -population of target-specific reverse transcription primers can hybridize selectively to the first target RNA. In some embodiments, the plurality of reverse transcription primers can comprise a second sub-population of target-specific reverse transcription primers.
  • the second sub-population of target-specific reverse transcription primers can hybridize selectively to the second target RNA.
  • the plurality of reverse transcription primers can comprise a first sub -population of random-sequence reverse transcription primers that can hybridize to the first target RNA, and can comprise a second sub -population of random-sequence reverse transcription primers that can hybridize to the second target RNA.
  • the plurality of reverse transcription primers can comprise a first sub -population of random-sequence reverse transcription primers.
  • the first sub -population of random-sequence reverse transcription primers can hybridize to the first target RNA.
  • the plurality of reverse transcription primers can comprise a second sub -population of random-sequence reverse transcription primers.
  • the second sub-population of random-sequence reverse transcription primers can hybridize to the second target RNA.
  • methods for detecting at least two different target RNA molecules in a cellular sample further comprise step (c): contacting the plurality of cDNA molecule in the cellular sample with a plurality of target-specific padlock probes which includes at least a first plurality of target-specific padlock probes and a second plurality of target-specific padlock probes.
  • the plurality of target-specific padlock probes can include at least a first plurality of target-specific padlock probes.
  • the plurality of targetspecific padlock probes can include at least a second plurality of target-specific padlock probes.
  • the method comprises contacting the plurality of cDNA molecule in the cellular sample with atleast2-10,000 different target-specific padlock probes.
  • individual padlockprobes in the plurality of first target -specific padlock probes comprise a firstand second end (e.g., first and second padlock binding arms), wherein the first end selectively hybridizes to a first region of the first target cDNA molecule and the second end selectively hybridizes to a second region of the first target cDNA molecule.
  • the contacting of step (c) comprises: hybridizing the first nd second ends of the first target-specific padlock probes to proximal positions on the first target cDNA molecule to form a circularized first target-specific padlock probe having a nick or gap between the hybridized firstand second ends (e.g., FIG. 1).
  • the first target-specific padlock probe comprises a first target barcode sequence that corresponds to the first target cDNA sequence. In some embodiments, the first target- specific padlock probe comprises a first target barcode sequence that is located adjacent to one of the regions of the first target-specific padlock probethat selectively hybridizes to the first target cDNA molecule. In some embodiments, the first target-specific padlock probe comprises at least one universal adaptor sequence, such as for example a universal sequencing primer binding site (or a complementary sequence thereof). In some embodiments, the first target-specific padlock probe comprises a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof).
  • the first target-specific padlock probe comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the method comprises contacting in situ : the first target nucleic acid sequence or a reverse complement thereof with a first oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the first oligonucleotide may be complementary.
  • the first end portion and second end portion of the first oligonucleotide may bind to two neighboring segments of the first target nucleic acid sequence or the reverse complement thereof so that the first oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion .
  • the method further comprises contacting in situ: the second target nucleic acid sequence or a reverse complement thereof with a second oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the second oligonucleotide are complementary.
  • the first end portion and second end portion bind to two neighboring segments of the second target nucleic acid sequence or a reverse complement thereof so that the second oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion .
  • the first target nucleic acid sequence comprises the first cDNA molecule or the first mRNA molecule .
  • the second target nucleic acid sequence comprises the second cDNAmolecule or the second mRNA molecule.
  • the gap of the first oligonucleotide or the second oligonucleotide has a size of one nucleotide.
  • the gap of the first oligonucleotide and the second oligonucleotide has a size of one nucleotide.
  • the gap of the first oligonucleotide has a size of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or more nucleotides.
  • the gap of the second oligonucleotide has a size of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, ormore nucleotides. In some embodiments, the gap of the first oligonucleotide or the second oligonucleotide has a size of at least two nucleotides. In some embodiments, the gap of the first oligonucleotide and the second oligonucleotide has a size of at least two nucleotides. In some embodiments, the first oligonucleotide further comprises a first identification sequence that identifies the first target nucleic acid sequence.
  • the second oligonucleotide further comprises a second identification sequence that identifies the second target nucleic acid sequence.
  • the first oligonucleotide further comprises a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a sequencing primer.
  • the second oligonucleotide further comprises a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a sequencing primer.
  • the first oligonucleotide further comprises a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a primer for nucleic acid amplification.
  • the second oligonucleotide further comprises a nucleic acid sequence that is complementary sequence to a nucleic acid sequence of a primer for nucleic acid amplification.
  • the primer for nucleic acid amplification is a primer for rolling circle amplification (RCA).
  • the primerforRCA produces a concatemer.
  • the concatemer comprises at least two repeats of a target nucleic acid sequence of the at least two target nucleic acid sequences or a portion thereof, or a reverse complement thereof.
  • the first oligonucleotide further comprises a reverse complement for a compaction oligonucleotide.
  • the second oligonucleotide further comprises a reverse complement for a compaction oligonucleotide.
  • a first segment of the compaction oligonucleotide is complementary and binds to a first portion of the concatemer.
  • a second segment of the compaction oligonucleotide is complementary and binds to a second portion of the concatemer.
  • the binding to a first and/or a second portion of the concatemer results in a reduction in the size or a change in the shape of the concatemer.
  • the method further comprises joining in situ the first and second end portions of the first oligonucleotide to produce a first circular oligonucleotide inside the biological sample. In some embodiments, the method further comprises joining in situ the first and second end portions of the second oligonucleotide to produce a second circular oligonucleotide inside the biological sample. In some embodiments, thejoiningthe first and second end portions of the first oligonucleotide comprises joiningthe first and second endportions of the first oligonucleotides through a first nucleic acid enzyme.
  • thejoiningthe first and second end portions of the second oligonucleotide comprises joining the firstand second end portions of the second oligonucleotide through a second nucleic acid enzyme.
  • the first nucleic acid enzyme and the second nucleic acid enzyme are the same type of enzyme.
  • the joining the first and second end portions of the second oligonucleotide comprises joining the first and second end portions of the second oligonucleotide through a second nucleic acid enzyme, wherein the first nucleic acid enzyme and the second nucleic acid enzyme are a different type of enzyme.
  • the first nucleic acid enzyme comprises a nucleic acid ligase, a nucleic acid ligation enzyme, a nucleic acid polymerase, a nucleic acid polymerization enzyme, or combinations thereof.
  • the second nucleic acid enzyme comprises a nucleic acid ligase, a nucleic acid ligation enzyme, a nucleic acid polymerase, a nucleic acid polymerization enzyme, or combinations thereof.
  • the method further comprises amplifying in situ the first circular oligonucleotide to produce the first nucleic acid molecule.
  • the method further comprises amplifying in situ the second circular oligonucleotide to produce the second nucleic acid molecule.
  • the amplifying comprises rolling circle amplification (RCA), wherein the first nucleic acid molecule comprises a first concatemer and the second nucleic acid molecule comprises a second concatemer.
  • the first concatemer comprises at least two repeats of a first unit nucleic acid sequence comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof.
  • the second concatemer comprises at least two repeats of a second unit nucleic acid sequence comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof.
  • the first concatemer further comprises the first identification sequence that identifies the first target nucleic acid sequence, or a sequencing primer or a reverse complement thereof
  • the second concatemer further comprises the second identification sequence that identifies the second target nucleic acid sequence, or a sequencing primer or a reverse complement thereof.
  • the first concatemer further comprises a first compaction oligonucleotide.
  • a first segment of the first compaction oligonucleotide is complementary and binds to a first portion of the first concatemer.
  • a second segment of the first compaction oligonucleotide is complementary and binds to a second portion of the first concatemer, to result in a reduction in the size or a change in the shape of the first concatemer.
  • the second concatemer further comprises a second compaction oligonucleotide.
  • a first segment of the second compaction oligonucleotide is complementary and binds to a first portion of the second concatemer.
  • a second segment of the second compaction oligonucleotide is complementary and binds to a second portion of the second concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • individual padlock probes in the plurality of second targetspecific padlock probes comprise a first and second end, wherein the first end selectively hybridizes to a first region of the second target cDNA molecule and the second end selectively hybridizes to a second region of the second target cDNA molecule.
  • the contacting of step (c) comprises: hybridizing the first and second ends of the secondtarget- specific padlock probes to proximal positions on the second target cDNA molecule to form a circularized second target-specific padlock probe having a nick or gap between the hybridized first and second ends.
  • the second target-specific padlock probe comprises a second target barcode sequence that corresponds to the second target cDNA sequence.
  • the second target-specific padlock probe comprises a second target barcode sequence that is located adjacent to one of the regions of the second target-specific padlock probe that selectively hybridizes to the second target cDNA molecule.
  • the second target-specific padlock probe at least one universal adaptor sequence, such as for example a comprises at least one universal sequencing primer binding site (or a complementary sequence thereof).
  • the second target-specific padlock probe comprises a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof).
  • the second target-specific padlock probe comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the target-specific padlock probes comprise a universal sequencing primer binding site and a target barcode sequence that are adjacent to each other so thatthe target barcode region of the concatemer is sequenced first.
  • the target barcode sequence can be any length, for example 3-15 bases, or 15-25 bases, or 25-40bases, orlonger.
  • methods for detecting at least two different target RNA molecules in a cellular sample further can comprise step (d): closing the nick or gap in the at least first and second circularized target-specific padlock probes by conducting an enzymatic reaction, thereby generating at least a first covalently closed circular padlock probe and a second covalently closed circular padlock probe inside the cellular sample.
  • step (d) can comprise closing the nick or gap in the at least first circularized target-specific padlock probe by conducting an enzymatic reaction, thereby generating at least a first covalently closed circular padlock probe.
  • step (d) can comprise closing the nick or gap in the at least second circularized target-specific padlock probe by conducting an enzymatic reaction, thereby generating at least a second covalently closed circular padlock probe.
  • the closing the nick can comprise conducting an enzymatic ligation reaction.
  • closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the first or second target cDNA molecule as a template, and conducting an enzymatic ligation reaction.
  • closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the first target cDNA molecule as a template.
  • closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the second target cDNA molecule as a template.
  • the method can comprise closing the nick or gap in atleast 2-10,000 circularized target-specific padlock probes by conducting an enzymatic reaction, thereby generating at least 2 -10,000 covalently closed circular padlockprobes inside the cellular sample.
  • methods for detecting at least two different target RNA molecules in a cellular sample can further comprise step (e): conducting a rolling circle amplification reaction inside the cellular sample using the first and second covalently closed circular padlock probes as template molecules, thereby generating a plurality of concatemer molecules including at least a first concatemer molecule that corresponds to a first target RNA molecule, and the plurality of concatemer molecules can include at least a second concatemer molecule that correspondsto a second target RNA molecule.
  • step (e) can comprise conducting a rolling circle amplification reaction inside the cellular sample using the first covalently closed circular padlock probe as a template molecule.
  • the rolling circle amplification of the first covalently closed circular padlock probe can generate a plurality of concatemer molecule.
  • the plurality of concatemer molecule can include a first concatemer molecule.
  • the first concatemer molecule can correspondto a first target RNA molecule.
  • step (e) can comprise conducting a rolling circle amplification reaction inside the cellular sample using the second covalently closed circular padlock probe as a template molecule.
  • the rolling circle amplification of the second covalently closed circular padlock probe can generate a plurality of concatemer molecules.
  • the plurality of concatemer molecule can include a second concatemer molecule.
  • the second concatemer molecule can correspond to a second target RNA molecule.
  • the first concatemer molecule can comprise tandem repeat units, wherein a unit can comprise the sequence of the first target cDNA and the universal sequencing primer binding site (or a complementary sequence thereof).
  • the second concatemer molecule can comprises tandem repeat units, wherein a unit can comprise the sequence of the second target cDNA and the universal sequencing primer binding site (or a complementary sequence thereof).
  • the method can comprise conducting a rolling circle amplification reaction inside the cellular sample using the at least 2-10,000 covalently closed circular padlock probes as template molecules, thereby generating at least 2-10,000 concatemer molecules that correspond to atleast2-10,000 target RNA molecules.
  • the rolling circle amplification can be conducted in the presence of a plurality of compaction oligonucleotides.
  • each compaction oligonucleotide can comprise a single stranded oligonucleotide.
  • the single stranded oligonucleotide can have a first region at one end and a second region at the other end.
  • the first region can hybridize to a portion of the concatemer molecule
  • the second region can hybridize to another portion of the concatemer molecule.
  • the hybridization of the first region and the second region of the compaction oligonucleotide can compact the concatemer molecule.
  • the compaction oligonucleotide can compact the size of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the shape of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the size and shape of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the concatemer molecule to form a compact nanoball.
  • methods for detecting at least two different target RNA molecules in a cellular sample can further comprise step (f): sequencing the plurality of concatemer molecules inside the cellular sample, which can comprise sequencing the first concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of first sequencing read products, and sequencing the second concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products.
  • step (f) can comprise sequencing the plurality of concatemer molecules inside the cellular sample.
  • sequencing the plurality of concatemer molecules inside the cellular sample can comprise sequencing the first concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of first sequencing read products.
  • step (f) can comprise sequencing the plurality of concatemer molecules inside the cellular sample.
  • sequencing the plurality of concatemer molecules inside the cellular sample can comprise sequencing the second concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products.
  • the sequencing of step (f) can comprise sequencing no more than 2-30 bases of the first concatemer molecules to generate a plurality of first sequencing read products, and which can comprise sequencing no more than 2 - 30 bases of the second concatemer molecules to generate a plurality of second sequencing read products.
  • the sequencing of step (f) can comprise sequencing no more than 2-30 bases of the first concatemer molecules to generate a plurality of first sequencing read products.
  • the sequencing of step (f) can comprise sequencing no more than 2-30 bases of the second concatemer moleculesto generate a plurality of second sequencing read products.
  • the method can comprise sequencing the at least 2-10,000 concatemer molecules inside the cellular sample, which can comprise conducting no more than 2-30 sequencing cycles on the 2-10,000 concatemer molecules to generate a plurality of sequencing read products.
  • only the first target barcode region of the first concatemer molecules are sequenced (e.g., FIG. 2).
  • at least a portion or the full length of the first target barcode of the first concatemer molecules are sequenced (e.g., FIG. 2).
  • the first target barcode is sequenced and a portion of the first cDNA region of the first concatemer molecules are sequenced (e.g., FIG. 3). In some embodiments, at least a portion of the first cDNA region of the first concatemer molecules are sequenced (e.g., FIG. 4 or 5). In some embodiments, only the second target barcode region of the second concatemer molecules are sequenced (e.g., FIG. 2). In some embodiments, at least a portion or the full length of the second target barcode of the second concatemer molecules are sequenced (e.g., FIG. 2).
  • the second target barcode is sequenced and a portion of the second cDNA region of the second concatemer molecules are sequenced (e.g., FIG. 3). In some embodiments, at least a portion of the second cDNA region of the second concatemer molecules are sequenced (e.g., FIG. 4 or 5).
  • the sequencing of step (f) comprises sequencing at least a portion of the firstand second nucleic acid concatemers using an optical imaging system comprising a field-of-view (FOV) greater than 1.0mm 2 .
  • FOV field-of-view
  • the plurality of first and second sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of first and second sequencing read products from the images obtained during the no more than 2-30 sequencing cycles.
  • the plurality first sequencing read products can be detectable by imaging.
  • the plurality of first sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of first sequencing read products from the images obtained during the no more than 2 - 30 sequencing cycles.
  • the plurality of second sequencing read products can be detectable by imaging.
  • the plurality of second sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of second sequencing read products from the images obtained during the no more than 2-30 sequencing cycles.
  • the plurality of first and second sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of first and second sequencing read products from the images obtained during the no more than 2-30 sequencing cycles.
  • the plurality first sequencing read products can be detectable by imaging.
  • the plurality of first sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of first sequencing read products from the images obtained during the no more than 2- 30 sequencing cycles.
  • the plurality of second sequencing read products can be detectable by imaging.
  • the plurality of second sequencing read products can be detectable by imaging, and wherein the sequencing can comprise decoding the plurality of second sequencing read products from the images obtained during the no more than 2-30 sequencing cycles.
  • the sequences of the first sequencing read products can be aligned with a first target reference sequence to confirm the presence of the first target RN A in the cellular product.
  • the sequences of the second sequencing read products can be aligned with a second target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • the sequencing of step (f) can comprise (1) contacting the plurality of concatemer molecules inside the cellular sample with (i) a plurality of universal sequencing primers, (ii) a plurality of sequencing polymerases, and (iii) a plurality of nucleotide reagents, under a condition suitable for hybridizing the plurality of universal sequencing primers to their respective universal sequencing primer binding sites on the concatemers.
  • the sequencing of step (f) can comprise (1) contacting the plurality of concatemer molecules inside the cellular sample with a plurality of universal sequencing primers.
  • the sequencing of step (f) can comprise (1) contacting the plurality of concatemer molecules inside the cellular sample with a plurality of sequencing polymerases. In some embodiments, the sequencing of step (f) can comprise (1) contacting the plurality of concatemer molecules inside the cellular sample with a plurality of nucleotide reagents. In some embodiments, the sequencing can further comprise (2) conducting no more than 2-30 sequencing cycles to generate at least a first plurality of sequencing read products. In some embodiments, the sequencing can further comprise (2) conducting no more than 2-30 sequencing cycles to generate at least a first plurality of sequ encing read products and a second plurality of sequencing read products.
  • the sequencing can further comprise (3) removing the first plurality of sequencing read products from the concatemers and retaining the plurality of concatemers inside the cellular sample. In some embodiments, the sequencing can further comprise (3) removing the first plurality of sequencing read products from the first concatemer molecules. In some embodiments, the sequencing of (3) can further comprise retaining the first concatemer molecules inside the cellular sample. In some embodiments, the sequencing further can comprise (3) removing the first plurality of sequencing read products from the concatemers and retaining the first concatemer molecules inside the cellular sample, and removing the second plurality of sequencing read products from the second concatemer molecules and retaining the second concatemer molecules inside the cellular sample.
  • the sequencing can further comprise (3) removing the second plurality of sequencing read products from the second concatemer molecules. In some embodiments, the sequencing of (3) can further comprise retaining the second concatemer molecules inside the cellular sample. In some embodiments, the sequencing can further comprise (4) repeating (1) - (3) at least once. In some embodiments, (4) can comprise repeating (1) - (3) atleast2 times, at least 3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 8 times, at least 9 times, or at least 10 times. In some embodiments, (4) can comprise repeating (1) - (3) up to 10 times, up to 20 times, up to 30 time, up to 40 times, or up to 50 times.
  • the reiterative sequencing can be conducting using a sequencing- by-binding procedure, labeled and/or non-labeled chain-terminating nucleotides, or multivalent molecules. Descriptions of these three sequencing methodsis described below.
  • the plurality of universal sequencing primers can be hybridized to concatemer template molecules with a hybridization reagent comprising an SSC buffer(e.g., 2X saline-sodium citrate) buffer with formamide (e.g., 10-20% formamide).
  • SSC buffer e.g., 2X saline-sodium citrate
  • formamide e.g., 10-20% formamide.
  • the hybridization conditions comprise a temperature of about20-30°C, for about 10-60 minutes.
  • the plurality of sequencing read products can be removed from the concatemers and the plurality of concatemers can be retained inside the cellular sample using a de-hybridization reagent comprising an SSC buffer (e.g., saline-sodium citrate) buffer, with or without formamide, at a temperature that promotes nucleic acid denaturation such as for example 30 - 90°C.
  • SSC buffer e.g., saline-sodium citrate
  • the plurality of nucleotide reagents of step (f) comprises a plurality of nucleotides that are detectably labeled or non-labeled.
  • individual nucleotides are linked to a detectable reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • the plurality of detectably labeled nucleotide analogs comprise a plurality of chain terminating nucleotides, where the chain terminating moiety is linked to the 3 ’ nucleotide sugar position to form a 3 ’ blocked nucleotide analog.
  • the chain terminating moiety can be removed to convert the 3 ’ blocked nucleotide analog to an extendible nucleotide having a 3 ’ OH group on the sugar.
  • the labeled nucleotide analogs are linked to a different fluorophore that corresponds to the nucleobases adenine, cytosine, guanine, thymine or uracil, where the different fluorophores emit a fluorescent signal during the sequencing of step (f).
  • a sequencing cycle comprises (1) contactingthe concatemer/sequencing primer duplex with a sequencing polymerase and a detectably labeled chain terminating nucleotide under a condition suitable for polymerase -catalyzed incorporation of the detectably labeled chain terminating nucleotide into the terminal end of the sequencing primer, (2) detecting and imaging the fluorescent signal and color emitted by the incorporated chain terminating nucleotide, and (3) removing the chain terminating moiety (e g., unblocking) and retainin the concatemer/sequencing primer duplex.
  • a sequencing cycle comprises (1) contactingthe concatemer/sequencing primer duplex with a sequencing polymerase and a detectably labeled chain terminating nucleotide under a condition suitable for polymerase -catalyzed incorporation of the detectably labeled chain terminating nucleotide into the terminal end of the sequencing primer, (2) detecting and imaging the fluorescent signal and color emitted by the incorporated chain terminating
  • no more than 2-30 sequencing cycles canbe conducted on the plurality of concatemers inside the cellular sample to generate a plurality of first sequencing read products and a plurality of second sequencing read products. In some embodiments, no more than 2 -30 sequencing cycles can be conducted on the plurality of concatemers inside the cellular sample to generate a plurality of first sequencing read. In some embodiments, no more than 2-30 sequencing cycles can be conducted on the plurality of concatemers inside the cellular sample to generate a plurality of second sequencing read products. In some embodiments, the sequence of the first sequencing read product can be determined and aligned with a first reference sequence to confirm the presence of the first target RNA molecules inside the cellular sample.
  • the sequence of the second sequencing read product can be determined and aligned with a second reference sequence to confirm the presence of the first target polypeptides inside the cellular sample.
  • the sequences of the first and second sequencing read products can be aligned after each round of generating the firstand second sequencing read products.
  • the firstand second sequencing read products canbe no more than 30 bases in length.
  • the sequences of the first and second sequencing read products can be aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the firstand second sequencing read products canbe no more than 30 bases in length.
  • the sequences of the first sequencing read products can be aligned after each round of generating the first sequencing read products. In some embodiments, the first sequencing read products canbe no more than 30 bases in length. In some embodiments, the sequences of the first sequencing read products can be aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the first sequencing read products can be no more than 30 bases in length. In some embodiments, the sequences of the second sequencing read products canbe aligned after each round of generating the second sequencing read products. In some embodiments, the second sequencing read products can be no more than 30 bases in length.
  • the sequences of the second sequencing read products canbe aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the second sequencing read products canbe no more than 30 bases in length.
  • the sequencing reactions can be conducted on a sequencing apparatus having a detectorthat captures fluorescent signals from the sequencing reactions inside the cellular sample. The sequencing apparatus can be configured to relay the fluorescent signal data captured by the detector to a computer system that is programmed to display images of different fluorescent spots which are co -located in the cellular sample, where individual fluorescent spots correspond to different target RNA molecules or different target polypeptides.
  • the images can have different color fluorescent spots co-located in the same cellular sample at different sequencing cycles.
  • out-of-sync phasing and/or pre-phasing events can occur during synchronized sequencing reactions on clonally amplified template amplicons, where the sequencing reactions comprise polymerase-catalyzed sequencing reactions employing detectably labeled chain terminator nucleotides.
  • a sequencing reaction on one template molecule in the clonally-amplified template molecules moves ahead (e.g., pre-phasing) or fall behind (e.g., phasing) of the sequencing of the other template molecules within the clonally-amplified template molecules.
  • a fluorescent signal is typically detected which corresponds to incorporation of a labeled chain terminator nucleotide.
  • phasing and pre-phasing events can be detected and monitored using incorporation of a labeled chain terminator nucleotide.
  • the plurality of nucleotide reagents of step (f) comprises a plurality of multivalent molecules each comprising a core attached to a plurality of nucleotide - arms, wherein the nucleotide-armsare attached to a nucleotide unit.
  • individual multivalent molecules are labeled with a detectably reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • the core of the multivalent molecule is labeled with a fluorophore, and wherein the fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • a fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • At least one of the nucleotide arms of the multivalent molecule comprises a linker and/or nucleotide base that is attached to a fluorophore, and wherein the fluorophore which is attached to a given nucleotide base corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • the nucleotide base e.g., adenine, guanine, cytosine, thymine or uracil
  • a sequencing cycle comprises (1) contacting the concatemer/sequencing primer duplex with a first sequencing polymerase to form a complexed polymerase, (2) contacting the complexed polymerase with a detectably labeled multivalent molecule under a condition suitable for binding a complementary nucleotide unit of the multivalent molecule to the complexed polymerase thereby forming a multivalent-binding complex, and the condition is suitable for inhibiting incorporation of the complementary nucleotide unit into the terminal end of the sequencing primer, (3 ) detecting and imaging the fluorescent signal and color emitted by the bound detectably labeled multivalent molecule, (4) removing the first sequencing polymerase and the bound detectably labeled multivalent molecule, and retaining the concatemer/sequencing primer duplex, (5) contacting the retained concatemer/sequencing primer duplex with a second sequencing polymerase and anon- labeled chain terminating nucleotide under a condition suitable for polymerase-cat
  • a sequencing cycle can comprise contacting the concatemer/sequencing primer duplex with a first sequencing polymerase to form a complexed polymerase.
  • a sequencing cycle can comprise contacting the complexed polymerase with a detectably labeled multivalent molecule under a condition suitable for binding a complementary nucleotide unit of the multivalent molecule to the complexed polymerase thereby forming a multivalent -binding complex, and the condition is suitable for inhibiting incorporation of the complementary nucleotide unit into the terminal end of the sequencing primer.
  • a sequencing cycle can comprise detecting and imaging the fluorescent signal and color emitted by the bound detectably labeled multivalent molecule.
  • a sequencing cycle can comprise removing the first sequencing polymerase and the bound detectably labeled multivalent molecule, and retaining the concatemer/sequencing primer duplex.
  • a sequencing cycle can comprise contacting the retained concatemer/sequencing primer duplex with a second sequencing polymerase and a non-labeled chain terminating nucleotide under a condition suitable for polymerase-catalyzed incorporation of the non-labeled chain terminating nucleotide into the terminal end of the sequencing primer.
  • a sequencing cycle can comprise removing the chain terminating moiety (e.g., unblocking) and retaining the concatemer/sequencing primer duplex.
  • individual cycle times can be achieved in less than 30 minutes.
  • the field of view (FOV) can exceed 1mm 2 and the cycle time for scanning large area (> 10mm 2 ) can be less than 5 minutes.
  • no more than 2-30 sequencing cycles can be conducted on the plurality of concatemers inside the cellular sample to generate a plurality of first sequencing read products and a plurality of second sequencing read products.
  • no more than 2-30 sequencing cycles can be conducted on the plurality of concatemers inside the cellular sample to generate a plurality of first sequencing read products.
  • no more than 2-30 sequencing cycles canbe conducted on the plurality of concatemers insidethe cellular sample to generate a plurality of second sequencing read products.
  • the sequence of the first sequencing read product can be determined and aligned with a first reference sequence to confirm the presence of the first target RNA molecules inside the cellular sample.
  • the sequence of the second sequencing read product can be determined and aligned with a second reference sequence to confirm the presence of the second target RNA molecules inside the cellular sample.
  • the sequences of the first and second sequencing read products can be aligned after each round of generating the first and second sequencing read products which are no more than 30 bases in length, or after generating a set of reiterative sequencing read products wherein the first and second sequencing read products which are no more than 30 bases in length.
  • the sequences of the first and second sequencing read products can be aligned after each round of generating the first and second sequencing read products. In some embodiments, the first and second sequencing read products can be no more than 30 bases in length. In some embodiments, the sequences of the first and second sequencing read products can be aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the firstand second sequencing read products can be no more than 30 bases in length. In some embodiments, the sequences of the first sequencing read products can be aligned after each round of generating the first sequencing read products. In some embodiments, the first sequencing read products can be no more than 30 bases in length.
  • the sequences of the first sequencing read products can be aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the first sequencing read products can be no more than 30 bases in length. In some embodiments, the sequences of the second sequencing read products can be aligned after each round of generating the second sequencing read products. In some embodiments, the second sequencing read products can be no more than 30 bases in length. In some embodiments, the sequences of the second sequencing read products can be aligned after generating a set of reiterative sequencing read products. In some embodiments, wherein the second sequencing read products can be no more than 30 bases in length.
  • the sequencing reactions can be conducted on a sequencing apparatus having a detector that can capture fluorescent signals from the sequencing reactions inside the cellular sample.
  • the sequencing apparatus can be configured to relay the fluorescent signal data captured by the detector to a computer system that can be programmed to display images of different fluorescent spots which can be co-located in the cellular sample, where individual fluorescent spots can correspond to different target RNA molecules (e.g., the first target RNA molecule, the second target RNA molecule, or a combination of the first RNA molecule and the second RNA molecule.
  • steps (2) and (3) can be conducted at a gentle temperature of about 35 - 45°C, or about 39 - 42°C.
  • steps (2) and (3) can be conducted at a gentle temperature which can help retain the compact size and shape of a DNA nanoball during multiple sequencing cycles (e.g., up to 30 cycles) which can improveFWHM (full width half maximum) of a spot image of the DNA nanoball inside a cellular sample.
  • the DNA nanoball does not unravel during multiple sequencing cycles.
  • the spot image of the DNA nanoball does not enlarge during multiple sequencing cycles.
  • the spot image of the DNA nanoball remains a discrete spot during multiple sequencing cycles.
  • the spot image can be represented as a Gaussian spot and the size can be measured as a FWHM.
  • a smaller spot size as indicated by a smaller FWHM typically correlates with an improved image of the spot.
  • the FWHM of a nanoball spot can be about 10 um or smaller.
  • out-of-sync phasing and/or pre-phasing events can occur during synchronized polymerase -catalyzed sequencing reactions employing detectably labeled multivalent molecules.
  • a fluorescent signal can be detected which corresponds to binding of complementary nucleotide unit of a multivalent molecule to the complexed polymerase thereby forming a multivalent-binding complex.
  • phasing and prephasing events can be detected and monitored using binding of labeled multivalent molecules.
  • the phasing and/or pre-phasing rate when conducting up to 30 sequencing cycles with detectably labeled multivalent molecules, can be less than about 5%, or less than about 1%, or less than about 0.01%, or less than about 0.001%.
  • the phasing and/or pre-phasing rates for conducting up to 30 sequencing cycles using labeled chain terminator nucleotides can be about 5%.
  • the sequencing of (f) can comprise determining the sequence of the first nucleic acid molecule or the portion thereof, wherein the first nucleic acid molecule or the portion thereof consists of 2 -30 nucleotides. In some embodiments, the sequence of (f) can comprise determining the sequence of the second nucleic acid molecule or the portion thereof, wherein the second nucleic acid molecule or the portion thereof consists of 2-30 nucleotides. In some embodiments, the 2-30 nucleotides of the first nucleic acid molecule or the portion thereof comprises the first identification sequence and the 2-30 nucleotides of the second nucleic acid molecule or the portion thereof comprises the second identification sequence.
  • the 2-30 nucleotides of the first nucleic acid molecule or the portion thereof further comprises at least a portion of the sequence of the first cDNA molecule or the first mRNA molecule and the 2-30 nucleotides of the second nucleic acid molecule or the portion thereof further comprises at least a portion of the sequence of the second cDNA molecule or the second mRNA molecule.
  • the determining comprises contacting the first concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the first concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the first concatemer, and the first concatemer hybridized to the primer sequence, wherein the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety .
  • the method further comprises incorporating the nucleotide into the 3 ’ end of the primer sequence.
  • the method further comprises identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide.
  • the determining comprises contacting the second concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the second concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the second concatemer, and the second concatemer hybridized to the primer sequence, wherein the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety .
  • the method further comprises incorporating the nucleotide into the 3 ’ end of the primer sequence.
  • the method further comprises identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide. In some embodiments, the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generate a 3 ’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, a silyl group, or any combination thereof.
  • the plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP.
  • the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP.
  • the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining comprises: contacting two of the first concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the first concatemer hybridized to each of the two of the primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the first concatemer.
  • the determining comprises detecting the multivalent binding complex through the label of the nucleotide conjugate; and identifying the nucleobases of the nucleotides of the two of the first concatemer that are each complementary and bind to each of the two of the at least two of the nucleotide moiety of the nucleotide conjugate.
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first or second concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the first concatemer; contacting each of the two of the first concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides; and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first concatemer, wherein an unlabeled nucleotide of the plurality of unlabeled nucleo
  • the determining comprises: contacting two of the second concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the second concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the second concatemer hybridized to each of the two of the primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the second concatemer; detecting the multivalent binding complex through the label of the nucleotide conjugate; and identifying the nucleobases of the nucleotides of the two of the second concatemer that are each
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attachedto one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the first concatemer; contacting each of the two of the first concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides; and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first concatemer, wherein an unlabeled nucleotide of the plurality of unlabeled nucleo
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the second concatemer; contacting each of the two of the second concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions sufficient for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the second concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the second concatemer, wherein an unlabeled nucleotide of the plurality of unlabeled
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attachedto one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the first concatemer; contacting each of the two of the first concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides; and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first concatemer, wherein an unlabeled nucleotide of the plurality of unlabeled nucleo
  • the determining comprises: contacting two of the first concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the first concatemer, and a second primer sequence that is complementary to a second portion of the first concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the first concatemer hybridized to the first primer sequence and a second portion of the first concatemer hybridized to the second primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of a first portion and a second portion of the first concatemer; detecting the multivalent binding complex through the label of
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attachedto one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the first concatemer; contacting each of the two of the first concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides; and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence, wherein each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first concatemer, wherein an unlabeled nucleotide of the plurality of unlabeled nucleo
  • the determining comprises: contacting two of the second concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the second concatemer, and a second primer sequence that is complementary to a second portion of the second concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the second concatemer hybridized to the first primer sequence and a second portion of the second concatemer hybridized to the second primer sequence, wherein the nucleotide conjugate comprises a label and at least two of a nucleotide moiety, wherein two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of a first portion and a second portion of the second concatemer; detecting the multivalent binding complex through the label of
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first or second concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generate a 3 ’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the first target nucleic acid sequence or the second target nucleic acid sequence may correspond to at least a portion of a messenger RNA (mRNA) molecule. In methods as disclosed herein, the first target nucleic acid sequence and the second target nucleic acid sequence may correspond to at least a portion of a messenger RNA (mRNA) molecule. In methods as disclosed herein, the first target nucleic acid sequence or the second target nucleic acid sequence may correspond to at least a portion of a complementary DNA (cDNA) molecule. In methods as disclosed herein, the first target nucleic acid sequence and the second target nucleic acid sequence may correspond to at least a portion of a complementary DNA (cDNA) molecule.
  • mRNA messenger RNA
  • mRNA messenger RNA
  • cDNA complementary DNA
  • the first target nucleic acid sequence and the second target nucleic acid sequence may correspond to at least a portion of a complementary DNA (cDNA) molecule.
  • the first target nucleic acid sequence and/or the second target nucleic acid sequence may correspond to two separate portions of the same mRNA or cDNA molecule. In some embodiments, the first target nucleic acid sequence and the second target nucleic acid sequence may correspond to two different mRNA or cDNA molecules. In some embodiments, the determining comprises detecting in situ the first or second sequencing product nucleic acid molecule inside the biological sample through imaging. In some embodiments, the determining comprises detecting in situ simultaneously the first and second sequencing product nucleic acid molecules inside the biological sample through imaging. In some embodiments, the imaging comprises fluorescent imaging.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly-frozen sample, a sectioned sample, or a formalin-fixed and paraffin-embedded (FFPE) sample.
  • the biological sample comprises a fresh cellular sample, a freshly -frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample.
  • the at least two target nucleic acid sequences comprise a target DNA sequence. In some embodiments, the at least two target nucleic acid sequences comprise a target RNA sequence. In some embodiments, the at least two target nucleic acid sequences comprise a first target RNA sequence and a second target RNA sequence. In some embodiments, the first target RNA sequence comprises coding RNA, non-coding RNA, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), microRNA (miRNA), guide RNA (gRNA), small nuclear RNA (snRNA), small interference RNA (siRNA), anti-sense RNA, mature microRNA, or immature microRNA.
  • mRNA messenger RNA
  • tRNA transfer RNA
  • rRNA ribosomal RNA
  • miRNA microRNA
  • gRNA small nuclear RNA
  • siRNA small interference RNA
  • anti-sense RNA mature microRNA, or immature microRNA.
  • the second target RNA sequence comprises codingRNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the at least two target nucleic acid sequences comprise a first target DNA sequence and a second target DNA sequence.
  • the first target DNA sequence comprises complementary DNA (cDNA), genomic DNA (gDNA), non-coding DNA, or coding DNA.
  • the second target DNA sequence comprises cDNA, gDNA, non -coding DNA, or codingDNA.
  • the at least two target nucleic acid sequences comprise a target RNA sequence and a target DNA sequence.
  • the target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti -sense RNA, mature microRNA, or immature microRNA.
  • the targetDNA sequence comprises cDNA, gDNA, non-codin DNA, or codingDNA.
  • methods for detecting at least two different target RNA molecules in a cellular sample can further comprise step (g): removing the plurality of first sequencing read products from the first concatemer molecules and retaining the first concatemer molecules in the cellular sample, and removing the plurality of second sequencing read products from the second concatemer molecules and retaining the second concatemer molecules in the cellular sample.
  • (g) can comprise removing the plurality of first sequence read products from the first concatemer molecule.
  • step (g) can further comprise retaining the first concatemer molecule in the cellular sample.
  • step (g) can comprise removing the plurality of second sequence read products from the second concatemer molecule.
  • step (g) can further comprise retaining the second concatemer molecule in the cellular sample.
  • methods for detecting at least two different target RNA molecules in a cellular sample further comprising step (h): reiteratively sequencing the plurality of concatemers by repeating steps (f) and (g) at least once, wherein the sequences of the plurality of first sequencing read products confirms the presence of the first target RNA molecules in the cellular sample, and wherein the sequences of the plurality of second sequencing read products confirms the presence of the second target RNA molecules in the cellular sample.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • step (h) can comprise reiteratively sequencing the plurality of concatemers by repeating steps (f) and (g) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the sequences of the plurality of first sequencing read products can confirm the presence of the first target RNA molecules in the cellular sample.
  • the sequences of the plurality of second sequencing read products can confirm the presence of the second target RNA molecules in the cellular sample.
  • the sequences of the plurality of first sequencing read products can confirm the presence of the first target RNA molecules in the cellular sample and the sequence of the plurality of the second sequencing read products can confirm the presence of the second target RNA molecules in the sample.
  • the present disclosure provides a method for detecting in situ at least two target nucleic acid molecules and at least two polypeptides in a biological sample, wherein the at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence, wherein the at least two target polypeptides comprise a first target polypeptide encoded by the first target nucleic acid sequence or a reverse complement thereof and a second target polypeptide encoded by the second target nucleic acid sequence or a reverse complement thereof.
  • the method comprises step (a) providing the biological sample.
  • the biological sample comprises a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof. In some embodiments, the biological sample comprises a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof.
  • the biological sample comprises a third nucleic acid molecule, comprising a third target nucleic acid sequence or a portion thereof, or the reverse complement of the third target nucleic acid sequence or a portion thereof, wherein the presence of third target nucleic acid sequence or the reverse complement thereof identifies the presence of the first target polypeptide in the biological sample.
  • the biological sample comprises a fourth nucleic acid molecule, comprising a fourth target nucleic acid sequence or a portion thereof, or the reverse complement of the fourth target nucleic acid sequence or a portion thereof, wherein the presence of the fourth target nucleic acid sequence or the reverse complement thereof identifies the presence of the second target polypeptide in the biological sample.
  • the method further comprises step (b) determining in situ the sequence of the first nucleic acid molecule or a portion thereof in the biological sample to generate a first sequencing product nucleic acid molecule that is complementary and binds to the first nucleic acid molecule or a portion thereof.
  • step (b) further comprises determining in situ the sequence of the third nucleic acid molecule or a portion thereof in the biological sample to generate a third sequencing product nucleic acid moleculethat is complementary and binds to the third nucleic acid molecule or a portion.
  • the method further comprises step (c) identifying in situ the sequence of: the second nucleic acid molecule or a portion thereof in the biological sample to generate a second sequencing product nucleic acid molecule that is complementary and binds to the second nucleic acid molecule or a portion thereof.
  • step (c) further comprises identifying in situ the sequence of the fourth nucleic acid molecule or a portion thereof in the biological sample to generate a fourth sequencing product nucleic acid molecule that is complementary andbinds to the fourth nucleic acid molecule or a portion .
  • the full sequence of the first target nucleic acid sequence and the full sequence of the second target nucleic acid sequence have at least one nucleotide of difference.
  • the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the performance of step (b) is under a condition that prevents the performance of the step (c).
  • the present disclosure provides a method for method for detecting in situ at least two target nucleic acid sequencesand at least two polypeptides in a biological sample.
  • at least two target nucleic acid sequences comprise a first target nucleic acid sequence and a second target nucleic acid sequence.
  • the at least two target polypeptides comprise a first target polypeptide encoded by the first target nucleic acid sequence or a reverse complement thereof .
  • the at least two target polypeptides comprise a second target polypeptide encoded by the second target nucleic acid sequence or a reverse complement thereof.
  • the method comprises (a) providing the biological sample comprising: (i) a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof.
  • the biological sample comprises (ii) a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof.
  • the biological sample comprises (iii)a third nucleic acid molecule, comprising a third target nucleic acid sequence or a portion thereof, or the reverse complement of the third target nucleic acid sequence or a portion thereof.
  • the presence of third target nucleic acid sequence or the reverse complement thereof identifies the presence of the first target polypeptide in the biological sample.
  • the biological sample further comprises (iv) a fourth nucleic acid molecule, comprising a fourth target nucleic acid sequence or a portion thereof, or the reverse complement of the fourth target nucleic acid sequence or a portion thereof, wherein the presence of the fourth target nucleic acid sequence or the reverse complement thereof identifies the presence of the second target polypeptide in the biological sample.
  • the method further comprises the step of (b) determining in situ the sequence of: (i) the first nucleic acid molecule or a portion thereof in the biological sample to generate a first sequencing product nucleic acid molecule that is complementary and binds to the first nucleic acid molecule or a portion thereof, wherein the firstnucleic acid molecule or the portion thereof consists of 2-30 nucleotides.
  • step (b) further comprises determining in situ the sequence of (ii) the third nucleic acid molecule or a portion thereof in the biological sample to generate a third sequencing product nucleic acid molecule that is complementary and binds to the third nucleic acid molecule or a portion, wherein the third nucleic acid molecule or the portion thereof consists of 2-30 nucleotides.
  • the methodfurther comprises (c) removing the first sequencing product nucleic acid molecule from the first nucleic acid molecule and the third sequencing product nucleic acid molecule from the third nucleic acid molecule.
  • the firstnucleic acid molecule and the third nucleic acid molecule are positioned in the biological sample after the removing.
  • the method further comprises step (d): repeating (b) and (c). In some embodiments, the method further comprises step (e): identifying in situ the sequence of : (i) the second nucleic acid molecule or a portion thereof in the biological sample to generate a second sequencing product nucleic acid molecule that is complementary and binds to the second nucleic acid molecule or a portion thereof, wherein the second nucleic acidmolecule orthe portion thereof consists of2-30 nucleotides.
  • step (e) further comprises identifying in situ the sequence of (ii) the fourth nucleic acid molecule or a portion thereof in the biological sample to generate a fourth sequencing product nucleic acid molecule that is complementary and binds to the fourth nucleic acid molecule or a portion.
  • the fourth nucleic acid molecule or the portion thereof consists of 2-30 nucleotides.
  • the method further comprises step (f): removing the second sequencing product nucleic acid molecule from the second nucleic acid molecule and the fourth sequencing product nucleic acid molecule from the fourth nucleic acid molecule, wherein the second nucleic acid molecule andthe fourth nucleic acid molecule are located in the biological sample after the removing.
  • the method further comprises step (g): repeating steps (e) and (f).
  • the full sequence of the first target nucleic acid sequence and the full sequence of the second target nucleic acid sequence have at least one nucleotide of difference.
  • the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the performance of step (b) is under a condition that prevents the performance of the step (e).
  • the present disclosure provides a method for detecting in situ at least two target RNA sequences and at least two polypeptides in a biological sample.
  • the method comprises a step (a) providing the biological sample that is immobilized on a surface, permeabilized and fixed.
  • the biological sample comprises (i) a first target RNA sequence of at least two target RNA sequences and a first target polypeptide of the at least two polypeptides, wherein the first target polypeptide is encoded by the first target RNA sequence or a reverse complement thereof.
  • the biological sample comprises (ii) a second target RNA sequence of at least two target RNA sequences and a second target polypeptide of the at least two polypeptides, wherein the second target polypeptide is encoded by the second target RNA sequence or a reverse complement thereof.
  • the method further comprises a step (b) producing a first target cDNA sequence through reverse transcription of the first target RNA sequence and a second target cDNA sequence through reverse transcription of the second target RNA sequence .
  • the method further comprises a step (c) contacting the first target cDNA sequence with a first oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the first oligonucleotide are complementary and bind to two neighboring segments of the first target cDNA sequence so that the first oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion, wherein the first oligonucleotide comprises a first identification sequence that identifies the first target RNA sequence, a first sequencing primer, and a nucleic acid amplification primer.
  • step (c) further comprises contacting the second target cDNA sequence with a second oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the second oligonucleotide are complementary and bind to two neighboring segments of the second target cDNA sequence, so that the second oligonucleotide forms a circular structure with a gap between the first end portion andthe second end portion.
  • the second oligonucleotide comprises a second identification sequence that identifies the second target RNA sequence, a second sequencing primer, and a nucleic acid amplification primer.
  • step (c) comprises contacting the first target polypeptide with a first oligonucleotide conjugate comprising a first short nucleic acid and a first binding moiety that binds specifically to the first target polypeptide, wherein the first short nucleic acid comprises a first tag sequence, a second tag sequence, wherein the first oligonucleotide conjugate binds specifically to the firsttarget polypeptide through the first binding moiety to form a first binding complex, wherein the firstand second tag sequences identify the first binding moiety .
  • step (c) comprises contacting the second target polypeptide with a second oligonucleotide conjugate comprising a second short nucleic acid and a second binding moiety that binds specifically to the second target polypeptide, wherein the second short nucleic acid comprises a third tag sequence, a fourth tag sequence, wherein the second oligonucleotide conjugate binds specifically to the second target polypeptide through the second binding moiety to form a second binding complex, wherein the third and fourth tag sequences identify the second binding moiety .
  • the method further comprises step (d) contacting the first binding complex with a third oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the third oligonucleotide are complementary and bind to the first tag sequence and the second tag sequence of the first oligonucleotide conjugate so that the third oligonucleotide forms a circular structure with a gap between the first end portion andthe second end portion.
  • step (d) comprises contacting the second binding complex with a fourth oligonucleotide comprising a first end portion and a second end portion, wherein the first end portion and second end portion of the fourth oligonucleotide are complementary and bind to the third tag sequence and the fourth tag sequence so that the fourth oligonucleotide forms a circular structure with a gap between the first end portion andthe second end portion.
  • the method further comprises step (e) joining the firstand second end portions of the first oligonucleotide to produce a first circular oligonucleotide, the first and second end portions of the second oligonucleotides to produce a second circular oligonucleotide, the first and second end portions of the third oligonucleotide to produce a third circular oligonucleotide and the first and second end portions of the fourth oligonucleotide to produce a fourth circular oligonucleotide, wherein the first circular oligonucleotide and the third circular oligonucleotide comprise a first sequencing primer or the reverse complement thereof, wherein the second circular oligonucleotide andthe fourth circular oligonucleotide comprise a second sequencing primer or the reverse complement thereof, wherein the sequences of the first and second sequencing primers have at least one nucleotide of difference.
  • the method further comprises step (f) amplifying the first circular oligonucleotide through rolling circle amplification to produce a first concatemer comprising a plurality of the first circular oligonucleotides.
  • step (f) further comprises amplifying the second circular oligonucleotide through rolling circle amplification to produce a second concatemer comprising a plurality of the second circular oligonucleotides.
  • step (f) further comprises amplifying the third circular oligonucleotide through rolling circle amplification to produce a third concatemer comprising a plurality of the third circular oligonucleotides.
  • step (f) further comprises amplifying the fourth circular oligonucleotide through rolling circle amplification to produce a fourth concatemer comprising a plurality of the fourth circular oligonucleotides.
  • the method further comprises step (g) determining in situ the sequence of the first concatemer or a portion thereof to generate a first sequencing product nucleic acid molecule that is complementary and binds to the first concatemer, wherein the sequence of the first concatemer or the portion thereof consists of 2-30 nucleotides.
  • step (g) further comprises determining in situ the sequence of the third concatemer or a portion thereof to generate a third sequencing product nucleic acid molecule that is complementary and binds to the third concatemer, wherein the sequence of the third concatemer or the portion thereof consists of 2 -30 nucleotides.
  • the performance of step (g) is under a condition that prevents the performance of the step (j).
  • the method further comprises step (h) removing the first sequencing product nucleic acid molecule from the first concatemer and the third sequencing product nucleic acid molecule from the third concatemer, wherein the first concatemer and the third concatemer are positioned in the biological sample after the removing.
  • the method further comprises step (i) repeating (g) and (h) at least once.
  • the method further comprises step (j) determining in situ the sequence of the second concatemer or a portion thereof to generate a second sequencing product nucleic acid molecule that is complementary and binds to the second concatemer, wherein the sequence of the second concatemer or the portion thereof consists of 2 -30 nucleotides.
  • step (j) further comprises determining in situ the sequence of the fourth concatemer or a portion thereof to generate a fourth sequencing product nucleic acid molecule that is complementary and binds to the fourth concatemer, wherein the sequence of the fourth concatemer or the portion thereof consists of 2-30 nucleotides.
  • the performance of steps (j) is under a condition that prevents the performance of the step (g).
  • the full sequence of the first target RNA sequence and the full sequence of the second target RNA sequence have at least one nucleotide of difference.
  • the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the method further comprises step (k) removing the second sequencing product nucleic acid molecule from the second concatemer and the fourth sequencing product nucleic acid molecule from the fourth concatemer.
  • the second concatemer and the fourth concatemer are positioned in the biological sample after the removing.
  • the method further comprises step (i) repeating (j) and (k) at least once.
  • the determining comprises imagingthe first, or third sequencing product nucleic acid molecule, wherein the identifying comprises imaging the second or fourth sequencing product nucleic acid molecule and the fourth sequencing product nucleic acid molecule.
  • the method further comprises imaging simultaneously the first sequencing product nucleic acid molecule and the second sequencing product nucleic acid molecule to analyze the spatial distribution of the first and second sequencing product nucleic acid molecules inside the biological sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly-frozen sample, a sectioned sample, or a formalin- fixed and paraffin-embedded (FFPE) sample.
  • FFPE formalin- fixed and paraffin-embedded
  • the first target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/orwherein the first target RNA sequence comprises codingRNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target nucleic acid sequence comprises DNA, cDNA, RNA, codin RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the second target RNA sequence comprises codingRNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the first, second, third or fourth concatemer further comprises a compaction oligonucleotide, wherein: a first segment of the compaction oligonucleotide is complementary and binds to a first portion of the first, second, third, or fourth concatemer. In some embodiments, a second segment of the compaction oligonucleotide is complementary and binds to a second portion of the first, second, third, or fourth concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and a portion of the first target RNA sequence
  • the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence and a portion of the second target RNA sequence.
  • the determining comprises contacting the first second, third, or fourth concatemer with a polymerizing enzyme, a plurality of nucleotides, and a primer sequence that is complementary to a portion of the first second, third, or fourth concatemer under conditions sufficient to form a binding complex comprising the polymerizing enzyme, a nucleotide of the plurality of nucleotides that is complementary and binds to a nucleotide unit of the first second, third, or fourth concatemer, and the first second, third, or fourth concatemer hybridized to the primer sequence.
  • the nucleotide comprises a fluorescent label and a removable blocking group at the 3 ’ carbon position of the sugar moiety.
  • the determining further comprises incorporating the nucleotide into the 3 ’ end of the primer sequence. In some embodiments, the determining further comprises identifying the nucleobase of the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide. In some embodiments, the determining further comprises contacting the nucleotide with an agent to remove a blocking group from the nucleotide and generate a 3’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides comprise one type of nucleotide selected from a group comprising dATP, dGTP, dCTP, dTTP and dUTP.
  • the plurality of nucleotides comprises a mixture of any combination of two or more types of nucleotides selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • the determining comprises contacting two of the first, second, third, or fourth concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first, second, third, or fourth concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence.
  • the nucleotide conjugate comprises a label and at least two of a nucleotide moiety .
  • two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the first concatemer.
  • the determining comprises detecting the multivalent binding complex through the label of the nucleotide conjugate.
  • the determining further comprises identifying the nucleobases of the nucleotides of the two of the first concatemer that are each complementary and bind to each of the two of the at least two of the nucleotide moiety of the nucleotide conjugate.
  • the determining further comprises: removing the two of the polymerizing enzyme and the nucleotide conjugate from the two of the first, second, third, or fourth concatemer. In some embodiments, the determining further comprises contacting each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence with two of a second polymerizing enzyme and a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence .
  • each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first, second, third, or fourth concatemer.
  • an unlabeled nucleotide of the plurality of unlabeled nucleotides comprises a removable blocking group at the 3 ’ carbon of the sugan moiety.
  • the determining comprises: contacting two of the first, second, third, or fourth concatemer with two of a polymerizing enzyme, a plurality of nucleotide conjugates, a first primer sequence that is complementary to a first portion of the first, second, third, or fourth concatemer, and a second primer sequence that is complementary to a second portion of the first, second, third, or fourth concatemer under conditions sufficient to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, the first portion of the first, second, third, or fourth concatemer hybridized to the first primer sequence and a second portion of the first, second, third, or fourth concatemer hybridized to the second primer sequence.
  • the nucleotide conjugate comprises a label and at least two of a nucleotide moiety. In some embodiments, two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of a first portion and a second portion of the first, second, third, or fourth concatemer. In some embodiments, the determining further comprises detecting the multivalent binding complex through the label of the nucleotide conjugate.
  • the determining further comprises identifying the nucleobases of the nucleotides of the firstand second portions ofthe first, second, third, or fourth concatemer that are each complementary and bind to each of the two of the at least two of the nucleotide moiety of the nucleotide conjugate.
  • the conditions inhibit incorporation of the at least two of the nucleotide moiety of the nucleotide conjugate into the two of the first, second, third, or fourth concatemer.
  • the nucleotide conjugate comprises a core coupled to a plurality of nucleotide arms, wherein each of the nucleotide moiety is attached to one nucleotide arm of the plurality of nucleotide arms.
  • the detectable label comprises a fluorescent label. In some embodiments, the detecting comprises imaging the fluorescent label.
  • the plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the determining further comprises contacting the nucleotide with an agent to remove the blocking group from the nucleotide and generate a 3’ OH group on the sugar moiety.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O -azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group .
  • the plurality of nucleotides consists of at least two of the same type of nucleotide, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprises at least two types of nucleotides, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the first target RN A sequence comprises coding RNA, non-codin RNA, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), microRNA (miRNA), guide RNA (gRNA), small nuclear RNA (snRNA), small interference RNA (siRNA), anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target RNA sequence comprises coding RNA, non -coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, or an organism
  • the biological sample is immobilized on a surface.
  • the surface comprises an interior surface of a flow cell.
  • the present disclosure provides methods for conducting in situ multiplex and multi - omics detection and identification using coded padlocks probes.
  • the padlock probes are designed to selectively detect target RNA or polypeptides.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, and the padlock probe may therefore not require a barcode sequence. In some embodiments, however, the padlock probe may comprise a barcode sequence.
  • the RNA-specific padlock probes selectively hybridize to cDNA that corresponds to target RNA.
  • the RNA-specific probes carry barcodes thatuniquely identify the cDNA.
  • the RNA-specific padlock probes also carry batch -specific sequencing primer binding sites.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, and the padlock probe may therefore not require a barcode sequence. In some embodiments, however, the padlock probe may comprise a barcode sequence.
  • the target polypeptides are detected using antibody -oligonucleotide conjugates and polypeptide-specific padlock probes which selectively hybridize to the oligonucleotide which is conjugated to the antibody.
  • the polypeptide -specific padlock probes carry barcodes that uniquely identify the antibody that selectively binds a target polypeptides.
  • the polypeptide- specific padlock probes also carry a batch -specific sequencing primer which is the same batchspecific sequencing primer carried by a corresponding RNA-specific padlock probe to enable simultaneous sequencing and detection of a target RNA and the polypeptide encoded by that target RNA.
  • the padlock probes enable simultaneous detection and identification of RNA and their encoded polypeptides.
  • Both types of padlock probes are used to generate concatemers which having multiple copies of batch-specific sequencing binding sites and barcodes.
  • the concatemers can collapse into DNA nanoballs having compact shape and size that produce increased signal intensity and color differentiation during sequencing.
  • the limit of optical resolution impedes the ability to perform highly multiplex sequencing.
  • the batch-specific sequencing primer binding sites on the padlock probes enables sequencing a desired subset (e.g., a batch) of the concatemers using selected batch-specific sequencing primers to reduce over-crowding signals and images.
  • the use of batch-specific sequencing primers produces optical images that are intense and resolvable.
  • the batch-specific sequencing methods described herein have many uses. For example, the number of spots that are imaged and associated with sequencing can be counted. The counted spots can be used as a measure or RNA and polypeptide levels in a cellular sample.
  • a pairwise sequencing kit that can be used to conduct 150 forward sequencing cycles and 150 reverse sequencing cycles (e.g., a total 300 sequencing cycles per kit) can be used to reveal more than 300,000 molecules.
  • a non-limiting example cell has approximately 30 copies of 5,000 different transcripts for a total of approximately 150,000 transcripts, and this call has approximately 15,000 polypeptides encoded by the transcripts for a total of 300,000 total RNA and polypeptide molecules.
  • the pairwise kit and be used to conduct 300 sequencing cycles (notin pairwise mode) to reveal the 300,000 different RNA and polypeptide molecules in the cellular sample. See Table 1 .
  • Table 1 lists the estimated total sequencing cycles and total time to decode RNA and polypeptides in a cellular sample .
  • Table 1 Estimated total sequencing cycles and total time to decode RNA and polypeptides in a cellular sample
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules and two different polypeptides encoded by the at least two different target RNA molecules, comprising step (a): providing a cellular sample deposited on a solid support, wherein the cellular sample harbors (i) a first plurality of DNA amplicons that correspond to a first target RNA molecule, (ii) a second plurality of DNA amplicons that correspond to a second target RNA molecule, (iii) a third plurality of DNA amplicons that correspond to a first polypeptide which is encoded by the first target RNA molecule, and (iv) a fourth plurality of DNA amplicons that correspond to a second polypeptide which is encoded by the second target RNA molecule.
  • the method further comprises step (b): sequencing the first nd third plurality of DNA amplicons inside the cellular sample under a condition that inhibits sequencingthe second and fourth plurality of DNA amplicons, wherein sequencingthe first plurality of DNA amplicons inside the cellular sample comprises generating a plurality of first sequencing read products, wherein the sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample, and wherein sequencing the third plurality of DNA amplicons inside the cellular sample comprises generating a plurality of second sequencing read products, wherein the sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the first target polypeptides in the cellular sample.
  • the methodfurther comprises step (c): sequencingthe second and fourth plurality of DNA amplicons inside the cellular sample under a condition that inhibits sequencingthe first and third plurality of DNA amplicons, wherein sequencingthe second plurality of DNA amplicons inside the cellular sample comprise s generating a plurality of third sequencing read products, wherein the sequences of the third sequencing read products are aligned with a third target reference sequence to confirm the presence of the second target RNA in the cellular sample, and wherein sequencing the fourth plurality of DNA amplicons inside the cellular sample comprises generating a plurality of fourth sequencing read products, wherein the sequences of the fourth sequencing read products are aligned with a fourth target reference sequence to confirm the presence of the second target polypeptides in the cellular sample.
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules and two different polypeptides encoded by the at least two different target RNA molecules, comprising step (a): providing a cellular sample deposited on a solid support, wherein the cellular sample harbors (i) a first plurality of DNA amplicons that correspond to a first target RNA molecule, (ii) a second plurality of DNA amplicons that correspond to a second target RNA molecule, (iii) a third plurality of DNA amplicons that correspond to a first polypeptide which is encoded by the first target RNA molecule, and (iv) a fourth plurality of DNA amplicons that correspond to a second polypeptide which is encoded by the second target RNA molecule.
  • the methods further comprise step (b): sequencin the firstand third plurality of DNA amplicons inside the cellular sample under a condition that inhibits sequencingthe second and fourth plurality of DNA amplicons.
  • step (b) comprises sequencingthe first plurality of DNA amplicons inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of first sequencing read products, wherein the sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample.
  • step (b) comprises sequencin the first plurality of DNA amplicons inside the cellular sample comprises conducting 1 -250 sequencing cycles to generate a plurality of first sequencing read products, wherein the sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample.
  • step (b) comprises sequencing the third plurality of DNA amplicons inside the cellular sample comprises conducting no more than 2 -30 sequencing cycles to generate a plurality of second sequencing read products, wherein the sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the first target polypeptide in the cellular sample.
  • step (b) comprises sequencing the third plurality of DNA amplicons inside the cellular sample comprises conducting 1-250 sequencing cycles to generate a plurality of second sequencing read products, wherein the sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the first target polypeptide in the cellular sample.
  • the methods further comprise step (c): removingthe plurality of first sequencing read products from the first DNA amplicons and retaining the first DNA amplicons inside the cellular sample, and removing the plurality of second sequencing read products from the third DNA ampliconsand retaining the third DNA amplicons inside the cellular sample.
  • a 3’ blocking moiety can be added to the firstand second sequencing read products to inhibit further sequencing reactions.
  • a nucleotide analog can be incorporated where the nucleotide analog inhibits incorporation of a subsequent nucleotide.
  • Non-limiting example blocking nucleotide analogs include dideoxynucleotide or a nucleotide having a 2’ or 3 ’ chain terminating moiety.
  • the methods further comprise step (d): reiteratively sequencing the firstand third plurality of DNA amplicons by repeating steps (b) and (c) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the methods further comprise step (e): sequencingthe second and fourth plurality of DNA amplicons inside the cellular sample under a condition that inhibits sequencingthe first and third plurality of DNA amplicons.
  • step (e) comprises sequencingthe second plurality of DNA amplicons inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of third sequencing read products, wherein the sequences of the third sequencing read products are aligned with a third target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • step (e) comprises sequencingthe second plurality of DNA amplicons inside the cellular sample comprises conducting 1 -250 sequencing cycles to generate a plurality of third sequencing read products, wherein the sequences of the third sequencing read products are aligned with a third target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • step (e) comprises sequencingthe fourth plurality ofDNA amplicons inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of fourth sequencing read products, wherein the sequences of the fourth sequencing read products are aligned with a fourth target reference sequence to confirm the presence of the second target polypeptide in the cellular sample.
  • step (e) comprises sequencin the fourth plurality ofDNA amplicons inside the cellular sample comprises conducting 1 -250 sequencing cycles to generate a plurality of fourth sequencing read products, wherein the sequences of the fourth sequencing read products are aligned with a fourth target reference sequence to confirm the presence of the second target polypeptide in the cellular sample.
  • the methods comprise step (f): removing the plurality of third sequencing read products from the second DNA ampliconsand retaining the second DNA amplicons inside the cellular sample, and removing the plurality of fourth sequencing read products from the fourth DNA amplicons and retainingthe fourth DNA amplicons inside the cellular sample.
  • a 3’ blocking moiety can be added to the third and fourth sequencing read products to inhibit further sequencing reactions.
  • a nucleotide analog can be incorporated where the nucleotide analog inhibits incorporation of a subsequent nucleotide.
  • Non-limiting example blocking nucleotide analogs include dideoxynucleotide or a nucleotide having a 2’ or 3 ’ chain terminating moiety.
  • the methods comprise step (g): reiteratively sequencin the second and fourth plurality ofDNA amplicons by repeating steps (e) and (f) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules and two different polypeptides encoded by the at least two different target RNA molecules, comprising step (a): providing a cellular sample deposited on a solid support, wherein the cellular sample is fixed and permeabilized, wherein the cellular sample harbors a first plurality of target RNA and a first plurality of target polypeptides which are encoded by the first plurality of target RNA, and the cellular sample harbors a second plurality of target RNA and a second plurality of target polypeptides which are encoded by the second plurality of target RNA.
  • the cellular sample harbors 2-25 different target RNA molecules, or harbors 25-50 different target RNA molecules, or harbors 50-75 different target RNA molecules, or harbors 75-100 different target RNA molecules. In some embodiments, the cellular sample harbors more than 100 different target RNA molecules, or more than 250 different target RNA molecules, or more than 500 different target molecules, or more than 1000 different target RNA molecules, or more. In some embodiments, the cellular sample harbors more than 10,000 different target RNA molecules. In some embodiments, the cellular sample comprises a whole cell, a plurality of whole cells, an intact tissue or an intact tumor.
  • the cellular sample comprises a fresh cellular sample, a freshly -frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample.
  • the cellular sample is deposited onto a solid support.
  • the cellular sample is deposited onto a solid support which is passivated with a coating that promotes cell adhesion.
  • the cellular sample is deposited on a support that lacks immobilized capture oligonucleotides.
  • the cellular sample is cultured prior to conducting step (b) which is described below.
  • the cellular sample harbors 2-25 different target polypeptide molecules, or harbors 25-50 different target polypeptide molecules, or harbors 50-75 different target polypeptide molecules, or harbors 75-100 different target polypeptide molecules. In some embodiments, the cellular sample harbors more than 100 different target polypeptide molecules, or more than 250 different target polypeptide molecules, or more than 500 different target molecules, or more than 1000 different target polypeptide molecules, or more. In some embodiments, the cellular sample harbors more than 10,000 different target polypeptide molecules.
  • the target polypeptide molecules are encoded by the target RNA molecules.
  • the methods comprise step (b): generating inside the cellular sample a plurality of cDNA by (i) generating at least a first plurality of target cDNA from the first plurality of target RNA, and (ii) generating at least a second plurality of target cDNA from the second plurality of target RNA (e.g., FIG. 20).
  • the first target cDNAs correspond to the first target RNA molecules.
  • the second target cDNAs correspond to the second target RNA molecules.
  • the method comprises generating at least 2-10,000 different target cDNA molecules that correspond to 2-10,000 different target RNA molecules.
  • the generating of step (b) comprises contacting the plurality of RNA inside the cellular sample with (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and(iii) a plurality of nucleotides, under a condition suitable for conducting a reverse transcription reaction to generate a plurality of cDNA molecules (e.g., a plurality of first strand cDNA molecules) in the cellular sample.
  • a plurality of reverse transcription primers e.g., a plurality of reverse transcriptase enzymes
  • a plurality of nucleotides e.g., a plurality of nucleotides
  • the plurality of reverse transcription primers comprises a first sub -population of target-specific reverse transcription primers that hybridize selectively to the first target RNA, and comprises a second sub -population of target-specific reverse transcription primers that hybridize selectively to the second target RNA.
  • the plurality of reverse transcription primers comprises a first sub -population of random-sequence reverse transcription primers that hybridize to the first target RNA, and comprises a second sub - population of random -sequence reverse transcription primers that hybridize to the second target RNA.
  • the methods comprise step (c): generating inside the cellular sample a plurality of DNA concatemers which correspond to the first and second plurality of target RNA molecules, comprising: (1) generating a first plurality of covalently closed circular padlock probes by contacting the first plurality of target cDNA with a first plurality of padlock probes, wherein the contacting is conducted under a condition suitable for hybridizing the first and second binding arms of the first padlock probes to proximal positions on their respective first target cDNA molecules to form a first plurality of circularized padlock probes each having a nick or gap between the hybridized first and second binding arms, wherein the first padlock probes include a (i) a first target barcode sequence (target BC-1) that uniquely identifies the first target RNA, (ii) a first batch-specific sequencing primer binding site (Batch Seq-1) (or a complementary sequence thereof), and (iii) a universal binding site for an amplification primer (universal
  • the rolling circle amplification reaction can be conducted in the presence or absence of a plurality of compaction oligonucleotides.
  • the method comprises contacting the plurality of cDNA molecule in the cellular sample with at least 2-10,000 different target-specific padlock probes.
  • the first padlock probe further comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the closing the nick in the first circularized padlock probes comprises conducting an enzymatic ligation reaction.
  • closing the gap in the first circularized padlock probes comprises conducting a polymerase-catalyzed fill-in reaction using the first target cDNA molecule as a template, and conducting an enzymatic ligation reaction.
  • each concatemer molecule in the first plurality comprises tandem repeat units, wherein a unit comprises the sequence of the first target cDNA and (i) the first target barcode sequence (target BC-1) that uniquely identifies the first target RNA, (ii) the first batch-specific sequencing primer binding site (Batch Seq-1) (or a complementary sequence thereof), and (iii) the universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof).
  • the unit further comprises the universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • step (c) further comprises: (1) generating a second plurality of covalently closed circular padlock probes by contacting the second plurality of target cDNA with a second plurality of padlock probes, wherein the contacting is conducted under a condition suitable for hybridizing the first and second binding arms of the second padlock probes to proximal positions on their respective second target cDNA moleculesto form a second plurality of circularized padlock probes each having a nick or gap between the hybridized first and second binding arms, wherein the second padlock probes include a (i) a second barcode sequence (target BC-2) that uniquely identifies the second target RNA, (ii) a second batch -specific sequencing primer binding site (Batch Seq-2) (or a complementary sequence thereof) wherein the sequence of the second batch-specific sequencing primer binding site differs from the sequence of the first batch-specific sequencing primer binding site, and (iii) the universal binding site for an amplification primer (universal RCA) (ora complementary
  • the rolling circle amplification reaction can be conducted in the presence or absence of a plurality of compaction oligonucleotides.
  • the method comprises contacting the plurality of cDNA molecule in the cellular sample with at least 2 - 10,000 different target-specific padlock probes.
  • the second padlock probe further comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the closing the nick in the second circularized padlock probes comprises conducting an enzymatic ligation reaction.
  • closing the gap in the second circularized padlock probes comprises conducting a polymerase-catalyzed fill- in reaction using the second target cDNA molecule as a template, and conducting an enzymatic ligation reaction.
  • the method comprises closing the nick or gap in at least 2-10,000 circularized target-specific padlock probes by conducting an enzymatic reaction, thereby generating at least 2-10,000 covalently closed circular padlock probes inside the cellular sample.
  • each concatemer molecule in the second plurality comprises tandem repeat units, wherein a unit comprises the sequence of the second target cDNA and (i) the second target barcode sequence (target BC-2) that uniquely identifies the second target RNA, (ii) the second batch-specific sequencing primer binding site (Batch Seq-2) (or a complementary sequence thereof), and (iii) the universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof).
  • the unit further comprises the universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the method further comprise step (d): generating inside the cellular sample a plurality of DNA concatemers which correspond to the first nd second plurality of target polypeptide molecules, comprising: (1) generating a third plurality of covalently closed circular padlock probes by contacting the first plurality of target polypeptides with a first plurality of antib ody-oligonucleotide conjugate molecules each comprising a first antibody linked to a first oligonucleotide carrying a first tag sequence (Tag-1) and a second tag sequence (Tag-2) wherein the first and second tag sequences uniquely identify the first antibodies that selective bind the first target polypeptides (e g., FIG.
  • the contacting is conducted under a condition suitable for selectively bin ding the first antibody - oligonucleotide conjugate molecules to the first target polypeptides; (2) contacting the first oligonucleotides (of the first antib ody-oligonucleotide conjugate molecules) with a third plurality of padlock probes, wherein the contacting is conducted under a condition suitable for hybridizing the first and second binding arms of the third padlock probes to proximal positions on their respective first oligonucleotides to form a third plurality of circularized padlock probes each having a nick or gap between the hybridized first and second binding arms, wherein the third padlock probes include a (i) a sequence that binds the first tag sequence of the first oligonucleotide, (ii) a third target barcode sequence (target BC-3) that uniquely identifies the first antibody-oligonucleotide conjugate which selectively binds the first target polypeptide,
  • the rolling circle amplification reaction can be conducted in the presence or absence of a plurality of compaction oligonucleotides.
  • the third padlock probe further comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the closing the nick in the third circularized padlock probes comprises conducting an enzymatic ligation reaction. In some embodiments, closing the gap in the third circularized padlock probes comprises conducting a polymerase-catalyzed fill-in reaction using the first oligonucleotide as a template, and conducting an enzymatic ligation reaction. In some embodiments, the method comprises closing the nick or gap in at least 2-10,000 circularized padlock probes by conducting an enzymatic reaction, thereby generating at least 2- 10, 000 covalently closed circular padlock probes inside the cellular sample.
  • each concatemer molecule in the third plurality comprises tandem repeat units, wherein aunit comprises (i) the sequence that binds the first tag sequence of the first oligonucleotide, (ii) the third target barcode sequence (target BC -3) that uniquely identifies the first antibody -oligonucleotide conjugate which selectively binds the first target polypeptide, (iii) the first batch-specific sequencing primer binding site (Batch Seq-1) (or a complementary sequence thereof), (iv) the universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof), and (v) the sequence that bind the second tag sequence of the first oligonucleotide.
  • the unit further comprises the universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • step (d) further comprises: (1) generating a fourth plurality of covalently closed circular padlock probes by contacting the second plurality of target polypeptides with a second plurality of antibody -oligonucleotide conjugate molecules each comprising a second antibody linked to a second oligonucleotide carrying a third tag sequence (Tag-3) and a fourth tag sequence (Tag-4) wherein the third and fourth tag sequences uniquely identify the secondantibodies that selective bind the second target polypeptides (e g., FIG.
  • the contacting is con ducted under a condition suitable for selectively binding the second antibody -oligonucleotide conjugate molecules to the second target polypeptides; (2) contacting the second oligonucleotides (of the second antibody -oligonucleotide conjugate molecules) with a fourth plurality of padlock probes, wherein the contacting is conducted under a condition suitable for hybridizing the first and second binding arms of the fourth padlock probes to proximal positions on their respective second oligonucleotides to form a fourth plurality of circularized padlock probes each having a nick or gap between the hybridized first and second binding arms, wherein the fourth padlock probes include a (i) sequence that binds the third tag sequence of the second oligonucleotide, (ii) a fourth target barcode sequence (target BC-4) that uniquely identifies the second antib ody-oligonucleo tide conjugate which selectively binds the second target polypeptide, (iii
  • the rolling circle amplification reaction can be conducted in the presence or absence of a plurality of compaction oligonucleotides.
  • the fourth padlock probe further comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the closing the nick in the fourth circularized padlock probes comprises conducting an enzymatic ligation reaction. In some embodiments, closing the gap in the fourth circularized padlock probes comprises conducting a polymerase-catalyzed fill-in reaction usingthe second oligonucleotide as a template, and conducting an enzymatic ligation reaction. In some embodiments, the method comprises closing the nick or gap in at least 2 -10,000 circularized padlock probes by conducting an enzymatic reaction, thereby generating at least 2-10,000 covalently closed circular padlock probes inside the cellular sample.
  • each concatemer molecule in the fourth plurality comprises tandem repeat units, wherein a unit comprises (i) the sequence that binds the third tag sequence of the second oligonucleotide, (ii) the fourth target barcode sequence (target BC-4) that uniquely identifies the second antib ody- oligonucleotide conjugate which selectively binds the second target polypeptide, (iii) the second batch-specific sequencing primer binding site (Batch Seq-2) (or a complementary sequence thereof), (iv) the universal binding site for an amplification primer (universal RCA) (or a complementary sequence thereof), and (v) the sequence that bind the fourth tag sequence of the second oligonucleotide.
  • the unit further comprises the universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the methods further comprise step (e): sequencingthe first and third plurality of concatemer molecules inside the cellular sample under a condition that inhibits sequencingthe second and fourth plurality of concatemers (e.g., FIG. 24).
  • step (e) comprises sequencing the first plurality of concatemers inside the cellular sample comprises conducting no more than 2 -30 sequencing cycles to generate a plurality of first sequencing read products, wherein the sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample.
  • step (e) comprises sequencing the first plurality of concatemers inside the cellular sample comprises conducting 1 -250 sequencing cycles to generate a plurality of first sequencing read products, wherein the sequences of the first sequencing read products are aligned with a first target reference sequence to confirm the presence of the first target RNA in the cellular sample.
  • step (e) comprises sequencing the third plurality of concatemers inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products, wherein the sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the first target polypeptides in the cellular sample.
  • step (e) comprises sequencing the third plurality of concatemers inside the cellular sample comprises conducting 1 -250 sequencing cycles to generate a plurality of second sequencing read products, wherein the sequences of the second sequencing read products are aligned with a second target reference sequence to confirm the presence of the first target polypeptides in the cellular sample.
  • step (e) in the first concatemer molecules, only the first target barcode region (target BC-1) is sequenced. In some embodiments, in the first concatemer molecules, at least a portion or the full length of the first target barcode (target BC-1) is sequenced. In some embodiments, in the first concatemer molecules, the first target barcode (target BC-1) is sequenced and a portion of the first cDNA region is sequenced.
  • step (e) in the third concatemer molecules, only the third target barcode region (target BC-3) is sequenced. In some embodiments, in the third concatemer molecules, at least a portion or the full length of the third target barcode (target BC-3) is sequenced. In some embodiments, in the third concatemer molecules, the third target barcode (target BC-3) is sequenced and a portion of the first oligonucleotide is sequenced.
  • the sequencing of step (e) comprises sequencing at least a portion of the firstand third nucleic acid concatemers using an optical imaging system comprising a field-of-view (FOV) greater than 1.0 mm 2 .
  • FOV field-of-view
  • the plurality of first and second sequencing read products are detectable by imaging, and wherein the sequencing comprises decoding the plurality of first and second sequencing read products from the images obtained during the no more than 2-30 sequencing cycles, or from the images obtained during the 1 -250 sequence cycles.
  • the plurality of the first and second sequencing read products are detectable by imaging, and wherein the sequencing comprises simultaneously imaging the plurality of first and second detectable sequencing read products in the cellular sample (co-localization of the first and second sequencing read products).
  • the methods further comprise step (f): removingthe plurality of first sequencing read products from the first concatemer molecules and retaining the first concatemer molecules inside the cellular sample, and removing the plurality of second sequencing read products from the third concatemer molecules and retaining the third concatemer molecules inside the cellular sample.
  • a 3 ’ blocking moiety can be added to the first and second sequencing read products to inhibit further sequencing reactions.
  • a nucleotide analog can be incorporated where the nucleotide analog inhibits incorporation of a subsequent nucleotide.
  • Non-limiting example blocking nucleotide analogs include dideoxynucleotide or a nucleotide having a 2’ or 3 ’ chain terminating moiety.
  • the methods further comprise step (g): reiteratively sequencing the plurality of first and third concatemers by repeating steps (e) and (f) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the sequencing the first concatemers of step (g) comprises step (1) contacting the first plurality of concatemer molecules inside the cellular sample with (i) a plurality of first batch -specific sequencing primers, (ii) a plurality of sequencing polymerases, and (iii) a plurality of nucleotide reagents, under a condition suitable for hybridizing the plurality of first batch -specific sequencing primers to their respective first batch -specific sequencing primer binding sites on the first concatemers.
  • the sequencing further comprises step (2) conducting no more than 2-30 sequencing cycles to generate a first plurality of sequencing read products.
  • the sequencing further comprises step (3) removing the first plurality of sequencing read products from the concatemers and retaining the plurality of concatemers inside the cellular sample.
  • the sequencing further comprises step (4) repeating steps (1) - (3) atleast once (e.g., FIG. 21).
  • step (4) comprises repeating steps (1) - (3) at least 2 times, at least 3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 8 times, at least 9 times, or at least 10 times.
  • step (4) comprises repeating steps (1) - (3) up to 10 times, up to 20 times, up to 30 time, up to 40 times, or up to 50 times.
  • the reiterative sequencing of the first concatemers of step (g) can be conducting using a sequencing-by-binding procedure, labeled and/or non-labeled chainterminating nucleotides, or multivalent molecules. Descriptions of these three sequencing methods is described below.
  • the sequencing the third concatemers of step (g) comprises step (1) contacting the third plurality of concatemer molecules inside the cellular sample with (i) a plurality of third batch-specific sequencing primers, (ii) a plurality of sequencing polymerases, and (iii) a plurality of nucleotide reagents, under a condition suitable for hybridizing the plurality of third batch-specific sequencing primers to their respective third batch-specific sequencing primer binding sites on the third concatemers.
  • the sequencing further comprises step (2) conducting no more than 2-30 sequencing cycles to generate a second plurality of sequencing read products.
  • the sequencing further comprises step (3) removing the second plurality of sequencing read products from the concatemers and retaining the plurality of concatemers inside the cellular sample.
  • the sequencing further comprises step (4) repeating steps (1) - (3) at least once.
  • step (4) comprises repeating steps (1) - (3) atleast2 times, atleast 3 times, at least 4 times, at least 5 times, at least 6 times, at least 7 times, at least 8 times, at least 9 times, or at least 10 times.
  • step (4) comprises repeating steps (1) - (3) up to 10 times, up to 20 times, up to 30 time, up to 40 times, or up to 50 times.
  • the reiterative sequencing of the third concatemers of step (g) can be conducting using a sequencing-by-binding procedure, labeled and/or non-labeled chainterminating nucleotides, or multivalent molecules. Descriptions of these three sequencing methods is described below.
  • the plurality of universal sequencing primers can be hybridized to concatemer template molecules with a hybridization reagent comprising an SSC buffer (e.g., 2X saline-sodium citrate) buffer with formamide (e.g., 10-20% formamide).
  • SSC buffer e.g., 2X saline-sodium citrate
  • formamide e.g., 10-20% formamide.
  • the hybridization conditions comprise a temperature of about20-30°C, for about 10-60 minutes.
  • the plurality of sequencing read products can be removed from the concatemers and the plurality of concatemers can be retained inside the cellular sample using a de-hybridization reagent comprising an SSC buffer (e.g., saline-sodium citrate) buffer, with or without formamide, at a temperature that promotes nucleic acid denaturation such as for example 30 - 90°C.
  • SSC buffer e.g., saline-sodium citrate
  • the methods further comprise step (h): sequencing the second and fourth plurality of concatemer molecules inside the cellular sample under a condition that inhibits sequencing the first and third plurality of concatemers (e.g., FIG. 25).
  • step (h) comprises sequencing the second plurality of concatemers inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of third sequencing read products, wherein the sequences of the third sequencing read products are aligned with a third target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • step (h) comprises sequencing the second plurality of concatemers inside the cellular sample comprisesconducting 1 -250 sequencing cycles to generate a plurality of third sequencing read products, wherein the sequences of the third sequencing read products are aligned with a third target reference sequence to confirm the presence of the second target RNA in the cellular sample.
  • step (h) comprises sequencing the fourth plurality of concatemers inside the cellular sample comprises conducting no more than 2-30 sequencing cycles to generate a plurality of fourth sequencing read products, wherein the sequences of the fourth sequencing read products are aligned with a fourth target reference sequence to confirm the presence of the second target polypeptides in the cellular sample.
  • step (h) comprises sequencingthe fourth plurality of concatemers inside the cellular sample comprises conducting 1-250 sequencing cycles to generate a plurality of fourth sequencing read products, wherein the sequences of the fourth sequencing read products are aligned with a fourth target reference sequence to confirm the presence of the second target polypeptides in the cellular sample.
  • step (h) in the second concatemer molecules, only the second target barcode region (targetBC-2) is sequenced. In some embodiments, in the second concatemer molecules, at least a portion or the full length of the second target barcode (target BC-2) is sequenced. In some embodiments, in the second concatemer molecules, the second target barcode (targetBC-2) is sequenced and a portion of the second cDNA region is sequenced.
  • step (h) in the fourth concatemer molecules, only the fourth target barcode region (targetBC-4) is sequenced. In some embodiments, in the fourth concatemer molecules, at least a portion or the full length of the fourth target barcode (target BC-4) is sequenced. In some embodiments, in the fourth concatemer molecules, the fourth target barcode (targetBC-4) is sequenced and a portion of the second oligonucleotide is sequenced. [0205] In some embodiments, the sequencing of step (h) comprises sequencing at least a portion of the second and fourth nucleic acid concatemers using an optical imaging system comprising a field-of-view (FOV) greater than 1.0 mm 2 .
  • FOV field-of-view
  • the plurality of third and fourth sequencing read products are detectable by imaging, and wherein the sequencing comprises decoding the plurality of third and fourth sequencing read products from the images obtained during the no more than 2-30 sequencing cycles, or from the images obtained during the 1 -250 sequencing cycles
  • the plurality of the third and fourth sequencing read products are detectable by imaging, and wherein the sequencing comprises simultaneously imaging the plurality of third and fourth detectable sequencing read products in the cellular sample (co-localization of the third and fourth sequencing read products).
  • the methods further comprise step (i): removing the plurality of third sequencing read products from the second concatemer moleculesand retaining the second concatemer molecules inside the cellular sample, and removing the plurality of fourth sequencing read products from the fourth concatemer molecules and retaining the fourth concatemer molecules inside the cellular sample.
  • a 3’ blocking moiety can be added to the third and fourth sequencing read products to inhibit further sequencing reactions.
  • a nucleotide analog can be incorporated where the nucleotide analog inhibits incorporation of a subsequent nucleotide.
  • Non-limiting example blocking nucleotide analogs include dideoxynucleotide or a nucleotide having a 2’ or 3 ’ chain terminating moiety.
  • the methods further comprise step (j): reiteratively sequencing the plurality of second and fourth concatemers by repeating steps (h) and (i) at least once.
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the plurality of nucleotide reagents comprises a plurality of nucleotides that are detectably labeled or non -labeled.
  • individual nucleotides are linked to a detectable reporter moiety.
  • the detectable reporter moiety comprises a fluorop hore.
  • the plurality of detectably labeled nucleotide analogs comprise a plurality of chain terminating nucleotides, where the chain terminating moiety is linked to the 3 ’ nucleotide sugar position to form a 3’ blocked nucleotide analog.
  • the chain terminating moiety canbe removed to convert the 3 ’ blocked nucleotide analog to an extendible nucleotide having a 3 ’ OH group on the sugar.
  • the labeled nucleotide analogs are linked to a different fluorophore that corresponds to the nucleobases adenine, cytosine, guanine, thymine or uracil, where the different fluoroph ores emit a fluorescent signal during sequencing.
  • a sequencing cycle comprises (1) contactingthe concatemer/sequencing primer duplex with a sequencing polymerase and a detectably labeled chain terminating nucleotide under a condition suitable for polymerase-catalyzed incorporation of the detectably labeled chain terminating nucleotide into the terminal end of the sequencing primer, (2) detecting and imaging the fluorescent signal and color emitted by the incorporated chain terminating nucleotide, and (3) removing the chain terminating moiety (e.g., unblocking) and retaining the concatemer/sequencing primer duplex
  • the sequencing cycles are conducted on the plurality of concatemers inside the cellular sample to generate a plurality of sequencing read products.
  • the sequence of the sequencing read product can be determined and aligned with a reference sequence to confirm the presence of target RNA molecules inside the cellular sample. In some embodiments, the sequence of the sequencing read product can be determined and aligned with a reference sequence to confirm the presence of target polypeptides inside the cellular sample. In some embodiments, the sequences of the sequencing read products can be aligned after each round of generating the sequencing read products, or after generating a set of reiterative sequencing read products. In some embodiments, the sequencing reactions are conducted on a sequencing apparatus having a detector that captures fluorescent signals from the sequencing reactions inside the cellular sample.
  • the sequencing apparatus can be configured to relay the fluorescent signal data captured by the detector to a computer system that is programmed to display images of different fluorescent spots which are co-located in the cellular sample, where individual fluorescent spots correspond to different target RNA molecules or different target polypeptides.
  • a computer system that is programmed to display images of different fluorescent spots which are co-located in the cellular sample, where individual fluorescent spots correspond to different target RNA molecules or different target polypeptides.
  • the images can have different color fluorescent spots co-located in the same cellular sample at different sequencing cycles.
  • out-of-sync phasing and/or pre-phasing events can occur during synchronized sequencing reactions on clonally amplified template amplicons, where the sequencing reactions comprise polymerase-catalyzed sequencing reactions employing detectably labeled chain terminator nucleotides.
  • a sequencing reaction on one template molecule in the clonally-amplified template molecules moves ahead (e.g., pre-phasing) or fall behind (e.g., phasing) of the sequencing of the other template molecules within the clonally-amplified template molecules.
  • a fluorescent signal is typically detected which corresponds to incorporation of a labeled chain terminator nucleotide.
  • phasing and pre-phasing events can be detected and monitored using incorporation of a labeled chain terminator nucleotide.
  • the plurality of nucleotide reagents comprises a plurality of multivalent molecules each comprising a core attached to a plurality of nucleotide-arms, wherein the nucleotide-arms are attached to a nucleotide unit.
  • individual multivalent molecules are labeled with a detectably reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • the core of the multivalent molecule is labeled with a fluorophore, and wherein the fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • a fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • At least one of the nucleotide arms of the multivalent molecule comprises a linker and/or nucleotide base that is attached to a fluorophore, and wherein the fluorophore which is attached to a given nucleotide base corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • the nucleotide base e.g., adenine, guanine, cytosine, thymine or uracil
  • a sequencing cycle comprises (1) contacting the concatemer/sequencing primer duplex with a first sequencing polymerase to form a complexed polymerase, (2) contacting the complexed polymerase with a detectably labeled multivalent molecule under a condition suitable for binding a complementary nucleotide unit of the multivalent molecule to the complexed polymerase thereby forming a multivalent-binding complex, and the condition is suitable for inhibiting incorporation of the complementary nucleotide unit into the terminal end of the sequencing primer, (3) detecting and imaging the fluorescent signal and color emitted by the bound detectably labeled multivalent molecule, (4) removing the first sequencing polymerase and the bound detectably labeled multivalent molecule, and retaining the concatemer/sequencing primer duplex, (5) contacting the retained concatemer/sequencing primer duplex with a second sequencing polymerase and a non-labeled chain terminating nucleotide under a condition suitable for polymerase -cat
  • individual cycle times can be achieved in less than 30 minutes.
  • the field of view (FOV) can exceed 1 mm 2 and the cycle time for scanning large area (> 10 mm 2 ) can be less than 5 minutes.
  • no more than 2-30 sequencing cycles or 1-250 sequencing cycles are conducted on the plurality of concatemers inside the cellular sample to generate a plurality of sequencing read products.
  • the sequence of the sequencing read product can be determined and aligned with a reference sequence to confirm the presence of the target RNA molecules inside the cellular sample.
  • the sequence of the sequencing read product can be determined and aligned with a reference sequence to confirm the presence of the target polypeptides inside the cellular sample.
  • the sequences of the sequencing read products can be aligned after each round of generating the sequencing read products, or after generating a set of reiterative sequencing read products.
  • the sequencing reactions are conducted on a sequencing apparatus having a detectorthat captures fluorescent signals from the sequencing reactions inside the cellular sample.
  • the sequencing apparatus can be configured to relay the fluorescent signal data captured by the detector to a computer system that is programmed to display images of different fluorescent spots which are co-located in the cellular sample, where individual fluorescent spots correspond to different target RNAmolecules or different target polypeptides.
  • steps (2) and (3) can be conducted at a gentle temperature of about 35 - 45°C, or about 39 - 42°C.
  • steps (2) and (3) can be conducted at a gentle temperature which can help retain the compact size and shape of a DNA nanoball during multiple sequencing cycles (e g., up to 30 cycles) which can improveFWHM (full width half maximum) of a spot image of the DNA nanoball inside a cellular sample.
  • the DNA nanoball does not unravel during multiple sequencing cycles.
  • the spot image of the DNA nanoball does not enlarge during multiple sequencing cycles.
  • the spot image of the DNA nanoball remains a discrete spot during multiple sequencing cycles.
  • the spot image can be represented as a Gaussian spot and the size can be measured as a FWHM.
  • a smaller spot size as indicated by a smaller FWHM typically correlates with an improved image of the spot.
  • the FWHM of a nanoball spot can be about 10 um or smaller.
  • out-of-sync phasing and/or pre-phasing events can occur during synchronized polymerase -catalyzed sequencing reactions employing detectably labeled multivalent molecules.
  • a fluorescent signal can be detected which corresponds to binding of complementary nucleotide unit of a multivalent molecule to the complexed polymerase thereby forming a multivalent-binding complex.
  • phasing and prephasing events can be detected and monitored using binding of labeled multivalent molecules.
  • the phasing and/or pre-phasing rate when conducting up to 30 sequencing cycles with detectably labeled multivalent molecules, can be less than about 5%, or less than about 1%, or less than about 0.01%, or less than about 0.001%.
  • the phasing and/or pre-phasing rates for conducting up to 30 sequencing cycles using labeled chain terminator nucleotides can be about 5%.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, or any combination thereof.
  • the biological sample may be immobilized on a surface.
  • the cellular sample comprises a whole cell, a plurality of whole cells, an intact tissue or an intact tumor.
  • the cellular sample comprises a fresh cellular sample, a freshly -frozen cellular sample, a sectioned cellular sample, or an FFPE cellular sample.
  • the cellular sample comprise one or more living cells or non-living cells.
  • the cellular sample can be obtained from a virus, fungus, prokaryote or eukaryote. In some embodiments, the cellular sample can be obtained from an
  • the cellular sample comprises one or more virally -infected cells.
  • the cellular sample can be obtained from any organism including human, simian, ape, canine, feline, bovine, equine, murine, porcine, caprine, lupine, ranine, piscine, plant, insect bacteria, algae sample, viral sample, protozoa, or fungal.
  • the cellular sample can be obtained from any organ including head, neck, brain, breast, ovary, cervix, colon, rectum, endometrium, gallbladder, intestines, bladder, prostate, testicles, liver, lung, kidney, esophagus, pancreas, thyroid, pituitary, thymus, skin, heart, larynx, or other organs.
  • the cellular sample harbors a plurality of RNA which include target RNA and non -target RNA.
  • cells typically produce RNA by gene expression which includes transcription of DNA (e.g., genomic DNA) into RNA molecules.
  • the transcribed RNA can undergo splicing or may not be spliced.
  • the transcribed RNA can be translated into a polypeptide (e.g., coding RNA), or do not undergo translation but can be processed into tRNA or rRNA (e g., non-coding RNA).
  • the plurality of RNA harbored by the cellular sample includes target and non-targetRNA.
  • the plurality of RNA harbored by the cellular sample comprises wild type RNA, mutant RNA or splice variant RNA.
  • the plurality of RNA harbored by the cellular sample comprises pre-spliced RNA, partially spliced RNA, or fully spliced RNA.
  • the plurality of RNA harbored by the cellular sample comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, microRNA (miRNA), mature microRNA, or immature microRNA.
  • the plurality of RNA harbored by the cellular sample comprises housekeeping RNA, cell -specific RNA, tissuespecific RNA or disease-specific RNA. In some embodiments, the plurality of RNA harbored by the cellular sample comprises RNA expressed by one or more cells in response to a stimulus such as heat, light, a chemical or a drug. In some embodiments, the plurality of RNA harbored by the cellular sample comprises RNA found in healthy cells or diseased cells. In some embodiments, the plurality of RNA harbored by the cellular sample comprises RNA transcribed from transgenic DNA sequences that are introduced into the cellular sample using recombinant DNA procedures.
  • the RNA can be transcribed from a transgenic DNA sequence that is controlled by an inducible or constitutive promoter sequence.
  • the plurality of RNA harbored by the cellular sample comprises RNA that is transcribed from DNA sequences that are not transgenic.
  • the cellular sample can be cultured on the support.
  • the methods comprise culturing the cellular sample on the support under a condition suitable for expanding the cellular sample for 2-10 generations or more.
  • the cultured cellular sample can generate a colony of cells.
  • the methods comprise culturing the cellular sample to confluence or non -confluence.
  • the methods comprise culturing the cellular sample on the support in a simple or complex cell culture media.
  • the cell culture media comprises D -MEM high glucose (e.g., from Thermo Fisher Scientific, catalogNo.
  • fetal bovine serum e g., 10% FBS; for example from Thermo Fisher Scientific, catalogNo. A3160402
  • MEM non- essential amino acids e.g., 0.1 mMMEM, for example from ThermoFisher Scientific, catalog No. 11140050
  • L-glutamine e g., 6 mM L-glutamine, for example from Thermo Fisher Scientific, catalogNo. A2916801
  • MEM sodium pyruvate e.g., 1 mM sodium pyruvate, for example from Thermo Fisher Scientific, catalogNo.
  • the methods comprise culturing the cellular sample at a humidity and temperature that is suitable for culturing the cell(s) on the support.
  • suitable conditions comprise approximately 37 °C with a humidified atmosphere of approximately 5-10% carbon dioxide in air.
  • the cellular sample can be cultured with suitable aeration with oxygen and/or nitrogen.
  • simple cell media refers to a cell media that typically lacks ingredients to support cell growth and/or proliferation in culture.
  • Simple cell media can be used for example to wash, suspend, or dilute the cellular sample.
  • Simple cell media can be mixed with certain ingredients to prepare a cell media that can support cell growth and/or proliferation in culture.
  • a simple cell media comprises any one or any combination of two or more of a buffer, a phosphate compound, a sodium compound, a potassium compound, a calcium compound, a magnesium compound and/or glucose.
  • the simple cell media comprises PBS (phosphate buffered saline), DPBS (Dulbecco’s phosp hate-buffered saline), HBSS (Hank’s balanced salt solution), DMEM (Dulbecco’s Modified Eagle’s Medium), EMEM (Eagle’s Minimum Essential Medium), and/or EBSS.
  • the cellular sample can be placed in a simple cell media prior to or during the step of conducting any of the nucleic acid methods described herein.
  • complex cell media refers to a cell media that can be used to support cell growth and/or proliferation in culture without supplementation or additives.
  • Complex cell media can include any combination of two or more of a buffering system (e g , HEPES), inorganic salt(s), amino acid(s), protein(s), polypeptide(s), carbohydrate(s), fatty acid(s), lipid(s), purine(s) and their derivatives (e.g., hypoxanthine), pyrimidine(s) and their derivatives, and/or trace element(s).
  • Complex cell media includes fluids obtained from a fluid or tissue extract.
  • Complex cell media includes artificial cell media.
  • complex cell media canbe a serum -containing media, for example complex cell media includes fluids such as fetal bovine serum, blood plasma, blood serum, lymph fluid, human placental cord serum and amniotic fluid.
  • complex cell media can be a serum-free media, which are typically (but not necessarily) defined cell culture media.
  • complex cell media can be a chemically -defined media which typically (but not necessarily) include recombinant polypeptides, and ultra -pure inorganic and/or organic compounds.
  • complex cell media can be a protein-free media which include for example MEM (minimal essential media) and RPMI-1640 (Roswell Park Memorial Institute).
  • the complex cell media comprises IMDM (Iscove’s Modified Dulbecco’s Medium. In some embodiments, the complex cell media comprises DMEM (Dulbecco’s Modified Eagle’s Medium). In some embodiments, the cellular sample can be placed in a complex cell media prior to or during the step of conducting any of the nucleic acid methods described herein.
  • the cellular sample comprises a fixed cellular sample.
  • the cellular sample can be treated with a fixation reagent (e.g., a fixing reagent) that preserves the cell and its contents to inhibit degradation and can inhibit cell lysis.
  • a fixation reagent e.g., a fixing reagent
  • the fixation reagent can preserve RNA harbored by the cellular sample.
  • the fixation reagent inhibits loss of nucleic acids from the cellular sample.
  • the fixation reagent can cross-link the RNA to prevent the RNA from escaping the cellular sample.
  • a cross -linking fixation reagent comprises any combination of an aldehyde, formaldehyde, paraformaldehyde, formalin, glutaraldehyde, imidoesters, N-hydroxysuccinimide esters (NHS) and/or glyoxal (a bifunctional aldehyde).
  • the fixation reagent comprises at least one alcohol, including methanol or ethanol. In some embodiments, the fixation reagent comprises at least one ketone, including acetone. In some embodiments, the fixation reagent comprises acetic acid, glacial acetic acid and/or picric acid. In some embodiments, the fixation reagent comprises mercuric chloride. In some embodiments, the fixation reagent comprises a zinc salt comprising zinc sulphate or zinc chloride. In some embodiments, the fixation reagent can denature polypeptides. [0228] In some embodiments, the fixation reagent comprises 4% w/v of paraformaldehyde to water/PBS.
  • the fixation reagent comprises 10% of 35% formaldehyde at a neutral pH. In some embodiments, the fixation reagent comprises 2% v/v of glutaraldehyde to water/PBS. In some embodiments, the fixation reagent comprises 25% of 37% formaldehyde solution, 70% picric acid and 5% acetic acid. [0229] In some embodiments, the cellular sample can be fixed on the support with 4% paraformaldehyde for about 30-60 minutes and washed with PBS.
  • the cellular sample can be stained, de -stained or un-stained.
  • the cellular sample comprises a permeabilized cellular sample.
  • the methods comprise treating the cellular sample with a permeabilization reagent that alters the cell membrane to permit penetration of experimental reagents into the cells.
  • the permeabilization reagent removes membrane lipids from the cell membrane.
  • the cellular sample can be treated with a permeabilization reagent which comprises any combination of an organic solvent, detergent, chemical compound, cross-linking agent and/or enzyme.
  • the organic solvents comprise acetone, ethanol, and methanol.
  • the detergents comprise saponin, Triton X-100, Tween-20, sodium dodecyl sulfate (SDS), anN- lauroylsarcosine sodium salt solution, or a nonionic polyoxyethylene surfactant (e.g., NP40).
  • the cross-linking agent comprises paraformaldehyde.
  • the enzyme comprises trypsin, pepsin or protease (e g. proteinase K).
  • the cells can be permeabilized using an alkaline condition, or an acidic condition with a protease enzyme.
  • the permeabilization reagent comprises water and/or PBS.
  • the fixed cells can be permeabilized with 70% ethanol for about30-60 minutes, and the permeabilizing reagent can be exchanged with PBS-T (e.g., PBS with 0.05% Tween-20).
  • PBS-T e.g., PBS with 0.05% Tween-20
  • the cells can be post-fixed with 3% paraformaldehyde and 0.1% glutaraldehyde for about 30-60 minutes, and washed with PBS-T multiple times.
  • the cellular sample is infused with a swellable polyelectrolyte hydrogel (U.S. patentNo. 10,309,879 and Chen 2015 Science 347:543, the contents of these documents are incorporated by reference in their entireties).
  • a fixed and permeabilized cellular sample can be infused with sodium acrylate, acrylamide and a cross-linker N-N’-methylenebisacrylamide.
  • ammonium persulfate (APS) initiator and tetramethylethylenediamine (TEMED) accelerator were infused to achieve polymerization.
  • the cellular sample can be infused with proteinase K for proteolysis and incubated in a digestion buffer.
  • the gel inside the cellular sample can be swelled by addition of water.
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules in a cellular sample, which can include converting the plurality of RNAs inside cellular sample to cDNA.
  • the methods can comprise contacting the plurality of RNA inside the fixed and permeabilized cellular sample with (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and (iii) a plurality of nucleotides, under a condition suitable for conducting a reverse transcription reaction to generate a plurality of cDNA molecules (e.g., a plurality of first strand cDNA molecules) in the cellular sample.
  • a plurality of reverse transcription primers e.g., a plurality of reverse transcriptase enzymes
  • a plurality of nucleotides e.g., a plurality of nucleotides
  • the method can comprise contacting the plurality of RNA inside the fixed and permeabilized cellular sampled with a plurality of reverse transcription primers. In some embodiments, the method can comprise contacting the plurality of RNA inside the fixed and permeabilized cellular sampled with a plurality of reverse transcriptase enzymes. In some embodiments, the method can comprise contacting the plurality of RNA inside the fixed and permeabilized cellular sampled with a plurality of nucleotides. In some embodiments, synthesis of second strand cDNA molecules may be omitted.
  • the present disclosure provides methods for conducting in situ analysis of a nucleic acid.
  • the analysis may comprise a sequencing of the nucleic acid.
  • the nucleic acid may be an RNA.
  • the analysis of the RNA may comprise reverse transcribing the RNA into a complementary DNA (cDNA).
  • cDNA complementary DNA
  • a reverse transcriptase maybe used to transcribe the RNA into a cDNA
  • Methods as described herein may comprise use of a reverse transcriptase.
  • the reverse transcriptase enzyme can exhibit RNA-dependent DNA polymerase activity.
  • the reverse transcriptase enzyme can comprise a reverse transcriptase enzyme from AMV (avian myeloblastosis virus), M-MMLV (moloney murine leukemia virus), or HIV (human immunodeficiency virus).
  • the reverse transcriptase enzyme can comprise a recombinant enzyme that exhibits reduced RNase H activity, for example REVERTAID (e.g., from Thermo Fisher Scientific, catalog No. EP0441).
  • the reverse transcriptase canbe a commercially -available enzyme, including MULTISCRIBE (e g., from Thermo Fisher Scientific, catalog# 4311235), THERMOSCRIPT (e g., from Thermo Fisher Scientific, catalog# 12236-014), or ARRAYSCRIPT (e.g., from Ambion, catalogNo. AM2048).
  • the reverse transcriptase enzyme can comprise SUPERSCRIPT II (e.g., catalogNo. 18064014), SUPERSCRIPT III (e.g., catalogNo. 18080044), or SUPERSCRIPT IV enzymes (e.g., catalog No. 18090010) (all SUPERSCRIPT enzymes from Invitrogen).
  • the reverse transcription reaction can include an RNase inhibitor. Further examples of reverse transcriptases can also be found in US Publication No. US20210139884, which is incorporated herein by reference in its entirety.
  • the reverse transcription primers comprise a single-stranded oligonucleotide comprising DNA, RNA, or chimeric DNA/RNA.
  • the reverse transcription primers Any combination of adenine (A), thymine (T), guanine (G), cytosine (C), uracil (U) and/or inosine (I).
  • the reverse transcription primers can be any length, for example 5 -25 bases, or 25-50 bases, or 50-75 bases, or 75-100 bases in length or longer.
  • the reverse transcription primers each comprise a 5’ end and 3’ end.
  • the 3’ end of the reverse transcription primerscan include a 3’ OH moiety which serves as a nucleotide polymerization initiation site in a polymerase -catalyzed primer extension reaction.
  • the 3 ’ end of the reverse transcription primers have a chain terminating moiety which blocks a polymerase -catalyzed primer extension reaction. The chain terminating moiety can be removed to convert the 3’ sugar position to an extendible 3 ’OH.
  • the reverse transcription primers are modified to confer resistance to nuclease degradation (e.g., ribonuclease degradation).
  • the reverse transcription primers comprise atleast one phosphorothioate diester bond at their 5’ ends which can render the reverse transcription primers resistant to nuclease degradation.
  • the reverse transcription primers comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends.
  • the plurality of reverse transcription primers comprise at least one ribonucleotide and/or at least one 2’-O-methyl, 2’-O-methoxyethyl (MOE), 2’ fluorobase nucleotide.
  • the reverse transcription primers comprise phosphorylated 3 ’ ends. In some embodiments, the reverse transcription primers comprise locked nucleic acid (LNA) bases. In some embodiments, the reverse transcription primers comprise a phosphorylated 5’ end (e.g., using a polynucleotide kinase).
  • LNA locked nucleic acid
  • the entire length of a reverse transcription primer can hybridize to a portion of an RNA molecule.
  • individual reverse transcription primers comprise a 3’ region having a sequence that hybridizes to a portion of an RNA molecule and a 5’ region that carries a tail that does not hybridize to an RNA molecule.
  • the 5’ tail comprises a universal adaptor sequence including any one or any combination of two or more of a sample barcode sequence, an amplification primer binding site, a sequencing primer binding site, a compaction oligonucleotide binding site and/or a surface capture primer binding site.
  • the 5’ tail comprises a unique identification sequence (e.g., unique molecular index (UMI).
  • the 5 ’ tail comprises a restriction enzyme recognition sequence.
  • individual reverse transcription primers comprise at least a portion of the 3 ’ region having a homopolymer sequence, for example poly -A, poly-T, poly-C, poly-G or poly-U.
  • the reverse transcription primers can hybridize to any portion of an RNA molecule, including the 5’ or the 3’ end of the RNA molecule, or an internal portion of the RNA molecule.
  • the plurality of reverse transcription primers comprises a first sub -population of target-specific reverse transcription primers that hybridize selectively to the first target RNA (e.g., targeted transcriptomics). In some embodiments, the plurality of reverse transcription primers further comprise a second sub -population of target-specific reverse transcription primers that hybridize selectively to the second target RNA. In some embodiments, the target-specific reverse transcription primers comprise a pre-determined sequence at the 3’ region which hybridizes to a target RNA molecule. In some embodiments, the pre -determined sequence portion of the reverse transcription primers can be 4-20bases, or20-40bases, or40-50 bases in length.
  • the first sub -population of target-specific reverse transcription primers can selectively hybridize to an RNA transcribed in the cellular sample by a housekeeping gene.
  • selection of the housekeeping gene may be dependent upon the type of cellular sample to be used for the in situ methods described herein.
  • Non-limiting example housekeeping genes include glyceraldehyde-3 -phosphate dehydrogenase (GAPDH), beta-actins (ACTB), tubulins, PPIA (peptidyl-prolyl cis-trans isomerase), NME4 (NME/NM23 nucleoside diphosphate kinase 4), SMARCAL1 (SWI/SNF related matrix associated actin dependent regulator of chromatin, subfamily A like 1), and POMK (protein-O- mannose kinase).
  • GPDH glyceraldehyde-3 -phosphate dehydrogenase
  • ACTB beta-actins
  • tubulins tubulins
  • PPIA peptidyl-prolyl cis-trans isomerase
  • NME4 NME/NM23 nucleoside diphosphate kinase 4
  • SMARCAL1 SWI/SNF related matrix associated actin dependent regulator of chromatin, subfamily A like 1
  • the second sub-population of target-specific reverse transcription primers can selectively hybridize to an RNA transcribed from a gene that is expressed in the cellular sample being examined (e g., a cell-specific or tissue-specific RNA).
  • the plurality of reverse transcription primers comprises a first sub -population of random-sequence reverse transcription primers that hybridize to the first target RNA (e.g., whole transcriptomics). In some embodiments, the plurality of reverse transcription primers further comprises a second sub-population of random-sequence reverse transcription primers that hybridize to the second target RNA. In some embodiments, the reverse transcription primers comprise a random and/or degenerate sequence at the 3’ region which hybridizes to an RNA molecule. In some embodiments, the random -sequence or the degenerate-sequence portion of the reverse transcription primers can be 4 -20 bases, or 20-40 bases, or 40-50 bases in length.
  • a padlock probe for use in any of the methods as described herein.
  • a padlock probe may be designed to selectively detect a target molecule.
  • the padlock probe maybe designed to selectively detect a target nucleic acid or polypeptide.
  • the target nucleic acid may be a target DNA or a target RNA.
  • binding the padlock probe to the target nucleic acid orprotein/polypeptide one can learn information about one or more properties of the target nucleic acid or protein/poly peptide. For example, a location of the target nucleic acid or protein/polypeptide can be ascertained.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, and the padlock probe may therefore not require a barcode sequence.
  • the location of the target can give information aboutthe sample, e.g., cellular origin. In such instances, it may not be necessary for the padlock probe to carry a barcode. However, in other non-limiting instances, the padlock probe may carry a barcode.
  • the present disclosure provides a padlock probe that is specific to a target RNA.
  • the RNA-specific padlock probe may selectively hybridize to a cDNA that corresponds to the target RNA.
  • the RNA-specific probe may lack a barcode that uniquely identifiesthe cDNA. In such instances, a spatial information maybe obtained with respect to the RNA-specific probe.
  • the RNA-specific probe may carry a barcode that uniquely identifies the cDNA.
  • the RNA-specific padlock probes may also carry a sequencing primer binding site.
  • the sequencing primer binding site may be a batch-specific primer binding site — in other words, a primer may bind to the primer binding site that is unique to the probe.
  • One or more padlock probes may be used. Each probe may have a unique batch-specific primer binding site. The use of a batch-specific primer binding site may enable the spatial localization of the padlock probe, thereby obviating the need for a barcode sequence on the padlock probe.
  • the plurality of cDNA inside the cellular sample can be amplified to generate cDNA amplicons (e.g., concatemers).
  • the plurality of cDNA molecules can be amplified by conducting a padlock probe circularization and rolling circle amplification workflow.
  • the methods comprise contacting the plurality of cDNA molecules with a plurality of padlock probes, including a first plurality of target-specific padlock probes that hybridize with first target cDNA molecules, and a second plurality of target-specific padlock probes that hybridize with second target cDNA molecules.
  • the padlock probes comprise single -stranded oligonucleotides.
  • the padlock probes comprise DNA, RNA, DNA and RNA.
  • the padlock probes comprise canonical nucleotides and/or nucleotide analogs.
  • the padlock probes are modified to confer resistance to nuclease degradation (e g., ribonuclease degradation).
  • the padlock probes comprise at least one phosphorothioate diester bond at their 5’ ends which can render the padlock probes resistant to nuclease degradation.
  • the padlock probes comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends.
  • the padlock probes comprise at least one ribonucleotide and/or at least one 2’ -O-m ethyl, 2’-O-methoxyethyl (MOE), 2’ fluoro-base nucleotide.
  • the padlock probes comprise phosphorylated 3 ’ ends.
  • the padlock probes comprise at least one locked nucleic acid (LNA) base.
  • the padlock probes comprise a phosphorylated 5 ’ end (e g., using a polynucleotide kinase).
  • the methods can comprise contacting the plurality of cDNA molecules with a plurality of padlock probes.
  • the plurality of padlock probes can include a first plurality of target-specific padlock probes.
  • the first plurality of target-specific padlock probes can hybridize with the first target cDNA molecules.
  • the plurality of padlock probes can include a second plurality of target-specific padlock probes.
  • the second plurality of target-specific padlock probes can hybridize with the second target cDNA molecules.
  • individual padlock probes comprise first and second terminal regions that hybridize to portions of cDNA molecules to form a plurality of cDNA -padlock probe complexes, wherein individual complexes have the firstand second terminal probe regions hybridized to proximal regions of a cDNA molecule to form a nick or gap between the first and second terminal probe ends.
  • the first terminal region of an individual padlock probe has a first target-specific sequence that selectively hybridizes to a first region of a target cDNA molecule
  • the second terminal region of the individual padlock probe has a second target-specific sequence that selectively hybridizes to a second region of the same target cDNA molecule, where a nick or gap is formed between the hybridized first and second terminal regions, thereby circularizing the padlock probe.
  • the first terminal region of an individual padlock probe can have a first target-specific sequence that can selectively hybridize to a first region of a target cDNA molecule.
  • the second terminal region of an individual padlock probe can have a second target-specific sequence that can selectively hybridize to a second region of a target cDNA molecule.
  • a nick or gap can be formed between the hybridized first and second terminal regions.
  • the hybridization of the first terminal region and the second terminal region can circularize the padlock probe.
  • individual padlock probes in a set of padlock probes comprise firstand second terminal regions that hybridize to the same target regions of the target cDNA molecules to form a plurality of cDNA-padlock probe complexes having the same cDNA sequence.
  • a set of padlock probes (e.g., a plurality of padlock probes) comprise at least two sub-sets of padlock probes.
  • individual padlock probes in a first sub-set of padlock probes comprise first and second terminal regions that hybridize to the same target regions (e.g., a first target region) of the target cDNA molecules to form a first plurality of cDNA-padlock probe complexes having the same cDNA sequence.
  • individual padlock probes in a second sub-set of padlock probes comprise first and second terminal regions that hybridizeto the same target regions (e.g., a second target region) of the target cDNA molecules to form a second plurality of cDNA-padlockprobe complexes having the same cDNA sequence.
  • the first and second subsets of padlock probes hybridize to different target regions of the same target cDNA molecules. In some embodiments, the first and second sub -sets of padlock probes hybridize to different target regions of different target cDNA molecules. In some embodiments, the set of padlock probes comprise 2-10 sub-sets of padlock probes, or 10-25 sub-sets of padlock probes, or 25-50 sub-sets of padlock probes, or up to 100 sub-sets of padlock probes.
  • the set of padlock probes comprise atleast 100 sub-sets of padlock probes, atleast 500 sub-sets of padlock probes, atleast 1000 sub-sets of padlock probes, atleast 10,000 sub-sets of padlock probes, or more sub-sets of padlock probes.
  • the nicks can be enzymatically ligated to generate covalently closed circular padlock probes.
  • the ligase enzyme can discriminate between matched and mis-matched hybridized ends to ensure target-specific hybridization.
  • the ligation reaction comprises use of a ligase enzyme, including a T3, T4, T7 or Taq DNA ligase enzyme.
  • the size of the gap between the hybridized first and second terminal regions is 1 -25 bases.
  • the 3 ’OH end of hybridized padlock probe can serve as an initiation site for a polymerase-catalyzed fill-in reaction (e.g., gap fill-in reaction) using the target cDNA molecule as a template. After the fill-in reaction, the remaining nick can be enzymatically ligated to generate covalently closed circular padlock probes.
  • the gap-filling reaction comprises contacting the circularized padlock probe with a DNA polymerase and a plurality of nucleotides.
  • the DNA polymerase comprises E. coll DNA polymerase I, Klenow fragment of E. coli DNA polymerase I, T7 DNA polymerase, or T4 DNA polymerase.
  • the ligase enzyme can discriminate between matched and mis-matched hybridized ends to ensure targetspecific hybridization.
  • the ligation reaction comprises use of a ligase enzyme, including a T3, T4, T7 or Taq DNA ligase enzyme.
  • the padlock probes comprise at least one universal adaptor sequence including a sample barcode sequence, an amplification primer binding site, a sequencing primer binding site, a compaction oligonucleotide binding site and/or a surface capture primer binding site.
  • the padlock probes comprise at least one unique identification sequence (e.g., unique molecular index (UMI).
  • UMI unique molecular index
  • the padlock probes comprise at least one restriction enzyme recognition sequence.
  • the plurality of covalently closed circular padlock probes can be subjected to a rolling circle amplification reaction to generate a plurality of concatemer molecules each having two or more tandem copies of a unit wherein the unit comprises a target sequence that corresponds to a target RNA molecules and any additional sequence(s) carried by the padlock probes including universal adaptor sequence(s), unique molecular index sequence(s) and/or restriction enzyme recognition sequence(s).
  • the rolling circle amplification reaction can generate a plurality of concatemer molecules.
  • each concatemer molecule of the plurality of concatemer molecule can have two or more tandem copies of a unit.
  • a unit can comprise a target sequence that can correspond to a target RNA molecule.
  • a unit can comprise additional sequence(s) carried by the padlock probes.
  • the additional sequences can include universal adaptor sequence(s), unique molecular index sequence(s), restriction enzyme recognition sequence(s), or any combination thereof.
  • the rolling circle amplification reaction comprises contacting the covalently closed circularized padlock probes with an amplification primer (e.g., a universal rolling circle amplification primer), a strand -displacing DNA polymerase, and a plurality of nucleotides, under a condition suitable for hybridizing individual amplification primers to a covalently closed padlock probe, and under a condition suitable for conducting primer extension using the covalently closed padlock probe as a template molecule to generate a nucleic acid concatemer.
  • the rolling circle amplification reaction can comprise contacting the covalently closed circularized padlock probes with an amplification primer.
  • the rolling circle amplification reaction can comprise contacting the covalently closed circularized padlock probes with a strand -displacing DNA polymerase. In some embodiments, the rolling circle amplification reaction can comprise contacting the covalently closed circularized padlock probes with a plurality of nucleotides. In some embodiments, the rolling circle amplification can occur under a condition suitable for hybridizing individual amplification primers to a covalently closed padlock probe. In some embodiments, the rolling circle amplification can occur under a condition suitable for conducting primer extension. In some embodiments, primer extension can occur using the covalently closed padlock probe as a template molecule. In some embodiments, the primer extension can generate a nucleic acid concatemer.
  • the plurality of nucleotides in the rolling circle amplification reaction comprise any mixture of two or more of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • any of the rolling circle amplification reactions described herein can be conducted in the presence or in the absence of a plurality of compaction oligonucleotides.
  • the resulting concatemer when the rolling circle amplification reaction includes a plurality of nucleotide which includes dUTP, the resulting concatemer can be cross-linked to a crosslinking reactive group by treating the cellular sample with a succininuide ester (NHS), maleimide (Sulfo-SMCC), imidoester (DMP), carbodiimide (DCC, EDC) or phenyl azide, in some embodiments, polymerization of the cross-linking reactive group can be initiated with light or UV light, in some embodiments, the resulting concatemer can be cross-linked to a matrix by treating the cellular sample with a cross-linked agarose, cross-linked dextran or cross-linked polyethylene glycol (PEG), polyacrylamide, cellulose alginate or polyamide.
  • NHS succininuide ester
  • DMP imidoester
  • DCC carbodiimide
  • phenyl azide phenyl azide
  • the PEG comprises a sulfo-NHS ester moiety atone or both ends, for example a PEGylated bis(sulfosuccinimidyl)suberate) (e.g., BS(PEG)9 from Thermo Fisher Scientific, catalogNo. 21582).
  • a PEGylated bis(sulfosuccinimidyl)suberate e.g., BS(PEG)9 from Thermo Fisher Scientific, catalogNo. 21582
  • the rolling circle amplification reaction can be conducted ata constant temperature (e.g., isothermal) wherein the constant temperature is at room temperature to about 30°C, or about 30 - 40°C, or about 40 - 50°C, or about 50 - 65 °C.
  • a constant temperature e.g., isothermal
  • the DNA polymerase having a strand displacing activity can be selected from a group consisting of phi29 DNA polymerase, large fragment of Bst DNA polymerase, large fragment of Bsu DNA polymerase, and Bea (exo-) DNA polymerase, Klenow fragment of E. coli DNA polymerase, T5 polymerase, M-MuLV reverse transcriptase, HIV viral reverse transcriptase, or Deep Vent DNA polymerase.
  • the phi29DNA polymerase can be wild type phi29 DNA polymerase (e.g., MagniPhi from Expedeon), or variant EquiPhi29 DNA polymerase (e.g., from Thermo Fisher Scientific), and chimeric QualiPhi DNA polymerase (e.g., from 4basebio).
  • wild type phi29 DNA polymerase e.g., MagniPhi from Expedeon
  • EquiPhi29 DNA polymerase e.g., from Thermo Fisher Scientific
  • chimeric QualiPhi DNA polymerase e.g., from 4basebio
  • the rolling circle amplification primers can be modified to increase resistance to nuclease degradation.
  • the rolling circle amplification primers comprise at least one phosphorothioate diester bond attheir 5’ ends which can render the amplification primers resistant to exonuclease degradation.
  • the rolling circle amplification primers comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends.
  • the rolling circle amplification primers comprise at least one ribonucleotide and/or at least one 2’ -O-m ethyl or 2’- O-methoxy ethyl (MOE) nucleotide.
  • the rolling circle amplification reaction can be conducted in the presence of a plurality of compaction oligonucleotides which, when hybridized to a concatemer molecule, can compact the size, the shape, or a combination of the size and the shape of the concatemer to form a compact nanoball.
  • the rolling circle amplification reaction can be conducted in the presence of a plurality of compaction oligonucleotides.
  • the compaction oligonucleotides can hybridize to a concatemer molecule.
  • the compaction oligonucleotide hybridized to the concatemer molecule can compact the size, the shape, or a combination of the size and the shape of the concatemer molecule.
  • the compacted concatemer molecule can form a compact nanoball.
  • the compaction oligonucleotides can comprise single stranded oligonucleotides having a first region at one end that can hybridize to a portion of a concatemer molecule and a second region at the other end that can hybridize to another portion of the same concatemer molecule, where hybridization of the compaction oligonucleotide to a given concatemer can compact the size, the shape, or a combination of the size orthe shape of the concatemer.
  • the compaction oligonucleotides can comprise single stranded oligonucleotides having a first region at one end that can hybridize to a portion of a concatemer molecule and a second region at the other end that can hybridize to another portion of the same concatemer molecule.
  • the hybridization of the compaction oligonucleotide to a concatemer molecule can compact the size, the shape, or a combination of the size and the shape of the concatemer molecule.
  • the compaction oligonucleotides include a 5’ region, an optional internal region (intervening region), a 3 ’ region, or any combination thereof.
  • a 5 ’ and 3 ’ regions of the compaction oligonucleotide can hybridize to different portions of the concatemer to pull together distal portions of the concatemer causing compaction of the concatemer to form a DNA nanoball.
  • the 5’ region of the compaction oligonucleotide is designed to hybridize to a first portion of the concatemer molecule (e.g., a universal compaction oligonucleotide binding site), and the 3’ region of the compaction oligonucleotide is designed to hybridized to a second portion of the concatemer molecule (e.g., a universal compaction oligonucleotide binding site).
  • Inclusion of compaction oligonucleotides during RCA can promote formation of DNA nanoballs havingtighter size and shape compared to concatemers generated in the absence of the compaction oligonucleotides.
  • the compact and stable characteristics of the DNA nanoballs improves in situ sequencing accuracy by increasing signal intensity and the nanoballs retain their shape and size during multiple sequencing cycles.
  • the compaction oligonucleotides comprise single stranded oligonucleotides comprising DNA, RNA, or a combination ofDNA and RNA.
  • the compaction oligonucleotides can be any length, including 20-150 nucleotides, or 30-100 nucleotides, or 40 80 nucleotides in length.
  • the compaction oligonucleotides comprises a 5’ region and a 3’ region, and optionally an intervening region between the 5’ and 3’ regions.
  • the intervening region can be any length, for example about 2 -20 nucleotides in length.
  • the intervening region comprises a homopolymer having consecutive identical bases (e.g., AAA, GGG, CCC, TTT or UUU).
  • the intervening region comprises a non-homopolymer sequence.
  • the 5’ region of the compaction oligonucleotides can be wholly complementary or partially complementary along its length to a first portion of a concatemer molecule.
  • the 5’ region of the compaction oligonucleotides can be wholly complementary along its length to a first portion of a concatemer molecule.
  • the 5 ’ region of the compaction oligonucleotides can be partially complementary along its length to a first portion of a concatemer molecule.
  • the 3 ’ region of the compaction oligonucleotides can be wholly complementary or partially complementary along its length to a second portion of a concatemer molecule.
  • the 3 ’ region of the compaction oligonucleotides can be wholly complementary along its length to a second portion of a concatemer molecule.
  • the 3’ region of the compaction oligonucleotides canbe partially complementary along its length to a second portion of a concatemer molecule.
  • the 5 ’ region of the compaction oligonucleotides can hybridize to a first universal sequence portion of a concatemer molecule.
  • the 3 ’ region of the compaction oligonucleotides can hybridize to a second universal sequence portion of a concatemer molecule. .
  • the 5’ region of the compaction oligonucleotide can have the same sequence as the 3 ’ region.
  • the 5’ region of the compaction oligonucleotide can have a sequence that is different from the 3 ’ region.
  • the 3 ’ region of the compaction oligonucleotide can have a sequence that is a reverse sequence of the 5 ’ region.
  • the 5’ region of the compaction oligonucleotide can have a sequence that is a reverse sequence of the 3 ’ region.
  • the 3’ region of any of the compaction oligonucleotides can include an additional three bases at the terminal 3’ end which comprises 2’ -O-m ethyl RNA bases (e g., designated mUmUmU) or the terminal 3’ end lacks additional 2’ -O-methyl RNA bases.
  • the compaction oligonucleotides comprise one ormore modified bases or linkages at their 5’ or 3’ ends to confer certain functionalities.
  • the compaction oligonucleotides comprise at least one phosph orothioate linkages at their 5 ’ and/or 3 ’ ends to confer exonuclease resistance.
  • At least one nucleotide at or near the 3’ end comprises a 2’ fluoro base which confers exonuclease resistance.
  • the 3’ end of the compaction oligonucleotides comprise at least one 2’ -O-methyl RNA base which blocks polymerase-catalyzed extension.
  • the 3 ’ end of the compaction oligonucleotide comprises threebases comprising 2 ’-O-methyl RNAbase (e.g., designated mUmUmU).
  • the compaction oligonucleotides comprise a 3’ inverted dT at their 3’ ends which blocks polymerase-catalyzed extension.
  • the compaction oligonucleotides comprise 3’ phosphorylation which blocks polymerase-catalyzed extension.
  • the internal region of the compaction oligonucleotides comprise at least one locked nucleic acid (LNA) which increases the thermal stability of duplexes formed by hybridizing a compaction oligonucleotide to a concatemer molecule.
  • the compaction oligonucleotides comprise a phosphorylated 5’ end (e.g., using a polynucleotide kinase).
  • the compaction oligonucleotide comprises the sequence s’ - CATGTAATGCACGTACTTTCAGGGTAAACATGTAATGCACGTACTTTCAGGGT-3’ (SEQ ID NO: 14).
  • the compaction oligonucleotides includes an additional three bases at the terminal 3’ end which comprises 2 ’-O-methyl RNA bases (e.g., designated mUmUmU) orthe terminal 3’ end lacks additional 2’-O-methyl RNAbases.
  • the compaction oligonucleotides can include at least one region having consecutive guanines.
  • the compaction oligonucleotides can include at least one region having 2, 3, 4, 5, 6 ormore consecutive guanines.
  • the compaction oligonucleotides comprise four consecutive guanines which can form a guanine tetrad structure (see FIG. 18).
  • the guanine tetrad structure can be stabilized via Hoogsteen hydrogen bonding.
  • the guanine tetrad structure can be stabilized by a central cation including potassium, sodium, lithium, rubidium or cesium.
  • At least one compaction oligonucleotide can form a guanine tetrad ( Figure 18) and hybridize to the universal binding sequences in a concatemer which can cause the concatemer to fold to form an intramolecular G-quadruplex structure (FIG. 19).
  • the concatemers can selfcollapse to form compact nanoballs. Formation of the guanine tetrads and G-quadruplexes in the nanoballs may increase the stability of the nanoballs to retain their compact size and shape which can withstand changes in pH, temperature and/or repeated flows of reagents during sequencing inside the cellular sample.
  • the plurality of compaction oligonucleotides in the rolling circle amplification reaction have the same sequence.
  • the plurality of compaction oligonucleotides in the rolling circle amplification reaction comprise a mixture of two or more different populations of compaction oligonucleotides having different sequences.
  • the immobilized concatemer template molecule can self-collapse into a compact nucleic acid nanoball.
  • the nanoballs can be imaged and a FWHM measurement can be obtained to give the shape/size of the nanoballs.
  • inclusion of compaction oligonucleotides in the rolling circle amplification reaction can promote collapsing of a concatemer into a DNA nanoball.
  • Conducting RCA with compaction oligonucleotides helps retain the compact size and shape of a DNA nanoball during multiple sequencing cycles which can improve FWHM (full width half maximum) of a spot image of the DNA nanoball inside a cellular sample.
  • the DNA nanoball does notunravel during multiple sequencing cycles.
  • the spot image of the DNA nanoball does not enlarge during multiple sequencing cycles.
  • the spot image of the DNA nanoball remains a discrete spot during multiple sequencing cycles.
  • the spot image can be represented as a Gaussian spot and the size canbe measured as a FWHM.
  • a smaller spot size as indicated by a smaller FWHM typically correlates with an improved image of the spot.
  • the FWHM of a nanoball spot can be about 10 um or smaller.
  • the present disclosure provides a protein -binding composition.
  • the protein-binding composition may be specific to a target polypeptide or protein corresponding to a targetRNA.
  • the target RNA may then be sequenced.
  • the protein -binding composition may increase the specificity of the methods of sequencing as disclosed herein.
  • the protein-binding composition can be used to selectively bind a target polypeptide or protein.
  • the protein-binding composition is linked to an oligonucleotide carrying at least one tag sequence which uniquely identifies the target polypeptide or protein to which the protein -binding composition selectively binds.
  • the protein -binding composition comprises an antibody.
  • the protein-binding composition comprises a protein-binding moiety.
  • the protein-binding moiety may comprise one or more antigen recognition domains.
  • the one or more antigen recognition domains of the protein-binding moiety may recognize the target polypeptide or protein.
  • the multivalent molecule may be or comprise an antibody or portion thereof, a chimeric antigen receptor (CAR) or portion thereof, or a T cell receptor (TCR) or portion thereof.
  • CAR/TCR chimeric antigen receptor
  • chimeric antigen receptors are engineered fusion proteins constructed from antigen recognition, signaling, and costimulatory domains that may be expressed in T cells to reprogram the T cells to specifically target tumor cells.
  • a CAR is a recombinant polypeptide construct comprising at least an extracellular antigen binding domain, a transmembrane domain, and a cytoplasmic signaling domain comprising a functional signaling domain derived from a stimulatory molecule.
  • Exogenous T cell receptors are similar to CARs in that they may be engineered to recognize an antigen (e.g., tumor antigen).
  • the TCR is a recombinant polypeptide.
  • Protein-binding compositions as disclosed herein may comprise a CAR or a TCR.
  • a antigen recognition domain of the CAR or TCR may be configured to recognize or bind to the target protein encoded by the target RNA.
  • the protein-binding composition may further comprise an oligonucleotide tag linked to the CAR or the TCR.
  • the oligonucleotide tag may hybridize to a unique padlock probe.
  • the binding of the oligonucleotide tag to the padlock probe may topologically lock the padlock probe.
  • the oligonucleotide tag may then be amplified. For example, it may be amplified via rolling circle amplification (RCA), bridge amplification, polymerase chain reaction (PCR), or any combination thereof.
  • RCA rolling circle amplification
  • PCR polymerase chain reaction
  • the extracellular antigen binding domain is an antigen binding fragment of an antibody, or a functional derivative thereof (e.g., an scFv).
  • the specificity of the antigen binding domain may be modified to treat a variety of different disorders, and may be mono-valent or multi-valent (e.g. di-valent, tri-valent).
  • the antigen binding domain comprises an scFv, and multivalent binding is provided by tandem addition of multiple scFvs bearing different antigen specificities.
  • the specificity and intended indication of the antigen binding matches that of any of the CAR-T constructs in contemporary clinical trials.
  • the specificity may include anti-CD19 (e.g.
  • axicabtagene ciloleucel for R/R diffuse large cell lymphoma Tisagenlecleucel, forR/RB cell ALL and non-Hodgkin lymphoma
  • anti-CD22 e.g. for R/RB-ALL
  • anti-CD19/CD22 dual targeted e.g. for R/R ALL
  • anti-CAIX carbonic anhydrase9
  • anti-PSMA a.k.a FOLHl, e.g. for renal cell carcinoma
  • anti-MUCl e.g. for seminal vesicle carcinoma
  • anti-CD33 e.g. for acute myeloid leukemia
  • anti-mesothelin mRNA e.g.
  • anti-FOLRl e.g. for metastatic ovarian cancer
  • anti-carcinoembryonic antigen a.k.a. CEA, e g. for CEA-expressing adenocarcinoma liver metastases
  • anti-IL13RA2 e g. for glioblastoma
  • anti-HER2 e g. for sarcoma
  • one or more of the following antigens may be bound by the CAR-T construct: 1-40-P-amyloid, 4-1BB, 5 AC, 5T4, 707-AP, A kinase anchor protein 4 (AKAP-4), activin receptor type-2B (ACVR2B), activin receptor-like kinase 1 (ALK1), adenocarcinoma antigen, adipophilin, adrenoceptor 0 3 (ADRB3), AGS-22M6, a folate receptor, a-fetoprotein (AFP), AIM-2, anaplastic lymphoma kinase (ALK), androgen receptor, angiopoietin 2, angiopoietin 3, angiopoietin-binding cell surface receptor 2 (Tie 2), anthrax toxin, A0C3 (VAP-1), B cell maturation antigen (BCMA), B7-H3 (CD276), Bacillus anthracis anthrax,
  • E. coli shiga toxin type-1 E. coli shiga toxin type-2, ecto-ADP- ribosyltransferase 4 (ART4), EGF-like module-containing mucin-like hormone receptor-like 2 (EMR2), EGF-like-domain multiple 7 (EGFL7), elongation factor 2 mutated (ELF2M), endotoxin, Ephrin A2, Ephrin B2, ephrin type-A receptor 2, epidermal growth factor receptor (EGFR), epidermal growth factor receptor variant III (EGFRvIII), episialin, epithelial cell adhesion molecule (EpCAM), epithelial glycoprotein 2 (EGP-2), epithelial glycoprotein 40 (EGP-40), ERBB2, ERBB3, ERBB4, ERG (transmembrane protease, serine 2 (TMPRSS2) ETS fusion gene), Escherichia coli, ETS translocation -variant gene 6, located on
  • the transmembrane domain of the CAR is a domain that localizes the CAR to the correct membrane location and stabilizes its structure .
  • the transmembrane domain comprises a transmembrane domain from CD28.
  • the CAR comprises a cytoplasmic signaling domain comprising a functional signaling domain derived from a stimulatory molecule.
  • the stimulatory molecule is a stimulatory receptor molecule.
  • the stimulatory receptor molecule is a stimulatory receptor molecule of an adaptive immune cell.
  • the stimulatory molecule is the zeta chain associated with the T cell receptor complex.
  • the stimulatory molecule is e.g.
  • the intracellular signaling domain comprises one or more functional signaling domains derived from at least one costimulatory molecule.
  • the costimulatory molecule comprises 4-1BB (i.e., CD137), CD27, CD28 CD30, CD40, PD-1, CD2, CD7, CD258, NKG2C, B7-H3, a ligand that binds to CD83, ICAM-1, LFA-1 (CD1 Ia/CD18), ICOS, or a combination thereof.
  • the CAR comprises a leader sequence at the amino-terminus (N-terminus) of the CAR fusion protein.
  • the CAR comprises a signal peptide sequence atthe N-terminus of the extracellular antigen recognition domain, wherein the signal peptide sequence is optionally cleaved from the antigen recognition domain (e g., a scFv) during cellular processing and localization of the CAR to the cellular membrane
  • the antigen recognition domain e g., a scFv
  • a CAR disclosed herein is a first-, second-, third-, or fourthgeneration CAR system, a functional variant thereof, or a combination thereof.
  • a first-generation CAR comprises an antigen binding domain with specificity for a particular antigen (e.g. an antibody or antigen-binding fragment thereof, such as an scFv, a Fab fragment, a VHH domain, or a VH domain of a heavy-chain only antibody), a transmembrane domain derived from an adaptive immune receptor (e.g. the transmembrane domain from the CD28 receptor), and a signaling domain derived from an adaptive immune receptor (e.g.
  • a second-generation CAR construct comprises the elements of the first- generation CAR and an addition of a co-stimulatory domain to the intracellular signaling domain portion of the CAR (e.g., derived from co-stimulatory receptors that act alongside T-cell receptors such as CD28, CD137/4-1BB, andCD134/OX40).
  • the co- stimulatory domain abrogates the need for administration of IL-2 alongside a first-generation CAR.
  • a third-generation CAR comprises the elements of a first generation CAR with the addition of multiple co-stimulatory domains to the intracellular signaling domain portion of the CAR (e.g. CD3( ⁇ -CD28-OX40, or CD3 -CD28-41BB).
  • fourth-generation CAR comprises the elements of a second- or third-generation CARs with the addition of an activating cytokine (e.g. IL- 12, IL-23, or IL-27) to the intracellular signaling portion of the CAR (typically between one or more of the costimulatory domains and the CD3(j ITAM domain) or under the control of a CAR -induced promoter (e.g. the NFAT/IL-2 minimal promoter).
  • an activating cytokine e.g. IL- 12, IL-23, or IL-27
  • Protein-binding compositions as disclosed herein may comprise an antibody or fragment thereof.
  • a “fragment thereof ’ refers to a moiety, other than a full-length intact antibody, that (1) comprises a portion of an antibody and (2) binds to the corresponding antigen (e g., target protein).
  • This includes, butis notlimited to, Fv, Fab, Fab', Fab'-SH, F(ab') 2 , diabodies, linear antibodies, single domain antibodies, single domain camelid antibodies, and single-chain antibody (scFv) molecules.
  • a Fab region of the antibody may be configured to recognize or bind to the target protein encoded by the target RNA.
  • the protein -binding composition may further comprise an oligonucleotide tag linked to the antibody or fragment thereof.
  • the oligonucleotide tag may hybridize to a unique padlock probe.
  • the binding of the oligonucleotide tag to the padlock probe may topologically lock the padlockprobe.
  • the oligonucleotide tag may then be amplified. For example, it may be amplified via rolling circle amplification (RCA), bridge amplification, polymerase chain reaction (PCR), or any combination thereof.
  • an antibody and oligonucleotide tag may be joined via a linker moiety.
  • the antibody may comprise a plurality of oligonucleotide tags linked via a plurality of linker moieties.
  • the antibody maybe specific to a target protein or polypeptide corresponding to a target RNA.
  • the oligonucleotide tag may correspond to a sequence of a padlock probe as disclosed herein. The successful binding of the oligonucleotide tag to a padlock probe as disclosed herein may lockthe topological configuration of the padlock probe.
  • the oligonucleotide tag sequence is designed to provide a sequencing read -out that is associated with binding between the antib ody-oligonucleotide conjugate and the target polypeptide.
  • the sequence of the oligonucleotide tag is designed to exhibit minimal hybridization to RNA (e.g., target RNA) in the cellular sample.
  • the present disclosure provides one or more sets of antibody-oligonucleotide conjugates, comprising at least a first and a second antibody-oligonucleotide conjugate.
  • a set of antibody- oligonucleotide conjugates can be used to conduct any of the detecting and/or sequencing methods described herein.
  • a set comprises at least a first antibody-oligonucleotide conjugate comprising a first antibody which selectively binds a first target polypeptide.
  • the first antibody is linked to a first oligonucleotide carrying an oligonucleotide tag sequence which uniquely identifies the first antibody which selectively binds the first target polypeptide.
  • the set comprises at least a second antibody-oligonucleotide conjugate comprising a second antibody which selectively binds a second target polypeptide.
  • the second antibody is linked to a second oligonucleotide carrying an oligonucleotide tag sequence which uniquely identifies the second antibody which selectively binds the second target polypeptide.
  • the first antibody-oligonucleotide conjugate and the second antibody-oligonucleotide conjugate are the same (e.g., FIG. 28). In some embodiments, the first antibody-oligonucleotide conjugate and the second antibody-oligonucleotide conjugate are different (e.g., FIG. 29).
  • the first and second oligonucleotide tag sequences are designed to selectively bind to left and right binding arms of a padlock probe. In some embodiments, the first and second oligonucleotide tag sequences differ from each other.
  • a set of antibody-oligonucleotide conjugates comprises 2-20 antibody-oligonucleotide conjugates, wherein each antibody-oligonucleotide conjugate binds its respective target polypeptide.
  • the set comprises 20-100 antibody- oligonucleotide conjugates, or 100-500 antibody-oligonucleotide conjugates, or 500-1000 antibody-oligonucleotide conjugate, or more antibody-oligonucleotide conjugates.
  • the present disclosure provides antibody-oligonucleotide conjugates each comprising an antibody linked to an oligonucleotide.
  • the antibody comprises an intact immunoglobulin, antibody fragment, an antigen binding portion of an antibody, or single -chain antibody.
  • the antibodies can be monoclonal or polyclonal antibodies.
  • the antibodies are capable of binding specifically to a target analyte.
  • the target analyte includes polypeptides, polynucleotides, carbohydrates, saccharides and lipids.
  • target analytes comprise intact polypeptides or peptide fragments.
  • the antibodies comprises an antigen -binding region (e.g., paratope) that binds specifically to a target analyte.
  • An immunoglobulin is typically a tetrameric molecule comprising two identical pairs of polypeptide chains where each pair includes a light chain and a heavy chain.
  • the amino portion of the heavy and light chains each comprise a variable region which associate with each other to form an antigen binding region (e.g., paratope).
  • an antigen binding region e.g., paratope
  • a typical immunoglobulin can bind two antigens or can bind two target analytes.
  • the carboxyl portion of the heavy chain comprise a constant region which associate with each other to form an Fc region for effector function.
  • the Fc portion of the heavy chains can define the class of antibody which includes IgG, IgM, IgD, IgA or IgE isotype.
  • the heavy and/or light chains can be prepared using recombinant techniques or by immunizing an animal with an antigen of interest.
  • the antibody fragment generally comprises a portion of an intact immunoglobulin that can bind an antigen. Examples of antibody fragments include but are not limited to Fv, Fab, Fab’, Fab’-SH, F(ab’)2, and Fd.
  • an Fv fragment comprises a variable light chain region (VL) and variable heavy chain region (VH).
  • an Fab fragment comprises a monovalent antibody fragment having a variable light chain region (VL), constant light chain region (CL), variable heavy chain region (VH), and first constant region (CHI).
  • VL variable light chain region
  • CL constant light chain region
  • VH variable heavy chain region
  • CHI first constant region
  • an Fab’ fragment comprises a monovalent antibody fragment having a variable light chain region (VL), constant light chain region (CL), variable heavy chain region (VH), first constant region (CHI), hinge region, and atleast a portion of a second constant region (CH2).
  • VL variable light chain region
  • CL constant light chain region
  • VH variable heavy chain region
  • CHI first constant region
  • hinge region atleast a portion of a second constant region (CH2).
  • an F(ab’)2 fragment comprises a bivalent antibody fragment having two Fab fragments linked via a disulfide bridge at the hinge region.
  • a single-chain antibody typically comprises a single polypeptide chain (e.g., a monovalent antibody molecule) having a variable light chain region (VL) and variable heavy chain region (VH) joined by a polypeptide linker (see, e.g., Bird etal., 1988, Science 242:423- 26 and Huston et al., 1988, Proc. Natl. Acad. Sci. USA 85 :5879-83).
  • the amino -terminal end of the single-chain antibody comprises either the variable light chain region (VL) or the variable heavy chain region (VH).
  • the single-chain antibody comprises an scFv- Fc antibody which further comprises an antibody hinge region, and at least a portion of the Fc region including the CH2 and/or the CH3 region.
  • the single-chain antibody comprises an scFv-CH antibody which further comprises an antibody hinge region, and at least a portion of the CH3 region
  • the antibody-oligonucleotide conjugates comprise an antibody linked to an oligonucleotide by a linker moiety.
  • the linker moiety comprises streptavidin (or an avidin -like moiety), biotin, an amine group, or a disulfide group.
  • the linker moiety is not cleavable or not removable. In some embodiments, the linker moiety is cleavable or removable.
  • the linker moiety can be cleavable with light (e.g., UV light), chemically -cleav able (e.g., dithiothreitol), heat- cleavable, or enzymatically cleavable.
  • light e.g., UV light
  • chemically -cleav able e.g., dithiothreitol
  • heat- cleavable e.g., heat- cleavable
  • enzymatically cleavable e.g., UV light
  • antibody-oligonucleotide conjugates canbe prepared by crosslinking amino groups on the antibody and oligonucleotide using glutaraldehyde.
  • the lysine side chain epsilon-amide is commonly targeted to conjugate to oligonucleotides.
  • maleimide-modified antibodies can be reacted with sulfhydryl-modified oligonucleotides.
  • heterobifunctional cross-linkers are introduced as bridges to link together the antibody and oligonucleotide.
  • the antibody-oligonucleotide conjugates comprise an antibody linked to an oligonucleotide.
  • the oligonucleotide comprises an nucleic acid comprising DNA, RNA, or chimeric DNA and RNA.
  • the oligonucleotide comprises canonical nucleotides or nucleotide analogs such as locked nucleic acids (LNA).
  • LNA locked nucleic acids
  • the length of the oligonucleotide can be 10 -50 nucleotides in length, or 50-100 nucleotides in length, or 100-200 nucleotides in length.
  • the oligonucleotide is designed to carry at least one tag sequence, where the tag sequence selectively binds to left and right binding arms of a padlock probe.
  • the oligonucleotide tag sequence also uniquely identifies the antibody to which it is conjugated, where the antibody selectively binds a target polypeptide.
  • the oligonucleotide tag sequence is designed to provide a sequencing read -out that is associated with binding between the antibody-oligonucleotide conjugate and the target polypeptide.
  • the sequence of the oligonucleotide tag is designed to exhibit minimal hybridization to RNA (e.g., target RNA) in the cellular sample.
  • a set of antibody-oligonucleotide conjugates comprises at least a first and a second antibody-oligonucleotide conjugate.
  • a set of oligonucleotide tags comprises at least a first and a second oligonucleotide tag.
  • the set of oligonucleotide tags comprises 2-20 oligonucleotide tags wherein each oligonucleotide tag in the set uniquely identifies its respective conjugated antibody where the antibody selectively binds a target polypeptide.
  • the set of oligonucleotide tags comprises 20- 100 oligonucleotide tags, or 100-500 oligonucleotide tags, or 500- 1000 oligonucleotide tags, or more oligonucleotide tags.
  • a first antibody-nucleotide conjugate and a second antibody-nucleotide conjugate may be used against a protein (in FIG. 28, “Protein-1 encodedby RNA-1”).
  • the first antibody- nucleotide conjugate, the second antibody- nucleotide conjugate, or both antibody-nucleotide conjugates may comprise a linker moiety.
  • a first oligonucleotide tag may be linked to the first antibody-oligonucleotide conjugate via a first linker moiety.
  • the first oligonucleotide tag may conjugate to a padlock probe.
  • the first oligonucleotide tag of the first antibody-oligonucleotide conjugate may, e.g., ligate, chemically conjugate, or have proximity crosslinking to the padlock probe.
  • a second oligonucleotide tag of the second antibody-oligonucleotide conjugate may be linked to the second antibody via a second linker moiety.
  • the second oligonucleotide tag may conjugate to a padlock probe.
  • the second oligonucleotide tag of the second antibody-oligonucleotide conjugate may, e.g., ligate, chemically conjugate, or have proximity crosslinking to the padlock probe.
  • a dehybridization step may be completed to eliminate circulate probes from a region of the padlock probe that is not ligated or not crosslinked to an oligonucleotide tag. Resultantly, successful ligation or chemical crosslinking of one or more oligonucleotide tag to the padlock probe may topologically lock the padlock probe in place.
  • the hybridization of both the first oligonucleotide tag and the second oligonucleotide tag to the padlock probe is required to topologically lock the padlock probe.
  • a rolling circle amplification (RCA) step may be completed.
  • the RCA step may comprise hybridizing an RCA primer to a sequence of the padlock probe, and then completing one or more cycles of RCA.
  • the RCA step may comprise cleaving a phosphate group of the first oligonucleotide tag or the second oligonucleotide tag (in FIG. 28, cleaving of a phosphate group of second oligonucleotide is denoted by a star), thereby generating a 3’ end, and then completing one or more cycles of RCA.
  • a first antibody-oligonucleotide conjugate and a second antibody-oligonucleotide conjugate may be used against a target protein (in FIG. 29, “Protein-1 encoded by RNA-1”).
  • the first antibody-oligonucleotide conjugate may comprise a first antibody and a first oligonucleotide tag.
  • the first oligonucleotide tag may be linked to the first antibody via a first linker moiety.
  • the second antibody-oligonucleotide tag may comprise a second antibody and a second oligonucleotide tag.
  • Thee second oligonucleotide tag may be linked to the second antibody via a second linker moiety.
  • the first antibody- oligonucleotide conjugate and the second antibody-oligonucleotide conjugate may be specific to the same target protein.
  • a first oligonucleotide tag (denoted in FIG. 29 by the dashed line) of the first antibody-oligonucleotide conjugate may conjugate to a padlock probe.
  • the first oligonucleotide tag of the first antibody-oligonucleotide conjugate may, e.g., ligate, chemically conjugate, or have proximity crosslinking to the padlock probe.
  • a second oligonucleotide tag (denoted in FIG.
  • the second antibody-oligonucleotide conjugate may conjugate to a padlock probe.
  • the second oligonucleotide tag of the second antibody - oligonucleotide conjugate may, e g., ligate, chemically conjugate, or have proximity crosslinking to the padlock probe.
  • a dehybridization step may be completed to eliminate the first oligonucleotide tag (dashed) or the second oligonucleotide tag (dotted) . Resultantly, successful ligation or chemical crosslinking of the first oligonucleotide tag or the second oligonucleotide tag to the padlock probe may topologically lock the padlock probe in place.
  • a rolling circle amplification (RCA) step may be completed.
  • the RCA step may comprise hybridizing an RCA primer to a sequence of the padlock probe, and then completing one or more cycles of RCA.
  • the RCA step may comprise cleaving a phosphate group of the first oligonucleotide tag or the second oligonucleotide tag (in FIG.
  • the RCA step may comprise ligation of the first (dashed) oligonucleotide tag, and no ligation of the second (dotted) oligonucleotide tag, as the second oligonucleotide tag is missing a 5’ phosphate group.
  • the 3’ end of the first oligonucleotide tag is hybridized to the padlock probe, as is used as an RCA primer.
  • One or more cycles of RCA may then be conducted.
  • the target analytes comprise polypeptides or peptide fragments.
  • a target polypeptide is encoded by a target RNA in the cellular sample.
  • Target polypeptides include polypeptides having post-translationally modified forms, including methylation, phosphorylation, glycosylation, hydroxylation, ubiquitination, nitrosylation, acetylation, lipidation, ADP -ribosylation, carbonylation, SUMOylation and/or disulfidebond formation.
  • the target polypeptides can be subjected to proteolysis for example by a protease or cleavage due to ribosomal skipping.
  • Target polypeptides also include precursor molecules that have notyetbeen subjected to post-translation modification.
  • Target polypeptides include muteins, variants, chimeric proteins and fusion proteins.
  • Target polypeptides can be labeled with a binding partner molecule having an affinity moiety, such as for example biotin (or its derivatives), digoxigenin, fluorescein, cholesterol, maltose, or any of the affinity molecules described below.
  • polysaccharides inside the cellular sample can be detected by contacting the cellular sample with lectin conjugated to an oligonucleotide carrying a tag sequence that is designed to bind the left and right arms of a padlock probe.
  • Lectinoligonucleotide conjugates can permit detection of polysaccharides using a sequencing readout.
  • lipids inside the cellular sample can be detected by contacting the cellular sample with a lipid-specific binding protein or an amphipathic polypeptide which are conjugated to an oligonucleotide carrying a tag sequence that is designed to bind the left and right arms of a padlock probe.
  • Lipid -specific binding proteins and amphipathic polypeptides that are conjugated to tagged oligonucleotides can permit detection of lipids using a sequencing readout.
  • sequencing polymerases can be used for conducting sequencing reactions.
  • the sequencing polymerase(s) is/are capable of binding and incorporating a complementary nucleotide opposite a nucleotide in a concatemer template molecule.
  • the sequencing polymerase(s) is/are capable of binding a complementary nucleotide unit of a multivalent molecule opposite a nucleotide in a concatemer template molecule.
  • the plurality of sequencing polymerases comprise recombinant mutant polymerases.
  • suitable polymerases for use in sequencing with nucleotides and/or multivalent molecules include but are not limited to: KlenowDNA polymerase; Thermus aquaticus DNA polymerase I (Taq polymerase); KlenTaq polymerase; Candidates altiarchaeales archaeon; Candidates Hadarchaeum Yellowstonense; Hadesarchaea archaeon; Euryarchaeota archaeon; Thermoplasmata archaeon; Thermococcus polymerases such as Thermococcus litoralis, bacteriophage T7 DNA polymerase; human alpha, delta and epsilon DNA polymerases; bacteriophage polymerases such as T4, RB69 and phi29 bacteriophage DNA polymerases; Pyrococcus furiosus DNA polymerase (Pfu polymerase); Bacillus subtilis DNA polymerase III; E.
  • KlenowDNA polymerase Thermus aquaticus DNA polymerase
  • coli DNA polymerase III alpha and epsilon 9 degree N polymerase
  • reverse transcriptases such as HIV type M or 0 reverse transcriptases
  • avian myeloblastosis virus reverse transcriptase Moloney Murine Leukemia Virus (MMLV) reverse transcriptase
  • MMLV Moloney Murine Leukemia Virus
  • DNA polymerases include those from various Archaea genera, such as, Aeropyrum, Archaeglobus, De sulfurococcus, Pyrobaculum, Pyrococcus, Pyrolobus, Pyrodictium, Staphylothermus, Stetteria, Sulfolobus, Thermococcus, and Vulcanisaeta and the like or variants thereof, including such polymerases as are known in the art such as 9 degrees N, VENT, DEEP VENT, THERMINATOR, Pfu, KOD, Pfx, Tgo and RB69 polymerases.
  • Archaea genera such as, Aeropyrum, Archaeglobus, De sulfurococcus, Pyrobaculum, Pyrococcus, Pyrolobus, Pyrodictium, Staphylothermus, Stetteria, Sulfolobus, Thermococcus, and Vulcanisaeta and the like or variants thereof, including such polymerases as are known
  • the sequencing comprises conducting sequencing reactions inside a cellular sample, where the cDNA amplicons are the concatemer molecules.
  • the sequencing employs non-labeled chain-terminating nucleotides.
  • a cycle of sequencing comprisesthe steps of (a) sequentially contacting a primed concatemer (e.g., a concatemer annealed to a plurality of sequencing primers) with at least two separate mixtures under ternary complex stabilizing conditions, wherein the at least two separate mixtures each include a polymerase and a nucleotide, whereby the sequentially contacting results in the primed concatemer being contacted, under the ternary complex stabilizing conditions, with nucleotide cognates for first, second and third base type base types in the template; (b) examining the at least two separate mixtures to determine whether a ternary complex formed; and (c) identifying the next correct nucleotide for the primed concatemer, wherein the next correct nucleotide is identified as a cognate of the first, second or third base type if ternary complex is detected in step (b), and wherein the next correct nucleotide is imputed to be a nucleotide cognate
  • the repeating comprises repeating at least 2 times, 5 times, 10 times, 15 times, 20 times, 25 times, 30 times, 35 times, 40 times, 45 times, or at least 50 times.
  • the sequencing is sequencing-by-binding (SBB).
  • a cycle of sequencing-by- binding can comprise sequentially contacting a primed concatemer (e.g., a concatemer annealed to a plurality of sequencing primers) with atleasttwo separate mixturesunder ternary complex stabilizing conditions, wherein the atleasttwo separate mixtures can each include a polymerase and a nucleotide, whereby the sequentially contacting can result in the primed concatemer being contacted, under the ternary complex stabilizing conditions, with nucleotide cognates for first, second and third base type base types in the template.
  • a primed concatemer e.g., a concatemer annealed to a plurality of sequencing primers
  • atleasttwo separate mixtures can each include a polymerase and a nucleotide
  • a cycle of sequencing-by-binding can comprise examining the at least two separate mixtures to determine whether a ternary complex formed.
  • a cycle of sequencing-by-binding can comprise identifying the next correct nucleotide for the primed concatemer, wherein the next correct nucleotide can be identified as a cognate of the first, second or third base type if ternary complex was detected and wherein the next correct nucleotide can be imputed to be a nucleotide cognate of a fourth base type based on the absence of a ternary complex.
  • a cycle of sequencing-by-binding can comprise adding a next correct nucleotide to the primer of the primed concatemer, thereby producing an extended primer. Sequencing-by-binding methods are described in U.S. patent Nos. 10,246,744 and 10,731,141 (where the contents ofboth patents are hereby incorporated by reference in their entireties). In some embodiments, no more than 2-30 sequencing-by-binding (SBB) cycles can be conducted
  • the present disclosure provides methods for detecting the sequence of a nucleic acid in a biological sample.
  • the nucleic acid maybe, e.g., DNA orRNA.
  • the biological sample may be a cellular sample.
  • the present disclosure provides methods for detecting in situ at least two different target RNA molecules in a cellular sample, which can include conducting sequencing reactions inside the cellular sample, where the cDNA amplicons canbe the concatemer molecules.
  • any of the sequencing methods described herein can employ at least one nucleotide.
  • the nucleotides comprise abase, sugar and at least one phosphate group.
  • at least one nucleotide in the plurality comprises an aromatic base, a five carbon sugar (e.g., ribose or deoxyribose), and one or more phosphate groups (e.g., 1 -10 phosphate groups).
  • the plurality of nucleotides can comprise at least one type of nucleotide selected from a group consisting of d ATP, dGTP, dCTP, dTTP and dUTP.
  • the plurality of nucleotides can comprise at a mixture of any combination of two or more types of nucleotides selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • at least one nucleotide in the plurality is not a nucleotide analog.
  • at least one nucleotide in the plurality comprises a nucleotide analog.
  • Atleast one nucleotide in the plurality of nucleotides comprise a chain of one, two or three phosphorus atoms where the chain is typically attached to the 5’ carbon of the sugar moiety via an ester or phosphoramide linkage.
  • at least one nucleotide in the plurality is an analog having a phosphorus chain in which the phosphorus atoms are linked together with intervening O, S, NH, methylene or ethylene.
  • the phosphorus atoms in the chain include substituted side groups including O, S orBH3.
  • the chain includes phosphate groups substituted with analogs including phosphoramidate, phosphorothioate, phosphordithioate, and O-methyl phosphoramidite groups.
  • At least one nucleotide in the plurality of nucleotides comprises a terminator nucleotide analog having a chain terminating moiety (e.g., blocking moiety) at the sugar 2’ position, at the sugar 3’ position, or at the sugar 2’ and 3’ position.
  • the chain terminating moiety can inhibit polymerase-catalyzed incorporation of a subsequent nucleotide unit or free nucleotide in a nascent strand during a primer extension reaction.
  • the chain terminating moiety is attached to the 3 ’ sugar hydroxyl position where the sugar comprises a ribose or deoxyribose sugar moiety. In some embodiments, the chain terminating moiety is removable/cleavable from the 3 ’ sugar hydroxyl position to generate a nucleotide having a 3 ’ OH sugar group which is extendible with a sub sequent nucleotide in a polymerase-catalyzed nucleotide incorporation reaction.
  • the chain terminating moiety comprises an alkyl group, alkenyl group, alkynyl group, allyl group, aryl group, benzyl group, azide group, amine group, amide group, keto group, isocyanate group, phosphate group, thiol group, disulfide group, carbonate group, urea group, or silyl group.
  • the chain terminating moiety is cleavable/removable from the nucleotide, for example by reacting the chain terminating moiety with a chemical agent, pH change, light or heat.
  • the chain terminating moieties alkyl, alkenyl, alkynyl and allyl are cleavable with tetrakis(triphenylphosphine)palladium(0) (Pd(PPh3)4) with piperidine, or with 2,3 -Dichloro-5,6- dicyano-l,4-benzo-quinone (DDQ).
  • the chain terminating moieties aryl and benzyl are cleavable with H2 Pd/C.
  • the chain terminating moieties amine, amide, keto, isocyanate, phosphate, thiol, disulfide are cleavable with phosphine or with a thiol group including beta-mercaptoethanol or dithiothritol (DTT).
  • the chain terminating moiety carbonate is cleavable with potassium carbonate (K2CO3) in MeOH, with triethylamine in pyridine, or with Zn in acetic acid (AcOH).
  • the chain terminating moieties urea and silyl are cleavable with tetrabutylammonium fluoride, pyridine-HF, with ammonium fluoride, or with triethylamine trihydrofluoride.
  • Atleast one nucleotide in the plurality of nucleotides comprises a terminator nucleotide analog having a chain terminating moiety (e.g., blocking moiety) at the sugar 2’ position, atthe sugar 3’ position, or at the sugar 2’ and 3 ’ position.
  • the chain terminating moiety comprises an azide, azido or azidomethyl group.
  • the chain terminating moiety comprises a 3 ’-O-azido or 3 ’-O-azidomethyl group.
  • the chain terminating moieties azide, azido and azidomethyl group are cleavable/removable with a phosphine compound.
  • the phosphine compound comprises a derivatized tri -alkyl phosphine moiety or a derivatized tri -aryl phosphine moiety.
  • the phosphine compound comprises Tris(2-carboxyethyl)phosphine (TCEP) or bis-sulfo triphenyl phosphine (BS-TPP) or Tri(hydroxyproyl)phosphine (THPP).
  • the cleaving agent comprises 4 -dimethylaminopyridine (4-DMAP).
  • the nucleotide comprises a chain terminating moiety which is selected from a group consisting of 3 ’-deoxy nucleotides, 2’, 3 ’-dideoxy nucleotides, 3 ’-methyl, 3 ’-azido, 3 ’-azidomethyl, 3’-O- azidoalkyl, 3 ’-O-ethynyl, 3 ’-O-aminoalkyl, 3’-O-fluoroalkyl, 3 ’-fluoromethyl, 3’- difluoromethyl, 3 ’ -trifluoromethyl, 3 ’-sulfonyl, 3 ’-malonyl, 3 ’-amino, 3 ’-O-amino, 3’- sulfhydral, 3 ’-aminomethyl, 3 ’-ethyl, 3 ’butyl, 3 ‘ -tert
  • the plurality of nucleotides comprises a plurality of nucleotides labeled with detectable reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • the fluorophore is attached to the nucleotide base.
  • the fluorophore is attached to the nucleotide base with a linker which is cleavable/removable from the base.
  • at least one of the nucleotides in the plurality is not labeled with a detectable reporter moiety.
  • a particular detectable reporter moiety e g., fluorophore
  • the nucleotide base e.g., dATP, dGTP, dCTP, dTTP or dUTP
  • the nucleotide base e.g., dATP, dGTP, dCTP, dTTP or dUTP
  • the cleavable linker on the nucleotide base comprises a cleavable moiety comprising an alkyl group, alkenyl group, alkynyl group, allyl group, aryl group, benzyl group, azide group, amine group, amide group, keto group, isocyanate group, phosphate group, thio group, disulfide group, carbonate group, urea group, or silyl group.
  • the cleavable linker on the base is cleavable/removable from the base by reacting the cleavable moiety with a chemical agent, pH change, light or heat.
  • the cleavable moieties alkyl, alkenyl, alkynyl and allyl are cleavable with tetrakis(triphenylphosphine)palladium(0) (Pd(PPh3)4) with piperidine, or with 2,3 -Dichloro-5,6- dicyano-l,4-benzo-quinone (DDQ).
  • the cleavable moieties aryl and benzyl are cleavable with H2 Pd/C.
  • the cleavable moieties amine, amide, keto, isocyanate, phosphate, thiol, disulfide are cleavable with phosphine or with a thiol group including beta-mercaptoethanol or dithiothritol (DTT).
  • the cleavable moiety carbonate is cleavable with potassium carbonate (K2CO3) in MeOH, with triethylamine in pyridine, or with Zn in acetic acid (AcOH).
  • the cleavable moieties urea and silyl are cleavable with tetrabutylammonium fluoride, pyridine -HF, with ammonium fluoride, or with tri ethylamine trihydro fluoride.
  • the cleavable linker on the nucleotide base comprises cleavable moiety including an azide, azido or azidomethyl group.
  • the cleavable moieties azide, azido and azidomethyl group are cleavable/removable with a phosphine compound.
  • the phosphine compound comprises a derivatized tri-alkyl phosphine moiety or a derivatized tri-aryl phosphine moiety.
  • the phosphine compound comprises Tris(2- carboxyethyl)phosphine (TCEP) orbis-sulfo triphenyl phosphine (BS-TPP) or Tri(hydroxyproyl)phosphine (THPP).
  • the cleaving agent comprises 4- dimethylaminopyridine (4-DMAP).
  • the chain terminating moiety (e.g., at the sugar 2’ and/or sugar 3’ position) and the cleavable linker on the nucleotide base have the same or different cleavable moieties.
  • the chain terminating moiety (e.g., at the sugar 2’ and/or sugar 3 ’ position) and the detectable reporter moiety linked to the base are chemically cleavable/removable with the same chemical agent.
  • the chain terminating moiety e.g., at the sugar 2’ and/or sugar 3’ position
  • the detectable reporter moiety linked to the base are chemically cleavable/removable with different chemical agents.
  • Methods as described herein may employ at least one multivalent molecule.
  • a multivalent binding complex may be formed between one or more polymer-nucleotide conjugates, one or more polymerases, one or more plurality of primed target nucleic acid molecules, or any combination thereof.
  • the multivalent binding complex may be stable, and may be used in any sequencing method as described herein.
  • the multivalent binding complex may allow a detection or base-calling step in a sequencing cycle to be separated from the nucleotide incorporation step.
  • Multivalent molecules as disclosed herein may comprise a plurality of nucleotide arms attached to a core and having any configuration including a starburst, helter skelter, or bottle brush configuration (e g., FIG. 7).
  • the multivalent molecule comprises: (1) a core; and (2) a plurality of nucleotide arms which comprise ⁇ ) a core attachment moiety, (ii) a spacer comprising a PEG moiety, (iii) a linker, and (iv) a nucleotide unit, wherein the core is attached to the plurality of nucleotide arms, wherein the spacer is attached to the linker, wherein the linker is attached to the nucleotide unit.
  • the multivalent molecule can comprise a core. In some embodiments, the multivalent molecule can comprise one or more linker moieties (e g., FIG. 26). In some embodiments, the multivalent molecule can comprise a plurality of nucleotide arms. In some embodiments, a nucleotide arm from the plurality of nucleotide arms can comprise a core In some embodiments, a nucleotide arm can comprise a spacer. In some embodiments, the spacer can comprise a PEG moiety. In some embodiments, a nucleotide arm can comprise a linker. In some embodiments, a nucleotide arm can comprise a nucleotide unit.
  • the spacer can be attached to the linker.
  • the linker can be attached to the nucleotide unit.
  • the core can be attached to the plurality of nucleotide arms.
  • the nucleotide unit can comprise a base, sugar and at least one phosphate group, and the linker can be attached to the nucleotide unit through the base.
  • the linker can comprise an aliphatic chain or an oligo ethylene glycol chain where both linker chains can have 2-6 subunits.
  • the linker can also include an aromatic moiety.
  • a non-limiting example nucleotide arm is shown in FIG. 11.
  • Non-limiting example multivalent molecules are shown in FIGS.
  • a multivalent molecule comprises a core attached to multiple nucleotide arms, and wherein the multiple nucleotide arms have the sametype of nucleotide unit which is selected from a group consisting of dATP, dGTP, dCTP, dTTP and dUTP.
  • a multivalent molecule comprises a core attached to multiple nucleotide arms, where each arm includes a nucleotide unit.
  • the nucleotide unit comprises an aromatic base, a five carbon sugar (e.g., ribose or deoxyribose), and one or more phosphate groups (e.g., 1 -10 phosphate groups).
  • the plurality of multivalent molecules can comprise one type multivalent molecule having one type of nucleotide unit selected from a group consisting of dATP, dGTP, dCTP, dTTP and dUTP.
  • the plurality of multivalent molecules can comprise at a mixture of any combination of two or more types of multivalent molecules, where individual multivalent molecules in the mixture comprise nucleotide units selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • the nucleotide unit comprises a chain of one, two or three phosphorus atoms where the chain is typically attached to the 5’ carbon of the sugar moiety via an ester or phosphoramide linkage.
  • at least one nucleotide unit is a nucleotide analog having a phosphorus chain in which the phosphorus atoms are linked together with intervening O, S, NH, methylene or ethylene.
  • the phosphorus atoms in the chain include substituted side groups including O, S or BH3.
  • the chain includes phosphate groups substituted with analogs including phosphoramidate, phosphorothioate, phosphordithioate, and O-methyl phosphoramidite groups.
  • the multivalent molecule comprises a core attached to multiple nucleotide arms, and wherein individual nucleotide arms comprise a nucleotide unit which is a nucleotide analog having a chain terminating moiety (e g., blocking moiety) at the sugar 2’ position, at the sugar 3’ position, or at the sugar 2’ and 3’ position.
  • the nucleotide unit comprises a chain terminating moiety (e.g., blocking moiety) atthe sugar 2’ position, atthe sugar 3’ position, or at the sugar 2’ and 3’ position.
  • the chain terminating moiety can inhibit polymerase-cataly zed incorporation of a subsequent nucleotide unit or free nucleotide in a nascent strand during a primer extension reaction.
  • the chain terminating moiety is attached to the 3 ’ sugar hydroxyl position where the sugar comprises a ribose or deoxyribose sugar moiety.
  • the chain terminating moiety is removable/cleavable from the 3 ’ sugar hydroxyl position to generate a nucleotide having a 3 ’ OH sugar group which is extendible with a sub sequent nucleotide in a polymerase-catalyzed nucleotide incorporation reaction.
  • the chain terminating moiety comprises an alkyl group, alkenyl group, alkynyl group, allyl group, aryl group, benzyl group, azide group, amine group, amide group, keto group, isocyanate group, phosphate group, thiol group, disulfide group, carbonate group, urea group, or silyl group.
  • the chain terminating moiety is cleavable/removable from the nucleotide unit, for example by reacting the chain terminating moiety with a chemical agent, pH change, light or heat.
  • the chain terminating moieties alkyl, alkenyl, alkynyl and allyl are cleavable with tetrakis(tfiphenylphosphine)palladium(0) (Pd(PPh 3 ) 4 ) with piperidine, or with 2,3-Dichloro-5,6-dicyano-l,4-benzo-quinone (DDQ).
  • the chain terminating moieties aryl and benzyl are cleavable with H2 Pd/C.
  • the chain terminating moieties amine, amide, keto, isocyanate, phosphate, thiol, disulfide are cleavable with phosphine or with a thiol group including beta -mercaptoethanol or dithiothritol (DTT).
  • the chain terminating moiety carbonate is cleavable with potassium carbonate (K2CO3) in MeOH, with triethylamine in pyridine, or with Zn in acetic acid (AcOH).
  • the chain terminating moieties urea and silyl are cleavable with tetrabutylammonium fluoride, pyridine-HF, with ammonium fluoride, or with triethylamine trihydrofluoride.
  • the nucleotide unit comprises a chain terminating moiety (e g., blocking moiety) at the sugar2’ position, atthe sugar3 ’ position, or at the sugar2’ and 3 ’ position.
  • the chain terminating moiety comprises an azide, azido or azidomethyl group.
  • the chain terminating moiety comprises a 3 ’ -O-azido or 3’-O-azidomethyl group.
  • the chain terminating moieties azide, azido and azidomethyl group are cleavable/removable with a phosphine compound.
  • the phosphine compound comprises a derivatized tri -alkyl phosphine moiety or a derivatized tri-aryl phosphine moiety.
  • the phosphine compound comprises Tris(2-carboxyethyl)phosphine (TCEP) or bis-sulfo triphenyl phosphine (B S-TPP) or Tri(hydroxyproyl)phosphine (THPP).
  • the cleaving agent comprises 4- dimethylaminopyridine (4-DMAP).
  • the nucleotide unit comprising a chain terminating moiety which is selected from a group consisting of 3 ’-deoxy nucleotides, 2 ’,3 ’-dideoxynucleotides, 3 ’- methyl, 3 ’-azido, 3 ’-azidomethyl, 3’-O-azidoalkyl, 3’-O-ethynyl, 3 ’-O-aminoalkyl, 3’-O- fluoroalkyl, 3 ’-fluoromethyl, 3 ’-difluoromethyl, 3 ’-trifluoromethyl, 3 ’-sulfonyl, 3 ’-malonyl, 3’- amino, 3’-O-amino, 3’-sulfhydral, 3 ’-aminomethyl, 3 ’-ethyl, 3 ’butyl, 3 '-tert butyl, 3’- Fluoren
  • the multivalent molecule comprises a core attached to multiple nucleotide arms, wherein the nucleotide arms comprise a spacer, linker and nucleotide unit, and wherein the core, linker and/or nucleotide unit is labeled with detectable reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • a particular detectable reporter moiety e.g., fluorophore
  • the multivalent molecule can correspond to the base (e.g., dATP, dGTP, dCTP, dTTP or dUTP) of the nucleotide unit to permit detection and identification of the nucleotide base.
  • At least one nucleotide arm of a multivalent molecule has a nucleotide unit that is attached to a detectable reporter moiety.
  • the detectable reporter moiety is attached to the nucleotide base.
  • the detectable reporter moiety comprises a fluorophore.
  • a particular detectable reporter moiety (e.g., fluorophore) that is attached to the multivalent molecule can correspond to the base (e.g., dATP, dGTP, dCTP, dTTP or dUTP) of the nucleotide unit to permit detection and identification of the nucleotide base.
  • the core of a multivalent molecule comprises an avidin-like or streptavidin-like moiety and the core attachment moiety comprises biotin.
  • the core of a multivalent molecule can comprise an avidin -like moiety.
  • the core of a multivalent molecule can comprise a streptavidin -like moiety.
  • the core of a multivalent molecule can comprise the core attachment moiety can comprise biotin.
  • the core comprises an streptavidin -type or avidin- type moiety which includes an avidin protein, as well as any derivatives, analogs and other nonnative forms of avidin that can bind to at least one biotin moiety.
  • avidin moieties include native and recombinant avidin and streptavidin as well as derivatized molecules, e.g. non-glycosylated avidin and truncated streptavidins .
  • avidin moiety includes de- glycosylated forms of avidin, bacterial streptavidin produced by Streptomyces (e.g., Streptomyces avidinii), as well as derivatized forms, for example, N-acyl avidins, e.g., N-acetyl, N-phthalyl and N-succinyl avidin, and the commercially -available products EXTRA VIDIN, CAPTA VIDIN, NEUTRA VIDIN and NEUTRALITE AVIDIN.
  • any of the methods for sequencing nucleic acid molecules described herein can include forming a binding complex, where the binding complex comprises (i) a polymerase, a nucleic acid concatemer molecule duplexed with a primer, and a nucleotide, or the binding complex comprises (ii) a polymerase, a nucleic acid concatemer molecule duplexed with a primer, and a nucleotide unit of a multivalent molecule.
  • the binding complex can comprise (i) a polymerase, a nucleic acid concatemer molecule duplexed with a primer, and a nucleotide.
  • the binding complex can comprise (ii) a polymerase, a nucleic acid concatemer molecule duplexed with a primer, and a nucleotide unit of a multivalent molecule.
  • the binding complex has a persistence time of greater than about 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or 1 second.
  • the binding complex has a persistence time of greater than about 0.1 -0.25 seconds, or about 0.25-0.5 seconds, or about0.5-0.75 seconds, or aboutO.75-1 second, or about 1-2 seconds, or about 2-3 seconds, or about 3 -4 second, or about 4-5 seconds, and/or wherein the methodis or may be carried out at a temperature of at or above 15 °C, at or above 20 °C, at or above 25 °C, at or above 35 °C, at or above 37 °C, at or above 42 °C at or above 55 °C at or above 60 °C, or at or above 72 °C, or at or above 80 °C, or within a range definedby any of the foregoing, or a combination of the binding complex can have a persistence time of greater that about 0.1 -0.25 seconds, or about 0.25-0.5 seconds, or aboutO.5-0.75 seconds, or about O.75-1 second, or about 1-2 seconds, or about 2-3 seconds, or about 3 -4 second, or about 4-5 seconds
  • the binding complex (e.g., ternary complex) remains stable until subjected to a condition that causes dissociation of interactions between any of the polymerase, template molecule, primer and/or the nucleotide unit or the nucleotide.
  • a dissociating condition comprises contacting the binding complex with any one or any combination of a detergent, EDTA and/or water.
  • the present disclosure provides said method wherein the binding complex is deposited on, attached to, or hybridized to, a surface showing a contrast to noise ratio in the detecting step of greaterthan 20.
  • the present disclosure provides said method wherein the contacting is performed under a condition that stabilizes the binding complex when the nucleotide or nucleotide unit is complementary to a next base of the template nucleic acid, and destab ilizesthe binding complex when the nucleotide or nucleotide unit is not complementary to the next base of the template nucleic acid.
  • Multivalent binding compositions as disclosed herein may comprise a particle-nucleotide conjugate having a plurality of copies of a nucleotide attached to the particle.
  • the multivalent binding composition may allow one to localize detectable signals, and can be used to identify sites of base incorporation in elongating nucleic acid chains during polymerase reactions .
  • Multivalent binding compositions may be used to provide improvedbase discrimination for sequencing and array based applications.
  • the binding of the plurality of first complexed polymerases with the plurality of multivalent molecules forms at least one avidity complex.
  • the method can comprise: (a) binding a first nucleic acid primer, a first sequencing polymerase, and a first multivalent molecule to a first portion of a concatemer template molecule thereby forming a first binding complex, wherein a first nucleotide unit of the first multivalent molecule can bind to the first sequencing polymerase; and (b) binding a second nucleic acid primer, a second sequencing polymerase, and the first multivalent molecule to a second portion of the same concatemer template molecule thereby forming a second binding complex, wherein a second nucleotide unit of the first multivalent molecule can bind to the second sequencing polymerase, wherein the first and second binding complexes which can include the same multivalent molecule forms an avidity complex.
  • the method can comprise binding a first nucleic acid primer, a first sequencing polymerase, and a first multivalent molecule to a first portion of a concatemer template molecule thereby forming a first binding complex, wherein a
  • the first multivalent molecule can be detectably -labeled multivalent molecule.
  • a first nucleotide unit of the first multivalent molecule can bind to the first sequencing polymerase.
  • the method can comprise binding a second nucleic acid primer, a second sequencing polymerase, and the first multivalent molecule to a second portion of the same concatemer template molecule thereby forming a second binding complex.
  • a second nucleotide unit of the first multivalent molecule can bind to the second sequencing polymerase, wherein the first and second binding complexes which can include the same multivalent molecule forms an avidity complex.
  • the first multivalent molecule can comprise a core attached to a plurality of nucleotide arms.
  • each nucleotide arm can be attached to a nucleotide unit.
  • the binding can be conducted under a condition suitable to inhibit polymerase-catalyzed incorporation of the first nucleotide unit and the second nucleotide unity in the first binding complex and the second binding complex.
  • the method can further comprise detecting the first bin ding complex and the second binding complex on the concatemer molecule.
  • the method can comprise detecting the first binding complex on the concatemer molecule.
  • the method can comprise detecting the second binding complex on the concatemer molecule.
  • the method can further comprise identifying the first nucleotide unit in the first binding complex thereby determining the sequence of the first portion of the concatemer molecule and identifying the second nucleotide unit in the second binding complex thereby determining the sequence of the second portion of the concatemer molecule. In some embodiments, the method can comprise identifying the first nucleotide unit in the first binding complex thereby determining the sequence of the first portion of the concatemer molecule. In some embodiments, the method can comprise identifying the second nucleotide unit in the second binding complex thereby determining the sequence of the second portion of the concatemer molecule. In some embodiments, the first sequencing polymerase can comprise any wild type or mutant polymerase described herein.
  • the second sequencing polymerase can comprise any wild type or mutant polymerase described herein.
  • Th e concatemer template molecule can comprise tandem repeat sequences of a sequence of interest and at least one universal sequencing primer binding site.
  • the first and second nucleic acid primers can bind to a sequencing primer binding site along the concatemer template molecule.
  • Example multivalent molecules are shown in FIGS. 7-10.
  • the method can include binding the plurality of first complexed polymerases with the plurality of multivalent molecules to form at least one avidity complex, the method can comprise: (a) contactingthe plurality of sequencing polymerases and the plurality of nucleic acid primers with different portions of a concatemer nucleic acid concatemer molecule to form at least first and second complexed polymerases on the same concatemer template molecule; (b) contacting a plurality of multivalent molecules to the at least first and second complexed polymerases on the same concatemer template molecule, under conditions suitable to bind a single multivalent molecule from the plurality to the first and second complexed polymerases, wherein at least a first nucleotide unit of the single multivalent molecule can be boundto the first complexed polymerase which can include a first primer hybridized to a first portion of the concatemer template molecule thereby forming a first
  • the method can comprise contacting the plurality of sequencing polymerases and the plurality of nucleic acid primers with different portions of a concatemer nucleic acid concatemer molecule to form at least first and second complexed polymerases on the same concatemer template molecule.
  • the method can comprise contacting a plurality of multivalent molecules to the at least first and second complexed polymerases on the same concatemer template molecule.
  • the contacting of the plurality of multivalent molecules to the at least first and second complexed polymerases can occur under conditions suitable to bind a single multivalent molecule from the plurality to the first and second complexed polymerases.
  • At least a first nucleotide unit of the single multivalent molecule can be bound to the first complexed p olymerase.
  • the first complexed polymerase can include a first primer hybridized to a first portion of the concatemer template molecule thereby forming a first binding complex (e.g., first ternary complex).
  • at least a second nucleotide unit of the single multivalent molecule can be bound to the second complexed polymerase.
  • the second complexed polymerase can include a second primer hybridized to a second portion of the concatemer template molecule thereby forming a second binding complex (e.g., second ternary complex)
  • the contacting can be conducted under a condition suitable to inhibit polymerase-catalyzed incorporation of the bound first and second nucleotide units in the first and second binding complexes.
  • the first and second binding complexes which can be bound to the same multivalent molecule forms an avidity complex.
  • the method can comprise detecting the first and second binding complexes on the same concatemer template molecule.
  • the method can comprise identifying the first nucleotide unit in the first binding complex thereby determining the sequence of the first portion of the concatemer template molecule, and identifying the second nucleotide unit in the second binding complex thereby determining the sequence of the second portion of the concatemer template molecule. In some embodiments, the method can comprise identifying the first nucleotide unit in the first binding complex thereby determining the sequence of the first portion of the concatemer template molecule. In some embodiments, the method can comprise identifying the second nucleotide unit in the second binding complex thereby determining the sequence of the second portion of the concatemer template molecule.
  • the method can comprise contacting the plurality of concatemer molecules with(i) a plurality of sequencing polymerases and (ii) a plurality of soluble sequencing primers. In some embodiments, the method can comprise contacting the plurality of concatemer molecules with a plurality of sequencing polymerases. In some embodiments, the plurality of sequencing polymerases can comprise a plurality of first sequencing polymerases. In some embodiments, the plurality of sequencing polymerases can comprise a plurality of second sequencing polymerases. In some embodiments, the method can comprise contacting the plurality of concatemer molecules with a plurality of soluble sequencing primers.
  • the contacting can be conducted under a condition suitable to form a plurality of complexed sequencing polymerases.
  • each complexed sequencing polymerase can comprise a sequencing polymerase bounds to a nucleic acid duplex.
  • the nucleic acid duplex can comprise a concatemer molecule hybridized to a soluble sequencing primer.
  • the method can further comprise contacting the plurality of complexed sequencing polymerases with a plurality of nucleotides.
  • the contacting can be conducted under a condition suitable for binding at least one nucleotide to a complexed sequencing polymerase.
  • the plurality of nucleotides can comprise at least one nucleotide analog.
  • the at least one nucleotide analog can be labeled with a fluorophore.
  • the at least one nucleotide analog can have a removable chain terminating moiety at the sugar 3 ’position.
  • the at least one nucleotide analog can be labeled with a fluorophore and can have a removable chain terminating moiety at the sugar 3 ’ position.
  • the method can further comprise incorporating the at least nucleotide analog into the 3 ’ end of the plurality of soluble sequencing primers, thereby generating a plurality of nascent extended sequencing primers.
  • the method can further comprise detecting the incorporated at least one nucleotide analog.
  • the method can further comprise identifying the nucleobase of the incorporated at least one nucleotide analog.
  • the method can comprise contacting the plurality of concatemer molecules with (i) a plurality of sequencing polymerases and (ii) a plurality of soluble sequencing primers. In some embodiments, the method can comprise contacting the plurality of concatemer molecules with a plurality of sequencing polymerases. In some embodiments, the plurality of sequencing polymerases can comprise a plurality of first sequencing polymerases. In some embodiments, the plurality of sequencing polymerases can comprise a plurality of second sequencing polymerases. In some embodiments, the method can comprise contacting the plurality of concatemer molecules with a plurality of soluble sequencing primers.
  • the contacting can be conducted under a condition suitable to form a plurality of complexed sequencing polymerases.
  • each complexed sequencing polymerase can comprise a sequencing polymerase bounds to a nucleic acid duplex.
  • the nucleic acid duplex can comprise a concatemer molecule hybridized to a soluble sequencing primer.
  • the method can further comprise contacting the plurality of first complexed sequencing polymerases with a plurality of detectably labeled multivalent molecules to form a plurality of multivalent-complexed polymerases.
  • the contacting can be conducted under a condition suitable for binding complementary nucleotide units of the plurality of detectably labeled multivalent molecules to at least two of the plurality of first complexed sequencing polymerases, thereby forming the plurality of multivalent-complexed polymerases.
  • the contacting can be conducted under the condition that can inhibit incorporation of the complementary nucleotide units into the plurality of soluble universal sequencing primers of the plurality of multivalent- complexed polymerases
  • individual multivalent molecules in the plurality of detectably labeled multivalent molecules can comprise a core attached to a nucleotide unit.
  • the method can further comprise detecting the plurality of multivalent- complexed polymerases. In some embodiments, the method can further comprise identifying the nucleobase of the complementary nucleotide units that are bound to the plurality of first complexed sequencing polymerases in the plurality of multivalent-complexed polymerases, thereby determining the sequence of a nucleic acid template. In some embodiments, the method can further comprise dissociating the plurality of multivalent-complexed polymerases. In some embodiments, the method can further comprise removing the plurality of first sequencing polymerases and the plurality of detectably labeled multivalent molecules. In some embodiments, the plurality of nucleic acid duplexes can be retained.
  • the method can further comprise contacting the plurality of nucleic acid duplexes with a plurality of second sequencing polymerases. In some embodiments, the contacting can be conducted under a condition suitable for binding the plurality of second sequencing polymerases to the plurality of nucleic acid duplexes, thereby forming a plurality of second complexed sequencing polymerases. In some embodiments, each second complexed sequencing polymerase can comprise a second sequencingpolymerase boundto the nucleic acidduplex. In some embodiments, the method can further comprise contacting the plurality of second complexed sequencing polymerases with a plurality of non-labeled nucleotides.
  • the plurality of non-labeled nucleotides can comprise at least one nucleotide analog that can have a removable chain terminating moiety at the sugar 3 ’ position.
  • the contacting can be conducted under a condition suitable for binding complementary nucleotides from the plurality of non-labeled nucleotides to at least two of the plurality of second complexed sequence polymerases, thereby forming a plurality of nucleotide -complexed polymerases.
  • the contacting can be conducted under the condition suitable for promoting incorporation of the complementary nucleotides into the plurality of soluble universal sequencing primers of the plurality of nucleotide-complexed polymerases.
  • the plurality of sequencing polymerases comprise any wild type or mutant sequencing polymerase described herein.
  • the concatemer template molecule comprises tandem repeat sequences of a sequence of interest and at least one universal sequencing primer binding site.
  • the plurality of nucleic acid primers can bind to a sequencing primer binding site along the concatemer template molecule.
  • Non-limiting example multivalent molecules are shown in FIGS. 7-10.
  • a biological sample maybe processed.
  • a cellular sample maybe extracted from a biological sample.
  • the cellular sample may be extracted from, e.g., blood or tissue.
  • the cellular sample may be separated from other components of the biological sample via immunomagnetic cell separations, fluorescence- activated cell sorting (FACS), density gradient centrifugation, immunodensity cell isolation, microfluidics, or any combination thereof.
  • FACS fluorescence- activated cell sorting
  • density gradient centrifugation immunodensity cell isolation
  • microfluidics or any combination thereof.
  • a cellular sample may be subjectedto any assay, e g., a viability assay, cell proliferation assay, reporter gene assay, cell signaling assays, functional assay, live cell analysis, protein assay, reactive oxygen species assay, or any combination thereof.
  • any assay e g., a viability assay, cell proliferation assay, reporter gene assay, cell signaling assays, functional assay, live cell analysis, protein assay, reactive oxygen species assay, or any combination thereof.
  • the cellular sample can be deposited onto a solid support (e.g., a flow cell).
  • the cellular sample is deposited onto a flow cell having walls (e.g., top or first wall, and bottom or secondwall) and a gap in-between, where the gap can be filled with a fluid, where the flow cell is positioned in a fluorescence optical imaging system.
  • the cellular sample has a thickness that may require using the imaging system to focus separately on the first and second surfaces of the flow cell, when using a traditional imaging system.
  • the flow cell can be positioned in a high performance fluorescence imaging system, which comprisestwo or more tube lenses which are designed to provide optimal imaging performance for the first and second surfaces of the flow cell at two or more fluorescence wavelengths.
  • the high-performance imaging system further comprises a focusing mechanism configured to refocus the optical system between acquiring images of the first and second surfaces of the flow cell.
  • the high performance imaging system is configured to image two or more fields-of-view on at least one of the first flow cell surface or the second flow cell surface.
  • the solid support comprises a flow cell having a coating that promotes cell adhesion.
  • the flow cell comprises a support which can be a planar or non -planar support.
  • the support can be solid or semi-solid.
  • the support can be porous, semi-porous or non-porous.
  • the support can be made of any material such as glass, plastic or a polymer material.
  • the surface of the support can be coated with one or more compounds to produce a passivated layer on the support.
  • the passivated layer forms a porous or semi-porous layer.
  • the support is coated with a lysine compound, poly -lysine compound, arginine compound or an amino-terminated compound.
  • the support can be coated with an unbranched compound, a branched compound, or a mixture of unbranched and branched compounds.
  • the support is coated with surface primers for capturing nucleic acids from the cellular sample. Alternatively, the support lacks surface primers.
  • any combination of the steps for conducting in situ reiterative short read sequencing can be performed in an automated mode using the fluid dispensing system, including cell seeding, cell fixation, cell permeabilization, reverse transcription reactions, padlock probe hybridization, padlock probe ligation reaction, rolling circle amplification, and sequencing
  • the present disclosure provides apparatus and methods for growing/ culturing a cellular sample on a flow cell and conducting nucleic acid workflows of the cultured cellular sample on the flow cell.
  • the cellular sample is deposited on the flow cell.
  • the flow cell can be coated with a reagent that promotes cell adhesion to the flow cell.
  • the flow cell, having a cell sample adhered thereon, can be placed onto a sequencing apparatus having a flow cell holder/cradle which is fluidically connected to an automated fluid dispensing system and configured on a fluorescent microscope.
  • the sequencing apparatus can be configured with at least one fluidic delivery device, at least one fluidics device (e g., microfluidics device), at least one imaging device and/or at least one sensor to detect signals from the sequencing reactions.
  • the automated fluid dispensing system can beused to deliver simple or complex cell culture media to the cellular sample on the flow cell.
  • the cellular sample can be cultured/expanded on the flow cell for 2-10 generations or more.
  • the cellular sample can be expanded to confluence or non - confluence.
  • the automated fluid dispensing system can beused to deliver fixation reagents to the expanded cellular sample on the flow cell, and the cellular sample can be incubated under conditions suitable for cell fixation.
  • the automated fluid dispensing system can beused to deliver permeabilization reagents to the fixed cellular sample on the flow cell, and the cellular sample can be incubated under conditions suitable for cell permeabilization.
  • the automated fluid dispensing system can beused to deliver reagents for conducting reverse transcription of RNA inside the fixed and permeabilized cellular sample under a condition suitable for generating a plurality of cDNA inside the cellular sample.
  • the automated fluid dispensing system can beused to deliver reagents for conducting padlock probe hybridization, circularization and ligation under a condition suitable for generating a plurality of covalently closed circular padlock probes inside the cellular sample.
  • the automated fluid dispensing system can beused to deliver reagents for conducting rolling circle amplification under a condition suitable for generating a plurality of concatemer molecules inside the cellular sample.
  • the automated fluid dispensing system can beused to deliver sequencing reagents for conducting sequencing cycles of the concatemer molecules under a condition suitable for generating a plurality of sequencing read products inside the cellular sample.
  • individual cycle times can be achieved in less than 30 minutes.
  • the field of view (FOV) can exceed 1 mm 2 and the cycle time for scanning large area (> 10 mm 2 ) can be less than 5 minutes.
  • the automated fluid dispensing system can beused to deliver reagents for removing the plurality of sequencing read products from the concatemer molecules and retaining the concatemer molecules inside the cellular sample.
  • the automated fluid dispensing system can beused to deliver sequencing reagents for conducting sequencing cycles of the concatemer molecules under a condition suitable for generating another plurality of sequencing read products inside the cellular sample.
  • individual cycle times can be achieved in less than 30 minutes.
  • the field of view (FOV) can exceed 1 mm 2 and the cycle time for scanning large area (> 10 mm 2 ) can be less than 5 minutes.
  • kits for carrying out the methods disclosed herein using the systems and compositions disclosed herein.
  • a kit may comprise a detectable polymernucleotide conjugate.
  • the polymer-nucleotide conjugate may comprise a polymer core.
  • the polymer-nucleotide conjugate may comprise two or more nucleotide moieties attached to the polymer core.
  • the kits described herein may have at least one, two, three, or four different types of detectable polymer-nucleotide conjugate, for example, in which each type of detectable polymer-nucleotide conjugate has a different nucleotide moiety.
  • the kit may have a substrate comprising a surface having coupled thereto a polymer layer suitable to immobilize a biological sample or derivative thereof to said surface.
  • the biological sample e.g., cell or tissue
  • the biological sample is not included in the kit.
  • the kit may comprise a hybridization buffer disclosed herein, for example, comprising (i) a first polar aprotic solvent having a dielectric constant that is no greater than 40 and having a polarity index of 4-9; and/or (ii) a second polar aprotic solvent having a dielectric constant that is less than or equal to 115.
  • capture oligonucleotides or components thereof, in situ amplification reagents e.g., buffers, primers, detectable labels
  • combinations thereof are included in the kit.
  • kits described herein including instructions for hybridizing at least a portion of said target nucleic acid sequence to at least a portion of a capture oligonucleotide coupled to said surface.
  • the kit may also comprise instructions for identifying at least a portion of the target nucleic acid sequence within the biological sample or derivative thereof by contacting said detectable polymer-nucleotide conjugate with said biological sample or derivative thereof (e.g., containing the target nucleic acid molecule) under conditions sufficient to form a multivalent binding complex between said two or more nucleotide moieties and said target nucleic acid sequence.
  • the kit may also comprise instructions for identifying at least a portion of a sub - cellular component within a cell or tissue in situ by contacting said detectable polymer- nucleotide conjugate with said sub -cellular component under conditions sufficient to form a multivalent binding complex between said two or more nucleotide moieties and said sub - cellular component.
  • the kit also contains other useful components, such as, diluents, buffers, pharmaceutically acceptable carriers, syringes, catheters, applicators, pipetting or measuring tools, bandaging materials or other useful paraphernalia.
  • the materials or components assembled in the kit can be provided to the practitioner stored in any convenient and suitable ways that preserve their operability and utility.
  • the components can be in dissolved, dehydrated, or lyophilized form; they can be provided at room, refrigerated or frozen temperatures.
  • the components are typically contained in suitable packaging material(s).
  • packaging material refers to one or more physical structures used to house the contents of the kit, such as compositions and the like.
  • the packaging material is constructed by well-known methods, preferably to provide a sterile, contaminant-free environment.
  • the packaging materials employed in the kit are those customarily utilized in gene expression assays and in the administration of treatments.
  • the term “package” refers to a suitable solid matrix or material such as glass, plastic, paper, foil, and the like, capable of holding the individual kit components.
  • a package can be a glass vial or prefilled syringes used to contain suitable quantities of the pharmaceutical composition.
  • the packaging material has an external label which indicates the contents and/or purpose of the kit and its components.
  • a kit comprises one or more containers comprising: a synthetic polypeptide disclosed herein.
  • the kit comprises a nucleotide unit.
  • the nucleotide unit is detectable.
  • the kit comprises instructions for introducing the synthetic polypeptide and the nucleotide unit to a primed nucleic acid sequence.
  • the primed nucleic acid sequence comprises a nucleotide complementary to the nucleotide unit under conditions sufficient to form a binding complex.
  • the binding complex comprises the synthetic polypeptide, the nucleotide unit, and the primed nucleic acid sequence.
  • the kit further comprises a composition, such as a nucleotide conjugate or a nucleotide conjugate disclosed herein.
  • the composition comprises a core and at least two nucleotide arms.
  • a nucleotide arm of the at least two nucleotide arms comprises: a core attachment moiety coupled to the core; a spacer coupled to the core attachment moiety; a linker coupled to the spacer; and the nucleotide unit coupled to the linker.
  • the kit comprises one or more padlock probes for batch-specific sequencing as disclosed herein.
  • Kits as disclosed herein may comprise one or more antibodies for use in any of the methods as described herein.
  • Kits as disclosed herein may comprise one or more antibody -nucleotide conjugates for use in any of the methods or compositions as described herein.
  • Kits as disclosed herein may comprise one or more sequencing polymerases .
  • Kits as disclosed herein may comprise one or more reverse transcriptase.
  • Kits as disclosed herein may comprise one or more reagents for performing rolling circle amplification (RCA).
  • RCA rolling circle amplification
  • kits for nucleic acid molecule processing comprises a composition disclosed herein, or a formulation disclosed herein.
  • the kit comprises an instruction for use of the composition in a nucleotide identification reaction.
  • the instructions for use of the composition orthe formulation comprise introducing the nucleotide conjugate and/orthe synthetic polypeptide to a nucleic acid sequence (e.g., primed nucleic acid sequence) under conditions sufficient to form a binding complex between a nucleotide of the nucleic acid sequence and a nucleotide unit of the nucleotide conjugate or the synthetic polypeptide or a combination thereof.
  • instructions further comprise use of the composition for performing a nucleotide binding, nucleotide incorporation, or a nucleotide identification reaction therewith.
  • the kit further comprises: an agent that reacts with the reactive group in the linker of the composition. In some embodiments, the kit further comprises: an agent that reacts with the reactive group at the 3 ’ carb on of the sugar moiety in the nucleotide unit of the composition. In some embodiments, the kit further comprises a reagent for use in the nucleotide binding reaction. In some embodiments, the reagent comprises a cation.
  • the kit further comprises: (i) a solution comprising a cation; (ii) one or more polymerizing enzymes; (iii) one or more primer sequences; (iv) one or more unlabeled nucleotides; or any combination of (i) to (iv).
  • the present disclosure provides a kit comprising any one or any combination of two or more of any of the nucleotide conjugates described herein.
  • the kit can comprise for example a plurality of one type of a nucleotide conjugate, a mixture of different types (sub -populations) of the nucleotide conjugates, or a combination of a plurality of one type of a nucleotide conjugate and a mixture of different types of the nucleotide conjugates.
  • the nucleotide conjugates in the kit can be labeled (e.g., fluorescently labeled), non-labeled, or a mixture of labeled and nonlabeled forms.
  • the nucleotide conjugates in the kit can include wild type or mutant forms of a streptavidin or avidin core.
  • the nucleotide conjugates in the kit can include core moietiesthat are labeled with the same type of detectable reporter moiety (e.g., fluorophore) or different types of detectable reporter moieties (e.g., different fluorophores).
  • the nucleotide conjugates in the kit can include nucleotide arms having the same type of spacer or different types of spacers.
  • the nucleotide conjugates in the kit can include nucleotide arms having the same type of linker or different types of linkers.
  • the nucleotide conjugates in the kit can include nucleotide arms comprising linkers with the same type of reactive group or different types of reactive groups.
  • the nucleotide conjugates in the kit can include nucleotide arms having the same type of nucleotide units or different types of nucleotide units.
  • the nucleotide conjugates in the kit can include nucleotide arms having nucleotide units having the same type of reactive groups at the sugar 3 ’ position or different types of reactive groups.
  • the kit can further include one or more chemical agents that react with a reactive group in the linker of the nucleotide conjugates.
  • the kit can further include one or more chemical agents that react with a reactive group at the sugar 3 ’ group in the nucleotide unit of the nucleotide conjugates.
  • the kit can further comprise at least one reagent suitable for use in conducting a nucleotide unit binding reaction, a nucleotide unit incorporation reaction, or a combination of a nucleotide unit binding reaction and a nucleotide unit incorporation reaction.
  • the reagent can comprise cations including any one or any combination of two or more of sodium, magnesium, strontium, barium, potassium, manganese, calcium, lithium, nickel, cobalt, or any combination thereof or other cations suitable for conducting a nucleotide unit binding reaction, a nucleotide incorporation reaction, or a combination of a nucleotide unit binding reaction and a nucleotide unit incorporation reaction.
  • the kit can comprise a reagent comprising a non -catalytic divalent cation including strontium, barium, calcium, or a combination thereof.
  • the kits comprise a reagent comprising a catalytic divalent cation including magnesium, manganese, or a combination of magnesium and manganese.
  • kits can comprise one or more containers that contain any one or any combination of two or more of any of the nucleotide conjugates described herein.
  • the kit can further comprise one or more containers that contain at least one cation, at least one polymerase, primers, a plurality of nucleotides, or a combination thereof.
  • the cation, polymerase and/or nucleotides can be combined in any combination and canbe contained in a single container, or can be contained in separate containers, or any combination thereof.
  • the kit can include instructions for use of the kit for conducting a nucleotide binding reaction, a nucleotide incorporation reaction, a nucleic acid sequencing reaction, or a combination thereof using nucleotide conjugates.
  • the kit is configuredfor detecting /// situ atleasttwo target RNA sequences and at least two polypeptides in a biological sample.
  • the kit comprises the first target polypeptide encoded by the first target nucleic acid molecule or a reverse complement thereof.
  • the second target polypeptide is encoded by the second target nucleic acid molecule or a reverse complement thereof.
  • the full sequence of the first target RNA sequence and the full sequence of the second target RNA sequence have at least one nucleotide of difference.
  • the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the first, second, or third nucleic acid enzyme comprises a nucleic acid ligase, a nucleic acid ligation enzyme, a nucleic acid polymerase, a nucleic acid polymerization enzyme, or combinations thereof.
  • the nucleotide of the plurality of nucleotides comprises a fluorescent label.
  • the nucleotide of the plurality of nucleotides comprises a removable blocking group at the 3 ’ carbon position of the sugar moiety.
  • the plurality of nucleotides consist of at least two of the same type of nucleotide comprising a fluorescent label, wherein the same type of nucleotide is selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the plurality of nucleotides comprise at least two types of nucleotides, wherein a type of the at least two types of nucleotides comprises a fluorescent label, wherein the at least two types of nucleotides are selected from the group comprising dATP, dGTP, dCTP, dTTP, and dUTP, wherein the fluorescent label of one type of nucleotide of the group emits light at a wavelength that is different from the wavelength of light emitted from the fluorescent label of another type of nucleotide of the group.
  • the nucleotide conjugate comprises a detectable label.
  • the nucleotide conjugate comprises a fluorescent label.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, or an organism.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly-frozen sample, a sectioned sample, or a formalin-fixed and paraffin- embedded (FFPE) sample.
  • FFPE formalin-fixed and paraffin- embedded
  • the first target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, n on -co ding RN A, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or, wherein the first target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the second target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the kit further comprises a first oligonucleotide conjugate comprising a first short nucleic acid and a first binding moiety that binds specifically to the first target polypeptide.
  • the first short nucleic acid comprises a first tag sequence, a second tag sequence.
  • the first oligonucleotide conjugate binds specifically to the first target polypeptide through the first binding moiety to form a first binding complex.
  • the first and second tag sequences identify the first binding moiety.
  • the kit comprises a second oligonucleotide conjugate comprising a second short nucleic acid and a second binding moiety that binds specifically to the second target polypeptide.
  • the second short nucleic acid comprises a third tag sequence, a fourth tag sequence, wherein the second oligonucleotide conjugate binds specifically to the second target polypeptide through the second binding moiety to form a second binding complex, wherein the third and fourth tag sequences identify the second binding moiety.
  • the kit further comprises a third oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the third oligonucleotide are complementary and bind to the first tag sequence and the second tag sequence of the first oligonucleotide conjugate so that the third oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the kit further comprises a fourth oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the fourth oligonucleotide are complementary andbind to the third tag sequence andthe fourth tag sequence so that the fourth oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the third oligonucleotide comprises a third identification sequence that identifies the first binding moiety
  • the fourth oligonucleotide comprises a fourth identification sequence that identifiesthe second binding moiety.
  • the first oligonucleotide and third oligonucleotide comprise the same first sequencing primer or the reverse complement thereof
  • the second oligonucleotide andthe fourth oligonucleotide comprise the same second sequencing primer or the reverse complement thereof.
  • the kit further comprises an agent that reacts with the reactive group at the 3 ’ carbon of the sugar moiety in the nucleotide moiety of the nucleotide conjugate.
  • the kit further comprises a reagent for use in the nucleotide binding reaction.
  • the reagent comprises a cation.
  • the kit further comprises a reverse transcriptase, a primer for reverse transcription, a sequencing primer, a reagent configured to permeabilize the biological sample, a reagent configured to fix the biological sample, an unblocked nucleotide, a blocked nucleotide, a reagent for use in the nucleotide incorporation reaction, a solution comprising a cation, one or more unlabeled nucleotides, one or more buffers for reverse transcription, one or more buffers for nucleic acid binding, one or more buffers for nucleic acid amplification, or one or more buffers for nucleic acid dissociation.
  • the kit further comprises instructions foruse of the kitto detect in situ the at least two targetnucleic acid sequences in the biological sample. In some embodiments, the kit further comprises instructions for use of the kit to detect in situ the at least two target RNA sequences and the at least two polypeptides in the biological sample. In some embodiments, the instructions comprise performing a sequencing by synthesis reaction. In some embodiments, the instructions comprise performing a sequencing by binding reaction in which detection in situ is not contemporaneous with a nucleotide incorporation step. In some embodiments, the kit further comprises instructions foruse of the kit using any of the methods provided in the present disclosure.
  • compositions for analyzing or processing a nucleic acid sequence may beused to employ in situ reiterative short read sequencing within a cellular sample including a single cell, multiple cells, a tissue or a tumor. By conducting reiterative short sequencing cycles, the RNA content of the cellular sample can be discovered.
  • Compositions as described herein may be employed in conducting transcriptomics workflows which leverages massively parallel sequencing technologies. Compared to long read sequencing workflows, the reiterative short sequencing cycles described herein use a reduced amount of sequencing reagents which reduces cost and saves time.
  • Compositions for conducting reiterative short sequencing cycles has many uses, including, but not limited to, detecting specific RNAs of interest, mutant RNA sequences, splice variants, and their abundance levels thereof.
  • compositions described herein are to detect and image the spatial localization of RNAs within a cellular sample using massively parallel sequencing. Any method as disclosed herein may be used for such a purpose.
  • compositions described herein offer several advantages over other in situ transcriptomics workflows, including a simpler workflow, fewer reagents, lower cost, less time, gentler conditions on the cellular sample, and no requirement for specialized equipment.
  • compositions for Conducting in situ Reiterative Short Read Sequencing are Compositions for Conducting in situ Reiterative Short Read Sequencing
  • compositions for conducting in situ sequencing in a biological sample One or more nucleic acids may be analyzed in a biological sample, e.g., a cell.
  • the nucleic acid may comprise, e.g., deoxyribonucleic acid (DNA) or ribonucleic acid (RNA)
  • compositions comprising at least two target nucleic acid molecules.
  • the at least two targetnucleic acid sequences may comprise a first target nucleic acid sequence and a second target nucleic acid sequence.
  • the at least two target polypeptides comprise a first target polypeptide encoded by the first target nucleic acid sequence or a reverse complement thereof.
  • the at least two target polypeptides comprise a second target polypeptide encoded by the second target nucleic acid sequence or a reverse complement thereof.
  • the composition further comprises a biological sample
  • the biological sample comprises a first nucleic acid molecule, comprising the first target nucleic acid sequence or a portion thereof, or the reverse complement of the first target nucleic acid sequence or a portion thereof.
  • the biological sample comprises a second nucleic acid molecule, comprising the second target nucleic acid sequence or a portion thereof, or the reverse complement of the second target nucleic acid sequence or a portion thereof.
  • the biological sample comprises a third nucleic acid molecule, comprising a third target nucleic acid sequence or a portion thereof, or the reverse complement of the third target nucleic acid sequence or a portion thereof.
  • the biological sample comprises a fourth nucleic acid molecule, comprising a fourth target nucleic acid sequence or a portion thereof, or the reverse complement of the fourth target nucleic acid sequence or a portion thereof.
  • the composition further comprises a first target complementary DNA (cDNA) sequence.
  • the first target cDNA sequence is generated through reverse transcription of the first target RNA sequence.
  • the composition further comprises a second target cDNA sequence.
  • the second target cDNA sequence is generated through reverse transcription of the second target RNA sequence.
  • the composition further comprises a first oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the first oligonucleotide are complementary and bind to two neighboring segments of the first target cDNA sequence so that the first oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion .
  • the second oligonucleotide comprises a first end portion and a second end portion.
  • the first end portion and second end portion of the second oligonucleotide are complementary and bind to two neighboring segments of the second target cDNA sequence, so that the second oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the second oligonucleotide comprises a second identification sequence th at identifies the second target RNA sequence, a second sequencing primer, and a nucleic acid amplification primer.
  • the first sequencing primer and the second sequencing primer have at least one nucleotide of difference.
  • the composition further comprises a first oligonucleotide conjugate comprising a first short nucleic acid and a first binding moiety .
  • the first binding moiety binds specifically to the first target polypeptide, wherein the first short nucleic acid comprises a first tag sequence, a second tag sequence, wherein the first oligonucleotide conjugate binds specifically to the first target polypeptide through the first bindingmoiety to form a first binding complex, wherein the first and second tag sequences identify the first binding moiety .
  • the composition further comprises a second oligonucleotide conjugate comprising a second short nucleic acid and a second binding moiety that binds specifically to the second target polypeptide.
  • the second short nucleic acid comprises a third tag sequence, a fourth tag sequence, wherein the second oligonucleotide conjugate binds specifically to the second target polypeptide through the second bindingmoiety to form a second binding complex, wherein the third and fourth tag sequences identify the second binding moiety .
  • the composition further comprises a third oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the third oligonucleotide are complementary.
  • the first end portion and the second end portion bind to the first tag sequence and the second tag sequence of the first oligonucleotide conjugate so that the third oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion .
  • the composition further comprises a fourth oligonucleotide comprising a first end portion and a second end portion.
  • the first end portion and second end portion of the fourth oligonucleotide are complementary and bind to the third tag sequence and the fourth tag sequence so thatthe fourth oligonucleotide forms a circular structure with a gap between the first end portion and the second end portion.
  • the composition further comprises a first circular oligonucleotide.
  • the first circular oligonucleotide is formed by joining the first and second end portions of the first oligonucleotide.
  • the first and second end portions of the second oligonucleotide are joined to produce a second circular oligonucleotide.
  • the first and second end portions of the third oligonucleotide are joinedto produce a third circular oligonucleotide.
  • the first and second end portions of the fourth oligonucleotide are joined to produce a fourth circular oligonucleotide.
  • the first circular oligonucleotide and the third circular oligonucleotide comprise a first sequencing primer or the reverse complement thereof.
  • the second circular oligonucleotide and the fourth circular oligonucleotide comprise a second sequencing primer or the reverse complement thereof.
  • the sequences of the first and second sequencing primers have at least one nucleotide of difference.
  • the first circular oligonucleotide is amplified through rolling circle amplification to produce a first concatemer comprising a plurality of the first circular oligonucleotides .
  • the second circular oligonucleotide is amplified through rolling circle amplification to produce a second concatemer comprising a plurality of the second circular oligonucleotides.
  • the third circular oligonucleotide is amplified through rolling circle amplification to produce a third concatemer comprising a plurality of the third circular oligonucleotides.
  • the fourth circular oligonucleotide is amplified through rolling circle amplification to produce a fourth concatemer comprising a plurality of the fourth circular oligonucleotides.
  • the composition further comprises a first sequencing product nucleic acid moleculethat is complementary and binds to the first concatemer. In some embodiments, the sequence of the first concatemer or the portion thereof consists of 2-30 nucleotides. In some embodiments, the composition comprises a third sequencing product nucleic acid molecule that is complementary and binds to the third concatemer. In some embodiments, the sequence of the third concatemer or the portion thereof consists of 2-30 nucleotides.
  • the composition comprises a second sequencing product nucleic acid molecule that is complementary and binds to the second concatemer. In some embodiments, the sequence of the second concatemer or the portion thereof consists of 2-30 nucleotides. In some embodiments, the composition further comprises a fourth sequencing product nucleic acid molecule that is complementary and binds to the fourth concatemer. In some embodiments, the sequence of the fourth concatemer or the portion thereof consists of 2- 30 nucleotides. In some embodiments, the full sequence of the first target RNA sequence and the full sequence of the second target RNA sequence have at least one nucleotide of difference . In some embodiments, the full sequence of the first target polypeptide and the full sequence of the second target polypeptide have at least one amino acid of difference.
  • the biological sample comprises a cellular organelle, a cell, a whole cell, a group of whole cells, a tissue, an intact tissue, a tumor, an intact tumor, an organ, an organism, a protozoa, an algae, a bacteria, a virus, a plant, a fungus, an insect, or an animal.
  • the biological sample comprises a fresh sample, a processed sample, a freshly -frozen sample, a sectioned sample, or a formalin -fixed and paraffin-embedded (FFPE) sample.
  • FFPE formalin -fixed and paraffin-embedded
  • the first target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the first target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the second target nucleic acid sequence comprises DNA, cDNA, RNA, coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA, and/or wherein the second target RNA sequence comprises coding RNA, non-coding RNA, mRNA, tRNA, rRNA, miRNA, gRNA, snRNA, siRNA, anti-sense RNA, mature microRNA, or immature microRNA.
  • the biological sample comprises a human sample, a simian sample, an ape sample, a canine sample, a feline sample, a bovine sample, an equine sample, a murine sample, a porcine sample, a caprine sample, a lupine sample, a ranine sample, a piscine sample, a plant sample, an insect sample, a bacteria sample, an algae sample, a viral sample, a protozoa sample, or a fungus sample.
  • the first, second, third or fourth concatem er further comprises a compaction oligonucleotide, wherein: a first segment of the compaction oligonucleotide is complementary and binds to a first portion of the first, second, third, or fourth concatemer In some embodiments, a second segment of the compaction oligonucleotide is complementary and binds to a second portion of the first, second, third, or fourth concatemer, to result in a reduction in the size or a change in the shape of the second concatemer.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence.
  • the first sequencing product nucleic acid molecule comprises a first identification sequence that identifies the first target RNA sequence and a portion of the first target RNA sequence
  • the second sequencing product nucleic acid molecule comprises a second identification sequence that identifies the second target RNA sequence and a portion of the second target RNA sequence.
  • the nucleotide comprises a fluorescent label and a removable blocking group at the 3’ carbon position of the sugar moiety .
  • the determining further comprises incorporating the nucleotide into 1he 3’ end of the primer sequence. In some embodiments, the determining further comprises identifying the nucleobaseof the incorporated nucleotide by imaging the fluorescent label of the incorporated nucleotide.
  • the determining further comprises contacting the nucleotide with an agent to remove a blocking group from the nucleotide and generate a 3 ’ OH group on the sugar moiety
  • the blocking group comprises an alkyl group, an alkenyl group, an alkynyl group, an allyl group, an aryl group, a benzyl group, an azide group, an azido group, an O-azidomethyl group, an amine group, an amide group, a keto group, an isocyanate group, a phosphate group, a thiol group, a disulfide group, a carbonate group, a urea group, or a silyl group.
  • the plurality of nucleotides comprise one type of nucleotide selected from a group comprising dATP, dGTP, dCTP, dTTP and dUTP. In some embodiments, the plurality of nucleotides comprise a mixture of any combination of two or more types of nucleotides selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • compositions as disclosed herein may comprise two of the first, second, third, or fourth concatemer; and two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first, second, third, or fourth concatemer.
  • two of the first, second, third, or fourth concatemer are contacted with two of a polymerizing enzyme, a plurality of nucleotide conjugates, and two of a primer sequence that is complementary to a portion of the first, second, third, or fourth concatemer to form a multivalent binding complex comprising each of the two of the polymerizing enzyme, a nucleotide conjugate of the plurality of nucleotide conjugates, and each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence.
  • the nucleotide conjugate comprises a label and at least two of a nucleotide moiety.
  • two of the at least two of the nucleotide moiety are each complementary and bind to a nucleotide of each of the two of the first concatemer.
  • the composition comprises one or more of a second polymerizing enzyme.
  • the composition comprises a plurality of unlabeled nucleotides under conditions suitable for forming a binding complex comprising each of the two of the second polymerizing enzyme, each of the two of the first, second, third, or fourth concatemer hybridized to each of the two of the primer sequence, and two of the plurality of the unlabeled nucleotides and incorporating each of the two of the plurality of the unlabeled nucleotides into each of the two of the primer sequence .
  • each of the two of the plurality of the unlabeled nucleotides is complementary and binds to a nucleotide of each of the two of the first, second, third, or fourth concatemer.
  • an unlabeled nucleotide of the plurality of unlabeled nucleotides comprises a removable blocking group at the 3 ’ carbon of the sugan moiety.
  • the present disclosure provides compositions for detecting in situ at least two different target RNA molecules in a cellular sample.
  • the composition may be configured to interact with a cellular sample deposited on a solid support, wherein the cellular sample harbors at least a first plurality of DNA amplicons that correspond to a first target RNA molecule and the cellular sample harbors a second plurality of DNA amplicons that correspond to a second target RNA molecule.
  • the cellular sample can comprise a first target RNA molecule.
  • the cellular sample can comprise at least a first plurality of DNA amplicons that can correspond to a first target RNA molecule.
  • the cellular sample can comprise a second target RNA molecule.
  • the cellular sample can comprise at least a second plurality of DNA amplicons that can correspond to a second target RNA.
  • the cellular sample can be deposited on a solid support.
  • compositions as disclosed herein may comprise one or more of (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and (iii) a plurality of nucleotides, under a condition suitable for conducting a reverse transcription reaction to generate a plurality of cDNA molecules (e.g., a plurality of first strand cDNA molecules) in the cellular sample (e.g., FIG. 1).
  • the plurality of reverse transcription primers comprises a first sub -population of target-specific reverse transcription primers that hybridize selectively to the first target RNA, and comprises a second sub -population of target-specific reverse transcription primers that hybridize selectively to the second target RNA.
  • the plurality of reverse transcription primers comprises a first sub -population of random-sequence reverse transcription primers that hybridize to the first target RNA, and comprises a second sub-population of random-sequence reverse transcription primers that hybridize to the second target RNA.
  • compositions as disclosed herein may comprise a plurality of cDNA molecules.
  • the cDNA molecules can include at least a first target cDNA molecule that can corresponds to the first target RNA molecule, and the plurality of cDNA molecules can include a second target cDNA molecule that can correspond to the second target RNA molecule.
  • the plurality of cDNA molecules can include at least a first target cDNA molecule.
  • the first target cDNA molecule can correspond to the first target RNA molecule.
  • the plurality of cDNA molecules can include at least a second target cDNA molecule.
  • the second target cDNA molecule can correspond to the second target RNA molecule.
  • the plurality of cDNA molecules can be made by contacting the plurality of RNA inside the cellular sample with (i) a plurality of reverse transcription primers, (ii) a plurality of reverse transcriptase enzymes, and (iii) a plurality of nucleotides, under a condition suitable for conducting a reverse transcription reaction to generate the plurality of cDNA molecules (e.g., a plurality of first strand cDNA molecules) in the cellular sample (e.g., FIG. 1).
  • the generating of (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of reverse transcription primers.
  • the generating of (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of reverse transcription enzymes. In some embodiments, the generating of (b) can comprise contacting the plurality of RNA inside the cellular sample with a plurality of nucleotides.
  • the plurality of reverse transcription primers can comprises a first sub -population of target-specific reverse transcription primers that can hybridize selectively to the first target RNA, and can comprises a second sub-population of target-specific reverse transcription primers that can hybridize selectively to the second target RNA. In some embodiments, the plurality of reverse transcription primers can comprise a first sub -population of target-specific reverse transcription primers.
  • the first sub -population of target-specific reverse transcription primers can hybridize selectively to the first target RNA.
  • the plurality of reverse transcription primers can comprise a second sub -population of targetspecific reverse transcription primers.
  • the second sub -population of target-specific reverse transcription primers can hybridize selectively to the second target RNA.
  • the plurality of reverse transcription primers can comprise a first subpopulation of random -sequence reverse transcription primers that can hybridize to the first target RNA, and can comprise a second sub -population of random-sequence reverse transcription primers that can hybridize to the second target RNA.
  • the plurality of reverse transcription primers can comprise a first sub -population of random-sequence reverse transcription primers. In some embodiments, the first sub -population of random-sequence reverse transcription primers can hybridize to the first target RNA. In some embodiments, the plurality of reverse transcription primers can comprise a second sub -population of randomsequence reverse transcription primers. In some embodiments, the second sub -population of random-sequence reverse transcription primers can hybridize to the second target RNA.
  • compositions as disclosed herein may further comprise a plurality of target-specific padlock probes
  • the plurality oftarget-specific padlock probes may comprise at least a first plurality of target-specific padlock probes and a second plurality of target-specific padlock probes.
  • the plurality of target-specific padlock probes can include at least a first plurality of target-specific padlock probes.
  • the plurality of targetspecific padlock probes can include at least a second plurality of target-specific padlock probes.
  • the method comprises contacting the plurality of cDNA molecule in the cellular sample with atleast2-10,000 different target-specific padlock probes.
  • individual padlockprobes in the plurality of first target -specific padlock probes comprise a firstand second end (e.g., first and second padlock binding arms), wherein the first end selectively hybridizes to a first region of the first target cDNA molecule and the second end selectively hybridizes to a second region of the first target cDNA molecule.
  • the first and second ends of the first target-specific padlock probes can be hybridized to proximal positions on the first target cDNA molecule to form a circularized first target-specific padlock probe having a nick or gap between the hybridized first and second ends (e.g., FIG. 1).
  • the first target-specific padlock probe comprises a first target barcode sequence that corresponds to the first target cDNA sequence. In some embodiments, the first target-specific padlock probe comprises a first target barcode sequence that is located adjacent to one of the regions of the first target-specific padlock probe that selectively hybridizes to the first target cDNA molecule. In some embodiments, the first targetspecific padlock probe comprises at least one universal adaptor sequence, such as for example a universal sequencing primer binding site (or a complementary sequence thereof). In some embodiments, the first target-specific padlock probe comprises a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof). In some embodiments, the first target-specific padlock probe comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • individual padlock probes in the plurality of second targetspecific padlock probes comprise a first and second end, wherein the first end selectively hybridizes to a first region of the second target cDNA molecule and the second end selectively hybridizes to a second region of the second target cDNA molecule.
  • the first and second ends of the second target-specific padlock probes can be hybridized to proximal positions on the second target cDNA molecule to form a circularized second target-specific padlock probe having a nick or gap between the hybridized first and second ends.
  • the second target-specific padlock probe comprises a second target barcode sequence that corresponds to the second target cDNA sequence.
  • the second target-specific padlock probe comprises a second target barcode sequence that is located adjacent to one of the regions of the second target-specific padlock probethat selectively hybridizes to the secondtarget cDNA molecule.
  • the second targetspecific padlock probe at least one universal adaptor sequence, such as for example a comprises at least one universal sequencing primer binding site (or a complementary sequence thereof).
  • the second target-specific padlock probe comprises a universal primer binding site for a rolling circle amplification primer (or a complementary sequence thereof).
  • the second target-specific padlock probe comprises a universal compaction oligonucleotide binding site (or a complementary sequence thereof).
  • the target-specific padlock probes comprise a universal sequencing primer binding site and a target barcode sequence that are adjacent to each other so that the target barcode region of the concatemer is sequenced first.
  • the target barcode sequence can be any length, for example 3 -15 bases, or 15-25 bases, or 25-40 bases, or longer.
  • compositions a disclosed herein may further comprise one or more components or reagents configured to close a nick or gap in a circularized target-specific padlock probe.
  • the components or reagent may be configured to conduct an enzymatic reaction, thereby generating a covalently closed circular padlock probe.
  • the closing the nick can comprise conducting an enzymatic ligation reaction.
  • closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the first or second target cDNA molecule as a template, and conducting an enzymatic ligation reaction. In some embodiments, closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the first target cDNA molecule as a template. In some embodiments, closing the gap can comprise conducting a polymerase-catalyzed fill-in reaction using the second target cDNA molecule as a template. In some embodiments, the method can comprise closing the nick or gap in at least2-10,000 circularized target-specific padlock probes by conducting an enzymatic reaction, thereby generating at least 2-10,000 covalently closed circular padlock probes inside the cellular sample.
  • compositions as disclosed herein may further comprise one or more components or reagents for conducting a rolling circle amplification reaction inside a cellular sample.
  • the first and second covalently closed circular padlock probes can be used as template molecules, thereby generating a plurality of concatemer molecules including at least a first concatemer molecule that corresponds to a first target RNA molecule, and the plurality of concatemer molecules can include at least a second concatemer molecule that corresponds to a second target RNA molecule.
  • a rolling circle amplification reaction can be conducted inside the cellular sample using the first covalently closed circular padlock probe as a template molecule.
  • the rolling circle amplification of the first covalently closed circular padlock probe can generate a plurality of concatemer molecule.
  • the plurality of concatemer molecule can include a first concatemer molecule.
  • the first concatemer molecule can correspond to a first target RNA molecule.
  • a rolling circle amplification reaction can be conducted inside the cellular sample using the second covalently closed circular padlockprobe as a template molecule.
  • the rolling circle amplification of the second covalently closed circular padlock probe can generate a plurality of concatemer molecules.
  • the plurality of concatemer molecule can include a second concatemer molecule.
  • the second concatemer molecule can correspond to a second target RNA molecule.
  • the first concatemer molecule can comprise tandem repeat units, wherein a unit can comprise the sequence of the first target cDNA and the universal sequencing primer binding site (or a complementary sequence thereof).
  • the second concatemer molecule can comprises tandem repeat units, wherein a unit can comprise the sequence of the second target cDNA and the universal sequencing primer binding site (or a complementary sequence thereof).
  • a rolling circle amplification reaction canbe conducted inside the cellular sample using at least 2-10,000 covalently closed circular padlock probes as template molecules, thereby generating at least 2-10,000 concatemer molecules that correspond to at least 2-10,000 target RNA molecules.
  • each compaction oligonucleotide can comprise a single stranded oligonucleotide.
  • the single stranded oligonucleotide can have a first region at one end and a second region at the other end.
  • the first region can hybridize to a portion of the concatemer molecule and the second region can hybridize to another portion of the concatemer molecule.
  • the hybridization of the first region and the second region of the compaction oligonucleotide can compact the concatemer molecule.
  • the compaction oligonucleotide can compact the size of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the shape of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the size and shape of the concatemer molecule. In some embodiments, the compaction oligonucleotide can compact the concatemer molecule to form a compact nanoball.
  • compositions as disclosed herein may further comprise one or more reagents or components for sequencingthe plurality of concatemer molecules insidethe cellular sample.
  • the sequencing may comprise sequencin the first concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of first sequencing read products, and sequencingthe second concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products.
  • sequencing the plurality of concatemer molecules may be conducted inside the cellular sample.
  • sequencing the plurality of concatemer molecules inside the cellular sample can comprise sequencing the first concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of first sequencing read products.
  • sequencing can comprise sequencingthe plurality of concatemer molecules inside the cellular sample.
  • sequencingthe plurality of concatemer molecules inside the cellular sample can comprise sequencingthe second concatemer molecule by conducting no more than 2-30 sequencing cycles to generate a plurality of second sequencing read products.
  • the sequencing of can comprise sequencing no more than 2-30 bases of the first concatemer molecules to generate a plurality of first sequencing read products, and which can comprise sequencing no more than 2-30 bases of the second concatemer molecules to generate a plurality of second sequencing read products.
  • the sequencing of (f) can comprise sequencing no more than 2-30 bases of the first concatemer molecules to generate a plurality of first sequencing read products.
  • the sequencing of (f) can comprise sequencing no more than 2 -30 bases of the second concatemer molecules to generate a plurality of second sequencing read products.
  • the method can comprise sequencing the at least 2-10,000 concatemer molecules inside the cellular sample, which can comprise conducting no more than 2-30 sequencing cycles on the 2-10,000 concatemer moleculesto generate a plurality of sequencing read products.
  • only the first target barcode region of the first concatemer molecules are sequenced (e.g., FIG. 2).
  • at least a portion or the full length of the first target barcode of the first concatemer molecules are sequenced (e.g., FIG. 2).
  • the first target barcode is sequenced and a portion of the first cDNA region of the first concatemer molecules are sequenced (e g., FIG 3).
  • at least a portion of the first cDNA region of the first concatemer molecules are sequenced (e.g., FIG. 4 or 5).
  • only the second target barcode region of the second concatemer molecules are sequenced (e.g., FIG. 2).
  • at least a portion or the full length of the second target barcode of the second concatemer molecules are sequenced (e.g., FIG. 2).
  • the second target barcode is sequenced and a portion of the second cDNA region of the second concatemer molecules are sequenced (e.g., FIG. 3).
  • at least a portion of the second cDNA region of the second concatemer molecules are sequenced (e g., FIG. 4 or 5)
  • the reiterative sequencing can be conducting using a sequencing- by-binding procedure, labeled and/or non-labeled chain-terminating nucleotides, or multivalent molecules. Descriptions of these three sequencing compositions is described below.
  • the plurality ofuniversal sequencing primers can be hybridizedto concatemer template molecules with a hybridization reagent comprising an SSC buffer(e.g., 2X saline-sodium citrate) buffer with formamide (e.g., 10-20% formamide).
  • SSC buffer e.g., 2X saline-sodium citrate
  • formamide e.g., 10-20% formamide.
  • the hybridization conditions comprise a temperature of about20-30°C, for about 10-60 minutes.
  • the plurality of sequencing read products can be removed from the concatemers and the plurality of concatemers can be retained inside the cellular sample using a de-hybridization reagent comprising an SSC buffer (e.g., saline-sodium citrate) buffer, with or without formamide, at a temperature that promotes nucleic acid denaturation such as for example 30 - 90°C.
  • SSC buffer e.g., saline-sodium citrate
  • the plurality of nucleotide reagents comprise a plurality of nucleotides that are detectably labeled or non-labeled.
  • individual nucleotides are linked to a detectable reporter moiety.
  • the detectable reporter moiety comprises a fluorop hore.
  • the plurality of detectably labeled nucleotide analogs comprise a plurality of chain terminating nucleotides, where the chain terminating moiety is linked to the 3 ’ nucleotide sugar position to form a 3’ blocked nucleotide analog.
  • the chain terminating moiety can be removed to convert the 3 ’ blocked nucleotide analog to an extendible nucleotide having a 3 ’ OH group on the sugar.
  • the labeled nucleotide analogs are linked to a different fluorophore that corresponds to the nucleobases adenine, cytosine, guanine, thymine or uracil, where the different fluorophores emit a fluorescent signal during the sequencing of step (f).
  • a sequencing cycle comprises (1) contactingthe concatemer/sequencing primer duplex with a sequencing polymerase and a detectably labeled chain terminating nucleotide under a condition suitable for polymerase -catalyzed incorporation of the detectably labeled chain terminating nucleotide into the terminal end of the sequencing primer, (2) detecting and imaging the fluorescent signal and color emitted by the incorporated chain terminating nucleotide, and (3) removing the chain terminating moiety (e.g., unblocking) and retaining the concatemer/sequencing primer duplex.
  • no more than 2-30 sequencing cycles can be conducted on the plurality of concatemers inside the cellular sample to generate a plurality of first sequencing read products and a plurality of second sequencing read products.
  • out-of-sync phasing and/or pre-phasing events can occur during synchronized sequencing reactions on clonally amplified template amplicons, where the sequencing reactions comprise polymerase-catalyzed sequencing reactions employing detectably labeled chain terminator nucleotides.
  • a sequencing reaction on one template molecule in the clonally-amplified template molecules moves ahead (e.g., pre-phasing) or fall behind (e.g., phasing) of the sequencing of the other template molecules within the clonally-amplified template molecules.
  • a fluorescent signal is typically detected which corresponds to incorporation of a labeled chain terminator nucleotide.
  • phasing and pre-phasing events can be detected and monitored using incorporation of a labeled chain terminator nucleotide.
  • the plurality of nucleotide reagents of step (f) comprise a plurality of multivalent molecules each comprising a core attached to a plurality of nucleotide -arms, wherein the nucleotide-arms are attached to a nucleotide unit.
  • individual multivalent molecules are labeled with a detectab ly reporter moiety.
  • the detectable reporter moiety comprises a fluorophore.
  • the core of the multivalent molecule is labeled with a fluorophore, and wherein the fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • a fluorophore which is attached to a given core of the multivalent molecule corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • At least one of the nucleotide arms of the multivalent molecule comprises a linker and/or nucleotide base that is attached to a fluorophore, and wherein the fluorophore which is attached to a given nucleotide base corresponds to the nucleotide base (e.g., adenine, guanine, cytosine, thymine or uracil) of the nucleotide arm.
  • a sequencing cycle can comprise contacting the concatemer/sequencing primer duplex with a first sequencing polymerase to form a complexed polymerase.
  • a sequencing cycle can comprise contacting the complexed polymerase with a detectably labeled multivalent molecule under a condition suitable for binding a complementary nucleotide unit of the multivalent molecule to the complexed polymerase thereby forming a multivalent-binding complex, and the condition is suitable for inhibiting incorporation of the complementary nucleotide unit into the terminal end of the sequencing primer.
  • the present disclosure provides compositions for conducting in situ multiplex and multi- omics detection and identification using coded padlocks probes.
  • the padlock probes are designed to selectively detect target RNA or polypeptides.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, and the padlock probe may therefore not require a barcode sequence. In some embodiments, however, the padlock probe may comprise a barcode sequence.
  • the RNA-specific padlock probes selectively hybridize to cDNA that corresponds to target RNA.
  • the RNA-specific probes carry barcodes thatuniquely identify the cDNA.
  • the RNA-specific padlock probes also carry batch -specific sequencing primer binding sites.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, and the padlock probe may therefore not require a barcode sequence. In some embodiments, however, the padlock probe may comprise a barcode sequence.
  • the target polypeptides are detected using antibody -oligonucleotide conjugates and polypeptide-specificpadlock probes which selectively hybridize to the oligonucleotide which is conjugated to the antibody.
  • the polypeptide-specific padlock probes carry barcodes that uniquely identify the antibody that selectively binds a target polypeptides.
  • the polypeptide- specific padlock probes also carry a batch -specific sequencing primer which is the same batchspecific sequencing primer carried by a corresponding RNA-specific padlock probe to enable simultaneous sequencing and detection of a target RNA and the polypeptide encoded by that target RNA.
  • the padlock probes enable simultaneous detection and identification of RNA and their encoded polypeptides.
  • the present disclosure provides compositions for conducting in situ analysis of a nucleic acid.
  • the analysis may comprise a sequencing of the nucleic acid.
  • the nucleic acid may be an RNA.
  • the analysis of the RNA may comprise reverse transcribing the RNA into a complementary DNA (cDNA).
  • cDNA complementary DNA
  • a reverse transcriptase may be used to transcribe the RNA into a cDNA.
  • compositions as described herein may further comprise a reverse transcriptase.
  • the reverse transcriptase enzyme can exhibit RNA-depen dent DNA polymerase activity.
  • the reverse transcriptase enzyme can comprise a reverse transcriptase enzyme from AMV (avian myeloblastosis virus), M-MuLV (moloney murine leukemia virus), or HIV (human immunodeficiency virus).
  • the reverse transcriptase enzyme can comprise a recombinant enzyme that exhibits reduced RNase H activity, for example REVERTAID (e.g., from Thermo Fisher Scientific, catalog No. EP0441).
  • the reverse transcriptase canbe a commercially -available enzyme, including MULTISCRIBE (e.g., from Thermo Fisher Scientific, catalog# 4311235), THERMOSCRIPT (e.g., from Thermo Fisher Scientific, catalog# 12236-014), or ARRAYSCRIPT (e.g., from Ambion, catalogNo. AM2048).
  • the reverse transcriptase enzyme can comprise SUPERSCRIPT II (e.g., catalogNo. 18064014), SUPERSCRIPT III (e.g., catalogNo 18080044), or SUPERSCRIPT IV enzymes (e g., catalog No. 18090010) (all SUPERSCRIPT enzymes from Invitrogen).
  • the reverse transcription reaction can include an RNase inhibitor.
  • the reverse transcription primers comprise a single -stranded oligonucleotide comprising DNA, RNA, or chimeric DNA/RNA.
  • the reverse transcription primers Any combination of adenine (A), thymine (T), guanine (G), cytosine (C), uracil (L..1) and/or inosine (I).
  • the reverse transcription primers can be any length, for example 5 -25 bases, or 25-50 bases, or 50-75 bases, or 75-100 bases in length or longer.
  • the reverse transcription primers each comprise a 5’ end and 3’ end.
  • the 3’ end of the reverse transcription primerscan include a 3’ OH moiety which serves as a nucleotide polymerization initiation site in a polymerase -catalyzed primer extension reaction.
  • the 3 ’ end of the reverse transcription primers have a chain terminating moiety which blocks a polym erase-catalyzed primer extension reaction. The chain terminating moiety can be removed to convert the 3’ sugar position to an extendible 3 ’OH.
  • the reverse transcription primers are modified to confer resistance to nuclease degradation (e.g., ribonuclease degradation).
  • the reverse transcription primers comprise atleast one phosphorothioate diester bond at their 5’ ends which can render the reverse transcription primers resistant to nuclease degradation.
  • the reverse transcription primers comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends.
  • the plurality of reverse transcription primers comprise at least one ribonucleotide and/or at least one 2’-O-methyl, 2 ’-O-m ethoxy ethyl (MOE), 2’ fluorobase nucleotide.
  • the reverse transcription primers comprise phosphorylated 3 ’ ends. In some embodiments, the reverse transcription primers comprise locked nucleic acid (LNA) bases. In some embodiments, the reverse transcription primers comprise a phosphorylated 5’ end (e.g., using a polynucleotide kinase).
  • LNA locked nucleic acid
  • the entire length of a reverse transcription primer can hybridize to a portion of an RNA molecule.
  • individual reverse transcription primers comprise a 3’ region having a sequence that hybridizes to a portion of an RNA molecule and a 5 ’ region that carries a tail that does not hybridize to an RNA molecule.
  • the 5’ tail comprises a universal adaptor sequence including any one or any combination of two or more of a sample barcode sequence, an amplification primer binding site, a sequencing primer binding site, a compaction oligonucleotide binding site and/or a surface capture primer binding site.
  • the 5’ tail comprises a unique identification sequence (e.g., unique molecular index (UMI).
  • the 5 ’ tail comprises a restriction enzyme recognition sequence
  • individual reverse transcription primers comprise at least a portion of the 3 ’ region having a homopolymer sequence, for example poly -A, poly-T, poly-C, poly-G or poly-U.
  • the reverse transcription primers can hybridize to any portion of an RNA molecule, including the 5 ’ or the 3 ’ end of the RNA molecule, or an internal portion of the RNA molecule.
  • the plurality of reverse transcription primers comprises a first sub -population of target-specific reverse transcription primers that hybridize selectively to the first target RNA (e.g., targeted transcriptomics). In some embodiments, the plurality of reverse transcription primers further comprise a second sub -population of target-specific reverse transcription primers that hybridize selectively to the second target RNA. In some embodiments, the target-specific reverse transcription primers comprise a pre-determined sequence at the 3’ region which hybridizes to a target RNA molecule. In some embodiments, the pre -determined sequence portion of the reverse transcription primers canbe 4-20bases, or20-40bases, or40-50 bases in length.
  • the first sub -population of target-specific reverse transcription primers can selectively hybridize to an RNA transcribed in the cellular sample by a housekeeping gene.
  • selection of the housekeeping gene may be dependent upon the type of cellular sample to be used for the in situ compositions described herein.
  • Non-limiting example housekeeping genes include glyceraldehyde-3 -phosphate dehydrogenase (GAPDH), beta-actins (ACTB), tubulins, PPIA (peptidyl-prolyl cis-trans isomerase), NME4 (NME/NM23 nucleoside diphosphate kinase 4), SMARCAL1 (SWI/SNF related matrix associated actin dependent regulator of chromatin, subfamily A like 1), and POMK (protein-O-mannose kinase).
  • GPDH glyceraldehyde-3 -phosphate dehydrogenase
  • ACTB beta-actins
  • tubulins tubulins
  • PPIA peptidyl-prolyl cis-trans isomerase
  • NME4 NME/NM23 nucleoside diphosphate kinase 4
  • SMARCAL1 SWI/SNF related matrix associated actin dependent regulator of chromatin, subfamily A like 1
  • the second sub-population of target-specific reverse transcription primers can selectively hybridize to an RNA transcribed from a gene that is expressed in the cellular sample being examined (e g., a cell-specific or tissue-specific RNA).
  • the plurality of reverse transcription primers comprises a first sub -population of random-sequence reverse transcription primers that hybridize to the first target RNA (e.g., whole transcriptomics). In some embodiments, the plurality of reverse transcription primers further comprises a second sub-population ofrandom-sequence reverse transcription primers that hybridize to the second target RNA. In some embodiments, the reverse transcription primers comprise a random and/or degenerate sequence at the 3 ’ region which hybridizes to an RNA molecule. In some embodiments, the random -sequence or the degenerate-sequence portion of the reverse transcription primers canbe 4 -20 bases, or 20-40 bases, or 40-50 bases in length. Rolling Circle Amplification (RCA)
  • compositions as disclosed herein may comprise a plurality of concatemer molecules.
  • the plurality of concatemer molecules maybe generated via RCA.
  • the plurality of concatemer molecules each having two or more tandem copies of a unit wherein the unit comprises a target sequence that corresponds to a target RNA molecules and any additional sequence(s) carried by the padlock probes including universal adaptor sequence(s), unique molecular index sequence(s) and/or restriction enzyme recognition sequence(s).
  • a unit can comprise a target sequence that can correspond to a target RNA molecule.
  • a unit can comprise additional sequence(s) carried by the padlock probes.
  • the additional sequences can include universal adaptor sequence(s), unique molecular index sequence(s), restriction enzyme recognition sequence(s), or any combination thereof.
  • the rolling circle amplification reaction includes a plurality of nucleotide which includes dUTP
  • the resulting concatemer can be cross-linked to a crosslinking reactive group by treating the cellular sample with a succininuide ester (NHS), maleimide (Sulfo-SMCC), imidoester (DMP), carbodiimide (DCC, EDC) or phenyl azide
  • polymerization of the cross-linking reactive group can be initiated with light or UV light
  • the resulting concatemer can be cross-linked to a matrix by treating the cellular sample with a cross-linked agarose, cross-linked dextran or cross-linked polyethylene glycol (PEG), polyacrylamide, cellulose alginate or polyamide.
  • the PEG comprises a sulfo-NHS ester moiety atone or both ends, for example a PEGylated bis(sulfosuccinimidyl)suberate) (e.g., BS(PEG)9 from Thermo Fisher Scientific, catalogNo. 21582).
  • a PEGylated bis(sulfosuccinimidyl)suberate e.g., BS(PEG)9 from Thermo Fisher Scientific, catalogNo. 21582
  • the rolling circle amplification reaction canbe conducted ata constant temperature (e.g., isothermal) wherein the constant temperature is at room temperature to about 30°C, or about 30 - 40°C, or about 40 - 50°C, or about 50 - 65 °C.
  • a constant temperature e.g., isothermal
  • the DNA polymerase having a strand displacing activity can be selected from a group consisting of phi29 DNA polymerase, large fragment of Bst DNA polymerase, large fragment of Bsu DNA polymerase, and Bea (exo-) DNA polymerase, Klenow fragment of E. coli DNA polymerase, T5 polymerase, M-MuLV reverse transcriptase, HIV viral reverse transcriptase, or Deep Vent DNA polymerase.
  • the phi29DNA polymerase can be wild type phi29 DNA polymerase (e.g., MagniPhi from Expedeon), or variant EquiPhi29 DNA polymerase (e.g., from Thermo Fisher Scientific), and chimeric QualiPhi DNA polymerase (e.g., from 4basebio).
  • wild type phi29 DNA polymerase e.g., MagniPhi from Expedeon
  • EquiPhi29 DNA polymerase e.g., from Thermo Fisher Scientific
  • chimeric QualiPhi DNA polymerase e.g., from 4basebio
  • the rolling circle amplification primers comprise at least one phosphorothioate diester bond at their 5’ ends which can render the amplification primers resistantto exonuclease degradation. In some embodiments, the rolling circle amplification primers comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends. In some embodiments, the rolling circle amplification primers comprise at least one ribonucleotide and/or at least one 2’-O-methyl or 2’ -O-meth oxyethyl (MOE) nucleotide.
  • MOE 2’-O-methyl or 2’ -O-meth oxyethyl
  • the composition further comprises a plurality of compaction oligonucleotides which, when hybridized to a concatemer molecule, can compact the size, the shape, or a combination of the size and the shape of the concatemer to form a compact nanoball.
  • rolling circle amplification reaction canbe conducted in the presence of one or more compaction oligonucleotides.
  • the compaction oligonucleotide may include a 5’ region, an optional internal region (intervening region), a 3 ’ region, or any combination thereof.
  • a 5 ’ and 3 ’ region of the compaction oligonucleotide can hybridize to different portions of the concatemer to pull together distal portions of the concatemer causing compaction of the concatemer to form a DNA nanoball.
  • the 5’ region of the compaction oligonucleotide can be designed to hybridize to a first portion of the concatemer molecule (e.g., a universal compaction oligonucleotide binding site), and the 3 ’ region of the compaction oligonucleotide is designed to hybridized to a second portion of the concatemer molecule (e.g., a universal compaction oligonucleotide binding site).
  • Inclusion of compaction oligonucleotides during RCA can promote formation of DNA nanoballs having tighter size and shape compared to concatemers generated in the absence of the compaction oligonucleotides.
  • the compaction oligonucleotides comprise single stranded oligonucleotides comprising DNA, RNA, or a combination of DNA and RNA.
  • the compaction oligonucleotides can be any length, including 20-150 nucleotides, or 30-100 nucleotides, or 40 80 nucleotides in length.
  • the compaction oligonucleotide comprises a 5’ region and a 3’ region, and optionally an intervening region between the 5’ and 3’ regions.
  • the intervening region can be any length, for example about 2 -20 nucleotides in length.
  • the intervening region comprises a homopolymer having consecutive identical bases (e.g., AAA, GGG, CCC, TTT or UUU).
  • the intervening region comprises a non-homopolymer sequence.
  • the 5’ region of the compaction oligonucleotide can be wholly complementary or partially complementary along its length to a first portion of a concatemer molecule.
  • the 5’ region of the compaction oligonucleotides can be wholly complementary along its length to a first portion of a concatemer molecule.
  • the 5 ’ region of the compaction oligonucleotides can be partially complementary along its length to a first portion of a concatemer molecule.
  • the 3 ’ region of the compaction oligonucleotides can be wholly complementary or partially complementary along its length to a second portion of a concatemer molecule.
  • the 3’ region of the compaction oligonucleotides can be wholly complementary along its length to a second portion of a concatemer molecule.
  • the 3’ region of the compaction oligonucleotides canbe partially complementary along its length to a second portion of a concatemer molecule.
  • the 5 ’ region of the compaction oligonucleotides can hybridize to a first universal sequence portion of a concatemer molecule.
  • the 3 ’ region of the compaction oligonucleotides can hybridize to a second universal sequence portion of a concatemer molecule. .
  • the 5’ region of the compaction oligonucleotide canhavethe same sequence as the 3 ’ region.
  • the 5’ region of the compaction oligonucleotide can have a sequence that is different from the 3 ’ region.
  • the 3 ’ region of the compaction oligonucleotide can have a sequence that is a reverse sequence of the 5 ’ region.
  • the 5’ region of the compaction oligonucleotide can have a sequence that is a reverse sequence of the 3 ’ region.
  • the 3’ region of any of the compaction oligonucleotides can include an additional three bases at the terminal 3’ end which comprises 2’ -O-methyl RNA bases (e.g., designated mUmUmU) or the terminal 3’ endlacks additional 2’-O-methyl RNA bases.
  • 2’ -O-methyl RNA bases e.g., designated mUmUmU
  • the compaction oligonucleotides comprise one or more modified bases or linkages at their 5’ or 3’ ends to confer certain functionalities. In some embodiments, the compaction oligonucleotides comprise at least one phosphorothioate linkages at their 5’ and/or 3 ’ ends to confer exonuclease resistance. In some embodiments, at least one nucleotide at or near the 3’ end comprises a 2’ fluoro base which confers exonuclease resistance. In some embodiments, the 3’ end of the compaction oligonucleotides comprise at least one 2’ -O-methyl RNA base which blocks polymerase-catalyzed extension.
  • the 3 ’ end of the compaction oligonucleotide comprises threebases comprising 2 ’-O-methyl RNAbase (e.g., designated mUmUmU).
  • the compaction oligonucleotides comprise a 3 ’ inverted dT at their 3’ ends which blocks polymerase-catalyzed extension.
  • the compaction oligonucleotides comprise 3’ phosphorylation which blocks polymerase-catalyzed extension.
  • the internal region of the compaction oligonucleotides comprise at least one locked nucleic acid (LNA) which increases the thermal stability of duplexes formed by hybridizing a compaction oligonucleotide to a concatemer molecule.
  • LNA locked nucleic acid
  • the compaction oligonucleotides comprise a phosphorylated 5’ end (e g., using a polynucleotide kinase).
  • the compaction oligonucleotide comprises the sequence s’ - CATGTAATGCACGTACTTTCAGGGTAAACATGTAATGCACGTACTTTCAGGGT-3’ (SEQ ID NO: 14).
  • the compaction oligonucleotides includes an additional three bases at the terminal 3’ end which comprises 2’ -O-methyl RNA bases (e.g., designated mUmUmU) orthe terminal 3’ end lacks additional 2’-O-methyl RNAbases.
  • the compaction oligonucleotides can include at least one region having consecutive guanines.
  • the compaction oligonucleotides can include at least one region having 2, 3, 4, 5, 6 or more consecutive guanines.
  • the compaction oligonucleotides comprise four consecutive guanines which can form a guanine tetrad structure (see FIG. 18).
  • the guanine tetrad structure can be stabilized via Hoogsteen hydrogen bonding.
  • the guanine tetrad structure can be stabilized by a central cation including potassium, sodium, lithium, rubidium or cesium.
  • At least one compaction oligonucleotide can form a guanine tetrad ( Figure 18) and hybridize to the universal binding sequences in a concatemer which can cause the concatemer to fold to form an intramolecular G-quadruplex structure ( Figure 19).
  • the concatemers can selfcollapse to form compact nanoballs. Formation of the guanine tetrads and G-quadruplexes in the nanoballs may increase the stability of the nanoballs to retain their compact size and shape which can withstand changes in pH, temperature and/or repeated flows of reagents during sequencing inside the cellular sample.
  • the plurality of compaction oligonucleotides in the rolling circle amplification reaction have the same sequence.
  • the plurality of compaction oligonucleotides in the rolling circle amplification reaction comprise a mixture of two or more different populations of compaction oligonucleotides having different sequences.
  • the immobilized concatemer template molecule can self-collapse into a compact nucleic acid nanoball.
  • the nanoballs can be imaged and a FWHM measurement can be obtained to give the shape/size of the nanoballs.
  • inclusion of compaction oligonucleotides in the rolling circle amplification reaction can promote collapsing of a concatemer into a DNA nanoball.
  • Conducting RCA with compaction oligonucleotides helps retain the compact size and shape of a DNA nanoball during multiple sequencing cycles which can improve FWHM (full width half maximum) of a spot image of the DNA nanoball inside a cellular sample.
  • the DNA nanoball does notunravel during multiple sequencing cycles.
  • the spot image of the DNA nanoball does not enlarge during multiple sequencing cycles.
  • the spot image of the DNA nanoball remains a discrete spot during multiple sequencing cycles.
  • the spot image can be represented as a Gaussian sp ot and the size can be measured as a FWHM.
  • a smaller spot size as indicated by a smaller FWHM typically correlates with an improved image of the spot.
  • the FWHM of a nanoball spot can be about 10 um or smaller.
  • the present disclosure provides a protein -binding composition.
  • the protein-binding composition may be specific to a target polypeptide or protein corresponding to a target RNA.
  • the target RNA may then be sequenced.
  • the protein -binding composition may increase the specificity of the methods of sequencing as disclosed herein.
  • protein-binding compositions as disclosed herein may comprise an antibody and an oligonucleotide tag.
  • the antibody and oligonucleotide tag may be joined via a linker moiety .
  • the antibody may comprise a plurality of oligonucleotide tags linked via a plurality of linker moieties.
  • the antibody may be specific to a target protein or polypeptide corresponding to a target RNA.
  • the oligonucleotide tag may correspond to a sequence of a padlock probe as disclosed herein. The successful binding of the oligonucleotide tag to a padlock probe as disclosed herein may lock the topological configuration of the padlock probe.
  • the oligonucleotide tag sequence is designed to provide a sequencing read-out that is associated with binding between the antibody-oligonucleotide conjugate andthe target polypeptide.
  • the sequence of the oligonucleotide tag is designed to exhibit minimal hybridization to RNA (e.g., target RNA) in the cellular sample.
  • the present disclosure provides one or more sets of antibody-oligonucleotide conjugates, comprising at least a first and a second antibody-oligonucleotide conjugate.
  • a set of antibody- oligonucleotide conjugates can be used to conduct any of the detecting and/or sequencing methods described herein.
  • a set comprises at least a first antibody-oligonucleotide conjugate comprising a first antibody which selectively binds a first target polypeptide
  • the first antibody is linked to a first oligonucleotide carrying an oligonucleotide tag sequence which uniquely identifies the first antibody which selectively binds the first target polypeptide.
  • the set comprises at least a second antibody-oligonucleotide conjugate comprising a second antibody which selectively binds a second target polypeptide.
  • the second antibody is linked to a second oligonucleotide carrying an oligonucleotide tag sequence which uniquely identifies the second antibody which selectively binds the second target polypeptide.
  • the first antibody-oligonucleotide conjugate andthe second antibody-oligonucleotide conjugate are the same (e g., FIG. 28). In some embodiments, the first antibody-oligonucleotide conjugate andthe second antibody-oligonucleotide conjugate are different (e.g., FIG. 29).
  • the first and second oligonucleotide tag sequences are designed to selectively bind to left and right binding arms of a padlock probe. In some embodiments, the first and second oligonucleotide tag sequences differ from each other.
  • a set of antibody-oligonucleotide conjugates comprises 2-20 antibody-oligonucleotide conjugates, wherein each antibody-oligonucleotide conjugate binds its respective target polypeptide.
  • the set comprises 20-100 antibody- oligonucleotide conjugates, or 100-500 antibody-oligonucleotide conjugates, or 500-1000 antibody-oligonucleotide conjugate, or more antibody-oligonucleotide conjugates.
  • the present disclosure provides antibody-oligonucleotide conjugates each comprising an antibody linked to an oligonucleotide.
  • the antibody comprises an intact immunoglobulin, antibody fragment, an antigen binding portion of an antibody, or single -chain antibody.
  • the antibodies can be monoclonal or polyclonal antibodies.
  • the antibodies are capable of binding specifically to a target analyte.
  • the target analyte includes polypeptides, polynucleotides, carbohydrates, saccharides and lipids.
  • target analytes comprise intact polypeptides or peptide fragments.
  • the antibodies comprises an antigen -binding region (e.g., paratope) thatbinds specifically to a target analyte.
  • An immunoglobulin is typically a tetrameric molecule comprising two identical pairs of polypeptide chains where each pair includes a light chain and a heavy chain.
  • the amino portion of the heavy and light chains each comprise a variable region which associate with each other to form an antigen binding region (e.g., paratope).
  • an antigen binding region e.g., paratope
  • the carboxyl portion of the heavy chain comprise a constant region which associate with each other to form an Fc region for effector function.
  • the Fc portion of the heavy chains can define the class of antibody which includes IgG, IgM, IgD, IgA or IgE isotype.
  • the heavy and/or light chains can be prepared using recombinant techniques or by immunizing an animal with an antigen of interest.
  • the antibody fragment generally comprises a portion of an intact immunoglobulin that can bind an antigen.
  • antibody fragments include but are not limited to Fv, Fab, Fab’, Fab’-SH, F(ab’)2, and Fd.
  • an Fv fragment comprises a variable light chain region (VL) and variable heavy chain region (VH).
  • an Fab fragment comprises a monovalent antibody fragment having a variable light chain region (VL), constant light chain region (CL), variable heavy chain region (VH), and first constant region (CHI).
  • VL variable light chain region
  • CL constant light chain region
  • VH variable heavy chain region
  • CHI first constant region
  • an Fab’ fragment comprises a monovalent antibody fragment having a variable light chain region (VL), constant light chain region (CL), variable heavy chain region (VH), first constant region (CHI), hinge region, and atleast a portion of a second constant region (CH2).
  • VL variable light chain region
  • CL constant light chain region
  • VH variable heavy chain region
  • CHI first constant region
  • hinge region atleast a portion of a second constant region (CH2).
  • an F(ab’)2 fragment comprises a bivalent antibody fragment having two Fab fragments linked via a disulfide bridge at the hinge region.
  • a single-chain antibody typically comprises a single polypeptide chain (e.g., a monovalent antibody molecule) having a variable light chain region (VL) and variable heavy chain region (VH)joined by a polypeptide linker (see, e.g., Bird etal., 1988, Science 242:423- 26 and Huston et al., 1988, Proc. Natl. Acad. Sci. USA 85 :5879-83).
  • the amino -terminal end of the single-chain antibody comprises either the variable light chain region (VL) or the variable heavy chain region (VH).
  • the single-chain antibody comprises an scFv- Fc antibody which further comprises an antibody hinge region, and at least a portion of the Fc region includingthe CH2 and/orthe CH3 region. In some embodiments, the single-chain antibody comprises an scFv-CH antibody which further comprises an antibody hinge region, and at least a portion of the CH3 region
  • the antibody-oligonucleotide conjugates comprise an antibody linked to an oligonucleotide by a linker moiety.
  • the linker moiety comprises streptavidin (or an avidin -like moiety), biotin, an amine group, or a disulfide group.
  • the linker moiety is not cleavable or not removable. In some embodiments, the linker moiety is cleavable or removable.
  • the linker moiety can be cleavable with light (e.g., UV light), chemically -cleav able (e.g., dithiothreitol), heat- cleavable, or enzymatically cleavable.
  • light e.g., UV light
  • chemically -cleav able e.g., dithiothreitol
  • heat- cleavable e.g., heat- cleavable
  • enzymatically cleavable e.g., UV light
  • antibody-oligonucleotide conjugates can be prepared by crosslinking amino groups on the antibody and oligonucleotide using glutaraldehyde.
  • the lysine side chain epsilon-amide is commonly targeted to conjugate to oligonucleotides.
  • maleimide-modified antibodies can be reacted with sulfhydryl-modified oligonucleotides.
  • heterobifunctional cross-linkers are introduced as bridges to link together the antibody and oligonucleotide.
  • the antibody-oligonucleotide conjugates comprise an antibody linked to an oligonucleotide.
  • the oligonucleotide comprises a nucleic acid comprising DN A, RNA, or chimeric DNA and RNA.
  • the oligonucleotide comprises canonical nucleotides or nucleotide analogs such as locked nucleic acids (LNA).
  • LNA locked nucleic acids
  • the length of the oligonucleotide can be 10-50 nucleotides in length, or 50- 100 nucleotides in length, or 100-200 nucleotides in length.
  • the oligonucleotide is designed to carry at least one tag sequence, where the tag sequence selectively binds to left and right binding arms of a padlock probe.
  • the oligonucleotide tag sequence also uniquely identifies the antibody to which it is conjugated, where the antibody selectively binds a target polypeptide.
  • the oligonucleotide tag sequence is designed to provide a sequencing read -out that is associated with binding between the antibody-oligonucleotide conjugate and the target polypeptide.
  • the sequence of the oligonucleotide tag is designed to exhibit minimal hybridization to RNA (e.g., target RNA) in the cellular sample.
  • a set of antibody-oligonucleotide conjugates comprises at least a first and a second antibody-oligonucleotide conjugate.
  • a set of oligonucleotide tags comprises at least a first and a second oligonucleotide tag.
  • the set of oligonucleotide tags comprises 2-20 oligonucleotide tags wherein each oligonucleotide tag in the set uniquely identifies its respective conjugated antibody where the antibody selectively binds a target polypeptide.
  • the set of oligonucleotide tags comprises 20- 100 oligonucleotide tags, or 100-500 oligonucleotide tags, or 500- 1000 oligonucleotide tags, or more oligonucleotide tags.
  • a first antibody-nucleotide conjugate and a second antibody-nucleotide conjugate may be used against a target protein encoded by a target RNA (in FIG. 28, “Protein-1 encoded by RNA-1”).
  • the first antibody- nucleotide conjugate, the second antibody-nucleotide conjugate, or both antibody-nucleotide conjugates may comprise a linker moiety.
  • a first oligonucleotide tag may be linked to the first antibody -oligonucleotide conjugate via a first linker moiety.
  • the first oligonucleotide tag may conjugate to a padlock probe.
  • the first oligonucleotide tag of the first antibody -oligonucleotide conjugate may, e.g., ligate, chemically conjugate, or have proximity crosslinking to the padlock probe.
  • a second oligonucleotide tag of the second antibody -oligonucleotide conjugate may be linked to the second antibody via a second linker moiety.
  • a first antibody-oligonucleotide conjugate and a second antibody-oligonucleotide conjugate may be used against a target protein (in FIG. 29, “Protein-1 encoded by RNA-1”).
  • the first antibody-oligonucleotide conjugate may comprise a first antibody and a first oligonucleotide tag.
  • the first oligonucleotide tag may be linked to the first antibody via a first linker moiety.
  • the second antibody-oligonucleotidetag may comprise a second antibody and a second oligonucleotide tag.
  • the second oligonucleotide tag may be linked to the second antibody via a second linker moiety.
  • the first antibody- oligonucleotide conjugate and the second antibody-oligonucleotide conjugate maybe specific to the same target protein encoded by the same target RNA.
  • the target analytes comprise polypeptides or peptide fragments.
  • a target polypeptide is encoded by a target RNA in the cellular sample.
  • Target polypeptides include polypeptides having post-translationally modified forms, including methylation, phosphorylation, glycosylation, hydroxylation, ubiquitination, nitro sylation, acetylation, lipidation, ADP -ribosylation, carbonylation, SUMOylation and/or disulfidebond formation.
  • the target polypeptides can be subjected to proteolysis for example by a protease or cleavage due to ribosomal skipping.
  • Target polypeptides also include precursor molecules that have notyetbeen subjected to post-translation modification.
  • Target polypeptides include muteins, variants, chimeric proteins and fusion proteins.
  • Target polypeptides can be labeled with a binding partner molecule having an affinity moiety, such as for example biotin (or its derivatives), digoxigenin, fluorescein, cholesterol, maltose, or any of the affinity molecules described below.
  • polysaccharides inside the cellular sample can be detected by contacting the cellular sample with lectin conjugated to an oligonucleotide carrying a tag sequence that is designed to bind the left and right arms of a padlock probe.
  • Lectinoligonucleotide conjugates can permit detection of polysaccharides using a sequencing readout.
  • lipids inside the cellular sample can be detected by contacting the cellular sample with a lipid-specific binding protein or an amphipathic polypeptide which are conjugated to an oligonucleotide carrying a tag sequence that is designed to bind the left and right arms of a padlock probe.
  • Lipid -specific binding proteins and amphipathic polypeptides that are conjugated to tagged oligonucleotides can permit detection of lipids using a sequencing readout.
  • sequencing polymerases can be used for conducting sequencing reactions.
  • the sequencing polymerase(s) is/are capable of binding and incorporating a complementary nucleotide opposite a nucleotide in a concatemer template molecule.
  • the sequencing polymerase(s) is/are capable of binding a complementary nucleotide unit of a multivalent molecule opposite a nucleotide in a concatemer template molecule.
  • the plurality of sequencing polymerases comprise recombinant mutant polymerases.
  • suitable polymerases for use in sequencing with nucleotides and/or multivalent molecules include but are not limited to : Klenow DNA polymerase; Thermus aquaticus DNA polymerase I (Taq polymerase); KlenTaq polymerase; Candidates altiarchaeales archaeon; Candidates Hadarchaeum Yellowstonense; Hadesarchaea archaeon; Euryarchaeota archaeon; Thermoplasmata archaeon; Thermococcus polymerases such as Thermococcus litoralis, bacteriophage T7 DNA polymerase; human alpha, delta and epsilon DNA polymerases; bacteriophage polymerases such as T4, RB69 and phi29 bacteriophage DNA polymerases;
  • Pyrococcus furiosus DNA polymerase Pfu polymerase
  • Bacillus subtilis DNA polymerase III E. coll DNA polymerase III alpha and epsilon; 9 degree N polymerase; reverse transcriptases such as HIV type M or 0 reverse transcriptases; avian myeloblastosis virus reverse transcriptase; Moloney Murine Leukemia Virus (MMLV) reverse transcriptase; or telomerase.
  • DNA polymerases include those from various Archaea genera, such as, Aeropyrum, Archaeoglobus, Desulfurococcus, Pyrobaculum, Pyrococcus, Pyrolobus, Pyrodictium, Staphylothermus, Stetteria, Sulfolobus, Thermococcus, and Vulcanisaeta and the like or variants thereof, including such polymerases as are known in the art such as 9 degrees N, VENT, DEEP VENT, THERMINATOR, Pfu, KOD, Pfx, Tgo and RB69 polymerases.
  • the sequencing comprises conducting sequencing reactions inside a cellular sample, where the cDNA amplicons are the concatem er molecules.
  • the composition comprises a non-labeled chain -terminating nucleotide.
  • the composition comprises a primed concatemer (e.g., a concatem er annealed to a plurality of sequencing primers).
  • the composition comprises the primed concatemer contacted with at least two separate mixtures under ternary complex stabilizing conditions.
  • the at least two separate mixtures can each include a polymerase and a nucleotide.
  • the primed concatemer is contacted with nucleotide cognates for first, second and third base type base types in the template.
  • nucleotide cognates for first, second and third base type base types in the template are described in U.S. PatentNos. 10,246,744 and 10,731,141 (where the contents of both patents are hereby incorporated by reference in their entireties).
  • a padlock probe for use in any of the methods as described herein.
  • a padlock probe may be designed to selectively detect a target molecule.
  • the padlock probe maybe designed to selectively detect a target nucleic acid or polypeptide.
  • the target nucleic acid may be a target DNA or a target RNA.
  • binding the padlock probe to the target nucleic acid orprotein/polypeptide one can learn information about one or more properties of the target nucleic acid or protein/poly peptide. For example, a location of the target nucleic acid orprotein/polypeptide can be ascertained.
  • the padlock probe may comprise one or more batch-specific primer-binding sites, which are specific to a primer.
  • the primer may be spatially localized, andthe padlock probe may therefore not require abarcode sequence.
  • the location of the target can give information about the sample, e.g., cellular origin. In such instances, it may not be necessary for the padlock probe to carry a barcode. However, in other non-limiting instances, the padlock probe may carry a barcode.
  • the present disclosure provides a padlock probe that is specific to a target RNA.
  • the RNA-specific padlock probe may selectively hybridize to a cDNA that corresponds to the target RNA.
  • the RNA-specific probe may lack a barcode that uniquely identifiesthe cDNA. In such instances, a spatial information maybe obtained with respect to the RNA-specific probe.
  • the RNA-specific probe may carry a barcode that uniquely identifies the cDNA.
  • the RNA-specific padlock probes may also carry a sequencing primer binding site.
  • the sequencing primer binding site may be a batch-specific primer binding site — in other words, a primer may bind to the primer binding site that is unique to the probe.
  • One or more padlock probes may be used. Each probe may have a unique batch-specific primer binding site. The use of a batch-specific primer binding site may enable the spatial localization of the padlock probe, thereby obviating the need for a barcode sequence on the padlock probe.
  • a plurality of cDNA inside the cellular sample can be amplified to generate cDNA amplicons (e g., concatemers).
  • the plurality of cDNA molecules can be amplified by conducting a padlock probe circularization and rolling circle amplification workflow.
  • the compositions comprise contacting the plurality of cDNA molecules with a plurality of padlock probes, including a first plurality oftarget-specific padlock probes that hybridize with first target cDNA molecules, and a second plurality of target-specific padlock probes that hybridize with second target cDNA molecules.
  • the padlock probes comprise single -stranded oligonucleotides.
  • the padlock probes comprise DNA, RNA, DNA and RNA.
  • the padlock probes comprise canonical nucleotides and/or nucleotide analogs.
  • the padlock probes are modified to confer resistance to nuclease degradation (e g., ribonuclease degradation).
  • the padlock probes comprise at least one phosphorothioate diester bond at their 5’ ends which can render the padlock probes resistant to nuclease degradation.
  • the padlock probes comprise 2-5 or more consecutive phosphorothioate diester bonds at their 5’ ends.
  • the padlock probes comprise at least one ribonucleotideand/or at least one 2’ -O-m ethyl, 2’-O-methoxyethyl (MOE), 2’ fluoro-base nucleotide.
  • the padlock probes comprise phosphorylated 3 ’ ends.
  • the padlock probes comprise at least one locked nucleic acid (LNA) base.
  • the padlock probes comprise a phosphorylated 5 ’ end (e.g., using a polynucleotide kinase).
  • the plurality of padlockprobes can include a first plurality of target-specific padlock probes. In some embodiments, the first plurality oftarget-specific padlock probes can hybridize with the first target cDNA molecules. In some embodiments, the plurality of padlock probes can include a second plurality oftarget-specific padlock probes. In some embodiments, the second plurality of target-specific padlock probes can hybridize with the second target cDNA molecules.
  • individual padlock probes comprise first and second terminal regions that hybridize to portions of cDNA molecules to form a plurality of cDNA-padlock probe complexes, wherein individual complexes have the firstand second terminal probe regions hybridized to proximal regions of a cDNA molecule to form a nick or gap between the first and second terminal probe ends.
  • the first terminal region of an individual padlock probe has a first target-specific sequence that selectively hybridizes to a first region of a target cDNA molecule
  • the second terminal region of the individual padlock probe has a second target-specific sequence that selectively hybridizes to a second region of the same target cDNA molecule, where a nick or gap is formed between the hybridized first and second terminal regions, thereby circularizing the padlock probe.
  • the first terminal region of an individual padlock probe can have a first target-specific sequence that can selectively hybridize to a first region of a target cDNA molecule.
  • the second terminal region of an individual padlock probe can have a second target-specific sequence that can selectively hybridize to a second region of a target cDNA molecule.
  • a nick or gap can be formed between the hybridized first and second terminal regions.
  • the hybridization of the first terminal region and the second terminal region can circularize the padlock probe.
  • individual padlock probes in a set of padlock probes comprise firstand second terminal regions that hybridize to the same target regions of the target cDNA molecules to form a plurality of cDNA-padlock probe complexes having the same cDNA sequence.
  • a set of padlock probes (e.g., a plurality of padlock probes) comprise at least two sub-sets of padlock probes.
  • individual padlock probes in a first sub-set of padlock probes comprise first and second terminal regions that hybridize to the same target regions (e.g., a first target region) of the target cDNA molecules to form a first plurality of cDNA-padlock probe complexes having the same cDNA sequence.
  • individual padlock probes in a second sub-set of padlock probes comprise first and second terminal regions that hybridizeto the same target regions (e g., a second target region) of the target cDNA molecules to form a second plurality of cDNA-padlockprobe complexes having the same cDNA sequence.
  • the first and second subsets of padlock probes hybridize to different target regions of the same target cDNA molecules.
  • the first and second sub -sets of padlock probes hybridize to different target regions of different target cDNA molecules.
  • the set of padlock probes comprise 2-10 sub-sets of padlock probes, or 10-25 sub-sets of padlock probes, or 25-50 sub-sets of padlock probes, or up to 100 sub-sets of padlock probes. In some embodiments, the set of padlock probes comprise atleast 100 sub-sets of padlock probes, atleast 500 sub-sets of padlock probes, atleast 1000 sub-sets of padlock probes, atleast 10,000 sub-sets of padlock probes, or more sub-sets of padlock probes. [0471] In some embodiments, the nicks can be enzymatically ligated to generate covalently closed circular padlock probes.
  • the ligase enzyme can discriminate between matched and mis-matched hybridized ends to ensure target-specific hybridization.
  • the ligation reaction comprises use of a ligase enzyme, including a T3 , T4, T7 or Taq DNA ligase enzyme.
  • the size of the gap between the hybridized first and second terminal regions is 1 -25 bases.
  • the 3 ’OH end of hybridized padlock probe can serve as an initiation site for a polymerase-catalyzed fill-in reaction (e.g., gap fill-in reaction) using the target cDNA molecule as a template. After the fill-in reaction, the remaining nick can be enzymatically ligated to generate covalently closed circular padlock probes.
  • the gap-filling reaction comprises contacting the circularized padlock probe with a DNA polymerase and a plurality of nucleotides.
  • the DNA polymerase comprises E. colt DNA polymerase I, Klenow fragment of E. coli DNA polymerase I, T7 DNA polymerase, or T4 DNA polymerase.
  • the ligase enzyme can discriminate between matched and mis-matched hybridized ends to ensure targetspecific hybridization.
  • the ligation reaction comprises use of a ligase enzyme, including a T3, T4, T7 or Taq DNA ligase enzyme.
  • the padlock probes comprise at least one universal adaptor sequence including a sample barcode sequence, an amplification primer binding site, a sequencing primer binding site, a compaction oligonucleotide binding site and/or a surface capture primer binding site.
  • the padlock probes comprise at least one unique identification sequence (e.g., unique molecular index (UMI).
  • UMI unique molecular index
  • the padlock probes comprise at least one restriction enzyme recognition sequence.
  • the present disclosure provides methods for detecting the sequence of a nucleic acid in a biological sample.
  • the nucleic acid maybe, e.g., DNA orRNA.
  • the biological sample may be a cellular sample.
  • compositions for detecting in situ at least two different target RNA molecules in a cellular sample which can include conducting sequencing reactions inside the cellular sample, where the cDNA amplicons can be the concatemer molecules.
  • any of the compositions described herein can comprise at least one nucleotide.
  • the nucleotides comprise a base, sugar and at least one phosphate group.
  • at least one nucleotide in the plurality comprises an aromatic base, a five carbon sugar (e.g., ribose or deoxyribose), and one or more phosphate groups (e.g., 1 -10 phosphate groups).
  • the plurality of nucleotides can comprise at least one type of nucleotide selected from a group consisting of dATP, dGTP, dCTP, dTTP and dUTP.
  • the plurality of nucleotides can comprise at a mixture of any combination of two or more types of nucleotides selected from a group consisting of dATP, dGTP, dCTP, dTTP and/or dUTP.
  • at least one nucleotide in the plurality is not a nucleotide analog.
  • at least one nucleotide in the plurality comprises a nucleotide analog.
  • At least one nucleotide in the plurality of nucleotides comprise a chain of one, two or three phosphorus atoms where the chain is typically attached to the 5 ’ carbon of the sugar moiety via an ester or phosphoramide linkage.
  • at least one nucleotide in the plurality is an analog having a phosphorus chain in which the phosphorus atoms are linked together with intervening O, S, NH, methylene or ethylene.
  • the phosphorus atoms in the chain include substituted side groups including O, S or BH3.
  • the chain includes phosphate groups substituted with analogs including phosphoramidate, phosphorothioate, phosphordithioate, and O-methyl phosphoramidite groups.
  • At least one nucleotide in the plurality of nucleotides comprises a terminator nucleotide analog having a chain terminating moiety (e.g., blocking moiety) at the sugar 2’ position, atthe sugar 3’ position, or at the sugar 2’ and 3 ’ position.
  • the chain terminating moiety can inhibit polymerase-catalyzed incorporation of a subsequent nucleotide unit or free nucleotide in a nascent strand during a primer extension reaction.
  • the chain terminating moiety is attached to the 3 ’ sugar hydroxyl position where the sugar comprises a ribose or deoxyribose sugar moiety. In some embodiments, the chain terminating moiety is removable/cleavable from the 3’ sugar hydroxyl position to generate a nucleotide having a 3 ’OH sugar group which is extendible with a subsequent nucleotide in a polymerase-catalyzed nucleotide incorporation reaction.
  • the chain terminating moiety comprises an alkyl group, alkenyl group, alkynyl group, allyl group, aryl group, benzyl group, azide group, amine group, amide group, keto group, isocyanate group, phosphate group, thiol group, disulfide group, carbonate group, urea group, or silyl group.
  • the chain terminating moiety is cleavable/removable from the nucleotide, for example by reacting the chain terminating moiety with a chemical agent, pH change, lighter heat.
  • the chain terminating moieties alkyl, alkenyl, alkynyl and allyl are cleavable with tetrakis(triphenylphosphine)palladium(0) (Pd(PPh3)4) with piperidine, or with 2,3 -Dichloro-5,6- dicyano-l,4-benzo-quinone (DDQ)
  • the chain terminating moieties aryl and benzyl are cleavable with H2 Pd/C.
  • the chain terminating moieties amine, amide, keto, isocyanate, phosphate, thiol, disulfide are cleavable with phosphine or with a thiol group including beta-mercaptoethanol or dithiothritol (DTT).
  • the chain terminating moiety carbonate is cleavable with potassium carbonate (K2CO3) in MeOH, with triethylamine in pyridine, or with Zn in acetic acid (AcOH).
  • the chain terminating moieties urea and silyl are cleavable with tetrabutylammonium fluoride, pyridine-HF, with ammonium fluoride, or with triethylamine trihydrofluoride.
  • At least one nucleotide in the plurality of nucleotides comprises a terminator nucleotide analog having a chain terminating moiety (e.g., blocking moiety) at the sugar 2’ position, at the sugar 3’ position, or at the sugar 2’ and 3 ’ position.
  • the chain terminating moiety comprises an azide, azido or azidomethyl group.
  • the chain terminating moiety comprises a 3 ’-O-azido or 3 ’-O-azidomethyl group.
  • the chain terminating moieties azide, azido and azidomethyl group are cleavable/removable with a phosphine compound.
  • the phosphine compound comprises a derivatized tri -alkyl phosphine moiety or a derivatized tri -aryl phosphine moiety.
  • the phosphine compound comprises Tris(2-carboxyethyl)phosphine (TCEP) or bis-sulfo triphenyl phosphine (BS-TPP) or Tri(hydroxyproyl)phosphine (THPP).
  • the cleaving agent comprises 4 -dimethylaminopyridine (4-DMAP).
  • the nucleotide comprises a chain terminating moiety which is selected from a group consisting of 3 ’ -deoxy nucleotides, 2’,3’-dideoxynucleotides, 3 ’-methyl, 3’-azido, 3 ’-azidomethyl, 3’-O-azidoalkyl, 3’- O-ethynyl, 3’-O-aminoalkyl, 3’-O-fluoroalkyl, 3 ’-fluoromethyl, 3 ’-difluoromethyl, 3 ’- trifluoromethyl, 3 ’-sulfonyl, 3 ’-malonyl, 3 ’-amino, 3’-O-amino, 3 ’ -sulfhydral, 3 ’-aminomethyl, 3 ’-ethyl, 3 ’butyl, 3 ‘ -tertbut
  • the plurality of nucleotides comprises a plurality of nucleotides labeled with detectable reporter moiety.
  • the detectable reporter moiety comprises a fluoroph ore.
  • the fluorophore is attached to the nucleotide base.
  • the fluorophore is attached to the nucleotide base with a linker which is cleavable/removable from the base.
  • at least one of the nucleotides in the plurality is not labeled with a detectable reporter moiety.
  • a particular detectable reporter moiety e.g., fluorophore
  • the nucleotide base e.g., dATP, dGTP, dCTP, dTTP or dUTP
  • the nucleotide base e.g., dATP, dGTP, dCTP, dTTP or dUTP
  • the cleavable linker on the nucleotide base comprises a cleavable moiety comprising an alkyl group, alkenyl group, alkynyl group, allyl group, aryl group, benzyl group, azide group, amine group, amide group, keto group, isocyanate group, phosphate group, thiol group, disulfide group, carbonate group, urea group, or silyl group.
  • the cleavable linker on the base is cleavable/removable from the base by reacting the cleavable moiety with a chemical agent, pH change, light or heat.
  • the cleavable moieties alkyl, alkenyl, alkynyl and allyl are cleavable with tetrakis(triphenylphosphine)palladium(0) (Pd(PPh3)4) with piperidine, or with 2,3-Dichloro-5,6-dicyano-l,4-benzo-quinone (DDQ).
  • the cleavable moieties aryl and benzyl are cleavable with H2 Pd/C.
  • the cleavable moieties amine, amide, keto, isocyanate, phosphate, thiol, disulfide are cleavable with phosphine or with a thiol group including beta-mercaptoethanol or dithiothritol (DTT).
  • the cleavable moiety carbonate is cleavable with potassium carbonate (K2CO3) in MeOH, with tri ethylamine in pyridine, or with Zn in acetic acid (AcOH).
  • the cleavable moieties urea and silyl are cleavable with tetrabutylammonium fluoride, pyridine - HF, with ammonium fluoride, or with tri ethylamine trihydrofluoride.
  • the cleavable linker on the nucleotide base comprises cleavable moiety including an azide, azido or azidomethyl group.
  • the cleavable moieties azide, azido and azidomethyl group are cleavable/removable with a phosphine compound.
  • the phosphine compound comprises a derivatized tri-alkyl phosphine moiety or a derivatized tri-aryl phosphine moiety.
  • the phosphine compound comprises Tris(2- carboxyethyl)phosphine (TCEP) orbis-sulfo triphenyl phosphine (BS-TPP) or Tri(hydroxyproyl)phosphine (THPP).
  • the cleaving agent comprises 4- dimethylaminopyridine (4-DMAP).

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Biotechnology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Pathology (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
PCT/US2023/065972 2022-04-20 2023-04-19 Reiterative short read sequencing inside a cellular sample Ceased WO2023205707A2 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
KR1020247038517A KR20250011725A (ko) 2022-04-20 2023-04-19 세포 샘플 내부에서의 반복적 짧은 리드 시퀀싱
EP23792763.7A EP4511515A4 (en) 2022-04-20 2023-04-19 Reiterative short read sequencing inside a cellular sample
CA3249276A CA3249276A1 (en) 2022-04-20 2023-04-19 ITERATIVE SHORT PLAY SEQUENCING WITHIN A CELL SAMPLE
JP2024562006A JP2025514791A (ja) 2022-04-20 2023-04-19 細胞試料内部での反復ショートリードシーケンシング
AU2023258027A AU2023258027A1 (en) 2022-04-20 2023-04-19 Reiterative short read sequencing inside a cellular sample
CN202380048112.1A CN119487210A (zh) 2022-04-20 2023-04-19 细胞样品内的反复短读测序
US18/919,127 US20250146068A1 (en) 2022-04-20 2024-10-17 Reiterative short read sequencing inside a cellular sample

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US202263332690P 2022-04-20 2022-04-20
US63/332,690 2022-04-20
US202263334023P 2022-04-22 2022-04-22
US63/334,023 2022-04-22

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/919,127 Continuation US20250146068A1 (en) 2022-04-20 2024-10-17 Reiterative short read sequencing inside a cellular sample

Publications (2)

Publication Number Publication Date
WO2023205707A2 true WO2023205707A2 (en) 2023-10-26
WO2023205707A3 WO2023205707A3 (en) 2023-11-30

Family

ID=88420782

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2023/065972 Ceased WO2023205707A2 (en) 2022-04-20 2023-04-19 Reiterative short read sequencing inside a cellular sample

Country Status (8)

Country Link
US (1) US20250146068A1 (https=)
EP (1) EP4511515A4 (https=)
JP (1) JP2025514791A (https=)
KR (1) KR20250011725A (https=)
CN (1) CN119487210A (https=)
AU (1) AU2023258027A1 (https=)
CA (1) CA3249276A1 (https=)
WO (1) WO2023205707A2 (https=)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12091657B2 (en) 2018-06-12 2024-09-17 Element Biosciences, Inc. Reverse transcriptase for nucleic acid sequencing
US12117438B2 (en) 2019-09-06 2024-10-15 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12134766B2 (en) 2018-11-15 2024-11-05 Element Biosciences, Inc. Methods for generating circular nucleic acid molecules
US12146190B2 (en) 2021-07-21 2024-11-19 Element Biosciences, Inc. Optical systems for nucleic acid sequencing and methods thereof
US12241891B2 (en) 2019-09-23 2025-03-04 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12313627B2 (en) 2019-05-01 2025-05-27 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12331356B2 (en) 2018-11-14 2025-06-17 Element Biosciences, Inc. Multipart reagents having increased avidity for polymerase binding
WO2025147667A1 (en) * 2024-01-05 2025-07-10 Element Biosciences, Inc. Target probe complexes and analyte detection complexes for in situ detecting and identifying target analytes
US12391929B2 (en) 2019-05-24 2025-08-19 Element Biosciences, Inc. Polymerase-nucleotide conjugates for sequencing by trapping
WO2025235016A1 (en) 2024-05-07 2025-11-13 Element Biosciences, Inc. Sequencing method with spatial flexibility
US12540350B2 (en) 2022-08-15 2026-02-03 Element Biosciences, Inc. Spatially resolved surface capture of nucleic acids
WO2026080904A2 (en) 2024-10-11 2026-04-16 Element Biosciences, Inc. Systems and methods for performing dna sequencing
US12612660B2 (en) 2024-01-19 2026-04-28 Element Biosciences, Inc. Optical systems for nucleic acid sequencing and methods thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10059990B2 (en) * 2015-04-14 2018-08-28 Massachusetts Institute Of Technology In situ nucleic acid sequencing of expanded biological samples
US11287422B2 (en) * 2019-09-23 2022-03-29 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12091657B2 (en) 2018-06-12 2024-09-17 Element Biosciences, Inc. Reverse transcriptase for nucleic acid sequencing
US12331356B2 (en) 2018-11-14 2025-06-17 Element Biosciences, Inc. Multipart reagents having increased avidity for polymerase binding
US12134766B2 (en) 2018-11-15 2024-11-05 Element Biosciences, Inc. Methods for generating circular nucleic acid molecules
US12492396B2 (en) 2018-11-15 2025-12-09 Element Biosciences, Inc. Methods for generating circular nucleic acid molecules
US12313627B2 (en) 2019-05-01 2025-05-27 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12391929B2 (en) 2019-05-24 2025-08-19 Element Biosciences, Inc. Polymerase-nucleotide conjugates for sequencing by trapping
US12117438B2 (en) 2019-09-06 2024-10-15 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12241891B2 (en) 2019-09-23 2025-03-04 Element Biosciences, Inc. Multivalent binding composition for nucleic acid analysis
US12146190B2 (en) 2021-07-21 2024-11-19 Element Biosciences, Inc. Optical systems for nucleic acid sequencing and methods thereof
US12540350B2 (en) 2022-08-15 2026-02-03 Element Biosciences, Inc. Spatially resolved surface capture of nucleic acids
WO2025147667A1 (en) * 2024-01-05 2025-07-10 Element Biosciences, Inc. Target probe complexes and analyte detection complexes for in situ detecting and identifying target analytes
US12612660B2 (en) 2024-01-19 2026-04-28 Element Biosciences, Inc. Optical systems for nucleic acid sequencing and methods thereof
WO2025235016A1 (en) 2024-05-07 2025-11-13 Element Biosciences, Inc. Sequencing method with spatial flexibility
WO2026080904A2 (en) 2024-10-11 2026-04-16 Element Biosciences, Inc. Systems and methods for performing dna sequencing

Also Published As

Publication number Publication date
EP4511515A2 (en) 2025-02-26
EP4511515A4 (en) 2026-04-29
WO2023205707A3 (en) 2023-11-30
KR20250011725A (ko) 2025-01-21
CA3249276A1 (en) 2023-10-26
CN119487210A (zh) 2025-02-18
AU2023258027A1 (en) 2024-12-05
US20250146068A1 (en) 2025-05-08
JP2025514791A (ja) 2025-05-09

Similar Documents

Publication Publication Date Title
US20250146068A1 (en) Reiterative short read sequencing inside a cellular sample
US20250002996A1 (en) Bi-directional in situ analysis
JP7846146B2 (ja) 連結型ライゲーション
US11891651B2 (en) Compositions and methods for pairwise sequencing
US11859241B2 (en) Compositions and methods for pairwise sequencing
CN103781918B (zh) 用于组织样本中核酸的局部或空间检测的方法和产品
JP7026248B2 (ja) 二本鎖dnaを増幅するための方法およびキット
CN110312800A (zh) 处理核酸样本的方法
KR20250086817A (ko) 범용 긴 스플린트 가닥을 갖는 이중가닥 스플린트 어댑터 및 사용 방법
JP2025528821A (ja) 核酸の空間的に分解された表面キャプチャ
JP2025500815A (ja) 次世代シーケンシングにおける一次分析
US20250349138A1 (en) Three-dimensional base calling in next generation sequencing analysis
KR20250028294A (ko) 차세대 시퀀싱에서 염기 호출의 페이징 및 사전 페이징 교정
KR20250028348A (ko) 1차 분석에서의 영상 등록
WO2025147667A1 (en) Target probe complexes and analyte detection complexes for in situ detecting and identifying target analytes
HK40124113A (zh) 细胞样品内的反复短读测序
WO2025170937A1 (en) Direct in situ sequencing
HK40120147A (zh) 双链夹板衔接子和使用方法

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 2024562006

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: AU2023258027

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2023792763

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2023792763

Country of ref document: EP

Effective date: 20241120

WWE Wipo information: entry into national phase

Ref document number: 11202407334T

Country of ref document: SG

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23792763

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2023258027

Country of ref document: AU

Date of ref document: 20230419

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 202380048112.1

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 202380048112.1

Country of ref document: CN