WO2022187719A1 - Molecular barcodes and related methods and systems - Google Patents
Molecular barcodes and related methods and systems Download PDFInfo
- Publication number
- WO2022187719A1 WO2022187719A1 PCT/US2022/019045 US2022019045W WO2022187719A1 WO 2022187719 A1 WO2022187719 A1 WO 2022187719A1 US 2022019045 W US2022019045 W US 2022019045W WO 2022187719 A1 WO2022187719 A1 WO 2022187719A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- identimer
- identimers
- cleavage
- beads
- molecular
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1065—Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B70/00—Tags or labels specially adapted for combinatorial chemistry or libraries, e.g. fluorescent tags or bar codes
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B82—NANOTECHNOLOGY
- B82Y—SPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
- B82Y5/00—Nanobiotechnology or nanomedicine, e.g. protein engineering or drug delivery
Definitions
- This disclosure relates to biotechnology. More specifically, this disclosure relates to molecular barcodes and related methods and systems.
- FIG. 1 is a graphical representation of construction steps for non-DNA barcodes.
- FIG. 2 is a graphical representation of construction steps for barcodes linked to Next Generation Sequencing (NGS) barcodes.
- NGS Next Generation Sequencing
- FIG. 3 is a graphical representation of a non-nucleic acid barcode and potential imaging results during deconvolution of the barcode.
- FIG. 4 is a graphical representation of construction steps for unlabeled barcodes linked to NGS barcodes. These may be optionally labeled.
- FIG. 5 is a graphical representation of construction steps to encode chemical libraries without using DNA.
- FIG. 6 is a graphical representation of construction steps for cleavable barcodes linked to NGS barcodes.
- FIG. 7 is a graphical representation of decoding steps to identify non-DNA molecular barcodes.
- FIG. 8 is a graphical representation of decoding steps to identify barcodes linked to NGS barcodes.
- FIG. 9 is a graphical representation of a double stranded DNA NGS- compatible barcodes with endonuclease recognition sites and potential imaging results during deconvolution of the barcode.
- FIG. 10 is a graphical representation of a non-nucleic acid NGS- compatible barcode with orthogonal protease recognition sites and potential imaging results during deconvolution of the barcode.
- FIG. 11 is a graphical representation of deconvolution steps of a visual NGS barcode.
- FIG. 12 is a graphical representation of construction of a non-DNA barcode linked to chemical building blocks.
- FIG. 13 is a graphical representation of a non-DNA barcode encoded chemical library and potential imaging results during deconvolution of the barcode.
- FIG. 14 is a graphical representation of an exemplary dual labeled, double stranded DNA identimer.
- FIG. 15 is a graphical representation of a scaffolding agent connected to detectable labels.
- FIG. 16 is a graphical representation of a non-nucleic acid barcode with dual labeled cyclic peptide building blocks and how the barcode may be decoded.
- FIG. 17 is a graphical representation of a mini well with capping beads that may be a surface for attaching barcodes for single cell analysis.
- FIG. 18 is a textual representation of a set of five single-label FND identimers having scaffold portions configured with orthogonal sticky-ends, orthogonal recognition moieties and cleavage sites as well as modified nucleotides.
- FIG. 19A represents a streptavidin bead labeled with a first labeled dsDNA identimer segment (Id 1/Hind 111-AF750) attached through ligation to an unlabeled dsDNA that is attached to the bead through a biotin moiety.
- FIG. 19B represents a streptavidin bead labeled with a second labeled dsDNA identimer segment (Id2/Spel-AF647).
- FIG. 19C represents a streptavidin bead labeled with a third labeled dsDNA identimer segment (ld3/Xhol-ATTO550).
- FIG. 19D represents a streptavidin bead labeled with a fourth labeled dsDNA identimer segment (ld4/Notl-ATT0488).
- FIG. 20A represents un-cleaved beads imaged in four fluorescence channels; 647nm (upward diagonal stripes), 550nm (downward diagonal stripes), 488nm (horizontal stripes) and 750nm (dotted), and mean intensity values obtained for each fluorophore were plotted to the right.
- FIG. 20B represents mean intensity of beads following exposure to the Notl enzyme.
- FIG. 20C represents mean intensity of beads following exposure to the Xhol enzyme.
- FIG. 20D represents mean intensity of beads following exposure to the Spel enzyme.
- FIG. 21A presents a bar graph representing OCS results obtained for single-label dsDNA identimer chains where the order of the cycles was reversed for analysis.
- FIG. 21 B presents a bar graph representing OCS results obtained for mixed-label dsDNA identimer chains where the order of the cycles was reversed for analysis.
- FIG. 22A represents the first of six I beads bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22B represents the second of six I beads bearing the same single label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22C represents the third bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22D represents the fourth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22E represents the fifth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22F represents the sixth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 23A graphs the bead library for a bead labeled with the sequence ld1/Hindlll-AF647-ld2/Spel-AF750 ⁇ ld3/Xhol-ATTO488 ⁇ ld4/Notl-ATTO488 across the OCS experiment.
- FIG. 23B graphs the bead library for a bead labeled with the sequence Id1/Hindlll-ATT0488-Id2/Spel-ATT0550-Id3/Xhol-AF647-Id4/Notl-ATT0488 across the OCS experiment.
- FIG. 23C graphs the bead library for a bead labeled with the sequence Id1/Hindlll-ATT0550-Id2/Spel-AF750-Id3/Xhol-ATT0488-Id4/Notl-AF750 across the OCS experiment.
- FIG. 24A represents Spel enzyme cleavage of an AF-750-labelled hairpin oligo containing two AF-750 labels.
- FIG. 24B presents a graph of signals from both the AF-647 and AF-750 labels before and following Spel enzyme cleavage.
- FIG. 25A represents ligation of an ATTO550 label from a streptavidin bead and confirmation by fluorescence imaging.
- FIG. 25B represents a second labeled DNA hairpin, containing the AF-647 label orthogonally attached through ligation to the second of the three acceptor oligos on the bead and confirmation by fluorescence imaging.
- FIG. 25C represents a third labeled DNA hairpin, containing the ATTO-488 label orthogonally attached through ligation to the last of three acceptor oligos on the bead and confirmation by fluorescence imaging.
- FIG. 26A represents a bead with three different ring identimers, Id1/Spel- ATTO-550, ld2/Xhol-AF-647, and ld3/Notl-ATTO-488 before and after Spel enzyme cleavage.
- FIG. 26B presents a bar graph demonstrating efficient Spel cleavage of the ATTO-550 label.
- FIG. 27A represents a first encoding cycle (ATTO-488 labeling) of a streptavidin bead containing a first labeled ssDNA that is hybridized to a first unlabeled ssDNA that is attached to the bead through a biotin moiety.
- FIG. 27B represents a second encoding cycle (AF-647 labeling) of the streptavidin bead.
- FIG. 28A represents a first cleavage with a first enzyme (Notl) on a bead containing three different hybridized identimers, ld1/Notl-ATTO-488, Id2/Spel-AF- 647, and Id3/Xhol-ATTO-550.
- FIG. 28B represents a second cleavage with a second enzyme (Spel) on the bead containing three different hybridized identimers.
- FIG. 28C provides a graph of fluorescence imaging confirmation of efficient Notl and Spel enzymatic cleavage.
- the molecular barcodes described herein comprise a set of one or more identimers, where each identimer includes one or more detectable labels operatively connected to a scaffold portion.
- the scaffold portion comprises a recognition moiety that is orthogonal to the recognition moiety of at least one identimer in the set of identimers and a cleavage site operatively connected to the one or more detectable labels, concatenated to each other in a three- dimensional arrangement to encode and form the molecular barcode.
- a molecular barcode includes an uncleavable identimer (FIG. 2).
- an uncleavable identimer token may be included in a molecular barcode to facilitate detection of its signal response after the detectable labels of cleavable identimer tokens have dissociated from the molecular barcode.
- an uncleavable identimer token may be operatively connected to a solid phase material as a firstly encoded identimer token.
- uncleavable identimers comprise scaffolding portions lacking a chemically reactive recognition moiety or cleavage site.
- phrases “operatively coupled to” and “coupled to” refer to any form of interaction between two or more entities, including electrostatic, enzymatic, covalent, ionic, or other chemical interaction. Two entities may interact with each other even though they are not in direct contact with each other. For example, two entities may interact with each other through an intermediate entity.
- protease cleavage site sequences and protease recognition sequences are peptide sequences at which an orthogonally reactive protease binds.
- a protease cleavage site sequences further includes a cleavage site within its peptide sequence.
- orthogonally reactive cleaving agents to the molecular barcode cleaves it at the cleavage sites of identimers having recognition moieties that are reactive to the cleaving agents to dissociate the detectable labels of the identimers from the three-dimensional arrangement according to the orthogonality of the identimers and thereby induce a detectable signal response to decode the molecular barcode.
- the scaffold portion of the molecular barcode may comprise polypeptides, amino acids, cyclic peptides, nucleic acids, or a combination thereof.
- the scaffold portion may comprise polyethylene glycol (PEG)n. “n” may equal 1-12, more preferably 1-6, most preferably 2-5. Examples of labeled cyclic peptides are shown in FIG. 15 and FIG. 16.
- Detectable labels may be fluorophores or chemically- protected quantum dots, for example. Spacing detectable labels away from each other can prevent quenching and/or spectral shifting.
- Cleavage sites may be a chemical linker, peptide, ssDNA, dsDNA, or RNA for cleavage by a small molecule, protease, endonuclease, or RNAse FI respectively.
- the scaffold portion comprises N-terminal lysine residues, the N-terminal lysine residues being modifiable with a chemical group (e.g., transcyclooctene (TCO), using NHS-PEG4-TCO) compatible for concatenating to the scaffold portions of other identimers containing a C-terminal methyltetrazine group.
- a chemical group e.g., transcyclooctene (TCO), using NHS-PEG4-TCO
- C-terminal modification of identimers may be achieved using commercially available heterobifunctional crosslinkers containing the maleimide reactive moiety for attachment to the installed cysteine sulfhydryl group (Methyltetrazine-PEG4- Maleimide).
- the scaffold portions may be modified through their C-terminal cysteine residues to contain the DBCO group (using DBCO-PEG4- Maleimide) while the scaffold portions of N-terminal lysate-bearing identimers may be modified through the N-terminal lysine residues to contain the azide group.
- a terminal identimer may be included in a molecular barcode to acts as a “capping” identimer, and therefore is not modified on its C-terminal end to contain a chemically reactive click moiety.
- an identimer scaffold portion comprises a peptide fragment covalently linked through its N- and C-terminal ends to linker reagents containing chemically reactive groups that enable the orthogonal chemical concatenation of identimer tokens using a split-pooling approach.
- the recognition moiety of the identimer is a protease cleavage site sequences. In some embodiments, the recognition moiety of the identimer is a protease recognition sequence.
- a identimer scaffold portion comprises a polypeptide fragment modified to contain an N-terminal lysine residue, a C-terminal cysteine residue, and an internal non-natural amino acid bearing the azide group, and may be ordered from a commercial supplier (e.g., Anaspec Inc., located at 34801 Campus Dr. Fremont, CA 94555).
- a commercial supplier e.g., Anaspec Inc., located at 34801 Campus Dr. Fremont, CA 94555.
- the scaffold portion comprises a N-terminal cysteine residue to facilitate concatenation of N-terminal cysteine bearing identimers to each other by native chemical ligation.
- the scaffold portion comprises enzymatic recognition tags to facilitate concatenation of enzymatic recognition tag-bearing identimers.
- a click chemistry may be used to concatenate identimers because click reactions are orthogonal may have desirable reaction kinetics compared to native chemical ligation, possibly requiring less identimers to form a molecular barcodes and thereby reduce costs.
- the scaffolding portion comprises a cyclic peptide, spacing linker, or other spacing molecule to facilitate concatenation of identimers through chemically linkage.
- the cyclic peptide, spacing linker, or spacing molecule acts to space identimer monomeric units from each other by the molecular distance of the cyclic peptide, spacing linker, or spacing molecule.
- a scaffolding portion is operatively connected to a set of one or more detectable labels.
- the scaffolding portion may be configured to space the detectable labels of an identimer apart from one another. For example, spacing detectable labels apart relative to each other may enhance combinatorial labeling performance of identimers.
- a scaffolding portion comprises a cyclic peptide and may be compatibly attached (or inserted) between two identimers.
- the orientation of each identimer would not be relevant (identimers can be concatenated by attachment of either their N- or C- terminal ends) because the detectable labels may be attached to the scaffolding portion between the identimers, and the orientation of the proteolytic site will not affect protease activity.
- a scaffold portions comprises a single-stranded DNA (ssDNA) molecule having a 5’ and 3’ end that is fluorescently labeled with a detectable label through a modified internal base, and is pre-hybridized with a complementary ssDNA to form a dsDNA duplex.
- the formed dsDNA duplex may encode one or more copies of a single endonuclease restriction sequence (or site type).
- site type a single endonuclease restriction sequence
- one strand of the dsDNA duplex containing the nuclease recognition sequence(s) comprises an internal-amino modification for attachment of a pre designated fluorophore.
- the detectable labels comprise one or more Q- dots because of their brightness (i.e., quantum yield) and multiplexing capabilities.
- the detectable labels comprise fluorophores having different emission wavelengths that may be covalently attached to the scaffold portion of an identimer.
- the combination of detectable signal responses of a formed and encoded molecular barcode may or may not comprise contiguous emission spectra, as the three-dimensional arrangement of the identimer tokens forming the molecular barcode may vary.
- a molecular barcode may be formed and encoded to include about one million (10 6 ) identimer class variations if Q-dots are used in parallel by split-pooling. Dual labeling of each identimer with combinatorial Q-dot labels may expand this number to about one billion (10 10 ) identimer class variations.
- the use of fluorophore-doped nanoparticles may provide further identimer class variations.
- the recognition moiety of at least one identimer in the set of identimers may comprise a chemical linker, peptide, or nucleic acid.
- the cleavage site may be within the recognition moiety.
- each recognition moiety in the set of identimers may comprise at least one moiety selected from a protease recognition moiety, an endonuclease recognition moiety, an epitope recognizable by an affinity reagent, a nucleic acid probe recognition moiety, a modified peptide side chain, and an unnatural peptide side chain.
- FIG. 14 An exemplary identimer having two detectable labels and two of the same recognition site is shown in FIG. 14.
- Molecular barcodes disclosed herein may be constructed by attaching an identimer to another identimer via specific compatible chemistries, such as click chemistry moieties, native chemical ligation linkers, or enzyme mediated ligation linkers, for example DNA ligation by T4 DNA ligase.
- the molecular barcode may further comprise an identimer linker between each identimer in the set of identimers.
- the identimer linker may be selected from a group of click chemistry moieties, native chemical ligation linkers, and enzyme-mediated ligation linkers.
- identimers may be sequentially added in a pre defined order to form barcodes over sequential rounds of splitting and pooling. Each round of identimer addition will enable concatenation of identimers through orthogonal chemical ligation.
- concatenation of identimers by ligation may be performed in the same reaction buffer (1X Phosphate Buffered Saline (PBS) or 1X T4 DNA ligase buffer) using a molar excess of the incoming identimer to be concatenated.
- PBS Phosphate Buffered Saline
- 1X T4 DNA ligase buffer 1X T4 DNA ligase buffer
- N-terminal lysine residues installed within scaffold portions may be modified with a chemical group (transcyclooctene (TCO), using NHS-PEG4-TCO) compatible for attachment to other identimers that contain a C- terminal methyltetrazine group.
- TCO transcyclooctene
- C-terminal modification of identimers may be achieved using commercially available heterobifunctional crosslinkers containing the maleimide reactive moiety for attachment to the installed cysteine sulfhydryl group (Methyltetrazine-PEG4-Maleimide).
- some identimers may be modified through their C-terminal cysteine residues to contain the DBCO group (using DBCO-PEG4-Maleimide), while some identimers may be modified through their N-terminal lysine residues to contain the azide group.
- a terminal identimer is included in the formed molecular barcode and acts as a “capping” identimer, and therefore is not modified on its C-terminal end to contain a chemically reactive click moiety.
- Molecular barcodes linked to Next Generation Sequencing barcodes may have the scaffolding portion comprising double stranded DNA and the recognition moiety comprising protein, DNA, RNA, or a chemically cleavable moiety.
- the double stranded DNA may have a 3’ overlap for capture such as Poly(T), a unique molecular identifier, and/or a 5’ PCR handle (FIG. 9 and FIG. 10).
- the recognition moiety may comprise a single endonuclease cleavage site type for each identimer class making up the barcode (FIG. 5).
- the cleavage agents for these molecular barcodes may be proteases (FIG. 8, FIG. 10), nucleases (FIG. 7, FIG. 9), or chemical cleavage agents.
- Many endonucleases are known in the art to possess orthogonal reactivity with respect to one another. For example, Notl, Xhol, Spel, and Hindi 11 possess specificity for dsDNA sequences containing GCGGCCGC, CTCGAG, ACTAGT, and AAGCTT, respectively.
- Notl should not specifically cleave sites recognized by Xhol, Spel or Hindi 11 with high efficiency.
- the same is true with respect to each of the nucleases listed, as each should be capable of cleaving (primarily), only their respective substrates in the presence of substrates recognized by the others.
- the molecular barcode may have the scaffold portion comprising double stranded DNA, the recognition moiety comprising a nuclease recognition moiety, and the one or more detectable labels comprising a fluorophore.
- the molecular barcode may have the scaffold portion comprising double stranded DNA, the recognition moiety comprising a protease recognition moiety, and the one or more detectable labels comprising a fluorophore.
- the molecular barcode of the first embodiment may have the scaffold portion comprising double stranded DNA, the recognition moiety comprising a chemical cleavage recognition moiety, and the one or more detectable labels comprising a fluorophore.
- Non-DNA molecular barcodes may have the scaffolding portion comprise amino acids, peptides, or protein.
- the recognition moiety may be made of a chemical cleavable moiety and may include more than one detectable label, for example, a combination of two or more different labels.
- the cleavage agents for these barcodes may be proteases or chemical cleavage agents, which may be added sequentially.
- the molecular barcode may have the scaffold portion comprising amino acids, the recognition moiety comprising a chemical cleavage recognition moiety, and the one or more detectable labels comprising a fluorophore.
- the molecular barcode may have the scaffold portion comprising amino acids, the recognition moiety comprising a protease recognition moiety, and the one or more detectable labels comprising a fluorophore.
- the recognition moiety may comprise a chemical linker, peptide, or nucleic acid. Identimers containing peptides of the above recognition motifs belong to different classes; a class is defined by the protease it is identified by.
- the one or more of the detectable labels may comprise a fluorophore.
- the molecular barcode may be associated with a material having mutual information with the three-dimensional arrangement of the molecular barcode. Accordingly, the material may be classified by decoding the molecular barcode.
- a set of one or more molecular barcodes may be introduced into a milieu (e.g., a chemical or biochemical system), wherein at least one molecular barcode in the set has mutual information with a material also present in the milieu.
- Orthogonally reactive cleaving agents to the set of molecular barcodes may be selectively applied to cleave the molecular barcodes at the cleavage sites that are reactive to the cleaving agents.
- the cleavage dissociates the detectable labels of identimers from the three-dimensional arrangements according to the orthogonality of the identimers.
- the dissociation of the detectable labels induces a detectable signal response.
- the signal response may be used to decode the set of one or more molecular barcodes (i.e., determine the identity and order of the specific identimers).
- Decoding the set of one or more molecular barcodes reveals the mutual information shared with the material (e.g., the identity and/or concentration of the material). This material may be also linked to the same bead as the molecular barcode used for encoding its identity.
- to encode refers to converting information, data, or classification instructions into a converted format.
- to decode refers to reversing an encoding process to extract information from a converted format.
- mutant information refers to a quantity information obtainable about a first variable through observing a second variable.
- orthogonal refers to a component in a multicomponent system that has chemical reactivity with a particular reagent under a specific set of reaction conditions while at least one other component in the multicomponent system has limited or no reactivity with the reagent, even though all components in the multicomponent system are present in the same milieu.
- orthogonal reactivity refers to a component in a multicomponent system that has chemical reactivity with a particular reagent under a specific set of reaction conditions while at least one or more components in the system does not, even though all the components in the system are present in the same milieu.
- orthogonal reactivity refers to a material having orthogonal reactivity.
- an “identimer” is a name given to a single “identifying molecule.”
- a single identimer molecule comprises a set of one or more detectable labels operatively connected to scaffold portion, the scaffold portion comprising a recognition moiety and a cleavage site, prior to their incorporation and encoding of a molecular barcode.
- an “identimer class” is a name given to a set of one or more identimers having an essentially identical configuration. Thus, identimers in a class should react essentially the same to a stimulus.
- an “identimer token” is an tangible instance of an identimer class that has been incorporated into a molecular barcode or has been physically associated with a material.
- a “recognition moiety” and “recognition portion” are used interchangeably and refer to chemical moieties that are reactive to chemical or enzymatic cleaving agents.
- cleavage site is the point of cleavage between two disassociated molecules. In some embodiments, a cleavage site is located within the recognition moiety. In some embodiments, a cleavage site is located outside of or distant from, the recognition moiety.
- token is a thing acting as a visible or tangible representation of information, such as a fact, quality, data, or other form of information.
- the material may comprise a test agent operatively coupled to a molecular barcode (collectively referred to as a “test construct”), wherein the three-dimensional arrangement of the molecular barcode has mutual information with the test agent.
- test constructs utilizing the barcodes may be used in screening.
- methods of screening may include introducing a plurality of test constructs into a milieu, wherein each test construct comprises a test agent operatively coupled to a unique molecular barcode, wherein the three-dimensional arrangement of the molecular barcode has mutual information with the test agent.
- the plurality of test constructs are screened against a set of one or more targets. The activity of one or more of the test constructs is determined.
- Orthogonally reactive cleaving agents are selectively applied to the plurality of test constructs to cleave coupled molecular barcodes at the cleavage sites that are reactive to the cleaving agents (i.e., primarily cleaves it at the cleavage sites of only the identimers having recognition moieties that are reactive to the cleaving agents).
- This dissociate the detectable labels of identimers from the three-dimensional arrangements only according to the orthogonality of the identimers. Therefore, the detectable signal response can be used to decode the barcodes (i.e., identify the barcode identimer sequence, in this example) and thereby identify the corresponding test agents with activity against the set of targets.
- Recognition and cleavage of detectable labels in some molecular barcodes may need to be performed in sequence from the outermost segment toward the bead (e.g., FIG. 7), or barcode information may be lost. Additionally, recognition and cleavage of detectable labels in some molecular barcodes may need to be performed in a known sequence to retain the barcode information (e.g., FIG. 8, FIG. 11).
- Recognition moiety can be made of protein or a chemically cleavable moiety. Optionally the recognition moiety contains more than one detectable label, for example, combination of two or more different labels.
- the detectable signal response may be induced without enrichment.
- the detectable signal response may be an increase or decrease in signal intensity, such as fluorescence intensity.
- Selectively applying orthogonally reactive cleavage agents to the plurality of test constructs may involve applying the cleavage reagents sequentially (e.g., applying a single cleaving agent iteratively or two or more cleaving agents individually and sequentially).
- Cleavage agents may be different proteases and/or different chemical cleavage agents.
- Methods for generating a molecular barcode include providing a set of one or more identimers, each identimer in the set of identimers comprising a set of one or more detectable labels operatively connected to a scaffold portion, the scaffold portion comprising a recognition moiety that is orthogonal to the recognition moiety of at least one identimer in the set of identimers and a cleavage site operatively connected to a set of one or more detectable labels.
- the identimers may be labeled in a combinatorial way.
- 10 different quantum dots may be encoded with the same protease site for each identimer in a 7-identimer barcode, which could possibly result encoding 10 million different barcodes or beads.
- the methods include selecting first and second identimers from the set of identimers and concatenating the first identimer to the second identimer in a three-dimensional arrangement to encode and form a molecular barcode.
- the method may continue with selecting a identimer from the set of identimers and concatenating it to the molecular barcode to modify the three dimensional arrangement and further encode and form the molecular barcode. This last step may be repeated one to nth times.
- Identimer chains may be built by a sequential split-pooling approach, and a different class is added in each round.
- a variety of identimer classes may be concurrently used to encode and form molecular barcodes by split-pool ligation to create a combinatorial library of molecular barcodes without reliance on DNA sequencing-based decoding.
- individual molecular barcodes may be generated in the presence of other molecular barcodes being formed and encoded simultaneously.
- Such other molecular barcodes may include Next Generation Sequencing barcodes attached to sites on the bead other than the site of identimer barcode attachment.
- identimers may be used to build combinatorial chains of identimer-based molecular barcodes by splitting and pooling of beads. Essentially, each round of addition adds a unique identimer token to each combinatorial chain. Essentially, each resulting molecular barcode will contain a combination of detectable labels organized according to the three-dimensional arrangement of the identimer tokens incorporated into the molecular barcode. [00100] Identimers may be attached by native chemical ligation, click chemistry, enzymatically by peptide ligases, sortase or SFP synthase for example. Only one class of identimer within a barcode is identified during each cycle of cleavage.
- the first identimer may be operatively coupled to a material, such as a surface of a material, such as the surface of a bead.
- the first identimer may be attached to the bead by using common chemistry.
- the first identimer may be an oligomer and attached to the bead by its 5’ end (FIG. 6).
- the bead may be operatively coupled to a test agent.
- the bead may be a capping bead (FIG. 17).
- the first identimer may be operatively coupled to the surface of the material by a high- affinity binding protein or a three-armed linker.
- Providing a set of one or more identimers may include configuring each identimer in the set of identimers to concatenate to a preceding identimer and to facilitate concatenation of the identimers in a linear three-dimensional arrangement and thereby form and encode the molecular barcode sequentially.
- Configuring the set of identimers to bind specifically to a preceding identimer may include concatenating by ligation to, extension from, or synthesizing onto, the preceding identimer.
- identimers comprising double stranded DNA may have overhanging ends for ligation.
- the methods may include providing a set of one or more chemical building blocks, each chemical building block in the set of chemical building blocks being configured to concatenate to a preceding chemical building block; selecting a first chemical building block from the set of chemical building blocks and concatenating it to the first identimer.
- the building block is encoded by a visual barcode segment (FIG. 12 and FIG. 13).
- the methods may continue with selecting a chemical building block from the set of chemical building blocks configured to concatenate to the preceding chemical building block and concatenating it to the preceding chemical building block and thereby form a series of chemical building blocks. This last step may be repeated one to nth times.
- concatenating the series of chemical building blocks may include the use of non-nucleic acid compatible reactions.
- reactions may be used to build the molecular barcodes that are not typically available for barcodes that are primarily nucleic acid based.
- each bead can display a unique compound whose identity is encoded within the combinatorial visual barcode.
- the library of compounds may then be selected upon using a variety of known selection schemes. Enrichment of selected compounds over unproductive compounds may not be required following a selection.
- This barcode type may enable construction and encoding of highly diverse chemical libraries without using DNA. Library members may be distinguished in massively parallel fashion as described previously, using orthogonal cleavage of barcode segments for example.
- molecular barcode libraries may be formed on beads, and the beads can be immobilized on a surface and imaged before and after contact with experimental solutions. Immobilized beads may first be imaged using a standard fluorescence microscope configured with appropriate excitation wavelengths and emission filters to record the combination of visual detectable labels making up each barcode in a given field of view or region of interest.
- any magnified apparatus such as a fluorescence scanner with the appropriate configurations built in, an apparatus that can image fluorescent beads may work.
- Automated imagers may be useful in a variety of applications. For example, a Molecular Dynamics scanner or Nexcelom scanner for a cell array application using standard 4 fluorophores may be used.
- a filter wheel capable of distinguishing all 11 possible Q-dots.
- one color constant color may need to be on all built identimers.
- barcodes may be built using 10 Q- dots and the 11th Q-dot may be used as a standard color to be used as a constant on all identimers.
- This may be beneficial as this fluorophore may act as a quantitative standard against which the other fluorophores in any given chain can be compared to in each image. This may allow a user to see how other signals compare relative to that one constant signal and to measure fractions of signals with more accuracy.
- Such systems include a barcode encoding system configured to introduce into a milieu a set of one or more encoded molecular barcodes, in which at least one molecular barcode in a set of encoded molecular barcodes has mutual information with a material.
- the system further includes a cleavage system configured to selectively apply orthogonally reactive cleaving agents to the milieu to cleave the set of molecular barcodes at the cleavage sites of identimers, in which there are recognition moieties that are reactive to the cleaving agents to dissociate the detectable labels from the three-dimensional arrangement of molecular barcodes in the molecular barcode set according to the orthogonality of the identimers, thereby induce a detectable signal response from one to decode the set of molecular barcodes.
- the system further includes a detection system for detecting the detectable signal response from the set of molecular barcodes, and may further include an illumination system configured to selectively convey light to the set of molecular barcodes to induce the detectable signal response.
- Flow cells were constructed using pre-fabricated plastic covers with adhesive purchased from Microfluidic Chip Shop — cut to enable the generation of 16 individual lanes once mounted on a slide. Flow cell lanes each had approximately a 15-20mI volume. Poly-L-lysine coated glass slides were used for constructing flow cells; in this way, free primary amines on lysine side chains could be used for modification with NHS-LC-Biotin (at a concentration of 500pMin NHS Conjugation Buffer at room temperature for at least 1 hour). After biotinylation, flow cell lanes were flushed with 200plof 2X Hybridization buffer to prepare them for introduction of oligonucleotide-coated streptavidin beads for immobilization and downstream decoding experiments.
- Imaging of streptavidin beads was performed on a Leica Thunder system with a HCX PL FLUOTAR L 40x (NA-0.6) CORR PH2 objective with a Lumencor Spectra X Light Engine (395, 440, 470, 550, 640, 748) and a Leica DFC9000 GTC camera.
- the following filter sets were utilized (Quad Cube- Ex: 375-407, 462-496, 542-566, 622-654, DC: 415, 500, 572, 660, Em: 420-450, 506- 532, 578-610, 666-724 and Y7 cube Ex: 672-748, DC: 760, Em: 765-855).
- An additional DFT5 fast filter wheel was downstream of the cubes and included the following LP filters: 440, 510, 590, 700 and 100%.
- Images were loaded into a Volocity/E database and analyzed in the following manner. First, the individual images were made into a time series in the reverse order in which they were acquired. Images were then movement corrected and cropped to the area from the middle of the field to minimize uneven illumination. For each image same size ROI's were drawn around 10 beads and one background area, for calculating background subtractions. Values obtained for each label were subtracted from the next image in the cycle, to clearly identify the fluorophore labels released during each cleavage cycle.
- a streptavidin bead is shown in FIG. 19A, containing a first labeled dsDNA identimer segment (Id 1/Hind 111-AF750) attached through ligation to an unlabeled dsDNA that is attached to the bead through a biotin moiety.
- the dsDNA identimer segment also shown to the right of the bead, contains the AF750 label (AF-750) (downward diagonal stripes) attached to one of the oligos of the DNA duplex through an internal amino-modified base using NHS-AF750.
- the attached labeled identimer also contains an enzyme-accessible restriction endonuclease site (Hindi 11), positioned between the attached label and the bead.
- the Id1/Hindlll-AF750 oligo duplex is attached to the bead through a ligation reaction to an unlabeled oligo (IdO) that does not contain a recognizable restriction endonuclease site (not cleavable), but does contain a 5'-biotin modification for attachment to the streptavidin bead.
- IdO unlabeled oligo
- Ligation of the dsDNA identimer was performed in-solution, beads were washed, and a sample of the beads were immobilized on a biotin-modified surface in a flow cell for imaging.
- a streptavidin bead is shown in FIG. 19B, containing a second labeled dsDNA identimer segment (Id2/Spel-AF647).
- the second dsDNA identimer segment also shown to the right of the bead, contains the AF647 label (AF-647) (dotted) attached to one of the oligos of the DNA duplex through an internal amino- modified base using NHS-AF647.
- the second attached labeled identimer in the chain also contains an enzyme-accessible restriction endonuclease site (Spel), positioned between the attached label and the bead.
- Spel enzyme-accessible restriction endonuclease site
- the Id2/Spel-AF647 oligo duplex is attached to the bead through a ligation reaction to the first labeled dsDNA identimer segment (Id 1/Hind 111-AF750) duplex. Ligation of the dsDNA identimer was performed in-solution, beads were washed, and a sample of the beads were immobilized on a biotin-modified surface in a flow cell for imaging.
- a streptavidin bead is shown in FIG. 19C, containing a third labeled dsDNA identimer segment (ld3/Xhol-ATTO550).
- the third dsDNA identimer segment also shown to the right of the bead, contains the ATTO550 label (ATTO- 550) (upward diagonal stripes) attached to one of the oligos of the DNA duplex through an internal amino-modified base using NHS-ATTO550.
- the third attached labeled identimer in the chain also contains an enzyme-accessible restriction endonuclease site (Xhol), positioned between the attached label and the bead.
- Xhol enzyme-accessible restriction endonuclease site
- the ld3/Xhol-ATTO550 oligo duplex is attached to the bead through a ligation reaction to the second labeled dsDNA identimer segment (ld2/Xhol-AF647) duplex. Ligation of the dsDNA identimer was performed in-solution, beads were washed, and a sample of the beads were immobilized on a biotin-modified surface in a flow cell for imaging.
- Images confirming efficient ligation of the ld3/Xhol-ATTO550 dsDNA identimer to the growing identimer chain are represented in a bar graph plotted using data acquired from all 4 emission wavelengths imaged, 488nm (horizontal stripes), 550nm (upward diagonal stripes), 647nm (dotted), and 750nm (downward diagonal stripes), and from many beads bearing the same oligos (mean intensity values).
- the height of the bars in the graph correspond to the relative magnitude of observed emission in all four channels imaged, represented in fluorescence intensity units (FU).
- a streptavidin bead is shown in FIG. 19D, containing a fourth labeled dsDNA identimer segment (ld4/Notl-ATT0488).
- the fourth dsDNA identimer segment also shown to the right of the bead, contains the ATT0488 label (ATTO- 488) (horizontal stripes) attached to one of the oligos of the DNA duplex through an internal amino-modified base using NHS-ATT0488.
- the fourth attached labeled identimer in the chain also contains an enzyme-accessible restriction endonuclease site (Notl), positioned between the attached label and the bead.
- the ld4/Notl- ATT0488 oligo duplex is attached to the bead through a ligation reaction to the third labeled dsDNA identimer segment (ld3/Xhol-AF647) duplex. Ligation of the dsDNA identimer was performed in-solution, beads were washed, and a sample of the beads were immobilized on a biotin-modified surface in a flow cell for imaging.
- Beads were encoded with a single combination of four different fluorophore labels, where each unique label was attached to a different identimer segment within the dsDNA identimer chain.
- the dsDNA identimer chain was formed by four rounds of ligation as described previously, and as shown in FIG. 19.
- the chain sequence was: Id1/Hindlll-ATT0550-Id2/Spel-AF750- ld3/Xhol-AF647-ld4/Notl-ATT0488.
- beads bearing such a dsDNA identimer chain sequence would lose fluorescence signal in the 488nm channel following exposure to only the Notl enzyme in a first cycle of orthogonal cleavage, then in the 647nm channel following exposure to only the Xhol enzyme in a second cycle of orthogonal cleavage, and in the 750nm channel following exposure to only the Spel enzyme in a third cycle of orthogonal cleavage.
- beads were immobilized on the surface of a flow cell and subjected to three cycles of orthogonal cleavage sequencing (OCS) using these enzymes in the aforementioned cycling order, with images acquired before and after each cycle.
- OCS orthogonal cleavage sequencing
- the flow cell was returned to the 37 C heat block and exposed to a solution containing 1X CutSmart buffer and the Xhol enzyme at 5U/ul concentration for 15 minutes.
- the flow cell lane was then flushed with 1X CutSmart buffer, and the flow cell was returned to the microscope for imaging. Images of beads following exposure to the Xhol enzyme (FIG. 20C) showed a significant reduction in signal emission from the AF647 label.
- the mean intensity of 647nm signal emission from beads prior to Xhol exposure was around 8,000 FU, and following exposure to Xhol this signal dropped to around 1,000 FU. Importantly, signal intensities observed for the two un-cleaved fluorescent labels remained substantially unchanged.
- the flow cell was returned to the 37 C heat block and exposed to a solution containing 1X CutSmart buffer and the Spel enzyme at 5U/ul concentration for 15 minutes.
- the flow cell lane was then flushed with 1X CutSmart buffer, and the flow cell was returned to the microscope for imaging. Images of beads following exposure to the Spel enzyme showed a significant reduction in signal emission from the AF750 label (FIG. 20D).
- the mean intensity of 750nm signal emission from beads prior to Spel exposure was around 5,500 FU, and following exposure to Spel this signal dropped to around 400 FU.
- signal intensities observed for the final, un-cleaved fluorescent label (ATTO550) remained substantially unchanged.
- Decodinq differentially-labeled dsDNA identimer chains sinqle and mixed seqments
- Single-label chains were constructed whereby segments of Id1/Hindlll, Id2/Spel, ld3/Xhol and ld4/Notl were each differentially labeled with the four fluorophores listed previously (ATT0488, ATTO550, AF647, and AF750) to make a total of 16 different labeled dsDNA identimer segments.
- One differentially-labeled segment of each type was selected for encoding a pre-defined chain sequence.
- the single-label dsDNA identimer chain sequence used here was: Id1/Hindlll-ATT0550-Id2/Spel-AF750--Id3/Xhol-AF647- ld4/Notl-ATT0488).
- Streptavidin beads containing the pre-defined single-label chain sequence were constructed by the split-pooling ligation strategy described previously, and immobilized on a biotin-modified surface in one lane of a flow cell for imaging. Many beads in a single FOV were imaged in all four fluorescent channels prior to contact with solutions containing restriction enzymes. Beads were then subjected to three cycles of OCS (Notl in cycle 1 , Xhol in cycle 2, and Spel) as described previously, followed by imaging after each cycle.
- mixed-segment chains were constructed whereby two differentially- labeled dsDNA identimers of a common type were equally mixed at a 1 :1 molar ratio in 10 different pre-defined, visually discernable combinations (for example, Id2/Spel- ATT0488 was mixed with Id2/Spel-AF647) prior to ligation of the segment. 10 fluorophore combinations can be made visually distinguishable with the 4 different labels used; 488/488, 488/550, 488/647, 488/750, 550/550, 550-/647, 550/750, 647/647, 647-/750, and 750/750.
- Streptavidin beads containing the pre-defined mixed-label chain sequence were constructed by the split-pooling ligation strategy described previously, and immobilized on a biotin-modified surface in one lane of a flow cell for imaging. Many beads in a single FOV were imaged in all four fluorescent channels prior to contact with solutions containing restriction enzymes. Beads were then subjected to three cycles of OCS (Notl in cycle 1 , Xhol in cycle 2, and Spel) as described previously, followed by imaging after each cycle. For each acquisition in the imaging series, data for many beads in a single FOV were averaged, and the same FOV was imaged across all cycles of the OCS series.
- Id1 is cleavable by Hindi 11, and was used as a 1:1 mixture labeled with both AF750 and ATT0488, this Id segment was not cleaved here, as signal from only two remaining fluorophore labels were easily visualized (following Spel cleavage) without needing to cleave this segment.
- Intensity data obtained by imaging of all four labels before and after each cycle of cleavage was entered into a table, with cycles listed in chronological order. It was not initially clear from the raw data which fluorophores were cleaved during each OCS cycle. To more clearly define the order of labels attached to each Id segment in the chain, data were plotted as sequences obtained from bead "reads", as performed with single-label chain data.
- OCS results obtained for mixed-label dsDNA identimer chains is displayed in the bar graphs shown in FIG. 21 B, where the order of the cycles was reversed for analysis as performed for single label chain data.
- images taken following the last cycle of orthogonal cleavage in this experiment (cleavage by Spel) became the "first" image in the series
- images acquired following Xhol cleavage became the "second” image in the series
- images acquired following Notl cleavage became the "third” image in the series
- images acquired of un-cleaved beads became the "last” image in the series.
- the intensity data for all four fluorophores was subtracted from the "previous" cycle.
- FIG. 22A through 22F show six individual beads bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV (Id1 /Hind 111-ATTO550 — Id2/Spel-AF750— ld3/Xhol-AF647— ld4/Notl-ATT0488; FIG. 21A), tracked over 3 cycles of an OCS experiment.
- ATT0488 horizontal stripes was cleaved from the Id4 segment by Notl, followed by removal of AF647 (dark dotted) from Id3 by Xhol, followed by removal of AF750 (light dotted) from the Id2 segment by Spel, and the Id1 segment remained as the only labeled segment on the bead following Spel cleavage, which contained ATTO550 (downward diagonal stripes). Based on these data, the experimental and analytical configuration was ready for simultaneous tracking of many individual beads in massively parallel fashion across at least 3 OCS cycles.
- Labeled dsDNA identimer chain structures decoding a library of 256 labeled beads [00122] To demonstrate combinatorial encoding of a labeled dsDNA identimer chain-bearing bead library, four different distinguishable labels, ATT0488 (horizontal stripes), ATTO550 (downward diagonal stripes), AF647 (dark dotted), and AF750 (light dotted) were attached to four dsDNA identimer chain segments (Id 1 -4) to create 4 differentially-labeled options at each dsDNA identimer segment position.
- a library of 256 different sequences (combinations) of fluorophore-labeled dsDNA identimer chains were constructed over 4 rounds of splitting and pooling by ligation.
- beads containing the IdO ligation acceptor dsDNA duplex (this duplex does not contain a visual label, but does contain a 5'-biotin modification for attachment to the bead, as well as a 5' overhang compatible for ligation with Id1 segments) were split into 4 wells for attachment of a first labeled Id segment (either Id1/Hindlll- ATT0488, Id1/Hindl I I-ATTO550, Id1/Hindl I I-AF647, or Id1/Hindl I I-AF750) by ligation as described in the methods section.
- a first labeled Id segment either Id1/Hindlll- ATT0488, Id1/Hindl I I-ATTO550, Id1/Hindl I I-AF647,
- Ligations were quenched, beads were then washed and pooled to mix, and beads were then split into 4 wells for attachment of a second labeled Id segment (either Id2/Spel-ATT0488, ld2/Spel-ATTO550, Id2/Spel- AF647, or Id2/Spel-AF750) by ligation.
- Ligations were quenched, beads were then washed and pooled to mix, and beads were then split into 4 wells for attachment of a third labeled Id segment (either ld3/Xhol-ATT0488, ld3/Xhol-ATTO550, ld3/Xhol- AF647, or Id3/Xhol-AF750) by ligation.
- beads were imaged prior to restriction enzyme exposure, then imaged following exposure to each individual enzyme (each orthogonal cleavage event) as described previously.
- cleavage cycle 1 a solution containing 5U/ul of the Notl enzyme was flowed into the flow cell lane and incubated for 15 minutes on a heat block set to 37 C.
- the flow cell was returned to the microscope, and the same FOV used for imaging un-cleaved beads was imaged following exposure to the Notl enzyme. This enabled tracking of the same individual beads across the first two images in the series. This process was repeated with exposure to the Xhol enzyme in cleavage cycle 2, and to the Spel enzyme in cleavage cycle 3.
- Identimers were constructed by ligation of a pre-hybridized dsDNA duplex (containing a biotin modification on the 5'-end of one oligo strand of the duplex, and a detectable label (AF-647, horizontal stripes in a and b) attached to the other strand of the duplex), and hairpin oligo containing two AF-750 labels (light dotted in a and b).
- the resulting ligated hairpin contains three detectable labels, a 5' biotin modification for attachment to a streptavidin bead, and two fully formed and orthogonal restriction sites that flank the AF-647 label.
- the first of the two restriction sites (RE1 site, specific for cleavage by Spel) is positioned in between the AF-647 label attached within the stem of the hairpin, and the AF-750 labels attached to the loop of the hairpin.
- This identimer structure enables building of short identimer chains for decoration of beads with many different combinations of fluorophores attached through orthogonally-cleavable linkers.
- This hairpin structure ensures that restriction sites remain as double-stranded regions prior to cleavage, because here the complementary strand is fused through the loop of the hairpin structure.
- this oligo was attached to streptavidin beads, and beads were immobilized on a biotin-modified surface in a flow cell for imaging before and after exposure to one cycle of OCS (FIG. 24A). Values were obtained from many beads in a single field of view, and average intensity values were plotted as percentages.
- the beads Prior to enzyme exposure, the beads displayed clear, measurable signal from both the AF-647 and AF-750 labels (FIG. 24B before), and these values were therefore 100% of the signal expected from the images after cleavage if no cleavage took place.
- the dual AF-750-labeled hairpin "cap” was liberated from hairpin identimer, as observed in images taken following cleavage (FIG. 24B after).
- beads contained slightly greater than 100% of the AF-647 signal observed in images acquired before cleavage, but contained less than 20% of the AF-750 signal following exposure to Spel.
- a streptavidin bead is shown in FIG. 25A, containing a first labeled DNA hairpin that is attached through ligation to a first unlabeled DNA hairpin that is attached to the bead through a biotin moiety.
- the first labeled DNA hairpin generates a labeled identimer segment upon templated ligation with the first ligation acceptor DNA hairpin oligo attached to the bead, as a ssDNA ring containing an encoded restriction site within the dsDNA region.
- Ligation of each labeled DNA hairpin is designed to be specific to only one acceptor hairpin oligo, enabling orthogonal, step wise ligation of one labeled hairpin oligo at a time.
- 25B shows a second labeled DNA hairpin, containing the AF-647 label (dotted) that was orthogonally attached through ligation to the second of the three acceptor oligos on the bead.
- the newly formed, labeled DNA ring identimer also contains an enzyme- accessible restriction endonuclease site (Xhol), positioned between the attached labels and the bead.
- Xhol enzyme- accessible restriction endonuclease site
- FIG. 25C shows a third labeled DNA hairpin, containing the ATTO-488 label (horizontal stripes) that was orthogonally attached through ligation to the last (third) of the three acceptor oligos on the bead.
- the newly formed, labeled DNA ring identimer also contains an enzyme-accessible restriction endonuclease site (Notl), positioned between the attached labels and the bead.
- Ligation of the ATTO-488 label to the bead was confirmed by fluorescence imaging in three fluorescence emission channels, 488nm, 550nm, and 647nm, where strong emission signal was now observed for all three labels (ATTO-550, AF-647, and ATTO-488). Images confirming efficient ligation are represented in a bar graph plotted using data acquired from all 3 emission wavelengths imaged; 488nm (horizontal stripes), 550nm (downward diagonal stripes), 647nm (dotted), and from many beads bearing the same oligos (mean intensity values). The height of the bars in the graph correspond to the relative magnitude of observed emission in all three channels imaged, represented in fluorescence intensity units (FU).
- FU fluorescence intensity units
- a streptavidin bead is shown in FIG. 27A, containing a first labeled ssDNA that is hybridized to a first unlabeled ssDNA that is attached to the bead through a biotin moiety.
- the first labeled DNA generates a labeled identimer segment upon templated hybridization with the first of three orthogonal complement strands attached to the bead, as a dsDNA duplex containing an encoded restriction site within the dsDNA region.
- One half of the required restriction enzyme recognition sequence is encoded in each complementary strand.
- Hybridization of each labeled ssDNA is designed to be specific to only one complementary oligo on the bead, enabling orthogonal, step-wise hybridization of one labeled ssDNA oligo at a time. This allows for a splitting and pooling approach to be taken when constructing libraries of beads containing identimers composed of labeled hybridized dsDNA.
- three labeled ssDNA strands were used in three rounds of encoding, to mimic a splitting and pooling workflow. The first ssDNA oligo, containing the ATTO-488 label (horizontal stripes) was hybridized to one of the three complementary oligos on the bead.
- the newly formed, labeled dsDNA hybridized identimer also contains an enzyme- accessible restriction endonuclease site (Notl), positioned between the attached label and the bead.
- Ligation of the ATTO-488 label to the bead was confirmed by fluorescence imaging in three fluorescence emission channels, 488nm, 550nm, and 647nm, where strong emission signal was observed in only the 488nm wavelength. Images confirming efficient hybridization are represented in a bar graph plotted using data acquired from all 3 emission wavelengths imaged; 488nm (horizontal stripes), 550nm (downward diagonal stripes), 647nm (dotted), and from many beads bearing the same oligos (mean intensity values).
- FIG. 27B shows a second labeled ssDNA oligo, containing the AF-647 label (dotted) that was orthogonally hybridized to the second of the three complementary oligos on the bead.
- the newly formed, labeled dsDNA hybridized identimer also contains an enzyme-accessible restriction endonuclease site (Spel), positioned between the attached label and the bead.
- Spel enzyme-accessible restriction endonuclease site
- Hybridization of the AF-647 label to the bead was confirmed by fluorescence imaging in three fluorescence emission channels, 488nm, 550nm, and 647nm, where strong emission signal was now observed for both the ATTO-488 label and for the AF-647 label.
- Images confirming efficient hybridization are represented in a bar graph plotted using data acquired from all 3 emission wavelengths imaged; 488nm (horizontal stripes), 550nm (downward diagonal stripes), 647nm (dotted), and from many beads bearing the same oligos (mean intensity values).
- the height of the bars in the graph correspond to the relative magnitude of observed emission in all three channels imaged, represented in fluorescence intensity units (FU).
- 27C shows a third labeled ssDNA oligo, containing the ATTO-550 label (downward diagonal stripes) that was orthogonally hybridized to the last (third) of the three complementary oligos on the bead.
- the newly formed, dsDNA identimer also contains an enzyme-accessible restriction endonuclease site (Xhol), positioned between the attached labels and the bead. Ligation of the ATTO-550 label to the bead was confirmed by fluorescence imaging in three fluorescence emission channels, 488nm, 550nm, and 647nm, where strong emission signal was now observed for all three labels (ATTO-550, AF-647, and ATTO-488).
- the height of the bars in the graph correspond to the relative magnitude of observed emission in all three channels imaged, represented in fluorescence intensity units (FU).
- FU fluorescence intensity units
- the beads were then exposed to a second cycle of orthogonal cleavage by the Spel enzyme, and imaged again.
- efficient cleavage of the AF-647 label from the bead was confirmed by fluorescence imaging in three fluorescence emission channels, 488nm, 550nm, and 647nm, where a significant loss of signal was observed for the AF-647 label relative to beads before Spel cleavage, and relative to the remaining signal on the bead (ATTO-550). Images confirming efficient Spel cleavage are represented in a bar graph (FIG.
- Example 1 illustrates the use of Fluorophore/PROTEASEsite/Non- DNA(FPND) identimers in molecular barcodes for in situ labeling of a material designated for visual decoding (FIGS. 1-3, 15, 16).
- FPDN identimers comprise fluorophore-based detectable labels operatively connected to polypeptide- based scaffold portions wherein the cleavage site is an orthogonally reactive protease cleavage site and the recognition moiety is a protease cleavage site sequence or a protease recognition sequence of the orthogonally reactive protease.
- FPND identimer-based molecular barcodes a four-cycle orthogonal protease cleavage experiment, may be performed (Experiment 1).
- FPND identimer classes may be used concurrently to encode and form molecular barcodes by split-pool ligation to create a molecular barcode combinatorial library without reliance on DNA sequencing-based decoding.
- Each molecular barcode in Experiment 1 comprises a set of five identimer tokens (Idi-s) derived from one or more FPDN identimer classes.
- Idi-s identimer tokens
- Each Idi token is derived from a non-cleavable FPDN identimer class and each of the Id2, Id3, Id4, and Ids tokens are derived from cleavable FPDN identimer classes.
- Each cleavable FPDN identimer is configured to react orthogonally to at least one protease selected from a Tobacco etch virus protease (TEVp), a tobacco vein mottling virus protease (TVMVp), a turnip mosaic virus protease (TUMVp), and a sunflower mild mosaic virus protease (SuMMVp).
- TEVp is known in the art to have no known off-target substrates in the human proteome and may be used as an orthogonally reactive cleaving agent.
- TEVp, TVMPpl, TUMVp2, and SuMMVp have been implemented in orthogonal protease regimes.
- TEVp should not cleave a molecular barcode at cleavage sites recognized by TVMVp, TUMVp or SuMMV with high efficiency.
- each protease should be capable of cleaving a molecular barcode at the cleavage sites of identimers having recognition moieties comprising the respective protease’s cleavage site sequence in the presence of substrates recognized by the other proteases.
- Factor Xa could also be used. Factor Xa cleaves after the arginine residue in its preferred cleavage site lle-(Glu or Asp)-Gly-Arg. It will not cleave a site followed by a proline or arginine.
- the FPND identimer scaffold portions comprise a peptide fragment covalently linked through its N- and C-terminal ends to linker reagents containing chemically reactive groups and the detectable labels comprise one or more Q-dots.
- Q-dot fluorophores are added, respectively, to Idi, Id2, Id3, Id4, and Ids to the modified azide-bearing amino acid in the peptide fragments containing the proteolytic sites.
- the combination of fluorophores displayed on any formed FPDN identimer barcodes in the library may or may not have contiguous emission spectra, as the three- dimensional arrangement of the FPND Identimer tokens within the molecular barcodes making up the library will vary.
- each Idi, Id3, and Ids identimer class comprises a TCO group at their N-terminal ends
- each Id2 and Id4 identimer class comprises a methyltetrazine group at their C-terminal ends
- C-termini of Idi and Id3 identimer classes comprise a DBCO group and the Id2 and Id4 identimer classes comprise an azide group at their N- terminal ends.
- the first identimer attachment is a 45-minute reaction, with incubation at 37°C. Beads are maintained at 1 mg/ml during all reactions, and identimer units are used at a lOpMconcentration for all subsequent additions. Following three wash steps in 1X PBS, the beads are subjected to free TCO in higher concentration (100uM), to ensure that all methyltetrazine sites are saturated. Following three wash steps in 1X PBS, the beads are ready for addition of the second identimer unit (Id2).
- reaction conditions are repeated for attachment of Id2, as well as for each subsequent identimer addition (i.e., Id3, Id4, and Ids) to a FPND molecular barcode. Due to the reaction kinetics of the listed click reactions, following each identimer addition, the reaction is “chased” by addition of the appropriate click reactive group to ensure all reactive sites used for identimer attachment are saturated prior to addition of the next identimer.
- the recognition portions of the ldi-s identimer classes comprise, respectively, the following peptide sequences: ldi is not cleavable and contains no cleavage site; ld 2 comprises KGGSGGGSACVYHQSGGAzGGSC (SEQ ID NO: 1) containing the TUMV cleavage site; Id3 comprises
- Id4 comprises KGGSGGGSETVRFQGGGAzGGSC (SEQ ID NO: 3) containing the TVMV cleavage site; and, Ids comprises
- KGGSGGGSENLYFQSGGAzGGSC (SEQ ID NO: 4) containing the TEV cleavage site.
- FPND Identimers may be generated by first modifying the peptide (through an azide bearing residue installed internally near the C-terminus of all peptides) using DBCO-modified fluorophores for visualization. To accomplish this, 40pMsamples of each peptide can be mixed with a designated DBCO-modified fluorophore at 200uM, and allowed to react in 1X PBS at 37°C for at least 3 hours; this reaction can be left overnight at room temperature as well.
- Fluorophore- modified peptides may then be purified using HPLC, dialysis or desalting columns to remove excess (and any unreacted) DBCO-fluorophore reagent, and to exchange the peptides into NHS-compatible reaction buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI). Labeled peptides may also be precipitated using a variety of methods known in the art, and resuspended in NHS-compatible reaction buffer.
- Labeled peptides may then be brought to a 20pMconcentration in NHS-compatible reaction buffer, and mixed with 100pMof their designated NHS reagent (NHS-PEG4-TCO for Idi, Id3, and Ids peptide fragments; NHS-PEG4-Azide for Id2 and Id4) to enable conjugation overnight at room temperature.
- NHS reagent NHS-PEG4-TCO for Idi, Id3, and Ids peptide fragments
- NHS-PEG4-Azide for Id2 and Id4
- Labeled and singly modified peptides may also be precipitated using a variety of methods known in the art, and resuspended in 1X PBS buffer. Labeled and singly modified peptides may then be brought to a 20pMconcentration in 1X PBS and mixed with 100pMof their designated maleimide reagent (Maleimide-PEG4-DBCO for Idi, Id3, and Ids peptide fragments; Maleimide-PEG4-Methyltetrazine for Id2 and Id4) to enable conjugation overnight at room temperature.
- maleimide reagent Maleimide-PEG4-DBCO for Idi, Id3, and Ids peptide fragments
- Maleimide-PEG4-Methyltetrazine for Id2 and Id4
- Labeled and dual-modified peptides may be purified using HPLC, dialysis or desalting columns to remove excess (and any unreacted) Maleimide- containing reagent, and to exchange the peptides into 1X PBS. Labeled and dual- modified peptides may also be precipitated using a variety of methods known in the art, and resuspended in 1X PBS buffer. FPND Identimers may then be used right away, stored for several days at 4°C or at -20°C for longer term storage.
- Validation of FPND Identimer-based molecular barcode performance may be accomplished using a four-cycle deconvolution/decoding by cleavage experiment to confirm the concatenation of the identimers to form and encode the FPND Identimer-based molecular barcodes and their detectable signal response upon decoding by orthogonally reactive cleavage.
- a series of imaging and cleaving steps is performed to decode FPND identimer-based molecular barcodes present on members of the combinatorial library (i.e., a molecular barcode fixed to a bead). This process can be repeated in cycles, whereby the next orthogonal protease can be introduced during each cleavage cycle, followed by an imaging step.
- TEV protease is introduced in the first cleaving step to decrease observed signal intensity of fluorophore attached only to TEVp-reactive identimers (i.e., Ids) followed by the first imaging step to visually identify the detectable labels cleaved from Ids tokens incorporated in the molecular barcodes comprising the combinatorial library.
- the cleavage cycles introduce the listed proteases in the following order: TEVp during cycle 1; TVMVp during cycle 2; SuMMVp during cycle 3; and TUMVp during cycle 4.
- Idi is the first identimer added to the chain, and therefore is directly attached to the solid support (bead); here Idi doesn’t include a cleavage site and its recognition portion can be an [uncleavable linker].
- Idi contains the N-terminal TCO modification which can be used for attachment to the solid support (bead); the surface of the solid support is be modified to contain the compatible methyltetrazine moiety for attachment of Idi.
- the recognition portions and cleavage sites of Id2, Id3, Id4, and Ids are the recognition sequences and cleavage sites of, respectively, TUMVp, SuMMVp, TVMVp, and TEV protease.
- ldi-s identimers are formed on beads as described herein.
- the beads are then be exposed in a first cleavage step to decode the molecular barcodes to a solution containing 2 units of TEVp for a 30-60 minute incubation at 30°C in protease reaction buffer (50mM Tris-HCI pH 7.5, 0.5mM EDTA, 1mM DTT).
- protease reaction buffer 50mM Tris-HCI pH 7.5, 0.5mM EDTA, 1mM DTT.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPND identimer tokens dissociated from the molecular barcode during the first cleavage step.
- the beads are exposed in a second cleavage step to a solution containing 2 units of TVMVp for a 30-60 minute incubation at 30°C in protease reaction buffer.
- the beads are imaged in a second imaging step and fluorophores responsive to TVMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPND identimer tokens dissociated from the molecular barcode during the second cleavage step.
- the beads are exposed in a third cleavage step to a solution containing 2 units of SuMMVp for a 30-60 minute incubation at 30°C in protease reaction buffer.
- the beads are imaged in a third imaging step and fluorophores responsive to SuMMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPND identimer tokens dissociated from the molecular barcode during the third cleavage step.
- the beads are exposed to a solution containing 2 units of TUMVp for a 30-60 minute incubation at 30°C in protease reaction buffer.
- the beads are imaged in a fourth imaging step and fluorophores responsive to TUMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPND identimer tokens dissociated from the molecular barcode during the cleavage step.
- the uncleavable label will remain as the strongest signal emitted from the bead, allowing for its identification and potential use for quantification.
- the sequence of fluorophores for each barcode that was formed during split-pooling and concatenation is determined during visual deconvolution and can be correlated with the bead it was attached to.
- Example 2 illustrates the use of a Fluorophore/NUCLEASEsite (FNS) Identimer-based molecular barcode for in situ labeling of material designed for visual decoding (FIGS. 5, 7, 9).
- FNS Fluorophore/NUCLEASEsite
- a set of one or more FNS Identimer classes may be used to encode and form a visual molecular barcode by split-pool ligation to create a combinatorial library of barcodes that do not require DNA sequencing-based decoding.
- a single barcode that may be generated in the same experiment, and in the presence of many different barcode chains being constructed simultaneously.
- Many endonucleases are known in the art to possess orthogonal reactivity with respect to one another. For example, Notl, Xhol, Spel, and Hindlll possess specificity for dsDNA sequences containing GCGGCCGC, CTCGAG, ACTAGT, and AAGCTT, respectively.
- identimer barcode library contains five classes (ldi-5) consisting of four cleavable Q- Fluorophore/NUCLEASEsite Identimer tokens (Id2, Id3, Id4, and Ids) as well as one uncleavable identimer token (Idi).
- Experiment 2 is a four-cycle orthogonal nuclease cleavage experiment, confirmed by visual imaging of reactive fluorophores performed to confirm the encoding and decoding of FNS identimer-based molecular barcodes.
- Experiment 2 comprises a first imaging step to record detectable signal conveyed by the detectable labels making up FNS Identimer-based molecular barcodes present on any member of the library (bead).
- Notl is introduced in a first cleavage step to decrease observed signal intensity of the fluorophore attached only to the Notl nuclease-reactive identimer, allowing for the visual identification of the label attached to Ids.
- each cycle of experiment 2 comprises a cleavage step and an imaging step.
- Nuclease cleavage agents are applied to the molecular barcode in the following order: Notl during cycle 1; Xhol during cycle 2; Spel during cycle 3; and Hindi 11 during cycle 4. Imaging after each cycle will enable deconvolution of the order that each visual label making up the FNS Identimer-based molecular barcode was added. Limit of detection, specificity, and precision experiments (in triplicate) of nucleases acting as cleaving agents may be performed to establish the reproducibility of nuclease orthogonal reactivity to the FNS Identimer-based molecular barcode.
- a FNS Identimer scaffold portion includes a ssDNA having a 5’ and 3’ end that is fluorescently labeled with a detectable label through a modified internal base, and is pre-hybridized with a complementary ssDNA to form a dsDNA duplex.
- the formed dsDNA duplex may encode one or more copies of a single endonuclease restriction sequence (or site type).
- one strand of the dsDNA duplex containing the nuclease recognition sequence(s) comprises an internal-amino modification for attachment of a pre-designated fluorophore.
- the scaffold portion of each FNS identimer class member comprises the same restriction endonuclease site type, but receives a different and distinguishing fluorophore.
- the amino-reactive heterobifunctional crosslinking reagent, NHS-PEG4-TCO may then be used to chemically modify the installed primary amine on the dsDNA to contain the click-compatible TCO group.
- the Methyltetrazine-modified fluorophores may then be conjugated with identimer dsDNA duplexes.
- the scaffold portions of FNS identimers comprise unpaired 3’-ends that are compatible for ligation with other FNS Identimer classes to facilitate building a combinatorial chain of identimers by split-pooling.
- Each round of addition to the growing FNS Identimer molecular barcode will add a unique identimer token to the forming molecular barcode.
- Idi is the first identimer added, and therefore is directly attached to the solid support (i.e., bead). Idi doesn’t include a cleavage site and its recognition portion is an uncleavable linker. Idi contains a 5’-amino modified base for attachment to the solid support.
- the recognition portions and cleavage sites of Id 2 , Id3, Id4, and Ids are the recognition sequences and cleavage sites of, respectively, Hindlll, Spel, Xhol, and Notl endonucleases.
- the detectable labels of the ldi-s classes comprise one or more quantum dots.
- Each FNS identimer class comprises quantum dots having different emission wavelengths that are covalently attached to the FNS identimers in different reaction mixtures (or wells of a plate) for each class-specific dsDNA bearing a specific nuclease site type.
- quantum- dot detectable labels are added, respectively, to Idi, Id2, Id3, Id4, and Ids through an internally-modified base located within one or both strands of the dsDNA sequence.
- the combination of fluorophores displayed on any formed and encoded FNS Identimer-based molecular barcode in the library may or may not have contiguous emission spectra, as the three dimensional configuration of identimer tokens comprising the molecular barcodes making up the library is determined by the cycling of orthogonal nucleases as described herein.
- the FNS Identimer recognition moieties comprise complementary ssDNA strands forming dsDNA duplexes.
- the FNS identimer cleavage sites comprise specific restriction endonuclease sites included in the dsDNA duplexes and are formed by hybridization, whereby the two strands share significant complementarity, but remain unpaired at their 3’-ends.
- the unpaired 3’- ends of each FNS Identimer class are designed to be complementary with acceptor 3’-ends on adjacent class members.
- FNS Identimers may be sequentially added in a pre-defined order to form barcodes over sequential rounds of splitting and pooling. Each round of addition will enable concatenation of identimer dsDNA through enzymatic ligation of the 5’- and 3’-ends of the ssDNA fragments of hybridized identimers.
- each identimer may be performed in 1X ligase buffer (50mM Tris-HCI pH 7.5, 10mM MgCI2, 1mM ATP, 10mM DTT) in the presence of 500-1 , 000U of T4 DNA ligase and 20-80U of Polynucleotide Kinase over a 25-minute incubation at 37°C.
- 1X ligase buffer 50mM Tris-HCI pH 7.5, 10mM MgCI2, 1mM ATP, 10mM DTT
- 500-1 , 000U of T4 DNA ligase and 20-80U of Polynucleotide Kinase over a 25-minute incubation at 37°C.
- the illustrated Fluorophore/NUCLEASEsite Identimer barcode contains 5 class members joined by enzymatic ligation.
- Idi is configured to be uncleavable and the recognition moieties of I D2-5 comprise, respectively: Id2 comprises ATTT ATTTAAG CTT ATT A/i Am M C6T/T ATTT (SEQ ID NO: 5) containing the Hindlll cleavage site; Id3 comprises ATTT ATTT AACTAGTATTA/i Am MC6T/T ATTT (SEQ ID NO: 6) containing the Spel cleavage site; Id4 comprises
- Ids comprises ATTT ATTTAGCGGCCGCATTA/iAmMC6T/T ATTT (SEQ ID NO: 8) containing the Notl cleavage site.
- the uncleavable dsDNA of Idi may consist of any DNA sequence that is not recognized by any of the nucleases used for recognition of other class members making up formed FSN Identimer-based molecular barcodes.
- modified nucleotides e.g., iAmMC6T
- iAmMC6T Int Amino Modifier C6 dT, available commercially from Integrated DNA Technologies, Inc. (1710 Commercial Park, Coralville, Iowa 52241, USA).
- modified nucleotides e.g., iAmMC6T
- the modified nucleotide is configured to be at least 6bp away from the recognition moiety.
- modified nucleotides placed internally within a looped dsDNA are placed far enough away from each other to avoid FRET-based quenching. In some embodiments, modified nucleotides placed internally within a looped dsDNA are placed far enough away from each other to avoid enzyme recognition sites and facilitate enzyme accessibility.
- each FNS Identimer class is generated by first annealing a complementary strand that together with the class-specific ssDNA strands listed above (Id2-s) to generate the viable dsDNA restriction sites. This is done in a 1:1 molar ratio in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI) at 95°C for 5 minutes on a heat block, the heat block is then removed and allowed to cool to room temperature on the benchtop over what amounts to be about a 2-hour time period.
- NHS-compatible annealing buffer 100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI
- the identimer class-specific dsDNA bearing the single endonuclease site type may be modified (through the internal-amino modification shown in the sequences above) using NHS-PEG4-transcyclooctene, such that it contains a compatible chemistry for downstream conjugation.
- 100pMof the labeled dsDNA is mixed with 500pMNHS-PEG 4 - transcyclooctene (TCO) in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI), and allowed to react at room temperature overnight.
- TCO-modified dsDNA duplexes may then be purified by desalting to remove excess (and any unreacted) NHS-PEG4-TCO reagent.
- Many methyltetrazine-modified fluorophores are commercially available and can be used for labeling of each class-specific identimer in a designated way. To do this, each class specific identimer could be split into different wells, and lOOpMTCO-modified dsDNA of each class could be conjugated to the different methyltetrazine-modified fluorophores used at 200pMin NHS-compatible annealing buffer. This reaction could proceed for 30 minutes at room temperature.
- the conjugated FNS Identimer maythen be buffer exchanged into storage buffer (50mM Tris-HCI pH 7.5 supplemented with 100mM NaCI), and could be used right away or stored for several days at 4°C or at -20°C for longer term storage.
- storage buffer 50mM Tris-HCI pH 7.5 supplemented with 100mM NaCI
- Identification of FNS identimer-bases molecular barcodes may be accomplished using a four-cycle deconvolution/decoding by cleavage experiment to confirm the detectable signal response of Idi-sand confirm the respective sequential concatenation and encoding of the FNS identimer-based molecular barcode.
- molecular barcode libraries may be formed on beads, and the beads may be immobilized on a surface and imaged before and after contact with experimental solutions. Immobilized beads may first be imaged using a standard fluorescence microscope configured with appropriate excitation wavelengths and emission filters to record the combination of detectable signal responses conveyed by the detectable labels making up each molecular barcode in a given field of view or region of interest. All “Hi-Fidelity” versions of the endonucleases listed can function with 100% efficiency in the same buffer (1X CUTSMART buffer available from New England Biolabs, Inc. at 428 Newburyport Turnpike, Rowley, MA 01969, U.S.A.).
- the beads are exposed in a first cleavage step in a solution containing 5 units of Notl-HF endonuclease for a 5-10 minute incubation at 37°C in CUTSMART buffer (50mM Potassium Acetate, 20mM Tris-acetate, 10mM Magnesium Acetate, 100ug/ml BSA; buffer pH is 7.9 at 25°C).
- CUTSMART buffer 50mM Potassium Acetate, 20mM Tris-acetate, 10mM Magnesium Acetate, 100ug/ml BSA; buffer pH is 7.9 at 25°C.
- the beads may be imaged in a second imaging step to detect detectable signal response conveyed by the fluorophore detectable labels responsive to Notl endonuclease activity.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FNS identimer tokens dissociated from the molecular barcode during the first cleavage step.
- a washing step may be performed prior to exposing the beads to the second cleavage step.
- the second cleavage step comprises exposing the beads to a solution containing 5 units of Xhol-HF for a 5-10 minute incubation at 37°C in CUTSMART buffer. Following incubation, the beads are imaged in a second imaging step and the detectable signal response conveyed by fluorophore detectable labels responsive to Xhol endonuclease activity is detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FNS identimer tokens dissociated from the molecular barcode during the second cleavage step.
- a washing step may be performed prior to exposing the beads to the third cleavage step.
- the third cleavage step comprises exposing the beads to a solution containing 5 units of Spel-HF for a 5-10 minute incubation at 37°C in OUTSMART buffer. Following incubation, the beads are imaged in a second imaging step and the detectable signal response conveyed by fluorophore detectable labels responsive to Spel endonuclease activity is detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FNS identimer tokens dissociated from the molecular barcode during the third cleavage step.
- a washing step may be performed prior to exposing the beads to the fourth cleavage step.
- the fourth cleavage step comprises exposing the beads a solution containing 5 units of Hindlll for a 5-10 minute incubation at 37°C in OUTSMART buffer. Following incubation, the beads are imaged in a fourth imaging step and the detectable signal response conveyed by fluorophore detectable labels responsive to Hindlll endonuclease activity is detected.
- the detectable signal response comprises a reduction of signal intensity for the emission wavelengths associated with wavelength of the detectable labels of Hindlll FNS identimer tokens dissociated from the molecular barcode during the fourth cleavage step.
- more than one restriction endonuclease may be introduced during the cleavage step of each cycle, but the endonucleases should be introduced under conditions whereby different endonucleases possess significantly different kinetic cleavage rates, and multiple images should be acquired during cleavage.
- the uncleavable detectable label After removal of the final cleavable identimer label, the uncleavable detectable label will convey the strongest detectable signal emitted from the bead, allowing for its identification.
- the sequence of fluorophores for each FSN Identimer barcode that was determined during visual deconvolution can be ascribed to each bead attached to it in this way.
- Example 3 Fluorophore/PROTEASEsite/dsDNA (FPD) Identimer-Based Molecular Barcodes
- FPD Fluorophore/PROTEASEsite/dsDNA
- Example 3 illustrates the use of a Fluorophore/PROTEASEsite/dsDNA (FPD) identimer-based molecular barcode for in situ labeling of material designed for decoding both visually as well as in a NGS reaction milieu (FIGS. 6, 8, 10, 11 ).
- a set of one or more of FPD identimer classes may be used to encode and form a NGS- compatible visual molecular barcode by split-pool ligation to create a combinatorial library of barcodes that do not require DNA sequencing-based decoding.
- each molecular barcode in the combinatorial library contains five FPD identimer classes (Idi-s) comprising four cleavable FPD identimer tokens (Id2, Id3, Id4, and Ids) as well as one uncleavable FPD identimer token (Idi).
- Experiment 3 is a four-cycle orthogonal protease cleavage experiment, confirmed by visual imaging of reactive fluorophores, performed to validate the encoding and decoding of FPD identimer-based molecular barcodes.
- Experiment 3 comprises a first imaging step to record detectable signal conveyed by the detectable labels of FNS Identimer-based molecular barcodes present on any member of the library (bead).
- TEV protease are introduced in the first cycle to decrease the signal intensity of the fluorophore detectable labels of TEV protease- reactive FPD identimer tokens incorporated into a molecular barcode, allowing for the visual identification of the label cleaved from Ids. This process is repeated in cycles, whereby the next orthogonal protease is introduced during each cleavage cycle, followed by an imaging step.
- the protease cleaving agents introducing curing the cleavage steps in the following order: TEVp during cycle 1; TVMVp during cycle 2; SuMMVp during cycle 3; and TUMVp during cycle 4. Imaging after each cycle will enable deconvolution of the three-dimensional arrangement of the detectable labels of FPD identimer tokens incorporated into FPD identimer-based molecular barcodes. Limit of detection, specificity, and precision experiments (in triplicate) of proteases acting as cleaving agents may be performed to establish the reproducibility of protease orthogonal reactivity to the Fluorophore/PROTEASEsite/dsDNA Identimer- based molecular barcode.
- a FPD identimer scaffold portion comprises a polypeptide fragment covalently linked to a single-stranded DNA (ssDNA) fragment having a 3’ and 5’ end through an internally amino-modified base in the ssDNA, using an amino-reactive heterobifunctional crosslinking reagent (IMHS-PEG4- methyltetrazine).
- ssDNA single-stranded DNA
- IMHS-PEG4- methyltetrazine amino-reactive heterobifunctional crosslinking reagent
- the polypeptide fragment containing the peptide recognition sequence may be modified to contain an N-terminal lysine residue, and can be subsequently modified with a chemical group compatible for attachment to the modified ssDNA using a different amino-reactive heterobifunctional crosslinking reagent (NHS-PEG4-transcyclooctene).
- a specific and complementary ssDNA oligonucleotide may be hybridized to the ssDNA of the scaffold portion to generate a dsDNA species with unpaired 3’-ends that will be compatible for ligation with other FPD identimer classes to build a combinatorial chain of identimers by split-pooling.
- Each round of addition to the FPD identimer-based molecular barcodes will add a unique identimer token to the forming molecular barcodes.
- Attached to the opposing end of the peptide fragment (C-terminus) is a detectable label (fluorophore) as described below.
- the emission wavelength (observed color) of the detectable label attached to each FPD identimer is correlated with the sequence of the attached dsDNA, thereby coupling information that may be obtained from the visual barcode with information that can be obtained from the formed NGS barcode after DNA sequencing.
- Idi is the first identimer added to the chain, and therefore is directly attached to the solid support (bead); here Idi doesn’t include a cleavage site and its recognition portion may be an uncleavable linker.
- Idi comprises a 5’-amino modified base for attachment to the solid support, and may encode a 5’- constant region for retrieval and downstream NGS library preparation.
- Ids comprises a 3’-capture sequence for the capture of macromolecules from cell lysate or a biological sample.
- the recognition portions and cleavage sites of Id2, Id3, Id4, and Ids are the recognition sequences and cleavage sites of, respectively, TUMVp,
- quantum dot fluorophore detectable labels comprising multiple distinguishable detectable labels are selected for combinatorial labeling of the FPD identimers in Example 3.
- Detectable labels conveying different emission wavelengths of detectable signal may be covalently attached to each FPD identimer class, in different reaction mixtures (or wells of a plate) for each class- specific peptide.
- detectable labels may be added respectively, to Idi, Id2, Id3, Id4, and Ids at the opposing end of their respective scaffold portions (C- terminal end) from that which the dsDNA portion of the identimer is conjugated.
- the combination of fluorophores displayed on any formed FPD identimer-based barcode in the combinatorial library may or may not convey detectable signal having a contiguous emission spectra, as the three- dimensional arrangement of the detectable labels of FPD identimer tokens incorporated into molecular barcodes making up the combinatorial library is determined by cycling of orthogonal proteases as described above.
- each ssDNA fragment is designed to record the emission wavelength of the attached detectable label.
- the complementary ssDNA strands making up each FPD Identimer scaffold portion are formed by hybridization, whereby the two strands share significant complementarity, but remain unpaired at their 3’- ends.
- the unpaired 3’-ends of each FPD identimer class are designed to be complementary with acceptor 3’-ends on adjacent FPD identimer tokens incorporated into a molecular barcode.
- FPD identimers can be sequentially added in a pre-defined order to form molecular barcodes over sequential rounds of splitting and pooling.
- Each round of FPD identimer addition will enable concatenation of FPD identimer dsDNA through enzymatic ligation of the 5’- and 3’-ends of the ssDNA fragments of hybridized identimers.
- the ligation of each identimer may be performed in 1X ligase buffer (50mM Tris-HCI pH 7.5, 10mM MgCI2, 1mM ATP, 10mM DTT) in the presence of 500-1, 000U of T4 DNA ligase and 20-80U of Polynucleotide Kinase over a 25-minute incubation at 37°C.
- the dsDNA of the final “capping” FPD identimer token may be designed to encode both the wavelength of the detectable label of a FPD identimer as well as a 3’ capture region designed to capture macromolecules from cell lysate.
- a unique molecular identifier UMI may also be included as a contiguous stretch of randomized or semi-randomized bases within a capping FPD identimer or may be added to the NGS barcode with each identimer class in smaller stretches of randomized or semi-randomized bases.
- Identimer class-specific oligopeptides designed to comprise a single protease recognition site may be purchased from a commercial vendor.
- Idi is configured to not be cleavable and the recognition portions of Id2-s comprise peptide sequences including, respectively: Id2 comprising KGGSGGGSACVYHQSGGSGGSC (SEQ ID NO: 9) containing the TUMV cleavage site; Ids comprising KGGSGGGSEEIHLQSGGSGGSC (SEQ ID NO: 10) containing the SuMMV cleavage site; Id4 comprising
- KGGSGGGSETVRFQGGGSGGSC (SEQ ID NO: 11) containing the TVMV cleavage site; and, Ids comprising KGGSGGGSENLYFQSGGSGGSC (SEQ ID NO: 12) containing the TEV cleavage site.
- each FPD identimer class is generated by first modifying the peptide (through a lysine residue installed at the N-terminus of the peptide) using NHS-PEG4-transcyclooctene, such that it contains a compatible chemistry for downstream conjugation.
- 100pMpeptide is mixed with 500pMNHS-PEG 4 -transcyclooctene (TOO) in NHS peptide reaction buffer (100mM Sodium Phosphate, pH 8.5, supplemented with 80mM KOI and 70mM NaCI), and allowed to react at room temperature overnight.
- TCO-modified peptides are purified using HPLC to remove excess (and any unreacted) NHS-PEG4-TCO reagent.
- lOOpMoligonucleotides containing an internal amino-modification and bearing a 5’-phosphate group are modified in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KOI) by incubation with 500pMNHS-PEG4-methyltetrazine overnight at room temperature.
- Methyltetrazine-modified oligonucleotides are then buffer-exchanged into NHS- compatible annealing buffer twice, using 7K MWCO Zeba desalting columns to remove excess and any unreacted NHS-PEG4-methyltetrazine reagent.
- TCO- modified peptides of each FPD identimer class are conjugated to the different tetrazine-modified ssDNA oligonucleotides (each containing a different nucleotide sequence that corresponds to the detectable labels conjugated to the identimer) by mixing in a 1:1 ratio at 25pMeach. This reaction is allowed to proceed for 30 minutes at room temperature.
- Complementary ssDNA oligonucleotides are annealed to form dsDNA by incubation with conjugates in a 1:1 molar ratio in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI) at 95°C for 5 minutes on a heat block. The heat block is then removed and allowed to cool to room temperature on the benchtop over what amounts to be about a 2-hour time period. Idi class members receive a complementary oligo strand containing a 5’-amino modification for chemical modification and subsequent attachment to the solid support.
- NHS-compatible annealing buffer 100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI
- All complementary ssDNA oligonucleotides contain a 5’-phosphate modification for competent ligation to adjacent identimers during barcode formation.
- the annealed identimers are then labeled with maleimide- modified fluorophores via the installed cysteine residue at their C-terminal ends, by addition of each fluorophore to each FPD identimer class (each FPD identimer class receives one designated fluorophore, that is recorded within the attached dsDNA).
- FPD identimers are then buffer exchanged into storage buffer (50mM Tris-HCI pH 7.5 supplemented with 100mM NaCI), and can be used right away, stored for several days at 4°C or at -20°C for longer term storage.
- Identification of FPD Identimer molecular barcodes may be accomplished using a four-cycle deconvolution/decoding by cleavage experiment to confirm the detectable signal response and confirm the respective forming and encoding of the FPD identimer molecular barcode.
- FDP identimer-based molecular barcode libraries may be formed on beads, and the beads may be immobilized on a surface and imaged before and after contact with experimental solutions. Immobilized beads would first be imaged using a standard fluorescence microscope configured with appropriate excitation wavelengths and emission filters to record the combination of visual detectable labels making up each barcode in a given field of view or region of interest.
- the first cleavage step comprises exposing the beads in a first cleaving step to a solution containing 2 units of TEVp for a 30-60 minute incubation at 30°C in protease reaction buffer (50mM Tris-HCI pH 7.5, 0.5mM EDTA, 1mM DTT).
- protease reaction buffer 50mM Tris-HCI pH 7.5, 0.5mM EDTA, 1mM DTT.
- the beads are imaged in a first imaging step and fluorophore detectable labels conveying detectable signal response to TEVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPD identimer tokens dissociated from the molecular barcode during the first cleavage step.
- the second cleavage step comprises exposing the beads to a solution containing 2 units of TVMVp for a 30-60 minute incubation at 30°C in protease reaction buffer. Following incubation, the beads are imaged in a second imaging step and fluorophore detectable labels responsive to TVMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPD identimer tokens dissociated from the molecular barcode during the second cleavage step.
- a washing step may be performed prior to the third cleavage step.
- the third cleavage step comprises exposing the beads to a solution containing 2 units of SuMMVp for a 30-60 minute incubation at 30°C in protease reaction buffer. Following incubation, the beads are imaged in a second imaging step and fluorophore detectable labels conveying a detectable signal response to SuMMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPD identimer tokens dissociated from the molecular barcode during the third cleavage step.
- a washing step may be performed prior to the third cleavage step.
- the fourth cleavage step comprises exposing the beads to a solution containing 2 units of TUMVp for a 30-60 minute incubation at 30°C in protease reaction buffer. Following incubation, the beads are imaged in a fourth imaging step and fluorophore detectable labels conveying a detectable signal response to TUMVp are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FPD identimer tokens dissociated from the molecular barcode during the fourth cleavage step.
- the uncleavable label After removal by cleavage of the final detectable labels from cleavable identimers, the uncleavable label will remain as the strongest detectable signal convey from the bead, allowing for its identification.
- the sequence of fluorophores for each molecular barcode that was determined during visual deconvolution is represented in the sequence of the NGS barcode, which can be retrieved and correlated with the bead following a NGS experiment.
- the recognition portion of every FPD identimer class comprises essentially identical protease recognition sites.
- an identimer class size is based on how many unique detectable labels are available. With Q-dots, this number is about 10, while with standard fluorophores this number is about four to about six. Dual labeling makes the class size 100 because each class member may be labeled with two Q- dots. This can also be done using combinations of standard fluorophores.
- the structural portion of a set of one or more FPD identimers is constructed by first conjugating different ssDNA strands to the same peptide (it has the same protease recognition site, and the same conjugation chemistry in all cases) through the N-terminal end of the peptide and an internal nucleotide modification sitting in the middle of the DNA sequence. That can be done in different tubes or wells (for example wells 1-10).
- FPD identimer class construction may be performed in different reactions to generate further FPD identimer classes to include in the set of FPD identimers . This is designed so that we know which ssDNA goes with which Q-dot.
- the FPD identimer classes are washed or otherwise purified, and a complementary ssDNA is hybridized to each identimer in the set of FPD identimers (the complementary ssDNA being configured to be complementary to the ssDNA of the structural portions of identimers in the set of FPD identimers) to form a dsDNA with 3’ unpaired ends.
- the complementary ssDNA will not ligate to the wrong FPD identimer class, thus providing a user a degree of control over how to configure and construct the FPD identimer classes.
- a user may configure a FPD identimer class to have a reaction specificity in which, for example, a Class B will ligate essentially only with class A and C; Class 3 will essentially only ligate with Blass B and D and so forth.
- construction of a FPD identimer comprises first conjugating the peptide to the Q-dot prior to adding a pre-hybridized dsDNA segment to complete the structural portion and form the FPD identimer.
- a FPD identimer construction process may be repeated for each FPD identimer class (each peptide that contains a different protease recognition site) and the FPD identimer classes may then be arranged for storage in tubes, or in a plate or plates depending on the size of each class and the number of classes. For example, as shown in Example 3, five FPD identimer classes comprising ten identimers each may put in 50 different wells of a plate.
- the beads upon making a combinatorial library comprising beads and a set of one or more molecular barcodes, the beads will have a first FPD identimer class attached to them in 10 different wells.
- the beads may then be washed, pooled, and randomly split into the next 10 wells for addition of a second FPD identimer class providing coverage of the combinatorial diversity of the library (all possible combination of what is now 100 different members, represented across 2 classes).
- the above process is repeated; the beads may be washed, pooled, and randomly split into the next 10 wells for addition of a third FPD identimer class, resulting in 1 ,000 members of the library represented across 3 classes.
- the FPD identimer construction process repeats until a total of 5 FPD identimer classes have been added — constituting a 1M member combinatorial library.
- the final “capping” FPD identimer class in Example 3 comprises a 3’-capture region for the capture of macromolecules from a reaction mixture (cell lysate for example).
- This 3’-capture region may be a poly(T) tag for capture of polyadenylated RNAs, or a specific tag for capture of nucleotide- tagged macromolecules used as reporters in an assay.
- the three-dimensional arrangement allows for decoding the molecular barcode by cycling the proteases through the cleavage steps one protease at a time and in a known order because each cleaving agent protease will essentially cleave only the detectable labels of FPD identimer tokens derived from FPD identimer classes comprising a recognition portions having orthogonal reactivity to the cleaving agent.
- molecular barcode decoding comprises detecting which detectable signal response (in this embodiment colors) are conveyed during exposure to a corresponding protease.
- detectable signal response in this embodiment colors
- a rainbow-colored bead will be missing an entire color, or have significantly less intensity for a given color, when we image after exposure to the first protease, then we would wash and add the next protease and a second color will be missing or have significantly less intensity when we image.
- Example 4 illustrates the use of a Fluorophore/NUCLEASEsite/dsDNA (FND) identimer-based molecular barcode for in situ labeling of material designed for decoding both visually as well as in a next generation sequencing (NGS) reaction milieu (FIGS. 5, 7, 8, 9, 11 ).
- FND Fluorophore/NUCLEASEsite/dsDNA
- NGS next generation sequencing
- a series of FND identimer classes may be used to encode and form a NGS-compatible visual molecular barcode by split-pool ligation to create a combinatorial library of barcodes that do not require DNA sequencing-based decoding.
- a single molecular barcode may be generated in the same experiment and in the presence of many different molecular barcodes being formed and encoded simultaneously.
- each endonuclease should be capable of cleaving a molecular barcode at the cleavage sites of identimers having recognition moieties comprising the respective endonuclease’s cleavage site sequence in the presence of substrates recognized by the other endonucleases, thus facilitating the orthogonal reactivity of the endonuclease cleaving agent during FND identimer-based molecular barcode decoding.
- a combinatorial library comprising beads and a set of one or more FND identimer-based molecular barcode includes five FND identimer classes (Idi-s) with Idi being configured to be an uncleavable identimer class and Id2- 4 being configured to be cleavable identimer classes.
- Idi-s FND identimer classes
- Id2- 4 FND identimer classes
- Experiment 4 is a four-cycle orthogonal nuclease cleavage experiment, confirmed by visual imaging of reactive fluorophores, performed to validate the encoding and decoding of FND identimer- based molecular barcodes on the beads.
- Experiment 4 comprises four cycles, each cycle comprising one or more cleavage steps followed by one or more imaging steps.
- Orthogonal cleaving agent nucleases may be applied one at a time during a cleavage step to the beads of the combinatorial library to facilitate orthogonal cleavage of formed and encoded molecular barcodes on the beads.
- a preliminary imaging step may be performed to detect detectable signal response conveyed from the detectable labels of FND identimer tokens incorporated into a molecular barcode present on any of the beads.
- Notl is introduced in the first cycle during a first cleavage step to decrease the intensity of detectable signal response convey by the detectable labels of FND identimer tokens reactive to Notl nuclease (in this case Ids tokens) allowing for the decoding or visual identification of the detectable labels dissociated from the Ids tokens through cleavage.
- orthogonally reactive cleaving agent nucleases are introduced during one or more of the cleavage steps in the following order: Notl during cycle 1; Xhol during cycle 2; Spel during cycle 3; and Hindi 11 during cycle 4.
- Detection of detectable signal response conveyed from dissociated detectable labels during one or more of the imaging steps facilitates decoding the three-dimensional arrangement of FND identimer tokens incorporated into the molecular barcodes.
- Limit of detection, specificity, and precision experiments (in triplicate) of nucleases acting as cleaving agents may be performed to establish the reproducibility of nuclease orthogonal reactivity to the FND Identimer-based molecular barcodes.
- a FND identimer scaffold portion comprises a pre hybridized double-stranded DNA (dsDNA) having one or more copies of a single endonuclease restriction sequence.
- dsDNA pre hybridized double-stranded DNA
- the amino-reactive heterobifunctional crosslinking reagent, NHS-PEG4-TCO may then be used to chemically modify the primary amine on the pre-hybridized dsDNA to contain the click-compatible TOO group.
- the pre-hybridized dsDNA is covalently linked to a single-stranded DNA (ssDNA) fragment having a 3’ and 5’ end through an internally amino-modified base in the ssDNA, that can be converted to the click- compatible methyltetrazine group using an amino-reactive heterobifunctional crosslinking reagent (NHS-PEG4-methyltetrazine).
- ssDNA single-stranded DNA
- a specific and complementary ssDNA oligonucleotide (harboring the same information as the ssDNA oligonucleotide is attached to the dsDNA fragment but in reverse- complement orientation) and may be hybridized to the ssDNA of the identimer to generate a dsDNA species with unpaired 3’-ends that will be compatible for ligation with other FND identimer classes to facilitate building a combinatorial chain of identimers by split-pooling.
- a labeled ssDNA strand comprising at least one detectable label (in this case, a fluorophore) is annealed to the attached ssDNA strand containing the specific endonuclease site.
- the labeled ssDNA strand is dually labeled on its opposing 5'- and 3'- ends to include four detectable labels.
- the dually labeled ssDNA comprises four different detectable labels, each detectable label having a unique detectable signal response, to provide 16 FND identimer classes.
- the detectable labels are operatively connected to the scaffold portion at a spaced-apart distance relative to each other to reduce fluorescent quenching or other non-specific reactions between the detectable labels.
- the labeled ssDNA strand is dual-labeled on its opposing 5’- and 3’-ends, and the sequence of the associated dsDNA duplex contains two restriction sites of the same type.
- the emission wavelength (observed color) of the detectable labels of each FND identimer is correlated with the sequence of the attached dsDNA (that is competent for ligation), thereby coupling information that can be obtained from the molecular barcode with information that can be obtained from the formed NGS barcode after DNA sequencing.
- the designed sequences making up the formed NGS barcode should not contain sequences that may be cleaved by any of the restriction enzymes used.
- Idi is the first identimer added and is therefore directly attached to solid support (bead).
- Idi doesn’t include a cleavage site and its recognition portion is an uncleavable linker.
- Idi comprises a 5'-amino modified base for attachment to the solid support, and can encode a 5'-constant region for retrieval and downstream NGS library preparation.
- the structural portions of Ids contains a 3'-capture sequence for the capture of macromolecules from cell lysate or a biological sample.
- the recognition portions and cleavage sites of Id 2 , Id3, Id4, and Ids are the recognition sequences and cleavage sites of, respectively, Hindlll, Spel, Xhol, and Notl endonucleases.
- fluorophores comprising multiple distinguishable labels are selected for combinatorial labeling of the FND identimers in this example.
- detectable labels of the FND identimers comprise one or more Q-dots.
- fluorophores of different emission wavelengths may be covalently attached to each identimer class, in different reaction mixtures (or wells of a plate) for each class-specific dsDNA bearing a specific nuclease site type.
- a single molecular barcode made up of five of FND identimer class types is illustrated in Example 4. Fluorophores may be added respectively, to Idi, Id2, Id3,
- the combination of fluorophores displayed on any formed individual FND identimer-based molecular barcode the combinatorial library may or may not have contiguous emission spectra, as the three-dimensional arrangement of FND identimer tokens incorporated into the molecular barcodes is determined by the cycling of cleavage steps using orthogonally reactive nucleases described herein.
- each ssDNA fragment is designed to record the emission wavelength a detectable signal response conveyed from dissociated detectable labels of FND identimer tokens incorporated into a molecular barcode.
- the complementary ssDNA strands of each FND identimer are formed by hybridization, whereby the two strands share significant complementarity, but remain unpaired at their 3’-ends.
- the unpaired 3’-ends of concatenated FND identimer tokens are configured to be complementary with acceptor 3’-ends of adjacent FND identimer tokens.
- FND identimers may be sequentially added in a pre defined order to form molecular barcodes over sequential rounds of splitting and pooling.
- Each round of FND identimer class addition will enable concatenation of identimer dsDNA through enzymatic ligation of the 5'- and 3'-ends of the ssDNA fragments of hybridized identimers.
- the ligation of each identimer may be performed in 1X ligase buffer (50mM Tris-HCI pH 7.5, 10mM MgCI2, 1mM ATP, 10mM DTT) in the presence of 500-1, 000U of T4 DNA ligase and 20-80U of Polynucleotide Kinase over a 25-minute incubation at 37°C.
- a final “capping” FND identimer class comprises dsDNA (making up the NGS portion of the barcode) to encode both the wavelength of the label attached to the identimer polypeptide protease recognition portion, as well as a 3' capture region designed to capture macromolecules from cell lysate.
- a unique molecular identifier may also be included as a contiguous stretch of randomized or semi-randomized bases within the capping FND identimer class, or may be added to the NGS barcode with each identimer class in smaller stretches of randomized or semi-randomized bases.
- the configured sequences making up the formed NGS barcode essentially do not contain sequences that can be cleaved by any of the restriction enzymes used.
- computational filtering of configured molecular barcodes may be used to avoid the inclusion of one of the restriction sites used for the decoding by cleavage experiment.
- the FND identimer barcode contains 5 class members joined by enzymatic ligation.
- Each FND identimer class-specific ssDNA making up the labeled dsDNA containing one or more restriction endonuclease site types may be purchased from a commercial vendor.
- the recognition portions and cleavage sites of the FND identimer classes comprise nucleotide sequences configured, respectively: (ldi is not cleavable.); Icte comprising /5AmMC6/ATTTATTTAAGCTTATTATTATTT (SEQ
- amino modifications are only on the 5’ end of one of the oligonucleotides making up an FND Identimer scaffold portion that is attached to the bead.
- all other dsDNA FND Identimer scaffold portion contain internal amino modifications so that the ends are available for ligation of flanking FND Identimer scaffold portions.
- the amino modifications are on 5’ ends of all hybridizing FND Identimers, because for each hybridizing duplex, one strand’s 5’ end is biotinylated and the other strand of the duplex contains a label attached via NHS-modified fluorophores.
- the amino modifications are included within internal positions of the scaffold portions all identimers in a set of ringed or circular dsDNA-based identimers so that free ends will available for ligation.
- 5AmMC6 refers to Amino Modifier C6.
- amino modifiers are used to introduce a primary amino group into an oligonucleotide.
- an amino modifier may be used in conjunction with a NHS ester or isothiocyanate fluorescent detectable labels.
- Amino Modifier C6 may be incorporated during oligonucleotide synthesis and can be used to label the 5’ end of an oligonucleotide with a primary amino group at the end of a six-carbon spacer.
- N- Hydroxysuccinimide (NHS) refers to an organic compound with the formula (CH2CO)2NOH.
- N-hydroxysuccinimide esters or “NHS-esters” may be used to site-specifically modify primary amine groups installed within synthesized oligonucleotides that contain such groups at designed locations, similarly to the way proteins may be non-selectively on free amino groups by ester- mediated derivatization (see e.g., Nanda et al., Methods Enzymol., 536:87-94 (2014)).
- NHS-esters may be used to label to the primary amines (R-NH2) or proteins, amine-modified nucleotides, and other amine-containing molecules.
- NHS fluorophore refers to any fluorophore conjugated to NHS.
- FND identimer class may be generated by first annealing a complementary, labeled strand that together with the class-specific ssDNA strands listed above (Id2-s), generate the viable dsDNA restriction sites. This may be done in a 1:1 molar ratio in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI) at 95°C for 5 minutes on a heat block, the heat block is then removed and allowed to cool to room temperature on the benchtop over what amounts to be about a 2-hour time period.
- NHS-compatible annealing buffer 100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI
- the class-specific dsDNA bearing the single endonuclease site type may be modified (through the 5’-amino modification shown in the sequences above) using NHS-PEG4-transcyclooctene, such that it contains a compatible chemistry for downstream conjugation.
- 100pMof the labeled dsDNA may be mixed with 500pMNHS-PEG 4 -transcyclooctene (TCO) in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI), and allowed to react at room temperature overnight.
- TCO-modified dsDNA duplexes may be purified by desalting to remove excess (and any unreacted) NHS-PEG4-TCO reagent.
- lOOpMoligonucleotides containing an internal amino-modification are first annealed to complementary ssDNA to form dsDNA encoding the NGS portion of the barcode by incubation in a 1:1 molar ratio in NHS-compatible annealing buffer (100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI) at 95°C for 5 minutes on a heat block, the heat block is then removed and allowed to cool to room temperature on the benchtop over what amounts to be about a 2-hour time period.
- NHS-compatible annealing buffer 100mM Sodium Phosphate buffer, pH 8.5, supplemented with 80mM KCI
- Idi class NGS barcode portions receive a complementary oligo strand containing a 5’-amino modification for chemical modification and subsequent attachment to the solid support.
- All complementary ssDNA oligonucleotides comprise a 5’-phosphate modification for competent ligation to adjacent identimers during barcode formation.
- the internal amino-modified oligos also bearing 5’-phosphate groups are modified in NHS-compatible annealing buffer by incubation with 500pMNHS-PEG4-methyltetrazine overnight at room temperature.
- Methyltetrazine-modified oligonucleotides are then buffer-exchanged into NHS- compatible annealing buffer twice, using 7K MWCO Zeba desalting columns to remove excess and any unreacted NHS-PEG4-methyltetrazine reagent.
- TCO- modified dsDNA of each class are conjugated to the different methyltetrazine- modified ssDNA oligonucleotides (each containing a different nucleotide sequence that corresponds to the detectable label already conjugated to the identimer) by mixing in a 1:1 ratio at 25pMeach. This reaction may be allowed to proceed for 30 minutes at room temperature.
- the conjugated FND Identimers may then be buffer exchanged into storage buffer (50mM Tris-HCI pH 7.5 supplemented with 100mM NaCI), and may be used right away, stored for several days at 4°C or at -20°C for longer term storage.
- storage buffer 50mM Tris-HCI pH 7.5 supplemented with 100mM NaCI
- Identification of FND identimer-based molecular barcodes may be accomplished using a four-cycle deconvolution/decoding by cleavage experiment to confirm the detectable signal response conveyed from detectable labels dissociated by cleavage from a molecular barcode to decode its encoded three-dimensional arrangement.
- molecular barcode combinatorial libraries may be formed on beads, and the beads may be immobilized on a surface and imaged before and after contact with experimental solutions. Immobilized beads would first be imaged using a standard fluorescence microscope configured with appropriate excitation wavelengths and emission filters to record the combination of visual detectable labels making up each barcode in a given field of view or region of interest.
- Hi-Fidelity versions of the endonucleases listed herein can function with 100% efficiency in the same buffer (1X OUTSMART buffer available from New England Biolabs, Inc. at 428 Newburyport Turnpike, Rowley, MA 01969, U.S.A).
- the beads are exposed in a first cleavage step to a solution containing 5 units of Notl-HF endonuclease for a 5-10 minute incubation at 37°C in OUTSMART buffer (50mM Potassium Acetate, 20mM Tris-acetate, 10mM Magnesium Acetate, 100ug/ml BSA; buffer pH is 7.9 at 25°C).
- OUTSMART buffer 50mM Potassium Acetate, 20mM Tris-acetate, 10mM Magnesium Acetate, 100ug/ml BSA; buffer pH is 7.9 at 25°C.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FND identimer tokens dissociated from the molecular barcode during the first cleavage step.
- a washing step may be performed prior to the second cleavage step.
- the second cleavage step comprises exposing the beads a solution containing 5 units of Xhol-HF for a 5-10 minute incubation at 37°C in OUTSMART buffer. Following incubation, the beads are imaged in a second imaging step and fluorophore detectable labels responsive to Xhol endonuclease activity are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FND identimer tokens dissociated from the molecular barcode during the second cleavage step.
- a washing step may be performed prior to the third cleavage step.
- the third cleavage step comprises exposing the beads to a solution containing 5 units of Spel-HF for a 5-10 minute incubation at 37°C in OUTSMART buffer. Following incubation, the beads are imaged in a third imaging step and fluorophores responsive to Spel endonuclease activity are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FND identimer tokens dissociated from the molecular barcode during the third cleavage step.
- a washing step may be performed prior to the fourth cleavage step.
- the fourth cleavage step comprises exposing the beads to a solution containing 5 units of Hindi 11 for a 5-10 minute incubation at 37°C in OUTSMART buffer. Following incubation, the beads can be imaged in a fourth imaging step and fluorophores responsive to Hindi 11 endonuclease activity are detected.
- the detectable signal response comprises a reduction of the intensity in the emission wavelength of the detectable labels of FND identimer tokens dissociated from the molecular barcode during the fourth cleavage step.
- more than one restriction endonuclease may be introduced in each cycle of decoding by cleavage, but these must be introduced under conditions whereby different endonucleases possess significantly different kinetic cleavage rates, and multiple images should be acquired during cleavage.
- the uncleavable label After removal of the final cleavable identimer label, the uncleavable label will remain as the strongest signal emitted from the bead, allowing for its detection and identification.
- the sequence of fluorophores for each barcode that was determined during visual deconvolution is represented in the sequence of the NGS barcode, which can be retrieved and correlated with the bead following a NGS experiment.
- Single-label and dual-label FND identimer-based molecular barcode combinatorial libraries were generated from a set of five FND identimer classes (FND ldi- 5 ).
- a set of five NHS-modified fluorophores were conjugated directly to amino-modified bases as described herein.
- a single-label FND identimer class comprises a recognition moiety that correlates to a single label upon cleavage
- a dual-label FND identimer class comprises a recognition moiety that correlates to two labels upon cleavage.
- a combinatorial library built with an unmixed volume of AF-750 labeled, Hindi I l-based FND identimers will lose 750nm spectra upon cleavage by Hindi 11
- a combinatorial library built with a 50/50 ratio of, respectively, AF-750 labeled, Hindi I l-based FND identimer and ATTO- 550 Hindi I l-based FND identimer will lose both 550 nm and 750 nm spectra upon cleavage, effectively doubling the mutual information that may be collected during one cleavage cycle.
- FIG. 18 is a textual representation of a set of five FND identimers (FND ID 1-5 ) having scaffold portions configured with orthogonal sticky-ends, orthogonal recognition moieties and cleavage sites as well as modified nucleotides.
- each scaffold portion of FND ldi- 5 comprised an unlabeled single-stranded oligonucleotide configured to anneal with a complementary single-stranded, labeled oligonucleotide to form a duplex oligonucleotide with an unpaired, single-stranded overhang portion, or “sticky-end.”
- the unpaired sticky-end may comprise 1 to 5 nucleotides.
- the unpaired sticky-end may comprise more than 5 nucleotides. It was observed that more orthogonal sticky-end configurations became possible as the number of its nucleotides increased.
- the ligation is a templated ligation.
- template ligation or “templated ligation” collectively refer to ligation reactions that are specific to the correct complementary sticky ends. Skilled persons will understand that it would be possible to use as little as one base overhang, as that is the configuration used for NGS adapters commonly ligated onto DNA for sequencing in many existing kits known in the art. In some embodiments, an increased orthogonality is useful for allowing the simultaneous ligation of multiple identimer classes.
- all identimer classes making up a set of identimer classes may be ligated together in a single reaction because each sticky- end pair in the set was configured to be orthogonal to each other.
- an Idi identimer having a sticky-end configured to anneal to a sticky-end of an Id2 identimer and remain orthogonal to a sticky-end of an Id3 identimer allows for the selective ligation of Idi identimer to Id2 identimer only. Skilled persons will understand that the process of sticky-end ligation facilitates the selective ligation of a pair of duplex oligonucleotides that have complementary sticky-ends.
- FND ldi-s was produced to perform two orthogonal nuclease cleavage experiments producing combinatorial libraries from both single-label and dual-label FND identimer classes.
- the scaffold portion of FND Idi comprised an unlabeled SEQ ID NO: 17 configured to anneal to a labeled SEQ ID NO: 18.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 18 comprised a sticky-end configured to anneal the sticky-end of SEQ ID NO: 19.
- Positions 1 and 25 were modified, respectively, by 5' phosphorylation and to be a 3' Amino Modifier C6 dT.
- a scaffold portion is configured to have a recognition moiety without a peptide or nucleotide sequence that reacts to the known cleaving agents to render it non-cleavable.
- a non-cleavable identimer may be useful as a last-visualized identimer since some cleaving agents become less effective as steric hinderance increases. For example, it was observed that Hindi 11 has reduced efficacy when cleaving dsDNA directly from a solid phase, such as a streptavidin bead. It was observed that, in certain instances, once a single label was observed following a cleavage step there was no need to perform the final cleavage step.
- a last-visualized identimer may be configured to be cleavable, since once it has been identified by imaging, there would be no need to cleave it.
- the scaffold portion of FND Id2 comprised an unlabeled SEQ ID NO: 19 configured to anneal to a labeled SEQ ID NO: 20.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 19 comprised a sticky-end configured to anneal the sticky-end of SEQ ID NO: 18.
- Positions 13 to 18 of SEQ ID NO: 19 comprised a Hindi 11 recognition moiety and cleavage site.
- Position 1 of SEQ ID NO: 19 was modified by 5' phosphorylation.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 20 comprised a sticky-end configured to anneal to the sticky-end of SEQ ID NO: 21.
- Positions 16-21 of SEQ ID NO: 20 comprised a Hindi 11 recognition moiety and cleavage site. Positions 1 and 9 of SEQ ID NO: 20 were modified, respectively, by 5' phosphorylation and to be a Int Amino Modifier C6 dT.
- the scaffold portion of FND Id3 comprised an unlabeled SEQ ID NO: 21 configured to anneal to a labeled SEQ ID NO: 22.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 21 comprised a sticky-end configured to anneal the sticky-end of SEQ ID NO: 20.
- Positions 13 to 18 of SEQ ID NO: 21 comprised a Spel recognition moiety and cleavage site.
- Position 1 of SEQ ID NO: 21 was modified by 5' phosphorylation.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 22 comprised a sticky-end configured to anneal to the sticky-end of SEQ ID NO: 23.
- Positions 16-21 of SEQ ID NO: 22 comprised a Spel recognition moiety and cleavage site. Positions 1 and 9 of SEQ ID NO: 22 were modified, respectively, by 5' phosphorylation and to be Int Amino Modifier C6 dT.
- the scaffold portion of FND Id4 comprised an unlabeled SEQ ID NO: 23 configured to anneal to a labeled SEQ ID NO: 24.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 23 comprised a sticky-end configured to anneal the sticky-end of SEQ ID NO: 22.
- Positions 13 to 18 of SEQ ID NO: 21 comprised a Xhol recognition moiety and cleavage site.
- Position 1 of SEQ ID NO: 21 was modified by 5' phosphorylation.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 24 comprised a sticky-end configured to anneal to the sticky-end of SEQ ID NO: 25.
- Positions 16 to 21 of SEQ ID NO: 22 comprised a Xhol recognition moiety and cleavage site. Positions 1 and 9 of SEQ ID NO: 24 were modified, respectively, by 5' phosphorylation and to be a Int Amino Modifier C6 dT.
- the scaffold portion of FND Ids comprised an unlabeled SEQ ID NO: 25 configured to anneal to a labeled SEQ ID NO: 26.
- the nucleotides corresponding to positions 1 to 5 of SEQ ID NO: 25 comprised a sticky-end configured to anneal the sticky-end of SEQ ID NO: 24.
- Positions 11 to 18 of SEQ ID NO: 25 comprised a Notl recognition moiety and cleavage site.
- Position 1 of SEQ ID NO: 25 was modified by 5' phosphorylation.
- the nucleotides corresponding to positions 11 to 18 of SEQ ID NO: 26 comprised a Notl recognition moiety and cleavage site.
- Position 1 of SEQ ID NO: 26 was modified to be a 5AmMC6T.
- 3AmMC6T refers to 3' Amino Modifier C6 dT, a modified nucleotide available commercially from Integrated DNA Technologies, Inc. (IDT) (1710 Commercial Park, Coralville, Iowa 52241, USA).
- 5' Biotin- TEG or “5BiotinTeg” collectively refer to a biotin molecule attached to a 15-atom, mixed polarity triethelyene glycol spacer, available commercially as a modification that can be installed during synthesis by IDT. Skilled persons will understand that 5' Biotin-TEG may be incorporated at either the 5' or 3' end of an oligonucleotide.
- 5Phos refers to 5' phosphorylation, such as, for example, the phosphorylation of an oligonucleotide at its 5' end. Skilled persons will understand that 5' Phosphorylation is needed if an oligonucleotide is used as a substrate for a DNA ligase enzyme.
- 5AmMC6T refers to 5' Amino Modifier C6 dT, a modified nucleotide available commercially from Integrated DNA Technologies,
- FND Idi-5 were configured accordingly: FND Idi was unlabeled; FND Id2 comprised a single AF-750; FND Id3 comprised a single AF-647; FND Id4 comprised a single ATTO-550; and, FND Ids comprised a single ATTO-488.
- each scaffold portion of FND ldi-5 was labeled by NHS ester- mediated derivatization.
- AF-750 was used to label the 3' Int Amino Modifier C6 dT of SEQ ID NO: 20 at position 9.
- AF-647 was used to label the Int Amino Modifier C6 dT of SEQ ID NO: 22 at position 9.
- ATTO-555 was used to label the Int Amino Modifier C6 dT of SEQ ID NO: 24 at position 9.
- ATTO-488 was used to label the 5' Amino Modifier C6 dT of SEQ ID NO: 26 at position 1.
- FND Idi- 5 For the dual-label orthogonal nuclease cleavage experiment, the detectable labels of FND ldi- 5 were configured accordingly: FND Idi was unlabeled; FND Id2 comprised approximately 50% AF-750 and approximately 50% ATTO-550; FND Id3 comprised approximately 50% AF-647 and approximately ATTO-488; FND Id4 comprised approximately 50% AF-647 and approximately 50% ATTO-550; and, FND Ids comprised approximately 50% ATTO-488 and approximately 50% AF-750. As disclosed herein, each scaffold portion of FND ldi- 5 was labeled by NHS ester- mediated derivatization.
- AF-750 and ATTO-550 were used to label the 3' Int Amino Modifier C6 dT of SEQ ID NO: 20 at position 9.
- AF-647 and ATTO-488 were used to label the Int Amino Modifier C6 dT of SEQ ID NO: 22 at position 9.
- AF-647 and ATTO-555 was used to label the Int Amino Modifier C6 dT of SEQ ID NO: 24 at position 9.
- ATTO-488 and AF-750 were used to label the 5' Amino Modifier C6 dT of SEQ ID NO: 26 at position 1.
- ATTO-488 is an ATTO fluorescent dye having an maximum absorption of 501 nm and a maximum fluorescence of 523 nm. Skilled persons will understand that ATTO-488 is excited more efficiently in a range of 480 nm to 515 nm.
- ATTO-550 is an ATTO fluorescent dye having an maximum absorption of 554 nm and a maximum fluorescence of 576 nm. Skilled persons will understand that ATTO-550 is excited more efficiently in a range of 540 nm to 565 nm.
- ATTO-488 and ATTO-550 are commercially available from ATTO- Tec GmbH (Martinshardt 7, 57074 Siegen; info@att-tec.com; Product No.: AD 488 and Product No.: AD550.)
- AF-405 is an Alexa Fluor 405 dye, a blue-emitting synthetic fluorophore having an excitation peak at 401 nm and an emission peak at 421 nm.
- AF-647 is an Alexa Fluor 647 dye, a far-red fluorescent dye having an excitation suited for 594 nm or 633 nm laser lines.
- AF-750 is an Alexa Fluor 750 dye, a bright, near-infrared fluorescent dye having an excitation suited for 633 nm laser line or dye-pumped excitation.
- AF-405, AF-647, and AF-750 are commercially available from ThermoFisher Scientific, Inc. (168 Third Avenue, Waltham, MA 02451, USA; Cat. No. A30000 for AF-405; Cat. No. A20006 for AF-647; Cat. No. A20011 for AF-750).
- AF-405, AF-647, AF-750, ATTO-488, and ATTO- 550 may be used interchangeably to produce different classes of Identimers.
- Skilled persons will understand that any NHS-conjugated fluorophore (NHS-fluorophore) may be used to label any free amino group by NHS ester-mediated derivatization.
- NHS-conjugated fluorophores are available commercially from the vendors provided herein.
- NH2 groups on the labeled oligonucleotides of FND ldi-5 (SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26) were labeled with fluorophores using 20mI of 200mM oligonucleotide and resuspended in 100mM NaP04 (sodium phosphate buffer) at pH 8.5 and then mixed with 2mI of 100mM NHS-Fluorophore (e.g., NHS-ATTO-488, NHS-ATTO-550, NHS- AF-647, or NHS-AF-750 resuspended in anhydrous dimethylformamide (DMF)), in an overnight reaction at room temperature.
- NHS-Fluorophore e.g., NHS-ATTO-488, NHS-ATTO-550, NHS- AF-647, or NHS-AF-750 resuspended in anhydrous dimethylformamide (DMF)
- the labeled oligonucleotides were then diluted with 80mI of 2X hybridization buffer (20mM Tris pH 7.5, 1M NaCI, 1mM EDTA) to a 100mI volume to quench any remaining unreacted NHS groups.
- 100mI of 40mM labeled oligonucleotide was added to and mixed with its corresponding unlabeled oligonucleotide as described herein (i.e., SEQ ID NO: 17 mixed with SEQ ID NO: 18; SEQ ID NO: 19 mixed with SEQ ID NO: 20; SEQ ID NO: 21 mixed with SEQ ID NO: 22; SEQ ID NO: 23 mixed with SEQ ID NO: 24; SEQ ID NO: 25 mixed with SEQ ID NO: 26), for subsequent annealing.
- Annealing was then performed by exposing all FND ldi-5 scaffold portion oligonucleotide duplexes to 90°C for approximately 5 minutes in a heat block and then placing the heat block on the lab bench until it reached room temperature (approximately 1.5 hours).
- 10mI of biotinylated FND Idi scaffold portion oligonucleotide duplex was diluted 10-fold in 90mI of hybridization buffer to 200nM, and 10OmI of each differentially-labeled duplex was mixed with 10OmI of 0.2mg/ml Dynabeads MyOne Streptavidin T1 beads (streptavidin beads) (available commercially from ThermoFisher Scientific, Inc.
- Ligation steps were performed at temperatures between room temperature or up to 37°C, in the presence of both T4 polynucleotide kinase (PNK) (commercially available from NEB; (5mI/250mI ligation reaction); Cat. No. M0201S or Mo201L) and T4 DNA ligase (commercially available from NEB; (5mI/250mI ligation reaction); Cat. No. M0202S, M0202T, M0202L, or M0202M) in 1X T4 DNA ligase buffer provided by NEB.
- PNK polynucleotide kinase
- On-bead ligations were performed using 0.1 mg/ml streptavidin beads coated with FND Idi scaffold portion oligonucleotide duplexes as described herein (washed 2X in 100mI of 1 X ligation buffer provided by NEB), for attachment of FND Id2 scaffold portion oligonucleotide duplexes. All subsequent ligations of FND Id3-s scaffold portion oligonucleotide duplexes were performed under the same conditions.
- FND indentimers may be ligated in solution prior to ligating to the streptavidin beads.
- the set of FND Id2 was pre-ligated with the set of FND Id3 prior to ligation to the streptavidin beads via FND Idi .
- Ligations were performed in the presence of both T4 polynucleotide kinase (PNK) and T4 DNA ligase. Between each round of ligation, extensive washing was performed using 0.5-1 mM EDTA to quench any remaining ligase activity, high salt (between 500mM-1M NaCI) to remove ligase from dsDNA duplexes, and detergent (0.01% Tween-20) to aid in ligase enzyme removal from duplexes and to prevent clumping of beads; the latter could lead to uneven coating of beads during subsequent ligations, so to further prevent this, beads were sonicated for at least 3 minutes prior to imaging, and prior to being exposed any enzyme catalyzed reactions. For further details, see methods section.
- streptavidin beads were washed 3X with 200 mI 2X hybridization buffer supplemented with 0.01% Tween-20, 2X with 200mI 1X hybridization buffer, resuspended in 20mI of 1X hybridization buffer, sonicated for 3 minutes, and loaded into a flow cell containing a biotin-modified surface for immobilization and subsequent cleavage experiments.
- FND identimer-based molecular barcodes were constructed by ligation of differentially-labeled, pre-hybridized oligonucleotide duplexes.
- Molecular barcodes having 4 different distinguishable labels were constructed from 5 segments (i.e., 5 identimer tokens per molecular barcode) using a combinatorial ligation strategy.
- a set of FND identimers was bound to streptavidin beads via a 5’-biotin modification on one of the oligonucleotide strands of the scaffold portion oligonucleotide duplexes in the set.
- the scaffold portions of FND Id2 were labeled with AF-750 and comprised both an enzyme-accessible Hindi 11 cleavage site and a 5’ overhang (phosphorylated) compatible for ligation to the scaffold portion oligonucleotide duplexes of FND Id3.
- a sample of these streptavidin beads were immobilized in a flow cell for imaging. Images of the streptavidin beads bound to molecular barcodes containing FND Idi and FND Id2 were taken in 4 fluorescence channels, and intensity values were plotted as shown in FIG. 19.
- 19A represents a streptavidin bead labeled with a first labeled dsDNA identimer segment (Id1 /Hind 111-AF750) attached through ligation to an unlabeled dsDNA that is attached to the bead through a biotin moiety.
- FIG. 19B represents a streptavidin bead labeled with a second labeled dsDNA identimer segment (Id2/Spel-AF647).
- FIG. 19C represents a streptavidin bead labeled with a third labeled dsDNA identimer segment (ld3/Xhol-ATTO550).
- FIG. 19D represents a streptavidin bead labeled with a fourth labeled dsDNA identimer segment (ld4/Notl- ATT0488).
- 20A represents un-cleaved beads imaged in four fluorescence channels; 647nm (upward diagonal stripes), 550nm (downward diagonal stripes), 488nm (horizontal stripes) and 750nm (dotted), and mean intensity values obtained for each fluorophore were plotted to the right.
- the scaffold portions of the FND Id3 were labeled with AF-647 and comprised both an enzyme-accessible Spel cleavage site and a 5’ overhang (phosphorylated) compatible for ligation to the scaffold portion oligonucleotide duplexes of FND Id4.
- a sample of the beads were immobilized in a flow cell for imaging. Images of streptavidin beads bound to molecular barcodes comprising ligated (i.e., encoded) FND Idi/FND Id2/FND Ids/ identimer tokens were taken in 4 fluorescence channels, and intensity values were plotted (FIG.
- the streptavidin beads bound to molecular barcodes comprising the ligated (i.e., encoded) FND Idi/FND Id2/FND Ids/ identimer tokens were then subjected to another round of ligation, whereby the FND Id4 were ligated (i.e., concatenated) to the growing molecular barcode.
- the scaffold portions of the FND Id4 were labeled with ATTO-550 and comprised both an enzyme-accessible Xhol restriction site, and a 5’ overhang (phosphorylated) compatible for ligation to the scaffold portion oligonucleotide duplexes of FND Ids.
- the scaffold portions of the FND Ids were labeled with ATTO-488 and comprised both an enzyme-accessible Notl restriction site (i.e., cleavage site).
- an enzyme-accessible Notl restriction site i.e., cleavage site.
- a sample of the streptavidin beads were immobilized in a flow cell for imaging. Images of streptavidin beads bound to molecular barcodes comprising FND Idi/FND Id2/FND Ids/FND Id4/FND Ids/ identimer tokens were taken in 4 fluorescence channels, and intensity values were plotted (FIG. 19).
- the three dimensional arrangement is linear, having one or two open ends. In some embodiments, the three dimensional arrangement is circular, having no open ends. In some embodiments, the three dimensional arrangement is a hairpin formation, having a looped end and an open end. Skilled persons will understand that sticky-end ligation of two or more dsDNA oligonucleotides will generally form a linear chain of segments, each segment ligated together and at least one end, to form a chain.
- FIG. 20B represents mean intensity of beads following exposure to the Notl enzyme.
- FIG. 20C represents mean intensity of beads following exposure to the Xhol enzyme.
- FIG. 20D represents mean intensity of beads following exposure to the Spel enzyme.
- FIGS. 20A-D show the decoding of a molecular barcode formed and encoded in a three dimensional arrangement by the concatenation of a set of one or more FND Identimers. As shown in FIGS. 20A-D, molecular barcodes, were formed and encoded from linear, segmented chains of ligated FND ldi-s identimer tokens and streptavidin beads. As shown in FIGS.
- the FND ldi identimer tokens were unlabeled; the FND Id2 identimer tokens were labeled with ATTO-550 and encoded an enzyme-accessible Hindi 11 recognition moiety; the FND Id3 identimer tokens were labeled with AF-647 and encoded an enzyme-accessible Spel recognition moiety; the FND Id4 identimer tokens were labeled with AF-750 and encoded an enzyme-accessible Xhol recognition moiety; and the FND Ids identimer tokens were labeled with ATTO-488 and encoded an enzyme-accessible Notl restriction site.
- streptavidin beads bearing the described identimer chain (and label combination) were immobilized on the surface of a flow cell, and were imaged within a single lane of the flow cell.
- all of the beads in any given field-of-view (FOV) were bearing the same identimer chain.
- Streptavidin beads in the flow cell lane were imaged prior to being exposed to any restriction enzymes, whereby signal for all four detectable labels was observed (FIG. 20A).
- the streptavidin beads in the flow cell lane were first contacted with a solution containing 200U of Notl enzyme (substantially removed of glycerol by desalting) in 1X Outsmart buffer provided by New England Biolabs (Cat. No. B7204), and incubated for 30 minutes at 37°C in a temperature-controlled heat-block chamber (off of the microscope).
- the flow cell was then removed from the microscope, and beads were contacted with a solution containing 400U of Xhol enzyme (substantially removed of glycerol by desalting) in 1X Cutsmart buffer provided by New England Biolabs (NEB), and incubated for 30 minutes at 37°C in a temperature-controlled heat-block chamber (off of the microscope). Following incubation with the Xhol enzyme, the flow cell lane was flushed with at least 10Oplof 1X Cutsmart buffer, and the flow cell was placed back on the microscope stage. Images of the same flow cell lane were then acquired in all 4 fluorescence channels.
- Xhol enzyme substantially removed of glycerol by desalting
- the flow cell was then removed from the microscope, and beads were contacted with a solution containing 300U of Spel enzyme (substantially removed of glycerol by desalting) in 1X Outsmart buffer provided by New England Biolabs (NEB), and incubated for 30 minutes at 37°C in a temperature-controlled heat-block chamber (off of the microscope). Following incubation with the Spel enzyme, the flow cell lane was flushed with at least 10OmI of 1X Outsmart buffer, and the flow cell was placed back on the microscope stage. Images of the same flow cell lane were then acquired in all 4 fluorescence channels.
- Spel enzyme substantially removed of glycerol by desalting
- FND Identimer Tokens having Single and Dual (Mixed) Detectable Labels [00249] As shown in FIG. 21 B, molecular barcodes comprising one unlabeled FND identimer token (FND Ido) and four dual-labeled FND Identimer tokens (FND Idi- 550/750, FND ld 2 -488/647, FND ld 3 -750/488, and FND ld 4 -488)
- IG. 21A presents a bar graph representing OCS results obtained for single-label dsDNA identimer chains where the order of the cycles was reversed for analysis.
- FIG. 21 B presents a bar graph representing OCS results obtained for mixed-label dsDNA identimer chains where the order of the cycles was reversed for analysis.
- FIGS. 21 A and 21 B are, respectively, bar graphs showing the decoding of molecular barcodes comprised of differentially labeled FND Identimer tokens having single and dual (mixed) detectable labels. As shown in FIG.
- molecular barcodes comprising four single-label FND identimer tokens (encoded using FND identimer classes: FND ldi-550, FND ld 2 -750, FND ld3-647, and FND Id 4 -488) were constructed and then decoded to demonstrate the combinatorial library theoretical capacity single-label FND Identimer-based molecular barcodes.
- a FND ld 2 -488 was mixed at an equimolar ratio with a FND ld 2 -550 to generate a total of 10 visually-discernable combinations of detectable labels per dual-label identimer (as opposed to 4 different possibilities per single-label identimer).
- the visually- discernable combinations for each dual-label FND Identimer that were mixed for constructing dual-labeled identimer chains were 488/488, 488/550, 488/647,
- pre-determined chain configurations i.e., pre-determined single labels or mixtures of labels at each identimer position in the chain
- streptavidin beads as described previously (i.e., by rounds of ligation with washes in between).
- Average bead intensity across greater than 100 beads in each field of view were acquired for strepatividin beads bearing single-label and dual-label identimer chains in all four fluorescence channels at each cycle. Images were acquired for uncut streptavidin beads, streptavidin beads subsequently cleaved with Notl (cycle 1), streptavidin beads subsequently cleaved with Xhol (cycle 2), and streptavidin beads subsequently cleaved with Spel (cycle 3). Images were then analyzed in reverse-chronological order.
- images acquired following cycle 3 were analyzed as the first images in the series; images acquired following cycle 2 (cleavage by Xhol) were analyzed as the second images in the series; images acquired following cycle 1 (cleavage by Notl) were analyzed as the third images in the series; and, uncleaved beads were analyzed as the fourth images in the series.
- intensity values for all four fluorescence channels obtained in the “previous” image were subtracted from the “next” image to enable cycle by cycle determination of labels associated with each identimer segment.
- intensity values obtained for all four fluorescent channels from images acquired following Spel cleavage were subtracted from intensity values obtained for all four fluorescent channels from images acquired following Xhol cleavage, and this analysis was performed for each cycle in the series. Following analysis, intensity values obtained in all four fluorescence channels were plotted for both single- and dual-labeled chains at each cycle.
- FIG. 21 A shows the sequence or order of labels released from beads bearing single label identimer chains.
- FIG. 21 B shows the order released from streptavidin beads bearing dual-labeled chains.
- the order (i.e., sequence) of labels released across all cleavage cycles in the single-label experiment was readily resolved; only labeled FND identimer tokens are numbered in the graph (Ido is unlabeled and attached to the bead).
- An imaging error prevented identification of Id3 in the dual-labeled experiment, but labels attached to flanking identimer segments in the chain(s) were easily resolved.
- the dsDNA identimer chain sequence that was decoded in the single label experiment was ldi-550, ld 2 -750, ld3-647, and ld 4 -488.
- the dsDNA identimer chain sequence that was decoded in the dual label experiment was Idi- 550/750, ld 2 -488/647, Id3 was not determined, and ld 4 -750/488.
- molecular barcodes comprising FND Idi- 550, FND ld 2 -750, FND ld3-647, and FND ld 4 -488 tokens (constructed by in-solution ligation of all identimers within a single ligation reaction and then captured onto beads) showed no obvious difference in performance (qualitative and quantitative assessments from various decoding experiments) when compared to FND Identimer-based molecular barcodes constructed step-wise by ligation of each individual identimer with washes in between. Therefore, whole single-label FND Identimer-based molecular barcodes built for the single bead tracking experiment (as shown in FIG.
- streptavidin beads conjugated to molecular barcodes comprising a single, pre determined sequence of labeled FND Identimers tokens were immobilized in a flow cell and imaged before and after exposure to each cleavage agent (Notl in cycle 1 , Xhol in cycle 2, and Spel in cycle 3). Following imaging of all cycles, 9 individual beads were selected for tracking, images were analyzed in reverse chronological order, values from the “previous” cycle were subtracted as described above, and the resulting intensity values were plotted for all 9 beads at each cycle.
- FIG. 22A represents the first of six I beads bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22B represents the second of six I beads bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22C represents the third bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22D represents the fourth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22A represents the first of six I beads bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22B represents the second of six I beads bearing the same single-
- FIG. 22E represents the fifth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22F represents the sixth bead bearing the same single-label identimer chain sequence used to generate data from individual beads in a FOV tracked over 3 cycles of an OCS experiment.
- FIG. 22 shows FND Identimer “orthogonal cleavage sequencing” (OCS) data obtained from the 9 individually tracked streptavidin beads. All streptavidin beads tracked in this experiment showed the expected (correct) order of fluorophore release upon exposure to the pre-determined order of orthogonal cleavage agents.
- FND(256) Ido was unlabeled and first attached to the streptavidin beads to act as an acceptor for ligation of FND(256) Idi (described previously herein when constructing single-label chains).
- FND(256) ldi-3 were configured to have, respectively, Hindll, Spel, Xhol recognition moieties. More specifically, FND(256) ldi-4 were first labeled with all 4 fluorophores (ATTO-488, ATTO-550, AF-647, and AF-750) by the labeling methods disclosed in Example 5. Each labeled oligonucleotide duplex was kept in separate wells of a standard 96-well plate.
- streptavidin beads were washed several times in the same wells (in which they were ligated) with a solution used for quenching the ligation reaction, and for washing away un ligated FND(256) Idi. Streptavidin beads conjugated to the four differentially-labeled FND(256) Idi were then pooled into the same tube for even mixing. Streptavidin beads were then split into 4 wells and exposed to ligation solution containing each of the 4 differentially-labeled options available for FND(256) Id2.
- FIG. 23 shows OCS sequencing reads for 3 individual streptavidin beads from the 256-member library.
- the order of fluorophore release from the 3 beads shown in FIG.23 was readily resolved in this experiment.
- these data suggest that a library of 10,000 beads could be encoded and decoded using the methods described here.
- additional individually detectable labels such as quantum dots for example
- Ring Identimer Composition and Construction [00260] As disclosed in Example 6, three identimer classes (Ring ldi-3) were constructed having scaffold portions comprising ligated rings of dsDNA (referred to herein as “Ring Identimers” or “Ring Id”) and were used to construct a set Ring Id- based molecular barcodes.
- the scaffold portions of each of Ring ldi-3 were configured to comprise one or more restriction sites and amino-modified bases for attachment to a streptavidin bead (via NHS-LC-biotin; commercially available from Sigma-Aldrich, Inx., PO Box 14508, St.
- the scaffold portion of Ring Idi comprised a Hp_Dual_NH2_Xhol oligonucleotide (SEQ ID NO: 27) and a Hp_iNH2_Xhol_bead oligonucleotide (SEQ ID NO: 28).
- the nucleotide corresponding to position 1 of SEQ ID NO: 27 was modified by 5' phosphorylation.
- Positions 14 and 20 of SEQ ID NO: 27 were Int Amino Modifier C6 dT modified nucleotides.
- Positions 34 to 37 of SEQ ID NO: 27 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 28.
- Positions 13 to 22 of SEQ ID NO: 27 comprised a stem-loop feature.
- the nucleotide corresponding to position 1 of SEQ ID NO: 28 was modified by 5' phosphorylation.
- Position 17 of SEQ ID NO: 28 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 34 to 37 of SEQ ID NO: 28 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 27.
- Positions 13 to 21 of SEQ ID NO: 27 comprised a stem loop feature.
- the scaffold portion of Ring Id2 comprised a Hp_Dual_NH2_Spel oligonucleotide (SEQ ID NO: 29) and a Hp_iNH2_Spel_bead oligonucleotide (SEQ ID NO: 30).
- the nucleotide corresponding to position 1 of SEQ ID NO: 29 was modified by 5' phosphorylation.
- Positions 14 and 20 of SEQ ID NO: 29 were Int Amino Modifier C6 dT modified nucleotides.
- Positions 34 to 37 of SEQ ID NO: 29 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 30.
- Positions 13 to 21 of SEQ ID NO: 29 comprised a stem loop feature.
- the nucleotide corresponding to position 1 of SEQ ID NO: 30 was modified by 5' phosphorylation.
- Position 17 of SEQ ID NO: 30 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 34 to 37 of SEQ ID NO: 30 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 29.
- Positions 13 to 21 of SEQ ID NO: 30 comprised a stem loop feature.
- the scaffold portion of Ring Id3 comprised a Hp_Dual_NH2_Notl oligonucleotide (SEQ ID NO: 31) and a Hp_iNH2_Notl_bead oligonucleotide (SEQ ID NO: 32).
- the nucleotide corresponding to position 1 of SEQ ID NO: 31 was modified by 5' phosphorylation.
- Positions 16 and 22 of SEQ ID NO: 31 were Int Amino Modifier C6 dT modified nucleotides.
- Positions 38 to 41 of SEQ ID NO: 31 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 32.
- Positions 13 to 21 of SEQ ID NO: 31 comprised a stem loop feature.
- the nucleotide corresponding to position 1 of SEQ ID NO: 32 was modified by 5' phosphorylation.
- Position 17 of SEQ ID NO: 32 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 34 to 37 of SEQ ID NO: 32 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 31.
- Positions 13 to 21 of SEQ ID NO: 32 comprised a stem loop feature.
- a set of Hp_iNH2_Xhol_bead oligonucleotides, a set of Hp_iNH2_Spel_bead oligonucleotides and, a set of Hp_iNH2_Notl_bead oligonucleotide were synthesized in preparation for conjugation to a set of streptavidin beads.
- Hp_iNH2 oligonucleotides were combined with amine-reactive biotin (NHS-LC-biotin) to conjugate the biotin to the Hp_iNH2 oligonucleotides at their free amines (i.e., at nucleotides corresponding to position 17 for all three HPJNH2 oligonucleotides.
- Each set of the biotinylated HPJNH2 oligonucleotides was suspended to final concentration of 200 nM.
- Hp_Dual_NH2_Xhol oligonucleotides a set of Hp_Dual_NH2_Spel oligonucleotides, and a set of Hp_Dual_NH2_Notl oligonucleotides (referred to collectively as “Hp_Dual_NH2 oligonucleotides”) were synthesized and were labeled with fluorophores in preparation for ligation to their respective Hp_Dual_NH2 oligonucleotides.
- Amine-reactive ATTO-550 (NHS-ATTO- 550) was combined with the set of Hp_Dual_NH2_Xhol oligonucleotides to dually label them at the nucleotides corresponding to positions 14 and 20.
- Amine-reactive ATTO-647 (NHS-ATTO-647) was combined with the set of Hp_Dual_NH2_Spel oligonucleotides to dually label them at the nucleotides corresponding to positions 14 and 20.
- Amine-reactive ATTO-488 (NHS-ATTO-488) was combined with the set of Hp_Dual_NH2_Notl oligonucleotides to dually label them at the nucleotides corresponding to positions 16 and 22.
- Each set of the labeled Hp_Dual_NH2 oligonucleotides was suspended to final concentration of 200 nM.
- the set of streptavidin beads was washed to remove unbound oligonucleotides (3X with 200mI2C hybridization buffer supplemented with 0.01% Tween-20, then washed twice with 200mI1C hybridization buffer), and then washed two more times with 100mIo ⁇ 1X ligation buffer provided by NEB, resuspended in 50mI of ligation buffer, and sonicated for 3 minutes to break up any clumps of beads prior to ligation reactions.
- Ligation reactions were carried out as described previously herein, using 0.2mg/ml of streptavidin beads coated with members from all three sets of HPJNH2 oligonucleotides and introducing each set of Hp_Dual_NH2 oligonucleotides in a separate round of ligation.
- the set of streptavidin beads was ligated: firstly to the set of Hp_Dual_NH2_Xhol oligonucleotides via stick-end ligation to bead-conjugated Hp_iNH2_Xhol_bead oligonucleotides in a first ligation cycle; secondly to the set of Hp_Dual_NH2_Spel oligonucleotides via sticky-end ligation to bead-conjugated Hp_iNH2_Spel_bead oligonucleotides in a second ligation cycle; and then thirdly to the set of Hp_Dual_NH2_Notl oligonucleotides via Hp_iNH2_Notl_bead oligonucleotides in a third ligation cycle to complete the construction of Ring ldi-3 and form the set of a set Ring Id-based molecular barcodes.
- the streptavidin beads were washed 3X with 200mI2C hybridization buffer supplemented with 0.01% Tween-20, 2X with 200mI1C hybridization buffer, resuspended in 20mI of 1X hybridization buffer, sonicated for 3 minutes, and loaded into a flow cell containing a biotin-modified surface for immobilization and subsequent cleavage experiments.
- OCS-compatible streptavidin bead libraries may be used for the co-encoding of other molecules whose components are coordinated with the OCS code.
- nucleic acid capture oligonucleotides used in NGS workflows (as described herein, or as a separate molecule attached to the same bead), and this can be accomplished by coordinate ligation.
- identimer configurations that can withstand synthetic chemical reactions are useful to enable encoding of chemical libraries.
- OCS- compatible libraries can be constructed from scaffolds portions comprised of polymers other than DNA to address this (as described previously herein). Skilled persons will understand DNA is largely susceptible to degradation when its linear 5’- and 3’-ends exposed and is therefore more protected from degradation by exposure to some chemical reactions if circularized.
- fluorescent ssDNA hairpin identimers comprising two differential labels separated by an enzyme-accessible Spel cleavage site were constructed (FIG. 24A) by ligation of a labeled and biotinylated dsDNA acceptor with a dual- labeled hairpin by methods previously disclosed herein. Images were acquired from streptavidin beads (immobilized in a flow cell as described previously) that were coated FSH Identimers in both fluorescence channels, before and after (FIG. 24B) exposure to 300U of Spel in 1X CutSmart Buffer (NEB).
- hairpin structure designed here can function as an identimer of two segments, allowing for combinations of different hairpins containing different combinations of restriction sites and visually-discernable labels to be used in the construction and generation (encoding) of OCS-compatible libraries.
- HairPin ldi- 3 three identimer classes (HairPin ldi- 3 ) were constructed having scaffold portions comprising a stem duplex portion and a loop portion (referred to herein as “HairPin Identimers”) and were used to construct a set HairPin Id-based molecular barcodes.
- the stem duplex portion of each of HairPin ldi- 3 comprised first and second oligonucleotides configured to hybridize and form a dsDNA duplex having a free sticky-end available for ligation.
- the first and second oligonucleotides comprised first and second restriction endonuclease recognition portions (referred to herein, respectively, as “RE1 Site” and “RE2 Site”).
- the first oligonucleotide of each of HairPin ldi- 3 was configured to have a 5AmMC6 modified nucleotide available for biotinylating the duplex.
- the second oligonucleotide of each of HairPin ldi- 3 was configured to have an internal iAmMC6T modified nucleotide available for fluorescent labeling by a NHS fluorophore.
- each loop portion of HairPin ldi- 3 comprised a ssDNA Hp_Dual_NH2 oligonucleotide (SEQ ID NO: 39) configured to form a dsDNA duplex with having a step-loop structure.
- the nucleotide corresponding to position 1 of SEQ ID NO: 39 was modified by 5' phosphorylation.
- Positions 14 and 20 of SEQ ID NO: 39 were Int Amino Modifier C6 dT modified oligonucleotides.
- Positions 13 to 21 comprise a stem-loop structure.
- Each loop portion was configured to have one or more iAmMC6T modified nucleotides at positions within the stem-loop structure available for fluorescent labeling by a NHS fluorophore.
- the scaffold portions of each of HairPin ldi- 3 were configured to comprise one or more recognition moieties and amino-modified bases for attachment to a streptavidin bead (via NHS-LC-biotin; commercially available from Sigma-Aldrich, Inx., PO Box 14508, St. Louis, MO 68178, USA; CAS No.: 35013-72-0), and NHS-fluorophores (available commercially from ThermoFisher Scientific, Inc. 168 Third Avenue, Waltham, MA 02451, USA;
- the first oligonucleotide of the a stem duplex portion of HairPin Idi comprised an Ab_stem1_Spel_Xhol oligonucleotide (SEQ ID NO: 33).
- the nucleotide corresponding to position 1 of SEQ ID NO: 33 was an Amino Modifier C6 modified nucleotide.
- Positions 17 to 22 and 31 to 36 of SEQ ID NO: 33 comprised, respectively, a Spel recognition moiety and a Xhol recognition moiety.
- Positions 41 to 44 of SEQ ID NO: 33 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 39.
- the second oligonucleotide of the a stem duplex portion of HairPin Idi comprised an Ab-stem1_ comp oligonucleotide (SEQ ID NO: 34).
- the nucleotide corresponding to position 1 of SEQ ID NO: 34 was modified by 5' phosphorylation.
- Position 14 of SEQ ID NO: 34 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 5 to 10 and 18 to 23 of SEQ ID NO: 28 comprised, respectively, a Xhol recognition moiety and a Spel recognition moiety.
- the first oligonucleotide of the a stem duplex portion of HairPin Id2 comprised an Ab_stem2_Hindlll_Spel oligonucleotide (SEQ ID NO: 35).
- the nucleotide corresponding to position 1 of SEQ ID NO: 35 was modified by Amino Modifier C6.
- Positions 16 to 21 and 30 to 35 of SEQ ID NO: 35 comprised, respectively, a Hindi 11 recognition moiety and a Spel recognition moiety.
- Positions 40 to 43 of SEQ ID NO: 35 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 39.
- the second oligonucleotide of the a stem duplex portion of HairPin Id2 comprised an Ab-stem2_comp oligonucleotide (SEQ ID NO: 36).
- the nucleotide corresponding to position 1 of SEQ ID NO: 36 was modified by 5' phosphorylation.
- Position 14 of SEQ ID NO: 36 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 5 to 10 and 18 to 23 of SEQ ID NO: 36 comprised, respectively, a Hindi 11 recognition moiety and a Spel recognition moiety.
- the first oligonucleotide of the a stem duplex portion of HairPin Id3 comprised an Ab_stem3_EcoRI_Hindlll oligonucleotide (SEQ ID NO: 37).
- the nucleotide corresponding to position 1 of SEQ ID NO: 37 was by modified by Amino Modifier C6.
- Positions 16 to 21 and 30 to 35 of SEQ ID NO: 37 comprised, respectively, a EcoRI recognition moiety and a Hindi 11 recognition moiety.
- Positions 40 to 43 of SEQ ID NO: 33 comprised a sticky-end configured to anneal to the sticky end of SEQ ID NO: 39.
- the second oligonucleotide of the a stem duplex portion of HairPin Id3 comprised an Ab-stem3_comp oligonucleotide (SEQ ID NO: 38).
- the nucleotide corresponding to position 1 of SEQ ID NO: 38 was modified by 5' phosphorylation.
- Position 14 of SEQ ID NO: 34 was an Int Amino Modifier C6 dT modified nucleotide.
- Positions 5 to 10 and 18 to 23 of SEQ ID NO: 28 comprised, respectively, a Hindi 11 recognition moiety and a EcoRI recognition moiety.
- oligonucleotides were ordered from IDT as either purified oligos (HPLC), or as standard desalted oligos. Standard desalted oligonucleotides containing 5’-amino modifications were first desalted or precipitated to remove any excess free amine carried over from synthesis. The 5’-amino group of the stem oligonucleotides (200uM) was modified with an appropriate chemistry for attachment to MyOne T1 streptavidin beads by adding NHS-LC-Biotin resuspended in anhydrous DMF (commercially available from Sigma-Aldrich, Inx., PO Box 14508, St.
- Ab_stem1_Spel_Xhol was biotinylated
- Ab_stem1_comp was labeled with NHS-AF-647
- the two oligonucleotides were annealed at a 1 :1.2 molar ratio (respectively; 10pMbiotinylated oligonucleotide: 12pMlabeled oligonucleotide) in 1X hybridaztion buffer by heating for 5 minutes at 90°C, and slowly cooling to RT over about 30 minutes in a heat block.
- the in solution ligation was carried out by mixing the annealed duplex at 200nM in 1X T4 DNA Ligase buffer (NEB) with AF750-labeled HP_Dual_NH2 oligonucleotide at a final concentration of 400nM, with 5mI T4 PNK (NEB) and 5mI of T4 DNA ligase (NEB) in a 250mI ligation reaction for 30 minutes at 37°C. Ligations were captured directly onto MyOne T1 streptavidin beads pre-washed as described above (2 washes with 2X Hybridization buffer) prior to binding with biotinylated oligonucleotides in the ligation reaction.
- Binding was initiated by mixing 250plof washed beads at 0.2mg/ml in 2X Hybridization buffer with the 250plligation reaction, and was allowed to proceed at 37°C for 30 minutes. Beads were then washed 3X with 200 mI 2X Hybridization buffer supplemented with 0.01% Tween-20, 2X with 200mI1C Hybridization buffer, resuspended in 20plof 1X Hybridization buffer, sonicated for 3 minutes, and loaded into a flow cell containing a biotin-modified surface for immobilization and subsequent cleavage experiments.
- Hyb Id-based molecular barcodes were formed and encoded from three identimer classes (Hyb ldi-3), in which each identimer class comprised scaffold portions having a first hybridizing oligonucleotide and a second hybridizing oligonucleotide, the hybridizing nucleotides configured to hybridize to each other and form a biotinylated, fluorescently labeled, dsDNA duplex having one or more orthogonal cleavage sites (referred to collectively as “Hybridization Identimers” or “Hyb Id”) that were used to construct a set of Hyb Id- based molecular barcodes.
- Hyb Identimers Hyb Id
- the scaffold portions of Hyb ldi-3 comprised, respectfully, a Xhol recognition moiety, a Spel recognition moiety, and a Notl recognition moiety.
- the scaffold portion of Hyb Idi comprised a Hyb_5NH2_Xhol_bead first hybridizing oligonucleotide (SEQ ID NO: 40) (also referred to herein as “Hyb_5NH2_Xhol_bead oligonucleotide”) and a Hyb_5NH2_Xhol_comp second hybridizing oligonucleotide (SEQ ID NO: 41) (also referred to herein as “Hyb_5NH2_Xhol_comp oligonucleotide”).
- the nucleotide corresponding to position 1 of SEQ ID NO: 40 was Amino Modifier C6 modified. Positions 22 to 27 of SEQ ID NO: 40 comprised a Xhol recognition moiety (also referred to in Example 9 as “RE3 site”). The nucleotide corresponding to position 1 of SEQ ID NO: 41 was Amino Modifier C6 modified. Positions 8 to 13 of SEQ ID NO: 41 comprised a Xhol recognition moiety.
- Hyb_5NH2_Spel_bead first hybridizing oligonucleotide SEQ ID NO: 42
- Hyb_Spel_Xhol_bead oligonucleotide also referred as “Hyb_Spel_Xhol_bead oligonucleotide”
- Hyb_5NH2_Spel_comp second hybridizing oligonucleotide SEQ ID NO: 43
- the nucleotide corresponding to position 1 of SEQ ID NO: 42 was Amino Modifier C6 modified.
- Positions 21 to 26 of SEQ ID NO: 42 comprised a Spel recognition moiety (also referred to in Example 9 as “RE2 site”). The.
- the nucleotide corresponding to position 1 of SEQ ID NO: 43 was Amino Modifier C6 modified.
- the scaffold portion of Hyb Id3 comprised a Hyb_5NH2_Notl_bead first hybridizing oligonucleotide (SEQ ID NO: 44) (also referred to herein as “Hyb_5NH2_Xhol_bead oligonucleotide”) and a Hyb_5NH2_Xhol_comp second hybridizing oligonucleotide (SEQ ID NO: 45) (also referred to herein as “Hyb_5NH2_Xhol_comp oligonucleotide”).
- the nucleotide corresponding to position 1 of SEQ ID NO: 44 was Amino Modifier C6 modified.
- Positions 22 to 27 of SEQ ID NO: 44 comprised a Notl recognition moiety (also referred to in Example 9 as “RE3 site”).
- the nucleotide corresponding to position 1 of SEQ ID NO: 41 was Amino Modifier C6 modified.
- Amine-reactive ATTO-550 (NHS-ATTO-550) was combined with a set of Hyb_5NH2_Xhol_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 41).
- Amine-reactive AF- 647 (NHS-AF-647) was combined with a set of Hyb_5NH2_Spel_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 43).
- Amine-reactive ATTO-488 (NHS- ATTO-488) was combined with a set of Hyb_5NH2_Notl_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 45).
- Hyb_5NH2_Xhol_comp oligonucleotides were biotinylated as described already herein, and all three were added to MyOne T1 streptavidin beads for coating.
- the comp oligonucleotides were labeled with various NHS-fluorophore reagents as described above.
- one labeled hybridizing oligonucleotide was introduced at a time during splitting and pooling cycles. These labeled hybridizing oligonucleotides were introduced at a concentration of 200nM during each encoding cycle, in 0.5X Hybridization buffer. Labeled strands were incubated with beads for 30 minutes at 37°C with occasional tapping of tubes to promote capture.
- beads were washed 3X with 200mI2C Hybridization buffer supplemented with 0.01% Tween-20, 2X with 200mI1C Hybridization buffer, resuspended in 20plof 1X Hybridization buffer, sonicated for 3 minutes, and loaded into a flow cell containing a biotin-modified surface for immobilization and subsequent cleavage experiments.
- Amine-reactive ATTO-550 (NHS-ATTO-550) was combined with a set of Hyb_5NH2_Xhol_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 41).
- Amine-reactive AF- 647 (NHS-AF-647) was combined with a set of Hyb_5NH2_Spel_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 43).
- Amine-reactive ATTO-488 (NHS- ATTO-488) was combined with a set of Hyb_5NH2_Notl_comp oligonucleotides to label them at their 5’ free amines (i.e., the nucleotide corresponding to position 1 of SEQ ID NO: 45). End of Example 9.
- methods for imaging may be limited by the dynamic range of the imaging system (see Weissleder et al., IEEE J. Sel. Top Quantum Electron; Jan-Feb; 25(1):6801507 (2019)).
- some streptavidin beads will have multiple copies of the same fluorophore, and any streptavidin beads found to saturate signal in any of the fluorescence channels may be difficult to resolve.
- use of high dynamic range imaging will improve upon the library diversity that one could construct using the compositions and methods disclosed approach. For example, three images are acquired for each experimental data point: one taken at low exposure, one taken at mid-exposure, and one taken at a higher exposure.
- streptavidin beads containing very few copies of a fluorophore can be imaged in the same field of view (experiment) as streptavidin beads that contain many copies of that fluorophore.
- Streptavidin beads with very few copies of a fluorophore need a higher exposure to get their values up into a range where they can be accurately quantified, and beads with many copies of a fluorophore need a lower exposure to get their values down below saturation, and into a range where they can be accurately quantified.
- use of high dynamic range imaging allows for streptavidin beads of greater deviation into the same experiment to increase the diversity of libraries constructed with the compositions and methods disclosed herein.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Saccharide Compounds (AREA)
- Addition Polymer Or Copolymer, Post-Treatments, Or Chemical Modifications (AREA)
- Materials For Medical Uses (AREA)
- Sampling And Sample Adjustment (AREA)
- Nitrogen And Oxygen Or Sulfur-Condensed Heterocyclic Ring Systems (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
- Peptides Or Proteins (AREA)
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22764204.8A EP4301909A4 (en) | 2021-03-04 | 2022-03-04 | MOLECULAR BARCODES AND RELATED METHODS AND SYSTEMS |
| JP2023553204A JP2024509822A (ja) | 2021-03-04 | 2022-03-04 | 分子バーコードならびに関連する方法およびシステム |
| CN202280029417.3A CN118202097A (zh) | 2021-03-04 | 2022-03-04 | 分子条形码以及相关方法和系统 |
| US18/548,864 US20240158784A1 (en) | 2021-03-04 | 2022-03-04 | Molecular barcodes and related methods and systems |
| AU2022230449A AU2022230449A1 (en) | 2021-03-04 | 2022-03-04 | Molecular barcodes and related methods and systems |
| CA3209941A CA3209941A1 (en) | 2021-03-04 | 2022-03-04 | Molecular barcodes and related methods and systems |
| KR1020237033127A KR20230165231A (ko) | 2021-03-04 | 2022-03-04 | 분자 바코드 및 관련 방법 및 시스템 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163156858P | 2021-03-04 | 2021-03-04 | |
| US63/156,858 | 2021-03-04 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2022187719A1 true WO2022187719A1 (en) | 2022-09-09 |
| WO2022187719A9 WO2022187719A9 (en) | 2024-04-11 |
Family
ID=83154643
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2022/019045 Ceased WO2022187719A1 (en) | 2021-03-04 | 2022-03-04 | Molecular barcodes and related methods and systems |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US20240158784A1 (https=) |
| EP (1) | EP4301909A4 (https=) |
| JP (1) | JP2024509822A (https=) |
| KR (1) | KR20230165231A (https=) |
| CN (1) | CN118202097A (https=) |
| AU (1) | AU2022230449A1 (https=) |
| CA (1) | CA3209941A1 (https=) |
| WO (1) | WO2022187719A1 (https=) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024206728A1 (en) * | 2023-03-28 | 2024-10-03 | Oregon Health & Science University | Orthogonal cleavage ligation sequencing (ocls) |
| WO2024182571A3 (en) * | 2023-02-28 | 2024-10-24 | Oregon Health & Science University | Compositions and methods for molecular barcoding |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050064435A1 (en) * | 2003-09-24 | 2005-03-24 | Xing Su | Programmable molecular barcodes |
| US20170337459A1 (en) * | 2015-02-27 | 2017-11-23 | Cellular Research, Inc. | Spatially addressable molecular barcoding |
| WO2018087539A1 (en) * | 2016-11-08 | 2018-05-17 | Bactevo Ltd | Tagless encoded chemical library |
| US20200190583A1 (en) * | 2018-12-14 | 2020-06-18 | Ultivue, Inc. | Methods and Compositions for Sequentially Detecting Targets |
-
2022
- 2022-03-04 EP EP22764204.8A patent/EP4301909A4/en active Pending
- 2022-03-04 AU AU2022230449A patent/AU2022230449A1/en active Pending
- 2022-03-04 WO PCT/US2022/019045 patent/WO2022187719A1/en not_active Ceased
- 2022-03-04 CA CA3209941A patent/CA3209941A1/en active Pending
- 2022-03-04 KR KR1020237033127A patent/KR20230165231A/ko active Pending
- 2022-03-04 US US18/548,864 patent/US20240158784A1/en active Pending
- 2022-03-04 CN CN202280029417.3A patent/CN118202097A/zh active Pending
- 2022-03-04 JP JP2023553204A patent/JP2024509822A/ja active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050064435A1 (en) * | 2003-09-24 | 2005-03-24 | Xing Su | Programmable molecular barcodes |
| US20170337459A1 (en) * | 2015-02-27 | 2017-11-23 | Cellular Research, Inc. | Spatially addressable molecular barcoding |
| WO2018087539A1 (en) * | 2016-11-08 | 2018-05-17 | Bactevo Ltd | Tagless encoded chemical library |
| US20200190583A1 (en) * | 2018-12-14 | 2020-06-18 | Ultivue, Inc. | Methods and Compositions for Sequentially Detecting Targets |
Non-Patent Citations (4)
| Title |
|---|
| ERIC J DEVOR, BEHLKE MARK A: "Strategies for Attaching Oligonucleotides to Solid Supports", INTEGRATED DNA TECHNOLOGIES, 1 January 2005 (2005-01-01), XP055294450, Retrieved from the Internet <URL:http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.93.1787&rep=rep1&type=pdf> [retrieved on 20160809] * |
| NANDA ET AL., METHODS ENZYMOL., vol. 536, 2014, pages 87 - 94 |
| See also references of EP4301909A4 |
| WEISSLEDER ET AL., IEEE J. SEL. TOP QUANTUM ELECTRON, vol. 25, no. 35013-72-0, 2019 |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024182571A3 (en) * | 2023-02-28 | 2024-10-24 | Oregon Health & Science University | Compositions and methods for molecular barcoding |
| WO2024206728A1 (en) * | 2023-03-28 | 2024-10-03 | Oregon Health & Science University | Orthogonal cleavage ligation sequencing (ocls) |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4301909A4 (en) | 2025-02-26 |
| JP2024509822A (ja) | 2024-03-05 |
| AU2022230449A2 (en) | 2023-09-28 |
| CN118202097A (zh) | 2024-06-14 |
| EP4301909A1 (en) | 2024-01-10 |
| KR20230165231A (ko) | 2023-12-05 |
| US20240158784A1 (en) | 2024-05-16 |
| WO2022187719A9 (en) | 2024-04-11 |
| AU2022230449A1 (en) | 2023-09-07 |
| CA3209941A1 (en) | 2022-09-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11713485B2 (en) | Hybridization chain reaction methods for in situ molecular detection | |
| KR102091207B1 (ko) | Dna―코딩된 라이브러리의 생성 및 스크리닝 방법 | |
| US20240158784A1 (en) | Molecular barcodes and related methods and systems | |
| JP2024509822A5 (https=) | ||
| HK1253711B (zh) | 产生和筛选dna编码的库的方法 | |
| HK40067204A (zh) | 检测目标核酸分子的方法 | |
| HK40067204B (zh) | 检测目标核酸分子的方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22764204 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022230449 Country of ref document: AU |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 3209941 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023553204 Country of ref document: JP |
|
| ENP | Entry into the national phase |
Ref document number: 2022230449 Country of ref document: AU Date of ref document: 20220304 Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022764204 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022764204 Country of ref document: EP Effective date: 20231004 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202280029417.3 Country of ref document: CN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 11202500745U Country of ref document: SG |
|
| WWP | Wipo information: published in national office |
Ref document number: 11202500745U Country of ref document: SG |