WO2018013723A1

WO2018013723A1 - Single-cell transcript sequencing

Info

Publication number: WO2018013723A1
Application number: PCT/US2017/041770
Authority: WO
Inventors: Kenneth J. Livak; Richard A. Fekete
Original assignee: Fluidigm Corporation
Priority date: 2016-07-14
Filing date: 2017-07-12
Publication date: 2018-01-18
Also published as: CN109661474A; SG11201811048UA; EP3485043A1; EP3485043A4; CA3027423A1; WO2018013723A9

Abstract

Described herein are methods for preparing DNA templates for single-cell transcript sequencing of RNA from a population of cells. The methods entail distributing cells from the population into separate reaction volumes so that a plurality of separate reaction volumes each contain a single, isolated cell, wherein the cells have been treated with a fixative prior to distribution. The isolated cells are then permeabilized or disrupted, and cDNA is prepared by reverse transcript, followed by amplification. Also provided is a novel chemistry for efficient production of DNA templates from T-cell receptors or immunoglobulins in single cells.

Description

SINGLE-CELL TRANSCRIPT SEQUENCING

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of and priority to U.S. Provisional Patent Application Serial Numbers 62/362,425, filed July 14, 2016, and 62/463,487, filed February 24, 2017, both of which are incorporated herein by reference in their entireties for all purposes.

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT

[0002] Not applicable.

FIELD

[0003] The subject matter disclosed herein relates to generally to the area of analysis of single cells. In particular, the subject matter relates to methods and

compositions for performing single-cell transcript sequencing. BACKGROUND

[0004] The ability to sequence mRNA from single eukaryotic cells is providing new information about the scope of cellular heterogeneity and the nature of cell identity. Single cells can be isolated, e.g., by micromanipulation, laser microdissection, fluorescence-activated cell sorting, and in microfluidic devices. Methods for preparing sequencing libraries from single-cell mRNA have been developed. The two main classes of methods are whole transcript and end-tagging. In the whole transcript approach, each library contains fragments obtained from the full length of the transcript; in the end- tagging approach, each library contains fragments from just one end of the transcript. Whole transcript libraries provide more information, but end-tagging offers advantages in terms of workflow and quantification. Primary processing of the sequencing data is generally similar to the processing used in bulk mRNA sequencing.

[0005] Single-cell transcript of sequencing is of particular interest in the context of determining RNA sequences encoding the T-cell receptor (TCR) or immunoglobulin produced by a particular cell. For example, each T lymphocyte expresses TCR that binds antigen. Antigen binding is a key event in the cascade of the immune response. Each TCR is a heterodimer consisting of a- and β-chains encoded by the TRA and TRB genes, respectively. Diversity in antigen recognition is generated by somatic V(D)J

recombination occurring at the TRA and TRB loci. At single-cell resolution, determining the sequence of an appropriate portion of TRA and TRB serves as a unique identifier of a T cell's ancestry because it is likely that any two T cells expressing the same TCRaP pair arose from a common T-cell clone. Because immunoglobulin genes are organized similarly to TCR genes and also generate diversity by somatic V(D)J recombination, immunoglobulin RNA sequences can likewise be used to identify a B cell's ancestry.

SUMMARY

[0006] Various embodiments contemplated herein may include, but need not be limited to, one or more of the following:

[0007] Embodiment 1 : A method for preparing DNA templates for single-cell transcript sequencing of RNA from a population of cells, the method including:

distributing cells from the population into separate reaction volumes so that a plurality of separate reaction volumes each comprise a single cell, wherein the cells have been treated with a fixative prior to distribution, wherein the fixative includes a cell-permeant fixative that enables the production of DNA templates from transcripts derived from single cells with greater efficiency than in the absence of said cell-permeant fixative; permeabilizing or disrupting each cell in each separate reaction volume; reverse transcribing cDNA from RNA in each separate reaction volume; and amplifying the cDNA to produce DNA templates, wherein the amplification incorporates one or more nucleotide sequences that facilitate DNA sequencing of the DNA templates. [0008] Embodiment 2: The method of embodiment 1, wherein the fixative stabilizes the cell nucleus and/or stabilizes RNA.

[0009] Embodiment 3 : The method of embodiment 1, wherein the method includes the prior treatment of the cells with the fixative.

[0010] Embodiment 4: The method of embodiments 1 or 3, wherein the fixative includes biomarker and histology preservative (BHP). [0011] Embodiment 5: The method of embodiments 1 or 3, wherein the fixative includes dithiobis(succinimydal proprionate) (DSP).

[0012] Embodiment 6: The method of any of embodiments 1-5, wherein the DNA templates are recovered from the separate reaction volumes in one or more pools of DNA templates.

[0013] Embodiment 7: The method of any of embodiments 1-6, wherein the DNA templates are further amplified after recovery.

[0014] Embodiment 8: The method of any of embodiments 1-7, wherein the method additionally includes subjecting the DNA templates to DNA sequencing. [0015] Embodiment 9: The method of any of embodiments 1-8, wherein the method includes preparing DNA templates for single-cell transcript sequencing of T-cell receptor or immunoglobulin RNA from the population, wherein: the cells comprise T cells or B cells; and the DNA templates are generated from T-cell receptor or

immunoglobulin RNA, respectively. [0016] Embodiment 10: The method of embodiment 9, wherein the cells are T cells.

[0017] Embodiment 11 : The method of embodiment 9, wherein the cells are B cells.

[0018] Embodiment 12: The method of embodiments 10 or 11, wherein the cells are activated.

[0019] Embodiment 13 : The method of any of embodiments 1-9, wherein the separate reaction volumes comprise separate capture sites in a microfluidic device.

[0020] Embodiment 14: The method of embodiment 13, including providing reverse transcription reagents to each capture site, wherein the reverse transcription reagents comprise: a first reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding a first chain of a T-cell receptor or immunoglobulin, the T-cell receptor or immunoglobulin also including a second chain; a second reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding the second chain of the T-cell receptor or

immunoglobulin, respectively; wherein the first and second RT primers each additionally comprise a first nucleotide tag including a first primer binding site for a first DNA sequencing primer, the first nucleotide tag being 5' of the CS-specific portion; and a first barcode primer that includes, from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor.

[0021] Embodiment 15: The method of embodiment 14, wherein the first barcode primer additionally includes nucleotide sequences that form a stem-duplex, whereby the first barcode primer has a hairpin configuration during reverse transcription and a linear configuration during amplification.

[0022] Embodiment 16: The method of embodiment 15, wherein the first barcode primer is provided to each capture site before permeabilizing or disrupting each cell in each separate reaction volume.

[0023] Embodiment 17: The method of embodiments 15 or 16, wherein the reverse transcription reagents additionally comprise a second barcode primer that includes, from 3' to 5', a portion specific for a second primer binding site for a second DNA sequencing primer, a second barcode nucleotide sequence, and a second sequencing adaptor.

[0024] Embodiment 18: The method of embodiment 17, wherein the second barcode primer is provided to each capture site with one or more reagents that

permeabilize or disrupt each cell in each separate reaction volume.

[0025] Embodiment 19: The method of embodiments 17 or 18, wherein the second barcode primer additionally includes nucleotide sequences that form a stem- duplex, whereby the second barcode primer has a hairpin configuration during reverse transcription and a linear configuration during amplification.

[0026] Embodiement 20: The method of any of embodiments 15-19, including providing amplification reagents to each capture site, wherein the amplification reagents are provided separately from the reverse transcription reagents, and the amplification reagents comprise: a plurality of first amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; a plurality of second amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the first and second amplification primers each additionally comprise a second nucleotide tag including the second primer binding site, the second nucleotide tag being 5' of the VS-specific portion; wherein the first and second amplification primers each comprise nucleotide sequences that form a stem-duplex, and wherein the first and second amplification primers each has a linear configuration during amplification.

[0027] Embodiment 21 : The method of any of embodiments 1-20, wherein said amplifying the cDNA to produce DNA templates includes polymerase chain reaction

(PCR), which includes multiple cycles of denaturation, annealing, and elongation, wherein the PCR includes reducing the reaction temperature to rehydrate the reaction volume after two or more cycles of PCR.

[0028] Embodiment 22: The method of embodiment 21, wherein each of the first and second amplification primers has a hairpin configuration during rehydration and a linear configuration during denaturation, annealing, and extension.

[0029] Embodiment 23 : The method of any of embodiments 14-22, wherein the reverse transcription reagents additionally comprise a third reverse transcription (RT) primer having an oligo-dT sequence and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the oligo-dT sequence.

[0030] Embodiment 24: The method of embodiment 23, wherein the third RT primer additionally includes a unique molecular identifier (UMI) between the oligo-dT sequence and the first primer binding site.

[0031] Embodiment 25: The method of any of embodiments 20-24, wherein the amplification reagents comprise a third amplification primer including a portion specific for a particular target RNA and the second nucleotide tag including the second primer binding site, the second nucleotide tag being 5' of the target-specific portion, wherein the third amplification primer includes nucleotide sequences that form a stem-duplex, wherein the third amplification primer has a linear configuration during amplification. [0032] Embodiment 26: The method of embodiment 14, including providing amplification reagents to each capture site, wherein the amplification reagents are provided separately from the reverse transcription reagents, and the amplification reagents comprise: a plurality of first amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; a plurality of second amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the second chain of the T-cell receptor or immunoglobulin, respectively; wherein the first and second amplification primers each additionally comprise a second nucleotide tag including a second primer binding site for a second DNA sequencing primer, the second nucleotide tag being 5' of the VS-specific portion; and a second barcode primer that includes, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor.

[0033] Embodiment 27: The method of embodiment 26, wherein the reverse transcription reagents additionally comprise a third reverse transcription (RT) primer having an oligo-dT sequence and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the oligo-dT sequence. [0034] Embodiment 28: The method of embodiment 27, wherein the third RT primer additionally includes a unique molecular identifier (UMI) between the oligo-dT sequence and the first primer binding site.

[0035] Embodiment 29: The method of any of embodiments 26-28, wherein the amplification reagents comprise a third amplification primer including a portion specific for a particular target RNA and the second nucleotide tag including the second primer binding site, the second nucleotide tag being 5' of the target-specific portion.

[0036] Embodiment 30: The method of embodiment 29, wherein the target- specific portion of the third amplification primer includes an insert that does not anneal to the target RNA, wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the target RNA.

[0037] Embodiment 31 : The method of embodiment 14, wherein the reverse transcription reagents additionally comprise a 5' oligonucleotide including, from 5' to 3', the second primer binding site, and an oligo-riboG sequence.

[0038] Embodiment 32: The method of embodiment 31, wherein the 5' oligonucleotide additionally includes a unique molecular identifier (UMI) between the second primer binding site and the oligo-riboG sequence.

[0039] Embodiment 33 : The method of embodiment 32, including providing amplification reagents to each capture site, wherein the amplification reagents are provided separately from the reverse transcription reagents, and the amplification reagents comprise a second barcode primer that includes, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor. [0040] Embodiment 34: The method of any of embodiments 14-33, wherein the method produces DNA templates for T-cell receptor chains.

[0041] Embodiment 35: The method of any of embodiments 14-33, wherein the method produces DNA templates for immunoglobulin chains. [0042] Embodiment 36: The method of any of embodiments 13-26, wherein the microfluidic device is a matrix-type microfluidic device including: capture sites arranged in a matrix of R rows and C columns, wherein R and C are integers greater than 1, and wherein the capture sites can be fluidically isolated from one another after distribution of cells to the capture sites; a set of R first input lines configured to deliver the first reagent(s) to capture sites in a particular row; a set of C second input lines configured to deliver second reagent(s) to capture sites in a particular column, wherein said delivery is separate from the delivery first reagent(s), wherein, after a reaction, reaction products can be recovered from the microfluidic device in pools of reaction products from individual rows or columns. [0043] Embodiment 37: The method of embodiment 36, wherein either: the reverse transcription reagents are provided to each capture site via the first input lines, and the amplification reagents are provided to each capture site via the second input lines; or the reverse transcription reagents are provided to each capture site via the second input lines, and the amplification reagents are provided to each capture site via the first input lines.

[0044] Embodiment 38: The method of embodiment 37, wherein: each barcode primer provided to a first input line includes a barcode nucleotide sequence that is different from that in the other barcode primers provided to all other first input lines; each barcode primer provided to a second input line includes a barcode nucleotide sequence that is different from that in the other barcode primers provided to all other second input lines; and each DNA template produced at each capture site includes the structure: 5'- (second sequencing adaptor)-(second barcode nucleotide sequence)-(second primer binding site)-(VS)-(complementarity determining region)-(CS)-( reverse complement first primer binding site)-(reverse complement first barcode nucleotide sequence)-(reverse complement first sequencing adaptor)-3', wherein the first and second barcode nucleotide sequences, together, uniquely identify the capture site at which the DNA template was produced. [0045] Embodiment 39: The method of any of embodiments 14-38, wherein: the

CS-specific portion of the first RT primer includes an insert that does not anneal to the CS of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; and/or the CS-specific portion of the second RT primer includes an insert that does not anneal to the CS of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the CS.

[0046] Embodiment 40: The method of any of embodiments 26-39, wherein: the

VS-specific portion of the plurality of first amplification primers includes an insert that does not anneal to the VS of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; and/or the VS-specific portion of the plurality of second amplification primers includes an insert that does not anneal to the VS of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the VS. [0047] Embodiment 41 : The method of embodiment 40, wherein the first and second RT primers do not comprise an insert that does not anneal to the CS of the RNA.

[0048] Embodiment 42: The method of any of embodiments 1-40, wherein the method includes preparing DNA templates for single-cell transcriptome sequencing.

[0049] Embodiment 43 : The method of any of embodiments 1-42, wherein the method includes preparing DNA templates for single-cell transcript sequencing of more than one specific target RNA.

[0050] Embodiment 44: The method of embodiment 43, wherein the reverse transcription reagents additionally comprise a third reverse transcription (RT) primer having a portion specific for a particular target RNA and a third nucleotide tag including the first primer binding site for the first DNA sequencing primer, the third nucleotide tag being 5' of the target-specific portion.

[0051] Embodiment 45: The method of embodiment 44, wherein the target- specific portion of the third RT primer includes an insert that does not anneal to the particular target RNA, wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the target RNA.

[0052] Embodiment 46: The method of any of embodiments 1-26 and 36-45, wherein the reverse transcription reagents additionally comprise a 5' oligonucleotide including, from 5' to 3', the second primer binding site for the second DNA sequencing primer and an oligo-riboG sequence.

[0053] Embodiment 47: The method of embodiment 46, wherein the 5' oligonucleotide additionally includes a unique molecular indentifier (UMI) between the second primer binding site and the oligo-riboG sequence.

[0054] Embodiment 48: The method of any of embodiments 9-47, wherein the method includes determining that a T-cell receptor or immunoglobulin sequence is present in a cell having a specific phenotype.

[0055] Embodiment 49: A primer combination for producing DNA templates from RNA encoding T-cell receptor or immunoglobulin chains, the primer combination including: a first reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding a first chain of a T-cell receptor or immunoglobulin, the T-cell receptor or immunoglobulin also including a second chain; a second reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding the second chain of the T-cell receptor or

immunoglobulin, respectively; wherein the first and second RT primers each additionally comprise a first nucleotide tag including a first primer binding site for a first DNA sequencing primer, the first nucleotide tag being 5' of the CS-specific portion; and a first barcode primer that includes, from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor; a plurality of first amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; a plurality of second amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the second chain of the T-cell receptor or

immunoglobulin, respectively; wherein the first and second amplification primers each additionally comprise a second nucleotide tag including a second primer binding site for a second DNA sequencing primer, the second nucleotide tag being 5' of the VS-specific portion; and a second barcode primer that includes, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor.

[0056] Embodiment 50: The primer combination of embodiment 49, wherein the first barcode primer includes nucleotide sequences that form a stem-duplex, whereby the first barcode primer has a hairpin configuration when used for reverse transcription and a linear configuration when used for amplification.

[0057] Embodiment 51 : The primer combination of embodiments 49 or 50, wherein the second barcode primer includes nucleotide sequences that form a stem- duplex, whereby the second barcode primer has a hairpin configuration when used for reverse transcription and a linear configuration when used for amplification.

[0058] Embodiment 52: The primer combination any of embodiments 49-51, wherein the first and second amplification primers each comprise nucleotide sequences that form a stem-duplex, whereby the first and second amplification primers each has a linear configuration when used for amplification.

[0059] Embodiment 53 : The primer combination of any of embodiments 49-52, additionally including a third reverse transcription (RT) primer having an oligo-dT sequence and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the oligo-dT sequence. [0060] Embodiment 54: The primer combination of embodiment 53, wherein the third RT primer additionally includes a unique molecular identifier (UMI) between the oligo-dT sequence and the first binding site.

[0061] Embodiment 55: The primer combination of any of embodiments 49-54, additionally including a third amplification primer including a portion specific for a particular target RNA and the second nucleotide tag including the second primer binding site, the second nucleotide tag being 5' of the target-specific portion, wherein the third amplification primer includes nucleotide sequences that form a stem-duplex, whereby the third amplification primer has a linear configuration during amplification.

[0062] Embodiment 56: The primer combination of embodiment 49, wherein: the CS-specific portion of the first RT primer includes an insert that does not anneal to the CS of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; and/or the CS-specific portion of the second RT primer includes an insert that does not anneal to the CS of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the CS.

[0063] Embodiment 57: The primer combination of embodiments 49 or 56, wherein: the VS-specific portion of the plurality of first amplification primers includes an insert that does not anneal to the VS of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; and/or the VS-specific portion of the plurality of second amplification primers includes an insert that does not anneal to the VS of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the VS.

[0064] Embodiment 58: The primer combination of embodiment 57, wherein the first and second RT primers do not comprise an insert that does not anneal to the CS of the RNA.

[0065] Embodiment 59: The primer combination of any of embodiments 56-58, additionally including a third reverse transcription (RT) primer having an oligo-dT sequence and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the oligo-dT sequence.

[0066] Embodiment 60: The primer combination of embodiment 59, wherein the third RT primer additionally includes a unique molecular identifier (UMI) between the oligo-dT sequence and the binding site for the first DNA sequencing primer.

[0067] Embodiment 61 : The primer combination of embodiments 59 or 60, additionally including a third amplification primer including a portion specific for a particular target RNA a second nucleotide tag including a second primer binding site, the second nucleotide tag being 5' of the target-specific portion. [0068] Embodiment 62: The method of embodiment 61, wherein the target- specific portion of the third amplification primer includes an insert that does not anneal to the target RNA, wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the target RNA.

[0069] Embodiment 63 : The primer combination of any of embodiments 49-58, wherein the primer combination additionally includes a third reverse transcription (RT) primer having a portion specific for a particular target RNA and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the target- specific portion.

[0070] Embodiment 64: The primer combination of embodiment 63, wherein the target-specific portion of the third RT primer includes an insert that does not anneal to the particular target RNA, wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the target RNA. [0071] Embodiment 65: The primer combination of embodiments 63 or 64, wherein the primer combination additionally includes a 5' oligonucleotide including, from 5' to 3', the second primer binding site and an oligo-riboG sequence.

[0072] Embodiment 66: The primer combination of embodiment 65, wherein the 5' oligonucleotide additionally includes a unique molecular identifier (UMI) between the binding site for the second DNA sequencing primer and the oligo-riboG sequence.

[0073] Embodiment 67: A primer combination for producing DNA templates from RNA encoding T-cell receptor or immunoglobulin chains, the primer combination including: a first reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding a first chain of a T-cell receptor or immunoglobulin, the T-cell receptor or immunoglobulin also including a second chain; a second reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding the second chain of the T-cell receptor or

immunoglobulin, respectively; wherein the first and second RT primers each additionally comprise a first nucleotide tag including a first primer binding site for a first DNA sequencing primer, the first nucleotide tag being 5' of the CS-specific portion; and a first barcode primer that includes, from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor; a 5'

oligonucleotide including, from 5' to 3', the second primer binding site for the second DNA sequencing primer, a unique molecular identifier (UMI), and an oligo-riboG sequence; and a second barcode primer that includes, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor.

[0074] Embodiment 68: The primer combination of embodiment 67, wherein: the CS-specific portion of the first RT primer includes an insert that does not anneal to the CS of the RNA encoding the first chain of the T-cell receptor or immunoglobulin; and/or the CS-specific portion of the second RT primer includes an insert that does not anneal to the CS of the RNA encoding the second chain of the T-cell receptor or immunoglobulin; wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the CS.

[0075] Embodiment 69: The primer combination of embodiments 67 or 68, wherein the combination additionally includes a third reverse transcription (RT) primer having a portion specific for a particular target RNA and a third nucleotide tag including the first primer binding site, the third nucleotide tag being 5' of the target-specific portion.

[0076] Embodiment 70: The primer combination of embodiment 69, wherein the target-specific portion of the third RT primer includes an insert that does not anneal to the particular target RNA, wherein the insert is flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the target RNA.

[0077] Embodiment 71 : The primer combination of any of embodiments 49-70, wherein the primer combination is adapted to produce DNA templates for T-cell receptor chains. [0078] Embodiment 72: The primer combination of any of embodiments 49-70, wherein the primer combination is adapted to produce DNA templates for

immunoglobulin chains.

[0079] Embodiment 73 : A kit including the primer combination of any of embodiments 49-72 and a matrix-type microfluidic device including: capture sites arranged in a matrix of R rows and C columns, wherein R and C are integers greater than 1, and wherein the capture sites can be fluidically isolated from one another after distribution of cells to the capture sites; a set of R first input lines configured to deliver the first reagent(s) to capture sites in a particular row; a set of C second input lines configured to deliver second reagent(s) to capture sites in a particular column, wherein said delivery is separate from the delivery first reagent(s), wherein, after a reaction, reaction products can be recovered from the microfluidic device in pools of reaction products from individual rows or columns.

BRIEF DESCRIPTION OF THE DRAWINGS

[0080] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

[0081] Figure 1 A-D: An illustrative matrix-type microfluidic device is shown schematically in (1 A). (IB) illustrates the delivery of R different barcodes through the R different first input lines to the capture sites. (1C) illustrates the delivery of C different barcodes through C different input lines to the capture sites. (ID) illustrates that, after the reaction has been carried out, reaction products can be harvested for each column as a pool, for example, by applying a harvesting fluid to the C second input lines to push the reaction products out of outlets at one end of the input lines.

[0082] Figure 2: A photograph of the illustrative matrix-type microfluidic device shown schematically in Figure 1. [0083] Figure 3A-3D: Schematic representation of the structure of the genes encoding the a- and β-chains of the T-cell receptor (TRA and TRB, respectively). (3 A) Somatic recombination to produce a gene encoding a TCR a-chain. (3B) Somatic recombination to produce a gene encoding a TCR β-chain. (3C) Detail of the complementarity determining regions (CDRs) in the TCR a-chain gene after somatic recombination; N denotes non-templated nucleotides inserted into CDR3. (3D) Detail of the CDRs in the TCR β-chain gene after somatic recombination.

[0084] Figure 4A-4B: (4 A) Schematic representation of the primers added to a column (or row) of a matrix-type microfluidic device in an embodiment of a method of single-cell TCR transcript sequencing. The TRAC and TRBC sequences are

complementary to the TCR a and β mRNAs and serve as specific primers during the reverse transcriptase reaction. In the first barcode primer, illustrated in Figure 4A, Rd2 is the Illumina TruSeq Read 2 sequence, BCl-20 refers to 20 different barcodes, and P7 is one of the Illumina sequencing adaptors. The P7.BC.Rd2 primer does not participate in the RT reaction but is there so that barcodes are added to this end of the amplified segments during the amplification step. (4B) Schematic representation of the primers added to a row (or column) of a matrix-type microfluidic device in this embodiment of a method of single-cell TCR transcript sequencing.

[0085] Figure 5: A diagrammatic representation of a Super Selective primer. The

Anchor segment keeps the primer stably bound to the template. Because of the unpaired Insert/Loop region, the Bait segment (typically 5-10 nucleotides) hybridizes to the template only transiently.

[0086] Figure 6A-6B: Schematic representations of the use of Super Selective primers in the method illustrated in Figure 4A-4B, above. (6A) Reverse transcription. (6B) amplification. [0087] Figure 7: Schematic representation of the use of template switching for generating a DNA template for a particular target transcript (targeted transcript sequencing). In the template switching oligonucleotide, Rdl is the Illumina TruSeq Read 1 sequence, UMI is a unique molecular identifier, and GGG is an oligo-riboG sequence. In the RT primer, Rd2 is the Illumina TruSeq Read 2 sequence. The P7.BC.Rd2 primer discussed with respect to Figures 4 and 6 can prime subsequent amplification of the cDNA resulting from template switching. [0088] Figure 8 A-B: Schematic representation of the structure of genes encoding particular heavy and light chains that make up immunoglobulins. (8A) Somatic recombination to produce genes encoding μ or δ heavy chains. (8B) Somatic

recombination to produce a gene encoding a κ light chain.

[0089] Figure 9: Gel showing results of Example 1. Desired DNA fragments derived from a TCR transcript are in the size range 250 to 350 nucleotides. Undesired, nonspecific fragments are in the size range 100 to 220 nucleotides. The reaction products in lane 1 were generated with No Loop RT primers and No Loop V primers; lane 2, Loop RT primers and No Loop V primers; lane 3, No Loop RT primers and Loop V primers; lane 4, Loop RT primers and Loop V primers. The best ratio of desired fragments to undesired fragments is observed for combination 3, which is No Loop RT primers and Loop V primers.

[0090] Figure 10: Gel showing results of Example 2. Because of the addition of

P5, P7, and other sequences, the desired DNA fragments derived from a TCR transcript are in the size range 320 to 400 nucleotides. Results show that the fixed cells gave a much better yield of desired fragments than fresh cells.

[0091] Figure 11 A-l IB: Schematic representation of an illustrative method of single-cell T-cell receptor template production designed for tubes, which is described in detail in Example 5.

[0092] Figure 12A-12B: Schematic representation of a method of single-cell T- cell receptor template production similar to that shown in Figure 11 A-l IB, but modified for use with Fluidigm's Ci™ High-Throughput IFC (described in detail in Example 5).

[0093] Figure 13 : Gel showing results of Example 5. This method produces TCR libraries predominantly in the expected 350-450 bp size range.

DETAILED DESCRIPTION

[0094] Described herein are methods for preparing DNA templates for single-cell transcript sequencing of RNA from a population of cells. The methods entail distributing cells from the population into separate reaction volumes so that a plurality of separate reaction volumes each contain a single, isolated cell, wherein the cells have been treated with a fixative prior to distribution. In certain embodiments, the fixative can be a cell- permeant fixative. In particular embodiments, the method includes the step of pre-treating the cells with the fixative. The method is carried out by permeabilizing or disrupting (e.g., lysing) each cell in its separate reaction volume and reverse transcribing cDNA from RNA in each separate reaction volume (i.e., wherein each reverse transcription of RNA from an isolated cell is carried out separately from the reverse transcription of RNA from every other isolated cell). The cDNA is then amplified to produce DNA, with the amplification also being carried out to produce DNA templates from each isolated cell in separate reaction volumes. The amplification incorporates one or more nucleotide sequences that facilitate DNA sequencing of the DNA templates. After amplification, the DNA templates can be recovered, e.g., for further amplification and/or DNA sequencing. In some embodiments, the DNA templates are recovered from the separate reaction volumes in one or more pools of DNA templates, which can, optionally, be further pooled to produce a single pool of DNA templates for further amplification and/or DNA sequencing.

[0095] These methods described herein enable the production of DNA templates from transcripts derived from single cells with greater efficiency than was previously possible (e.g., without the fixative). In certain embodiments, aimed at single-cell sequencing of T-cell receptors or immunoglobulins, a particular chemistry is described herein which allow the production of DNA templates using a simpler protocol than any previously available, especially when implemented on a matrix-type microfiuidic device.

Definitions

[0096] Terms used in the claims and specification are defined as set forth below unless otherwise specified. These terms are defined specifically for clarity, but all of the definitions are consistent with how a skilled artisan would understand these terms.

[0097] As used herein, the term "microfiuidic device" refers to any device that includes chambers and/or fluid channels wherein at least one dimension is less than 1 millimeter. In certain embodiments, a microfiuidic device includes fluid flow channels (or lines) and separate control channels (or lines) that function to control or regulate flow through the fluid channels. [0098] The term nucleic acid includes any form of DNA or RNA, including, for example, genomic DNA; complementary DNA (cDNA), which is a DNA representation of mRNA, usually obtained by reverse transcription of messenger RNA (mRNA) or by amplification; DNA molecules produced synthetically or by amplification; and mRNA. [0099] The term nucleic acid encompasses double- or triple-stranded nucleic acids, as well as single-stranded molecules. In double- or triple-stranded nucleic acids, the nucleic acid strands need not be coextensive (i.e., a double-stranded nucleic acid need not be double-stranded along the entire length of both strands).

[0100] The term nucleic acid also encompasses any chemical modification thereof, such as by methylation and/or by capping. Nucleic acid modifications can include addition of chemical groups that incorporate additional charge, polarizability, hydrogen bonding, electrostatic interaction, and/or functionality to the individual nucleic acid bases or to the nucleic acid as a whole. Such modifications may include base modifications such as 2' -position sugar modifications, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, substitutions of 5-bromo-uracil, backbone modifications, unusual base pairing combinations such as the isobases isocytidine and isoguanidine, and the like.

[0101] More particularly, in certain embodiments, nucleic acids, can include polydeoxyribonucleotides (containing 2-deoxy-D-ribose), polyribonucleotides (containing D-ribose), and any other type of nucleic acid that is an N- or C-glycoside of a purine or pyrimidine base, as well as other polymers containing nonnucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and polymorpholino

(commercially available from the Anti-Virals, Inc., Corvallis, Oregon, as Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing that the polymers contain nucleobases in a configuration which allows for base pairing and base stacking, such as is found in DNA and RNA. The term nucleic acid also encompasses linked nucleic acids (LNAs), which are described in U.S. Patent Nos. 6,794,499,

6,670,461, 6,262,490, and 6,770,748, which are incorporated herein by reference in their entirety for their disclosure of LNAs. [0102] The nucleic acid(s) can be derived from a completely chemical synthesis process, such as a solid phase-mediated chemical synthesis, from a biological source, such as through isolation from any species that produces nucleic acid, or from processes that involve the manipulation of nucleic acids by molecular biology tools, such as DNA replication, amplification (e.g., PCR), reverse transcription, or from a combination of those processes.

[0103] The term "template" is used herein to refer to a nucleic acid molecule that serves as a template for a polymerase to synthesize a complementary nucleic acid molecule.

[0104] A "DNA template," for example, can be used in DNA sequencing, typically after adding one or more nucleotide sequences that facilitate DNA sequencing (e.g., barcodes, unique molecular identifiers, binding sites for DNA sequencing primers, and/or sequencing adaptors, such as, e.g., a flow cell sequence useful in cluster generation in bridge sequencing).

[0105] As used herein the terms "nucleotide barcode" and "barcode" refer to a specific nucleotide sequence that encodes information about cDNA produced when a barcoded primer or oligonucleotide is employed in reverse transcription or the amplicon produced when one or more barcoded primer(s) is/are employed in an amplification reaction.

[0106] In some embodiments, the barcode encodes an item of capture site information. For example, for reactions carried out on a matrix-type microfluidic device, a barcode can encode the row or column of a capture site. Two barcodes, one encoding the row in which the barcode is introduced and the other encoding the column in which that barcode is introduced can define the specific capture site residing at the intersection of the row and column identified by the barcodes.

[0107] As used herein, "UMI" is an acronym for "unique molecular identifier," also referred to as "molecular identifier." A UMI is one in a group of identifiers in which each identifier is distinguishable from any of the other identifiers in the group. One way to achieve this "uniqueness" is to use a string of nucleotides. For example, if the length of this string is 10 bases, there are more than 1 million unique sequences; if it is 20 bases long, there will be 10¹² unique sequences. See Hug and Schuler, "Measurement of the Number of Molecules of a Single mRNA Species in a Complex mRNA Preparation," J. Theor. Biol. (2003) 221, 615-624 and Hollas and Schuler, "A Stochastic Approach to Count RNA Molecules Using DNA Sequencing Methods" in Algorithms in

Bioinformatics (2003): Third International Workshop, WABI 2003, Budapest, Hungary, September 15-20, 2003, Series title: Lecture Notes in Computer Science Volume 2812, pp 55-62 (eds. Benson and Page), which is incorporated by reference herein for its description of the UMI concept.

[0108] The term "target nucleic acids" or "target RNA" is used herein to refer to particular nucleic acids to be detected in the methods described herein. Accordingly, amplification of a particular RNA transcript, for example, is an example of target-specific amplification, whereas "Whole Transcriptome Amplification" is an example of an amplification that aims to amplify all transcripts present in a given cell. When the amplification products are sequenced, the sequencing of a particular transcript is termed "target-specific" or "targeted" sequencing, whereas "transcriptome sequencing" aims to sequence all transcripts present in a cell.

[0109] As used herein, the term "target nucleotide sequence" refers to a molecule that includes the nucleotide sequence of a target nucleic acid, such as, for example, the amplification product obtained by amplifying a target nucleic acid or the cDNA produced upon reverse transcription of an RNA target nucleic acid.

[0110] As used herein, the term "complementary" refers to the capacity for precise pairing between two nucleotides. I.e, if a nucleotide at a given position of a nucleic acid is capable of hydrogen bonding with a nucleotide of another nucleic acid, then the two nucleic acids are considered to be complementary to one another at that position.

Complementarity between two single-stranded nucleic acid molecules may be "partial," in which only some of the nucleotides bind, or it may be complete when total

complementarity exists between the single-stranded molecules. The degree of

complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. A first nucleotide sequence is said to be the "complement" of a second sequence if the first nucleotide sequence is complementary to the second nucleotide sequence. A first nucleotide sequence is said to be the "reverse complement" of a second sequence, if the first nucleotide sequence is complementary to a sequence that is the reverse (i.e., the order of the nucleotides is reversed) of the second sequence. [0111] "Specific hybridization" refers to the binding of a nucleic acid to a target nucleotide sequence in the absence of substantial binding to other nucleotide sequences present in the hybridization mixture under defined stringency conditions. Those of skill in the art recognize that relaxing the stringency of the hybridization conditions allows sequence mismatches to be tolerated.

[0112] In particular embodiments, hybridizations are carried out under stringent hybridization conditions. The phrase "stringent hybridization conditions" generally refers to a temperature in a range from about 5°C to about 20°C or 25°C below than the melting temperature (T_m) for a specific sequence at a defined ionic strength and pH. As used herein, the T_m is the temperature at which a population of double-stranded nucleic acid molecules becomes half-dissociated into single strands. Methods for calculating the T_m of nucleic acids are well known in the art (see, e.g., Berger and Kimmel (1987) METHODS IN ENZYMOLOGY, VOL.152: GUIDE TO MOLECULAR CLONING TECHNIQUES, San Diego: Academic Press, Inc. and Sambrook et al. (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 2ND ED., VOLS. 1-3, Cold Spring Harbor Laboratory), both incorporated herein by reference). As indicated by standard references, a simple estimate of the T_m value may be calculated by the equation: T_m =81.5+0.41(% G+C), when a nucleic acid is in aqueous solution at 1 M NaCl (see, e.g., Anderson and Young, Quantitative Filter Hybridization in NUCLEIC ACID HYBRIDIZATION (1985)). The melting temperature of a hybrid (and thus the conditions for stringent hybridization) is affected by various factors such as the length and nature (DNA, RNA, base composition) of the primer or probe and nature of the target nucleic acid (DNA, RNA, base

composition, present in solution or immobilized, and the like), as well as the concentration of salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol). The effects of these factors are well known and are discussed in standard references in the art. Illustrative stringent conditions suitable for achieving specific hybridization of most sequences are: a temperature of at least about 60°C and a salt concentration of about 0.2 molar at pH7.

[0113] The term "oligonucleotide" is used to refer to a nucleic acid that is relatively short, generally shorter than 200 nucleotides, more particularly, shorter than 100 nucleotides, even more particularly, shorter than 50 nucleotides, and most particularly shorter than 40, 30, 20, 15, 10, or 5 nucleotides. Typically, oligonucleotides are single- stranded DNA mol ecul e s .

[0114] A stretch of nucleotides of the same type are referred to, e.g., as "oligo- riboG" (for a stretch of repeated guanines) or "poly-A" (for a stretch of repeated adenines). [0115] The term "primer" refers to an oligonucleotide that is capable of hybridizing (also termed "annealing") with a nucleic acid and serving as an initiation site for nucleotide (RNA or DNA) polymerization under appropriate conditions (i.e., in the presence of four different nucleoside triphosphates and an agent for polymerization, such as DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable temperature. The term "primer site" or "primer binding site" refers to the segment of the target nucleic acid to which a primer hybridizes.

[0116] A primer is said to "anneal" to another nucleic acid if the primer, or a portion thereof, hybridizes to a nucleotide sequence within the nucleic acid. The statement that a primer hybridizes to a particular nucleotide sequence is not intended to imply that the primer hybridizes either completely or exclusively to that nucleotide sequence. For example, a portion of a primer may anneal to a particular nucleic acid, in which case, that portion is said to be "specific for" that nucleic acid.

[0117] The primer can be perfectly complementary to the target nucleic acid sequence or can be less than perfectly complementary. In certain embodiments, the primer has at least 65% identity to the complement of the target nucleic acid sequence over a sequence of at least 7 nucleotides, more typically over a sequence in the range of 10-30 nucleotides, and often over a sequence of at least 14-25 nucleotides, and more often has at least 75% identity, at least 85% identity, at least 90% identity, or at least 95%, 96%, 97%. 98%), or 99% identity. It will be understood that certain bases (e.g., the 3 ' base of a primer) are generally desirably perfectly complementary to corresponding bases of the target nucleic acid sequence. Primer and probes typically anneal to the target sequence under stringent hybridization conditions.

[0118] The term "nucleotide tag" is used herein to refer to a predetermined nucleotide sequence that is added to another nucleotide sequence, typically during reverse transcription or amplification. Nucleotide tags are conveniently added using a primer having a non-hybridizing portion (i.e., the "tag"). Nucleotide tags can include, e.g., primer binding sites for sequencing primers, barcodes, UMIs, sequencing adaptors, etc.

[0119] As used herein the term "barcode primer" refers to a primer that includes a specific barcode nucleotide sequence that encodes information about the amplicon produced when the barcode primer is employed in an amplification reaction. [0120] A "linker" can, but need not, be or include a nucleic acid. Nucleotide linkers can be added to either end of a nucleotide sequence to be amplified to facilitate unbiased amplification using primers specific for the nucleotide linkers, which can be the same or different. [0121] As used herein with reference to a portion of a primer, the term "target- specific" nucleotide sequence refers to a sequence that can specifically anneal to a target nucleic acid or a target nucleotide sequence under suitable annealing conditions.

[0122] As used herein, an "anchor sequence" refers to a sequence in an

oligonucleotide (e.g., a primer) that keeps the oligonucleotide stably bound to a

complementary nucleotide sequence.

[0123] As used herein, a "bait sequence" refers to a sequence in an oligonucleotide

(e.g., a primer) that binds only transiently to a complementary nucleotide sequence. When used in conjunction with an anchor sequence, the bait sequence will form a less stable hybrid than the anchor sequence. [0124] As used in conjunction with anchor and bait sequences, an "insert" refers to a non-hybridizing element residing between the anchor and bait sequences, the insert is typically a nucleotide sequence, but is not limited to such.

[0125] The term "flanked by" is used herein to describe a situation in which a central nucleotide sequence has other nucleotide sequences on either side. [0126] As used herein, a "stem-duplex" is formed when two self-complementary regions of nucleic acid molecule (typically an oligonucleotide) hybridize with one another.

[0127] The two self-complementary regions of a stem-duplex are linked, typically by a nucleotide loop, to form a "hairpin" configuration at temperatures below the T_m of the stem-duplex. [0128] A "hairpin primer" is an oligonucleotide that is capable of priming nucleotide polymerization (e.g., amplification) and that, at lower temperatures, assumes a hairpin configuration.

[0129] A primer is said to have "a hairpin configuration during reverse

transcription" if it has a stem-duplex with a T_m higher than the temperature used for reverse transcription (e.g., 37-42 °C). In this configuration, the primer cannot function to prime nucleotide polymerization. In effect, the primer is "turned off." [0130] A primer is said to have "a linear configuration" during amplification such that it can anneal to a sufficiently complementary sequence and prime nucleotide polymerization. In effect, the primer is "turned on." When amplification is carried out by polymerase chain reaction (PCR), the primer assumes a linear configuration on

denaturation, which is maintained during annealing (e.g., at 50-65°C) and extension.

[0131] Amplification according to the present teachings encompasses any means by which at least a part of at least one target nucleic acid is reproduced, typically in a template-dependent manner, including without limitation, a broad range of techniques for amplifying nucleic acid sequences, either linearly or exponentially. Illustrative means for performing an amplifying step include ligase chain reaction (LCR), ligase detection reaction (LDR), ligation followed by Q-replicase amplification, PCR, primer extension, strand displacement amplification (SDA), hyperbranched strand displacement

amplification, multiple displacement amplification (MDA), nucleic acid strand-based amplification (NASBA), two-step multiplexed amplifications, rolling circle amplification (RCA), and the like, including multiplex versions and combinations thereof, for example but not limited to, OLA/PCR, PCR/OLA, LDR/PCR, PCR/PCR/LDR, PCR/LDR, LCR/PCR, PCR/LCR (also known as combined chain reaction~CCR), and the like.

Descriptions of such techniques can be found in, among other sources, Ausbel et al.; PCR Primer: A Laboratory Manual, Diffenbach, Ed., Cold Spring Harbor Press (1995); The Electronic Protocol Book, Chang Bioscience (2002); Msuih et al., J. Clin. Micro. 34:501- 07 (1996); The Nucleic Acid Protocols Handbook, R. Rapley, ed., Humana Press, Totowa, N.J. (2002); Abramson et al., Curr Opin Biotechnol. 1993 Feb.;4(l):41-7, U.S. Pat. No. 6,027,998; U.S. Pat. No. 6,605,451, Barany et al., PCT Publication No. WO 97/31256; Wenz et al., PCT Publication No. WO 01/92579; Day et al., Genomics, 29(1): 152-162 (1995), Ehrlich et al., Science 252: 1643-50 (1991); Innis et al., PCR Protocols: A Guide to Methods and Applications, Academic Press (1990); Favis et al., Nature Biotechnology 18:561-64 (2000); and Rabenau et al., Infection 28:97-102 (2000); Belgrader, Barany, and Lubin, Development of a Multiplex Ligation Detection Reaction DNA Typing Assay, Sixth International Symposium on Human Identification, 1995 (available on the world wide web at: promega.com/geneticidproc/ussymp6proc/blegrad.html- ); LCR Kit

Instruction Manual, Cat. #200520, Rev. #050002, Stratagene, 2002; Barany, Proc. Natl. Acad. Sci. USA 88: 188-93 (1991); Bi and Sambrook, Nucl. Acids Res. 25:2924-2951 (1997); Zirvi et al., Nucl. Acid Res. 27:e40i-viii (1999); Dean et al., Proc Natl Acad Sci USA 99:5261-66 (2002); Barany and Gelfand, Gene 109: 1-11 (1991); Walker et al., Nucl. Acid Res. 20: 1691-96 (1992); Polstra et al., BMC Inf. Dis. 2: 18- (2002); Lage et al., Genome Res. 2003 Feb.; 13(2):294-307, and Landegren et al., Science 241 : 1077-80 (1988), Demidov, V., Expert Rev Mol Diagn. 2002 Nov.;2(6):542-8., Cook et al., J Microbiol Methods. 2003 May;53(2): 165-74, Schweitzer et al., Curr Opin Biotechnol. 2001 Feb.; 12(l):21-7, U.S. Pat. No. 5,830,711, U.S. Pat. No. 6,027,889, U.S. Pat. No. 5,686,243, PCT Publication No. WO0056927A3, and PCT Publication No.

WO9803673A1.

[0132] In some embodiments, amplification comprises at least one cycle of the sequential procedures of: annealing at least one primer with complementary or substantially complementary sequences in at least one target nucleic acid; synthesizing at least one strand of nucleotides in a template-dependent manner using a polymerase; and denaturing the newly-formed nucleic acid duplex to separate the strands. The cycle may or may not be repeated. Amplification can comprise thermocycling or can be performed isothermally.

[0133] "Whole transcriptome amplification" ("WTA") refers to any amplification method that aims to produce an amplification product that is representative of a population of RNA from the cell from which it was prepared. An illustrative WTA method entails production of cDNA bearing linkers on either end that facilitate unbiased amplification. In many implementations, WTA is carried out to analyze messenger (poly-A) RNA.

[0134] The term "substantially" as used herein with reference to a parameter means that the parameter is sufficient to provide a useful result. Thus, "substantially complementary," as applied to nucleic acid sequences generally means sufficiently complementary to work in the described context. Typically, substantially complementary means sufficiently complementary to hybridize under the conditions employed.

[0135] A "reagent" refers broadly to any agent used in a reaction, other than an analyte (e.g., nucleic acid being analyzed). Illustrative reagents for a nucleic acid amplification reaction include, but are not limited to, buffer, metal ions, polymerase, reverse transcriptase, primers, nucleotides, oligonucleotides, labels, dyes, nucleases, and the like. Reagents for enzyme reactions include, for example, substrates, cofactors, buffer, metal ions, inhibitors, and activators. The term reagent also encompasses any component that influences cell growth or behavior, such as, e.g., buffer, culture medium or components thereof, agonists or antagonists, etc.

[0136] The term "label," as used herein, refers to any atom or molecule that can be used to provide a detectable and/or quantifiable signal. In particular, the label can be attached, directly or indirectly, to a nucleic acid or protein. Suitable labels that can be attached to probes include, but are not limited to, radioisotopes, fluorophores,

chromophores, mass labels, electron dense particles, magnetic particles, spin labels, molecules that emit chemiluminescence, electrochemically active molecules, enzymes, cofactors, and enzyme substrates. [0137] As used herein with respect to reactions, reaction mixtures, reaction volumes, etc., the term "separate" refers to reactions, reaction mixtures, reaction volumes, etc., where reactions are carried out in isolation from other reactions. Separate reactions, reaction mixtures, reaction volumes, etc. include those carried out in droplets (See, e.g., U.S. Patent No., 7,294,503, issued November 13, 2007 to Quake et al., entitled

"Microfabricated crossflow devices and methods," which is incorporated herein by reference in its entirety and specifically for its description of devices and methods for forming and analyzing droplets; U.S. Patent Publication No. 20100022414, published January 28, 2010, by Link et al., entitled "Droplet libraries," which is incorporated herein by reference in its entirety and specifically for its description of devices and methods for forming and analyzing droplets; and U.S. Patent Publication No. 20110000560, published January 6, 2011, by Miller et al., entitled "Manipulation of Microfluidic Droplets," which is incorporated herein by reference in its entirety and specifically for its description of devices and methods for forming and analyzing droplets.), which may, but need not, be in an emulsion, as well as those wherein reactions, reaction mixtures, reaction volumes, etc. are separated by mechanical barriers, e.g., separate vessels, separate wells of a microtiter plate, or separate compartments of a matrix-type microfluidic device.

[0138] The term "fluidically isolated" is used herein to refer to state in which two or more elements of a microfluidic device are not in fluid communication with one another. [0139] The term "elastomer" has the general meaning used in the art. Thus, for example, Allcock et al. (Contemporary Polymer Chemistry, 2nd Ed.) describes elastomers in general as polymers existing at a temperature between their glass transition temperature and liquefaction temperature. Elastomeric materials exhibit elastic properties because the polymer chains readily undergo torsional motion to permit uncoiling of the backbone chains in response to a force, with the backbone chains recoiling to assume the prior shape in the absence of the force. In general, elastomers deform when force is applied, but then return to their original shape when the force is removed.

[0140] As used herein, "fixative" is compound or mixture of compounds that stabilizes cells, cellular structures, and/or macromolecules (e.g., RNA) for subsequent analysis.

[0141] When used with reference to a T cell or B cell, the term "activated" refers to the state of the cell after exposure to an antigen that stimulates an immune response in the cell.

Single-Cell Transcript Sequencing Using Fixed Cells - In General

[0142] The methods described herein can be employed to prepare DNA templates for sequencing from single, fixed cells. The templates can be prepared from any type of RNA, e.g., from micro-RNA to messenger RNA. Templates can be prepared from specific RNA targets in a single cell or the entire complement of a type of RNA, e.g., all messenger RNA as in whole transcriptome sequencing. While the present disclosure focuses on the preparation of DNA templates for determining RNA sequences, those of skill in the art understand that RNA expression levels can be determined in the course of sequencing because expression levels are correlated with the number of sequencing reads obtained. In addition, as those of skill in the art will readily appreciate from the discussion below, these methods for generating and sequencing DNA templates can be coupled with other types of analyses of nucleic acids, proteins, or other aspects of single cells. Because multiple types of analyses can be carried out in the separate reaction volumes employed in these methods, multiple characteristics of individual cells can be identified, facilitating the linkage of particular RNA sequences, optionally at particular levels, with phenotype. An example of this type of approach is described in Darmanis S, Gallant CJ, Marinescu VD, Niklasson M, Segerman A, Flamourakis G, Fredriksson S, Assarsson E, Lundberg M, Nelander S, et al: Simultaneous Multiplexed Measurement of RNA and Proteins in Single Cells. Cell Rep 2016, 14:380-389, which is incorporated by reference herein for this description. Cells Analyzed

[0143] The methods described herein can be used to analyze transcripts from any type of cells, e.g., any self-replicating, membrane-bounded biological entity or any non- replicating, membrane-bounded descendant thereof. Non-replicating descendants may be senescent cells, terminally differentiated cells, cell chimeras, serum-starved cells, infected cells, non-replicating mutants, anucleate cells, intact nuclei, and fixed, intact (dead) cells, etc. Cells used in the methods described herein may have any origin, genetic background, state of health, state of fixation, membrane permeability, pretreatment, and/or population purity, among other characteristics. Suitable cells may be eukaryotic, prokaryotic, archaeon, etc., and may be from animals, plants, fungi, protists, bacteria, and/or the like. In illustrative embodiments, human cells are analyzed. Cells may be from any stage of organismal development, e.g., in the case of mammalian cells (e.g., human cells), embryonic, fetal, or adult cells may be analyzed. In certain embodiments, the cells are stem cells. Cells may be wildtype; natural, chemical or viral mutants; engineered mutants (such as transgenics); and/or the like. In addition, cells may be growing, quiescent, senescent, transformed, and/or immortalized, among other states. Furthermore, cells may be a monoculture, generally derived as a clonal population from a single cell or a small set of very similar cells; may be preselected by any suitable mechanism, such as affinity binding, FACS, drug selection, etc.; and/or may be a mixed or heterogeneous population of distinct cell types.

[0144] In particular embodiments, the cells are preselected populations of T or B cells. Both (along with natural killer T or NKT cells and monocytes) are peripheral blood mononuclear cells (PMBCs), which can be isolated using standard methods. PMBCs can be enriched for desired populations also using standard methods. For example, T helper (T_H) cells can be enriched by selecting for CD4+ cells; cytotoxic T cells (Tc cells or CTLs) can be enriched by selecting for CD8+ cells; memory T cells can be enriched by selecting for CD45RO+ cells; etc.

[0145] One advantage of the methods described herein is that they can be used to analyze virtually any number of single cells, e.g., in a particular cell population. In various embodiments, the number of single cells analyzed can, e.g., be at least any of the following values: 10, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7,000, 8000, 9,000, or 10,000. When the cells to be analyzed are scarce, the methods described herein offer the advantage over cell sorting that fewer cells are needed for the analysis. In various embodiments, the number of single cells analyzed can be fewer than any of the following values: 10,000, 9,000, 8,000, 7000, 6,000, 5,000, 4,000, 3,000, 2,000, or 1,000. In specific embodiments, the number of cells analyzed can fall within a range bounded by any two values listed above, such as, for example, 300-1,300, 400-1,200, 500-1, 100, 600-1,000, or 700-900. An illustrative embodiment aims to analyze 800 cells.

Fixing Cells

[0146] Treatment of the cells with a cell-permeant fixative might be expected to hamper subsequent reactions, such as reverse transcription and/or amplification.

Surprisingly, based on the work described herein, it has now been found that fixing cells can improve the quality of the results observed upon sequencing DNA templates produced by reverse transcription, followed by amplification.

[0147] Accordingly, the methods described herein enable the production of DNA templates from transcripts derived from single cells with greater efficiency than was previously possible (e.g., without the fixative). In various embodiments, these methods enable sequencing and/or quantification of the level of any transcript of interest from more than: 50%, 60%, 70%, 80%, 85%, 90%, 95% of cells that have been isolated in separate reaction volumes, wherein the number of isolated cells ranges, e.g., from 90-10,000, 100- 5,000, 200-1,500, 300-1,300, 400-1,200, 500-1, 100, 600-1,000, or 700-900. For example, using the methods described herein, one can obtain the sequences (and optionally expression levels) of both TCR or Ig chains, and optionally one or more phenotypic markers, in more than 85%, more than 90%, or more than 95% of of 200-1,500, isolated cells using a simpler protocol than any previously available. In certain embodiments, suitable fixatives provide such results by stabilizing the cell nucleus, so that cell lysis is less likely to disrupt the nucleus (which increases the percentage of RNA in the released nucleic acids) and/or by stabilizing RNA.

[0148] In illustrative embodiments, the fixative is biomarker and histology preservative (BHP) and/or dithiobis(succinimydal proprionate) (DSP). BHP is described in Mueller et al., "One-Step Preservation of Phosphoproteins and Tissue Morphology at Room Temperature for Diagnostic and Research Specimens (August, 2011) PLos ONE, 6(8) e23780, and in Espina et al., US 2013/0137094, published May 30, 2013, both of which are incorporated by reference for this description. Table 1 of Mueller et al. provides the following regarding BHP composition:

Ρ$»$Ϊ5*8*8$» yites* sSs§<>¾ «#^' £S«^¾*¾SS«

US 2013/0137094 describes the production of an illustrative BHP as follows: 1. Protocol for making the blood cell fixative without lactic acid:

Prepare stock solutions of 10% polyethylene glycol (MW 8000) (Fisher),

100 mM sodium orthovanadate (Sigma), and 1.0M Beta Glycerophosphate (Calbiochem) in type 1 reagent grade water. Prepare a stock solution of

200 mM Genistein (Alexis Biochemicals) in DMSO.

Dissolve 0.05 mg DSP (Pierce) in 500 \JL DMSO. Dissolve 0.05 mg

DTBP in 500 μΐ_^ Type 1 reagent grade water.

Add 6.0 mL of 200 proof ethanol (Sigma) to 38 mL of Hanks Balanced

Salt Solution (Hyclone, Fisher).

Add 250 μΐ_^ 10% polyethylene glycol, 1000 μΐ_^ sodium orthovanadate

(Sigma), 3.75 mL Beta Glycerophosphate (Calbiochem), 50.0 μΐ_^ of 1.0

mM Staurosporine ready-to-use in DMSO (Sigma cat #S6942) and 2.5 μΐ_^

Genistein (Alexis Biochemicals), 500 μΙ_, DSP solution, and 500 μΙ_, DTBP solution to the alcohol/Hanks Balanced Salt Solution. Mix gently.

[0149] Cells can be fixed with BHP by adding the BHP to the cells at room temperature.

[0150] The fixative DSP is described in Xiang, et al., "Using DSP, a reversible cross-linker, to fix tissue sections for immunostaining, microdissection and expression profiling" (December 16, 2004) Nucleic Acids Res., 32(22) el85. DSP, which is also known as Lomant's reagent, is a cell permeant, homobifunctional, thiol-cleavable molecule, which is believed to link primary amino groups to one another in aqueous buffers at pHs between 6.5 and 8.5. Cells can be fixed, for example, as described in Xiang, et al. Briefly, fibroblast cells are grown to about 85% confluency, washed once with IX PBS buffer and then fixed with DSP (Pierce, Rockford, IL) at a final

concentration of 1 mg/ml for 5 min. Stock solutions of DSP in 100% anhydrous DMSO (Sigma-Aldrich, St Louis, MO) are prepared and stored at -80°C. The stock solution is diluted to working concentration with IX PBS immediately before use. To prevent the DSP from precipitating when the DMSO stock is added to the PBS, the latter is vortexed gently and the stock solution added dropwise.

Distributing Cells into Separate Reaction Volumes

[0151] The separate reaction volumes can be created and maintained by any available means. For example, separate reaction volumes can be made up of separate wells in a plate, separate droplets in an emulsion, separate segments of fluid in a channel, or separate reaction sites (e.g., chambers) in a microfluidic device.

[0152] At least four standard methods/sy stems are available for isolating cells for single-cell sequencing: micromanipulation, laser-capture microdissection (LCM), fluorescence-activated cell sorting (FACS), and microfluidic systems. The last two approaches can provide the high throughput needed for the large sample numbers required to reliably reflect a cell population's heterogeneity. See Livak, "Eukaryotic Single-Cell mPvNA Sequencing," Field Guidelines for Genetic Experimental Designs in High- Throughput Sequencing, Springer, ISBN 978-3-319-31348-1 (2016), which is

incorporated by reference herein for its description of single-cell mRNA sequencing. [0153] In some embodiments, the distributing step is carried so as to increase the number of separate reaction volumes that contain only one cell. Methods for achieving this are well known and include, for example, limiting dilution. In particular

embodiments, the method entails determining which reaction volumes contain a single, isolated cell. [0154] Conveniently, the method can be carried out using a microfluidic device that facilitates capture of individual cells at a plurality of capture sites. In some embodiments, the capture sites are capable of being fluidically isolated from one another, for example, after cell distribution throughout the device. In certain embodiments, the capture sites each have a capture feature that retains the cell or group of cells in the place. In some embodiments, the capture feature resides within a chamber that can be fluidically isolated from other chambers within the capture site. The device can be configured so that one or more first reagent(s) can be provided to each capture site, and one or more second reagent(s) can be provided to each capture site, wherein the second reagent(s) is/are different from the first reagent(s) and is/are provided separately from the first reagent(s). Each pair of reagents can, for example, be provided to a pair of fluidically isolatable chambers in the capture site that are distinct from one another and, optionally, distinct from the chamber used for cell capture.

[0155] In some embodiments, at least one surface of the microfluidic device is transparent to permit visualization of the cell and/or a signal from a label. In such embodiments, the method can optionally include imaging the cell-occupied capture sites before reverse transcription. Matrix- Type Microfluidic Devices

[0156] In some embodiments, methods described herein can be carried out on a matrix-type microfluidic device, which facilitates the introduction of a barcode that identifies a particular row in the device and a barcode that identifies a particular column, whereby the combination uniquely identifies a particular capture site and therefore a particular cell or group of cells from which the reaction products were derived. The method has been tested on such a device and demonstrated to work (see Example 3).

[0157] In certain embodiments, a matrix-type microfluidic device useful in the method described above includes capture sites arranged in a matrix of R rows and C columns, wherein R and C are integers greater than 1. Each capture site can include a capture feature that is capable of capturing just one cell or, where cells are to be analyzed in groups, not more than the desired number of cells for each group of cells. The capture sites can be fluidically isolated from one another after distribution of cells to the capture sites. The device also includes a set of R first input lines configured to deliver the first reagent(s) to capture sites in a particular row, and a set of C second input lines configured to deliver second reagent(s) to capture sites in a particular column, wherein this delivery is separate from the delivery first reagent(s). An illustrative device of this type is shown schematically in Figure 1 A. Figure IB illustrates the delivery of R different barcodes through the R different first input lines to the capture sites. Figure 1C illustrates the delivery of C different barcodes through C different input lines to the capture sites. In particular embodiments, all barcodes will be unique, i.e., different from every other barcode provided to the device. Figure ID illustrates that, after the reaction has been carried out, reaction products can be harvested from each column as a pool, for example, by applying a harvesting fluid to the C second input lines to push the reaction products out of outlets at one end of the input lines. (Those of skill in the art readily appreciate that the designation of "rows" and "columns" is arbitrary, since, if one rotates the device 90 degrees, the rows become columns and vice versa, in which case, reaction products would be harvested from each row as a pool.) Figure 2 shows a photograph of the device shown schematically in Figure 1.

[0158] In certain embodiments, the matrix-type microfluidic device permits analysis of individual cells or groups of cells, e.g., up to (and including) 1000. The cells can be intact or partially or fully disrupted (e.g., permeablized or lysed) after capture or isolation of one or more cells at each capture site. In the latter case, the device is configured to provide this functionality. In some embodiments, the device is transparent on at least one surface to permit imaging to visualize cell number or phenotype (e.g., where the cells or their contents have been reacted with an optically detectable label). In some embodiments, the device is configured to perform "X-Y" combinatorial barcoding, whereby reaction products may be exported in one or more pools (which may themselves be pooled) and further analyzed in multiplex (e.g., by amplification), followed by "demultiplexing" ("demux") to assign particular reaction products to particular capture sites. This type of barcoding is illustrated in Figure 1, which shows the same set of 3' barcodes ("3'BC" in Figure IB) being delivered to each column and the same set of 5' barcodes ("5'BC" in Figure 1C) being delivered to each row.

[0159] In various embodiments, a microfluidic device having from about 90 to about 10,000 separate capture sites is employed to carry out one or more of the methods described herein, particularly from about 90 to about 5,000 capture sites, more particularly from about 90 to about 2,500 capture sites, and even more particularly from about 90 to about 1,000 capture sites. In some embodiments the microfluidic device can have greater than 100, greater than 200, greater than 300, greater than 400, greater than 500, greater than 600, greater than 700, greater than 800, greater than 900, or greater than 1000 capture sites. In illustrative embodiments, the microfluidic device has a number of capture sites in the range of 300-1300, 400-1200, 500-1100, 600-1000, 700-900, or 800. [0160] In some embodiments, the capture sites have one or more reaction chambers ranging from about 0.2 nL to about 500 nL. The lower the reaction chamber volume, the higher the effective concentration of any target nucleic acid. In certain embodiments, the reaction chamber is from about 0.2 nL to about 50 nL, preferably 0.5 nL to about 5 nL, more preferably from about 1 nL to about 4 nL. In some embodiments, the reaction chamber volume is 1.5 nL, 2.0, nL, 2.5 nL, 3.0 nL, or 3.5 nL, or falls within any range bounded by any of these values.

[0161] Microfluidic devices meeting the specifications described herein, and systems employing them the carry out the disclosed method can be designed and fabricated based on the guidance herein and in prior co-owned patent publications, such as U. S. Patent Publication No. 2013/0323732, published May 12, 2013, Anderson et al. and U. S. Application No. 15/055,252, filed February 26, 2016, Conant et al. (both of which are incorporated by reference for their descriptions of single-cell analysis methods and systems). For example, the C ™ Single-Cell Auto Prep System available from Fluidigm Corporation (South San Francisco, CA) provides bench-top automation of the multiplexed isolation, lysis, and reactions on nucleic acids from single cells in an IFC™. In particular, the Ci Single-Cell Auto Prep Array IFC is a matrix-type microfluidic device that facilitates capture and highly parallel preparation of products from up to 96 individual cells. Fluidigm also sells a high-throughput version of this type of device, called the Ci mRNA Seq HT Array IFC, which facilitates capture and preparation of products from up to 800 individual cells. When used properly, each capture site within the chip captures one single cell. Sometimes, a site may capture zero, two, or more cells; however, the exact number of captured cells in each captured site of a Ci chip is easily verified at high confidence and easily documented in a microscopic picture. In certain embodiments, cells are captured and barcoding is carried out in each separate reaction volume to produce barcoded nucleic acid molecules, which are analyzed, most conveniently by DNA sequencing, be it Sanger sequencing, next-generation sequencing, or third-generation sequencing, optionally after preamplification. [0162] The Ci architecture enables the processing of discretely captured cells in combination with any multistep biochemical process that facilitates the analysis of intracellular macromolecules. Such process include, in addition to preparing templates for DNA sequencing, multiplexed protein proximity ligation assays or multiplexed protein proximity elongation assays to quantitate specific proteins, multiplexed microRNA preamplication, target-specific amplification of RNA transcripts or DNA sequences (e.g., for the purpose of genotyping polymorphic markers, such as SNPs, or otherwise analyzing genetic variations, such as copy number variations), for example, or any combination thereof. This architecture can also be exploited to culture discretely captured cells under any desired conditions, which can be modified on-chip by adding components such as, e.g., agonists or antagonists for particular receptors.

Producing Encoded DNA Templates

[0163] Once the cells are distributed into separate reaction volumes, the individual cells are permeabilized or disrupted to release RNA. Preferably, the cell are disrupted (e.g., lysed), and any conventional means of achieving this can be used, such as, for example, treatment with proteinase K. An illustrative lysis solution is 0.5% NP-40, 50 mM Tris-HCl, pH 8.4, 1 mM EDTA. This provides a pH that is optimal for reverse transcriptase, and the NP-40 does not inhibit reverse transcriptase. A brief incubation (1-2 minutes) at 65-70 °C is sufficient to lyse many mammalian cells. If harsher lysis is required, this solution can be supplemented with 30 μg/mL proteinase K, and incubation can be carried out at 50 °C for 30 minutes, followed by 70 °C for 1 minute. If proteinase K is included in the lysis solution, the proteinase K inhibitor AAPF (Cat. No. 539470, EMD Millipore) can be included in the reverse transcriptase reaction at a concentration of 0.33 mM. Commercially available lysis solutions include CelluLyser™ (TATAA

Biocenter), RealTime ready™ Cell Lysis (Roche Diagnostics), and Single Cell-to-CT™ (Life Technologies).

[0164] One advantage of working with the RNA from a single cell is that the RNA need not be purified. The volume of a single cell is on the order of picoliters. Even at the nanoliter scale typically used in microfluidics, the cell lysate can be diluted enough that enzymatic reactions can be performed directly in the lysate.

[0165] In certain embodiments, reverse transcription reagents are added to the separate reaction volumes and reverse transcription is carried out. In particular embodiments, amplification reagents are subsequently added to the separate reaction volumes and amplification is carried out.

[0166] In some embodiments, the reactions carried out in each reaction volume

(separately from every other reaction volume) produce DNA templates that encode one or more items of information regarding reaction volume identity and/or cell and/or target (e.g., in targeted transcript sequencing). The DNA templates can be recovered, optionally pooled, and subjected to DNA sequencing, which determines both the sequence of the

DNA template and the item(s) of information encoded therein. In particular embodiments, this allows the identification of particular DNA templates as having been derived from a single cell in a particular reaction volume.

[0167] One way that this identification can be achieved is by incorporating one or more barcode nucleotide sequences ("barcodes") into the DNA templates. Barcodes can be of virtually any length, although shorter barcodes (e.g., 4-6 nucleotides in length) may be preferred in some embodiments. In various embodiments, suitable barcodes are 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, or 20 nucleotides in length or can fall within a range bounded by any of these values, e.g., 2-10 or 3-8.

[0168] In some embodiments, each reaction can incorporate at least one unique molecular identifier ("UMI"), which is a nucleotide sequence that uniquely identifies the molecule into which it is incorporated. Kivioja et al. (2012) describe the process of counting the absolute number of molecules using UMIs. In some embodiments, a UMI is a random sequence label (generally N4 to N10) that is attached to a molecule to be sequenced prior to any amplification step. For conventional RNA Seq, quantification is inferred from the total number of reads that map to each particular transcript. With UMIs, quantification is done by counting the number of unique UMIs observed for each particular transcript regardless of how many times each UMI is read. For 5 '-end-tagging, the UMI can be incorporated into the template switch oligo. For 3 '-end-tagging, the UMI can be incorporated into, e.g, an oligo dT primer. In terms of the number of random nucleotides to use for the UMI, N5 should be adequate for most mammalian cells. For larger cells, which would be expected to have a greater number of total transcripts, the use of N6 or N7 might be advisable.

[0169] In variations of such embodiments, the reaction incorporates one or more barcodes in addition to one or more UMIs (e.g, two barcodes to encode row and column of a matrix-type microfluidic device plus a UMI to encode the molecular identity of the original RNA template). UMIs can be any length, and the length required for a given analysis will increase as the number of unique molecules to be identified increases. In various embodiments, suitable UMIs are 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, or 20 nucleotides in length or can fall within a range bounded by any of these values, e.g., 2-10, 3-8, 4-7, or 5-6. Single-Cell Sequencing of T-Cell Receptor or Immunoglobulin Transcripts

[0170] In certain embodiments the above-described methods can be carried out to sequence T-cell receptor (TCR) or immunoglobulin (Ig) transcripts in single T or B cells, respectively. In some embodiments, the T or B cells are activated in order to increase the levels of TCR or Ig transcripts available for analysis. T or B cells can be activated using any method known to those of skill in the art. For example, T helper (T_H) cells are activated when presented with peptide antigens by MHC class II molecules, which are expressed on the surface of antigen-presenting cells (APCs). Cytotoxic T cells (Tc cells or CTLs) are activated by binding to antigen associate with MHC class I molecules, which are present on the surface of all nucleated cells. B cells are activated by antigen binding to cell surface receptors.

TCR Transcript Production

[0171] Each T lymphocyte expresses TCR that binds antigen. Antigen binding is a key event in the cascade of the immune response. Most TCRs are a heterodimer consisting of a- and β-chains encoded by the TRA and TRB genes, respectively. A small percentage of TCRs are a heterodimer consisting of γ- and δ-chains encoded by the TRG and TRD genes, respectively.

[0172] Diversity in antigen recognition is generated by somatic V(D)J

recombination occurring at the TRA, TRB, TRG, and TRD loci. For ease of discussion, the methods described below focus on sequencing transcripts from the TRA and TRB loci, respectively, although these methods are equally applicable to transcripts from the TRG and TRD loci. At single-cell resolution, determining the sequence of an appropriate portion of TRA and TRB serves as a unique identifier of a T cell's ancestry because it is likely that any two T cells expressing the same TCRaP pair arose from a common T-cell clone. This disclosure describes improved methods for targeted single-cell transcript sequencing to determine T-cell clonal identity. The improved methods are conveniently carried out on microfluidic devices, such as those described herein, e.g., like the Ci Single- Cell Auto Prep Array™ IFC or the Ci mRNA Seq HT Array IFC, both sold by Fluidigm.

[0173] TRA and TRB have a complex structure diagrammed in Figure 3. In any given T cell, V(D)J recombination has joined one of many Variable (V), Joining (J), and Diversity (D) segments to a Constant (C) segment at both TRA and TRB to create the genes that encode the a- and β-chains expressed by that cell. In this process, non- templated nucleotides (N) are inserted in a segment denoted CDR3. The sequence of CDR3 is the best marker to use to determine clonal identity. By using primers specific for V and C, the segment containing CDR3 can be amplified and sequenced. Because there are many different V segments, many different TRAV and TRBV primers need to be included in the PCR mix. For any given cell, though, typically only one TRAV and one TRBV primer participate in the PCR.

Preparing DNA Templates from TRCs by Reverse Transcription and Amplification

[0174] The Summary above describes methods for preparing DNA templates for the two types of TCR transcripts from single T cells in general terms. These methods enable the use of just one amplification, which is much simpler than any available protocol for preparing TCR templates from single cells. In addition, these methods separate the reverse transcription step from the amplification step, which provides greater specificity. In some embodiments, specificity is further enhanced by primer design as described below. [0175] These methods can be implemented in a variety of ways that are within the level of skill in the art. In specific embodiments, the goal of performing single-cell TCR sequencing (scTCRseq) on the Ci mRNA Seq HT Array IFC is to generate 800 single-cell libraries with appropriate barcodes and sequencing adapters such that the material harvested from the IFC can be pooled, cleaned up with beads, and directly loaded onto an Illumina sequencer. The first step after capturing cells is to use the 20 column inlets of the Ci to add reverse transcriptase (RT) reagents to the captured cells. Referring to Figure 4 A, the RT reagents include two RT primers, a first RT having a portion specific for the Constant segment (CS) of an RNA encoding a first chain of a T-cell receptor, and a second RT primer having a portion specific for the Constant segment (CS) of an RNA encoding the second chain of the T-cell receptor. The first and second RT primers each additionally include a first nucleotide tag comprising a first primer binding site for a first DNA sequencing primer, the first nucleotide tag being 5' of the CS-specific portion.

[0176] The RT reagents include an additional type of primer, termed a "first barcode primer," which includes from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor. A different first barcode primer, bearing a different barcode is added in each of the columns, as shown in Figure 4A (so that each barcode uniquely identifies a particular column). The TRAC and TRBC sequences are complementary to the TCR a and β mRNAs and serve as specific primers during the reverse transcriptase reaction. In the first barcode primer illustrated in Figure 4A, Rd2 is the Illumina TruSeq Read 2 sequence, BCl-20 refers to 20 different barcodes, and P7 is one of the Illumina sequencing adaptors. The P7.BC.Rd2 primer does not participate in the reverse transcription reaction but is there so that barcodes are added to this end of the amplified segments during the amplification step.

[0177] Illustrative sequences designed for the two RT primers (TRAC and TRBC) and the first barcode primers (P7.BCl-20.Rd2, i.e., 20 different first barcode primers) are as follows:

AC.A1: gtgactggagttcagacgtgtgctcttccgatctTGAATAGGCAGACAGACTTGTCA

(SEQ ID NO:l)

BC.A1: gtgactggagttcagacgtgtgctcttccgatctCACACCAGTGTGGCCTTT

(SEQ ID NO:2)

P7.BC.Rd2x: CAAGCAGAAGACGGCATACGAGAT [barcode seq] GTGACTGGAGTTCAGACGT (SEQ ID NO:3)

[0178] After adding these RT reagents, the cells are lysed, e.g., by addition of a proteinase K/NP-40 mixture and incubation at elevated temperature (50°C). Then, additional components required for reverse transcription (e.g., buffer, dNTPs, proteinase K inhibitor, reverse transcriptase, and Ci™ Loading Reagent (Fluidigm 100-5170)) are transferred from a common reservoir to the captured cells and reverse transcription is carried out. [0179] The next step of this illustrative method is to use the 40 row inlets of the Ci to add the amplification reagents. Referring to Figure 4B, the amplification reagents include two sets amplification primers, a set of first amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the a chain of the T-cell receptor, and a set of second amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the β chain of the T- cell receptor. The first and second amplification primers each additionally include a second nucleotide tag comprising a second primer binding site for a second DNA sequencing primer, the second nucleotide tag being 5' of the VS-specific portion.

[0180] The amplification reagents include an additional type of primer, termed a "second barcode primer," which includes from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor. A different second barcode primer, bearing a different barcode is added in each of the rows, as shown in Figure 4B (so that each barcode uniquely identifies a particular row). The TRAV and TRBV sequences are specific for the Variable segments that may be in the TCR a and β mRNAs. Because there are many different Variable segments, multiple TRAV and TRBV primers must be used at each capture site. Rdl is the Illumina TruSeq Read 1 sequence, BCl-40 refers to 40 different barcodes, and P5 is one of the Illumina sequencing adaptors. Illustrative sequences designed for the two sets of amplification primers (38 TRAV and 31 TRBV) and the second barcode primers (P5.BC1- 40. Rdl, i.e., 40 different second barcode primers) are as follows:

P5.BC.Rdlx:AATGATACGGCGACCACCGAGATCT [barcode seq] ACACTCTTTCCCTACACGA

(SEQ ID NO:73)

[0181] The amplification reagents include, in addition to the above-described primers, other components for performing an amplification reaction (e.g., PreAmp Master Mix (Fluidigm 100-5581) and Ci™ Loading Reagent (Fluidigm 100-5170)).

[0182] The RT primers (TRAC and TRBC) and first barcode (P7.BC.Rd2) primers are still present from the RT step and are utilized during the amplification step. The combinatorial arrangement of 40 barcodes on this end of the amplified segment and 20 barcodes on the other end provides each of the 800 cells with a unique barcode identity.

[0183] The amplicons harvested in the 20 harvest ports should have the structure:

P5-BC-Rdl-V(a or p)-CDR3(a or p)-C(a or p)-Rd2-BC-P7 [0184] Because each cell has its own combinatorial barcode, the 20 harvests can be pooled, purified using beads or some other method, and loaded on an Illumina sequencer to determine CDR3 sequences for each of the cells. By using different sets of 20 barcodes in the P7.BC.Rd2 primers for additional IFCs, the libraries from multiple IFCs can be pooled together before sequencing.

[0185] In particular embodiments, the amplification reaction is PCR, preferably carried out under hot-start conditions to improve amplification specificity.

Super Selective Primers

[0186] In some embodiments of the methods described herein, amplification specificity can be further enhanced using Super Selective (also termed "Loop") primers (Marras S, Vargas-Gold D, Tyagi S, Kramer FR, PCT/US2014/015351, which is incorporated by reference for it description of Super Selective primers). An illustrative Super Selective primer is shown diagrammatically in Figure 5. The Anchor segment (typically 18-35 nucleotides) keeps the primer stably bound to the template. Because of the unpaired Insert/Loop region, the Bait segment (typically 5-10 nucleotides) hybridizes to the template only transiently. Occasionally, during this hybridization time, the primer is extended by polymerase. Any mismatch in the Bait segment drastically reduces the occupancy time for any hybridization event and thus drastically reduces the probability that the primer will be extended. Therefore, the only templates that should participate in primer extension are those that have the Anchor sequence and a close-by, perfectly- matched Bait sequence. An illustrative Super Selective primer can have a 24-nucleotide Anchor segment, a 14-nucleotide Insert, and a Bait sequence of 5-7 nucleotides. The Loop formed by unpaired template nucleotides need not be the same length as the Insert in the primer. For example, the Loop opposite a 14-nucleotide Insert might only contain 10 nucleotides.

[0187] For TCR sequencing, the same procedure outlined above can be used, with the modification that the TCR-specific primers include Inserts that do not hybridize to the TCR transcripts. Figure 6A illustrates the use of Super Selective primers in reverse transcription, whereas Figure 6B illustrates their use in amplification. [0188] Referring to Figure 6A, the CS-specific portion of the first RT primer includes an insert that does not anneal to the CS of the RNA encoding the a chain of the TCR, and/or the CS-specific portion of the second RT primer includes an insert that does not anneal to the CS of the RNA encoding the β chain of the TCR. The inserts are flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the CS.

Preferably, both first and second RT primers include inserts; in some embodiments, the inserts are similar or the same with respect to sequence and/or length. [0189] Referring to Figure 6B, the VS-specific portion of the plurality of first amplification primers includes an insert that does not anneal to the VS of the RNA encoding the a chain of the TCR, and/or the VS-specific portion of the plurality of second amplification primers includes an insert that does not anneal to the VS of the RNA encoding the β chain of the TCR. The inserts are flanked by a 5' anchor sequence and a 3' bait sequence, both of which anneal to the VS. Preferably, both first and second amplification primers include inserts; in some embodiments, the inserts are similar or the same with respect to sequence and/or length.

[0190] In various embodiments, the first and second RT primers can include inserts, with standard primers (i.e., without inserts) used for amplification; standard primers can be used for reverse transcription in conjunction with amplification primers that include inserts; or both sets of primer can include inserts. As an example of the latter embodiment, the sequences of illustrative Super Selective forms of RT primers (1 TRAC and 1 TRBC) and amplification primers (41 TRAV and 31 TRBV) are given below:

AC.LLgtgactggagttcagacgtgtgctcttccgatctCGGTGAATAGGCAGACAGACTTGTCAactctaa ctGAGTCT (SEQ ID NO: 147)

BC.LLgtgactggagttcagacgtgtgctcttccgatctGGCACACCAGTGTGGCCTTTatctccatctGGA GATC ( SEQ ID NO: 148)

[0191] Using these Super Selective primers, the resulting amplicons still have the same general structure (P5-BC-Rdl-V(a or p)-CDR3(a or p)-C(a or p)-Rd2-BC-P7) and can be pooled and sequenced as outlined above.

[0192] In certain, preferred embodiments of the above-described methods, Super

Selective primers are used for amplification, but not for reverse transcription, as this primer combination, rather surprisingly, enhances production of the desired products.

[0193] In some embodiments, the concentration of the "outer" barcode primers exceeds that of the "inner" target-specific primers. For example, the outer, barcode primers can be used in a 100-fold, 75-fold, 50-fold, 40-fold, 30-fold, 20-fold, 10-fold, or 5-fold excess over the inner, target-specific primers or in a degree of excess falling within any range bounded by any of these values, e.g., 30-fold to 50-fold. In implementations on the Ci chip, for example, good results are obtained using inner, target-specific primers at 50 nM and outer, barcode primers at 2 μΜ, which is a 40-fold excess. Preparing DNA Templates by Reverse Transcription, Template Switching, and Amplification

[0194] In some embodiments, DNA templates can be prepared from single cells using template switching. This well-known technique is outlined in Zhu YY, Machleder EM, Chenchik A et al (2001) Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. Biotechniques 30:892-897, which is incorporated herein by reference for this description. The Summary above discusses methods for preparing the DNA templates described herein using template switching in general terms. Figure 7 illustrates the application of an embodiment of this technique to generate a DNA template for a particular target transcript (targeted transcript sequencing). The number and identity of the target-specific primers is determined by the mRNAs to be sequenced. These target-specific primers are typically positioned close to the 5 'end of their respective mRNAs. In the Figure 7, the RT primer is shown with an insert, but primers without inserts could be used as well. Reverse transcriptase extends the target- specific primers then, taking advantage of the tempi ate- switch mechanism, copies the information in the tempi ate- switch oligonucleotide (Rdl-UMI-GGG). By incorporating the UMI, more accurate quantitative information for the targeted mRNAs can be obtained. "GGG" denotes an oligo-riboG sequence. Most typically, three guanines are used in template switch oligonucleotides, but any number that works can be used. One or more of the guanines can be a locked nucleic acid (LNA).

[0195] This approach can be adapted for whole transcriptome sequencing by using an RT primer in which the target-specific portion is replaced with a portion specific for the poly-A tail of mRNA molecules (e.g., poly-dT). In either case, templates can be amplified and barcodes and sequencing adaptors added using barcode primers such as those described above (e.g., P5-BC-Rdl and Rd2-BC-P7), followed by pooling and sequencing as described above. If whole transcriptome sequencing is performed, TCR and Ig sequences can be determined from the resulting data set using standard bioinformatic methods, thus avoiding the need for the specialized single-cell TCR or Ig transcript sequencing methods described above. Illustrative bioinformatics methods are found in Nat Methods. 2016 Apr; 13(4):329-32. doi: 10.1038/nmeth.3800. Epub 2016 Mar 7. T cell fate and clonality inference from single-cell transcriptomes. Stubbington MJ, Lonnberg T, Proserpio V, Clare S, Speak AO, Dougan G, Teichmann SA, which is incorporated by reference herein for this description. In an intermediate approach, template switching can be carried out to obtain templates corresponding to the whole transcriptome, followed by TCR- or Ig-transcript specific amplification.

[0196] Alternatively, either or both of targeted transcript sequencing or transcriptome sequencing can be combined with the single-cell TCR or Ig transcript sequencing methods described above.

[0197] In some embodiments, using the tempi ate- switch oligonucleotide provides another mechanism for obtaining TCR/Ig sequence information. In this case, the reverse transcription step would be performed with TRAC primer, TRBC primer, optionally target-or poly-A-specific primers for additional mRNAs, P7.BC.Rd2 primer, and a tempi ate- switch oligonucleotide. The TRAC and TRBC primers would be extended to the 5' ends of the TCR mRNAs, followed by template switching. For the amplification step, all that needs to be added is amplification reagents including the P5.BC.Rdl primer. There would be no need to add all the different TRAV and TRBV primers. If necessary, the number of sequencing cycles could be increased to be certain of obtaining CDR3 sequence.

[0198] In particular embodiments, the template switching method is carried out in a microfluidic device, as described above. In this case, the RT primer can be delivered to capture sites via one set of the input lines, and the 5' oligonucleotide can be delivered to the capture sites via the other set of input lines. Ig Transcript Production

[0199] Natural antibodies typically have two identical Ig heavy chains. Each of the Ig heavy chain genes is assembled from multiple possible V, D, J, and C segments by somatic recombination in a manner similar to that described above for TCRs. An example of heavy chain rearrangement is shown in Figure 8 A. There are multiple different C segments (encoding constant regions) that give rise to different isotypes. Natural antibodies also typically have two identical Ig light chains, which can either be κ light chains or λ light chains. Each of the Ig light chain genes is assembled from multiple possible V, J, and C segments by somatic recombination. An example of κ light chain rearrangement is shown in Figure 8B. As those of skill in the art readily appreciate, the strategies described herein with respect to sequencing TCR transcripts can also be carried out to sequence Ig transcripts simply by designing primers that will anneal to the V and C regions of Igs, rather than the V and C regions of TCRs. DNA Sequencing

[0200] In some embodiments, the methods described herein include subjecting the

DNA templates produced to DNA sequencing, e.g., Sanger sequencing, next-generation sequencing (e.g., bridge sequencing), or third-generation sequencing. In variations of such embodiments, the sequences obtained from DNA sequencing can be identified as having been derived from a particular capture site based on one or two barcodes.

[0201] As discussed above, reaction products from a particular row or column of a matrix-type microfluidic device can be exported as a pool. Any subsequent

characterization of reaction products, such as DNA sequencing, can be carried out on individual exported pools. However, it is also contemplated that the pools themselves can be pooled prior to further characterization. In this case, the DNA template(s) from each separate capture site in the microfluidic device is typically distinct, which is readily achieved, e.g., by using two barcode sequences to encode the row and column location of the capture site in the microfluidic device. [0202] In some embodiments, it is advantages to further amplify reaction products before carrying out DNA sequencing. Where all reaction products bear common end sequences, all reaction products to be sequenced can be pooled and amplified together, using primers specific for the end sequences (i.e., in the illustrative embodiment of preparing TCR templates described above, P5 and P7 primers would be used). Primer Combinations

[0203] Any of the primers or oligonucleotides described above may be combined to form primer combinations. Typically, primer combinations include 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or more primers or oligonucleotides that are used together in a method such as those described herein. Individual primers can be packaged individually or packaged together, when used as a set in a reaction. For example, when the primers are to be used for sequencing TCRs or Igs, the set of all possible Variable segment-specific primers are conveniently packaged together.

Kits

[0204] Kits according to the invention can include one or more reagents useful for practicing one or more methods described herein. A kit generally includes a package with one or more containers holding the reagent(s), as one or more separate compositions or, optionally, as admixture where the compatibility of the reagents will allow. The kit can also include other material(s) that may be desirable from a user standpoint, such as a buffer(s), a diluent(s), a standard(s), and/or any other material useful in sample processing, washing, or conducting any other step of the assay. In specific embodiments, the kit includes one or more matrix-type microfluidic devices and/or primers/oligonucleotides discussed above or combinations thereof.

[0205] It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims.

[0206] In addition, all other publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.

EXAMPLES

Example 1: Use of Super Selective Primers for T-Cell Receptor Template

Production

[0207] RT primer set 1 (RT-No Loop) is a mix of AC. Al and BC. Al . RT primer set 2 (RT-Loop) is a mix of AC.Ll and BC.Ll . V primer set 1 (V-No Loop) is a mix of AV01.A1, AV02.A1, AV03.A1, AV04.A1, AV05.A1, AV06.A1, AV07.A1, AV08.A11, AV08.A12, AV08.A13, AV09.A1, AV10.A1, AV12.A1, AV13.A11, AV13.A12,

AV14.A1, AV16.A1, AV17.A1, AV18.A1, AV19.A1, AV20.A1, AV21.A1, AV22.A1, AV23.A1, AV24.A1, AV25.A1, AV26.A11, AV26.A12, AV27.A1, AV29.A1, AV30.A1, AV34.A1, AV35.A1, AV36.A1, AV38.A1, AV39.A1, AV40.A1, AV41.A1, BV02.A1, BV03.A1, BV04.A1, BV05.A11, BV05.A12, BV06.A11, BV06.A12, BV07.A11,

BV07.A12, BV07.A13, BV07.A14, BV09.A1, BV10.A11, BV10.A12, BV10.A13, BV11.A1, BV12.A11, BV12.A12, BV13.A1, BV14.A1, BV15.A1, BV16.A1, BV18.A1, BV19.A1, BV20.A1, BV24.A1, BV25.A1, BV27.A1, BV28.A1, BV29.A1, and BV30.A1. V primer set 2 (V-Loop) is a mix of AVOl .Ll, AV02.L1, AV03.L1, AV04.L1, AV05.L1, AV06.L1, AV07.L2, AV08.L11, AV08.L12, AV08.L13, AV08.L14, AV9.L11, AV9.L12, AV10.L1, AV12.L11, AV12.L12, AV12.L13, AV13.L11, AV13.L12, AV14.L1,

AV16.L1, AV17.L1, AV18.L1, AV19.L1, AV20.L1, AV21.L1, AV22.L1, AV23.L1, AV24.L1, AV25.L1, AV26.L11, AV26.L12, AV27.L1, AV29.L1, AV30.L1, AV34.L1, AV35.L1, AV36.L1, AV38.L1, AV39.L1, AV40.L1, AV41.L1, BV02.L1, BV03.L1, BV04.L1, BV05.L1 1, BV05.L12, BV06.L1 1, BV06.L12, BV07.L1 1, BV07.L12, BV07.L13, BV07.L14, BV09.L1, BV10.L1 1, BV10.L12, BV10.L13, BV1 1.L1,

BV12.L1 1, BV12.L12, BV13.L1, BV14.L1, BV15.L1, BV16.L1, BV18.L1, BV19.L1, BV20.L1, BV24.L1, BV25.L1, BV27.L1, BV28.L1, BV29.L1, and BV30.L1. In the experiment described below, these primers were used in the following combinations:

1 : No Loop RT primers with No Loop V primers

2: Loop RT primers with No Loop V primers

3 : No Loop RT primers with Loop V primers 4: Loop RT primers with Loop V primers

[0208] Reverse transcriptase reactions contained 100 ng total RNA from human peripheral blood mononuclear cells (PBMCs), 220 nM each RT primer (RT-No Loop or RT-Loop), 15 mM Tris-HCl, pH 8.4, 75 mM KC1, 10 mM MgCl₂, 5% (v/v) glycerol, 10 mM dithiothreitol, 0.8 units^L RNaseOUT™ (Thermo Fisher Scientific), 500 μΜ each dNTP, and 8 units/ Superscript® II reverse transcriptase (Thermo Fisher Scientific) in a volume of 1 1.5 piL. These reactions were incubated at 25°C for 10 minutes, 42°C for 60 minutes, then placed on ice. To each reaction, 14 μΐ_^ of a preamp/primer mix was added so that the final concentrations were 1 x PreAmp Master Mix (Fluidigm) and 50 nM each V primer (V-No Loop or V-Loop). The conditions for PCR were 95°C for 5 minutes, 20 cycles of 96°C for 5 seconds and 60 °C for 6 minutes, followed by 4°C hold. Ten microliters of a mix consisting of 2 μΐ_^ Exonuclease I (New England BioLabs), 1 10x Exonuclease I reaction buffer, 7 water were added to each reaction. These reactions were incubated at 37°C for 30 minutes, 80°C for 15 minutes, then placed on ice. Ninety microliters 10 mM Tris-HCl, pH 8.0, 1 mM EDTA were added to each reaction. For purification, 50 of each sample was mixed with 50 μΐ_^ AMPure XP Beads (Beckman Coulter), mixed well, and kept at room temperature for 5 minutes. Each sample tube was put on a magnet plate for 5 minutes, then the supernatant was discarded. The beads were washed with newly made 70% ethanol twice. The beads were dried at room temperature for at least 7 minutes, then each was resuspended in 30 μΐ_^ 10 mM Tris-HCl, pH 8.0, 0.1 mM EDTA. After sitting at room temperature for 5 minutes, each sample tube was put on a magnet plate for 2 minutes and each supernatant was transferred to a fresh tube. A Ι-μΙ_^ aliquot of each sample was analyzed on the 2100 Bioanalyzer system using the High Sensitivity DNA kit (Agilent).

[0209] Results are shown in Figure 9. Desired DNA fragments derived from the

TCR transcript are in the size range 250 to 350 nucleotides. Undesired, nonspecific fragments are in the size range 100 to 220 nucleotides. The best ratio of desired fragments to undesired fragments is observed for combination 3, which is No Loop RT primers and Loop V primers.

Example 2: Fixation Enhances the Recovery of DNA Templates Produced from RNA Transcripts

[0210] Human peripheral blood mononuclear cells (PBMCs) were enriched for

CD4⁺ cells using magnetic bead purification. Pelleted cells, resuspended in Dulbecco's Modified Eagle Medium (DMEM) containing IL-2 and IL-7, were activated by addition of 2 μg/mL phytohemagglutinin (PHA). After five days of activation, cells were washed and enriched again for CD4⁺ cells using magnetic bead purification. Two aliquots of these cells were pelleted. The cells designated fresh were resuspended in phosphate-buffered saline (PBS). The cells designated fixed were resuspended in PBS containing 1 mg/mL 3,3'-dithiodipropionic acid di(N-hydroxysuccinimide ester) (DSP, Sigma- Aldrich). Eight hundred fresh or fixed cells were pelleted and resuspended in 5.5 μΐ_^ Lysis Mix consisting of 50 mM Tris-HCl, pH 8.4, 40 mM dithiothreitol, 0.5% NP-40 detergent, 8 units/mL proteinase K (New England BioLabs), 230 nM each RT-No Loop primer, 9.2 μΜ

P7.rcCBC001.Rd2x primer

(CAAGCAGAAGACGGCATACGAGATcgcgactgaaGTGACTGGAGTTCAGACGT) (SEQ ID NO: 149), and 1 χ CI Loading Reagent (Fluidigm). These reactions were incubated at 50°C for 30 minutes, 72°C for 1 minute, then placed on ice. To each reaction, 6 piL of a RT mix was added so that the final concentrations were 15 mM Tris- HCl, pH 8.4, 75 mM KC1, 10 mM MgCl₂, 5% (v/v) glycerol, 360 μΜ AAPF proteinase K inhibitor (EMD Millipore), 500 μΜ each dNTP, 8 units/piL Superscript^® II reverse transcriptase, and 1 χ CI Loading Reagent. These reactions were incubated at 42°C for 60 minutes, 85°C for 5 minutes, then placed on ice. To each reaction, 14 JL of a

preamp/primer mix was added so that the final concentrations were 1 ^χ PreAmp Master Mix (Fluidigm) and 50 nM each V-Loop primer, and 2 μΜ P5.Rdlx primer

(AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGA) (SEQ ID NO: 150), and 1 x CI Loading Reagent. The conditions for PCR were 95°C for 5 minutes, 20 cycles of 96°C for 20 seconds and 60°C for 6 minutes, followed by 4°C hold. Twenty- five microliters 10 mM Tris-HCl, pH 8.0, 1 mM EDTA were added to each reaction. For purification, each sample was mixed with 35 μΐ_^ AMPure XP Beads (Beckman Coulter), mixed well, and kept at room temperature for 5 minutes. Each sample tube was put on a magnet plate for 5 minutes, then the supernatant was discarded. The beads were washed with newly made 70% ethanol twice. The beads were dried at room temperature for at least 7 minutes, then each was resuspended in 20 nL 10 mM Tris-HCl, pH 8.0, 1 mM EDTA. After sitting at room temperature for 5 minutes, each sample tube was put on a magnet plate for 2 minutes and each supernatant was transferred to a fresh tube. The following was added to each sample: 2.5 μΐ_^ 10 μΜ P5 primer

(AATGATACGGCGACCACCGAGATCTACAC) (SEQ ID NO: 151), 2.5 μΐ. 10 μΜ P7 primer (CAAGCAGAAGACGGCATACGAGAT) (SEQ ID NO: 152), and 25 μΐ, 2x Q5^® Hot Start HiFi PCR Master Mix (New England BioLabs). The conditions for PCR were 98°C for 30 seconds, 18 cycles of 98°C for 10 seconds and 62 °C for 2 minutes, 75°C for 2 minutes, followed by 4°C hold. For purification, each sample was mixed with 35 AMPure XP Beads (Beckman Coulter), mixed well, and kept at room temperature for 5 minutes. Each sample tube was put on a magnet plate for 5 minutes, then the supernatant was discarded. The beads were washed with newly made 70% ethanol twice. The beads were dried at room temperature for at least 7 minutes, then each was resuspended in 20 10 mM Tris-HCl, pH 8.0, 1 mM EDTA. After sitting at room temperature for 5 minutes, each sample tube was put on a magnet plate for 2 minutes and each supernatant was transferred to a fresh tube.

[0211] A l-μΐ. aliquot of each sample was analyzed on the 2100 Bioanalyzer system using the High Sensitivity DNA kit. Results are shown in Figure 10. Because of the addition of P5, P7, and other sequences, the desired DNA fragments derived from the TCR transcript are in the size range 320 to 400 nucleotides. Results show that the fixed cells gave a much better yield of desired fragments than fresh cells.

Example 3: Single-Cell T-Cell Receptor Template Production Using CI mRNA Seq

HT Array IFC

[0212] In the procedure that follows, any reference to standard protocol refers to

"Using the CI High-Throughput IFC to Generate Single-Cell cDNA Libraries for mRNA Sequencing" (Fluidigm PN 100-9886). This document also provides the details for instrument operation.

Stocks

5x RT Buffer

75 mM Tris-HCI, pH 8.4

375 mM KCI

50 mM MgCI₂

25% glycerol (50%, Teknova G1796, G1797, G1798, G1 799)

18 mM AAPF Proteinase K inhibitor

Dissolve 10 mg AAPF Proteinase K inhibitor in 1 mL DMSO

Store at -20°C

C1 Mixes

2Qx Column Mixes

Mix:

187.5 μΙ_ Cell Rinsing Reagent (Fluidigm)

12.5 μΙ_ C1 Loading Reagent (Fluidigm 100-5170)

Dispense 8 μί to each of 20 wells

Add 2 iL 100 μΜ P7.rcCBC001 -020.Rd2x, with a different barcode added to each well

During the set up for the Prime step, transfer 8 of each to one of the 20 Inlets labeled as "Column inlets." Addition of the barcode mix will actually happen after cell staining.

Lysis Mix (163.5 μί)

15 μΐ 1 Μ Tris-HCI, ρΗ8.4

12 μΐ 1 Μ dithiothreitol

15 μΐ 1 0% ΝΡ-40 (0.5% in final rxn)

6.9 μΐ 10 μΜ each RT-No Loop primer

3 μΐ 800 units/mL (20 mg/mL) proteinase K (8 units/mL in final rxn)

8.2 μΐ C1 Loading Reagent (Fluidigm 100-5170)

103.4 μΐ H₂0

Transfer 130 μί Lysis Mix to Inlet L2 (big reservoir) during the set up for the Final Reaction.

RT Mix (26 μ|_)

10 μί 5χ RT Buffer

1 [iL 25 mM each dNTPs

1 μί 1 8 mM AAPF Proteinase K inhibitor (360 μΜ in final rxn)

1 [it 200 units^L Superscript® II Reverse Transcriptase

1 .3 μΐ C1 Loading Reagent (Fluidigm 100-5170)

1 1 .7 μί H₂0

Transfer 20 RT Mix to Inlet RT during the set up for the Final Reaction.

40 x Row Mixes

PCR Mix (225 μ|_)

130 μί 5χ PreAmp Master Mix (Fluidigm 1 00-5581 ) (1 χ in final rxn)

65 500 nM each V-Loop primer (50 nM each primer in final rxn)

18 μί C1 Loading Reagent (Fluidigm 100-5170)

12 μί H₂0

Dispense 5 PCR Mix to each of 40 wells

Add 2.9 μί 10 μΜ P5.rcRBC001 -040.Rd1 x, with a different barcode added to each well

Transfer 5 of each to Inlets labeled as "Row Barcode" during the set up for the Final Reaction .

Harvest Mix (200 μί)

194 μί C1 Harvest Reagent (Fluidigm 1 00-7081 )

6 μί 10 mg/mL yeast tRNA (Thermo Fisher Scientific AM71 1 9) 180 μΙ_ to each of the Harvest Inlet during the set up for the Final Reaction.

C1 Steps

1. Prime:

a. Pipette mixes following the standard prime loading map. For Valve Fluid, add Tween 20 to a final concentration of 0.05% and load 250 μΙ_ each into Acc1 and Acc2.

b. Pipette 8 μΙ_ each column mix to 20 column inlets

c. Run prime script: "mRNA Seq Prime R&C barcode". ~51 min

2. Cell Loading:

a. Replace liquid following the standard protocol

b. Keep the column mixes in the 20 column inlets

c. Using activated, DSP-fixed CD4+ T cells prepared as described in Example 2, dispense 20 cell suspension to each of the two cell inlets

d. If you want cell load and stain: Run script "mRNA Seq HT: Cell Load &Stain with Column barcode" ~39min

e. If you only want cell load and wash: Run script "mRNA Seq HT: Cell load with column barcode" ~9 min

3. Final Reaction:

a. Replace cell-related liquid following the standard protocol.

b. Pipette Lysis Mix into L2 zone (up right corner), ~130

c. Pipette RT Mix into RT inlet, -20 μί

d. Pipette PCR Mixes containing barcode primers into 40 row inlets, ~5 each

e. Check the entire reagent, such as harvest reagent, before inserting IFC into machine. f. Run script "scTCR Seq HT" ~6 hr. Set time delay as usual.

Final Reaction consists of:

1 . Lysis Mix to chambers 0+1

50°C, 30 min (1800 sec)

72°C, 1 min (60 sec)

10°C, 1 min (60 sec)

2. RT Mix to chambers 0+1 +2

42°C, 60 min (3600 sec)

85°C, 5 min (300 sec)

3. PCR Mix to chambers 0+1 +2+3

70°C, 30 min (1800 sec)

95°C, 5 min (300 sec)

20 cycles: 96°C, 20 sec

60°C, 6 min (360 sec)

4. Harvest

Transfer the 20 Harvest samples to a 96-well plate.

Pool 5 of each sample.

Clean up with 80 μί AMPure XP Beads. Elute with 50 μί 10 mM Tris-HCI, pH 8.0, 0.1 mM EDTA. Clean up with 40 μΐ AMPure XP Beads. Elute with 20 μΐ 10 mM Tris-HCI, pH 8.0, 0.1 mM EDTA. Add:

2.5 iL 10 μΜ P5 Primer

2.5 [iL 10 μΜ P7 Primer

25 [it Q5® Hot Start HiFi PCR Master Mix

98°C, 30 sec 20 cycles: 98°C, 10 sec

62°C, 2 min

75°C, 2 min

4°C, hold

Clean up with 35 μΙ_ AMPure XP Beads. Elute with 20 μΙ_ 10 mM Tris-HCI, pH 8.0, 0.1 mM EDTA.

[0213] The sequences of primers not previously given are below:

P7. rcCBCOOl i CAAG C AGAAGAC G G CAT AC GAGAT cgcgactgaaGT GAC T G GA ;(SEQ ID NO 149) ; .Rd2x i GTTCAGACGT

\ P7. rcCBC002 i CAAGC AGAAGAC G G CAT AC GAGAT agcatcgataGT GAC T G GA ;(SEQ ID NO: 153) ; i .Rd2x i GTTCAGACGT

P7. rcCBC003 i CAAG C AGAAGAC G G CAT AC GAGAT cgacacatggGT GAC T G GA ;(SEQ ID NO 154) ; .Rd2x i GTTCAGACGT

i P7. rcCBC004 i CAAGC AGAAGAC G G CAT AC GAGAT cgactacgcaGT GAC T G GA ( SEQ ID NO- 155) ; \ .Rd2x GTTCAGACGT

P7. rcCBC005 i CAAG C AGAAGAC G G CAT AC GAGAT cactgctgagGT GAC T G GA i(SEQ ID NO 156) ; .Rd2x GTTCAGACGT

i P7.rcCBC006 i CAAGC AGAAGACG G CAT AC GAGAT tcgctgtacaGTGACTGGA i(SEQ ID NO- 157) i I .Rd2x GTTCAGACGT

P7. rcCBC007 i CAAG C AGAAGAC G G CAT AC GAGAT cgctgcagtaGT GAC T G GA i(SEQ ID NO 158) i .Rd2x GTTCAGACGT

i P7.rcCBC008 i CAAGC AGAAGACG G CAT AC GAGAT agacttgcagGT GAC T G GA (SEQ ID NO: 159) i I .Rd2x GTTCAGACGT

P7. rcCBC009 I CAAG C AGAAGAC G G CAT AC GAGAT gcgaactaagGT GAC T G GA i(SEQ ID NO 160) i .Rd2x GTTCAGACGT

i P7. rcCBCOlO I CAAGC AGAAGACG G CAT AC GAGAT gctagaacagGT GAC T G GA i ( SEQ ID NO: 161) i I .Rd2x GTTCAGACGT

P7. rcCBCOll CAAG C AGAAGAC G G CAT AC GAGAT tacgctaacgGT GAC T G GA i ( SEQ ID NO 162) i .Rd2x I GTTCAGACGT

I P7. rcCBC012 ; CAAGC AGAAGACG G CAT AC GAGAT cagctatccgGT GAC T G GA (SEQ ID NO. 163) ; \ .Rd2x I GTTCAGACGT

P7. rcCBC013 CAAG C AGAAGAC G G CAT AC GAGAT gtgtcctacaGT GAC T G GA (SEQ ID NO 164) i .Rd2x I GTTCAGACGT

I P7.rcCBC014 i CAAGC AGAAGAC G G CAT AC GAGAT atgatgcatgGT GAC T G GA ;(SEQ ID NO. 165) ; \ .Rd2x I GTTCAGACGT

P7. rcCBC015 i CAAG C AGAAGAC G G CAT AC GAGAT gaatgcatcaGT GAC T G GA i ( SEQ ID NO 166) ; .Rd2x i GTTCAGACGT

i P7.rcCBC016 i CAAGC AGAAGAC G G CAT AC GAGAT gtctcgacaaGT GAC T G GA (SEQ ID NO: 167) ; \ .Rd2x i GTTCAGACGT

P7. rcCBC017 i CAAG C AGAAGAC G G CAT AC GAGAT agcacgctcaGT GAC T G GA i ( SEQ ID NO 168) ; .Rd2x i GTTCAGACGT

\ P7. rcCBC018 i CAAGC AGAAGAC G G CAT AC GAGAT ctatcgaacaGT GAC T G GA i ( SEQ ID NO: 169) ; i .Rd2x i GTTCAGACGT

P7. rcCBC019 i CAAG C AGAAGAC G G CAT AC GAGAT aagtcatcagGT GAC T G GA i ( SEQ ID NO 170) ; .Rd2x i GTTCAGACGT

i P7. rcCBC020 i CAAGC AGAAGAC G G CAT AC GAGAT ttcgtcgtgaGT GAC T G GA ( SEQ ID NO- 171) ; \ .Rd2x i GTTCAGACGT

P5. rcRBCOOl i AAT GAT AC G G C GAC C AC C GAGAT CTACACgcaacgtaagACAC \ ( SEQ ID NO 172) ; . Rdlx TCTTTCCCTACACGA

i P5.rcRBC002 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC gctgctcga a AC AC i(SEQ ID NO- 173) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC003 i AAT GAT AC G G C GAC C AC C GAGAT CTACACtagaacaacgACAC \ ( SEQ ID NO 174) ; .Rdlx TCTTTCCCTACACGA

i P5.rcRBC004 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC acgtcggat a AC AC (SEQ ID NO: 175) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC005 AAT GAT AC G G C GAC C AC C GAGAT CTACACtcagagaatgACAC i ( SEQ ID NO 176) i . Rdlx i TCTTTCCCTACACGA

I P5. rcRBC006 ; AATGA ACGGCGACCACCGAGATC ACACatagctcgaaACAC (SEQ ID NO 177) ; \ .Rdlx I TCTTTCCCTACACGA

P5. rcRBC007 AATGATACGGCGACCACCGAGATCTACACgacacggataACAC ( SEQ ID NO 178) i .Rdlx I TCTTTCCCTACACGA

I P5.rcRBC008 i AATGATACGGCGACCACCGAGATCTACACattgttgcgaACAC (SEQ ID NO 179) ; \ .Rdlx I TCTTTCCCTACACGA

P5. rcRBC009 i AATGATACGGCGACCACCGAGATCTACACcgatagaagaACAC ( SEQ ID NO 180) ; .Rdlx i TCTTTCCCTACACGA

i P5.rcRBC010 i AATGATACGGCGACCACCGAGATCTACACtaccgtatgaACAC ( SEQ ID NO 181) ; \ .Rdlx i TCTTTCCCTACACGA

P5. rcRBCOll i AATGATACGGCGACCACCGAGATCTACACatggagtctaACAC ( SEQ ID NO 182) ; .Rdlx i TCTTTCCCTACACGA

\ P5. rcRBC012 i AATGATACGGCGACCACCGAGATCTACACagctccgtagACAC ( SEQ ID NO 183) ; i .Rdlx i TCTTTCCCTACACGA

P5. rcRBC013 i AATGATACGGCGACCACCGAGATCTACACgaatctacagACAC ( SEQ ID NO 184) ; .Rdlx i TCTTTCCCTACACGA

i P5. rcRBC014 i AATGATACGGCGACCACCGAGATCTACACcaatacgctaACAC ( SEQ ID NO 185) ; \ .Rdlx i TCTTTCCCTACACGA

P5. rcRBC015 i AATGATACGGCGACCACCGAGATCTACACcaacatgagaACAC ( SEQ ID NO 186) ; .Rdlx TCTTTCCCTACACGA

i P5.rcRBC016 i AATGATACGGCGACCACCGAGATCTACACggcaacagaaACAC (SEQ ID NO 187) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC017 i AATGATACGGCGACCACCGAGATCTACACagagtgcataACAC (SEQ ID NO 188) ; .Rdlx TCTTTCCCTACACGA

i P5.rcRBC018 i AATGATACGGCGACCACCGAGATCTACACcactgtcggaACAC (SEQ ID NO 189) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC019 i AATGATACGGCGACCACCGAGATCTACACgatcgtaacaACAC (SEQ ID NO 190) i .Rdlx TCTTTCCCTACACGA

i P5. rcRBC020 I AATGATACGGCGACCACCGAGATCTACACacatgctccaACAC (SEQ ID NO 191) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC021 I AATGATACGGCGACCACCGAGATCTACACgcacattcgaACAC (SEQ ID NO 192) i .Rdlx TCTTTCCCTACACGA

I P5. rcRBC022 ; AATGATACGGCGACCACCGAGATCTACACggcgatactcACAC (SEQ ID NO 193) ; \ .Rdlx I TCTTTCCCTACACGA

P5. rcRBC023 AATGATACGGCGACCACCGAGATCTACACacgctacatcACAC (SEQ ID NO 194) i .Rdlx I TCTTTCCCTACACGA

I P5.rcRBC024 i AATGATACGGCGACCACCGAGATCTACACacaatgctcgACAC (SEQ ID NO 195) ; \ .Rdlx I TCTTTCCCTACACGA

P5. rcRBC025 i AATGATACGGCGACCACCGAGATCTACACtacacgacgcACAC (SEQ ID NO 196) ; .Rdlx I TCTTTCCCTACACGA

i P5.rcRBC026 i AATGATACGGCGACCACCGAGATCTACACagatgcactcACAC (SEQ ID NO 197) ; \ .Rdlx i TCTTTCCCTACACGA

P5. rcRBC027 i AATGATACGGCGACCACCGAGATCTACACtcatctaggaACAC (SEQ ID NO 198) ; .Rdlx i TCTTTCCCTACACGA

\ P5. rcRBC028 i AATGATACGGCGACCACCGAGATCTACACtggtatacagACAC (SEQ ID NO 199) ; i .Rdlx i TCTTTCCCTACACGA

P5. rcRBC029 i AATGATACGGCGACCACCGAGATCTACACgcactcaacgACAC (SEQ ID NO 200) ; .Rdlx i TCTTTCCCTACACGA

i P5. rcRBC030 i AATGATACGGCGACCACCGAGATCTACACactgtatgtgACAC (SEQ ID NO 201) ; \ .Rdlx i TCTTTCCCTACACGA

P5. rcRBC031 i AATGATACGGCGACCACCGAGATCTACACattcgtgcacACAC (SEQ ID NO 202) ; .Rdlx i TCTTTCCCTACACGA

i P5.rcRBC032 i AATGATACGGCGACCACCGAGATCTACACcataatgcgcACAC (SEQ ID NO 203) i I .Rdlx TCTTTCCCTACACGA

P5. rcRBC033 i AATGATACGGCGACCACCGAGATCTACACtagattgtcgACAC (SEQ ID NO 204) ; .Rdlx TCTTTCCCTACACGA

i P5.rcRBC034 i AATGATACGGCGACCACCGAGATCTACACtaagaagacgACAC (SEQ ID NO 205) i I .Rdlx TCTTTCCCTACACGA i P5. rcRBC035 i AAT GATACGG CGAC C AC C GAGAT CT AC AC acattgaga c AC AC ( SEQ ID NO: :206) ; \ .Rdlx TCTTTCCCTACACGA

P5.rcRBC036 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC agtcggact c AC AC i(SEQ ID NO: :207) i I .Rdlx TCTTTCCCTACACGA

P5.rcRBC037 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC gcaatcaat c AC AC i(SEQ ID NO: :208) i I .Rdlx TCTTTCCCTACACGA

P5.rcRBC038 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC tacgattga g AC AC (SEQ ID NO: :209) i I .Rdlx TCTTTCCCTACACGA

P5.rcRBC039 i AAT GAT AC G G C GAC C AC C GAGAT C T AC AC atatcgctg a AC AC (SEQ ID NO: :210) i I .Rdlx TCTTTCCCTACACGA

i P5. rcRBC040 I AAT GAT AC G G C GAC C AC C GAGAT C T AC AC accacgtga c AC AC i ( SEQ ID NO: :211) i I .Rdlx TCTTTCCCTACACGA

[0214] Below is an example of an alpha CDR3 sequence detected in a single stimulated CD4+ T cell analyzed using this protocol:

TGTGCTCCTACTAG C AAC AC AG G C AAAC T AAT C T T T (SEQ ID NO: 212)

C A P T S N T G K L I F (SEQ ID NO:213)

The top line is the DNA sequence determined by sequencing. The bottom line is the inferred amino acid sequence. The same CDR3 sequence was detected when single cells from the same population were analyzed by a method similar to that described in Nat Biotechnol.2014 Jul;32(7):684-92, "Linking T-cell receptor sequence to functional phenotype at the single-cell level," Han A, Glanville J, Hansmann L, Davis MM.

Example 4: Single-Cell T-Cell Receptor Plus Target Transcript Template

Production Using CI mRNA Seq HT Array IFC

[0215] The protocol is the same as in Example 3 except Target RT primers are added to the Lysis Mix and a template switch oligo (TSO) is added to the RT Mix. The modified recipes are shown below:

Lysis Mix (163.5 μΙ_)

15 μΙ_ 1 Μ Tris-HCI, ρΗ8.4

12 μΙ_ 1 Μ dithiothreitol

15 μΙ_ 10% ΝΡ-40 (0.5% in final rxn)

6.9 μΙ_ 10 μΜ each RT-No Loop primer

6.9 μΙ_ 10 μΜ each Target RT primer

3 μΙ_ 800 units/mL (20 mg/mL) proteinase K (8 units/mL in final rxn)

8.2 μΙ_ C1 Loading Reagent (Fluidigm 100-5170)

96.5 μΙ_ H₂0

RT Mix (26 μ|_)

10 iL 5x RT Buffer

5 [iL 20 μΜ TSO.Rd1.1

1 [it 25 mM each dNTPs 1 μΙ_ 18 mM AAPF Proteinase K inhibitor (360 μΜ in final rxn)

1 μΙ_ 200 units/pL Superscript II® Reverse Transcriptase

1.3 μΙ_ C1 Loading Reagent (Fluidigm 100-5170)

6.7 pl_ H₂0

Transfer 20 pL RT Mix to Inlet RT during the set up for the Final Reaction.

[0216] The sequence of the template switch oligo TSO.Rdl . l is /5'-

Biotin/rArCrArCrUrCrUrUrUrCrCrCrUrArCrArCrGrArCrGrCrUrCrUrUrCrCrGrArUrCr UrNrNrNrHrHrHrHrGrGrG/3 '-Amino Modifier/, r indicates a ribonucleotide, rN is rA, rC, rG, or rU, and rH is rA, rC, or rU (SEQ ID NO:214). The sequences for Target RT primers for 96 target transcripts are:

i AC B. RT1 i gtgactggagttcagacgtgtgctcttccgatctTCGTCGCC i(SEQ ID NO: :215) i

I CACATAGGAA

B2M.RT1 gtgactggagttcagacgtgtgctcttccgatctCAACTTCA i(SEQ ID NO: :216) ;

ATGTCGGATGGATGAA

I BCL6. RT1 i gtgactggagttcagacgtgtgctcttccgatctTCAACAGA (SEQ ID NO: :217) i

I GCTCAATTCTCGGAA

BTLA. RT1 gtgactggagttcagacgtgtgctcttccgatctCCAAGCAT i(SEQ ID NO: :218) i GGCAGGCAA

i CBLB. R 1 I gtgactggagttcagacgtgtgctcttccgatctGGAGTGGG i ( SEQ ID NO: :219) i

\ ATCGCTGAGAA

CCL3. RT1 gtgactggagttcagacgtgtgctcttccgatctCTCGTCTC i ( SEQ ID NO: :220) i

AAAGTAGTCAGCTATGAA

I CCL4. R 1 i gtgactggagttcagacgtgtgctcttccgatctGAGGCTGC (SEQ ID NO: :221) ;

I TGGTCTCATAGTAA

CCL5. RT1 gtgactggagttcagacgtgtgctcttccgatctCACTTGCC i ( SEQ ID NO: :222) i

ACTGGTGTAGAAA

I CCR1. R 1 i gtgactggagttcagacgtgtgctcttccgatctGACCACCA ;(SEQ ID NO: :223) ;

I GGATGTTTCCAA

CCR1C 1. RT1 gtgactggagttcagacgtgtgctcttccgatctAGAGTCAG (SEQ ID NO: :224) ;

GGCCAGCAA

i CCR2. RT1 i gtgactggagttcagacgtgtgctcttccgatctTTGGGTTG (SEQ ID NO: :225) ;

I AGGTCTCCAGAA

CCR3. RT1 gtgactggagttcagacgtgtgctcttccgatctTCTCCAAT i ( SEQ ID NO: :226) ;

ACAACTCAGCAGTGAA

\ CCR4. R 1 i gtgactggagttcagacgtgtgctcttccgatctCTTGATGC i ( SEQ ID NO: :227) ;

\ CTTCTTTGGTGCAA

CCR5. RT1 gtgactggagttcagacgtgtgctcttccgatctGCCAGATG i ( SEQ ID NO: :228) ;

AGCTGTGCAA

i CCR6. R l I gtgactggagttcagacgtgtgctcttccgatctGCAGCGGT ( SEQ ID NO: :229) ;

\ AGCAGGAAA

CCR7. RT1 gtgactggagttcagacgtgtgctcttccgatctACGTCCTT \ ( SEQ ID NO: :230) ;

CTTGGAGCACAA

i CCR8. RTl i gtgactggagttcagacgtgtgctcttccgatctGAGCAACT i(SEQ ID NO: :231) i

I TGCCATTTGTCTGAA

CCR9. RTl gtgactggagttcagacgtgtgctcttccgatctTGAAGTTA \ ( SEQ ID NO: :232) ;

ACGTAGTCTTCCATGGAA

i CD16C 1. RTl i gtgactggagttcagacgtgtgctcttccgatctCCTTGGCT (SEQ ID NO: :233) i

I TCAGTCTCCTGAA

CD27. RTl gtgactggagttcagacgtgtgctcttccgatctAGAACGCA i ( SEQ ID NO: :234) i CAGCCACCA

i CD274 . RTl i gtgactggagttcagacgtgtgctcttccgatctACAATTAG (SEQ ID NO: :235) i

I TGCAGCCAGGTCTAA CD276. RT1 gtgactggagttcagacgtgtgctcttccgatctCTGTGAGG i ( SEQ ID NO 236) : CGGCTGACA

i CD28. RT1 j gtgactggagttcagacgtgtgctcttccgatctTGAAGGGA (SEQ ID NO 237) ;

\ ATAAGTTGAGAGCCAA

CD3E. RT1 gtgactggagttcagacgtgtgctcttccgatctCCCGACTG i ( SEQ ID NO 238) ;

CATCTTTGTTTCAT CD4.RT1 j gtgactggagttcagacgtgtgctcttccgatctGGACCTGA !(SEQ ID NO 239) :

i GCCCACAGAA

CD40LG. RT1 gtgactggagttcagacgtgtgctcttccgatctCCAACCTT !(SEQ ID NO 240) :

CTATGAAGATACACAGCA

CD44. R 1 gtgactggagttcagacgtgtgctcttccgatctAGCGAAGG i ( SEQ ID NO 241) :

ACACACCCAA

CD5.RT1 gtgactggagttcagacgtgtgctcttccgatctCTGGCACT \ ( SEQ ID NO 242) :

TCGAGTTGGAA

i CD69. R 1 Ϊ gtgactggagttcagacgtgtgctcttccgatctTGGCCCAC (SEQ ID NO 243) ;

I TGATAAGGCAA

CD80. RT1 gtgactggagttcagacgtgtgctcttccgatctGGTGTTCC (SEQ ID NO 244) j

TGGGTCTCCAA

CD8A. R 1 ; gtgactggagttcagacgtgtgctcttccgatctGCGCTGCT (SEQ ID NO 245) !

i GACCTCATT

CSF2. RT1 gtgactggagttcagacgtgtgctcttccgatctCGGCTCCT (SEQ ID NO 246) :

GGAGGTCAA

i CTLA4 . RT1 gtgactggagttcagacgtgtgctcttccgatctAGGGCCAG (SEQ ID NO 247) :

i GTCCTGGTA

CXCL8 . RT1 gtgactggagttcagacgtgtgctcttccgatctGTTCTTTA (SEQ ID NO 248) ;

GCACTCCTTGGCAA

\ CXCR3 . RT1 : gtgactggagttcagacgtgtgctcttccgatctCGGTCGAA (SEQ ID NO 249) !

i GTTCAGGCTGAA

CXCR4 . RT1 gtgactggagttcagacgtgtgctcttccgatctGCCCACAA (SEQ ID NO 250) ;

TGCCAGTTAAGAA

CXCR5 . RT1 gtgactggagttcagacgtgtgctcttccgatctCCAATCTG (SEQ ID NO 251) :

TCCAGTTCCCAGAA

CXCR6 . RT1 gtgactggagttcagacgtgtgctcttccgatctAGGCTCTG (SEQ ID NO 252) :

CAACTTATGGTAGAA

i EBI3. R l i gtgactggagttcagacgtgtgctcttccgatctGAGCCTGT (SEQ ID NO 253) ;

\ ACGTGGCAA

EGR2. RT1 gtgactggagttcagacgtgtgctcttccgatctTCCTTTTG (SEQ ID NO 254) i

CCCTCCACACTTAA

^:; EGR3. RTl : gtgactggagttcagacgtgtgctcttccgatctGAAAAGCA (SEQ ID NO 255) :

i TGCGAGAGGGAAA

EOMES . RTl gtgactggagttcagacgtgtgctcttccgatctCGTCCTTT (SEQ ID NO 256) 1

CCGGAAGGAAA

FAS . RT1 gtgactggagttcagacgtgtgctcttccgatctAGCTTCCC (SEQ ID NO: 257) :

CAACTCCGTA

FOXP3 . RTl gtgactggagttcagacgtgtgctcttccgatctCTGAGGCT (SEQ ID NO 258) ;

TTGGGTGCA

i GATA3 . RTl i gtgactggagttcagacgtgtgctcttccgatctCTGCAATT (SEQ ID NO: 259) i

; CTGCGAGCCA

GGT1. RTl gtgactggagttcagacgtgtgctcttccgatctTGTTACCT (SEQ ID NO 260) j

CCCTCTGCCTCTA

GZMA. RTl ; gtgactggagttcagacgtgtgctcttccgatctAGTTACAG (SEQ ID NO. 261) ;

TGAGCTGCAGTCAA

GZMB. RTl gtgactggagttcagacgtgtgctcttccgatctGCAGCTGT (SEQ ID NO: 262) :

CAGCACGAA

: HAVCR2. RT1 i gtgactggagttcagacgtgtgctcttccgatctGGACACCT (SEQ ID NO: 263) :

\ CTGTTAGGCACA

HLA-DRA. RT1 gtgactggagttcagacgtgtgctcttccgatctCCTGATTG (SEQ ID NO: 264) ;

GTCAGGATTCAGATAGAA

: ICOS. RTl ; gtgactggagttcagacgtgtgctcttccgatctTCCTTTTG (SEQ ID NO: 265) !

! TCTTAGTGAGATCGCA

i RORC.RTl I gtgactggagttcagacgtgtgctcttccgatctGCACCCCT ( SEQ ID NO::296) ; I CACAGGTGA AA

i RPS3.RT1 i gtgactggagttcagacgtgtgctcttccgatctACATTCTG i(SEQ ID NO: :297) i

I TGTTCTGGTGGCTAA

i RUNX1.RT1 i gtgactggagttcagacgtgtgctcttccgatctGCATCGTG i(SEQ ID NO: :298) i

I GACGTCTCTAGAA

i RUNX3.RT1 i gtgactggagttcagacgtgtgctcttccgatctGAGAAGCG (SEQ ID NO: :299) i

I GGAAAGCAGAA

i SELL . RT1 i gtgactggagttcagacgtgtgctcttccgatctGTGCTCTG (SEQ ID NO: :300) i

I ACATTTCCATGGAAA

i SNCA.RT1 I gtgactggagttcagacgtgtgctcttccgatctCTCCCAGT i(SEQ ID NO: :301) i

\ TCTCCGCTCA

i TBX21.RT1 I gtgactggagttcagacgtgtgctcttccgatctGTCAGCAT i(SEQ ID NO: :302) i

\ GTCTCCGCAA

I TGFB1.RT1 i gtgactggagttcagacgtgtgctcttccgatctCGGCAACG (SEQ ID NO: :303) ;

I GAAAAGTCTCAA

I TIGIT.RT1 i gtgactggagttcagacgtgtgctcttccgatctGTGGAGGA (SEQ ID NO: :304) ;

I GAGGTGACATTGTAA

i TNF.RT1 i gtgactggagttcagacgtgtgctcttccgatctCTCCACGT ;(SEQ ID NO: :305) ;

I CCCGGA CAT

I TNFRSF18. RT1 i gtgactggagttcagacgtgtgctcttccgatctGTCCGTTC ;(SEQ ID NO: :306) ;

I CCGTCCCAA

I TNFRSF9. RT1 i gtgactggagttcagacgtgtgctcttccgatctTGGCACAG (SEQ ID NO: :307) ;

I GTATGATACTAGCAAA

i UBB. RT1 i gtgactggagttcagacgtgtgctcttccgatctGTCACTTA (SEQ ID NO: :308) ;

I TCACCCCTCACGTA

\ VEGFA. RT1 i gtgactggagttcagacgtgtgctcttccgatctGGAAGTAG i ( SEQ ID NO: :309) ;

\ AGCAATCTCCCCAA

\ VTCN1. RT1 i gtgactggagttcagacgtgtgctcttccgatctCCAGCTGA i ( SEQ ID NO: :310) ;

\ GGCGACAGTA

Example 5: Single-Cell T-Cell Receptor Template Production Using Hairpin Primers

[0217] This Example describes the adaptation of an illustrative method of single- cell T-cell receptor template production designed for tubes to Fluidigm' s C™ High- Throughput IFC ("HT-IFC").

[0218] The method, as designed for tubes, is shown schematically in Figure 1 1 A-

1 IB. During the reverse transcriptase (RT) step (Figure 1 1 A), the primers Rd2.TRAC and Rd2.TRBC hybridize to the TCR a and β mRNAs, respectively, and are extended to generate TCR-specific cDNA. The primer Rd2.N7.T20VN (Rd2-UMI-oligo-dT, in

Figure 1 1 A) generates cDNA from all the polyA+ mRNA in the sample. The primer

P7.rcCBC001.Rd2x (P7-CBCl-20-Rd2, in Figure 1 1A) is present in the reaction, but it is not used until the PCR step. During the initial cycle of PCR (Figure 1 IB), the primers Rdl .TRAV.Ll (42 different primers; Rdl-TRAV, in Figure 1 1B), Rdl .TRBV.Ll (31 different primers; Rdl-TRBV, in Figure 1 IB), and Rdl .Target.Ll (96 different primers; Rdl -Target, in Figure 1 1) generate primer extension products on TCR-a cDNA, TCR-β cDNA and 96 target cDNAs, respectively. In order to enhance specificity, these primers are designed as Super Selective primers (Marras S, Vargas-Gold D, Tyagi S, Kramer FR, PCT/US2014/015351; Vargas DY, Kramer FR, Tyagi S, Marras SA, "Multiplex Real- Time PCR Assays that Measure the Abundance of Extremely Rare Mutations Associated with Cancer," PLoS One 11 :e0156546, 2016). At this point, the DNA strands have either an Rdl or an Rd2 5' tail and can be amplified with the outer primers P5.rcRBC001.Rdlx (P5-RBC11-40-Rdl, in Figure 1 IB) and P7.rcCBC001.Rd2x (P7-CBC1 l-20-Rd2, in Figure 1 IB). This procedure produces libraries predominantly 350-450 bp in size. When sequenced, these libraries generate the desired data, namely TCR sequences and sequences from the 96 target transcripts.

[0219] When this procedure is used with HT IFCs, though, the libraries include a significant percentage of non-specific fragments generally smaller than 350 bp. This seems to be due to sub-optimal mixing of the PCR reagents with the RT reaction. In the PCR mix, the primer P5.rcRBC001.Rdlx (or one of its barcode counterparts) is present at high concentration with the DNA polymerase and all the other components required for PCR. When the PCR mix is loaded into the IFC, mixing with the RT reaction occurs largely by diffusion. Because the primers in the RT reaction are smaller than the cDNA, these primers diffuse faster and are the first to encounter P5.rcRBC001.Rdlx and the other primers in the PCR mix. Thus, spurious products involving the various primers have a chance to form before any specific products involving the cDNA are made. This problem can be partially overcome by moving the primer P5.rcRBC001.Rdlx to the RT reaction. When this is done, libraries generated in HT IFCs do have the desired products in the 350- 450 bp size range, but there still are a lot of non-specific products 250 bp and smaller.

[0220] The residual non-specific products being generated seem to be due to rehydration steps used during PCR. In the standard mRNA seq protocol developed for the HT IFC, rehydration is performed every 2nd cycle of PCR. In the TCR protocol, by changing the rehydration from every 2nd cycle to every 4th cycle, the ratio of specific products (350 to 450 bp) to non-specific products (smaller than 250 bp) improves.

Increasing to rehydration every 8th cycle improves the ratio more, but the effect is marginal. One explanation is that, when the reaction is lowered to 25°C, the primers have the opportunity to form non-specific products. What was needed is a way to turn the primers "off at lower temperatures. This was done by using hairpin primers that each consist of a blunt-ended, self-complementary stem (a "stem-duplex") connected by a loop (Lao KQ, Straus NA, Livak KJ, "Hot start reverse transcription by primer design," U. S. Patent 8,993,240). The use of hairpin primers addresses another issue. Having too many primers in the RT reaction can also lead to formation of non-specific products. The hairpins used here are designed to be closed at the 42°C of the RT reaction, but open at the 62°C annealing temperature used in the PCR. Thus, these primers are not functional during the RT reaction or during the rehydration steps in PCR when the temperature of the HT-IFC is lowered to 25°C.

[0221] The following primers were replaced with hairpin versions (denoted .HP):

P5.rcRBC001-040.Rdlx (40 primers), P7.rcCBC001-020.Rd2x (20 primers),

Rdl .TRAV.Ll (42 primers), Rdl .TRBV.Ll (31 primers), and Rdl .Target.Ll (96 primers). Using the .HP versions of these primers, specific TCR libraries in the size range 350-450 bp are generated in the HT-IFC from single cells captured in the HT IFC with only minimal amounts of non-specific fragments. This strategy is shown schematically in Figure 12A-12B. A TCR library was prepared from a mixed pool of stimulated CD4+ or CD8+ T cells. Approximately 400 DSP-fixed single cells were captured and processed on a HT IFC according to the protocol given below. As shown in Figure 13, this method produces libraries predominantly 350-450 bp in size.

Protocol:

Purchases

Thermo Fisher Scientific

Superscript^® II Reverse Transcriptase

18064-014 10% NP-40

PI-28324 dNTP Mix, 25mM each

FERR1 121

Yeast tRNA (10 mg/mL)

AM71 19 New England BioLabs

Proteinase K

P8107S NEBNext^® Q5^® Hot Start HiFi PCR Master Mix

M0543S

EMD Millipore

AAPF Proteinase K inhibitor

539470

Teknova

1 M Tris-HCI, pH8.4

T1084

1 M DTT (Dithiothreitol)

D9750

DNA Suspension Buffer

T0221

Beckman Coulter

AMPure XP Beads

A63880

Stocks

5* RT Buffer

75 mM Tris-HCI, pH 8.4

375 mM KCI

50 mM MgCI₂

25% glycerol (50%, Teknova G1796, G1797, G1798, G1799) 18 mM AAPF Proteinase K inhibitor

Dissolve 10 mg AAPF Proteinase K inhibitor in 1 mL DMSO Store at -20°C C1 Mixes

20x Column Mixes

Mix:

187.5 μΙ_ Cell Rinsing Reagent (Fluidigm)

12.5 μΙ_ C1™ Loading Reagent (Fluidigm 100-5170)

Dispense 8 μL· to each of 20 wells

Add 2 μΐ 50 μΜ P7.rcCBC001 -020.Rd2x.HP (1 μΜ in PCR), with a different barcode added to each well

Before Cell Loading and Staining, transfer 9 of each to one of the 20 Inlets labeled as "Column inlets." The barcode mix will happen after cell staining.

2x Lysis Mix (245.4 μΐ)

45 μΐ 1 Μ Tris-HCI, ρΗ8.4

36 μΐ 1 Μ DTT

45 μΐ 10% ΝΡ-40 (0.5% in final rxn)

20.7 μΐ 10 μΜ each AC.A1/BC.A1 primers (1 10 nM in RT rxn)

18.9 μΐ 10 μΜ Rd2.N7.T20VN (100 nM in RT rxn)

9 μΐ 800 units/mL (20 mg/mL) proteinase K (8 units/mL in final rxn)

33.8 μΐ C1™ Loading Reagent (Fluidigm 100-5170)

37 μΐ H₂0

40 x Row Mixes

Dispense 5 μL· 2χ Lysis Mix to each of 40 wells

Add 5 μΐ 17 μΜ P5.rcRBC001 -040.Rd1x.HP (1 μΜ in PCR), with a different barcode added to each well

Transfer 9 μL· of each to Inlets labeled as "Row Barcode"

RT Mix (26 μΐ)

10 μΐ 5^χ RT Buffer

1 μΐ 25 mM each dNTPs

1 μΐ 18 mM AAPF Proteinase K inhibitor (360 μΜ in final rxn)

1 μΐ 200 units^L Superscript II Reverse Transcriptase

1 .3 μΐ C1™ Loading Reagent (Fluidigm 100-5170)

1 1 .7 μΐ H₂0

Transfer 20 μΐ RT Mix to Inlet RT PCR Mix (150 μ|_)

54.6 μΐ 5x PreAmp Master Mix (Fluidigm 100-5581)

2.7 μΙ_ 5 μΜ each TRAV.L1.HP/TRBV.L1 .HP primers (50 nM each primer in final rxn)

2.7 μΙ_ 5 μΜ each Target. L1 .HP primers (50 nM each primer in final rxn)

7.5 μΙ_ C1™ Loading Reagent (Fluidigm 100-5170)

82.5 [it H₂0

Transfer 130 μί PCR Mix to Inlet L2 (big reservoir).

Harvest Mix (200 μί)

194 μΙ_ C1 Harvest Reagent (Fluidigm 100-7081)

6 μΙ_ 10 mg/mL yeast tRNA

180 μΐ to the Harvest Inlet

C1 Steps

1. Prime:

a. Pipette mixes following the scTCR small_v1 xx v5 prime loading map. For Valve Fluid, add Tween 20 to a final concentration of 0.05%, or use Valve Fluid v2.

b. Pipette 9 μΙ_ each column mix to 20 column inlets

c. Run prime script: "scTCR small: 26min Prime with Column Barcode (1911x)"

2. Cell loading:

a. Replace liquid following the scTCR small_v1 xx v5.

b. Keep the column mixes in the 20 column inlets

c. Dispense 20 μΙ_ cell suspension to each of the two cell inlets d. Dispense 20 μΙ_ cell rinse reagent to each of the two cell stain inlets e. Run script "scTCR small: 36min, Cell Load with column barcode (1911x)"

3. Final reaction:

a. Replace cell related liquid following the scTCR small_v1xx v5.

b. Pipette lysis mixes into 40 row inlets, ~9 each

c. Pipette RT mix into RT inlet, -20 μΐ

d. Pipette PCR mix into L2 zone (up right corner), ~130 μί e. Check the entire reagent, such as harvest reagent, before inserting IFC into machine.

f. Run script "scTCR small:7hr50min Chemistry v5D (1911x)". Set time delay as usual.

Final Reaction consists of:

Lysis Mix to chambers 0+1

50°C, 30 min (1800 sec)

72°C, 1 min (60 sec)

10°C, 1 min (60 sec) 2. RT Mix to chambers 0+1 +2

42°C, 90 min (5400 sec)

85°C, 5 min (300 sec)

3. PCR Mix to chambers 0+1 +2+3

95°C, 2 min (120 sec)

22 cycles: 95°C, 5 sec

62°C, 5 min (360 sec)

4. Harvest

Post-C1 Steps

Transfer the 20 Harvest samples to a 96-well plate.

Combine 2.5 μΙ_ of each sample.

Clean up with 40 μΙ_ AMPure XP Beads. Elute with 50 μΙ_ DNA Suspension Buffer. Clean up with 40 μΙ_ AMPure XP Beads. Elute with 20 μΙ_ DNA Suspension Buffer.

Add:

2.5 μΙ_ 10 μΜ P5 Primer

2.5 μΙ_ 10 μΜ P7 Primer

25 μΙ_ Q5 Hot Start HiFi PCR Master Mix

98°C, 30 sec

21 cycles: 98°C, 10 sec

62°C, 2 min

75°C, 2 min

4°C, hold Clean up with 35 μΙ_ AMPure XP Beads. Elute with 30 μΙ_ DNA Suspension Buffer. Clean up with 21 μΙ_ AMPure XP Beads. Elute with 20 μΙ_ DNA Suspension

Buffer

Check 1 μΙ_ on Bioanalyzer. Primers:

atttcaacattCAAGAGTC

i TRAV9.L11.HP GAACTGAGAGacactctttccctacacgacgctct (SEQ ID NO:325) tccgatctACCGTAAAGAAACCACTTC TTCCACT

TGctctttttctCTCAGTTC

i TRAV9. L12. HP GAACTGAGAAacactctttccctacacgacgctct (SEQ ID NO:326) tccgatctCCACATACCGTAAGGAAACCACTTCTT

TCCAacctttCTCAGTTC

i TRAV10.L1.HP GCTTTGCAAATacactctttccctacacgacgctc ( SEQ ID NO:327) ttccgatctCGAACGGAAGATA ACAGCAACTCTG

GATcatatttGCAAAGC

TRAV12. Lll . HP GCTGGCTGGGacactctttccctacacgacgctct (SEQ ID NO:328) tccgatctTGGTAATGAAGATGGAAGGTTTACAGC

ACAaacattcccAGCCAGC

i TRAV12. L12. HP GCTGGCTAAGacactctttccctacacgacgctct (SEQ ID NO:329) tccgatctRACAAAGAAGATGGAAGGTTTACAGCA

CAaacattcttAGCCAGC

i TRAV12. L13. HP TGCTGGATAAGacactctttccctacacgacgctc (SEQ ID NO:330) ttccgatctRACAAAGAAGATGGAAGGTTTACAGC

ACAaacattcttATCCAGCA

i TRAV13. Lll . HP CTGTCTTGGAAGTacactctttccctacacgacgc (SEQ ID NO:331) tcttccgatctGGCGAAAAGAAAGACCAACGAATT

GCTtcacttcCAAGACAG

i TRAV13. L12. HP CACTGTCGGGacactctttccctacacgacgctct (SEQ ID NO:332) tccgatctGCAAGGCCAAAGAGTCACCGTTTTAac

cttttcccGACAGTG

; TRAV14.L1.HP TCTTGCCAAacactctttccctacacgacgctctt (SEQ ID NO:333) ccgatctTGCAACAGAAGGTCGCTACTCATTGAAa

acaatcttGGCAAGA

I TRAV16.L1.HP GTGGAAAGGAacactctttccctacacgacgctct (SEQ ID NO:334) tccgatctCATCAAAGGCTTCACTGCTGACCTTAA

atatcCTTTCCAC

i TRAV17. LI . HP GAACTGCAGGAacactctttccctacacgacgctc (SEQ ID NO:335) ttccgatctTGGAAGATTAAGAGTCACGCTTGACA

CTcaattcctGCAGTTC

i TRAV18. LI . HP TGGAAGGTTGacactctttccctacacgacgctct (SEQ ID NO:336) tccgatctAGAGGTTTTCAGGCCAGTCCTATCAAc

ttccaaCCTTCCA

i TRAV19. LI . HP GTTGAAGGTGGacactctttccctacacgacgctc (SEQ ID NO:337) ttccgatctTGGTCGGTATTCTTGGAACTTCCAGA

AtcatccaCCTTCAAC i TRAV20. LI . HP CTTCAGGAGAacactctttccctacacgacgctct (SEQ ID NO:338) tccgatctAAGAAGGAAAGCTTTCTGCACATCACA

caaaactctCCTGAAG

i TRAV21.L1.HP GTACTACGAGAacactctttccctacacgacgctc (SEQ ID NO:339) ttccgatctTGGAAGACTTAATGCCTCGCTGGATA

AtcatttctCGTAGTAC

; TRAV22. LI . HP GAAATGTACGTAGacactctttccctacacgacgc (SEQ ID NO:340) tcttccgatctGCCACGACTGTCGCTACGGAAact

acGTACATTTC

; TRAV23.L1.HP TCCATGATTAacactctttccctacacgacgctct (SEQ ID NO:341) tccgatctACAATC CCT CAATAAAAGTGCCAAG

CAccataATCATGGA

i TRAV24. LI . HP GGATCCTAATacactctttccctacacgacgctct (SEQ ID NO:342) tccgatctCACTCTTAATACCAAGGAGGGTTACAG

CTAacaattAGGATCC

i TRAV25.L1.HP CATCTGTATAGacactctttccctacacgacgctc (SEQ ID NO:343) ttccgatctTCCCTGCACATCACAGCCACaatcta

aatctaTACAGATG

i TRAV26. LI1. HP TCAGCGTGTGacactctttccctacacgacgctct (SEQ ID NO:344) tccgatctACAGAAAGTCCAGCACCTTGATCCTca

atacacACGCTGA

i TRAV26. L12. HP TCAGCGAAacactctttccctacacgacgctcttc (SEQ ID NO:345) cgatctAGCAATGTGAACAACAGAATGGCCTacaa

tcattTCGCTGA

I TRAV27. LI . HP CTTCAGCAAGAAacactctttccctacacgacgct (SEQ ID NO:346) cttccgatctGGTGACAGTAGTTACGGGTGGAGAA

cacttttcttGCTGAAG

i TRAV29. LI . HP GGCACAAATAacactctttccctacacgacgctct (SEQ ID NO:347) tccgatctTCTTAAACAAAAGTGCCAAGCACCTCT

acacaatatTTGTGCC

i TRAV30. LI . HP GTTCTCCTTTacactctttccctacacgacgctct (SEQ ID NO:348) tccgatctTGAAGCACCCGTCTTCCTGATGATAcc

taaaGGAGAAC

i TRAV34. LI . HP TGGGCTGTTGacactctttccctacacgacgctct (SEQ ID NO:349) tccgatctAGAAAAAGCAGCAAAGTTCCCTGCATA

catacaaCAGCCCA

i TRAV35. LI . HP CTTTCTGGAAacactctttccctacacgacgctct (SEQ ID NO:350) tccgatctACCTCAAATGGAAGACTGACTGCTCAt

caccttCCAGAAAG

; TRAV36.L1.HP GAGTCTCGTTacactctttccctacacgacgctct (SEQ ID NO:351) tccgatctAGCATCCTGAACATCACAGCCACaa11

tataacGAGACTC

i TRAV38. LI . HP CGATTCTCGTAacactctttccctacacgacgctc (SEQ ID NO:352) ttccgatctCGTTATTCGCCAAGAAGCTTATAAGC

AACActtattacGAGAATCG

i TRAV39.L1.HP GGTATCAAGATGacactctttccctacacgacgct (SEQ ID NO:353) cttccgatctGCAGTGAAGCAGGAGGGACGATTAA

actaaaacatCTTGATACC

; TRAV40. LI . HP GAGTTTTTGTCacactctttccctacacgacgctc (SEQ ID NO:354) ttccgatctAGCAAAAACTTCGGAGGCGGAAtctc

acctttGACAAAAACTC

I TRAV41.L1.HP TCTGGGAATTacactctttccctacacgacgctct (SEQ ID NO:355) tccgatctCACAGCTCCCTGCACATCACAcaacaa

aaatTCCCAGA

i TRBV02. LI . HP GCTGAGTAAAGacactctttccctacacgacgctc (SEQ ID NO:356) ttccgatctTCACTCTGAAGATCCGGTCCACAAtc

aatctttACTCAGC

i TRBVO3. LI . HP GATGTGAAGTacactctttccctacacgacgctct (SEQ ID NO:357) tccgatctCCAAATCGMTTCTCACCTAAATCTCCA

GACAAtc11aCTTCACATC

i TRBVO4. LI . HP GAGAGCATAAacactctttccctacacgacgctct (SEQ ID NO:358) tccgatctGCCAAGTCGCTTCTCACCTGAAacaat

tatGCTCTC

i TRBV05. Lll . HP CCTGAGATAGTTacactctttccctacacgacgct (SEQ ID NO:359) cttccgatctTCAGTGAGACACAGAGAAACAAAGG

AAACTcaactaTCTCAGG

I TRBVO5. L12. HP CGAGTCTAGAAAGGacactctttccctacacgacg (SEQ ID NO:360) ctcttccgatctCTCTGAGCTGAATGTGAACGCCT

TcctttctaGACTCG

i TRBVO6. Lll. HP GGAGACAAATacactctttccctacacgacgctct (SEQ ID NO:361) tccgatctGTACCACTGRCAAAGGAGAAGTCCCat

tattTGTCTCC

i TRBVO6. L12. HP TCAGCCCacactctttccctacacgacgctcttcc (SEQ ID NO:362) gatetACATGTMCTGGTATCGACAAGACCCAtcat

actcatacGGGCTGA

i TRBVO7. LI1. HP TGTCGGTTAGacactctttccctacacgacgctct (SEQ ID NO:363) tccgatctTGATCCAATTTCAGGTCATACTGCCCT

catacacctaACCGACA

i TRBVO7. L12. HP CATGACCATGTGacactctttccctacacgacgct (SEQ ID NO:364) cttccgatctGACRGGATGTAGCTCTCAGGTGTGA TaattcacatGGTCATG

i TRBV07. L13. HP GATCCCTAGTTAGacactctttccctacacgacgc (SEQ ID NO:365) tcttccgatctGCYCAGTGATCGGTTCTCTGCAct

ctaactAGGGATC

i TRBV07. L14. HP GATCCCTAGTTAGacactctttccctacacgacgc (SEQ ID NO:366) tcttccgatctTGCTCAGTGATCGGATCTCTGCAc

tctaactAGGGATC

i TRBVO9. LI . HP GCAAGTCGacactctttccctacacgacgctcttc ( SEQ ID NO:367) cgatctACATTCTTGAACGATTC CCGCACAAatc

acaaacGACTTGC

TRBV10.L11.HP AACACCATGAacactctttccctacacgacgctct (SEQ ID NO:368) tccgatctCATGGGCTGAGGCTGATCCAcatacac

atcATGGTGTT

i TRBV10. L12. HP AGCAGCTTacactctttccctacacgacgctcttc (SEQ ID NO:369) cgatctGGACATGGGCTGAGGCTGATCTAactaca

tcaAGCTGCT

i TRBV10. L13. HP AACACCATGAacactctttccctacacgacgctct (SEQ ID NO:370) tccgatctGCATGGGCTGAGGCTAATCCAcataca

catcATGGTGTT

i TRBV11.L1.HP CTACTCCAAAacactctttccctacacgacgctct (SEQ ID NO:371) tccgatctCAGTTGCCTAAGGATCGATTTTCTGCA

tcttttGGAGTAG

i TRBV12. Lll . HP GAATCGAAGGAGTacactctttccctacacgacgc (SEQ ID NO:372) tcttccgatctACAACGTTCCGATAGATGATTCAG

GGATcaaactcctTCGATTC

; TRBV12. L12. HP AGTGGCTTGacactctttccctacacgacgctctt (SEQ ID NO:373) ccgatctTCGATTCTCAGCAGAGATGCCTGATcat

tataacaAGCCACT

I TRBV13.L1.HP GAGCTGATTGacactctttccctacacgacgctct (SEQ ID NO:374) tccgatctGCAGAGCGATAAAGGAAGCATCCCTtt

cactcaaTCAGCTC

i TRBV14. LI . HP CTCCAGTAGAacactctttccctacacgacgctct (SEQ ID NO:375) tccgatctTCCGGTATGCCCAACAATCGATTCTTA

caactttctACTGGAG

i TRBV15. LI . HP GTGTTCGTGacactctttccctacacgacgctctt (SEQ ID NO:376) ccgatctAGCAGACACCCCTGATAACTTCCAAaaa

ttccaCGAACAC

i TRBV16. LI . HP CACTTAGCATacactctttccctacacgacgctct (SEQ ID NO:377) tccgatctCTTTGATGAAACAGGTATGCCCAAGGA

AtctacacatGCTAAGTG i TRBV18. LI . HP GGGAAATTCTTGacactctttccctacacgacgct (SEQ ID NO:378) cttccgatctGAGTCAGGAA.TGCCAAAGGAA.CGAT

caaacaacaaGAATTTCCC

i TRBV19. LI . HP GATTCCTTCAAGacactctttccctacacgacgct (SEQ ID NO:379) cttccgatctGAGATATAGCTGAAGGGTACAGCGT

CTacactcttGAAGGAATC

; TRBV20. LI . HP CATGGTTGGAacactctttccctacacgacgctct (SEQ ID NO:380) tccgatctGCAAGGCGTCGAGAAGGACAAtacaac

atcCAACCATG

; TRBV24. LI . HP GAGAATTTAGacactctttccctacacgacgctct (SEQ ID NO:381) tccgatctTCTCTGATGGATACAGTGTCTCTCGAC

ActatatctaCTAAATTCTC

i TRBV25. LI . HP GAAAATGCAAacactctttccctacacgacgctct (SEQ ID NO:382) tccgatctCTTTCCTCTGAGTCAACAGTCTCCAGA

ActtcacttGCATTTTC

i TRBV27. LI . HP GGAAATTCAAGacactctttccctacacgacgctc (SEQ ID NO:383) ttccgatctTTCCTGAAGGGTACAAAGTCTCTCGA

AttcttcttGAATTTCC

i TRBV28. LI . HP TCCTTTTTCAacactctttccctacacgacgctct (SEQ ID NO:384) tccgatctGGGCTACGGCTGATCTATTTCTCATAT

GATtcatttGAAAAAGGA

i TRBV29. LI . HP TGATGGGTTGacactctttccctacacgacgctct (SEQ ID NO:385) tccgatctAGGCCACATATGAGAGTGGATTTGTCA

TcttatcaaCCCATCA

I TRBV30. LI . HP GTCCTGGTTAacactctttccctacacgacgctct (SEQ ID NO:386) tccgatctTGAGGTGCCCCAGAATCTCTCAtaact

aaCCAGGAC

i ACTB . LI . HP TTGGGAGTTTGGacactctttccctacacgacgct (SEQ ID NO:387) cttccgatctCCCCACTTCTCTCTAAGGAGAATGG

aaatccaaaCTCCCAA

i BCL6.L1.HP TTCCGTGTGTacactctttccctacacgacgctct (SEQ ID NO:388) tccgatctCGTACAACGTGTCCTCACGGTtaacca

acaCACGGAA

i BTLA.L1.HP TTTGAGAGTGacactctttccctacacgacgctct (SEQ ID NO:389) tccgatctCAGATGAACACACGAAATTGAACAGTT

CCacatcttcaCTCTCAAA

i CBLB.L1.HP TTCCCAGTGGacactctttccctacacgacgctct (SEQ ID NO:390) tccgatctTCAGGTACTACCTGTTGACCTGGAtaa

ccaCTGGGAA

; CCL22.L1.HP TTCTCACTTATacactctttccctacacgacgctc (SEQ ID NO:391) ttccgatctACTCCCTGAACCCAGCCTGtatttca

taAGTGAGAA

i CCL3.L1.HP TTAAGAAGATAAGAGacactctttccctacacgac (SEQ ID NO:392) gctcttccgatctGGTTGTTGCCAAACAGCCACAa

acctcttaTCTTCTTAA

i CCL4.L1.HP TGCGGAGacactctttccctacacgacgctcttcc (SEQ ID NO:393) gatctCCATGAGACACATCTCCTCCATACTCtctt

acaaaCTCCGCA

; CCL5.L1.HP TTGACAAAGTGacactctttccctacacgacgctc (SEQ ID NO:394) ttccgatctCCCAACTAAAGCCTAGAAGAGCTTCT

ttctacacaCTTTGTCAA

I CCR1.L1.HP TTTGGGTATAacactctttccctacacgacgctct (SEQ ID NO:395) tccgatctAGAAAAACCCCAAACACTGACATTACC

TAataccaataTACCCAAA

i CCR10.L1.HP TTGTCCCTTTacactctttccctacacgacgctct (SEQ ID NO:396) tccgatctCTCCCACGGAGACCCACAtcaaacaaa

GGGACAA

i CCR2.L1.HP TTGGGAGAGAacactctttccctacacgacgctct (SEQ ID NO:397) tccgatctGGCTTCTAGAACCAGGCAACTTGtctt

actctCTCCCAA

i CCR3.L1.HP GAAGCAAATGacactctttccctacacgacgctct (SEQ ID NO:398) tccgatctGCAGCGTACTCATCATCAACCCTtttt

caTTTGCTTC

i CCR4.L1.HP TTGCCAGTacactctttccctacacgacgctcttc (SEQ ID NO:399) cgatctGGCAGCTTTTTCTCTCCCACTAGACttcc

aatcaCTGGCAA

I CCR5.L1.HP TTGCTTCCATacactctttccctacacgacgctct (SEQ ID NO:400) tccgatctCTGAATATGAACGGTGAGCATTGTGGa

accatcatGGAAGCAA

i CCR7.L1.HP TTTCCCTTTAacactctttccctacacgacgctct (SEQ ID NO:401) tccgatctCTCTTGGCTCCACTGGGATGGttcttt

aAAGGGAAA

i CCR8.L1.HP TTTCACACAGTGacactctttccctacacgacgct (SEQ ID NO:402) cttccgatctGGCATGCTAGTAGCAGTGAGCAttc

tccactGTGTGAAA

i CCR9.L1.HP TAGAGTAAGTTGGacactctttccctacacgacgc (SEQ ID NO:403) tcttccgatctGCTGGAAGGCTATTTACTTCCATG

CTaacaaccaaCTTACTCTA

i CD160.L1.HP TTGTAGTCGTGTTacactctttccctacacgacgc (SEQ ID NO:404) tcttccgatctGGAAGAGGAAGATTTGTGCAGACC AAGtcataacacGACTACAA

i CD244.L1.HP TGGCCTGTGTacactctttccctacacgacgctct (SEQ ID NO:405) tccgatctTGGAATC AGACAC ATGCTGGG TCC

aacttaacaCAGGCCA

i CD27.L1.HP TGACAAGATTacactctttccctacacgacgctct (SEQ ID NO:406) tccgatctCTGGCAGCCACAACTGCAtcaaatcaa

TCTTGTCA

i CD274.L1.HP TTACAAATCAactctttccctacacgacgctcttc ( SEQ ID NO:407) cgatctAGTCTATTCCTAAGTCCTAACTCCTCCTT

GatcacacaatGATTTGTAA

CD276.L1.HP TTCCTGGAGACTacactctttccctacacgacgct (SEQ ID NO:408) cttccgatctGCAGAAAAGGCAGAGCCTGGtagtc

tCCAGGAA

i CD28.L1.HP TTGAGCAATTacactctttccctacacgacgctct (SEQ ID NO:409) tccgatctTGGCTTGCCTCGTCACCCacattccaa

TTGCTCAA

i CD3E.L1.HP TTACCCAGAAacactctttccctacacgacgctct (SEQ ID NO:410) tccgatctCCCCTTTTGCAGCCCTCTCTtccttac

ttCTGGGTAA

i CD4.L1.HP TTGTAACCAAacactctttccctacacgacgctct (SEQ ID NO:411) tccgatctCCTCCAGACCATTCAGGACACAGtctt

tcattGGTTACAA

i CD40LG. LI . HP TAGGGACATTacactctttccctacacgacgctct (SEQ ID NO:412) tccgatctAGGCCGTTGCTAGTCAGTTCTCaagaa

aTGTCCCTA

; CD44.L1.HP TACACAAAGactctttccctacacgacgctcttcc (SEQ ID NO:413) gatetAGAATAAGAA AACATGG CCA TCACCT

TATGTaccctaCTTTGTGTA

I CD5.L1.HP TTCCCAGATAacactctttccctacacgacgctct (SEQ ID NO:414) tccgatctAGCCCAGGTCACAGATCTTCCaagatt

atCTGGGAA

i CD69.L1.HP TATTTCCTAGATacactctttccctacacgacgct (SEQ ID NO:415) cttccgatctGGAAAATGTGCAATATGTGATGTGG

CAttcacaatcTAGGAAATA

i CD80.L1.HP TTACACAGTGacactctttccctacacgacgctct (SEQ ID NO:416) tccgatctGCTACCTCACTATGCTGCTTCACAAta

acacacaCTGTGTAA

i CD8A.L1.HP TAAGCAAGTacactctttccctacacgacgctctt (SEQ ID NO:417) ccgatctGCCCAAACTGCTGTCCCAAACtccatac

aaCTTGCTTA i CSF2.L1.HP TTGGTCCAAAGGacactctttccctacacgacgct (SEQ ID NO:418) cttccgatctACAAGAGCTAGAAACTCAGGATGGT

CtcaacctttGGACCAA

i CTLA4.L1.HP TTCCTAGAGTTAGacactctttccctacacgacgc (SEQ ID NO:419) tcttccgatctGCTTACTCCAGGAGACCCACAGtc

tactaacTCTAGGAA

; CXCL8.L1.HP TAGTTCTATGAGTacactctttccctacacgacgc (SEQ ID NO:420) tcttccgatctTTGGTAGTGCTGTGTTGAATTACG

GAtcttactcaTAGAACTA

; CXCR3.L1.HP TTGGCAGacactctttccctacacgacgctcttcc (SEQ ID NO:421) gatctTCTGAGGACTGCACCATTGCTtacaacatt

CTGCCAA

i CXCR4.L1.HP TTCTAGCTAAGacactctttccctacacgacgctc (SEQ ID NO:422) ttccgatctCTGAACATTCCAGAGCGTGTAGTGAt

cataccttAGCTAGAA

i CXCR5.L1.HP TTCCTCGAATacactctttccctacacgacgctct (SEQ ID NO:423) tccgatctGCTCACGTGAGAGTGTCTTCACGtatt

ctattCGAGGAA

i CXCR6.L1.HP TTGCACATAacactctttccctacacgacgctctt (SEQ ID NO:424) ccgatctGCTGATAATTCCAGTGGTCCATGGAtca

tcctATGTGCAA

i EBI3.L1.HP TTCTCAGCacactctttccctacacgacgctcttc (SEQ ID NO:425) cgatctCCGACTTTTCCCTTTGAGCCTCtcaacaa

tGCTGAGAA

I EGR2. LI . HP TAGGCAGTGacactctttccctacacgacgctctt (SEQ ID NO:426) ccgatctATCACTCCCTGAGTTTAGTATGGCTGat

ctccatccaCTGCCTA

i EGR3.L1.HP TTGGCATTGacactctttccctacacgacgctctt (SEQ ID NO:427) ccgatctTCACCAACATTTCATTTGCTCCTTTGTC

tctacccaatcAATGCCAA

i EO ES.L1.HP TAAGAAGTATTGacactctttccctacacgacgct (SEQ ID NO:428) cttccgatctGGAGAGTTTCATCATCCCCATGATA

TTTGGctccaaTACTTCTTA

i FAS. LI. HP TGCTTTCATAacactctttccctacacgacgctct (SEQ ID NO:429) tccgatctAGGCTATTTGCAGAAGGAGCTCAatct

cataTGAAAGCA

i FOXP3.L1.HP TGTGCGTTacactctttccctacacgacgctcttc (SEQ ID NO:430) cgatctCCTGTTAGAATTCACCTGTGTATCTCACG

atctccataACGCACA

; GATA3.L1.HP TACCCTCTTacactctttccctacacgacgctctt (SEQ ID NO:431) ccgatctCATCTGTCTTGTCCCTATTCCTGCAtaa

ctacaaGAGGGTA

i GGT1.L1.HP TTGCACCTGTacactctttccctacacgacgctct (SEQ ID NO:432) tccgatctCCAGATCGCGTCCACCTTCtcacacaa

caGGTGCAA

i GZMA.Ll.HP TTCCAGAAAGacactctttccctacacgacgctct (SEQ ID NO:433) tccgatctCCGAGGTGGAAGAGACTCGTGattctt

tctTTCTGGAA

; HAVCR2. LI . HP TTCTGTGATGGacactctttccctacacgacgctc (SEQ ID NO:434) ttccgatctACTTCTGTATTCGTGGACCAAACTGA

AGactataccaTCACAGAA

I HLA-DRA. LI . HP TTAGAGTACTTGTacactctttccctacacgacgc (SEQ ID NO:435) tcttccgatctCCCCAAGTGTGGATATGCCTCTTC

ttcatacaaGTACTCTAA

i ICOS.L1.HP TGCAAACATGacactctttccctacacgacgctct (SEQ ID NO:436) tccgatctGACTGGCTTTGCACAGGTGTcaaactt

caTGTTTGCA

i IFNG.Ll.HP TAAACACACacactctttccctacacgacgctctt (SEQ ID NO:437) ccgatctGCAGCCAACCTAAGCAAGATCCatatct

ccaaGTGTGTTTA

i IL10.L1.HP TTCCATCTGAATAAGacactctttccctacacgac (SEQ ID NO:438) gctcttccgatctAGGCTGAGGCAAGAGAATTGCT

ccttattcAGATGGAA

i IL12A.L1.HP TTAAAGTGAAactctttccctacacgacgctcttc (SEQ ID NO:439) cgatctGCTAAGAAGGGAAAATATCCATCCTGAAG

GatcaaccatTTCACTTTAA

I IL12B.L1.HP TTCTCACATTacactctttccctacacgacgctct (SEQ ID NO:440) tccgatctTCAGAATGGCAGGTGGCTTCTcttatc

aaaTGTGAGAA

i IL13.L1.HP TTCTGCCAGAacactctttccctacacgacgctct (SEQ ID NO:441) tccgatctCCTCATCCGAGGCAGGGTatcttctct

GGCAGAA

i IL15.L1.HP TTAGTTTTCTGacactctttccctacacgacgctc (SEQ ID NO:442) ttccgatctCCAGGGAAATCAAAAGATTGGATGCC

aactcatcaGAAAACTAA

i IL16.L1.HP TTAGGGATTGTAAGacactctttccctacacgacg (SEQ ID NO:443) ctcttccgatctGCGTGCTCAGTTCAGAATCACTT

CactcttacaATCCCTAA

i IL17A.L1.HP TTAATCTCCGacactctttccctacacgacgctct (SEQ ID NO:444) tccgatctGTCCAGTTTCTCCCCTAGACTCAGtac caacacGGAGATTAA

i IL2.L1.HP TAGAAGGCGacactctttccctacacgacgctctt (SEQ ID NO:445) ccgatctACTGACTTGATAATTAAGTGCTTCCCAC

TTctcgtcGCCTTCTA

i IL2RA. LI . HP TGAGATTCGAacactctttccctacacgacgctct (SEQ ID NO:446) tccgatctCTCCCCCTTCAGGTATATGTTTTCTGA

GcttttcGAATCTCA

i IL2RB.L1.HP TTCCATTTGTTTacactctttccctacacgacgct ( SEQ ID NO:447) cttccgatctCCTCTCTGCAAGTCGGTCTCCactc

aaaaaCAAATGGAA

IL4.L1.HP TTCAGGAAAGTTacactctttccctacacgacgct (SEQ ID NO:448) cttccgatctCAGTTCCACAGGCACAAGCAGaact

caactTTCCTGAA

i IL5.L1.HP TCTGGCATGAacactctttccctacacgacgctct (SEQ ID NO:449) tccgatctACTTCAGAGGGAAAGTAAATATTTCAG

GCActactcaTGCCAGA

i IL6.L1.HP TGGCTCTTGTacactctttccctacacgacgctct (SEQ ID NO:450) tccgatctTGGAAAGTGGCTATGCAGTTTGAATAT

CCcaaccacacaAGAGCCA

i IL7.L1.HP TGACAATGGTTcactctttccctacacgacgctct (SEQ ID NO:451) tccgatctTCACCTAGTCTAAGGATGCTAAACCTT

AGTtaccttaacCATTGTCA

i IL7R.L1.HP TGCATCATAAAGacactctttccctacacgacgct (SEQ ID NO:452) cttccgatctGCCAGGAAGAATATGTGGCAGAGCa

aatacctttATGATGCA

; IL9.L1.HP TTGCCTGacactctttccctacacgacgctcttcc (SEQ ID NO:453) gatctCCTGTGAACAGCCATGCAACCtttaatacc

CAGGCAA

I ITCH. Ll. HP TTTGGTCTTTTGTacactctttccctacacgacgc (SEQ ID NO:454) tcttccgatctACAGCCCATCTGGATTGTGGTtac

cacaaaAGACCAAA

i ITGA2.L1.HP TACAAACATTacactctttccctacacgacgctct (SEQ ID NO:455) tccgatctTGTTTTGCCTCTTTCCACAAAAACTGT

ttcctaATGTTTGTA

i ITGAE . Ll . HP TGATAGCCATTAGacactctttccctacacgacgc (SEQ ID NO:456) tcttccgatctCGAAGAAGAGAATTAGGACCTGCT

ATCCtcactaatGGCTATCA

i ITGAL . Ll . HP TTCCTTGATGacactctttccctacacgacgctct (SEQ ID NO:457) tccgatctTGGGCAACACAGCGAGACaaaccacaa

tcaTCAAGGAA i ITGAX.L1.HP TAGAGGGATGacactctttccctacacgacgctct (SEQ ID NO:458) tccgatctGGCAACATTGCTGGCTGGAAGtcttct

catCCCTCTA

i KLRG1.L1.HP TTGAGGCGacactctttccctacacgacgctcttc (SEQ ID NO:459) egatct GTGGTTGGATCACCAAATTATCT AGGT

tacttctacttcGCCTCAA

; LCK.L1.HP TTGTACAGATAGacactctttccctacacgacgct (SEQ ID NO:460) cttccgatctTGTACATGTGTAGCCTGTGCATGTA

TGaacatctatCTGTACAA

; EDD4. LI . HP TGTCCTTGGAacactctttccctacacgacgctct (SEQ ID NO:461) tccgatctGCGGGACAGTCAGTAGGATCAacactt

cCAAGGACA

i PECAM1.L1.HP TGACCTGAacactctttccctacacgacgctcttc (SEQ ID NO:462) cgatctGTGCATGCGACCCCTTCCttcacaactCA

GGTCA

i PPIA.Ll.HP TTGATTGCAacactctttccctacacgacgctctt (SEQ ID NO:463) ccgatctCCCACCTTAACAGACCTCTAGGGTTacc

ttaaatGCAATCAA

i PRDM1.L1.HP TTCACATGATacactctttccctacacgacgctct (SEQ ID NO:464) tccgatctTCACCATCCATCCTTCTCTTTTCTGCa

caactacacatCATGTGAA

i PRF1.L1.HP TTGGCCTAacactctttccctacacgacgctcttc (SEQ ID NO:465) cgatctAGATCACAGCTTCAGCCAGGAGacttcat

ctAGGCCAA

I PTPRC.L1.HP TAATGGTTGTacactctttccctacacgacgctct (SEQ ID NO:466) tccgatctACCAGGAATGGATGTCGCTAATCATtt

cttttcaCAACCATTA

i RNF128.L1.HP TAGCTCTAGacactctttccctacacgacgctctt (SEQ ID NO:467) ccgatctGGTGCCTTAATATAAAGTTTGAAGCTTC

ATCCAaaccttcTAGAGCTA

i RORC.L1.HP TGTCCTCAGAacactctttccctacacgacgctct (SEQ ID NO:468) tccgatctGTCATTGGGCACCGCTGACcaatcatc

tGAGGACA

i RPS3.L1.HP TTGACCCTacactctttccctacacgacgctcttc (SEQ ID NO:469) cgatctCCAAGGGATTTGCATCTGTGGATTCAtaa

ttaccaGGGTCAA

i RU X1.L1.HP TATCTAGAATGGacactctttccctacacgacgct (SEQ ID NO:470) cttccgatctTGAGAGAATATCCCAGAACCCTCTC

TaattctccaTTCTAGATA

; RU X3.L1.HP TATGTCACTGGacactctttccctacacgacgctc (SEQ ID NO:471) ttccgatctTGAGCCTGTCAGTAGTGGGTACattc

aaccaGTGACA A

i SELL. LI. HP TTCATTGTactctttccctacacgacgctcttccg (SEQ ID NO:472) atctTGGAAGAGTTAAAACAGGTGGAGAAATTCCa

cctcaaactcaCACAATGAA

i SNCA.L1.HP TTGGGAGGTATGacactctttccctacacgacgct (SEQ ID NO:473) cttccgatctGGATCAATCCAGTCCTAGGTTTATT

TTGCAttacatacCTCCCAA

; TBX21.L1.HP TTTGAGCATTTTTGacactctttccctacacgacg (SEQ ID NO:474) ctcttccgatctGACTTTGGACAGCTGGCCTGtca

caaaaaTGCTCAAA

I TGFB1.L1.HP TTCATTAATGacactctttccctacacgacgctct (SEQ ID NO:475) tccgatctGAGCACCTTGGGCACTGTTGttcaaaa

tCATTAATGAA

i TIGIT.L1.HP TGGTGGAAGTacactctttccctacacgacgctct (SEQ ID NO:476) tccgatctGACCATTTCATAGTTGGATTCCTGGAG

AatctacactTCCACCA

i TNF.Ll.HP TTCACGGTGTacactctttccctacacgacgctct (SEQ ID NO:477) tccgatctGGAGCTGCCTTGGCTCAGAataccaca

caCCGTGAA

i TNFRSF18. LI . HP TGGTCACATATGTacactctttccctacacgacgc (SEQ ID NO:478) tcttccgatctAGCAGAAGTGGGTGCAGGAAtcat

cacatatGTGACCA

i TNFRSF9. LI . HP TAGTCATCATGacactctttccctacacgacgctc (SEQ ID NO:479) ttccgatctGGAAGATATCACTCTGACGGAAAGTT

TTGActatcatGATGACTA

I UBB.L1.HP TTCCAGCATGacactctttccctacacgacgctct (SEQ ID NO:480) tccgatctCCAGCAGAGGCTCATCTTTGCtccatt

catGCTGGAA

i VEGFA.L1.HP TGTATATTGTGacactctttccctacacgacgctc (SEQ ID NO:481) ttccgatctTCCCCAGCACACATTCCTTTGAttct

tccaCAATATACA

i VTCN1. LI . HP TTCAAAAGTGacactctttccctacacgacgctct (SEQ ID NO:482) tccgatctGGCAACAAACATATACCTTCCATGAAG

CtatatatatcACTTTTGAA

i P5.rcRBC001.Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO:483) HP CACgcaacgtaagACACTCTTTCCCTACACGA

i P5.rcRBC002.Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO:484) HP CACgctgctcgaaACACTCTTTCCCTACACGA

i P5.rcRBC003.Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO:485) i HP CACtagaacaacgACACTCTTTCCCTACACGA

i P5 rcRBC004 . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA \ (SEQ ID NO 486) i HP CACacgtcggataACACTCTTTCCCTACACGA

i P5 rcRBC005 . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 487) i HP CACtcagagaatgACACTCTTTCCCTACACGA

: P5 rcRBC006 . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA \ (SEQ ID NO 488) : HP CACatagctcgaaACACTCTTTCCCTACACGA

i P5 rcRBC007 . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 489) : HP CACgacacggataACACTCTTTCCCTACACGA

i P5 rcRBC008 . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 490) ; HP CACattgttgcgaACACTCTTTCCCTACACGA

i P5 rcRBC009 . Rdlx . tcgtgtaggAA GATACGGCGACCACCGAGA CTA i (SEQ ID NO 491) : : HP CACcgatagaagaACACTCTTTCCCTACACGA

P5 rcRBCOlO Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA \ (SEQ ID NO 492) HP CACtaccgtatgaACACTCTTTCCCTACACGA

i P5 rcRBCOll . Rdlx . tcgtgtaggAATGATACGGCGACCACCGAGATCTA \ (SEQ ID NO 493) ; HP CACatggagtctaACACTCTTTCCCTACACGA

i ^p5 rcRBC012 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 494) ; HP CACagctccgtagACACTCTTTCCCTACACGA

: P5 rcRBC013 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 495) :

: HP CACgaatctacagACACTCTTTCCCTACACGA

: P5 rcRBC014 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA \ (SEQ ID NO 496) HP CACcaatacgctaACACTCTTTCCCTACACGA

i P5 rcRBCOlS Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 497) j HP CACcaacatgagaACACTCTTTCCCTACACGA

i P5 rcRBC016 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA i (SEQ ID NO 498) j i HP CACggcaacagaaACACTCTTTCCCTACACGA

i ^p5 rcRBC017 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA ^; (SEQ ID NO. 499) I HP CACagagtgcataACACTCTTTCCCTACACGA

; P5 rcRBC018 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 500) i i HP CACcactgtcggaACACTCTTTCCCTACACGA

; P5 rcRBC019 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 501) j HP CACgatcgtaacaACACTCTTTCCCTACACGA

p⁵ rcRBC020 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 502) : j HP CACacatgctccaACACTCTTTCCCTACACGA

P5. rcRBC021 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 503) HP CACgcacattcgaACACTCTTTCCCTACACGA

; P5. rcRBC022 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 504) i HP CACggcgatactcACACTCTTTCCCTACACGA

; P5. rcRBC023 Rdlx. tcgtgtaggAATGATACGGCGACCACCGAGATCTA (SEQ ID NO: 505) i

Claims

CLAIMS What is claimed is:

1. A method for preparing DNA templates for single-cell transcript sequencing of RNA from a population of cells, the method comprising:

distributing cells from the population into separate reaction volumes so that a plurality of separate reaction volumes each comprise a single cell, wherein the cells have been treated with a fixative prior to distribution, wherein the fixative comprises a cell-permeant fixative that enables the production of DNA templates from transcripts derived from single cells with greater efficiency than in the absence of said cell-permeant fixative;

permeabilizing or disrupting each cell in each separate reaction volume;

reverse transcribing cDNA from RNA in each separate reaction volume; and

amplifying the cDNA to produce DNA templates, wherein the amplification incorporates one or more nucleotide sequences that facilitate DNA sequencing of the DNA templates.

2. The method of claim 1, wherein the fixative stabilizes the cell nucleus and/or stabilizes RNA.

3. The method of claim 1, wherein the method comprises the prior treatment of the cells with the fixative.

4. The method of claims 1 or 3, wherein the fixative comprises biomarker and histology preservative (BHP).

5. The method of claims 1 or 3, wherein the fixative comprises dithiobis(succinimydal proprionate) (DSP).

6. The method of any of claims 1-5, wherein the DNA templates are recovered from the separate reaction volumes in one or more pools of DNA templates.

7. The method of any of claims 1-6, wherein the DNA templates are further amplified after recovery.

8. The method of any of claims 1-7, wherein the method additionally comprises subjecting the DNA templates to DNA sequencing.

9. The method of any of claims 1-8, wherein the method comprises preparing DNA templates for single-cell transcript sequencing of T-cell receptor or immunoglobulin RNA from the population, wherein:

the cells comprise T cells or B cells; and

the DNA templates are generated from T-cell receptor or immunoglobulin RNA, respectively.

10. The method of claim 9, wherein the cells are T cells.

11. The method of claim 9, wherein the cells are B cells.

12. The method of claims 10 or 11, wherein the cells are activated.

13. A primer combination for producing DNA templates from RNA encoding T-cell receptor or immunoglobulin chains, the primer combination comprising:

a first reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding a first chain of a T-cell receptor or immunoglobulin, the T-cell receptor or immunoglobulin also comprising a second chain;

a second reverse transcription (RT) primer having a portion specific for the Constant segment (CS) of an RNA encoding the second chain of the T-cell receptor or immunoglobulin, respectively;

wherein the first and second RT primers each additionally comprise a first nucleotide tag comprising a first primer binding site for a first DNA sequencing primer, the first nucleotide tag being 5' of the CS-specific portion; and

a first barcode primer that comprises, from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor; a plurality of first amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the first chain of the T-cell receptor or immunoglobulin;

a plurality of second amplification primers, each having a portion specific for a different Variable segment (VS) of the RNA encoding the second chain of the T-cell receptor or immunoglobulin, respectively;

wherein the first and second amplification primers each additionally comprise a second nucleotide tag comprising a second primer binding site for a second DNA sequencing primer, the second nucleotide tag being 5' of the VS-specific portion; and

a second barcode primer that comprises, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor.

14. A primer combination for producing DNA templates from RNA encoding T-cell receptor or immunoglobulin chains, the primer combination comprising:

a first barcode primer that comprises, from 3' to 5', a portion specific for the first primer binding site, a first barcode nucleotide sequence, and a first sequencing adaptor;

a 5' oligonucleotide comprising, from 5' to 3', the second primer binding site for the second DNA sequencing primer, a unique molecular identifier (UMI), and an oligo-riboG sequence; and a second barcode primer that comprises, from 3' to 5', a portion specific for the second primer binding site, a second barcode nucleotide sequence, and a second sequencing adaptor.

15. A kit comprising the primer combination of claim 13 or claim 14 and a matrix-type microfluidic device comprising:

capture sites arranged in a matrix of R rows and C columns, wherein R and C are integers greater than 1, and wherein the capture sites can be fluidically isolated from one another after distribution of cells to the capture sites;

a set of R first input lines configured to deliver the first reagent(s) to capture sites in a particular row;

a set of C second input lines configured to deliver second reagent(s) to capture sites in a particular column, wherein said delivery is separate from the delivery first reagent(s),

wherein, after a reaction, reaction products can be recovered from the microfluidic device in pools of reaction products from individual rows or columns.