WO2013097744A1 - 用于扩增cdr3编码序列的引物组合物及其用途 - Google Patents
用于扩增cdr3编码序列的引物组合物及其用途 Download PDFInfo
- Publication number
- WO2013097744A1 WO2013097744A1 PCT/CN2012/087668 CN2012087668W WO2013097744A1 WO 2013097744 A1 WO2013097744 A1 WO 2013097744A1 CN 2012087668 W CN2012087668 W CN 2012087668W WO 2013097744 A1 WO2013097744 A1 WO 2013097744A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- primer
- coding sequence
- cdr3
- chain
- cell receptor
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6881—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for tissue or cell typing, e.g. human leukocyte antigen [HLA] probes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/16—Primer sets for multiplex assays
Definitions
- the invention relates to the field of biotechnology.
- the invention relates to primer compositions for amplifying CDR3 coding sequences and their use. More specifically, the present invention relates to a primer composition for amplifying a CDR3 coding sequence, a method of enriching a CDR3 coding sequence, a method of constructing a sequencing library of a CDR3 coding sequence, a method of determining sequence information of a CDR3 coding sequence, and determining an individual immunity State methods, systems for determining individual immune status, and kits.
- the immune system is an important system for the body to resist the invasion and immune regulation of pathogens. It is of great significance to study it.
- Immunoglobulins, T cell receptors (TCRs) and HLAs human leukocyte antigens
- TCRs T cell receptors
- HLAs human leukocyte antigens
- the diversity of immune macromolecules allows the body to recognize numerous foreign substances and remove some of the metabolites produced in the body.
- the mechanisms underlying immune macromolecular diversity include gene rearrangements, somatic mutations, and insertion and deletion of non-template nucleotides.
- the immunoglobulin heavy chain rearrangement finger, D and J gene segments are rearranged to form a plurality of CDR3 coding sequences.
- the first is the rearrangement of the DJ gene fragment, and the rearranged gene fragment is combined with the V gene to obtain the CDR3 encoding.
- the ⁇ chain gene rearrangement is performed first. If the rearrangement fails, the ⁇ chain gene begins to rearrange, and various mechanisms such as rearrangement and insertion and deletion of non-template nucleotides are generated. Specific gene fragment. Thus, 10 11 specific molecules may be produced in an individual.
- the diversity of sputum cell receptors is mainly caused by the rearrangement of the ⁇ -chain and ⁇ -chain of the sputum cell receptor mainly expressed by human peripheral blood sputum cells, while the sputum cell receptor ⁇ chain and ⁇ chain rearrangement cause sputum cells.
- the diversity of the receptor alpha chain and beta chain CDR3 coding sequences can better represent the diversity of sputum cell receptors and even the immune status of individuals.
- the present invention aims to solve at least one of the technical problems existing in the prior art. To this end, it is an object of the present invention to provide a means for efficiently enriching the CDR3 coding sequences of the immunoglobulin heavy and light chain, sputum cell receptor alpha chain and beta chain.
- the invention provides a primer composition for amplifying a CDR3 coding sequence.
- the primer composition comprises a first primer set consisting of at least one V-region primer, each of the at least one V-region primer comprising at least one V gene fragment a complementary sequence; and a second primer set, the second primer set consisting of at least one J region primer, each of the at least one J region primer comprising a sequence complementary to at least one J gene fragment, wherein
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell receptor a chain, and a T cell receptor ⁇ chain.
- the CDR3 coding sequence of the immunoglobulin heavy chain, the immunoglobulin light chain, the sputum cell receptor a chain and the T cell receptor ⁇ chain can be efficiently enriched, thereby deepening the above CDR3 Research provides a convenient tool.
- the invention also provides a method of enriching a CDR3 coding sequence.
- the method comprises the steps of: providing a nucleic acid sample comprising a nucleic acid sequence encoding a CDR3; and using the primer composition of the invention as described above, using the nucleic acid sample as a template, PCR amplification is performed to obtain an amplification product enriched for the CDR3 coding sequence, wherein the CDR3 coding sequence is derived from an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor a chain, and a T At least one of the cell receptor beta chains.
- the CDR3 coding sequence of the immunoglobulin heavy chain, the immunoglobulin light chain, the sputum cell receptor a chain and the T cell receptor ⁇ chain can be efficiently enriched.
- the invention provides a method of constructing a sequencing library of CDR3 coding sequences.
- the method comprises the steps of: obtaining an amplification product enriched for the CDR3 coding sequence according to the method described above; and constructing a DNA sequencing library for the amplification product, the DNA sequencing The library constitutes a sequencing library of the CDR3 coding sequence, wherein the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor alpha chain, and a purine cell receptor beta chain.
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor alpha chain, and a purine cell receptor beta chain.
- the auto-globulin heavy chain, the immunoglobulin light chain, the sputum cell receptor alpha chain or the sputum cell receptor beta can be Based on the enrichment of the CDR3 coding sequence of the strand, a sequencing library that can be used for sequencing is constructed.
- the invention proposes a method of determining sequence information of a CDR3 coding sequence.
- the method comprises the steps of: constructing a sequencing library of CDR3 coding sequences according to the methods described above; and sequencing the sequencing library of the CDR3 coding sequence to determine the sequence of the CDR3 coding sequence Information, wherein the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell receptor alpha chain, and a T cell receptor beta chain.
- the sequence information of the CDR3 coding sequence of the immunoglobulin heavy chain, the immunoglobulin light chain, the sputum cell receptor a chain and the T cell receptor ⁇ chain can be efficiently determined.
- the invention provides a method of determining an immune status of an individual.
- the method comprises the steps of: sequencing the CDR3 coding sequence of the individual according to the method described above to obtain a sequencing result consisting of a plurality of sequencing data; and based on the sequencing result, The immune state of the individual is determined, wherein the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor alpha chain, and a purine cell receptor beta chain.
- the sequence information of the CDR3 coding sequence of the individual immunoglobulin heavy chain, immunoglobulin light chain, sputum cell receptor a chain or T cell receptor ⁇ chain can be effectively obtained, thereby effectively determining the individual immunity status.
- the invention proposes a system for determining an immune status of an individual.
- the system comprises: a CDR3 coding sequence enrichment device, wherein the CDR3 coding sequence enrichment device is provided with the primer composition described above to enrich the CDR3 coding sequence of the nucleic acid sample of the individual
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor alpha chain, and a purine cell receptor beta chain
- a library construction device the library construction a device coupled to said CDR3 coding sequence enrichment device for constructing a sequencing library of CDR3 coding sequences for said enriched CDR3 coding sequence
- a sequencing device said sequencing device being coupled to said library construction device for use in a sequencing library of the CDR3 coding sequence is sequenced to obtain a sequencing result composed of a plurality of sequencing data
- an analysis device coupled to the sequencing
- the present invention also provides a kit.
- the kit is provided with the primer composition described above.
- the kit can be used to detect VJ rearrangements of immunoglobulin heavy chains, immunoglobulin light chains, purine cell receptor alpha chains, and purine cell receptor beta chains, or to detect immunoglobulin heavy chains, Coding sequences for the CDR3 of the immunoglobulin light chain, the sputum cell receptor a chain, and the T cell receptor beta chain.
- Figure 1 is a schematic view showing the structure of CDR3 composed of V H , D H and J H in an immunoglobulin heavy chain;
- FIG. 2 is a schematic flow diagram of enriching and sequencing a CDR3 coding sequence according to an embodiment of the present invention
- FIG. 3 is a schematic diagram showing the structure of a system for determining an immune state of an individual according to an embodiment of the present invention
- Figure 4 is a result of agarose gel electrophoresis of the multiplex PCR product in Example 1 of the present invention.
- Figure 5 is a diagram showing the results of paired distribution and abundance of the obtained V-J gene fragment after sequencing the immunoglobulin heavy chain CDR3 coding sequence in Example 1 of the present invention
- Figure 6 is a schematic diagram showing the CDR3 sequence consisting of the primer-amplified V-J rearrangement in the immunoglobulin light chain in Example 2 of the present invention
- Figure 7 is a result of the multiplex PCR product agarose gel electrophoresis in Example 2 of the present invention
- Figure 8 is a diagram showing the results of the distribution and abundance of the obtained V K -J K gene fragment after sequencing the immunoglobulin light chain CDR3 sequence in Example 2 of the present invention.
- Figure 9 is a diagram showing the results of the distribution and abundance of the V A -JA gene fragment obtained by sequencing the immunoglobulin light chain CDR3 sequence in Example 2 of the present invention.
- Figure 10 is a result of agarose gel electrophoresis of the multiplex PCR product in Example 3 of the present invention.
- FIG. 1 is a schematic diagram showing the results of paired distribution and abundance of the V-J gene fragment obtained by sequencing the T cell receptor a-chain CDR3 coding sequence in Example 3 of the present invention
- Figure 12 is a result of agarose gel electrophoresis of the multiplex PCR product in Example 4 of the present invention.
- Fig. 13 is a view showing the results of paired distribution and abundance of the obtained VJ gene fragment after sequencing the T cell receptor ⁇ -chain CDR3 coding sequence in Example 4 of the present invention. Detailed description of the invention
- the invention proposes a primer composition for amplifying a CDR3 coding sequence.
- the primer composition comprises a first primer set and a second primer set.
- the first primer set is composed of at least one V region primer
- the second primer set is composed of at least one J region primer derived from an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell.
- V region primer refers to a primer that specifically recognizes an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor a chain or a T cell receptor.
- the V gene fragment in the ⁇ chain family can thereby direct the polymerase chain reaction, whereby each of the V region primers contained in the first primer set contains a sequence complementary to at least one V gene fragment.
- J region primer refers to a primer that specifically recognizes an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor a chain or a T.
- each of the J region primers contained in the first primer set contains a sequence complementary to at least one J gene fragment .
- the coding comprising the V gene fragment and the J gene fragment can be specifically amplified from the nucleic acid sample containing the nucleic acid sequence encoding the CDR3 by an amplification reaction such as PCR amplification. sequence.
- the immunoglobulin heavy chain, the immunoglobulin light chain, the T cell receptor alpha chain and the T cell receptor ⁇ chain are encoded by V, D, J gene segments or rearrangement products of V and J gene segments,
- an amplification product of the CDR3 coding sequence can be efficiently amplified from the nucleic acid sample containing the above CDR3 coding sequence.
- the primer composition according to the embodiment of the present invention the coding sequence for obtaining the CDR3 can be amplified and enriched from the sample with high specificity, and no other interference sequence exists.
- CDR3 coding sequences from different sources, such as CDR3 coding sequences derived from immunoglobulin heavy chain, immunoglobulin light chain, purine cell receptor alpha chain or purine cell receptor beta chain, respectively.
- a primer composition that amplifies the CDR3 coding sequence.
- the CDR3 coding sequence is derived from an immunoglobulin heavy chain.
- the primer composition of the present invention for amplifying a CDR3 coding sequence can have the following characteristics:
- the roles of the V region primer and the J region primer in the PCR amplification process are not particularly limited.
- the V region primer can serve as a sense strand primer and the J region primer can serve as an antisense strand primer.
- the inventors have found that by setting the V region primer and the J region primer in this way, the enrichment efficiency when the primer composition is used for enrichment of the CDR3 coding sequence can be further improved.
- the type of immunoglobulin to which the above primer composition can be applied is not particularly limited, and those skilled in the art can select an appropriate immunoglobulin as a research object according to research needs.
- the immunoglobulin is a human immunoglobulin.
- the primer composition can be effectively used for enriching the human immunoglobulin heavy chain CDR3 coding sequence, thereby being effectively used for studying human immune status.
- a sequence of the V region primer and the J region primer can be selected to realize that one primer can recognize a plurality of V gene segments, thereby further improving the amplification efficiency and reducing the number of primers. Fight for low cost.
- at least one primer of the first primer set comprises a sequence complementary to a conserved region of a plurality of V gene segments.
- At least one of the second primer set comprises a sequence complementary to a conserved region of a plurality of J gene segments.
- the efficiency of enrichment of the immunoglobulin heavy chain CDR3 coding sequence can be improved while reducing the number of primers, and the inventors have also found that such an operation can improve the uniformity of amplification of each CDR3 coding sequence, thereby enabling real The distribution ratio of the CDR3 coding sequence in the host is reflected.
- the second primer set may be composed of only one primer.
- the second primer set consists of a degenerate primer comprising a sequence complementary to a conserved region of a plurality of J gene segments.
- the present invention proposes a set of V region primers (SEQ ID NO: 1-20) and a J region primer (SEQ ID NO: 1-20) for the sequence characteristics of the human immunoglobulin heavy chain CDR3. : 21 ), its sequence and name are summarized as follows:
- V-zone or J-sub-family listed in the above table is not exhaustive, but only all the sub-families currently reported.
- the inventors have surprisingly found that the use of the above specific primer sequences can fully cover all subfamilies of human immunoglobulin CDR3, including the discovery of a new CDR3 subfamily, thereby enabling full enrichment of human immunoglobulin heavy chain CDR3
- the coding sequence in addition, the inventors have also found that by using the above primer sequences, multiplex PCR (also sometimes referred to as "multiplex PCR”) can be performed simultaneously in one PCR reaction system, and the CDR3 contained in the sample can be effectively performed.
- the coding sequence is amplified, and the uniformity of amplification of each CDR3 coding sequence can be ensured, thereby truly reflecting the distribution ratio of the CDR3 coding sequence in the host.
- the annealing temperature of the multiplex primer PCR may be 60 degrees Celsius. The inventors have surprisingly found that the efficiency of multiplex primer PCR amplification of CDR3 is significantly improved when the annealing temperature is 60 degrees Celsius.
- the present invention contains specific primers which can amplify all V regions and J regions of the IMGT database immunoglobulin CDR3 sequence, and can enrich the coding sequence of the human immunoglobulin heavy chain CDR3 in the most comprehensive manner.
- the amplified product is capable of distinguishing each subfamily of immunoglobulin heavy chains, and the same template is not specifically amplified by the two sets of primers.
- the primers of the present invention can better represent the collection and distribution of immune macromolecules, and have a more reliable effect on finding information related to diseases or changes in immune system.
- the CDR3 coding sequence is derived from the immunoglobulin light chain.
- the primer composition of the present invention for amplifying a CDR3 coding sequence can have the following characteristics:
- the immunoglobulin light chain has two types, ⁇ -rearrangement and VK-JK rearrangement, specifically, in the embodiment of the present invention, V is specifically recognized.
- V is specifically recognized. 1 into the gene fragment primers, ie, SEQ ID NO: 22-36 in the first primer set and SEQ ID NO: 41-43 in the second primer set, capable of selectively and efficiently from the nucleic acid sample containing the CDR3 coding sequence
- Amplification of the amplification product of the lambda light chain CDR3 sequence; amplification of the kappa light chain CDR3 amplification product is similar, using SEQ ID NO: 37-40 in the first primer set and SEQ in the second primer set ID NO: 44 and SEQ ID NO: 45 primer set; similarly, using the first primer set SEQ ID NO: 22-40 and the second primer set SEQ ID NO: 41-45, multiplex PCR can be combined
- the CDR3 sequences of the two types of light chains of ⁇ , ⁇ are obtained; in the light chain,
- the roles of the V region primer and the J region primer in the PCR amplification process are not particularly limited.
- the V region primer can serve as a sense strand primer and the J region primer can serve as an antisense strand primer.
- the inventors have found that by setting the V region primer and the J region primer in this way, it is possible to further enhance the enrichment when the primer composition is used to enrich the immunoglobulin light chain CDR3 sequence. Set efficiency.
- the type of immunoglobulin to which the above primer composition can be applied is not particularly limited, and those skilled in the art can select an appropriate immunoglobulin as a research object according to research needs.
- the immunoglobulin is a human immunoglobulin.
- a sequence of the V region primer and the J region primer can be selected to realize that one primer can recognize a plurality of V gene segments, thereby further improving the amplification efficiency and reducing the number of primers. Fight for low cost.
- at least one primer of the first primer set comprises a sequence complementary to a conserved region of a plurality of V gene segments.
- At least one of the reverse primer sets comprises a sequence complementary to a conserved region of a plurality of J gene segments.
- the present invention proposes a set of V region primers (SEQ ID NO: 22-40) and a set of J region primers (SEQ ID NO: 22-40) for a sequence characteristic of a human immunoglobulin light chain CDR3 (SEQ ID NO: 22-40) : 41-45 ), its sequence and name are summarized as follows:
- R A/G
- Y C/T
- M A/C
- H A/C/T
- S C/G
- W A/T
- sequences listed in the above table can identify at least one V region or J region subfamily sequence, and the representative subfamily identified by it
- V-zone or J-sub-family listed in the above table is not exhaustive, but only all the sub-families currently reported.
- the inventors have surprisingly discovered that the use of the above specific primer sequences can fully cover all subfamilies of the human immunoglobulin light chain CDR3, including the discovery of a new CDR3 subfamily, thereby enabling full enrichment of human immunoglobulin light.
- the annealing temperature of the multiplex primer PCR may be 60 degrees Celsius.
- the inventors have surprisingly found that the efficiency of multiplex PCR amplification of CDR3 is significantly improved when the annealing temperature is 60 degrees Celsius.
- the present invention contains specific primers for mutating all V regions and J regions of the IMGT database immunoglobulin CDR3 sequence, and can be most comprehensively enriched for human immunoglobulin light chain CDR3 sequences, and amplified.
- the resulting product is capable of distinguishing each subfamily of immunoglobulin light chains, and the same template is not amplified by the specific binding of the two primers.
- the primers of the present invention can better represent the collection and distribution of immune molecules in the body, and have a more reliable effect on finding information related to diseases or changes in the immune system.
- the CDR3 coding sequence is derived from the T cell receptor a chain.
- the primer composition of the present invention for amplifying the CDR3 coding sequence may have the following characteristics:
- the roles of the V region primer and the J region primer in the PCR amplification process are not particularly limited.
- the V region primer can serve as a sense strand primer and the J region primer can serve as an antisense strand primer.
- the inventors have found that by setting the V region primer and the J region primer as described above, the enrichment efficiency when the primer composition is used for enrichment of the T cell receptor a chain CDR3 coding sequence can be further improved.
- the type of T cell receptor to which the above primer composition can be applied is not particularly limited, and those skilled in the art can select an appropriate T cell receptor as a research object according to research needs.
- the T cell receptor is a human T cell receptor.
- the primer composition can be effectively used to enrich the human T cell receptor a-chain CDR3 coding sequence, and thus can be effectively used for studying human immune status.
- a sequence of the V region primer and the J region primer can be selected to realize that one primer can recognize a plurality of V gene segments, thereby further improving the amplification efficiency and reducing the number of primers. Fight for low cost.
- at least one primer of the first primer set comprises a sequence complementary to a conserved region of a plurality of V gene segments.
- At least one of the second primer set comprises a sequence complementary to a conserved region of a plurality of J gene segments.
- the present invention provides a set of V region primers (SEQ ID NO: 46-84) and a set of J region primers (SEQ ID) for the sequence characteristics of the human sputum cell receptor alpha chain CDR3. NO: 85-134), the sequence and name are summarized as follows: Primer type Primer No. Primer sequence (SEQ ID NO: )
- the V region primer (SEQ ID NO: 46-84) and the J region primer (SEQ ID NO: 85-134) of the present invention are capable of distinguishing each subfamily and capable of More intuitively showing the VJ pairing after gene rearrangement and the preference of the V gene.
- the primers are easy to synthesize, the mismatch rate is low, the specificity is high, and the annealing temperatures of all the primers are close, thereby reducing the amplification bias. Sex.
- the inventors have surprisingly discovered that the use of the above specific primer sequences can fully cover all subfamilies of the human T cell receptor CDR3, including the discovery of a novel CDR3 subfamily, thereby fully enriching the human T cell receptor CDR3 coding sequence,
- multiplex primer PCR (sometimes referred to as "multiplex PCR") can be simultaneously performed in one PCR reaction system, and the CDR3 coding sequence contained in the sample can be efficiently amplified. And the uniformity of amplification of each CDR3 coding sequence can be ensured, thereby being able to truly reflect the distribution ratio of the CDR3 coding sequence in the host.
- the annealing temperature of the multiplex primer PCR may be 60 degrees Celsius.
- the inventors have surprisingly found that when the annealing temperature is 60 degrees Celsius, the efficiency of PCR amplification of CDR3 by multiplex primers is significantly improved.
- the present invention contains specific primers for amplifying all V region genes and J region genes of the T cell receptor a-chain CDR3 sequence of the IMGT database, and is the most comprehensive enrichment of human T cell receptors.
- the coding sequence of the a-chain CDR3, the amplified product is able to distinguish each subfamily of the T cell receptor a chain, and the same template is not specifically amplified by the two sets of primers.
- the primers of the present invention can better represent the collection and distribution of immune macromolecules in the body, and have a more reliable effect on finding information related to diseases or changes in the immune system.
- the CDR3 coding sequence is derived from the T cell receptor ⁇ chain.
- the primer composition of the present invention for amplifying the CDR3 coding sequence can have the following characteristics:
- the roles of the V region primer and the J region primer in the PCR amplification process are not particularly limited.
- the V region primer can serve as a sense strand primer and the J region primer can serve as an antisense strand primer.
- the inventors have found that by setting the V region primer and the J region primer in this way, the enrichment efficiency when the primer composition is used for enrichment of the T cell receptor ⁇ chain CDR3 coding sequence can be further improved.
- the type of sputum cell receptor to which the above primer composition can be applied is not particularly limited, and those skilled in the art can select an appropriate sputum cell receptor as a research object according to research needs.
- the sputum cell receptor is a human sputum cell receptor.
- the primer composition can be effectively used to enrich the human ⁇ cell receptor ⁇ chain CDR3 coding sequence, and thus can be effectively used for research on human immune status.
- a sequence of the V region primer and the J region primer can be selected to realize that one primer can recognize a plurality of V gene segments, thereby further improving the amplification efficiency and reducing the number of primers. Fight for low cost.
- at least one primer of the first primer set comprises a sequence complementary to a conserved region of a plurality of V gene segments.
- At least one of the second primer set comprises a sequence complementary to a conserved region of a plurality of J gene segments.
- the efficiency of enrichment of the T cell receptor ⁇ -chain CDR3 coding sequence can be improved while reducing the number of primers, and the inventors have also found that such an operation can improve the uniformity of amplification of each CDR3 coding sequence, thereby enabling The proportion of distribution of CDR3 coding sequences in the host is truly reflected.
- the present invention provides a set (SEQ ID NO: ⁇ / RTI> for the sequence characteristics of the human ⁇ cell receptor ⁇ chain CDR3
- Primer type primer sequence primer sequence (SEQ ID NO: ) I3/: O 899/-8osl£ m/-6oiAV
- the V region primer and the J region primer of the present invention are capable of distinguishing each subfamily, and can more intuitively display the VJ pairing after gene rearrangement and the preference of the V gene, and in addition, the primer synthesis is easy.
- the mismatch ratio is low, the specificity is high, and the annealing temperatures of all the primers are close to each other, so that the bias of the amplification can be low.
- the inventors have surprisingly discovered that the use of the above specific primer sequences can fully cover all subfamilies of the human T cell receptor CDR3, including the discovery of a novel CDR3 subfamily, thereby fully enriching the human T cell receptor CDR3 coding sequence,
- multiplex primer PCR (sometimes referred to as "multiplex PCR") can be simultaneously performed in one PCR reaction system, and the CDR3 coding sequence contained in the sample can be efficiently amplified. And the uniformity of amplification of each CDR3 coding sequence can be ensured, thereby being able to truly reflect the distribution ratio of the CDR3 coding sequence in the host.
- the annealing temperature of the multiplex primer PCR may be 60 degrees Celsius.
- the inventors have surprisingly found that when the annealing temperature is 60 degrees Celsius, the efficiency of PCR amplification of CDR3 by multiplex primers is significantly improved.
- the present invention contains specific primers for amplifying all V region genes and J region genes of the T cell receptor ⁇ -chain CDR3 sequence of the IMGT database, and is the most comprehensive enrichment of human T cell receptors.
- the coding sequence of the ⁇ -chain CDR3, the amplified product is able to distinguish the sub-families of the ⁇ -chain of the sputum cell receptor, and the same template is not specifically amplified by the two sets of primers.
- the primers of the present invention can better represent the collection and distribution of immune macromolecules in the body, and have a more reliable effect on finding information related to diseases or changes in the immune system.
- the invention also provides a method of enriching a CDR3 coding sequence.
- the method comprises the steps of: firstly, providing a nucleic acid sample comprising a nucleic acid sequence encoding CDR3; and subsequently, using the primer composition described above, using the provided nucleic acid sample as a template PCR amplification is carried out.
- an amplification product can be obtained by PCR amplification, and the amplification product is enriched in the CDR3 coding sequence.
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor a chain, and a T cell receptor ⁇ chain.
- the immunoglobulin heavy chain CDR3 coding sequence can be efficiently amplified, thereby effectively enriching the immunoglobulin CDR3 coding sequence.
- the source of the nucleic acid sample is not particularly limited. One skilled in the art can select the source from which the nucleic acid sample can be obtained, depending on the research needs. For example, to study the specific immune status of a tissue, immune cells can be extracted from the tissue or its vicinity as a source of nucleic acid samples.
- a single nuclear cell containing the above nucleic acid sample can be isolated from human peripheral blood, and the nucleic acid sample can be obtained by isolating the nucleic acid.
- the step of providing a nucleic acid sample may further comprise: first, isolating peripheral blood mononuclear cells from human peripheral blood; next, extracting a nucleic acid sample from peripheral blood mononuclear cells.
- a nucleic acid sample containing the nucleic acid sequence encoding the CDR3 can be efficiently obtained, whereby the efficiency of enriching the CDR3 coding sequence can be further improved.
- Those skilled in the art can extract peripheral blood mononuclear cells from peripheral blood by any conventional means.
- the peripheral blood mononuclear cells can be isolated by density gradient centrifugation. And genomic DNA and total RA can be extracted from the isolated peripheral blood mononuclear cells as a nucleic acid sample for amplification by a conventional means. Thereby, the nucleic acid sample can be obtained conveniently, quickly and at low cost. It will be understood by those skilled in the art that when amplification is performed as a nucleic acid sample when total RA is employed, total RA can be first converted to cDNA by reverse transcription according to experimental needs.
- the type of the above PCR is not particularly limited, that is, multiple PCR reactions may be performed in sequence, or multiple rounds of PCR amplification may be performed in one PCR system.
- PCR amplification is multiplex primer PCR amplification.
- the amplification product obtained by at least one isolation purification selected from agarose gel electrophoresis, magnetic bead purification, and purification column purification may be further included.
- the purity of the amplified product can be increased, thereby increasing the efficiency of enriching the coding sequence of the immunoglobulin heavy chain CDR3.
- the V region in the first primer set The primer may have the nucleotide sequence shown in SEQ ID NOS: 1-20, and the J region primer in the second primer set has the nucleotide sequence shown in SEQ ID NO: 21, and the length of the obtained amplification product Is a 140-280 bp;
- the V-region primer in the first primer set can be Having the nucleotide sequence shown in SEQ ID NOS: 22-40, the J region primer in the second primer set may have the nucleotide sequence shown in SEQ ID NOS: 41-45, and the length of the obtained amplification product a
- the invention features a method of constructing a sequencing library of CDR3 coding sequences.
- the method may comprise the steps of: First, obtaining an amplification product enriched for the CDR3 coding sequence according to the method described above. Next, a DNA sequencing library is constructed for the obtained amplification product, and the DNA sequencing library can be used as a sequencing library of the CDR3 coding sequence.
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell receptor a chain, and a T cell receptor ⁇ chain.
- a sequencing library that can be used for sequencing can be constructed based on the enrichment of the CDR3 coding sequences of the immunoglobulin heavy chain, the immunoglobulin light chain, the sputum cell receptor a chain or the T cell receptor ⁇ chain. .
- a method of constructing a DNA sequencing library for an amplification product is not particularly limited. According to an embodiment of the present invention, constructing the DNA sequencing library for the amplification product may further comprise:
- the amplified product is subjected to end repair to obtain an end-repaired amplification product.
- the terminal repair is carried out using a Klenow fragment, T4 DNA polymerase and T4 polynucleotide kinase, the Klenow fragment having 5' ⁇ 3' polymerase activity and 3' ⁇ 5' Digestive activity, but lacking 5' ⁇ 3' exonuclease activity.
- the efficiency of the end repair can be further improved, so that the efficiency of constructing the sequencing library can be further improved.
- the terminal-repaired amplification product was subjected to addition of the base A at the 3' end to obtain an amplification product of the base A added at the 3' end.
- the end-repaired amplification product is subjected to 3, terminal addition of base A using Klenow (3 '-5' exo-).
- Klenow 3 '-5' exo-
- the obtained amplification product of the base A is added to the linker to obtain a ligation product; the resulting ligation product is subjected to PCR amplification to obtain a second amplification product.
- the amplification product having the cohesive terminus A is ligated to a linker using T4 DNA ligase.
- the obtained second amplification product is purified and recovered to obtain a recovered product, and the resulting recovered product constitutes the sequencing library.
- the method for purifying and recovering the second amplification product is not particularly limited, and according to a specific example of the present invention, at least a column selected from agarose gel electrophoresis, magnetic bead purification, and purification may be purified. A second amplification product is isolated and purified. Thereby, the efficiency of constructing the sequencing library can be further improved.
- the sequencing library can be efficiently constructed, thereby facilitating subsequent sequencing and further analysis.
- the invention proposes a method of determining sequence information of a CDR3 coding sequence.
- the method may comprise the following steps:
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell receptor a chain, and a T cell receptor ⁇ chain.
- the sequencing library of the CDR3 coding sequence is sequenced to determine the sequence information of the CDR3 coding sequence.
- the sequencing is performed using at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and single molecule sequencing devices.
- the CDR3 coding sequence of the obtained immunoglobulin heavy chain, immunoglobulin light chain, T cell receptor a chain or T cell receptor ⁇ chain can be encoded with high throughput with high precision, thereby further improving the determination of CDR3.
- the efficiency of the method of encoding sequence information Method and system for determining individual immune status, kit
- the immune pool is the sum of the diverse immune cells at a certain time in an individual, which reflects the individual genetic factors, the history of antigen exposure, and the immune regulation of individual moments.
- the immune library can be used for disease-related research, and the mechanism of disease can be explored as an effective means to find biomarkers.
- the results of the immune library can promote early diagnosis, treatment and even prevention of more diseases.
- relevant studies have shown that IgH and TCR have a certain relationship with the occurrence of immune system diseases.
- the increase or decrease of a certain clone directly affects the occurrence and progress of the disease.
- studies have shown that the rearrangement of specific IgH clones is non-Hodgkin's lymph.
- Tumor biomarkers; specific B cell monoclonal increase is the first biomarker of chronic lymphocytic leukemia.
- another aspect of the present invention provides a method of determining an immune status of an individual.
- the method for determining an immune state of an individual of the present invention may comprise the steps of: first, sequencing the immunoglobulin heavy chain CDR3 coding sequence of the individual according to the method described above, in order to obtain Sequencing data constitutes a sequencing result; and then, based on the obtained sequencing result, the immune status of the individual is determined.
- the CDR3 coding sequence is derived from at least one selected from the group consisting of an immunoglobulin heavy chain, an immunoglobulin light chain, a T cell receptor alpha chain, and a T cell receptor ⁇ chain.
- the sequence information of the CDR3 coding sequence of the individual immunoglobulin heavy chain, immunoglobulin light chain, sputum cell receptor a chain or T cell receptor ⁇ chain can be effectively obtained, thereby effectively determining the individual immunity status.
- the term "immune state” as used herein shall be understood broadly to mean any CDR3 encoding that may be through an immunoglobulin heavy chain, an immunoglobulin light chain, a purine cell receptor alpha chain or a purine cell receptor beta chain.
- the sequence information of the sequence reflects the immune information.
- determining the immune status of the individual based on the obtained sequencing result may further comprise: comparing the obtained sequencing result with a control sequence to determine a subfamily of CDR3 included in the individual Type, and the relative proportion of each subfamily type.
- a control sequence to determine a subfamily of CDR3 included in the individual Type
- the relative proportion of each subfamily type may be effectively judged, thereby effectively determining the immune status of the individual.
- an individual may be monitored multiple times to determine the subfamily type of CDR3, and the relative proportion of each subfamily type as a function of time.
- samples are taken from the same individual at a plurality of different time points, and a plurality of sequencing results are obtained for the sample according to the method described above;
- the sequencing results are aligned to determine changes in the CDR3 subfamily type and relative proportions in the individual. Therefore, the CDR3 subfamily type and the relative proportion change in the individual can be effectively determined based on the alignment of the sequencing results of the samples at different time points, thereby more effectively determining the individual.
- the immune status can be sampled at different times to analyze, for example, changes in the individual's immune pool before and after the disease or a particular event, period, and to understand the individual's immune system changes for a particular event, at a particular time. For example, it is possible to know the detailed changes of the current sample from a single cloning level to find information related to the history of the disease.
- the system 1000 for determining an individual immune state includes: a CDR3 coding sequence enrichment device 100, a library construction device 200, a sequencing device 300, and an analysis device 400.
- the CDR3 coding sequence enrichment device 100 is provided with the primer composition described above to enrich the CDR3 coding sequence of the nucleic acid sample of the individual, wherein the CDR3 coding sequence is derived from an immunoglobulin heavy chain, an immunoglobulin At least one of a protein light chain, a T cell receptor a chain, and a T cell receptor ⁇ chain.
- Library construction device 200 is ligated to CDR3 coding sequence enrichment device 100 to construct a sequencing library of CDR3 coding sequences for the enriched CDR3 coding sequences.
- a sequencing library for an amplification product those skilled in the art can appropriately select according to different sequencing technologies.
- a manufacturer of a sequencing instrument such as Illumina.
- the procedures provided by the company are described, for example, in the Illumina Corporation Multiplexing Sample Preparation Guide (Part # 1005361; Feb 2010) or the Paired-End SamplePrep Guide (Part # 1005063; Feb 2010), which is incorporated herein by reference.
- the term "connected” as used herein shall be understood broadly and may be either directly connected or indirectly connected as long as the above functional connections are achieved.
- the sequencing device 300 is coupled to the library construction device 200 for sequencing the CDR3 coding sequence sequencing library to obtain sequencing results consisting of multiple sequencing data.
- the analysis device 400 is coupled to the sequencing device 300 for determining the immune status of the individual based on the sequencing results.
- the analyzing device 400 may further comprise a comparison unit, wherein the comparison unit stores a comparison sequence for comparing the sequencing result with the control sequence to determine a subfamily of the CDR3 included in the individual.
- related sequence information may be pre-stored in the analysis device 300, or the analysis device 300 may be connected to a remote database (not shown) for networking operation.
- a control sequence such as the known immune genome database IMGT
- the subfamily type distribution of CDR3 and the distribution of each subfamily type can be determined, thereby further improving the efficiency of determining the immune state of the individual.
- the present invention also provides a kit.
- the kit is provided with the primer composition described above.
- the kit can be effectively used for detecting the coding sequences of the CDR3 of the immunoglobulin heavy chain, the immunoglobulin light chain, the T cell receptor alpha chain and the sputum cell receptor ⁇ chain, thereby effectively determining the immune status of the individual. .
- the kit may comprise a primer composition according to the foregoing.
- the kit can be used to detect V-J rearrangements of immunoglobulin heavy chains, immunoglobulin light chains, purine cell receptor a chains, and T cell receptor beta chains.
- the kit of the present invention can be used for detecting the coding sequence of the CDR3 of the immunoglobulin heavy chain, the immunoglobulin light chain, the sputum cell receptor alpha chain and the sputum cell receptor ⁇ chain.
- the first primer set and the second primer set may be disposed in different containers, or may be disposed in the same container as a composition.
- the technical means used in the examples are conventional means well known to those skilled in the art, and can be referred to the third edition of the Guide to Molecular Cloning, or related products, and the reagents and products used are also available. Commercially obtained.
- the various processes and methods not described in detail are conventional methods in the field of public service.
- the source of the reagents used, the trade name, and the components necessary to list them are indicated on the first occurrence, and the same reagents used thereafter are not special. The descriptions are the same as the first ones.
- the method for enriching CDR3 coding sequences, constructing a sequencing library and sequencing of the present invention may generally comprise the following steps:
- PBMC normal human peripheral blood is drawn, specifically, blood is collected using a sterile blood collection tube containing an anticoagulant, and then PBMC is separated by density gradient centrifugation using a Ficoll-Paque PLUS or Percoll lymphocyte separation solution.
- genomic DNA and total RA are extracted, specifically, genomic DNA is extracted by proteinase K digestion or phenol chloroform extraction; total RA is extracted by Trizol method.
- S300 Amplification with primer composition
- multiplex PCR amplification was carried out using the primer composition of the present invention to obtain an amplification product enriched in the CDR3 coding sequence.
- the purified amplification product was recovered by agarose gel electrophoresis and MiniElute PCR Purification Kit (Qiagen). Among them, when the total R A is used as a template, it is necessary to reverse-transcribe R A into cDNA.
- the purified amplified product is subjected to terminal repair by dNTP as a substrate for the action of an enzyme such as T4 DNA polymerase, Klenow fragment and T4 polynucleotide kinase to obtain an end-repaired amplification product.
- an enzyme such as T4 DNA polymerase, Klenow fragment and T4 polynucleotide kinase to obtain an end-repaired amplification product.
- the Klenow fragment (3 '-5'exo-) polymerase and dATP were then used to add base A to the 3' end of the end-repaired amplification product to obtain an amplification product of base A added at the 3' end.
- the amplification product of the base A added at the 3' end was recovered by using the MiniElute PCR Purification Kit (Qiagen).
- the resulting amplification product of the 3, terminal addition base A was ligated to the linker by the action of T4 DNA ligase to obtain a ligated product. Then, the purified ligation product was recovered by agarose gel electrophoresis and MiniElute PCR Purification Kit (Qiagen).
- a universal PCR primer and a sequencing primer were added, and PCR amplification was carried out using Phusion enzyme to obtain a second amplification product. Then, the second amplification product is recovered by agarose gel electrophoresis to obtain a recovered product, and the recovered product constitutes a DNA sequencing library.
- the DNA sequencing library obtained above was quantified by Agilent 2100 detection and Q-PCR, and sequenced using a Hiseq2000 sequencing platform to obtain sequencing results composed of multiple sequencing data, and then immunoglobulin heavy chain and immunoglobulin on IMGT.
- the sequence of the CDR3 of the protein light chain, the T cell receptor a chain or the T cell receptor ⁇ chain was used as a reference sequence, and the sequencing results were analyzed.
- the CDR3 coding sequence of the immunoglobulin heavy chain is enriched, sequenced, sequenced and analyzed according to the following steps:
- the DNA extraction kit extracts PBMC DNA. ( Qiagen )
- the DNA obtained in the previous step was subjected to multiplex PCR amplification as a template DNA.
- the reaction system was LX500, LX800, ZXJ500, ZXJ800, and 4 parallel experiments.
- the PCR reaction conditions are:
- the PCR product was electrophoresed on a 2% agarose gel at a voltage of 4 V/cm for an electrophoresis time of 2 h.
- the gel was then cut, and a 140-280 bp fragment was taken and purified using QIAquick Gel Purification Kit (Qiagen), and finally the sample was dissolved in 30 ⁇ of elution buffer.
- the results are shown in Fig. 4. There are multiple amplification bands. Before the formal establishment of the library, the experiment was performed. The Sanger sequencing results showed that the 140-280 bp band is the target product, ie the heavy chain CDR3 coding sequence. -280 bp strip.
- the DNA obtained in the previous step was prepared in a 1.5 mL centrifuge tube according to the following table. Four end-repair reaction systems were prepared.
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 30 min. After completion of the reaction, purification was carried out using QIAquick PCR Purification Kit (Qiagen), and finally the sample was dissolved in 32 ⁇ L of elution buffer.
- Thermomixer Eppendorf
- QIAquick PCR Purification Kit Qiagen
- the DNA obtained in the previous step was prepared in a 200 ⁇ PCR tube according to the following table. Four 3' end-added bases were used.
- the linker sequence is:
- Linker 1 5, GATCGGAAGAGCGGTTCAGCAGGAATGCCGAG (SEQ ID NO: 178);
- Linker 2 5'TACACTCTTTCCCTACACGACGCTCTTCCGATCT (SEQ ID NO: 179).
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 15 min. After the reaction was completed, it was separated by electrophoresis on a 2% agarose gel at a voltage of 120 V and an electrophoresis time of 120 minutes. After the end of the electrophoresis, the band of interest was excised, purified using QIAquick PCR Purification Kit (Qiagen), and finally the sample was dissolved in 30 ⁇ l of elution buffer.
- the DNA obtained in the previous step was prepared according to the following system: 4 PCR reaction systems
- T SEQ ID NO: 181
- the base ⁇ is any combination of four bases of eight, T, C, G as a distinguishing identifier.
- the PCR reaction conditions are:
- the Agilent 2100 Bioanalyzer analysis system detects library insert size and content; Q-PCR accurately quantifies library concentration.
- sequence analysis was performed on a Hiseq2000 sequencer according to the read length of 151 bases at both ends.
- the basic analysis process of the raw data obtained by sequencing mainly includes the following steps: First, data processing is performed, and the library data of different samples are distinguished by the sequence tag on the linker or the PCR primer, and the raw data obtained by sequencing is decontaminated, de-joined and de-lowered. Mass filtration; then read and compare the reference sequence of the IMGT database, D, J gene, and analyze.
- the sequenced raw data after filtration was subjected to the results of the alignment analysis. See Table 1 below.
- the average total sequence obtained was 41.46 megabytes, and the average clones were 2.27 megabytes.
- IGHV4-34 2417 4052 17750 62345 26275 39808
- Figure 5 is a VJ gene pairing distribution map of four parallel libraries based on the data in Table 2, and Figure 5 shows all V gene subfamilies and J gene subfamily rearrangements. After the pairing and the abundance of CDR3 in various VJ pairs, the coordinates on the right side of the figure are the classification of all subfamilies of the J gene. In the figure, the abscissa is the classification of each subfamily of the V gene, and the intersection of the two coordinates is the pairing of the VJ gene. One CDR3 sequence, the left coordinate is the abundance of each VJ pair.
- the primers provided by the present invention can fully enrich the coding sequence of the human immunoglobulin heavy chain CDR3, and the amplified product can distinguish the subfamilies of the immunoglobulin heavy chain. Furthermore, the relative proportions of the various heavy chain CDR3 coding sequences can be determined by the method of the present invention described above, so that the immune status of the individual can be determined based on the relative proportion of the coding sequences or the change in the ratio.
- the DNA extraction kit extracts PBMC DNA. ( Qiagen )
- the DNA obtained in the previous step was subjected to multiplex PCR amplification as a template DNA.
- the PCR reaction system was prepared in a 200 ⁇ PCR tube according to the following table, which was ZXJ.
- the PCR reaction conditions are:
- Fig. 6 is a schematic diagram showing the CDR3 sequence consisting of the V-J rearrangement in the primer-amplified immunoglobulin light chain.
- the PCR product was electrophoresed on a 2% agarose gel at a voltage of 4 V/cm for an electrophoresis time of 2 h.
- the gel was then cut, and a 110-250 bp fragment was taken and purified using QIAquick Gel Purification Kit (Qiagen), and finally the sample was dissolved in 30 ⁇ of elution buffer.
- the results are shown in Fig. 7. There are multiple amplification bands. Before the formal establishment of the library, the experiment was performed. The Sanger sequencing results showed that the 110-250 bp band is the target product, ie the light chain CDR3 sequence. 250bp strip.
- the DNA obtained in the previous step was prepared in a 1.5 mL centrifuge tube according to the following table to prepare four end-repair reaction systems.
- Component volume ( ) ( )
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 30 min. After completion of the reaction, purification was carried out using QIAquick PCR Purification Kit (Qiagen), and finally the sample was dissolved in 32 ⁇ L of elution buffer.
- Thermomixer Eppendorf
- QIAquick PCR Purification Kit Qiagen
- the DNA obtained in the previous step was separately prepared in a 200 ⁇ PCR tube as shown in the following table. 4 3' end-added bases ⁇ reaction system
- the tube was then placed on a Thermomixer (Eppendrof) adjusted to 37 ° C for 30 min. After the reaction, purification was carried out using a MiniElute PCR Purification Kit (Qiagen), and finally the sample was dissolved in ⁇ elution buffer.
- the DNA obtained in the previous step was prepared in a 1.5 mL centrifuge tube according to the following table. Four end-repair reaction systems were prepared.
- the linker sequence is:
- Linker 1 5, GATCGGAAGAGCGGTTCAGCAGGAATGCCGAG (SEQ ID NO: 178);
- Linker 2 5'TACACTCTTTCCCTACACGACGCTCTTCCGATCT (SEQ ID NO: 179).
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 15 min. After the reaction was completed, it was separated by electrophoresis on a 2% agarose gel at a voltage of 120 V and an electrophoresis time of 120 minutes. After the end of the electrophoresis, the band of interest was excised, purified using QIAquick PCR Purification Kit (Qiagen), and finally the sample was dissolved in 30 ⁇ l of elution buffer.
- the DNA obtained in the previous step was prepared according to the following system: 4 PCR reaction systems
- T SEQ ID NO: 181
- the base ⁇ is any combination of four bases of eight, T, C, G as a distinguishing identifier.
- the PCR reaction conditions are:
- the Agilent 2100 Bioanalyzer analysis system detects library insert size and content; Q-PCR accurately quantifies library concentration.
- sequence analysis was performed on a Hiseq2000 sequencer according to the read length of 151 bases at both ends.
- the basic analysis process of the raw data obtained by sequencing mainly includes the following steps: First, data processing is performed, and the library data of different samples are distinguished by the sequence tag on the linker or the PCR primer, and the raw data obtained by sequencing is decontaminated, de-joined and de-lowered. Mass filtering; then the reads and the reference sequence of the IMGT database, J gene alignment, analysis.
- Table 4 is the number of sequences paired with the ⁇ -type IGL VJ
- Table 5 is the sequence of the paired ⁇ -type IGL V-J. Quantity.
- Figure 8 is a ZXJ V K -J K gene pairing distribution map based on the data in Table 4
- Figure 9 is a ⁇ ⁇ - ⁇ ⁇ gene pairing distribution map based on the data in Table 5
- 8-9 shows the rearrangement of all V gene subfamilies and J gene subfamilies and the abundance of various CDR3s.
- the ordinate is the classification of all subfamilies of the J gene
- the abscissa is each of the V genes.
- Family classification the intersection of two coordinates is a CDR3 sequence generated by pairing of VJ genes.
- each square of the intersection point represents the abundance of each VJ pair, and the right side of the figure corresponds to the abundance color indicator band. From Tables 4-5 and 8-9, it can be concluded that the primers provided by the present invention can fully enrich the human immunoglobulin light chain CDR3 sequence, and the amplified product can distinguish each sub-globulin of the immunoglobulin light chain. Family, and by the methods of the invention described above, the relative proportions of the various light chain CDR3 sequences can be determined such that the immune status of the individual can be determined based further on the relative proportions of these sequences or changes in the ratio. Example 3
- the CDR3 coding sequence of the T cell receptor ⁇ chain was enriched, sequenced, sequenced and analyzed according to the following procedure:
- the human peripheral blood mononuclear cell genomic DNA is extracted by using a DNA extraction kit (Qiagen), specifically, including:
- the DNA obtained in the above step is used as a template, a first primer set consisting of a V region primer having the nucleotide sequence shown by SEQ ID NOS: 46-84, and a nucleotide sequence having SEQ ID NO: 85-134
- a second primer set consisting of the J region primers was subjected to multiplex PCR amplification.
- V-region primers each having a concentration of ⁇ are mixed, and then diluted and mixed with water of ⁇ to obtain a first primer set, and each V-region primer in the first primer set is obtained.
- the concentration is 2 ⁇ .
- 50 kinds of J-region primers each having a concentration of ⁇ were mixed and mixed to obtain a second primer set, and the concentration of each J-region primer in the second primer set was 2 ⁇ , and then, according to the following table Ratio, prepare a reaction system for multiplex PCR amplification:
- the prepared reaction system is subjected to PCR amplification under the following reaction conditions:
- the amplified product was subjected to 2% agarose gel electrophoresis, and a 10-200 bp gel (as shown in Fig. 10) was cut out, and purified by QIAquick Gel Purification Kit (Qiagen), and then the recovered product was dissolved. In 34 ⁇ 1 of elution buffer, spare.
- the amplification product obtained in the previous step was prepared into a terminal repair reaction system according to the ratio in the following table in a 1.5 ml centrifuge tube:
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 30 min to obtain a terminal-repaired amplification product, and then the end-repaired amplification product was subjected to QIAquick PCR Purification Kit (Qiagen). Purify and finally dissolve in 34 ⁇ l elution buffer and set aside.
- Thermomixer Eppendorf
- QIAquick PCR Purification Kit Qiagen
- the end-repaired amplification product was prepared by adding a base A to the 3' end in a 1.5 ml centrifuge tube as follows:
- the tube was placed on a Thermomixer (Eppendorf) adjusted to 37 ° C for 30 min to obtain an amplification product of 3, terminal A was added, and then the 3' end was amplified using a MiniElute PCR purification kit (Qiagen).
- the amplification product of base A was added for purification, and finally it was dissolved in 20 ⁇ l elution buffer and used.
- Linker 1 TACACTCTTTCCCTACACGACGCTCTTCCGATCT (SEQ ID NO: 179);
- Linker 2 5-Phos/GATCGGAAGAGCACACGTCTGAACTCCAGTC AC (SEQ ID NO: 182).
- the tube was placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 15 min to obtain a ligation product, and then the ligation product was purified using a MiniElute PCR Purification Kit (Qiagen), and finally dissolved in 28 ⁇ l. In the flush, spare.
- Thermomixer Eppendorf
- the ligation product was purified using a MiniElute PCR Purification Kit (Qiagen), and finally dissolved in 28 ⁇ l. In the flush, spare.
- the ligation product obtained in the previous step was prepared according to the ratio in the following table:
- the prepared reaction system is subjected to a PCR reaction on a PCR machine to obtain a second amplification product, wherein the PCR reaction conditions are:
- a second amplification product of 200-300 bp was purified by QIAquick PCR Purification Kit (Qiagen) to obtain a recovered product, which constitutes a DNA sequencing library. Then, the obtained sequencing library sample was dissolved in 20 ⁇ l of the elution buffer.
- the size and content of the insert in the obtained sequencing library were detected using an Agilent 2100 Bioanalyzer analysis system; the concentration of the library was accurately quantified by Q-PCR.
- sequence analysis were performed on a Hiseq2000 sequencer according to the read length of 151 bases at both ends. Sequence.
- a bp or 8 bp tag sequence was introduced into one side of the fragment by a linker or a PCR primer, so that different libraries were directly mixed and then sequenced on the machine.
- one end of the sample DNA was first sequenced using the SP1 primer, and the other end of the sample DNA was sequenced using the SP2 primer.
- the obtained sequencing result is subjected to data analysis, wherein the sequence referred to when analyzing the data is the reference sequence of TCRA on the IMGT.
- the basic analysis of the raw data obtained by sequencing is first performed, which mainly comprises the following steps: performing data processing on the raw data, and distinguishing the library data of different samples by the sequence tag on the linker or the PCR primer, and the raw data obtained by sequencing Decontamination, de-joining and low-quality filtration are performed to determine the reads; then, the readings of the reads and IMGT databases are compared with the V and J genes, and the results are shown in Table 6 below and Figure 11: Table 6: reads analysis Comparison result In addition, Figure 11 shows the rearrangement of all V gene subfamilies and J gene subfamilies after rearrangement and various VJ pairing distributions as well as the abundance of CDR3.
- the left coordinate is the classification of all subfamilies of the J gene
- the abscissa is each subfamily classification of the V gene
- the intersection of the two coordinates is a CDR3 sequence generated by the pairing of the VJ genes
- the right color band Indicates the shade of each square, representing the abundance of each VJ pair.
- the primer composition provided by the present invention is capable of fully enriching the sequence of the human T cell receptor CDR3
- the amplified product is capable of distinguishing various subfamilies of the T cell receptor alpha chain.
- Example 4 Following the above general procedure, the CDR3 coding sequence of the T cell receptor ⁇ chain was enriched, sequenced, sequenced and analyzed according to the following procedure:
- the human peripheral blood mononuclear cell genomic DNA is extracted by using a DNA extraction kit (Qiagen), specifically, including:
- total R A was also possible to use the total R A as a nucleic acid sample.
- total RNA was extracted using the Trizol method.
- the total R A obtained is then reverse transcribed into cDNA according to the following procedure:
- the C region primer (TRBC-R1) is:
- the obtained mixture was denatured at 65 ° C for 5 min on a PCR machine, immediately placed on ice, and the reagents in the following table were continuously added to the mixture:
- the DNA obtained in the above step is used as a template, and the first primer set consisting of the V region primer having the nucleotide sequence shown by SEQ ID NO: 135-164 and the nucleotide sequence shown by SEQ ID NO: 165-177
- a second primer set consisting of the J region primers was subjected to multiplex PCR amplification.
- V-region primers each having a concentration of ⁇ are mixed, and then diluted and mixed with 20 L of water to obtain a first primer set, and then each V-region primer in the first primer set is obtained.
- the concentration is 2 ⁇ .
- 13 kinds of J-region primers each having a concentration of ⁇ were also mixed, and then diluted with 37 L of water to obtain a second primer set, and the concentration of each J-region primer in the second primer set was obtained. Both are 2 ⁇ .
- the prepared reaction system is subjected to PCR amplification under the following reaction conditions:
- the amplified product was subjected to 2% agarose gel electrophoresis, wherein the voltage was 100 V, and the electrophoresis time was 2 hours and 20 minutes. Then, a 100-200 bp gel was removed and purified by QIAquick Gel Purification Kit (Qiagen), and the recovered product was dissolved in 30 ⁇ of elution buffer for use. Among them, the electrophoresis results are shown in Figure 12. As shown in Figure 12, where Mark is a 50bp DNA Ladder.
- the amplification product obtained in the previous step was prepared into a terminal repair reaction system according to the ratio in the following table in a 1.5 mL centrifuge tube:
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 30 min to obtain a terminal-repaired amplification product, and then the end-repaired amplification product was subjected to QIAquick PCR Purification Kit (Qiagen). Purify and finally dissolve in 34 ⁇ L of elution buffer and set aside.
- Thermomixer Eppendorf
- QIAquick PCR Purification Kit Qiagen
- the end-repaired amplification product was prepared by adding a base A at the 3' end according to the ratio in a 1.5 mL centrifuge tube as follows:
- the tube was placed on a Thermomixer (Eppendorf) adjusted to 37 ° C for 30 min to obtain an amplification product of 3, terminal A was added, and then the 3' end was amplified using a MiniElute PCR purification kit (Qiagen).
- the amplification product of base A was added for purification, and finally it was dissolved in 2C L elution buffer and used.
- the amplification product of base A was added to the 3' end, and the reaction system of the linker was prepared according to the ratio in the following table in a 1.5 mL centrifuge tube:
- Linker 1 TACACTCTTTCCCTACACGACGCTCTTCCGATCT ( SEQ ID NO: 179 );
- Linker 2 5-Phos/GATCGGAAGAGCACACGTCTGAACTCCAGTC AC (SEQ ID NO: 182).
- the tube was then placed on a Thermomixer (Eppendorf) adjusted to 20 ° C for 15 min to obtain the ligation product, and then the ligation product was purified using MiniElute PCR Purification Kit (Qiagen), and finally dissolved in 32 ⁇ . In the flush, spare.
- Eppendorf Thermomixer
- MiniElute PCR Purification Kit Qiagen
- the ligation product obtained in the previous step was prepared according to the ratio in the following table:
- the prepared reaction system is subjected to a PCR reaction on a PCR machine to obtain a second amplification product, wherein the PCR reaction conditions are:
- the second amplification product is purified and recovered by QIAquick PCR Purification Kit (Qiagen) to obtain a recovered product, which constitutes a DNA sequencing library. Then, the obtained sequencing library sample was dissolved in 20 ⁇ l of elution buffer.
- sequence analysis were performed on a Hiseq2000 sequencer according to the read length of 151 bases at both ends. Sequence.
- a bp or 8 bp tag sequence was introduced into one side of the fragment by a linker or a PCR primer, so that different libraries were directly mixed and then sequenced on the machine.
- one end of the sample DNA was first sequenced using the SP1 primer, and the other end of the sample DNA was sequenced using the SP2 primer.
- the obtained sequencing result is subjected to data analysis, wherein the sequence referred to when analyzing the data is the reference sequence of TCRB on the IMGT.
- the basic analysis of the raw data obtained by sequencing is first performed, which mainly comprises the following steps: performing data processing on the raw data, and distinguishing the library data of different samples by the sequence tag on the linker or the PCR primer, and the raw data obtained by sequencing Decontamination, de-joining, and low-quality filtration are performed to determine the reads; then, the readings of the reads and IMGT databases are compared with the V, D, and J genes, and the results are shown in Table 7, Table 8, and Figure 13:
- Figure 13 shows the rearrangement of all V gene subfamilies and J gene subfamilies after rearrangement and various VJ pairing distributions as well as the abundance of CDR3.
- the left coordinate is the classification of all subfamilies of the J gene
- the abscissa is each subfamily classification of the V gene
- the intersection of the two coordinates is a CDR3 sequence generated by the pairing of the VJ genes
- the right color band Indicates the shade of each square, representing the abundance of each VJ pair.
- the primer composition provided by the present invention is capable of fully enriching the sequence of the human T cell receptor ⁇ chain CDR3
- the amplified product is capable of distinguishing various subfamilies of the ⁇ cell receptor ⁇ chain.
- a primer composition for amplifying a CDR3 coding sequence of the present invention a method for enriching a CDR3 coding sequence, a method for constructing a sequencing library of a CDR3 coding sequence, a method for determining sequence information of a CDR3 coding sequence, a method for determining an immune state of an individual,
- a system and kit for determining an individual's immune status can be effectively applied to enrich and sequence CDR3 coding sequences of immunoglobulin heavy and light chain, ⁇ cell receptor alpha and beta chains of an individual, and based on sequencing results , can accurately and efficiently determine the immune status of an individual.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Analytical Chemistry (AREA)
- Wood Science & Technology (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110444740.0A CN103184216B (zh) | 2011-12-27 | 2011-12-27 | 用于扩增免疫球蛋白重链cdr3编码序列的引物组合物及其用途 |
CN201110444740.0 | 2011-12-27 | ||
CN201210011443.1 | 2012-01-13 | ||
CN201210011443.1A CN103205420B (zh) | 2012-01-13 | 2012-01-13 | 用于扩增T细胞受体β链CDR3编码序列的引物组合物及其用途 |
CN201210017173.5 | 2012-01-19 | ||
CN201210017173.5A CN103215255B (zh) | 2012-01-19 | 2012-01-19 | 用于扩增免疫球蛋白轻链cdr3序列的引物集及其用途 |
CN201210055230.9A CN103289994B (zh) | 2012-03-05 | 2012-03-05 | 用于扩增T细胞受体α链CDR3编码序列的引物组合物及其用途 |
CN201210055230.9 | 2012-03-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013097744A1 true WO2013097744A1 (zh) | 2013-07-04 |
Family
ID=48696344
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/087668 WO2013097744A1 (zh) | 2011-12-27 | 2012-12-27 | 用于扩增cdr3编码序列的引物组合物及其用途 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2013097744A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10202640B2 (en) * | 2014-05-07 | 2019-02-12 | The Board Of Trustees Of The Leland Stanford Junior University | Single cell analysis of T cells using high-throughput multiplex amplification and deep sequencing |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010151416A1 (en) * | 2009-06-25 | 2010-12-29 | Fred Hutchinson Cancer Research Center | Method of measuring adaptive immunity |
-
2012
- 2012-12-27 WO PCT/CN2012/087668 patent/WO2013097744A1/zh active Application Filing
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010151416A1 (en) * | 2009-06-25 | 2010-12-29 | Fred Hutchinson Cancer Research Center | Method of measuring adaptive immunity |
Non-Patent Citations (1)
Title |
---|
TEER J. K. ET AL.: "Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing", GENOME RESEARCH, vol. 20, 1 September 2010 (2010-09-01), pages 1 - 12, XP055074121 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10202640B2 (en) * | 2014-05-07 | 2019-02-12 | The Board Of Trustees Of The Leland Stanford Junior University | Single cell analysis of T cells using high-throughput multiplex amplification and deep sequencing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2016242967B2 (en) | Method of identifying human compatible T cell receptors specific for an antigenic target | |
CN103205420B (zh) | 用于扩增T细胞受体β链CDR3编码序列的引物组合物及其用途 | |
EP2364368B1 (en) | Methods of monitoring conditions by sequence analysis | |
KR101709826B1 (ko) | 신규 pcr 서열화 방법 및 hla 제노타이핑에서의 상기 방법의 사용 | |
WO2019114146A1 (zh) | 基因目标区域富集方法及建库试剂盒 | |
WO2018149091A1 (zh) | 一种环状rna高通量测序文库的构建方法及其试剂盒 | |
CN105506746A (zh) | 构建可变区测序文库的方法及确定可变区核酸序列的方法 | |
CN103184216B (zh) | 用于扩增免疫球蛋白重链cdr3编码序列的引物组合物及其用途 | |
TW201321518A (zh) | 微量核酸樣本的庫製備方法及其應用 | |
WO2014116729A2 (en) | Haplotying of hla loci with ultra-deep shotgun sequencing | |
TR201807917T4 (tr) | Maternal numunelerde fetal nükleik asitlerin fraksiyonunu belirleme yöntemleri. | |
TWI467020B (zh) | 檢測dmd基因外顯子缺失和/或重複的方法 | |
WO2018095108A1 (zh) | 引物组合物、其用途、构建文库和确定核酸序列的方法 | |
CN103298955A (zh) | 用于构建血浆dna测序文库的方法和试剂盒 | |
WO2017202389A1 (zh) | 一种适用于超微量dna测序的接头及其应用 | |
EP3607065A1 (en) | Method and kit for constructing nucleic acid library | |
CN107699957A (zh) | 基于dna的融合基因定量测序建库、检测方法及其应用 | |
CN102839168A (zh) | 核酸探针及其制备方法和应用 | |
CN103571822B (zh) | 一种用于新一代测序分析的多重目的dna片段富集方法 | |
CN103215255B (zh) | 用于扩增免疫球蛋白轻链cdr3序列的引物集及其用途 | |
CN112384608A (zh) | 细菌捕获测序平台及其设计、构建和使用方法 | |
WO2012037879A1 (zh) | 核酸标签及其应用 | |
WO2013097744A1 (zh) | 用于扩增cdr3编码序列的引物组合物及其用途 | |
US20210115435A1 (en) | Error-proof nucleic acid library construction method | |
CN103289994B (zh) | 用于扩增T细胞受体α链CDR3编码序列的引物组合物及其用途 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12863395 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12863395 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC, FORM 1205A DATED 25-06-2015 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12863395 Country of ref document: EP Kind code of ref document: A1 |