WO2018133546A1 - CONSTRUCTION METHOD, DETECTION METHOD AND KIT FOR NON-INVASIVE PRENATAL FETAL α-THALASSEMIA GENE MUTATION DETECTION LIBRARY - Google Patents

CONSTRUCTION METHOD, DETECTION METHOD AND KIT FOR NON-INVASIVE PRENATAL FETAL α-THALASSEMIA GENE MUTATION DETECTION LIBRARY Download PDF

Info

Publication number
WO2018133546A1
WO2018133546A1 PCT/CN2017/113231 CN2017113231W WO2018133546A1 WO 2018133546 A1 WO2018133546 A1 WO 2018133546A1 CN 2017113231 W CN2017113231 W CN 2017113231W WO 2018133546 A1 WO2018133546 A1 WO 2018133546A1
Authority
WO
WIPO (PCT)
Prior art keywords
primer set
downstream
upstream
specific
library
Prior art date
Application number
PCT/CN2017/113231
Other languages
French (fr)
Chinese (zh)
Inventor
王晓锋
徐湘民
曾华萍
宋卓
Original Assignee
人和未来生物科技(长沙)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 人和未来生物科技(长沙)有限公司 filed Critical 人和未来生物科技(长沙)有限公司
Publication of WO2018133546A1 publication Critical patent/WO2018133546A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms

Definitions

  • the invention relates to the field of gene detection technology, in particular to a method, a detection method and a kit for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library.
  • Thalassemia (referred to as thalassemia) is one of the high incidence genetic diseases in the world. It is due to the mutation or deletion of human genes, resulting in an imbalance of the synthesis rate of ⁇ , ⁇ -globin peptide chains, resulting in hemolytic anemia.
  • Two common types of thalassaemia are ⁇ -thalassemia and ⁇ -thalassemia, ⁇ -thalassemia-related genes are HBA2 and HBA1, and ⁇ -thalassaemia-related genes are HBB.
  • the thalassemia is concentrated in the tropical and subtropical regions, mostly in the Mediterranean countries, followed by the Middle East, India, Pakistan, Southeast Asia, South China and North Africa. The United States is an immigrant country with a high incidence.
  • ⁇ -thalassemia genotypes are commonly found as - ⁇ SEA , - ⁇ 3.7, - ⁇ 4.2, ⁇ CS , ⁇ QS and ⁇ WS , and 26 gene mutations are common in ⁇ -thalassemia. do not. At present, there is no cure for thalassaemia. It can only rely on traditional methods of treatment, that is, blood transfusion therapy is the main method, and the price is expensive. Therefore, prenatal screening and diagnosis are of great significance for preventing such birth defects.
  • Chiu RW achieves efficient detection of the father-derived codons 41/42 4bp deletion by using real-time fluorescent PCR technology, and researchers have developed detection methods for other ⁇ -thalassaemia mutations, but since the detection mutations are mostly point mutations, The specificity of the method needs to be improved. Studies on alpha-thalassemia mutations have also been reported. Warunee Tungwiwat et al. established a method for detecting ⁇ 0 deletions based on real-time PCR, but the sensitivity of this method needs to be improved. In addition, a variety of high-sensitivity techniques have also been tried for non-invasive depletion detection, such as mass spectrometry, digital PCR, COLD-PCR, and the like.
  • the detection methods based on single-site PCR technology have certain limitations, such as low plasma DNA content and high fragmentation characteristics, which may bring false negative PCR.
  • This method can only detect the presence of a specific paternal mutation different from the mother, and cannot further determine the presence or absence of the maternal mutation.
  • Whole-genome sequencing or target region capture sequencing of maternal plasma free DNA based on high-throughput sequencing technology can more accurately detect fetal thalassemia genes, but the cost of genome-wide sequencing and the complexity of analytical methods and the need for parental monomers Types and the like limit the clinical application; target region capture sequencing can only achieve the differentiation of the fetal ⁇ SEA deletion pure heterozygous type due to the low capture efficiency of the target region, and there is also the possibility of typing errors. In view of this, it is necessary to provide a high-precision non-invasive prenatal fetal thalassemia gene mutation detection method.
  • the invention provides a method, a detection method and a kit for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library, which can accurately and efficiently detect the alpha thalassemia gene mutation, and the result is consistent with the amniocentesis detection type, but Safety, non-invasiveness and high efficiency are significantly better than amniocentesis.
  • the present invention provides a method for constructing a non-invasive prenatal fetal alpha-type thalassemia gene mutation detection library for detecting non-invasive prenatal fetal alpha-type thalassemia gene mutation by high-throughput sequencing
  • the above methods include:
  • the upstream specific primer set 1 includes an upstream primer set 1 for amplifying the ⁇ -thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 1 comprising a downstream primer set 1 for amplifying a thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
  • the upstream specific primer set 2 includes an upstream primer set 2 for amplifying the ⁇ -thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio;
  • the above-mentioned downstream specific primer set 2 comprising a downstream primer set 2 for amplifying the alpha thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of fetal free DNA ratio;
  • the upstream specific primer set 2 is closer to the corresponding amplification target position than the upstream specific primer set 1 described above. Point, the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1; the upstream specific primer set 2 and the 5' end of the downstream specific primer set 2 are contained for a linker sequence of a high throughput sequencing library;
  • the upstream specific amplification product 2 and the downstream specific amplification product 2 are mixed, and then amplified using universal primers at both ends to obtain a library which can be sequenced by the machine.
  • the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 described above are provided with a modification for isolating the amplification product, preferably with a biotin modification.
  • the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 5' end of the primer corresponding to the target site in the upstream specific primer set 2 are 10-15 bases Coincident; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and 10-15 bases from the 5' end of the primer for the target site in the downstream specific primer set 2 described above Coincident.
  • upstream primer set 1 sequence for amplifying the ⁇ -thalassemia gene is as shown in SEQ ID NOs: 1-7;
  • the upstream primer set 1 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOS: 8-17;
  • the sequence of the downstream primer set 1 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 18-23;
  • the above upstream primer set 2 sequence for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
  • the upstream primer set 2 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 41-50;
  • the sequence of the downstream primer set 2 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 51-56;
  • the sequence of the downstream primer set 2 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
  • the present invention provides a non-invasive prenatal fetal alpha thalassemia gene mutation detecting method comprising performing high-throughput sequencing of a library constructed by the method of the first aspect to obtain a sequencing read read; Based on the total depth of the unique specific tag sequence of the site and the depth of the allele of the mutation, the fetal DNA content was calculated and the thalassemia-related mutation was typed to analyze the alpha-thalassemia gene mutation.
  • the present invention provides a non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction kit for performing non-invasive prenatal fetal alpha thalassemia gene mutation by high-throughput sequencing
  • the above kits include:
  • upstream specific primer set 1 and downstream specific primer set 1 for upstream pre-library and downstream pre-library, respectively Specific amplification is performed to obtain upstream specific amplification product 1 and downstream specific amplification product 1, respectively.
  • the upstream specific primer set 1 includes an upstream primer set 1 for amplifying the ⁇ -thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 1 comprising a downstream primer set 1 for amplifying a thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
  • upstream specific primer set 2 and downstream specific primer set 2 respectively, for specifically amplifying the above upstream specific amplification product 1 and downstream specific amplification product 1 respectively to obtain upstream specific expansion Product 2 and downstream specific amplification product 2,
  • the upstream specific primer set 2 includes an upstream primer set 2 for amplifying the ⁇ -thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio;
  • the above-mentioned downstream specific primer set 2 comprising a downstream primer set 2 for amplifying the alpha thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of fetal free DNA ratio;
  • the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the corresponding amplification than the downstream specific primer set 1 Target site; the upstream specific primer set 2 and the 5' end of the downstream specific primer set 2 described above contain a linker sequence for a high throughput sequencing library.
  • the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 described above are provided with a modification for isolating the amplification product, preferably with a biotin modification.
  • the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 5' end of the primer corresponding to the target site in the upstream specific primer set 2 are 10-15 bases Coincident; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and 10-15 bases from the 5' end of the primer for the target site in the downstream specific primer set 2 described above Coincident.
  • upstream primer set 1 sequence for amplifying the ⁇ -thalassemia gene is as shown in SEQ ID NOs: 1-7;
  • the upstream primer set 1 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOS: 8-17;
  • the sequence of the downstream primer set 1 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 18-23;
  • the above upstream primer set 2 sequence for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
  • the upstream primer set 2 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 41-50;
  • the sequence of the downstream primer set 2 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 51-56;
  • the sequence of the downstream primer set 2 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
  • kit further includes:
  • the method and kit of the invention can construct a library for detection of alpha thalassemia gene mutation by using maternal free plasma DNA as a raw material, and realize detection of alpha thalassemia gene mutation by high-throughput sequencing of the library (including - ⁇ SEA , - ⁇ 3.7 , - ⁇ 4.2 , ⁇ CS , ⁇ QS , ⁇ WS mutation detection).
  • the key is that in the library construction, a specific linker is ligated to the free DNA fragment of the maternal peripheral blood, and then the pre-amplification product of the linker ligation product is creatively divided into two parts, which are used independently for the target site (such as ⁇ -type thalassemia).
  • the upstream and downstream primers of the gene are subjected to two rounds of specific amplification, which can highly enrich the target site and significantly increase the specificity of primer amplification.
  • the upstream primer set and the downstream primer set for calculating a plurality of SNP sites of the fetal free DNA ratio are respectively used for amplification.
  • different primers are also used for the SNP site for two rounds of specificity.
  • Sexual amplification can efficiently and accurately calculate the fetal DNA content.
  • the fetal DNA content can be calculated based on the sequencing read length and the thalassemia-related mutations can be classified to effectively distinguish the mutations of the embryonic gene.
  • the inventors have designed highly efficient primers specifically for thalassemia genes, as well as primers for calculating multiple SNP sites of fetal DNA ratio.
  • Using the library construction method of the present invention it is preferred to accurately and efficiently detect alpha-thalassemia gene mutations (including - ⁇ SEA , - ⁇ 3.7 , - ⁇ 4.2 , ⁇ ) using the primer sequence set provided by the present invention.
  • CS , ⁇ QS , ⁇ WS mutation detection the results are consistent with the amniocentesis test classification, but the safety, non-invasive and high efficiency is significantly better than amniocentesis.
  • FIG. 1 is a method for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library and a mutation detecting method according to an embodiment of the present invention; Schematic diagram of the process;
  • FIG. 2 is a schematic diagram showing the relative positional relationship between the upstream and downstream specific primers, the universal primers and the template in the embodiment of the present invention
  • M represents Marker
  • A1-A6 represents 6 An exemplary sample.
  • a method for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library includes the steps of:
  • S1 A linker sequence with a specific tag sequence and an optional sample tag sequence is ligated to the parental peripheral blood free DNA.
  • the parental peripheral blood free DNA that is, the peripheral blood free DNA of pregnant women, including maternal blood free DNA and fetal-derived free DNA.
  • the linker sequence carries a specific tag sequence, which can be a random sequence of n (eg, 8 or 10) bases in length, such that each linker sequence is different from the other linker sequences, that is, the linker sequence is
  • n eg, 8 or 10
  • the free DNA fragments of each parental blood bank have a specific linker sequence, which facilitates the differentiation of different maternal peripheral blood free DNA fragments in the sequence information analysis after subsequent sequencing.
  • sample pooling of different sources eg, peripheral blood free DNA from different maternal sources
  • linker sequence can also With a sample tag sequence (index), it is used to distinguish samples from different sources.
  • S2 Pre-library amplification of the ligation product of the previous step using a pre-library amplification primer sequence, wherein the pre-library amplification primer sequence is complementary to the linker sequence.
  • Pre-library amplification is performed to increase the amount of linker ligation product, as some of the molecules in the linker ligation product may have a lower copy number and are specifically amplified in subsequent partitions. At these times, these molecules with low copy number may not be efficiently amplified, and pre-amplification increases the amount of linker ligation products, so that molecules of various copy numbers can be efficiently amplified.
  • the pre-library obtained in the previous step is divided into an upstream pre-library and a downstream pre-library, and the upstream pre-library and the downstream pre-library are specifically amplified by using the upstream specific primer set 1 and the downstream specific primer set 1, respectively, respectively.
  • the pre-library is divided into an upstream pre-library and a downstream pre-library for two rounds of specific amplification, which can significantly increase the specificity of primer amplification.
  • the beneficial effects of separate independent amplification on the upstream and downstream include: on the one hand, upstream specific primers and downstream specific primers need to be closer to the amplification target site due to the second generation high-throughput sequencing read shorts (or Target region), it is difficult to effectively amplify the target fragment without separation, and independent amplification of the upstream and downstream can effectively amplify the target fragment; on the other hand, independent amplification of the upstream and downstream can retain the linker sequence at one end of the sequence. Easy for subsequent high-throughput measurements The progress of the order.
  • the amplification target site includes two types of thalassemia gene and a plurality of SNP sites for calculating fetal DNA ratio, and correspondingly, upstream specific primer set 1 and downstream specific
  • the primer set 1 also includes two primers respectively, that is, the upstream specific primer set 1 includes: an upstream primer set 1 for amplifying the ⁇ -thalassemia gene and a plurality of SNP sites for calculating a fetal free DNA ratio.
  • the upstream primer set 1; the downstream specific primer set 1 includes a downstream primer set 1 for amplifying the ⁇ -thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of the fetal free DNA ratio.
  • USP1 upstream primer set 1
  • DSP1 downstream primer set 1
  • Relative positional relationship It can be seen that USP1 and DSP1 are located at the left and right sides of the amplification target site, or the detection point (ie, the ⁇ -type thalassemia gene mutation). Generally, the 3' end of USP1 and DSP1 are closer to the detection point.
  • the second-generation high-throughput sequencing reads shorter, such as about 100bp, if the USP1 and DSP1 3' end distance detection point In the far distance, it may be that the second generation high-throughput sequencing cannot effectively detect the detection points.
  • the relative positional relationship of the upstream and downstream specific primers to the template binding of the plurality of SNP sites for calculating the fetal free DNA ratio is the same as that of the schematic diagram of FIG.
  • the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 may be provided with a modification for isolating the amplification product, preferably with Biotin modification, biotin can bind to avidin, so upstream specific amplification product 1 and downstream specific amplification product 1 can be separated from the reaction system by biotin-avidin affinity binding.
  • the specific primers used for the second round of specific primer amplification also include upstream specific primers and downstream specific primers.
  • the upstream specific primer set 2 used for the second round of specific primer amplification comprises: an upstream primer set 2 for amplifying the alpha thalassemia gene 2 And an upstream primer set 2 for calculating a plurality of SNP sites of the fetal free DNA ratio;
  • the downstream specific primer set 2 includes: a downstream primer set 2 for amplifying the ⁇ -thalassemia gene and for calculating a fetal free DNA ratio The downstream primer set 2 of multiple SNP sites.
  • the upstream primer set 2 (USP2) for amplifying the ⁇ -thalassemia gene and the downstream primer set 2 (DSP2) for amplifying the ⁇ -thalassemia gene in combination with the template in the embodiment of the present invention.
  • USP2 is closer to the alpha-type thalassemia gene mutation site than USP1
  • DSP2 is closer to the alpha-type thalassemia gene mutation site than DSP1.
  • the relative positional relationship between the upstream and downstream primer set 2 and the template binding of the plurality of SNP sites for calculating the fetal free DNA ratio is the same as that of the schematic diagram of FIG.
  • the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1
  • the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1.
  • the primer for a specific target site in the upstream specific primer set 1 for example, USP1 for the alpha-type thalassemia gene
  • the upstream specific primer set 2 for the target site Primers (eg, USP2 for the alpha-type thalassemia gene) have a 10-15 base overlap at the 5' end; similarly, primers for a specific target site in the downstream specific primer set 1 (eg, for alpha-type thalassemia)
  • the 3' end of the DSP1) of the gene has a 10-15 base overlap with the 5' end of the downstream specific primer set 2 for the target site (for example, DSP2 for the alpha-type thalassemia gene).
  • 3' end and “5' end” refer to a sequence near the 3' end and the 5' end, respectively. This coincidence is mainly considering that USP1 and DSP1 are close to the original detection point (for example, ⁇ -type thalassemia gene mutation site), and the superposition of bases can fully utilize the binding space while further improving the amplification specificity.
  • upstream specific primer set 2 and downstream specific primer set 2 may contain a linker sequence for a high throughput sequencing library. Specifically, which high-throughput sequencing platform is used, the corresponding linker sequence of the platform is used.
  • the universal primer is capable of binding to the linker sequence and is amplified by universal primers to obtain a library for sequencing on the machine. Sequencing on the machine is achieved by high-throughput sequencing.
  • the high throughput sequencing technology can be selected from the group consisting of Illumina/Genome Analyzer/Hiseq/Miseq/NEXTseq CN500, Applied Biosystems SOLID, and life Technologies/Ion Torrent. Specifically, which high-throughput sequencing technology is used, universal primers also use primers corresponding to the high-throughput sequencing technology.
  • the upstream primer set 1 sequence for amplifying the alpha-type thalassemia gene is set forth in SEQ ID NOs: 1-7; upstream of a plurality of SNP sites for calculating fetal free DNA ratio Primer set 1 sequences are set forth in SEQ ID NOS: 8-17; downstream primer set 1 sequences for amplifying alpha-type thalassemia gene are set forth in SEQ ID NOS: 18-23; used to calculate the ratio of fetal free DNA
  • the sequence of the downstream primer set 1 of one SNP site is shown in SEQ ID NOs: 24-33; the sequence of the upstream primer set 2 used to amplify the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
  • the upstream primer set 2 sequence of the plurality of SNP sites of the fetal free DNA ratio is shown in SEQ ID NOs: 41-50;
  • the downstream primer set 2 sequence for amplifying the alpha thalassemia gene is SEQ ID NO: 51-56
  • the embodiment of the invention further provides a non-invasive prenatal fetal alpha thalassemia gene mutation detecting method, comprising performing high-throughput sequencing of the library constructed in the embodiment of the invention to obtain a sequencing read read; then, according to the unique position of the site Specific tag sequence The total depth and depth of the allele of the mutation, the fetal DNA content was calculated and the thalassemia-related mutations were typed to analyze the alpha-thalassemia gene mutation.
  • the embodiment of the invention further provides a non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction kit, wherein the above library is used for detecting non-invasive prenatal fetal alpha thalassemia gene mutation by high-throughput sequencing, the above kit include:
  • upstream specific primer set 2 comprises an upstream primer set 2 for amplifying the alpha-thalassemia gene and an upstream of a plurality of SNP sites for calculating the fetal free DNA ratio Primer set 2; downstream specific primer set 2 includes a downstream primer set 2 for amplifying the alpha-thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of the fetal free DNA ratio.
  • upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1;
  • the 5' end of upstream specific primer set 2 and downstream specific primer set 2 contains a linker sequence for a high throughput sequencing library.
  • the upstream specific primer set 1 and the downstream specific primer set 1 have a modification for the isolation of the amplified product, preferably with a biotin modification, to separate upstream specific expansion from the system.
  • Product 1 and downstream specific amplification product 1 are added.
  • the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 10' end of the primer for the target site in the upstream specific primer set 2 are 10-15 Coincident of bases; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and the 10' end of the primer of the downstream specific primer set 2 for the target site are 10-15 bases Coincident.
  • the upstream primer set 1 sequence for amplifying the alpha-type thalassemia gene is set forth in SEQ ID NOs: 1-7; upstream of a plurality of SNP sites for calculating fetal free DNA ratio Primer set 1 sequences are set forth in SEQ ID NOS: 8-17; downstream primer set 1 sequences for amplifying alpha-type thalassemia gene are set forth in SEQ ID NOS: 18-23; used to calculate the ratio of fetal free DNA
  • the sequence of the downstream primer set 1 of one SNP site is shown in SEQ ID NOs: 24-33; the sequence of the upstream primer set 2 used to amplify the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
  • the upstream primer set 2 sequence of multiple SNP sites of fetal free DNA ratio is shown in SEQ ID NOs: 41-50; used to amplify alpha-type thalassemia
  • the sequence of the downstream primer set 2 is shown in SEQ ID NOS: 51-56; the
  • the kit further comprises:
  • R3 linker sequence carrying a specific tag sequence and an optional sample tag sequence for ligation of maternal peripheral blood free DNA to obtain a ligation product
  • R4 a pre-library amplification primer sequence which is complementary to the above-described linker sequence for pre-library amplification of the above-described ligation product
  • R5 a universal primer for amplifying the mixed product after mixing the upstream specific amplification product 2 and the downstream specific amplification product 2 described above to obtain a library which can be sequenced on the machine.
  • the linker sequence is set forth in SEQ ID NOs: 67-68; the pre-library amplification primer sequences are set forth in SEQ ID NOs: 69-70; and the universal primers are set forth in SEQ ID NO: 71-72. Show.
  • the upstream primer and the downstream primer are located on the left and right sides of the detection point, respectively.
  • the 3' end of the ipsilateral specific primer 1 and the 5' end of the specific primer 2 have a 10-15 base overlap.
  • Specific primer 1 has a biotin modification at the 5' end.
  • the 5' end of specific primer 2 contains a linker sequence for a high throughput sequencing library. Specifically, the primer sequences are shown in Table 1.
  • ADT-R pGATCGGAAGAGC (SEQ ID NO: 68).
  • NNNNNNNN is a specific tag sequence
  • ATTAAGG is a sample tag sequence (index)
  • the above linker sequence needs to be annealed into a double strand.
  • pre-lib-primer-F CAAGCAGAAGACGGCATACGA (SEQ ID NO: 69);
  • pre-lib-primer-R GCTCTTCCGATCT (SEQ ID NO: 70).
  • Seq-lib-primer-F CAAGCAGAAGACGGCATACGA (SEQ ID NO: 71);
  • reaction 1 Free DNA end repair, and the reaction system (Reaction 1) was set as shown in Table 2.
  • reaction 2 The DNA fragment is ligated to the linker sequence carrying the specific tag sequence and the sample tag sequence, and the reaction system (Reaction 2) is shown in Table 3.
  • thermocycler 20 ° C, 15 min; 65 ° C, 10 min; 4 ° C preservation.
  • the final reaction product was purified by using 50 ⁇ L of XP beads, and quantification by Qubit, and 5 ⁇ L of the reaction product was detected by 2% gel electrophoresis. The results are shown in Fig. 3a, indicating that the pre-library was successfully constructed. Two aliquots of the pre-library were taken and named as the upstream pre-library and the downstream pre-library, respectively, and each 100 ng was subjected to the next amplification.
  • Each sample pre-library is an upstream pre-library and a downstream pre-library, respectively using the corresponding upstream and downstream specific primer 1 and specific primer 2, respectively, until the amplification of the upstream and downstream primers of each sample is combined when using universal primer amplification. And then use universal primers for amplification.
  • the PCR reaction procedure was as follows: 95 ° C, 10 min, 1 cycle; (95 ° C, 30 s, 62 ° C, 30 s, 72 ° C, 1 min) 20 cycles; 72 ° C, 7 min, 1 cycle; 4 ° C storage.
  • reaction product 5 ⁇ L was subjected to 2% agarose gel electrophoresis, and the remaining product was recovered by 1.2X XP, eluted with 24 ⁇ L of ultrapure water or an eluent, and the eluate was subjected to further amplification.
  • the PCR reaction procedure was as follows: 95 ° C, 10 min, 1 cycle; (95 ° C, 30 s, 62 ° C, 30 s, 72 ° C, 1 min) 15 cycles; 72 ° C, 7 min, 1 cycle; 4 ° C preservation.
  • the PCR reaction procedure was as follows: 98 ° C, 45 s, 1 cycle; (98 ° C, 15 s, 60 ° C, 30 s, 72 ° C, 30 s) 10 cycles; 72 ° C, 1 min, 1 cycle; 4 ° C preservation.
  • Illumina NEXTseq CN500 was subjected to 75PE sequencing.
  • the molecular tag set sequences were re-aligned to the reference genome (BWA MEM), the sequences surrounding Indel were re-aligned using GATK software, SNP and small indels were detected using samtools software, and alpha1 ( ⁇ 1) and alpha2 were performed using CNVPanelizer R package. ( ⁇ 2) deletion detection. Calculate the embryonic DNA content based on the total depth of the unique molecular tag set sequence of the locus and the depth of the associated SNP locus allele, and genotype the fetal thalassemia-related point mutation, INDEL, for alpha1 And alpha2 type deletion, mixed Gaussian model for fetal genotyping.
  • non-invasive prenatal thalassemia gene detection was performed on 8 pregnant women with thalassemia carriers, and the results are shown in Table 8 below.
  • results show that the non-invasive prenatal detection and the amniocentesis detection of the ⁇ -thalassemia gene mutation are consistent in the embodiment of the present invention, indicating that the method of the present invention can accurately and efficiently detect the alpha-thalassemia gene mutation, and the result and the amniocentesis test score.
  • the type is consistent, but it is significantly better than amniocentesis in terms of safety, non-invasiveness and high efficiency.

Abstract

Disclosed are a construction method, a detection method and a kit for a non-invasive prenatal fetal α-thalassemia gene mutation detection library. In the construction method for the library, a specific linker is ligated to a free DNA fragment in the maternal peripheral blood, and the pre-amplification product of the linker ligation product is then divided into two parts, and two rounds of specific amplification are respectively carried out independently using upstream and downstream primers for a target site, and the target site can be enriched with a high specificity, and the amplification specificity of the primers can be significantly improved. In addition, two rounds of specific amplification are respectively carried out using upstream and downstream primer sets for multiple SNP sites for calculating the proportion of free fetal DNA, and the fetal DNA proportion can be efficiently and accurately calculated. Sequencing the library can accurately and efficiently detect the α-thalassemia gene mutation, and the results thereof are consistent with those of the amniocentesis detection genotyping; however, in terms of safety, non-invasiveness and high-efficiency, the method is significantly better than the amniocentesis detection.

Description

无创产前胎儿α型地贫基因突变检测文库构建方法、检测方法和试剂盒Non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction method, detection method and kit 技术领域Technical field
本发明涉及基因检测技术领域,尤其涉及一种无创产前胎儿α型地贫基因突变检测文库构建方法、检测方法和试剂盒。The invention relates to the field of gene detection technology, in particular to a method, a detection method and a kit for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library.
背景技术Background technique
地中海贫血(简称地贫)是全世界的高发病率遗传病之一,是由于人体基因突变或者缺失而导致α,β-珠蛋白肽链合成速率的不平衡,从而引起的溶血性贫血。地贫常见的两种类型是α-地贫和β-地贫,α-地贫相关基因为HBA2和HBA1,β-地贫相关基因为HBB。地贫集中在热带和亚热带地区,多发生于地中海沿岸国家,其次为中东、印度、巴基斯坦、东南亚、中国南方和北非,美国是移民国家,其发生率也较高。我国长江以南各省是地贫高发区,广东、广西、海南、台湾和香港等地受累人口超过2亿。中国人群中,α-地贫的基因型别常见的有-αSEA、-α3.7、-α4.2、αCS、αQS和αWS,β-地贫常见的有26种基因突变型别。目前地贫尚无根治的方法,只能依靠传统方法治疗,即以输血治疗为主,价格昂贵,因此产前筛查和诊断对预防该种出生缺陷疾病具有重大意义。Thalassemia (referred to as thalassemia) is one of the high incidence genetic diseases in the world. It is due to the mutation or deletion of human genes, resulting in an imbalance of the synthesis rate of α,β-globin peptide chains, resulting in hemolytic anemia. Two common types of thalassaemia are α-thalassemia and β-thalassemia, α-thalassemia-related genes are HBA2 and HBA1, and β-thalassaemia-related genes are HBB. The thalassemia is concentrated in the tropical and subtropical regions, mostly in the Mediterranean countries, followed by the Middle East, India, Pakistan, Southeast Asia, South China and North Africa. The United States is an immigrant country with a high incidence. The provinces south of the Yangtze River are high-risk areas for thalassaemia, with more than 200 million people affected by Guangdong, Guangxi, Hainan, Taiwan and Hong Kong. Among the Chinese population, α-thalassemia genotypes are commonly found as -α SEA , -α3.7, -α4.2, α CS , α QS and α WS , and 26 gene mutations are common in β-thalassemia. do not. At present, there is no cure for thalassaemia. It can only rely on traditional methods of treatment, that is, blood transfusion therapy is the main method, and the price is expensive. Therefore, prenatal screening and diagnosis are of great significance for preventing such birth defects.
1997年,卢煜明发现孕妇外周血浆中存在来自于胎儿的游离DNA,这一发现为通过孕妇外周血无创产前诊断和检测胚胎基因型提供了新的可能,随着高通量测序技术的应用,无创产前检测胚胎染色体异常(如21、18、13染色体非整倍体变异)以及部分大的拷贝数变异已经成功应用于临床产前筛查和诊断,但是孕妇体内胎儿游离DNA仅占孕妇血浆中总游离DNA的3%-6%,对于胎儿无创式的遗传性疾病的检测带来挑战。In 1997, Lu Yuming discovered that there is free DNA from the fetus in peripheral blood of pregnant women. This finding provides a new possibility for non-invasive prenatal diagnosis and detection of embryonic genotypes in pregnant women's peripheral blood. With the application of high-throughput sequencing technology, Non-invasive prenatal detection of embryonic chromosomal abnormalities (such as 21, 18, and 13 chromosome aneuploidy variants) and partial large copy number variation have been successfully applied to clinical prenatal screening and diagnosis, but fetal free DNA in pregnant women only accounts for maternal plasma. 3%-6% of the total free DNA poses a challenge for the detection of fetal non-invasive hereditary diseases.
不断有研究人员尝试通过孕妇外周血游离DNA对胎儿地贫基因做出检测,由于孕妇外周血中存在大量母源游离DNA,所以如何有效区分胚胎基因的突变,对检测方法的灵敏度提出了很高的要求。目前临床报道产前检测地贫基因突变的方法有几种:传统的羊膜穿刺、腹绒毛活检等,由于是有创伤性的操作,可能伴有胎儿损伤、流产或宫内感染等风险。利用母体血浆中游离胎儿DNA的无创产前诊断也有些研究。Chiu RW通过使用实时荧光PCR技术实现了对父源codons 41/42 4bp缺失的有效检测,还有科研工作者发展了针对其他β地贫点突变的检测方法,但由于检测突变多为点突变,方法的特异性有待提高。对于α地贫突变的研究也有报道,Warunee Tungwiwat等人,建立了基于荧光定量PCR检测α0缺失的方法,然而该方法的灵敏度有待提高。此外,多种高灵敏度的技术方法也被尝试用于无创地贫检测,如质谱、数字PCR、COLD-PCR等。但这些基于单一位点的PCR技术的检测方法存在一定的局限性,如血浆DNA低含量及高度片段化的特点,很可能会带来PCR的假阴性。这种方法 只能够检测与母亲不同的特异性父源突变是否存在,而无法进一步确定母源突变的存在与否。基于高通量测序技术的孕妇血浆游离DNA的全基因组测序或者目标区域捕获测序可以更加准确的实现对胎儿地贫基因的检测,但全基因组测序的成本以及分析方法的复杂性和需要父母单体型等限制了在临床上的应用;目标区域捕获测序由于其目标区域捕获效率低,仅能实现对胎儿αSEA缺失纯杂合型的区分,并且还存在分型错误的可能。鉴于此,需要提供一种高精确度的无创产前胎儿地贫基因突变检测方法。Researchers have been trying to detect fetal thalassemia genes through the free DNA of pregnant women's peripheral blood. Because there are a large number of maternal free DNA in the peripheral blood of pregnant women, how to effectively distinguish the mutations of embryonic genes, the sensitivity of the detection method is very high. Requirements. At present, there are several methods for prenatal detection of thalassemia gene mutations: traditional amniocentesis, abdominal villus biopsy, etc., due to traumatic operation, may be associated with fetal injury, miscarriage or intrauterine infection. Non-invasive prenatal diagnosis using free fetal DNA in maternal plasma has also been studied. Chiu RW achieves efficient detection of the father-derived codons 41/42 4bp deletion by using real-time fluorescent PCR technology, and researchers have developed detection methods for other β-thalassaemia mutations, but since the detection mutations are mostly point mutations, The specificity of the method needs to be improved. Studies on alpha-thalassemia mutations have also been reported. Warunee Tungwiwat et al. established a method for detecting α 0 deletions based on real-time PCR, but the sensitivity of this method needs to be improved. In addition, a variety of high-sensitivity techniques have also been tried for non-invasive depletion detection, such as mass spectrometry, digital PCR, COLD-PCR, and the like. However, the detection methods based on single-site PCR technology have certain limitations, such as low plasma DNA content and high fragmentation characteristics, which may bring false negative PCR. This method can only detect the presence of a specific paternal mutation different from the mother, and cannot further determine the presence or absence of the maternal mutation. Whole-genome sequencing or target region capture sequencing of maternal plasma free DNA based on high-throughput sequencing technology can more accurately detect fetal thalassemia genes, but the cost of genome-wide sequencing and the complexity of analytical methods and the need for parental monomers Types and the like limit the clinical application; target region capture sequencing can only achieve the differentiation of the fetal α SEA deletion pure heterozygous type due to the low capture efficiency of the target region, and there is also the possibility of typing errors. In view of this, it is necessary to provide a high-precision non-invasive prenatal fetal thalassemia gene mutation detection method.
发明内容Summary of the invention
本发明提供一种无创产前胎儿α型地贫基因突变检测文库构建方法、检测方法和试剂盒,能够准确、高效地检测α型地贫基因突变,其结果与羊水穿刺检测分型一致,但安全性、无创性和高效性方面明显优于羊水穿刺检测。The invention provides a method, a detection method and a kit for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library, which can accurately and efficiently detect the alpha thalassemia gene mutation, and the result is consistent with the amniocentesis detection type, but Safety, non-invasiveness and high efficiency are significantly better than amniocentesis.
根据本发明的第一方面,本发明提供一种无创产前胎儿α型地贫基因突变检测文库构建方法,上述文库用于通过高通量测序进行无创产前胎儿α型地贫基因突变的检测,上述方法包括:According to a first aspect of the present invention, the present invention provides a method for constructing a non-invasive prenatal fetal alpha-type thalassemia gene mutation detection library for detecting non-invasive prenatal fetal alpha-type thalassemia gene mutation by high-throughput sequencing The above methods include:
(a)对母体外周血游离DNA连接带有特异标签序列和任选的样本标签序列的接头序列;(a) a linker sequence having a specific tag sequence and an optional sample tag sequence for maternal peripheral blood free DNA ligation;
(b)使用预文库扩增引物序列对上一步的连接产物进行预文库扩增,其中上述预文库扩增引物序列与上述接头序列互补结合;(b) pre-librarian amplification of the ligation product of the previous step using a pre-library amplification primer sequence, wherein the pre-library amplification primer sequence is complementary to the linker sequence;
(c)将上一步得到的预文库分为上游预文库和下游预文库,分别使用上游特异性引物组1和下游特异性引物组1对上述上游预文库和下游预文库进行特异性扩增,分别得到上游特异性扩增产物1和下游特异性扩增产物1,(c) dividing the pre-library obtained in the previous step into an upstream pre-library and a downstream pre-library, and specifically amplifying the upstream pre-library and the downstream pre-library using the upstream specific primer set 1 and the downstream specific primer set 1, respectively. The upstream specific amplification product 1 and the downstream specific amplification product 1 are respectively obtained.
其中,上述上游特异性引物组1包括用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;上述下游特异性引物组1包括用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1;Wherein, the upstream specific primer set 1 includes an upstream primer set 1 for amplifying the α-thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 1 comprising a downstream primer set 1 for amplifying a thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
(d)对上述上游特异性扩增产物1和下游特异性扩增产物1,分别使用上游特异性引物组2和下游特异性引物组2进行特异性扩增,分别得到上游特异性扩增产物2和下游特异性扩增产物2,(d) specifically amplifying the upstream specific amplification product 1 and the downstream specific amplification product 1 using the upstream specific primer set 2 and the downstream specific primer set 2, respectively, to obtain upstream specific amplification products, respectively. 2 and downstream specific amplification product 2,
其中,上述上游特异性引物组2包括用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;上述下游特异性引物组2包括用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2;Wherein the upstream specific primer set 2 includes an upstream primer set 2 for amplifying the α-thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 2 comprising a downstream primer set 2 for amplifying the alpha thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of fetal free DNA ratio;
并且,上述上游特异性引物组2相比上述上游特异性引物组1更靠近对应的扩增目标位 点,上述下游特异性引物组2相比上述下游特异性引物组1更靠近对应的扩增目标位点;上述上游特异性引物组2和上述下游特异性引物组2的5’端含有用于高通量测序文库的接头序列;Furthermore, the upstream specific primer set 2 is closer to the corresponding amplification target position than the upstream specific primer set 1 described above. Point, the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1; the upstream specific primer set 2 and the 5' end of the downstream specific primer set 2 are contained for a linker sequence of a high throughput sequencing library;
(e)将上述上游特异性扩增产物2和下游特异性扩增产物2混合后,使用两端的通用引物进行扩增,得到可上机测序的文库。(e) The upstream specific amplification product 2 and the downstream specific amplification product 2 are mixed, and then amplified using universal primers at both ends to obtain a library which can be sequenced by the machine.
进一步地,上述上游特异性引物组1和下游特异性引物组1的5'端带有用于分离扩增产物的修饰,优选带有生物素修饰。Further, the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 described above are provided with a modification for isolating the amplification product, preferably with a biotin modification.
进一步地,上述上游特异性引物组1中针对特定目标位点的引物的3’端,与上述上游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合;上述下游特异性引物组1中针对特定目标位点的引物的3’端,与上述下游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合。Further, the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 5' end of the primer corresponding to the target site in the upstream specific primer set 2 are 10-15 bases Coincident; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and 10-15 bases from the 5' end of the primer for the target site in the downstream specific primer set 2 described above Coincident.
进一步地,上述用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;Further, the above upstream primer set 1 sequence for amplifying the α-thalassemia gene is as shown in SEQ ID NOs: 1-7;
上述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;The upstream primer set 1 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOS: 8-17;
上述用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;The sequence of the downstream primer set 1 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 18-23;
上述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;The sequence of the downstream primer set 1 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NO: 24-33;
上述用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;The above upstream primer set 2 sequence for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
上述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示;The upstream primer set 2 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 41-50;
上述用于扩增α型地贫基因的下游引物组2序列如SEQ ID NO:51-56所示;The sequence of the downstream primer set 2 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 51-56;
上述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。The sequence of the downstream primer set 2 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
根据本发明的第二方面,本发明提供一种无创产前胎儿α型地贫基因突变检测方法,包括对第一方面的方法构建的文库进行高通量测序得到测序读长(reads);然后,根据位点的唯一的特异标签序列的总深度和突变等位基因(allele)的深度,计算胎源DNA含量并对地贫相关的突变进行分型以分析α型地贫基因突变情况。According to a second aspect of the present invention, the present invention provides a non-invasive prenatal fetal alpha thalassemia gene mutation detecting method comprising performing high-throughput sequencing of a library constructed by the method of the first aspect to obtain a sequencing read read; Based on the total depth of the unique specific tag sequence of the site and the depth of the allele of the mutation, the fetal DNA content was calculated and the thalassemia-related mutation was typed to analyze the alpha-thalassemia gene mutation.
根据本发明的第三方面,本发明提供一种无创产前胎儿α型地贫基因突变检测文库构建试剂盒,上述文库用于通过高通量测序进行无创产前胎儿α型地贫基因突变的检测,上述试剂盒包括:According to a third aspect of the present invention, the present invention provides a non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction kit for performing non-invasive prenatal fetal alpha thalassemia gene mutation by high-throughput sequencing For testing, the above kits include:
(a)上游特异性引物组1和下游特异性引物组1,分别用于对上游预文库和下游预文库 进行特异性扩增,以分别得到上游特异性扩增产物1和下游特异性扩增产物1,(a) upstream specific primer set 1 and downstream specific primer set 1 for upstream pre-library and downstream pre-library, respectively Specific amplification is performed to obtain upstream specific amplification product 1 and downstream specific amplification product 1, respectively.
其中,上述上游特异性引物组1包括用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;上述下游特异性引物组1包括用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1;Wherein, the upstream specific primer set 1 includes an upstream primer set 1 for amplifying the α-thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 1 comprising a downstream primer set 1 for amplifying a thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
(b)上游特异性引物组2和下游特异性引物组2,分别用于对上述上游特异性扩增产物1和下游特异性扩增产物1进行特异性扩增,以分别得到上游特异性扩增产物2和下游特异性扩增产物2,(b) upstream specific primer set 2 and downstream specific primer set 2, respectively, for specifically amplifying the above upstream specific amplification product 1 and downstream specific amplification product 1 respectively to obtain upstream specific expansion Product 2 and downstream specific amplification product 2,
其中,上述上游特异性引物组2包括用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;上述下游特异性引物组2包括用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2;Wherein the upstream specific primer set 2 includes an upstream primer set 2 for amplifying the α-thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio; the above-mentioned downstream specific primer set 2 comprising a downstream primer set 2 for amplifying the alpha thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of fetal free DNA ratio;
并且,上述上游特异性引物组2相比上述上游特异性引物组1更靠近对应的扩增目标位点,上述下游特异性引物组2相比上述下游特异性引物组1更靠近对应的扩增目标位点;上述上游特异性引物组2和上述下游特异性引物组2的5’端含有用于高通量测序文库的接头序列。Further, the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the corresponding amplification than the downstream specific primer set 1 Target site; the upstream specific primer set 2 and the 5' end of the downstream specific primer set 2 described above contain a linker sequence for a high throughput sequencing library.
进一步地,上述上游特异性引物组1和下游特异性引物组1的5'端带有用于分离扩增产物的修饰,优选带有生物素修饰。Further, the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 described above are provided with a modification for isolating the amplification product, preferably with a biotin modification.
进一步地,上述上游特异性引物组1中针对特定目标位点的引物的3’端,与上述上游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合;上述下游特异性引物组1中针对特定目标位点的引物的3’端,与上述下游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合。Further, the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 5' end of the primer corresponding to the target site in the upstream specific primer set 2 are 10-15 bases Coincident; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and 10-15 bases from the 5' end of the primer for the target site in the downstream specific primer set 2 described above Coincident.
进一步地,上述用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;Further, the above upstream primer set 1 sequence for amplifying the α-thalassemia gene is as shown in SEQ ID NOs: 1-7;
上述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;The upstream primer set 1 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOS: 8-17;
上述用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;The sequence of the downstream primer set 1 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 18-23;
上述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;The sequence of the downstream primer set 1 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NO: 24-33;
上述用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;The above upstream primer set 2 sequence for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 34-40;
上述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示; The upstream primer set 2 sequence of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 41-50;
上述用于扩增α型地贫基因的下游引物组2序列如SEQ ID NO:51-56所示;The sequence of the downstream primer set 2 for amplifying the alpha thalassemia gene is shown in SEQ ID NOs: 51-56;
上述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。The sequence of the downstream primer set 2 of the above plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
进一步地,上述试剂盒还包括:Further, the kit further includes:
(c)接头序列,其带有特异标签序列和任选的样本标签序列,用于连接母体外周血游离DNA,以得到连接产物;(c) a linker sequence with a specific tag sequence and an optional sample tag sequence for ligation of maternal peripheral blood free DNA to obtain a ligation product;
(d)预文库扩增引物序列,其与上述接头序列互补结合,用于对上述连接产物进行预文库扩增;(d) a pre-library amplification primer sequence which is complementary to the above-described linker sequence for pre-library amplification of the above-described ligation product;
(e)通用引物,用于在上述上游特异性扩增产物2和下游特异性扩增产物2混合后,对混合产物进行扩增,以得到可上机测序的文库;(e) a universal primer for amplifying the mixed product after mixing the upstream specific amplification product 2 and the downstream specific amplification product 2 to obtain a library which can be sequenced on the machine;
优选地,Preferably,
上述接头序列如SEQ ID NO:67-68所示;The above linker sequences are shown in SEQ ID NOs: 67-68;
上述预文库扩增引物序列如SEQ ID NO:69-70所示;The above pre-library amplification primer sequences are shown in SEQ ID NOs: 69-70;
上述通用引物如SEQ ID NO:71-72所示。The above universal primers are shown in SEQ ID NOs: 71-72.
本发明的方法和试剂盒,能够通过以母体游离血浆DNA为原始材料,构建α型地贫基因突变检测的文库,并通过对文库进行高通量测序而实现α型地贫基因突变检测(包括-αSEA、-α3.7、-α4.2、αCS、αQS、αWS突变检测)。关键在于,在文库构建中,对母体外周血游离DNA片段连接特异性的接头,然后创造性地将接头连接产物预扩增产物分成两部分,分别独立地使用针对目标位点(如α型地贫基因)的上、下游引物进行两轮特异性扩增,能够高特异性地富集目标位点,显著提高引物扩增特异性。并且在扩增过程中,分别使用用于计算胎儿游离DNA比例的多个SNP位点的上游引物组和下游引物组进行扩增,具体地,针对SNP位点也使用不同的引物进行两轮特异性扩增,能够高效、准确的计算胎源DNA含量。从而,在高通量测序之后,能够依据测序读长计算胎源DNA含量并对地贫相关的突变进行分型,有效区分胚胎基因的突变。The method and kit of the invention can construct a library for detection of alpha thalassemia gene mutation by using maternal free plasma DNA as a raw material, and realize detection of alpha thalassemia gene mutation by high-throughput sequencing of the library (including -α SEA , -α 3.7 , -α 4.2 , α CS , α QS , α WS mutation detection). The key is that in the library construction, a specific linker is ligated to the free DNA fragment of the maternal peripheral blood, and then the pre-amplification product of the linker ligation product is creatively divided into two parts, which are used independently for the target site (such as α-type thalassemia). The upstream and downstream primers of the gene are subjected to two rounds of specific amplification, which can highly enrich the target site and significantly increase the specificity of primer amplification. And during the amplification process, the upstream primer set and the downstream primer set for calculating a plurality of SNP sites of the fetal free DNA ratio are respectively used for amplification. Specifically, different primers are also used for the SNP site for two rounds of specificity. Sexual amplification can efficiently and accurately calculate the fetal DNA content. Thus, after high-throughput sequencing, the fetal DNA content can be calculated based on the sequencing read length and the thalassemia-related mutations can be classified to effectively distinguish the mutations of the embryonic gene.
进一步地,发明人特别针对地贫基因设计了高效的引物,以及计算胎儿DNA比例的多个SNP位点的引物。使用本发明的文库构建方法,优选地在使用本发明提供的引物序列组的情况下,能够准确、高效地检测α型地贫基因突变(包括-αSEA、-α3.7、-α4.2、αCS、αQS、αWS突变检测),其结果与羊水穿刺检测分型一致,但安全性、无创性和高效性方面明显优于羊水穿刺检测。Further, the inventors have designed highly efficient primers specifically for thalassemia genes, as well as primers for calculating multiple SNP sites of fetal DNA ratio. Using the library construction method of the present invention, it is preferred to accurately and efficiently detect alpha-thalassemia gene mutations (including -α SEA , -α 3.7 , -α 4.2 , α) using the primer sequence set provided by the present invention. CS , α QS , α WS mutation detection), the results are consistent with the amniocentesis test classification, but the safety, non-invasive and high efficiency is significantly better than amniocentesis.
附图说明DRAWINGS
图1为本发明实施例的无创产前胎儿α型地贫基因突变检测文库构建方法和突变检测方 法的流程示意图;1 is a method for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library and a mutation detecting method according to an embodiment of the present invention; Schematic diagram of the process;
图2为本发明实施例的上下游特异性引物、通用引物与模板结合的相对位置关系示意图;2 is a schematic diagram showing the relative positional relationship between the upstream and downstream specific primers, the universal primers and the template in the embodiment of the present invention;
图3为本发明实施例的无创产前胎儿α型地贫基因突变检测的预文库(a)和终文库(b)的2%凝胶电泳检测图谱,M表示Marker,A1-A6表示6个示例性的样本。3 is a 2% gel electrophoresis detection map of a pre-library (a) and a final library (b) for detection of non-invasive prenatal fetal alpha thalassemia gene mutation according to an embodiment of the present invention, M represents Marker, and A1-A6 represents 6 An exemplary sample.
具体实施方式detailed description
下面通过具体实施方式结合附图对本发明作进一步详细说明。The present invention will be further described in detail below with reference to the accompanying drawings.
如图1所示,本发明实施例的无创产前胎儿α型地贫基因突变检测文库构建方法包括步骤:As shown in FIG. 1 , a method for constructing a non-invasive prenatal fetal alpha thalassemia gene mutation detection library according to an embodiment of the present invention includes the steps of:
S1:对母体外周血游离DNA连接带有特异标签序列和任选的样本标签序列的接头序列。S1: A linker sequence with a specific tag sequence and an optional sample tag sequence is ligated to the parental peripheral blood free DNA.
母体外周血游离DNA,即孕妇的外周血游离DNA,其中包括母源血游离DNA和胎源游离DNA。接头序列带有特异标签序,该特异标签序可以是一段n个(例如8或10个)碱基长度的随机序列,使得每一接头序列均不同于其它接头序列,也就是说,接头序列用于区分其连接的母体外周血游离DNA,使得每一母体外周血游离DNA片段都带有特异性的接头序列,便于后续测序之后的序列信息分析中能够区分不同母体外周血游离DNA片段。由于在高通量测序中,每次都能产生海量的测序数据,因此往往将不同来源(例如来源于不同母体的外周血游离DNA)的样本混库(pooling)进行测序,因此接头序列还可以带有一段样本标签序列(index),其用于区分不同来源的样本。The parental peripheral blood free DNA, that is, the peripheral blood free DNA of pregnant women, including maternal blood free DNA and fetal-derived free DNA. The linker sequence carries a specific tag sequence, which can be a random sequence of n (eg, 8 or 10) bases in length, such that each linker sequence is different from the other linker sequences, that is, the linker sequence is In order to distinguish the free DNA of the maternal peripheral blood, the free DNA fragments of each parental blood bank have a specific linker sequence, which facilitates the differentiation of different maternal peripheral blood free DNA fragments in the sequence information analysis after subsequent sequencing. Because high-throughput sequencing produces massive amounts of sequencing data each time, sample pooling of different sources (eg, peripheral blood free DNA from different maternal sources) is often sequenced, so the linker sequence can also With a sample tag sequence (index), it is used to distinguish samples from different sources.
S2:使用预文库扩增引物序列对上一步的连接产物进行预文库扩增,其中预文库扩增引物序列与接头序列互补结合。S2: Pre-library amplification of the ligation product of the previous step using a pre-library amplification primer sequence, wherein the pre-library amplification primer sequence is complementary to the linker sequence.
进行预文库扩增(或称“预扩增”)是为了增加接头连接产物的量,因为接头连接产物中有些分子的拷贝数可能偏低,在后续分成上、下游两部分进行特异性扩增时,这些拷贝数偏低的分子可能不能得到有效扩增,而预扩增增加接头连接产物的量,使得各种拷贝数的分子都能得到有效扩增。Pre-library amplification (or "pre-amplification") is performed to increase the amount of linker ligation product, as some of the molecules in the linker ligation product may have a lower copy number and are specifically amplified in subsequent partitions. At these times, these molecules with low copy number may not be efficiently amplified, and pre-amplification increases the amount of linker ligation products, so that molecules of various copy numbers can be efficiently amplified.
S3:将上一步得到的预文库分为上游预文库和下游预文库,分别使用上游特异性引物组1和下游特异性引物组1对上游预文库和下游预文库进行特异性扩增,分别得到上游特异性扩增产物1和下游特异性扩增产物1。S3: The pre-library obtained in the previous step is divided into an upstream pre-library and a downstream pre-library, and the upstream pre-library and the downstream pre-library are specifically amplified by using the upstream specific primer set 1 and the downstream specific primer set 1, respectively, respectively. Upstream specific amplification product 1 and downstream specific amplification product 1.
本发明将预文库分为上游预文库和下游预文库分别进行两轮特异性扩增,能够显著提高引物扩增特异性。上、下游分开独立扩增的有益效果包括:一方面,由于二代高通量测序读长(reads)短的原因,上游特异性引物与下游特异性引物需要比较靠近扩增目标位点(或目标区域),不分开难以有效扩增到目标片段,而上、下游分开独立扩增能够有效扩增到目标片段;另一方面,上、下游分开独立扩增能够在序列的一端保留接头序列,便于后续高通量测 序的进行。In the present invention, the pre-library is divided into an upstream pre-library and a downstream pre-library for two rounds of specific amplification, which can significantly increase the specificity of primer amplification. The beneficial effects of separate independent amplification on the upstream and downstream include: on the one hand, upstream specific primers and downstream specific primers need to be closer to the amplification target site due to the second generation high-throughput sequencing read shorts (or Target region), it is difficult to effectively amplify the target fragment without separation, and independent amplification of the upstream and downstream can effectively amplify the target fragment; on the other hand, independent amplification of the upstream and downstream can retain the linker sequence at one end of the sequence. Easy for subsequent high-throughput measurements The progress of the order.
本发明实施例中,扩增目标位点(或目标区域)包括α型地贫基因和用于计算胎儿DNA比例的多个SNP位点两种,相应的,上游特异性引物组1和下游特异性引物组1也均分别包括两种引物,即上游特异性引物组1包括:用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;下游特异性引物组1包括:用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1。In the embodiment of the present invention, the amplification target site (or target region) includes two types of thalassemia gene and a plurality of SNP sites for calculating fetal DNA ratio, and correspondingly, upstream specific primer set 1 and downstream specific The primer set 1 also includes two primers respectively, that is, the upstream specific primer set 1 includes: an upstream primer set 1 for amplifying the α-thalassemia gene and a plurality of SNP sites for calculating a fetal free DNA ratio. The upstream primer set 1; the downstream specific primer set 1 includes a downstream primer set 1 for amplifying the α-thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of the fetal free DNA ratio.
如图2示出了本发明实施例中用于扩增α型地贫基因的上游引物组1(USP1)和用于扩增α型地贫基因的下游引物组1(DSP1)与模板结合的相对位置关系。可以看出,USP1和DSP1分别位于扩增目标位点——或称检测点(即α型地贫基因突变)的左右两侧,一般而言USP1和DSP1的3’端距离检测点较近,例如几个碱基、十几个碱基的距离,其中一个原因是,二代高通量测序的读长序列(reads)较短,例如100bp左右,如果USP1和DSP1的3’端距离检测点较远的话,可能二代高通量测序不能有效地测到检测点。用于计算胎儿游离DNA比例的多个SNP位点的上、下游特异性引物与模板结合的相对位置关系与图2的示意图相同。2 shows the upstream primer set 1 (USP1) for amplifying the α-thalassemia gene and the downstream primer set 1 (DSP1) for amplifying the α-thalassemia gene in combination with the template in the embodiment of the present invention. Relative positional relationship. It can be seen that USP1 and DSP1 are located at the left and right sides of the amplification target site, or the detection point (ie, the α-type thalassemia gene mutation). Generally, the 3' end of USP1 and DSP1 are closer to the detection point. For example, a few bases, a distance of a dozen bases, one of the reasons is that the second-generation high-throughput sequencing reads shorter, such as about 100bp, if the USP1 and DSP1 3' end distance detection point In the far distance, it may be that the second generation high-throughput sequencing cannot effectively detect the detection points. The relative positional relationship of the upstream and downstream specific primers to the template binding of the plurality of SNP sites for calculating the fetal free DNA ratio is the same as that of the schematic diagram of FIG.
为了能够分离上游特异性扩增产物1和下游特异性扩增产物1,上游特异性引物组1和下游特异性引物组1的5'端可以带有用于分离扩增产物的修饰,优选带有生物素修饰,生物素能够与亲和素结合,因此上游特异性扩增产物1和下游特异性扩增产物1可以通过生物素-亲和素亲和结合从反应体系中分离出来。In order to be able to separate the upstream specific amplification product 1 and the downstream specific amplification product 1, the 5' end of the upstream specific primer set 1 and the downstream specific primer set 1 may be provided with a modification for isolating the amplification product, preferably with Biotin modification, biotin can bind to avidin, so upstream specific amplification product 1 and downstream specific amplification product 1 can be separated from the reaction system by biotin-avidin affinity binding.
S4:对上游特异性扩增产物1和下游特异性扩增产物1,分别使用上游特异性引物组2和下游特异性引物组2进行特异性扩增,分别得到上游特异性扩增产物2和下游特异性扩增产物2。S4: specific amplification of upstream specific amplification product 1 and downstream specific amplification product 1 using upstream specific primer set 2 and downstream specific primer set 2, respectively, to obtain upstream specific amplification products 2 and Downstream specific amplification product 2.
只有一轮特异性引物扩增的情况下,可能会有非特异性扩增产物的存在。因此,需要进行第二轮特异性引物扩增,该第二轮的特异性引物扩增使用的特异性引物也包括上游特异性引物和下游特异性引物。In the case of only one round of specific primer amplification, there may be a presence of non-specific amplification products. Therefore, a second round of specific primer amplification is required, and the specific primers used for the second round of specific primer amplification also include upstream specific primers and downstream specific primers.
在本发明实施例中,类似于第一轮特异性引物扩增,第二轮特异性引物扩增使用的上游特异性引物组2包括:用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;下游特异性引物组2包括:用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2。In an embodiment of the invention, similar to the first round of specific primer amplification, the upstream specific primer set 2 used for the second round of specific primer amplification comprises: an upstream primer set 2 for amplifying the alpha thalassemia gene 2 And an upstream primer set 2 for calculating a plurality of SNP sites of the fetal free DNA ratio; the downstream specific primer set 2 includes: a downstream primer set 2 for amplifying the α-thalassemia gene and for calculating a fetal free DNA ratio The downstream primer set 2 of multiple SNP sites.
如图2示出了本发明实施例中用于扩增α型地贫基因的上游引物组2(USP2)和用于扩增α型地贫基因的下游引物组2(DSP2)与模板结合的相对位置关系。可以看出,USP2相比USP1更靠近α型地贫基因突变位点,DSP2相比DSP1更靠近α型地贫基因突变位点。用 于计算胎儿游离DNA比例的多个SNP位点的上、下游引物组2与模板结合的相对位置关系与图2的示意图相同。总之,上游特异性引物组2相比上游特异性引物组1更靠近对应的扩增目标位点,下游特异性引物组2相比下游特异性引物组1更靠近对应的扩增目标位点。2 shows the upstream primer set 2 (USP2) for amplifying the α-thalassemia gene and the downstream primer set 2 (DSP2) for amplifying the α-thalassemia gene in combination with the template in the embodiment of the present invention. Relative positional relationship. It can be seen that USP2 is closer to the alpha-type thalassemia gene mutation site than USP1, and DSP2 is closer to the alpha-type thalassemia gene mutation site than DSP1. use The relative positional relationship between the upstream and downstream primer set 2 and the template binding of the plurality of SNP sites for calculating the fetal free DNA ratio is the same as that of the schematic diagram of FIG. In summary, the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1.
在具体实施例中,上游特异性引物组1中针对特定目标位点的引物(例如针对α型地贫基因的USP1)的3’端,与上游特异性引物组2中针对该目标位点的引物(例如针对α型地贫基因的USP2)的5’端有10-15个碱基的重合;类似地,下游特异性引物组1中针对特定目标位点的引物(例如针对α型地贫基因的DSP1)的3’端,与下游特异性引物组2中针对该目标位点的引物(例如针对α型地贫基因的DSP2)的5’端有10-15个碱基的重合。此处“3’端”和“5’端”,分别是指靠近3’末端和5’末端的一段序列。这种重合,主要是考虑USP1和DSP1原本距离检测点(例如α型地贫基因突变位点)较近,碱基的重合能够在进一步提高扩增特异性的同时,充分有效利用结合空间。In a specific embodiment, the primer for a specific target site in the upstream specific primer set 1 (for example, USP1 for the alpha-type thalassemia gene), and the upstream specific primer set 2 for the target site Primers (eg, USP2 for the alpha-type thalassemia gene) have a 10-15 base overlap at the 5' end; similarly, primers for a specific target site in the downstream specific primer set 1 (eg, for alpha-type thalassemia) The 3' end of the DSP1) of the gene has a 10-15 base overlap with the 5' end of the downstream specific primer set 2 for the target site (for example, DSP2 for the alpha-type thalassemia gene). Here, "3' end" and "5' end" refer to a sequence near the 3' end and the 5' end, respectively. This coincidence is mainly considering that USP1 and DSP1 are close to the original detection point (for example, α-type thalassemia gene mutation site), and the superposition of bases can fully utilize the binding space while further improving the amplification specificity.
为了便于高通量测序的进行,上游特异性引物组2和下游特异性引物组2的5’端可以含有用于高通量测序文库的接头序列。具体的,使用何种高通量测序平台,则采用该平台对应的接头序列。To facilitate high-throughput sequencing, the 5' end of upstream specific primer set 2 and downstream specific primer set 2 may contain a linker sequence for a high throughput sequencing library. Specifically, which high-throughput sequencing platform is used, the corresponding linker sequence of the platform is used.
S5:将上游特异性扩增产物2和下游特异性扩增产物2混合后,使用两端的通用引物进行扩增,得到可上机测序的文库。S5: After the upstream specific amplification product 2 and the downstream specific amplification product 2 are mixed, amplification is performed using universal primers at both ends to obtain a library which can be sequenced on the machine.
如图2所示,通用引物(UNP)能够结合接头序列,通过通用引物扩增即可得到上机测序的文库。上机测序通过高通量测序实现。高通量测序技术可以选自Illumina/Genome Analyzer/Hiseq/Miseq/NEXTseq CN500、Applied Biosystems SOLID和life Technologies/Ion Torrent。具体的,使用何种高通量测序技术,则通用引物也采用该高通量测序技术对应的引物。As shown in Figure 2, the universal primer (UNP) is capable of binding to the linker sequence and is amplified by universal primers to obtain a library for sequencing on the machine. Sequencing on the machine is achieved by high-throughput sequencing. The high throughput sequencing technology can be selected from the group consisting of Illumina/Genome Analyzer/Hiseq/Miseq/NEXTseq CN500, Applied Biosystems SOLID, and life Technologies/Ion Torrent. Specifically, which high-throughput sequencing technology is used, universal primers also use primers corresponding to the high-throughput sequencing technology.
在本发明的一个实施例中,用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示;用于扩增α型地贫基因的下游引物组2序列如SEQ ID NO:51-56所示;用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。In one embodiment of the invention, the upstream primer set 1 sequence for amplifying the alpha-type thalassemia gene is set forth in SEQ ID NOs: 1-7; upstream of a plurality of SNP sites for calculating fetal free DNA ratio Primer set 1 sequences are set forth in SEQ ID NOS: 8-17; downstream primer set 1 sequences for amplifying alpha-type thalassemia gene are set forth in SEQ ID NOS: 18-23; used to calculate the ratio of fetal free DNA The sequence of the downstream primer set 1 of one SNP site is shown in SEQ ID NOs: 24-33; the sequence of the upstream primer set 2 used to amplify the alpha thalassemia gene is shown in SEQ ID NOs: 34-40; The upstream primer set 2 sequence of the plurality of SNP sites of the fetal free DNA ratio is shown in SEQ ID NOs: 41-50; the downstream primer set 2 sequence for amplifying the alpha thalassemia gene is SEQ ID NO: 51-56 The sequence of the downstream primer set 2 for the multiple SNP sites used to calculate the fetal free DNA ratio is shown in SEQ ID NOs: 57-66.
本发明实施例还提供一种无创产前胎儿α型地贫基因突变检测方法,包括对本发明实施例构建的文库进行高通量测序得到测序读长(reads);然后,根据位点的唯一的特异标签序列 的总深度和突变等位基因(allele)的深度,计算胎源DNA含量并对地贫相关的突变进行分型以分析α型地贫基因突变情况。The embodiment of the invention further provides a non-invasive prenatal fetal alpha thalassemia gene mutation detecting method, comprising performing high-throughput sequencing of the library constructed in the embodiment of the invention to obtain a sequencing read read; then, according to the unique position of the site Specific tag sequence The total depth and depth of the allele of the mutation, the fetal DNA content was calculated and the thalassemia-related mutations were typed to analyze the alpha-thalassemia gene mutation.
本发明实施例还提供一种无创产前胎儿α型地贫基因突变检测文库构建试剂盒,上述文库用于通过高通量测序进行无创产前胎儿α型地贫基因突变的检测,上述试剂盒包括:The embodiment of the invention further provides a non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction kit, wherein the above library is used for detecting non-invasive prenatal fetal alpha thalassemia gene mutation by high-throughput sequencing, the above kit include:
(R1)上游特异性引物组1和下游特异性引物组1,分别用于对上游预文库和下游预文库进行特异性扩增,以分别得到上游特异性扩增产物1和下游特异性扩增产物1,其中,上游特异性引物组1包括用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;下游特异性引物组1包括用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1;(R1) upstream specific primer set 1 and downstream specific primer set 1 for specific amplification of upstream pre-library and downstream pre-library, respectively, to obtain upstream specific amplification product 1 and downstream specific amplification, respectively Product 1, wherein the upstream specific primer set 1 comprises an upstream primer set 1 for amplifying a type alpha thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; downstream specific primers Group 1 includes a downstream primer set 1 for amplifying a thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
(R2)上游特异性引物组2和下游特异性引物组2,分别用于对上游特异性扩增产物1和下游特异性扩增产物1进行特异性扩增,以分别得到上游特异性扩增产物2和下游特异性扩增产物2,其中,上游特异性引物组2包括用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;下游特异性引物组2包括用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2。(R2) upstream specific primer set 2 and downstream specific primer set 2, respectively, for specific amplification of upstream specific amplification product 1 and downstream specific amplification product 1 respectively, to obtain upstream specific amplification, respectively Product 2 and downstream specific amplification product 2, wherein upstream specific primer set 2 comprises an upstream primer set 2 for amplifying the alpha-thalassemia gene and an upstream of a plurality of SNP sites for calculating the fetal free DNA ratio Primer set 2; downstream specific primer set 2 includes a downstream primer set 2 for amplifying the alpha-thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of the fetal free DNA ratio.
其中,上游特异性引物组2相比上游特异性引物组1更靠近对应的扩增目标位点,下游特异性引物组2相比下游特异性引物组1更靠近对应的扩增目标位点;上游特异性引物组2和下游特异性引物组2的5’端含有用于高通量测序文库的接头序列。Wherein, the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the corresponding amplification target site than the downstream specific primer set 1; The 5' end of upstream specific primer set 2 and downstream specific primer set 2 contains a linker sequence for a high throughput sequencing library.
在优选的实施例中,上游特异性引物组1和下游特异性引物组1的5'端带有用于分离扩增产物的修饰,优选带有生物素修饰,以便从体系中分离上游特异性扩增产物1和下游特异性扩增产物1。In a preferred embodiment, the upstream specific primer set 1 and the downstream specific primer set 1 have a modification for the isolation of the amplified product, preferably with a biotin modification, to separate upstream specific expansion from the system. Product 1 and downstream specific amplification product 1 are added.
在优选的实施例中,上游特异性引物组1中针对特定目标位点的引物的3’端,与上游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合;下游特异性引物组1中针对特定目标位点的引物的3’端,与下游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合。In a preferred embodiment, the 3' end of the primer for the specific target site in the upstream specific primer set 1 and the 10' end of the primer for the target site in the upstream specific primer set 2 are 10-15 Coincident of bases; the 3' end of the primer for the specific target site in the downstream specific primer set 1 and the 10' end of the primer of the downstream specific primer set 2 for the target site are 10-15 bases Coincident.
在本发明的一个实施例中,用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示;用于扩增α型地贫基 因的下游引物组2序列如SEQ ID NO:51-56所示;用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。In one embodiment of the invention, the upstream primer set 1 sequence for amplifying the alpha-type thalassemia gene is set forth in SEQ ID NOs: 1-7; upstream of a plurality of SNP sites for calculating fetal free DNA ratio Primer set 1 sequences are set forth in SEQ ID NOS: 8-17; downstream primer set 1 sequences for amplifying alpha-type thalassemia gene are set forth in SEQ ID NOS: 18-23; used to calculate the ratio of fetal free DNA The sequence of the downstream primer set 1 of one SNP site is shown in SEQ ID NOs: 24-33; the sequence of the upstream primer set 2 used to amplify the alpha thalassemia gene is shown in SEQ ID NOs: 34-40; The upstream primer set 2 sequence of multiple SNP sites of fetal free DNA ratio is shown in SEQ ID NOs: 41-50; used to amplify alpha-type thalassemia The sequence of the downstream primer set 2 is shown in SEQ ID NOS: 51-56; the sequence of the downstream primer set 2 for calculating a plurality of SNP sites of fetal free DNA ratio is shown in SEQ ID NOs: 57-66.
在优选的实施例中,试剂盒还包括:In a preferred embodiment, the kit further comprises:
(R3)接头序列,其带有特异标签序列和任选的样本标签序列,用于连接母体外周血游离DNA,以得到连接产物;a (R3) linker sequence carrying a specific tag sequence and an optional sample tag sequence for ligation of maternal peripheral blood free DNA to obtain a ligation product;
(R4)预文库扩增引物序列,其与上述接头序列互补结合,用于对上述连接产物进行预文库扩增;(R4) a pre-library amplification primer sequence which is complementary to the above-described linker sequence for pre-library amplification of the above-described ligation product;
(R5)通用引物,用于在上述上游特异性扩增产物2和下游特异性扩增产物2混合后,对混合产物进行扩增,以得到可上机测序的文库。(R5) a universal primer for amplifying the mixed product after mixing the upstream specific amplification product 2 and the downstream specific amplification product 2 described above to obtain a library which can be sequenced on the machine.
在本发明的一个实施例中,接头序列如SEQ ID NO:67-68所示;预文库扩增引物序列如SEQ ID NO:69-70所示;通用引物如SEQ ID NO:71-72所示。In one embodiment of the invention, the linker sequence is set forth in SEQ ID NOs: 67-68; the pre-library amplification primer sequences are set forth in SEQ ID NOs: 69-70; and the universal primers are set forth in SEQ ID NO: 71-72. Show.
以下通过实施例详细说明本发明的技术方案和技术效果,应当理解,实施例仅是示例性的,不能理解为对本发明保护范围的限制。The technical solutions and the technical effects of the present invention are described in detail below by way of examples. It is to be understood that the embodiments are only illustrative and not restrictive.
实施例Example
1、针对α地中海贫血基因-αSEA、-α3.7、-α4.2、αCS、αQS、αWS突变位点设计合成引物:1. Design synthetic primers for α thalassemia gene-α SEA , -α 3.7 , -α 4.2 , α CS , α QS , α WS mutation sites:
上游引物和下游引物分别位于检测点左侧和右侧。同侧的特异性引物1的3’端和特异性引物2的5’端有10-15个碱基的重合。特异性引物1的5'端有生物素修饰。特异性引物2的5’端含有用于高通量测序文库的接头序列。具体的,引物序列如表1所示。The upstream primer and the downstream primer are located on the left and right sides of the detection point, respectively. The 3' end of the ipsilateral specific primer 1 and the 5' end of the specific primer 2 have a 10-15 base overlap. Specific primer 1 has a biotin modification at the 5' end. The 5' end of specific primer 2 contains a linker sequence for a high throughput sequencing library. Specifically, the primer sequences are shown in Table 1.
表1Table 1
Figure PCTCN2017113231-appb-000001
Figure PCTCN2017113231-appb-000001
Figure PCTCN2017113231-appb-000002
Figure PCTCN2017113231-appb-000002
Figure PCTCN2017113231-appb-000003
Figure PCTCN2017113231-appb-000003
Figure PCTCN2017113231-appb-000004
Figure PCTCN2017113231-appb-000004
Figure PCTCN2017113231-appb-000005
Figure PCTCN2017113231-appb-000005
2、带有特异标签序列和样本标签序列(index)的接头序列:2. A linker sequence with a specific tag sequence and a sample tag sequence:
ADT-F:ADT-F:
CAAGCAGAAGACGGCATACGAGATNNNNNNNNATTAAGGGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT(SEQ ID NO:67);CAAGCAGAAGACGGCATACGAGATNNNNNNNNATTAAGGGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT (SEQ ID NO: 67);
ADT-R:pGATCGGAAGAGC(SEQ ID NO:68)。ADT-R: pGATCGGAAGAGC (SEQ ID NO: 68).
其中,NNNNNNNN是特异标签序列,ATTAAGG是样本标签序列(index),以上接头序列需要退火成双链。Among them, NNNNNNNN is a specific tag sequence, ATTAAGG is a sample tag sequence (index), and the above linker sequence needs to be annealed into a double strand.
3、预文库扩增引物序列:3. Pre-library amplification primer sequence:
pre-lib-primer-F:CAAGCAGAAGACGGCATACGA(SEQ ID NO:69);pre-lib-primer-F: CAAGCAGAAGACGGCATACGA (SEQ ID NO: 69);
pre-lib-primer-R:GCTCTTCCGATCT(SEQ ID NO:70)。pre-lib-primer-R: GCTCTTCCGATCT (SEQ ID NO: 70).
4、终文库扩增引物序列(通用引物):4. Final library amplification primer sequence (general primer):
Seq-lib-primer-F:CAAGCAGAAGACGGCATACGA(SEQ ID NO:71);Seq-lib-primer-F: CAAGCAGAAGACGGCATACGA (SEQ ID NO: 71);
Seq-lib-primer-R:Seq-lib-primer-R:
ATGATACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTC(SEQ ID NO:72)。ATGATACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTC (SEQ ID NO: 72).
5、本发明实施例的检测方法和检测试剂盒,具体建库步骤为:5. The detection method and the detection kit of the embodiment of the present invention, the specific construction steps are as follows:
1)提取2ml孕妇血浆游离DNA。1) Extract 2 ml of pregnant women's plasma free DNA.
2)游离DNA末端修复,配置反应体系(反应一)如表2所示。2) Free DNA end repair, and the reaction system (Reaction 1) was set as shown in Table 2.
表2 Table 2
试剂Reagent 体积(μL)Volume (μL)
DNADNA 40.540.5
T4 DNA连接酶缓冲液+10mM ATPT4 DNA ligase buffer + 10 mM ATP 55
10mM dNTP混合液10mM dNTP mixture 22
T4 DNA聚合酶(Enzymatics公司)T4 DNA polymerase (Enzymatics) 11
T4 DNA磷酸化酶(Enzymatics公司)T4 DNA phosphorylase (Enzymatics) 0.50.5
rTaq(Takara公司)rTaq (Takara company) 11
总体积total capacity 5050
充分混匀,瞬时离心,在热循环仪上进行如下反应:37℃,20min;72℃,20min;4℃保存。Mix well, centrifuge instantaneously, and carry out the following reaction on a thermal cycler: 37 ° C, 20 min; 72 ° C, 20 min; 4 ° C storage.
3)DNA片段连接带有特异标签序列和样本标签序列的接头序列,配置反应体系(反应二)如表3所示。3) The DNA fragment is ligated to the linker sequence carrying the specific tag sequence and the sample tag sequence, and the reaction system (Reaction 2) is shown in Table 3.
表3table 3
试剂Reagent 体积(μL)Volume (μL)
反应一Reaction one 5050
DNA连接酶缓冲液DNA ligase buffer 2727
DNA连接酶(Enzymatics公司)DNA ligase (Enzymatics) 11
接头序列(SEQ ID NO:67-68)Linker sequence (SEQ ID NO: 67-68) 22
总体积total capacity 8080
充分混匀,瞬时离心,在热循环仪上进行如下反应:20℃,15min;65℃,10min;4℃保存。Mix well, centrifuge instantaneously, and carry out the following reaction on a thermocycler: 20 ° C, 15 min; 65 ° C, 10 min; 4 ° C preservation.
4)反应结束后,立即使用30μL XP beads纯化,26μL超纯水或者洗脱液中洗脱。4) Immediately after completion of the reaction, use 30 μL of XP beads to purify, and elute with 26 μL of ultrapure water or an eluent.
5)PCR预扩增5) PCR preamplification
在冰上配置如下反应体系,如表4所示。The following reaction system was placed on ice as shown in Table 4.
表4Table 4
Figure PCTCN2017113231-appb-000006
Figure PCTCN2017113231-appb-000006
Figure PCTCN2017113231-appb-000007
Figure PCTCN2017113231-appb-000007
充分混匀,瞬时离心,在热循环仪上进行如下反应:95℃,3min,1循环;(95℃,15s,62℃,30s,72℃,30s)10循环;72℃,5min,1循环;4℃保存。Mix thoroughly, centrifuge instantaneously, and perform the following reactions on a thermal cycler: 95 ° C, 3 min, 1 cycle; (95 ° C, 15 s, 62 ° C, 30 s, 72 ° C, 30 s) 10 cycles; 72 ° C, 5 min, 1 cycle ; 4 ° C preservation.
6)反应终产物使用50μL XP beads回收纯化,使用Qubit定量,5μL反应产物2%凝胶电泳检测,结果如图3a所示,表明预文库构建成功。取两等份预文库,分别命名为上游预文库和下游预文库,每份100ng,进行下一步扩增。6) The final reaction product was purified by using 50 μL of XP beads, and quantification by Qubit, and 5 μL of the reaction product was detected by 2% gel electrophoresis. The results are shown in Fig. 3a, indicating that the pre-library was successfully constructed. Two aliquots of the pre-library were taken and named as the upstream pre-library and the downstream pre-library, respectively, and each 100 ng was subjected to the next amplification.
7)特异性扩增17) Specific amplification 1
每份样品预文库为上游预文库和下游预文库,分别使用对应的上、下游特异性引物1和特异性引物2,直到使用通用引物扩增时将每份样品上、下游引物扩增产物合并,再使用通用引物扩增。Each sample pre-library is an upstream pre-library and a downstream pre-library, respectively using the corresponding upstream and downstream specific primer 1 and specific primer 2, respectively, until the amplification of the upstream and downstream primers of each sample is combined when using universal primer amplification. And then use universal primers for amplification.
在冰上配置如下反应体系,如表5所示。The following reaction system was placed on ice as shown in Table 5.
表5table 5
Figure PCTCN2017113231-appb-000008
Figure PCTCN2017113231-appb-000008
PCR反应程序如下:95℃,10min,1循环;(95℃,30s,62℃,30s,72℃,1min)20循环;72℃,7min,1循环;4℃保存。The PCR reaction procedure was as follows: 95 ° C, 10 min, 1 cycle; (95 ° C, 30 s, 62 ° C, 30 s, 72 ° C, 1 min) 20 cycles; 72 ° C, 7 min, 1 cycle; 4 ° C storage.
8)5μL反应产物2%琼脂糖凝胶电泳,剩余产物1.2X XP回收,24μL超纯水或者洗脱液洗脱,洗脱液进行下一步扩增。8) 5 μL of the reaction product was subjected to 2% agarose gel electrophoresis, and the remaining product was recovered by 1.2X XP, eluted with 24 μL of ultrapure water or an eluent, and the eluate was subjected to further amplification.
9)特异性扩增29) Specific amplification 2
在冰上配置如下反应体系,如表6所示。The following reaction system was placed on ice as shown in Table 6.
表6Table 6
试剂Reagent 用量Dosage
上游或下游特异性扩增1产物Upstream or downstream specific amplification of 1 product 22μL22μL
360 Master mix(Applied Biosystems公司)360 Master mix (Applied Biosystems) 25μL25μL
U-SP2(SEQ ID NO:34-50)或U-SP2 (SEQ ID NO: 34-50) or 2μL2μL
D-SP2(SEQ ID NO:51-66)D-SP2 (SEQ ID NO: 51-66)  
GC-增强剂GC-enhancer 1μL1μL
PCR反应程序如下:95℃,10min,1循环;(95℃,30s,62℃,30s,72℃,1min)15循环;72℃,7min,1循环;4℃保存。The PCR reaction procedure was as follows: 95 ° C, 10 min, 1 cycle; (95 ° C, 30 s, 62 ° C, 30 s, 72 ° C, 1 min) 15 cycles; 72 ° C, 7 min, 1 cycle; 4 ° C preservation.
10)5μL反应产物2%琼脂糖凝胶电泳,剩余产物1.2X XP回收,每一个样品的上下游扩增产物可以合并回收,最终26μL超纯水或者洗脱液洗脱,洗脱液进行通用引物扩增。10) 5 μL of the reaction product was electrophoresed on 2% agarose gel, and the remaining product was recovered by 1.2X XP. The upstream and downstream amplification products of each sample were combined and recovered, and finally 26 μL of ultrapure water or eluate was eluted, and the eluate was used for general purpose. Primer amplification.
11)通用引物扩增11) Universal primer amplification
在冰上配置如下反应体系,如表7所示。The following reaction system was placed on ice as shown in Table 7.
表7Table 7
Figure PCTCN2017113231-appb-000009
Figure PCTCN2017113231-appb-000009
PCR反应程序如下:98℃,45s,1循环;(98℃,15s,60℃,30s,72℃,30s)10循环;72℃,1min,1循环;4℃保存。The PCR reaction procedure was as follows: 98 ° C, 45 s, 1 cycle; (98 ° C, 15 s, 60 ° C, 30 s, 72 ° C, 30 s) 10 cycles; 72 ° C, 1 min, 1 cycle; 4 ° C preservation.
12)5μL反应产物2%凝胶电泳检测,结果如图3b所示,表明终文库构建成功。剩余产物1.0X XP回收,最后使用30ul洗脱液洗脱,即为最终上机文库。12) 5 μL of the reaction product was detected by 2% gel electrophoresis, and the results are shown in Fig. 3b, indicating that the final library was successfully constructed. The remaining product was recovered by 1.0X XP and finally eluted with 30 ul of eluent, which was the final library.
13)文库质控后,Illumina NEXTseq CN500进行75PE测序。13) After library control, Illumina NEXTseq CN500 was subjected to 75PE sequencing.
14)测序数据信息分析14) Analysis of sequencing data information
过滤掉低质量测序读长(reads)后,对于每一对pair-end reads根据所连接的分子标签序列和扩增引物序列归类,当reads起始部分序列与实验加入的引物序列和分子标签序列不一致时,reads将被过滤掉。保留下来的pair-end reads用BWA MEM人类参考基因组(hg19)比对,按同起始、同终止位置且分子标签一致的序列聚类到同一组,得到唯一的分子标签组序列,将唯一的分子标签组序列重新比对到参考基因组(BWA MEM),用GATK软件对Indel周围的序列重新比对,用samtools软件进行SNP和小的插入缺失检测,用CNVPanelizer R package进行alpha1(α1)和alpha2(α2)缺失检测。根据位点的唯一的分子标签组序列的总深度和相关SNP位点等位基因(allele)的深度,计算胚胎DNA含量,并对胎儿地贫相关的点突变、INDEL进行基因分型,对于alpha1和alpha2型缺失,采用混合高斯模型进行胎儿基因分型(fetal genotyping)。 After filtering out low-quality sequencing read reads, for each pair of pair reads based on the linked molecular tag sequence and amplification primer sequence, when the reads start sequence and the experimentally added primer sequences and molecular tags When the sequence is inconsistent, the reads will be filtered out. The retained pair-end reads are aligned with the BWA MEM human reference genome (hg19), clustered into the same group by the same starting, same termination position and molecular tag sequence, resulting in a unique molecular tag set sequence, which will be unique. The molecular tag set sequences were re-aligned to the reference genome (BWA MEM), the sequences surrounding Indel were re-aligned using GATK software, SNP and small indels were detected using samtools software, and alpha1 (α1) and alpha2 were performed using CNVPanelizer R package. (α2) deletion detection. Calculate the embryonic DNA content based on the total depth of the unique molecular tag set sequence of the locus and the depth of the associated SNP locus allele, and genotype the fetal thalassemia-related point mutation, INDEL, for alpha1 And alpha2 type deletion, mixed Gaussian model for fetal genotyping.
根据上述实施例,对8例地中海贫血携带者的孕妇进行无创产前地中海贫血基因检测,结果如下表8所示。According to the above examples, non-invasive prenatal thalassemia gene detection was performed on 8 pregnant women with thalassemia carriers, and the results are shown in Table 8 below.
表8Table 8
Figure PCTCN2017113231-appb-000010
Figure PCTCN2017113231-appb-000010
结果显示,本发明实施例对α地中海贫血基因突变无创产前检测和羊水穿刺检测分型一致,表明本发明的方法能够准确、高效地检测α型地贫基因突变,其结果与羊水穿刺检测分型一致,但安全性、无创性和高效性方面明显优于羊水穿刺检测。The results show that the non-invasive prenatal detection and the amniocentesis detection of the α-thalassemia gene mutation are consistent in the embodiment of the present invention, indicating that the method of the present invention can accurately and efficiently detect the alpha-thalassemia gene mutation, and the result and the amniocentesis test score. The type is consistent, but it is significantly better than amniocentesis in terms of safety, non-invasiveness and high efficiency.
以上内容是结合具体的实施方式对本发明所作的进一步详细说明,不能认定本发明的具体实施只局限于这些说明。对于本发明所属技术领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干简单推演或替换,都应当视为属于本发明的保护范围。 The above is a further detailed description of the present invention in connection with the specific embodiments, and the specific embodiments of the present invention are not limited to the description. It will be apparent to those skilled in the art that the present invention may be made without departing from the spirit and scope of the invention.

Claims (10)

  1. 一种无创产前胎儿α型地贫基因突变检测文库构建方法,所述文库用于通过高通量测序进行无创产前胎儿α型地贫基因突变的检测,其特征在于,所述方法包括:A method for constructing a non-invasive prenatal fetal alpha-type thalassemia gene mutation detection library for detecting non-invasive prenatal fetal alpha-type thalassemia gene mutation by high-throughput sequencing, characterized in that the method comprises:
    (a)对母体外周血游离DNA连接带有特异标签序列和任选的样本标签序列的接头序列;(a) a linker sequence having a specific tag sequence and an optional sample tag sequence for maternal peripheral blood free DNA ligation;
    (b)使用预文库扩增引物序列对上一步的连接产物进行预文库扩增,其中所述预文库扩增引物序列与所述接头序列互补结合;(b) pre-librarian amplification of the ligation product of the previous step using a pre-library amplification primer sequence, wherein the pre-library amplification primer sequence is complementary to the linker sequence;
    (c)将上一步得到的预文库分为上游预文库和下游预文库,分别使用上游特异性引物组1和下游特异性引物组1对所述上游预文库和下游预文库进行特异性扩增,分别得到上游特异性扩增产物1和下游特异性扩增产物1,(c) dividing the pre-library obtained in the previous step into an upstream pre-library and a downstream pre-library, and specifically amplifying the upstream pre-library and the downstream pre-library using the upstream specific primer set 1 and the downstream specific primer set 1, respectively. , obtaining upstream specific amplification product 1 and downstream specific amplification product 1, respectively.
    其中,所述上游特异性引物组1包括用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;所述下游特异性引物组1包括用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1;Wherein the upstream specific primer set 1 comprises an upstream primer set 1 for amplifying a type alpha thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the downstream specificity Primer set 1 includes a downstream primer set 1 for amplifying a type alpha thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
    (d)对所述上游特异性扩增产物1和下游特异性扩增产物1,分别使用上游特异性引物组2和下游特异性引物组2进行特异性扩增,分别得到上游特异性扩增产物2和下游特异性扩增产物2,(d) specifically amplifying the upstream specific amplification product 1 and the downstream specific amplification product 1 using the upstream specific primer set 2 and the downstream specific primer set 2, respectively, to obtain upstream specific amplification, respectively. Product 2 and downstream specific amplification product 2,
    其中,所述上游特异性引物组2包括用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;所述下游特异性引物组2包括用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2;Wherein the upstream specific primer set 2 comprises an upstream primer set 2 for amplifying the α-thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio; the downstream specificity Primer set 2 includes a downstream primer set 2 for amplifying a type alpha thalassemia gene and a downstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio;
    并且,所述上游特异性引物组2相比所述上游特异性引物组1更靠近对应的扩增目标位点,所述下游特异性引物组2相比所述下游特异性引物组1更靠近对应的扩增目标位点;所述上游特异性引物组2和所述下游特异性引物组2的5’端含有用于高通量测序文库的接头序列;And, the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the downstream specific primer set 1 Corresponding amplification target sites; the upstream specific primer set 2 and the downstream specific primer set 2 have a linker sequence for the high-throughput sequencing library at the 5' end;
    (e)将所述上游特异性扩增产物2和下游特异性扩增产物2混合后,使用两端的通用引物进行扩增,得到可上机测序的文库。(e) After mixing the upstream specific amplification product 2 and the downstream specific amplification product 2, amplification was carried out using universal primers at both ends to obtain a library which can be sequenced by the machine.
  2. 根据权利要求1所述的文库构建方法,其特征在于,所述上游特异性引物组1和下游特异性引物组1的5'端带有用于分离扩增产物的修饰,优选带有生物素修饰。The library construction method according to claim 1, wherein the upstream specific primer set 1 and the downstream specific primer set 1 have a modification at the 5' end thereof for isolating the amplification product, preferably with biotin modification. .
  3. 根据权利要求1所述的文库构建方法,其特征在于,所述上游特异性引物组1中针对特定目标位点的引物的3’端,与所述上游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合;所述下游特异性引物组1中针对特定目标位点的引物的3’端,与所述下游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合。 The library construction method according to claim 1, wherein a 3' end of the primer specific to the specific target site in the upstream specific primer set 1 and the target specific position in the upstream specific primer set 2 Point 5's primer has a 10-15 base overlap; the downstream specific primer set 1 has a 3' end for a specific target site, and the downstream specific primer set 2 The 5' end of the primer at the target site has a 10-15 base overlap.
  4. 根据权利要求1-3任一项所述的文库构建方法,其特征在于,所述用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;The library construction method according to any one of claims 1 to 3, wherein the upstream primer set 1 sequence for amplifying the α-thalassemia gene is as shown in SEQ ID NOS: 1-7;
    所述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;The upstream primer set 1 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 8-17;
    所述用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;The downstream primer set 1 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 18-23;
    所述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;The downstream primer set 1 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 24-33;
    所述用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;The upstream primer set 2 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 34-40;
    所述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示;The upstream primer set 2 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 41-50;
    所述用于扩增α型地贫基因的下游引物组2序列如SEQ ID NO:51-56所示;The downstream primer set 2 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 51-56;
    所述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。The downstream primer set 2 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
  5. 一种无创产前胎儿α型地贫基因突变检测方法,其特征在于,所述方法包括对权利要求1-4任一项构建的文库进行高通量测序得到测序读长;然后,根据位点的唯一的特异标签序列的总深度和突变等位基因的深度,计算胎源DNA含量并对地贫相关的突变进行分型以分析α型地贫基因突变情况。A non-invasive prenatal fetal alpha thalassemia gene mutation detecting method, characterized in that the method comprises performing high-throughput sequencing of the library constructed according to any one of claims 1 to 4 to obtain a sequencing read length; and then, according to the site The total depth of the unique specific tag sequence and the depth of the mutant allele, the fetal DNA content was calculated and the thalassemia-related mutation was typed to analyze the alpha-thalassemia gene mutation.
  6. 一种无创产前胎儿α型地贫基因突变检测文库构建试剂盒,所述文库用于通过高通量测序进行无创产前胎儿α型地贫基因突变的检测,其特征在于,所述试剂盒包括:A non-invasive prenatal fetal alpha thalassemia gene mutation detection library construction kit for detecting non-invasive prenatal fetal alpha-type thalassemia gene mutation by high-throughput sequencing, characterized in that the kit include:
    (a)上游特异性引物组1和下游特异性引物组1,分别用于对上游预文库和下游预文库进行特异性扩增,以分别得到上游特异性扩增产物1和下游特异性扩增产物1,(a) upstream specific primer set 1 and downstream specific primer set 1 for specific amplification of upstream pre-library and downstream pre-library, respectively, to obtain upstream specific amplification product 1 and downstream specific amplification, respectively Product 1,
    其中,所述上游特异性引物组1包括用于扩增α型地贫基因的上游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1;所述下游特异性引物组1包括用于扩增α型地贫基因的下游引物组1以及用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1;Wherein the upstream specific primer set 1 comprises an upstream primer set 1 for amplifying a type alpha thalassemia gene and an upstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio; the downstream specificity Primer set 1 includes a downstream primer set 1 for amplifying a type alpha thalassemia gene and a downstream primer set 1 for calculating a plurality of SNP sites of a fetal free DNA ratio;
    (b)上游特异性引物组2和下游特异性引物组2,分别用于对所述上游特异性扩增产物1和下游特异性扩增产物1进行特异性扩增,以分别得到上游特异性扩增产物2和下游特异性扩增产物2,(b) upstream specific primer set 2 and downstream specific primer set 2 for specifically amplifying the upstream specific amplification product 1 and downstream specific amplification product 1 respectively to obtain upstream specificity, respectively Amplification product 2 and downstream specific amplification product 2,
    其中,所述上游特异性引物组2包括用于扩增α型地贫基因的上游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2;所述下游特异性引物组2包括用于扩增α型地贫基因的下游引物组2以及用于计算胎儿游离DNA比例的多个SNP位点的下游 引物组2;Wherein the upstream specific primer set 2 comprises an upstream primer set 2 for amplifying the α-thalassemia gene and an upstream primer set 2 for calculating a plurality of SNP sites of a fetal free DNA ratio; the downstream specificity Primer set 2 includes a downstream primer set 2 for amplifying the alpha thalassemia gene and a downstream of a plurality of SNP sites for calculating the ratio of fetal free DNA. Primer set 2;
    并且,所述上游特异性引物组2相比所述上游特异性引物组1更靠近对应的扩增目标位点,所述下游特异性引物组2相比所述下游特异性引物组1更靠近对应的扩增目标位点;所述上游特异性引物组2和所述下游特异性引物组2的5’端含有用于高通量测序文库的接头序列。And, the upstream specific primer set 2 is closer to the corresponding amplification target site than the upstream specific primer set 1, and the downstream specific primer set 2 is closer to the downstream specific primer set 1 Corresponding amplification target sites; the upstream specific primer set 2 and the 5' end of the downstream specific primer set 2 contain a linker sequence for a high throughput sequencing library.
  7. 根据权利要求6所述的文库构建试剂盒,其特征在于,所述上游特异性引物组1和下游特异性引物组1的5'端带有用于分离扩增产物的修饰,优选带有生物素修饰。The library construction kit according to claim 6, wherein the upstream specific primer set 1 and the downstream specific primer set 1 have a modification at the 5' end thereof for isolating the amplification product, preferably with biotin Modification.
  8. 根据权利要求6所述的文库构建试剂盒,其特征在于,所述上游特异性引物组1中针对特定目标位点的引物的3’端,与所述上游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合;所述下游特异性引物组1中针对特定目标位点的引物的3’端,与所述下游特异性引物组2中针对该目标位点的引物的5’端有10-15个碱基的重合。The library construction kit according to claim 6, wherein the upstream specific primer set 1 has a 3' end for a specific target site, and the upstream specific primer set 2 targets the target The 5' end of the primer of the site has a 10-15 base overlap; the 3' end of the primer of the downstream specific primer set 1 for a specific target site, and the downstream specific primer set 2 The 5' end of the primer at the target site has a 10-15 base overlap.
  9. 根据权利要求6-8任一项所述的文库构建试剂盒,其特征在于,所述用于扩增α型地贫基因的上游引物组1序列如SEQ ID NO:1-7所示;The library construction kit according to any one of claims 6 to 8, wherein the upstream primer set 1 sequence for amplifying the alpha thalassemia gene is as shown in SEQ ID NOS: 1-7;
    所述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组1序列如SEQ ID NO:8-17所示;The upstream primer set 1 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 8-17;
    所述用于扩增α型地贫基因的下游引物组1序列如SEQ ID NO:18-23所示;The downstream primer set 1 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 18-23;
    所述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组1序列如SEQ ID NO:24-33所示;The downstream primer set 1 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 24-33;
    所述用于扩增α型地贫基因的上游引物组2序列如SEQ ID NO:34-40所示;The upstream primer set 2 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 34-40;
    所述用于计算胎儿游离DNA比例的多个SNP位点的上游引物组2序列如SEQ ID NO:41-50所示;The upstream primer set 2 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is as shown in SEQ ID NOs: 41-50;
    所述用于扩增α型地贫基因的下游引物组2序列如SEQ ID NO:51-56所示;The downstream primer set 2 sequence for amplifying the alpha thalassemia gene is set forth in SEQ ID NOs: 51-56;
    所述用于计算胎儿游离DNA比例的多个SNP位点的下游引物组2序列如SEQ ID NO:57-66所示。The downstream primer set 2 sequence of the plurality of SNP sites for calculating the ratio of fetal free DNA is shown in SEQ ID NOs: 57-66.
  10. 根据权利要求6-8任一项所述的文库构建试剂盒,其特征在于,所述试剂盒还包括:The library construction kit according to any one of claims 6 to 8, wherein the kit further comprises:
    (c)接头序列,其带有特异标签序列和任选的样本标签序列,用于连接母体外周血游离DNA,以得到连接产物;(c) a linker sequence with a specific tag sequence and an optional sample tag sequence for ligation of maternal peripheral blood free DNA to obtain a ligation product;
    (d)预文库扩增引物序列,其与所述接头序列互补结合,用于对所述连接产物进行预文库扩增;(d) a pre-library amplification primer sequence that binds complementary to the linker sequence for pre-library amplification of the ligation product;
    (e)通用引物,用于在所述上游特异性扩增产物2和下游特异性扩增产物2混合后,对混合产物进行扩增,以得到可上机测序的文库; (e) a universal primer for amplifying the mixed product after mixing the upstream specific amplification product 2 and the downstream specific amplification product 2 to obtain a library which can be sequenced on the machine;
    优选地,Preferably,
    所述接头序列如SEQ ID NO:67-68所示;The linker sequence is set forth in SEQ ID NOs: 67-68;
    所述预文库扩增引物序列如SEQ ID NO:69-70所示;The pre-library amplification primer sequences are set forth in SEQ ID NOs: 69-70;
    所述通用引物如SEQ ID NO:71-72所示。 The universal primers are set forth in SEQ ID NOs: 71-72.
PCT/CN2017/113231 2017-01-19 2017-11-28 CONSTRUCTION METHOD, DETECTION METHOD AND KIT FOR NON-INVASIVE PRENATAL FETAL α-THALASSEMIA GENE MUTATION DETECTION LIBRARY WO2018133546A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710044343.1A CN106755485A (en) 2017-01-19 2017-01-19 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit
CN201710044343.1 2017-01-19

Publications (1)

Publication Number Publication Date
WO2018133546A1 true WO2018133546A1 (en) 2018-07-26

Family

ID=58945126

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/113231 WO2018133546A1 (en) 2017-01-19 2017-11-28 CONSTRUCTION METHOD, DETECTION METHOD AND KIT FOR NON-INVASIVE PRENATAL FETAL α-THALASSEMIA GENE MUTATION DETECTION LIBRARY

Country Status (2)

Country Link
CN (1) CN106755485A (en)
WO (1) WO2018133546A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106755485A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit
CN107099609A (en) * 2017-06-19 2017-08-29 东莞市妇幼保健院 A kind of droplet type digital pcr detects the genetic analysis method of low ratio trace target gene
CN108796061A (en) * 2017-11-07 2018-11-13 深圳华大临床检验中心有限公司 For the primer sets of thalassaemia mutations type genetic test, kit, its application and library constructing method
WO2020113577A1 (en) * 2018-12-07 2020-06-11 深圳华大生命科学研究院 Method for constructing target gene library, detection device and application thereof
CN113012759B (en) * 2020-12-09 2022-08-12 人和未来生物科技(长沙)有限公司 Method for calculating cffDNA content of male fetus based on X chromosome
CN112831555B (en) * 2021-02-01 2022-12-20 人和未来生物科技(长沙)有限公司 Reference substance for detecting thalassemia gene and preparation method and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014029093A1 (en) * 2012-08-23 2014-02-27 深圳华大基因科技有限公司 Method and system for determining whether individual is in abnormal state
CN104232777A (en) * 2014-09-19 2014-12-24 天津华大基因科技有限公司 Method and device for simultaneously determining fetal nucleic acid content and aneuploidy of chromosome
CN104846089A (en) * 2015-05-06 2015-08-19 厦门万基生物科技有限公司 Quantitative method for free fetal DNA (deoxyribonucleic acid) proportion in maternal peripheral blood
CN104894279A (en) * 2015-06-25 2015-09-09 北京嘉宝仁和医疗科技有限公司 Test kit for alpha-thalassemia gene mutations
CN106755485A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103421903B (en) * 2013-08-14 2015-02-18 邯郸市康业生物科技有限公司 Alpha-thalassemia screening kit and application thereof in prenatal screening
CN105861724B (en) * 2016-06-03 2019-07-16 人和未来生物科技(长沙)有限公司 A kind of KRAS gene ultralow frequency mutation detection kit

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014029093A1 (en) * 2012-08-23 2014-02-27 深圳华大基因科技有限公司 Method and system for determining whether individual is in abnormal state
CN104232777A (en) * 2014-09-19 2014-12-24 天津华大基因科技有限公司 Method and device for simultaneously determining fetal nucleic acid content and aneuploidy of chromosome
CN104846089A (en) * 2015-05-06 2015-08-19 厦门万基生物科技有限公司 Quantitative method for free fetal DNA (deoxyribonucleic acid) proportion in maternal peripheral blood
CN104894279A (en) * 2015-06-25 2015-09-09 北京嘉宝仁和医疗科技有限公司 Test kit for alpha-thalassemia gene mutations
CN106755485A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Non-official translation: Applied research of non-invasive prenatal diagnostic test for beta-thalassaemia using second generation sequencing technology on maternal plasma to track fetal free DNA", CHINA OUTSTANDING DOCTORAL DISSERTATIONS FULL-TEXT DATABASE, MEDICINE AND HEALTH TECHNOLOGY SERIES, 15 January 2015 (2015-01-15) *

Also Published As

Publication number Publication date
CN106755485A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
WO2018133546A1 (en) CONSTRUCTION METHOD, DETECTION METHOD AND KIT FOR NON-INVASIVE PRENATAL FETAL α-THALASSEMIA GENE MUTATION DETECTION LIBRARY
JP6585117B2 (en) Diagnosis of fetal chromosomal aneuploidy
EP3329010B1 (en) Nucleic acids and methods for detecting chromosomal abnormalities
Varley et al. Dynamic DNA methylation across diverse human cell lines and tissues
Morgan et al. Genetic diagnosis of familial breast cancer using clonal sequencing
AU2020216438A1 (en) Compositions and methods for isolating cell-free DNA
JP2020504606A (en) Methods and systems for analyzing nucleic acid molecules
CN107475375A (en) A kind of DNA probe storehouse, detection method and kit hybridized for microsatellite locus related to microsatellite instability
KR102605749B1 (en) Circulating rna signatures specific to preeclamsia
CN108998508B (en) Construction method of amplicon sequencing library, primer group and kit
WO2018133547A1 (en) METHOD FOR CONSTRUCTING LIBRARY FOR NON-INVASIVE PRENATAL FETAL β-THALASSEMIA GENE MUTATION DETECTION, DETECTION METHOD AND KIT
TWI467020B (en) Method of detecting dmd gene exon deletion and/or repeated
WO2017202389A1 (en) Adapter suitable for ultra-trace dna sequencing, and application thereof
CN103571822B (en) A kind of multipurpose DNA fragmentation enriching method analyzed for new-generation sequencing
WO2023284768A1 (en) Fusion primer direct amplification method-based human mitochondrial whole genome high-throughput sequencing kit
CN105734679A (en) Preparation method of nucleic acid target sequence capture sequencing library
CN103998625B (en) For the method and system of Viral diagnosis
CN105803054A (en) Kit and use thereof in detection of orofacial clefts related genes
CN106755484A (en) The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α SEA types ground, detection method and kit
CN109735611A (en) A kind of assortment of genes for detecting bone marrow failure syndrome, primer library, the method and its application for constructing high-throughput sequencing library
CN112592972B (en) Early screening method and kit for diffuse toxic goiter susceptibility genes
TWI579722B (en) Method for assessing the risk of adverse drug reaction and device thereof
CN101871002B (en) Kit for antenatally screening trisomy 21 syndrome noninvasively
CN109609694B (en) Kit and method for detecting hepatitis B typing and multiple drug-resistant sites based on Illumina sequencing technology
US20220145368A1 (en) Methods for noninvasive prenatal testing of fetal abnormalities

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17892638

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205N DATED 07/11/2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17892638

Country of ref document: EP

Kind code of ref document: A1