WO2020113577A1 - Method for constructing target gene library, detection device and application thereof - Google Patents

Method for constructing target gene library, detection device and application thereof Download PDF

Info

Publication number
WO2020113577A1
WO2020113577A1 PCT/CN2018/119906 CN2018119906W WO2020113577A1 WO 2020113577 A1 WO2020113577 A1 WO 2020113577A1 CN 2018119906 W CN2018119906 W CN 2018119906W WO 2020113577 A1 WO2020113577 A1 WO 2020113577A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
primer
upstream
primer pair
target gene
Prior art date
Application number
PCT/CN2018/119906
Other languages
French (fr)
Chinese (zh)
Inventor
杨林
张海萍
高雅
陈芳
Original Assignee
深圳华大生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大生命科学研究院 filed Critical 深圳华大生命科学研究院
Priority to PCT/CN2018/119906 priority Critical patent/WO2020113577A1/en
Priority to CN201880098877.5A priority patent/CN112996926A/en
Publication of WO2020113577A1 publication Critical patent/WO2020113577A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids

Definitions

  • the present application relates to the field of gene detection, in particular to a method for constructing a target gene library, a detection device and its application.
  • Prenatal diagnosis is an important means to prevent birth defects caused by genetic diseases.
  • invasive sampling methods such as chorionic sampling, amniotic fluid puncture, and umbilical cord blood puncture to obtain fetal genetic information. Therefore, developing a non-invasive prenatal detection method has always been an important research direction.
  • the research direction of scientific researchers focused on how to obtain fetal nucleated red blood cells from the peripheral blood of pregnant women for genetic analysis.
  • how to efficiently separate and enrich fetal nucleated red blood cells has been a major bottleneck in the development of this technology.
  • Lu Yuming published a scientific paper in "lancet”, which confirmed the presence of free DNA from the fetus in the plasma of pregnant women by PCR amplification of Y chromosome-specific genes; Blood testing for fetal genotypes provides new possibilities.
  • the widely used detection technology based on whole genome sequencing now provides an important reference for the screening of children with Down's disease, and also avoids the birth of countless children with Down's disease.
  • the existing technology can only detect chromosomal abnormalities, and cannot detect other single gene mutations.
  • the proportion of fetal defects caused by chromosomal abnormalities is less than 4% of the total number of congenital irritating and disabled children.
  • a considerable number of defective fetuses are not caused by abnormal chromosome numbers, but by single gene mutations.
  • Diseases caused by single gene mutations are important diseases that affect children's health and quality of life, and the vast majority cannot be detected in early pregnancy.
  • B-imaging can find abnormalities, it can only be found in the middle or late pregnancy, which brings a heavy blow to pregnant women and their families. After birth, most of the defective fetuses have physiological abnormalities, and it is difficult to take care of themselves. It also brings great pain to the family.
  • the purpose of the present application is to provide a new method for constructing a target gene library, a fetal gene detection method, a fetal gene detection device, a fetal gene detection reagent and their applications based on the library construction method of the present application.
  • the first aspect of the present application discloses a method for constructing a target gene library, including the following steps: extracting free RNA from a biological sample derived from a pregnant woman, the free RNA including fetal free RNA; reverse transcription of the extracted free RNA to generate a first Strand cDNA; the first strand cDNA is amplified with specific primers to obtain specific amplification products; wherein, the specific primers can specifically amplify the target gene, the target gene is the differentially expressed gene of the fetus and the mother, and the specific primer
  • the sequence of the 5′ end of the upstream primer and/or the downstream primer is the same as at least a part of the sequence of the universal primer; using the universal primer to perform a second round of PCR amplification on the specific amplification product to obtain the target gene library.
  • the differentially expressed genes are genes that are specifically expressed by the fetus, but are hardly expressed by the mother.
  • the differentially expressed genes are at least one of the genes shown in Table 1.
  • the differentially expressed genes refer to genes corresponding to RNAs derived from maternal and fetal and released into the peripheral blood of pregnant women with significantly different expression levels.
  • the differentially expressed gene may be a gene that is specifically expressed by the fetus, but is rarely expressed by the maternal or expressed in a very low amount.
  • the differentially expressed genes may be any one or more of the genes shown in Table 1. This application is to use the differential expression of the fetus and the mother to construct the fetal target gene library, and then realize the subsequent detection of the fetal gene information.
  • sequence of the 5'end of the upstream primer and/or the downstream primer of the specific primer is the same as at least a part of the sequence of the universal primer, so that the subsequent universal primer can bind to the end of the specific amplification product, thereby achieving specific amplification
  • the amplified products were subjected to the second round of PCR amplification.
  • fetal free RNA especially free mRNA (abbreviated cfmRNA). Therefore, the present application creatively constructs a fetal target gene library for free RNA extracted from pregnant women samples.
  • the fetal target gene library constructed by this application can directly detect fetal specific cfmRNA expressed in the plasma or urine of pregnant women, and then infer the genotype of the fetus, which solves the problem that a large amount of maternal background is carried in free DNA causess problems that mutation detection cannot judge.
  • cfmRNA is a complete transcript sequence obtained after DNA translation. It removes a lot of intron redundant information, which can reflect the structural variation of DNA in exons. These variations are difficult to detect at the DNA level.
  • the target gene library construction of the present application uses cfmRNA instead of cfDNA for subsequent noninvasive detection of genetic disease-related mutations, which can detect certain recessive genetic disease-related mutations of the fetus, and can also detect the structure of certain genes Mutations, which are difficult to detect at the cfDNA level.
  • the biological sample is peripheral blood or urine.
  • target gene library construction method of the present application is in principle applicable to all pregnant women samples containing fetal free RNA, including but not limited to urine and peripheral blood plasma.
  • reverse transcription of the extracted free RNA to generate first-strand cDNA specifically includes reverse transcription of the free RNA using random primers to obtain first-strand cDNA.
  • the random primer is an N6 random primer.
  • N6 random primers to generate the first-strand cDNA is only a specific scheme adopted in an implementation manner of this application, and it is not excluded that other methods for generating the first-strand cDNA can also be used, which is not specifically limited herein.
  • the specific primer pair is at least one of the first primer pair to the sixth primer pair; the upstream and downstream primers of the first primer pair are SEQ ID NO. 1 and SEQ respectively
  • the upstream and downstream primers of the second primer pair are the sequences shown in SEQ ID NO. 3 and SEQ ID NO. 4, respectively, and the upstream and downstream primers of the third primer pair are SEQ ID NO.
  • the six primer pairs of the sequences shown in SEQ ID NO. 1 to SEQ ID NO. 12 are only the specific primers for amplifying common mutation sites of the FGA gene in one implementation of the present application.
  • the primer can be adjusted appropriately, such as adding or subtracting a few bases at the 5'end or 3'end, or adding a special purpose at the 5'end Nucleic acid sequence, for example, the six primer pairs of the present application add a part of the linker sequence at the 5'end, so as to facilitate subsequent amplification using universal primers; on the other hand, in the specific use process, according to the specific detected mutation
  • One or more primer pairs are used for the site selection. Even for some special mutation sites that are not covered by the six primer pairs in this application, primer pairs can be designed by themselves, which is not specifically limited here.
  • the specific primers include at least one of primer pair 1 to primer pair 7, and the upstream and downstream primers of primer pair 1 are SEQ ID NO. 17 and SEQ ID NO. 18 respectively.
  • the sequence of primer pair two is the sequence shown in SEQ ID NO.19 and SEQ ID NO.20
  • the primer pair three is the sequence shown in SEQ ID NO.21 and SEQ ID NO.22
  • the upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO. 24, respectively.
  • the upstream and downstream primers of primer pair 5 are the sequences shown in SEQ ID NO. 25 and SEQ ID NO. 26, respectively.
  • the upstream and downstream primers for pair six are the sequences shown in SEQ ID NO.27 and SEQ ID NO.28, respectively, and the upstream and downstream primers for primer pair seven are the sequences shown in SEQ ID NO.29 and SEQ ID NO.30.
  • the seven primer pairs of the sequences shown in SEQ ID NO. 17 to SEQ ID NO. 30 are only the specific primers for amplifying common mutation sites of the HESX1 gene in one implementation of this application; , Without affecting the amplification effect or covering the mutation site, you can adjust the primer appropriately; on the other hand, in the specific use process, you can choose to use one or more of the mutation sites according to the specific detection One primer pair, even at some special mutation sites that are not covered by the seven primer pairs of this application, you can also design your own primer pair, which is not specifically limited here.
  • reference primers are used to amplify the first-strand cDNA, and the reference primers can specifically amplify the housekeeper gene.
  • the target region of the housekeeping gene is amplified, on the one hand, it can be used as a reference for the success of the experiment; on the other hand, it can be used as a reference for sample quantification.
  • the housekeeping gene is the GAPDH gene.
  • the upstream and downstream primers of the primers that amplify the housekeeping gene GAPDH are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
  • primers of the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14 are only the primers used to amplify the housekeeping gene GAPDH used in one implementation of this application, and it is not excluded that other primers can also be designed or used , As long as it does not affect the target gene amplification.
  • the universal primer includes a first primer and a second primer, the first primer is the sequence shown in SEQ ID NO.15, and the second primer is the sequence shown in SEQ ID NO.16. Also, preferably, the 5'end of the first primer has a phosphorylation modification.
  • the purpose of the universal primer is to further amplify and enrich the PCR amplification product, so as to obtain a sequencing library that can be used for sequencing; wherein, the 5'end of the first primer has a phosphorylation modification, and its purpose is to It is convenient for the subsequent circularization of single-stranded DNA. It can be understood that the specific sequence of the universal primer can be designed according to different experiments or sequencing platforms.
  • the double-stranded nucleic acid amplification product is denatured into single-stranded nucleic acid, and a nucleic acid-mediated fragment is used
  • the single-stranded nucleic acid is connected into a circular single-stranded nucleic acid, thereby obtaining a circular single-stranded target gene library, in which the nucleic acid-mediated fragment can be combined with the two ends of the single-stranded nucleic acid through the base complementary pairing principle.
  • the target gene library can be adaptively processed.
  • sequencing using the Illumina sequencing platform requires that the library on the computer is a double-stranded nucleic acid library, that is, the nucleic acid double-stranded amplification product obtained after the second round of PCR amplification is the library used on the computer;
  • use BGI or MGI sequencing platform for sequencing requires that the library on the machine is a circular single-stranded nucleic acid library, then the double-stranded nucleic acid amplification product obtained after the second round of PCR amplification needs to be denatured into single-stranded nucleic acid, and through the mediated fragment
  • the single-stranded nucleic acid is connected into a circular single-stranded nucleic acid, and the sequencing can only be performed after the circular single-stranded nucleic acid library is finally obtained.
  • the second aspect of the present application discloses a method for detecting fetal genes, including the following steps:
  • the method of the first aspect of the present application is used to construct a target gene library to obtain a target gene library; sequence the target gene library to obtain sequencing results composed of multiple sequencing data; analyze the sequencing results to obtain fetal gene information.
  • the fetal gene detection method of the present application is actually based on the method of constructing the target gene library of the present application, and further sequencing the constructed target gene library to realize fetal gene detection.
  • a high-throughput sequencing method is used to sequence the constructed target gene library to obtain mutation information, fetal genotype information, and gene structure variation information including recessive genetic single gene disease. Fetal genetic information.
  • the fetal gene detection method of the present application may be a detection method for non-diagnostic purposes, for example, the fetal genotype information is obtained by the detection method, so as to predict the phenotype that the fetus may appear, when the genotype information does not directly correspond to the disease In the relationship, the predicted phenotype information is not directly related to the disease.
  • the analysis of the sequencing results specifically includes the steps of filtering the sequencing data; comparing the filtered sequencing data to the reference genome, retaining the sequencing data of the unique reference genome; sequencing based on the unique comparison of the reference genome Data, statistics of the base distribution of the target gene, to obtain mutation information of the target gene, and then obtain the genetic information of the fetus.
  • the analysis of the sequencing results further includes comparing the mutation information of the target gene with the disease database to obtain mutation information of the recessive genetic single gene disease of the fetus.
  • fetal gene detection method of the present application such as the type of biological sample, the first strand cDNA generation, etc., reference may be made to the method of constructing a target gene library in the first aspect of the present application, which is not described here.
  • all or part of the steps can be realized by a unique mechanical device, so as to realize automatic operation.
  • the third aspect of the present application discloses a fetal gene detection device, which includes a free RNA extraction module, a reverse transcription module, a target gene amplification module, a target gene library generation module, a sequencing module, and an analysis module; a free RNA extraction module, It is used to extract free RNA from biological samples derived from pregnant women.
  • the free RNA contains fetal free RNA;
  • the reverse transcription module is used to reverse transcribe the extracted free RNA to generate first-strand cDNA;
  • the target gene amplification module is used to adopt specific Sexual primers amplify the first-strand cDNA to obtain specific amplification products.
  • the specific primers can specifically amplify the target gene.
  • the target gene is the differentially expressed gene of the fetus and the mother.
  • the upstream primer and/or downstream of the specific primer is the same as at least a part of the sequence of the universal primer;
  • the target gene library generation module is used to perform a second round of PCR amplification on the specific amplification product using the universal primer to obtain the target gene library;
  • the sequencing module is used to Sequencing the target gene library to obtain sequencing results composed of multiple sequencing data; analysis module for analyzing the sequencing results to obtain fetal genetic information.
  • the biological sample involved in the free RNA extraction module is peripheral blood or urine.
  • the reverse transcription module reverse transcribes the extracted free RNA to generate first-strand cDNA, which specifically includes performing reverse transcription of the free RNA using random primers to obtain the first-strand cDNA; preferably, random
  • the primers are N6 random primers.
  • the differentially expressed genes involved in the target gene amplification module are genes that are specifically expressed by the fetus but hardly expressed by the mother.
  • the differentially expressed genes are at least one of the genes shown in Table 1.
  • the genetic information of the fetus includes at least one of mutation information of recessive genetic single gene disease, fetal genotype information, and genetic structure variation information.
  • the analysis module further includes a filtering unit, a reference genome comparison unit and a statistical unit; a filtering unit is used to filter sequencing data; a reference genome comparison unit is used to filter The sequencing data is compared to the reference genome, and the unique comparison of the sequencing data of the reference genome is retained; the statistical unit is used to calculate the base distribution of the target gene based on the sequencing data of the unique comparison of the reference genome to obtain mutation information of the target gene, and then Get the genetic information of the fetus.
  • a filtering unit is used to filter sequencing data
  • a reference genome comparison unit is used to filter The sequencing data is compared to the reference genome, and the unique comparison of the sequencing data of the reference genome is retained
  • the statistical unit is used to calculate the base distribution of the target gene based on the sequencing data of the unique comparison of the reference genome to obtain mutation information of the target gene, and then Get the genetic information of the fetus.
  • the analysis module further includes a disease database comparison unit, and the disease database comparison unit is used to compare target gene mutation information with the disease database to obtain mutation information of the recessive genetic single gene disease of the fetus .
  • the fourth aspect of the present application discloses the method of constructing the target gene library of the first aspect of the present application, the fetal gene detection method of the second aspect of the present application or the fetal gene detection device of the third aspect of the present application in the detection of recessively inherited single gene diseases , Or the application of structural variation detection.
  • a fifth aspect of the present application discloses a reagent for detecting fetal FGA gene
  • the reagent includes a specific primer for amplifying FGA gene
  • the specific primer includes at least one of the first primer pair to the sixth primer pair
  • the upstream and downstream primers of the first primer pair are the sequences shown in SEQ ID NO. 1 and SEQ ID NO. 2, respectively
  • the upstream and downstream primers of the second primer pair are SEQ ID NO. 3 and SEQ ID NO. 4.
  • Sequence, the upstream and downstream primers of the third primer pair are the sequences shown in SEQ ID NO.5 and SEQ ID NO.6, and the upstream and downstream primers of the fourth primer pair are SEQ ID NO.7 and SEQ ID NO.8, respectively.
  • the upstream and downstream primers of the fifth primer pair are the sequences shown in SEQ ID NO. 9 and SEQ ID NO. 10, respectively, and the upstream and downstream primers of the sixth primer pair are SEQ ID NO. 11 and SEQ ID NO. 12 respectively. ⁇ SEQ.
  • the reagent of the present application its six primer pairs can amplify and cover the common mutation sites of the FGA gene; therefore, the reagent of the present application can be directly used for FGA gene mutation detection on the one hand, on the other hand, The corresponding target fragments can be amplified and enriched. It can be understood that the reagents of the present application can be used for the detection method of the fetal FGA gene of the present application, and other FGA gene detection methods based on PCR amplification of target fragments can also use the reagents of the present application, which are not specifically limited herein.
  • the reagent of the present application further includes a reference primer for amplifying the housekeeping gene GAPDH, and the upstream and downstream primers of the reference primer are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
  • each primer pair can be packaged separately, and the primer mixture can be configured according to the specific use situation, which is not specifically limited here. It can be understood that if it is not necessary to amplify the housekeeping gene, the corresponding reference primer pair may not be used.
  • the reagent of the present application further includes a universal primer for constructing a sequencing library.
  • the universal primer includes a first primer and a second primer.
  • the first primer is the sequence shown in SEQ ID NO.15
  • the second primer is SEQ ID NO.
  • the sequence shown in 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
  • a sixth aspect of the present application discloses a kit for fetal FGA gene detection, which includes the reagent for fetal FGA gene detection in the fifth aspect of the present application.
  • the kit further includes at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents and nucleic acid purification reagents.
  • the specific primer pairs of the FGA gene of the present application can be used to construct universal primers, or even involved in the fetal FGA gene detection method of the present application
  • the various reagents are assembled into a kit to facilitate the detection of fetal FGA genes; of course, free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents, nucleic acid purification reagents, etc. can also use conventional laboratory reagents, here No specific restrictions.
  • the seventh aspect of the present application discloses a reagent for detecting fetal HESX1 gene, including a specific primer for amplifying the HESX1 gene, the specific primer includes at least one of primer pair 1 to primer pair 7, primer pair 1
  • the upstream and downstream primers are the sequences shown in SEQ ID NO.17 and SEQ ID NO.18, respectively.
  • the upstream and downstream primers of primer pair 2 are the sequences shown in SEQ ID NO.19 and SEQ ID NO.20, and the primer pair three is The downstream primers are the sequences shown in SEQ ID NO.21 and SEQ ID NO.22 respectively, the upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO.24, and the upstream and downstream primers of primer pair V are respectively They are the sequences shown in SEQ ID NO.25 and SEQ ID NO. 26 respectively, the upstream and downstream primers of primer pair 6 are the sequences shown in SEQ ID NO.27 and SEQ ID NO. 28, and the upstream and downstream primers of primer pair 7 are respectively The sequence shown in SEQ ID NO.29 and SEQ ID NO.30.
  • the reagent of the present application its seven primer pairs can amplify and cover the common mutation sites of the HESX1 gene; therefore, the reagent of the present application can be directly used for HESX1 gene mutation detection on the one hand, and can also be used for the corresponding The target fragments are amplified and enriched. It can be understood that the reagents of the present application can be used for the HESX1 gene detection method of the present application, and other HESX1 gene detection methods based on PCR amplification of target fragments can also use the reagents of the present application, which are not specifically limited herein.
  • the reagent of the present application further includes a reference primer for amplifying the housekeeping gene GAPDH, and the upstream and downstream primers of the reference primer are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
  • the reagent of the present application further includes a universal primer for constructing a sequencing library.
  • the universal primer includes a first primer and a second primer.
  • the first primer is the sequence shown in SEQ ID NO.15
  • the second primer is SEQ ID NO.
  • the sequence shown in 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
  • the packaging format of the primer and whether the reference primer is used are the reagents for detecting the FGA gene of the fetus according to the fifth aspect of the present application, which are not repeated here.
  • specific primer pairs of the HESX1 gene of the present application housekeeping gene GAPDH amplification primer pairs and/or libraries can also be used to construct universal primers, and even each of the methods involved in the fetal HESX1 gene detection method of the present application
  • Reagents, assembled into a kit to facilitate the detection of fetal HESX1 gene can also use conventional laboratory reagents, do not do here Specific restrictions.
  • the eighth aspect of the present application discloses a kit for fetal HESX1 gene detection, which includes the reagent for fetal HESX1 gene detection in the seventh aspect of the present application.
  • the kit further includes at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents and nucleic acid purification reagents.
  • kits the packaging form of various reagents in the kit is the same as the kit for detecting fetal FGA gene in the sixth aspect of the present application, which is not repeated here.
  • the target gene library construction method of the present application uses the difference in gene expression between the fetus and the mother to construct a target gene library for genes expressed by the fetus and hardly expressed by the mother based on free RNA extracted from the pregnant woman.
  • the target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library first, the present application detects the differential genes expressed by the fetus and the mother, especially those that are specifically expressed by the fetus and almost not expressed by the mother Genes can eliminate the interference of maternal background, and solve the problem that the mutation detection cannot be judged due to the large amount of maternal background in the detection based on free DNA; second, this application is specific for the target gene in free RNA of maternal plasma Amplification can detect fetal genes with low expression in plasma, and solve the problem of losing low-expression mRNA based on full transcriptome detection.
  • cfmRNA is a complete transcript obtained after DNA translation.
  • the sequence which removes a lot of redundant information of introns, can reflect the structural variation of DNA occurring on exons. Therefore, the present application can detect genetic structural variation, which solves the difficulty of detecting genetic structural variation in the existing free DNA-based detection Question;
  • the target gene library constructed in this application is used for fetal gene detection. When detecting recessive single gene disease, there is no need to know the information of parents and probands, no need to construct fetal haploid information, and the operation is simpler. The cost is lower.
  • the construction method of the target gene library and the fetal gene detection method based on the constructed target gene library of this application are simple in operation and low in cost, and can meet the clinical use requirements, laying a foundation for the further promotion and use of non-invasive detection for pregnant women.
  • FIG. 1 is a schematic diagram of a technical process of a method for constructing a target gene library in an embodiment of the present application
  • Example 2 is a distribution diagram of the coverage depth of each amplicon in Example 1 of the present application.
  • Example 3 is a graph of repetitive statistical results of two repeated sequencing samples of plasma samples of pregnant women with normal fetuses in Example 1 of the present application;
  • Example 4 is a graph of repetitive statistical results of two repeated sequencing samples of a pregnant woman plasma sample with a fetus with congenital afibrinogenemia in Example 1 of the present application;
  • FIG. 5 is a statistical result diagram of the base coverage of four sequencing samples in the first embodiment of the present application to the target pathogenic site;
  • Example 6 is a graph showing the results of base coverage of four sequencing samples in Example 2 of the present application on target pathogenic sites.
  • the existing fetal gene detection method is mainly for free fetal DNA in the blood of pregnant women.
  • the detection method based on free fetal DNA provided by Dennis Lo et al. can not only determine whether the fetus obtains the pathogenic mutation from the father, but also Judgment of the status of alleles inherited by the fetus from the mother.
  • the detection methods of Dennis Lo and others are complicated and costly, and it is difficult to meet the needs of clinical use; and, more importantly, the detection method based on free fetal DNA is difficult to detect genetic structure variation.
  • the transcript profile of the pregnant woman and the transcript profile of the fetus are different throughout the pregnancy period; the expression of different genes in time and space varies during the development of the fetus from the fertilized egg into an individual, for example, for certain Genes are not expressed by the mother, while the fetus is expressed during a certain period of pregnancy or has been expressed throughout pregnancy.
  • the inventors of the present application found that diseases caused by mutations in genes that differ in fetal and maternal expression can be detected by directly detecting cfmRNA expressed specifically in fetuses in plasma. Fetal-specifically expressed RNA is released into the blood circulation through the placenta. By detecting cfmRNA in the blood circulation, mutations in the fetal DNA level can be deduced.
  • the present application creatively proposes a new method for constructing a target gene library, including extracting free RNA from biological samples derived from pregnant women, the free RNA includes fetal free RNA; and reverse transcription of the extracted free RNA is generated First-strand cDNA; the first-strand cDNA is amplified with specific primers to obtain specific amplification products, wherein the specific primers can specifically amplify the target gene, which is a differentially expressed gene of the fetus and the mother, specific The sequence of the 5'end of the upstream primer and/or the downstream primer of the sexual primer is the same as at least a part of the sequence of the universal primer; the universal primer is used to perform a second round of PCR amplification on the specific amplification product to obtain the target gene library of the present application.
  • the present application further proposes a fetal gene detection method based on the target gene library construction method of the present application, and a fetal gene detection device and the like. Further, based on a specific application of the target gene library construction method and fetal gene detection method of the present application; and the present application proposes a reagent and kit for fetal FGA gene detection; a fetal HESX1 gene detection method Reagents and kits.
  • the fetal gene detection method of the present application is based on the detection of free plasma RNA from pregnant women.
  • the cfmRNA specifically expressed by fetus in plasma can easily detect the type of mutation of the fetus, including the genetic structure, so as to analyze whether the fetus carries the mutation of the parent. Specifically, during pregnancy, some genes of the fetus are specifically expressed and released into the mother's blood circulation system through the placenta. If these genes are related to certain diseases, the plasma cfRNA can be directly detected to detect the fetus. DNA mutation, as shown in Figure 1.
  • the fetal gene detection method of the present application for example, for recessive sick leave, the maternal genotype is AA or Aa, and the decision logic is:
  • the fetal genotype is AA
  • Aa 1:1, then the genotype of the fetus is Aa; 2) A ⁇ a, that is, the detected A is much smaller than a, then The genotype of the fetus is aa;
  • the fetal genotype must be aa.
  • the target gene library construction method and fetal gene detection method of the present application use genes with different expressions in the fetus and the mother to detect, and are particularly suitable for genes that are specifically expressed by the fetus, but the mother does not express or the expression of the mother is very low, such genes As shown in Table 1.
  • Table 1 shows the genes that are specifically expressed by the fetus, but the mother does not express or has very little expression. These genes can be used for the detection of the fetal gene of the present application for recessive genetic single gene disease mutations, fetal genotypes, genetic structure variations, etc. Detection.
  • the target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library can perform fetal genotype detection, and can easily realize genetic structure variation detection; and the operation is simple, the detection cost is relatively low, and It satisfactorily satisfies the use of clinical testing and lays the foundation for the further promotion and application of noninvasive production testing.
  • Congenital afibrinogenemia is a bleeding symptom caused by the loss of the coagulation process. Most of them are related to mutations in the FGA gene and are autosomal recessive genetic diseases. During pregnancy, the gene is rarely or not expressed in the mother, while the fetus is specifically expressed. Therefore, in this example, the fetal FGA gene was tested to verify the target gene library construction method and fetal gene detection method of the present application.
  • primer pairs designed in this example were designed in Table 2, and the common FGA mutation sites covered are shown in Table 3.
  • this example also designed a pair of primers to amplify GAPDH as a sample reference for reference for successful experiment and sample quantification.
  • the specific primer sequences are shown in Table 2.
  • FGA-F01 and FGA-R01 are the first primer pair
  • FGA-F02 and FGA-R02 are the second primer pair
  • FGA-F03 and FGA-R03 are the third primer pair
  • FGA-F04 and FGA-R04 It is the fourth primer pair
  • FGA-F05 and FGA-R05 are the fifth primer pair
  • FGA-F06 and FGA-R06 are the sixth primer pair
  • GAPDH-F and GAPDH-R are the primer pairs for amplifying GAPDH.
  • the underlined sequence that is, the 20 bp sequence at the 5'end of the upstream primer and the 21 bp sequence at the 5'end of the downstream primer is the linker sequence.
  • the linker sequence when the first round of PCR is performed on the cDNA, a part of the adapter sequence is introduced through the primer; in the second round of PCR, that is, when the specific primer is used to amplify the specific PCR amplification product, the complete linker containing the tag sequence is introduced through the universal primer Sequence, these linker sequences can be adjusted according to different sequencing platforms.
  • the free RNA in the blood of pregnant women is extracted, and the cfRNA is reverse transcribed into cDNA, which is combined with multiple PCR to complete the preparation of the target library, and then the mutation information is obtained through high-throughput sequencing.
  • the details are as follows:
  • Experimental sample 1 mL of a plasma sample of a pregnant woman carrying a normal fetus. The sample was divided into 2 equal parts for repeated testing. A pregnant woman with a fetus known to have a congenital afibrinogenemia. The fetus was clinically diagnosed with a gene mutation: c.103C>T(p.Arg35Cys). The pregnant woman’s plasma sample was 1 mL, and the sample was also averaged. Divided into 2 parts, as a repeated test.
  • the plasma samples were extracted from Qigen using Qigen’s
  • the Circulating Nucleic Acid Kit was extracted according to standard operating procedures, and the final total RNA was dissolved in 20 ⁇ L of water.
  • Thermofisher's first-strand cDNA synthesis kit SuperScript TM IV First-Strand Synthesis System 18091200 was used, as follows:
  • the random primer is an N 6 random primer.
  • Agencourt AMPure XP magnetic beads were added to obtain a 50 ⁇ L magnetic bead purification system. Purification was performed according to the instructions. Finally, 20 ⁇ L of distilled water was used to dissolve the DNA. Among them, Agencourt AMPure XP magnetic beads were purchased from Beckman Coulter Co., Ltd.
  • the PCR amplification enzyme adopts KAPA2G Fast Multiplex PCR Kit product number KK5801 of American kapa company.
  • the PCR reaction system includes: 2 ⁇ kapa polymerase mixed solution 25 ⁇ L, 10 ⁇ M primer pool 5 ⁇ L, purified DNA 20 ⁇ L, a total of 50 ⁇ L.
  • the primer pool includes the specific primer pairs shown in Table 1 and the primer pairs for amplifying GAPDH.
  • the concentration of the upstream and downstream primers of each primer pair in the primer pool is 10 ⁇ M.
  • the PCR reaction conditions were: pre-denaturation at 98°C for 2 min, and then entered into 15 cycles: 98°C for 10 s, 62°C for 2 min, and 72°C for 30 s. After the cycle was extended at 72°C for 5 min, PCR amplification was completed.
  • the PCR amplification enzyme adopts KAPA2G Fast Multiplex PCR Kit product of American kapa company, article number KK5801.
  • the reaction system includes: 25 ⁇ L of 2 ⁇ kapa polymerase mixture, 2.5 ⁇ L of 10 ⁇ M first primer, 2.5 ⁇ L of 10 ⁇ M second primer, and 20 ⁇ L of DNA obtained from the purified product of specific PCR amplification, totaling 50 ⁇ L.
  • the first primer is the sequence shown in SEQ ID NO.15
  • the second primer is the sequence shown in SEQ ID NO.16
  • the 5'end of the first primer has a phosphorylation modification.
  • reaction conditions were: pre-denaturation at 98°C for 2min, then 15 cycles: 98°C for 10s, 62°C for 2min, 72°C for 30s, and 72°C extension for 5min after the end of the cycle.
  • DNA nanospheres are prepared according to the requirements of BGISEQ-500 platform of BGI, and then the DNA nanospheres are sequenced on the computer.
  • the sequencing type is 50bp double-ended.
  • the preparation of DNA nanospheres includes: denaturing double-stranded nucleic acid amplification products into single-stranded nucleic acids through high temperature, using nucleic acid-mediated fragments to connect single-stranded nucleic acids into circular single-stranded nucleic acids, and performing rolling circle amplification on the circular single-stranded nucleic acids To obtain DNA nanospheres.
  • BGISEQ-500 platform of BGI For detailed steps, please refer to the instruction of BGISEQ-500 platform of BGI.
  • the statistical results of the number of original reads, the number of de-joined reads, the number of aligned reads, and the number of reads in the target area for the four sequencing samples are shown in Table 4.
  • four sequencing samples that is, two repeated sequencing samples of plasma samples of pregnant women with normal fetuses, and two repeated sequencing samples of plasma samples of pregnant women with fetal congenital afibrinogenemia.
  • Table 4 show that in the fetal FGA gene detection method of this example, the data utilization rate of the off-machine data can reach 97%, the comparison rate can reach 98%, and the target area ratio can reach 98%.
  • each specific primer pair and GAPDH amplification primers in the four sequencing samples is statistically shown in Figure 2; in Figure 2, the abscissas FGA01, FGA02, FGA03, FGA04, FGA05, FGA06 are in order Coverage of one pair of primers to the sixth pair of primers in four sequencing samples, 1-repeat 1, 1-repeat 2 are two replicate sequencing samples of pregnant fetal plasma samples from normal fetuses, 2-repeat 1, 2-repeat 2 Two repeated sequencing samples of plasma samples from pregnant women with fetuses with congenital afibrinogenemia. The results in Figure 2 show that the depth distribution obtained after sequencing each amplicon (ie, primer pair) is within 5 times the depth difference between different amplicons in the four samples.
  • the repeatability of two repeated sequencing samples of the plasma samples of pregnant women with normal fetuses was counted.
  • the results are shown in Figure 3.
  • the vertical coordinate of Figure 3 is the coverage of duplicate 1, and the horizontal coordinate is the coverage of duplicate 2.
  • the repeatability of two repeated sequencing samples of the plasma samples of pregnant women with congenital non-fibrinogenemia fetuses was counted.
  • the results are shown in Figure 4.
  • the vertical coordinate of Figure 4 is the coverage of Repeat 1, and the horizontal Coverage for repeat 2.
  • the results of Figure 3 and Figure 4 show that the depth stability of each amplicon obtained from different experiments of the same sample is very good.
  • the base coverage of the target pathogenic site was counted by sequencing data.
  • the results are shown in Fig. 5.
  • the abscissa of Fig. 5 is four sequencing samples, the ordinate is the coverage depth ratio, ⁇ indicates the pathogenic base, ⁇ indicates Normal bases.
  • Figure 5 shows the coverage of the four sequencing samples for the target site c.103C>T.
  • the results in Figure 5 show that the plasma samples of pregnant women with two normal fetuses can cover the pathogenic bases well, and the plasma samples of two pregnant women with congenital afibrinogenemia can also cover the normal. Base, indicating that the sequencing data of this example has a good coverage of the target pathogenic site.
  • Sample number Clinical test results Test results in this case Sample 1 repeats 1 No mutation No mutation detected Sample 1 repeats 2 No mutation No mutation detected Sample 2 repeats 1 c.103C>T c.103C>T Sample 2 repeats 2 c.103C>T c.103C>T
  • Optic septal dysplasia is a rare anterior malformation of the midline structure, also known as De-Morsier syndrome, including the absence of transparent septa and dysplasia of the visual transmission pathway, which can be combined with other developmental abnormalities in the brain. Part of the cause is caused by mutations in the HESX1 gene. During pregnancy, the gene is rarely or not expressed in the mother, while the fetus is specifically expressed. Therefore, in this example, the fetal HESX1 gene was tested to verify the target gene library construction method and fetal gene detection method of the present application.
  • this example 7 pairs of specific primers were designed to cover the common HESX1 mutation sites; the primer pairs designed in this example are shown in Table 6, and the covered common HESX1 mutation sites are shown in Table 7.
  • this example also uses a pair of primers to amplify GAPDH as a sample reference for reference for successful experiment and sample quantification.
  • the specific primer sequences are shown in Table 1.
  • HESX-F01 and HESX-R01 are primer pair one
  • HESX F02 and HESX-R02 are primer pair two
  • HESX-F03 and HESX-R03 are primer pair three
  • HESX-F04 and HESX-R04 are primer pair four
  • HESX F05 and HESX-R05 are primer pair five
  • HESX-F06 and HESX-R06 are primer pair six
  • HESX-F07 and HESX-R07 are primer pair seven.
  • HESX1-07F GACCGCTTGGCCTCCGACTTCTTCTAATTGCAGAGCATGAAGA 29 HESX1-07R GACATGGCTACGATCCGACTTTCTTTACTATAACTAAAAGTGCCC 30
  • the experimental sample in this case: 1 mL of a plasma sample of a pregnant woman carrying a normal fetus, sample No. 3, which was divided into 2 equal parts as a repeated test. After MRI, the pregnant woman was found to have a dysplasia of the septum and a positive biopsy. The fetus was clinically diagnosed with a gene mutation: ESX1, c.509C>T, p.Ser170Leu; take a plasma sample of the pregnant woman 1mL, No. 4 sample, equally divide the sample into 2 parts, as a repeated test.
  • the primers in Table 6 were used to prepare the sequencing library using the same library construction method as in Example 1.
  • the general primers used in the library preparation process were the same as in Example 1. Then use the same sequencing platform as in Example 1 for on-machine sequencing, and use the same method as in Example 1 for data analysis.
  • the sequencing data was used to count the base coverage of the target disease site.
  • the results are shown in Figure 6.
  • the abscissa of Figure 6 is four sequencing samples, sample 3 repeats 1 and sample 3 repeats 2, which is normal. 2 repeated sequencing samples of fetal maternal plasma samples; sample 4 repeats 1 and sample 4 repeats 2, that is, 2 repeat sequencing samples of maternal plasma samples with fetal dysplasia; the ordinate is the depth of coverage ratio , ⁇ indicates a pathogenic base, ⁇ indicates a normal base, and Figure 6 shows the coverage of the four sequencing samples for the target site c.509C>T.
  • the target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library can not only detect the single base mutation of the FGA gene in Example 1 and the single base mutation of the HESX1 gene in Example 2, Similarly, genes similar to those of the FGA gene and the HESX1 gene, which are specifically expressed by the fetus but not expressed by the mother or expressed in very small amounts, such as the genes shown in Table 1, can also be tested as long as the invention of the present application
  • the idea is to design the corresponding specific primer pairs covering the mutation sites of each gene, and then perform the first round of PCR amplification on the cDNA; the extraction of cfRNA before the specific amplification, the synthesis of the first strand of cDNA, and the subsequent general PCR amplification
  • FGA gene detection method in Example 1 of the present application For augmentation, computer sequencing, and data analysis, refer to the FGA gene detection method in Example 1 of the present application.

Abstract

Provided are a method for constructing a target gene library, a detection device and an application thereof. The method for constructing a target gene library comprises: extracting free RNA from a biological sample obtained from a pregnant woman, which comprises fetal free RNA; performing reverse transcription on the free RNA to form first-strand cDNA; amplifying the first-strand cDNA by using specific primers to obtain an amplification product having specificity, which is to say differentially expressed genes of the fetus and the mother, i.e., the target genes; performing a second round of PCR amplification on the amplification product by using universal primers to obtain a target gene library.

Description

一种靶基因文库的构建方法、检测装置及其应用Construction method, detection device and application of target gene library 技术领域Technical field
本申请涉及基因检测领域,特别是涉及一种靶基因文库的构建方法、检测装置及其应用。The present application relates to the field of gene detection, in particular to a method for constructing a target gene library, a detection device and its application.
背景技术Background technique
产前诊断是预防由遗传病导致的出生缺陷的重要手段。然而,像绒毛膜取样、羊水穿刺、脐带血穿刺这些为获得胎儿遗传信息的有创取样方式还是存在一定的风险。因此,发展一种无创的产前检测方式一直是一个重要的研究方向。早年间,科研工作者的研究方向集中在如何从孕妇外周血中获得胎儿有核红细胞用于遗传学分析。然而,如何高效的分离富集胎儿有核红细胞一直是该项技术发展的一大瓶颈。1997年,卢煜明在《lancet》上发表了一篇科学论文,该论文通过PCR扩增Y染色体特异性基因的方法证实了孕妇外周血浆中存在来自于胎儿的游离DNA;这一发现为通过孕妇外周血检测胎儿基因型提供了新的可能。Prenatal diagnosis is an important means to prevent birth defects caused by genetic diseases. However, there are still risks associated with invasive sampling methods such as chorionic sampling, amniotic fluid puncture, and umbilical cord blood puncture to obtain fetal genetic information. Therefore, developing a non-invasive prenatal detection method has always been an important research direction. In the early years, the research direction of scientific researchers focused on how to obtain fetal nucleated red blood cells from the peripheral blood of pregnant women for genetic analysis. However, how to efficiently separate and enrich fetal nucleated red blood cells has been a major bottleneck in the development of this technology. In 1997, Lu Yuming published a scientific paper in "lancet", which confirmed the presence of free DNA from the fetus in the plasma of pregnant women by PCR amplification of Y chromosome-specific genes; Blood testing for fetal genotypes provides new possibilities.
现在广泛应用的基于全基因组测序的检测技术,为唐氏患儿的筛选提供了重要参考依据,也避免了无数唐氏患儿的出生。但是,现有的技术只能针对染色体异常进行检测,对于其他单基因突变无法进行检测。然而染色体异常导致的胎儿缺陷的比例不到先天性致愚致残缺陷儿总数的4%,有相当多的缺陷胎儿不是由于染色体数目异常导致,而是由于单基因发生突变导致。单基因突变导致的疾病是影响儿童健康状况和生存质量的重要疾病,且绝大多数在孕早期无法发现。B超影像学虽然可以发现异常,但是只能在怀孕中期或者晚期发现异常,这给孕妇及其家人带来沉重的打击。缺陷胎儿出生后大部分出现生理异常,生活难以自理,也给家庭带来巨大的痛苦。The widely used detection technology based on whole genome sequencing now provides an important reference for the screening of children with Down's disease, and also avoids the birth of countless children with Down's disease. However, the existing technology can only detect chromosomal abnormalities, and cannot detect other single gene mutations. However, the proportion of fetal defects caused by chromosomal abnormalities is less than 4% of the total number of congenital foolish and disabled children. A considerable number of defective fetuses are not caused by abnormal chromosome numbers, but by single gene mutations. Diseases caused by single gene mutations are important diseases that affect children's health and quality of life, and the vast majority cannot be detected in early pregnancy. Although B-imaging can find abnormalities, it can only be found in the middle or late pregnancy, which brings a heavy blow to pregnant women and their families. After birth, most of the defective fetuses have physiological abnormalities, and it is difficult to take care of themselves. It also brings great pain to the family.
近年来,高通量测序技术的发展为无创产前检测单基因病提供了新的技术平台。可以通过直接检测血浆中可能存在的de novo突变进行显性单基因的检测;但是对于隐性单基因,由于母亲一般携带有突变信息,很难从有母亲背景的血浆中检测到胎儿是否携带有父母双方的突变信息。Dennis Lo等人2010在《science translational medicine》杂志上发表了一篇基于孕妇血浆游离DNA全基因组测序,并结合父母单体型分析推测胎儿基因型的文章,文章以β地贫为示例,证实了该方法在遗传病检测中的应用前景。随后,其又发表了基于目标区域捕获进行无创胎儿β地贫突变携带情况检测的报道,该报道与之前文章所采 用的分析方法类似,不同之处在于通过使用目标区域捕获技术,极大的缩减了测序成本,使其进一步接近临床应用。这种基于单体型的分析方法不但可以明确胎儿是否从父亲处获得致病突变,也可以实现对胎儿从母亲处所继承的等位基因情况的判断。但该方法需要先通过实验获得父母和先证者的基因信息,并采用复杂的算法来推测胎儿是否患病,操作复杂,成本高,无法满足临床实际使用需求。并且,通过短片段血浆游离DNA很难对一些基因结构发生变异的情况进行检测。In recent years, the development of high-throughput sequencing technology has provided a new technical platform for noninvasive prenatal detection of single gene diseases. The dominant single gene can be detected by directly detecting the denovo mutation that may exist in the plasma; however, for the recessive single gene, because the mother generally carries mutation information, it is difficult to detect whether the fetus carries the plasma from the mother's background Mutation information for both parents. Dennis Lo et al. published an article based on whole-genome sequencing of free plasma DNA of pregnant women in the journal Science Translational Medicine and combined with haplotype analysis of parents to speculate the genotype of the fetus. The article uses β-thalassaemia as an example to confirm The application prospect of this method in the detection of genetic diseases. Subsequently, it published a report on the detection of noninvasive fetal β-thalassemia mutations based on target area capture. This report is similar to the analysis method used in the previous article, except that the use of target area capture technology greatly reduces the The cost of sequencing has brought it closer to clinical applications. This haplotype-based analysis method can not only determine whether the fetus obtains the disease-causing mutation from the father, but also realize the determination of the alleles inherited by the fetus from the mother. However, this method needs to first obtain the genetic information of parents and probands through experiments, and use complex algorithms to speculate whether the fetus is sick. The operation is complicated and the cost is high, which cannot meet the actual clinical needs. Moreover, it is difficult to detect some gene structure mutations with short fragments of plasma free DNA.
发明内容Summary of the invention
本申请的目的是提供一种新的靶基因文库的构建方法,以及基于本申请文库构建方法的胎儿基因检测方法、胎儿基因检测装置、胎儿基因检测试剂及其应用。The purpose of the present application is to provide a new method for constructing a target gene library, a fetal gene detection method, a fetal gene detection device, a fetal gene detection reagent and their applications based on the library construction method of the present application.
本申请具体采用了以下技术方案:This application specifically adopts the following technical solutions:
本申请的第一方面公开了一种靶基因文库的构建方法,包括以下步骤:从来源于孕妇的生物样本中提取游离RNA,游离RNA包含胎儿游离RNA;将提取的游离RNA逆转录生成第一链cDNA;采用特异性引物对第一链cDNA进行扩增,获得特异性扩增产物;其中,特异性引物能够特异性扩增靶基因,靶基因为胎儿与母体的差异表达基因,特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同;采用通用引物对特异性扩增产物进行第二轮PCR扩增,获得靶基因文库。The first aspect of the present application discloses a method for constructing a target gene library, including the following steps: extracting free RNA from a biological sample derived from a pregnant woman, the free RNA including fetal free RNA; reverse transcription of the extracted free RNA to generate a first Strand cDNA; the first strand cDNA is amplified with specific primers to obtain specific amplification products; wherein, the specific primers can specifically amplify the target gene, the target gene is the differentially expressed gene of the fetus and the mother, and the specific primer The sequence of the 5′ end of the upstream primer and/or the downstream primer is the same as at least a part of the sequence of the universal primer; using the universal primer to perform a second round of PCR amplification on the specific amplification product to obtain the target gene library.
优选地,差异表达基因为胎儿特异性表达,而母体几乎不表达的基因。Preferably, the differentially expressed genes are genes that are specifically expressed by the fetus, but are hardly expressed by the mother.
优选地,差异表达基因为表1所示基因中的至少一个。Preferably, the differentially expressed genes are at least one of the genes shown in Table 1.
在本申请中,差异表达基因是指来源于母体和胎儿并释放到孕妇外周血中表达量有显著差异的RNA所对应的基因。例如,差异表达基因可以是胎儿特异性表达,而母体几乎不表达或者表达量很低的基因。具体地,差异表达基因可以是表1所示基因中的任意一个或者多个。本申请正是利用胎儿和母体的差异表达进行胎儿靶基因文库构建,进而实现后续的胎儿基因信息的检测。此外,特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同,是为了便于后续的通用引物能够结合于特异性扩增产物末端,从而实现对特异性扩增产物进行第二轮PCR扩增。可以理解,在实际的应用过程中,特异性引物可以有多组,分别对多个靶基因进行扩增,从而获得多个特异性扩增产物;而通用引物就是采用一组引物就可以对多个不同的特异性扩增产物进行扩增;因此,需要在各个特异性引物的上游引物和/或下游引物的5’端设计与通 用引物的至少一部分序列相同的序列。In the present application, the differentially expressed genes refer to genes corresponding to RNAs derived from maternal and fetal and released into the peripheral blood of pregnant women with significantly different expression levels. For example, the differentially expressed gene may be a gene that is specifically expressed by the fetus, but is rarely expressed by the maternal or expressed in a very low amount. Specifically, the differentially expressed genes may be any one or more of the genes shown in Table 1. This application is to use the differential expression of the fetus and the mother to construct the fetal target gene library, and then realize the subsequent detection of the fetal gene information. In addition, the sequence of the 5'end of the upstream primer and/or the downstream primer of the specific primer is the same as at least a part of the sequence of the universal primer, so that the subsequent universal primer can bind to the end of the specific amplification product, thereby achieving specific amplification The amplified products were subjected to the second round of PCR amplification. It can be understood that in the actual application process, there can be multiple sets of specific primers, which respectively amplify multiple target genes, so as to obtain multiple specific amplification products; while universal primers use one set of primers to Different specific amplification products are amplified; therefore, it is necessary to design a sequence identical to at least a part of the sequence of the universal primer at the 5′ end of the upstream primer and/or downstream primer of each specific primer.
需要说明的是,孕妇的生物样本中含有胎儿的游离RNA,特别是游离mRNA(缩写cfmRNA),因此,本申请创造性地直接对提取自孕妇样本的游离RNA进行胎儿靶基因文库构建。采用本申请构建的胎儿靶基因文库,能够直接对孕妇的血浆或尿液中胎儿特异性表达的cfmRNA进行检测,然后推断出胎儿的基因型,解决了因游离DNA中携带有大量的母体背景而导致突变检测无法判断的问题。cfmRNA是经过DNA翻译后得到的完整的转录本序列,除去了大量的内含子冗余信息,可以反映DNA发生在外显子上的结构变异,在DNA水平这些变异很难被检测到,而在mRNA水平,这些变异则相对容易被检测到。因此,本申请的靶基因文库构建采用cfmRNA替代cfDNA,用于后续的遗传疾病相关突变的无创性检测,可以对胎儿的某些隐性遗传病相关突变进行检测,也可以检测某些基因的结构变异,而这些在cfDNA水平上是很难检测到的。It should be noted that pregnant women’s biological samples contain fetal free RNA, especially free mRNA (abbreviated cfmRNA). Therefore, the present application creatively constructs a fetal target gene library for free RNA extracted from pregnant women samples. The fetal target gene library constructed by this application can directly detect fetal specific cfmRNA expressed in the plasma or urine of pregnant women, and then infer the genotype of the fetus, which solves the problem that a large amount of maternal background is carried in free DNA Causes problems that mutation detection cannot judge. cfmRNA is a complete transcript sequence obtained after DNA translation. It removes a lot of intron redundant information, which can reflect the structural variation of DNA in exons. These variations are difficult to detect at the DNA level. At the mRNA level, these variations are relatively easy to detect. Therefore, the target gene library construction of the present application uses cfmRNA instead of cfDNA for subsequent noninvasive detection of genetic disease-related mutations, which can detect certain recessive genetic disease-related mutations of the fetus, and can also detect the structure of certain genes Mutations, which are difficult to detect at the cfDNA level.
优选地,本申请的靶基因文库构建方法中,生物样本为外周血或尿液。Preferably, in the target gene library construction method of the present application, the biological sample is peripheral blood or urine.
需要说明的是,本申请的靶基因文库构建方法,原则上适用于所有含有胎儿游离RNA的孕妇样本,包括但不仅限于尿液和外周血的血浆。It should be noted that the target gene library construction method of the present application is in principle applicable to all pregnant women samples containing fetal free RNA, including but not limited to urine and peripheral blood plasma.
优选地,本申请的靶基因文库构建方法中,将提取的游离RNA逆转录生成第一链cDNA,具体包括采用随机引物对游离RNA进行逆转录,获得第一链cDNA。Preferably, in the method for constructing a target gene library of the present application, reverse transcription of the extracted free RNA to generate first-strand cDNA specifically includes reverse transcription of the free RNA using random primers to obtain first-strand cDNA.
更优选地,随机引物为N6随机引物。More preferably, the random primer is an N6 random primer.
需要说明的是,采用N6随机引物生成第一链cDNA只是本申请的一种实现方式中采用的具体方案,不排除还可以采用其它生成第一链cDNA的方法,在此不作具体限定。It should be noted that the use of N6 random primers to generate the first-strand cDNA is only a specific scheme adopted in an implementation manner of this application, and it is not excluded that other methods for generating the first-strand cDNA can also be used, which is not specifically limited herein.
优选地,本申请的靶基因文库构建方法中,特异性引物对为第一引物对至第六引物对中的至少一组;第一引物对的上下游引物分别为SEQ ID NO.1和SEQ ID NO.2所示序列,第二引物对的上下游引物分别为SEQ ID NO.3和SEQ ID NO.4所示序列,第三引物对的上下游引物分别为SEQ ID NO.5和SEQ ID NO.6所示序列,第四引物对的上下游引物分别为SEQ ID NO.7和SEQ ID NO.8所示序列,第五引物对的上下游引物分别为SEQ ID NO.9和SEQ ID NO.10所示序列,第六引物对的上下游引物分别为SEQ ID NO.11和SEQ ID NO.12所示序列。Preferably, in the method for constructing a target gene library of the present application, the specific primer pair is at least one of the first primer pair to the sixth primer pair; the upstream and downstream primers of the first primer pair are SEQ ID NO. 1 and SEQ respectively The sequence shown in ID NO.2, the upstream and downstream primers of the second primer pair are the sequences shown in SEQ ID NO. 3 and SEQ ID NO. 4, respectively, and the upstream and downstream primers of the third primer pair are SEQ ID NO. 5 and SEQ respectively The sequence shown in ID NO.6, the upstream and downstream primers of the fourth primer pair are the sequences shown in SEQ ID NO.7 and SEQ ID NO.8, and the upstream and downstream primers of the fifth primer pair are SEQ ID NO.9 and SEQ, respectively The sequence shown in ID NO.10, the upstream and downstream primers of the sixth primer pair are the sequences shown in SEQ ID NO.11 and SEQ ID NO.12, respectively.
需要说明的是,SEQ ID NO.1至SEQ ID NO.12所示序列的六个引物对只是本申请的一种实现方式中,具体的对FGA基因的常见突变位点进行扩增的引物,一方面,在不影响扩增效果或覆盖突变位点的情况下,可以对引物进行适当的调整,例如在5’端或3’端增减若干碱基,或者在5’端增加特殊目的的核酸序列, 例如本申请的这六个引物对就在5’端增加了部分接头序列,以方便后续采用通用引物进行扩增;另一方面,在具体的使用过程中,可以根据具体检测的突变位点选择使用其中的一个或多个引物对,甚至,在本申请的六个引物对没有覆盖的一些特殊突变位点,还可以另外自行设计引物对,在此不做具体限定。It should be noted that the six primer pairs of the sequences shown in SEQ ID NO. 1 to SEQ ID NO. 12 are only the specific primers for amplifying common mutation sites of the FGA gene in one implementation of the present application. On the one hand, without affecting the amplification effect or covering the mutation site, the primer can be adjusted appropriately, such as adding or subtracting a few bases at the 5'end or 3'end, or adding a special purpose at the 5'end Nucleic acid sequence, for example, the six primer pairs of the present application add a part of the linker sequence at the 5'end, so as to facilitate subsequent amplification using universal primers; on the other hand, in the specific use process, according to the specific detected mutation One or more primer pairs are used for the site selection. Even for some special mutation sites that are not covered by the six primer pairs in this application, primer pairs can be designed by themselves, which is not specifically limited here.
优选地,本申请的靶基因文库构建方法中,特异性引物包括引物对一至引物对七中的至少一组,引物对一的上下游引物分别为SEQ ID NO.17和SEQ ID NO.18所示序列,引物对二的上下游引物分别为SEQ ID NO.19和SEQ ID NO.20所示序列,引物对三的上下游引物分别为SEQ ID NO.21和SEQ ID NO.22所示序列,引物对四的上下游引物分别为SEQ ID NO.23和SEQ ID NO.24所示序列,引物对五的上下游引物分别为SEQ ID NO.25和SEQ ID NO.26所示序列,引物对六的上下游引物分别为SEQ ID NO.27和SEQ ID NO.28所示序列,引物对七的上下游引物分别为SEQ ID NO.29和SEQ ID NO.30所示序列。Preferably, in the method for constructing a target gene library of the present application, the specific primers include at least one of primer pair 1 to primer pair 7, and the upstream and downstream primers of primer pair 1 are SEQ ID NO. 17 and SEQ ID NO. 18 respectively. The sequence of primer pair two is the sequence shown in SEQ ID NO.19 and SEQ ID NO.20, and the primer pair three is the sequence shown in SEQ ID NO.21 and SEQ ID NO.22 The upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO. 24, respectively. The upstream and downstream primers of primer pair 5 are the sequences shown in SEQ ID NO. 25 and SEQ ID NO. 26, respectively. The upstream and downstream primers for pair six are the sequences shown in SEQ ID NO.27 and SEQ ID NO.28, respectively, and the upstream and downstream primers for primer pair seven are the sequences shown in SEQ ID NO.29 and SEQ ID NO.30.
同样的,SEQ ID NO.17至SEQ ID NO.30所示序列的七个引物对只是本申请的一种实现方式中,具体的对HESX1基因的常见突变位点进行扩增的引物;一方面,在不影响扩增效果或覆盖突变位点的情况下,可以对引物进行适当的调整;另一方面,在具体的使用过程中,可以根据具体检测的突变位点选择使用其中的一个或多个引物对,甚至,在本申请的七个引物对没有覆盖的一些特殊突变位点,还可以另外自行设计引物对,在此不做具体限定。Similarly, the seven primer pairs of the sequences shown in SEQ ID NO. 17 to SEQ ID NO. 30 are only the specific primers for amplifying common mutation sites of the HESX1 gene in one implementation of this application; , Without affecting the amplification effect or covering the mutation site, you can adjust the primer appropriately; on the other hand, in the specific use process, you can choose to use one or more of the mutation sites according to the specific detection One primer pair, even at some special mutation sites that are not covered by the seven primer pairs of this application, you can also design your own primer pair, which is not specifically limited here.
优选地,本申请的靶基因文库构建方法中,在采用特异性引物对第一链cDNA进行扩增的同时,采用参照引物对第一链cDNA进行扩增,参照引物能够特异性扩增看家基因。Preferably, in the method for constructing a target gene library of the present application, while using specific primers to amplify the first-strand cDNA, reference primers are used to amplify the first-strand cDNA, and the reference primers can specifically amplify the housekeeper gene.
需要说明的是,扩增看家基因的靶标区域,其目的是一方面可以作为实验成功与否的参照;另一方面,可以作为样本定量的参照。It should be noted that the target region of the housekeeping gene is amplified, on the one hand, it can be used as a reference for the success of the experiment; on the other hand, it can be used as a reference for sample quantification.
优选地,看家基因为GAPDH基因。Preferably, the housekeeping gene is the GAPDH gene.
优选地,扩增看家基因GAPDH的引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。Preferably, the upstream and downstream primers of the primers that amplify the housekeeping gene GAPDH are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
需要说明的是,SEQ ID NO.13和SEQ ID NO.14所示序列的引物只是本申请的一种实现方式中采用的扩增看家基因GAPDH的引物,不排除还可以设计或采用其它引物,只要不影响靶基因扩增即可。It should be noted that the primers of the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14 are only the primers used to amplify the housekeeping gene GAPDH used in one implementation of this application, and it is not excluded that other primers can also be designed or used , As long as it does not affect the target gene amplification.
优选地,本申请的靶基因文库构建方法中,通用引物包括第一引物和第二引物,第一引物为SEQ ID NO.15所示序列,第二引物为SEQ ID NO.16所示序列。并且,优选地,第一引物的5’端具有磷酸化修饰。Preferably, in the method for constructing a target gene library of the present application, the universal primer includes a first primer and a second primer, the first primer is the sequence shown in SEQ ID NO.15, and the second primer is the sequence shown in SEQ ID NO.16. Also, preferably, the 5'end of the first primer has a phosphorylation modification.
需要说明的是,通用引物的作用是对PCR扩增产物进一步进行扩增富集, 从而获得可以用于测序的测序文库;其中,第一引物的5’端具有磷酸化修饰,其作用是为了方便后续对单链DNA进行环化。可以理解,通用引物的具体序列可以根据不同的试验或测序平台进行设计。It should be noted that the purpose of the universal primer is to further amplify and enrich the PCR amplification product, so as to obtain a sequencing library that can be used for sequencing; wherein, the 5'end of the first primer has a phosphorylation modification, and its purpose is to It is convenient for the subsequent circularization of single-stranded DNA. It can be understood that the specific sequence of the universal primer can be designed according to different experiments or sequencing platforms.
优选地,本申请的靶基因文库构建方法中,进一步包括在第二轮PCR扩增获得双链核酸扩增产物之后,将双链核酸扩增产物变性为单链核酸,并采用核酸介导片段将单链核酸连接成环状单链核酸,进而获得环状单链靶基因文库,其中,核酸介导片段能够与单链核酸的两个末端通过碱基互补配对原则进行结合。Preferably, in the method for constructing a target gene library of the present application, after obtaining the double-stranded nucleic acid amplification product in the second round of PCR amplification, the double-stranded nucleic acid amplification product is denatured into single-stranded nucleic acid, and a nucleic acid-mediated fragment is used The single-stranded nucleic acid is connected into a circular single-stranded nucleic acid, thereby obtaining a circular single-stranded target gene library, in which the nucleic acid-mediated fragment can be combined with the two ends of the single-stranded nucleic acid through the base complementary pairing principle.
需要说明的是,根据选用的测序平台的不同,可以对靶基因文库进行适应性处理。例如,使用Illumina测序平台进行测序,要求上机的文库是双链核酸文库,即在进行第二轮PCR扩增后获得的核酸双链扩增产物则是上机使用的文库;又如,使用BGI或MGI测序平台进行测序,要求上机的文库是环状单链核酸文库,则需要将第二轮PCR扩增后获得的核酸双链扩增产物变性为单链核酸,并通过介导片段将单链核酸连接成环状单链核酸,最后获得环状单链核酸文库之后才能上机测序。It should be noted that, according to different sequencing platforms selected, the target gene library can be adaptively processed. For example, sequencing using the Illumina sequencing platform requires that the library on the computer is a double-stranded nucleic acid library, that is, the nucleic acid double-stranded amplification product obtained after the second round of PCR amplification is the library used on the computer; another example, use BGI or MGI sequencing platform for sequencing requires that the library on the machine is a circular single-stranded nucleic acid library, then the double-stranded nucleic acid amplification product obtained after the second round of PCR amplification needs to be denatured into single-stranded nucleic acid, and through the mediated fragment The single-stranded nucleic acid is connected into a circular single-stranded nucleic acid, and the sequencing can only be performed after the circular single-stranded nucleic acid library is finally obtained.
本申请的第二方面公开了一种胎儿基因的检测方法,包括以下步骤:The second aspect of the present application discloses a method for detecting fetal genes, including the following steps:
采用本申请第一方面的方法进行靶基因文库构建,获得靶基因文库;对靶基因文库进行测序,获得由多个测序数据构成的测序结果;对测序结果进行分析,获得胎儿的基因信息。The method of the first aspect of the present application is used to construct a target gene library to obtain a target gene library; sequence the target gene library to obtain sequencing results composed of multiple sequencing data; analyze the sequencing results to obtain fetal gene information.
需要说明的是,本申请的胎儿基因检测方法,实际上就是在本申请的靶基因文库构建方法基础上,进一步对所构建的靶基因文库进行测序,从而实现胎儿基因检测。本申请的一种实现方式中,采用高通量测序方法,对所构建的靶基因文库进行测序,能够获得包括隐性遗传单基因病的突变信息、胎儿基因型信息、基因结构变异信息等在内的胎儿基因信息。It should be noted that the fetal gene detection method of the present application is actually based on the method of constructing the target gene library of the present application, and further sequencing the constructed target gene library to realize fetal gene detection. In one implementation of the present application, a high-throughput sequencing method is used to sequence the constructed target gene library to obtain mutation information, fetal genotype information, and gene structure variation information including recessive genetic single gene disease. Fetal genetic information.
可以理解,本申请的胎儿基因检测方法可以是非诊断目的的检测方法,例如通过该检测方法获得胎儿的基因型信息,从而预测胎儿可能出现的表型,当该基因型信息与疾病并没有直接对应关系时,其预测的表型信息也与疾病没有直接关系。It can be understood that the fetal gene detection method of the present application may be a detection method for non-diagnostic purposes, for example, the fetal genotype information is obtained by the detection method, so as to predict the phenotype that the fetus may appear, when the genotype information does not directly correspond to the disease In the relationship, the predicted phenotype information is not directly related to the disease.
优选地,对测序结果进行分析,具体包括以下步骤,对测序数据进行过滤;将过滤后的测序数据比对到参考基因组,保留唯一对比上参考基因组的测序数据;基于唯一对比上参考基因组的测序数据,统计靶基因的碱基分布情况,获得靶基因突变信息,进而获得胎儿的基因信息。Preferably, the analysis of the sequencing results specifically includes the steps of filtering the sequencing data; comparing the filtered sequencing data to the reference genome, retaining the sequencing data of the unique reference genome; sequencing based on the unique comparison of the reference genome Data, statistics of the base distribution of the target gene, to obtain mutation information of the target gene, and then obtain the genetic information of the fetus.
优选地,对测序结果进行分析,进一步包括,将靶基因突变信息与疾病数 据库进行对比,获得胎儿的隐性遗传单基因病的突变信息。Preferably, the analysis of the sequencing results further includes comparing the mutation information of the target gene with the disease database to obtain mutation information of the recessive genetic single gene disease of the fetus.
本申请的胎儿基因检测方法的其它部分,例如生物样本类型、第一链cDNA生成等都可以参考本申请第一方面的靶基因文库构建方法,在此不累述。For other parts of the fetal gene detection method of the present application, such as the type of biological sample, the first strand cDNA generation, etc., reference may be made to the method of constructing a target gene library in the first aspect of the present application, which is not described here.
本申请的靶基因文库构建方法或胎儿基因检测方法,其全部或部分步骤可以通过特有的机械装置实现,从而实现自动化操作。In the method for constructing a target gene library or the method for detecting fetal genes in the present application, all or part of the steps can be realized by a unique mechanical device, so as to realize automatic operation.
本申请的第三方面公开了一种胎儿基因的检测装置,其包括游离RNA提取模块、逆转录模块、靶基因扩增模块、靶基因文库生成模块、测序模块和分析模块;游离RNA提取模块,用于从来源于孕妇的生物样本中提取游离RNA,游离RNA包含胎儿游离RNA;逆转录模块,用于将提取的游离RNA逆转录生成第一链cDNA;靶基因扩增模块,用于采用特异性引物对第一链cDNA进行扩增,获得特异性扩增产物,特异性引物能够特异性扩增靶基因,靶基因为胎儿与母体的差异表达基因,特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同;靶基因文库生成模块,用于采用通用引物对特异性扩增产物进行第二轮PCR扩增,获得靶基因文库;测序模块,用于对靶基因文库进行测序,获得由多个测序数据构成的测序结果;分析模块,用于对测序结果进行分析,获得胎儿的基因信息。The third aspect of the present application discloses a fetal gene detection device, which includes a free RNA extraction module, a reverse transcription module, a target gene amplification module, a target gene library generation module, a sequencing module, and an analysis module; a free RNA extraction module, It is used to extract free RNA from biological samples derived from pregnant women. The free RNA contains fetal free RNA; the reverse transcription module is used to reverse transcribe the extracted free RNA to generate first-strand cDNA; the target gene amplification module is used to adopt specific Sexual primers amplify the first-strand cDNA to obtain specific amplification products. The specific primers can specifically amplify the target gene. The target gene is the differentially expressed gene of the fetus and the mother. The upstream primer and/or downstream of the specific primer The sequence at the 5'end of the primer is the same as at least a part of the sequence of the universal primer; the target gene library generation module is used to perform a second round of PCR amplification on the specific amplification product using the universal primer to obtain the target gene library; the sequencing module is used to Sequencing the target gene library to obtain sequencing results composed of multiple sequencing data; analysis module for analyzing the sequencing results to obtain fetal genetic information.
优选地,本申请的胎儿基因检测装置中,游离RNA提取模块涉及的生物样本为外周血或尿液。Preferably, in the fetal gene detection device of the present application, the biological sample involved in the free RNA extraction module is peripheral blood or urine.
优选地,本申请的胎儿基因检测装置中,逆转录模块将提取的游离RNA逆转录生成第一链cDNA,具体包括采用随机引物对游离RNA进行逆转录,获得第一链cDNA;优选地,随机引物为N6随机引物。Preferably, in the fetal gene detection device of the present application, the reverse transcription module reverse transcribes the extracted free RNA to generate first-strand cDNA, which specifically includes performing reverse transcription of the free RNA using random primers to obtain the first-strand cDNA; preferably, random The primers are N6 random primers.
优选地,本申请的胎儿基因检测装置中,靶基因扩增模块涉及的差异表达基因为胎儿特异性表达,而母体几乎不表达的基因。优选地,差异表达基因为表1所示基因中的至少一个。Preferably, in the fetal gene detection device of the present application, the differentially expressed genes involved in the target gene amplification module are genes that are specifically expressed by the fetus but hardly expressed by the mother. Preferably, the differentially expressed genes are at least one of the genes shown in Table 1.
优选地,本申请的胎儿基因检测装置中,分析模块中,胎儿的基因信息包括隐性遗传单基因病的突变信息、胎儿基因型信息、基因结构变异信息的至少一种。Preferably, in the fetal gene detection device of the present application, in the analysis module, the genetic information of the fetus includes at least one of mutation information of recessive genetic single gene disease, fetal genotype information, and genetic structure variation information.
优选地,本申请的胎儿基因检测装置中,分析模块进一步包括过滤单元、参考基因组比对单元和统计单元;过滤单元,用于对测序数据进行过滤;参考基因组比对单元,用于将过滤后的测序数据比对到参考基因组,保留唯一对比上参考基因组的测序数据;统计单元,用于基于唯一对比上参考基因组的测序数据,统计靶基因的碱基分布情况,获得靶基因突变信息,进而获得胎儿的基因信息。Preferably, in the fetal gene detection device of the present application, the analysis module further includes a filtering unit, a reference genome comparison unit and a statistical unit; a filtering unit is used to filter sequencing data; a reference genome comparison unit is used to filter The sequencing data is compared to the reference genome, and the unique comparison of the sequencing data of the reference genome is retained; the statistical unit is used to calculate the base distribution of the target gene based on the sequencing data of the unique comparison of the reference genome to obtain mutation information of the target gene, and then Get the genetic information of the fetus.
优选地,本申请的胎儿基因检测装置中,分析模块进一步包括疾病数据库对比单元,疾病数据库对比单元用于将靶基因突变信息与疾病数据库进行对比,获得胎儿的隐性遗传单基因病的突变信息。Preferably, in the fetal gene detection device of the present application, the analysis module further includes a disease database comparison unit, and the disease database comparison unit is used to compare target gene mutation information with the disease database to obtain mutation information of the recessive genetic single gene disease of the fetus .
本申请的第四方面公开了本申请第一方面的靶基因文库构建方法、本申请第二方面的胎儿基因检测方法或本申请第三方面的胎儿基因检测装置在隐性遗传的单基因病检测、或结构变异检测中的应用。The fourth aspect of the present application discloses the method of constructing the target gene library of the first aspect of the present application, the fetal gene detection method of the second aspect of the present application or the fetal gene detection device of the third aspect of the present application in the detection of recessively inherited single gene diseases , Or the application of structural variation detection.
本申请的第五方面公开了一种用于胎儿FGA基因检测的试剂,该试剂包括用于扩增FGA基因的特异性引物,该特异性引物包括第一引物对至第六引物对中的至少一组,第一引物对的上下游引物分别为SEQ ID NO.1和SEQ ID NO.2所示序列,第二引物对的上下游引物分别为SEQ ID NO.3和SEQ ID NO.4所示序列,第三引物对的上下游引物分别为SEQ ID NO.5和SEQ ID NO.6所示序列,第四引物对的上下游引物分别为SEQ ID NO.7和SEQ ID NO.8所示序列,第五引物对的上下游引物分别为SEQ ID NO.9和SEQ ID NO.10所示序列,第六引物对的上下游引物分别为SEQ ID NO.11和SEQ ID NO.12所示序列。A fifth aspect of the present application discloses a reagent for detecting fetal FGA gene, the reagent includes a specific primer for amplifying FGA gene, the specific primer includes at least one of the first primer pair to the sixth primer pair In one set, the upstream and downstream primers of the first primer pair are the sequences shown in SEQ ID NO. 1 and SEQ ID NO. 2, respectively, and the upstream and downstream primers of the second primer pair are SEQ ID NO. 3 and SEQ ID NO. 4. Sequence, the upstream and downstream primers of the third primer pair are the sequences shown in SEQ ID NO.5 and SEQ ID NO.6, and the upstream and downstream primers of the fourth primer pair are SEQ ID NO.7 and SEQ ID NO.8, respectively. Sequence, the upstream and downstream primers of the fifth primer pair are the sequences shown in SEQ ID NO. 9 and SEQ ID NO. 10, respectively, and the upstream and downstream primers of the sixth primer pair are SEQ ID NO. 11 and SEQ ID NO. 12 respectively.示SEQ.
需要说明的是,本申请的试剂,其六个引物对能够扩增并覆盖FGA基因的常见突变位点;因此,本申请的试剂一方面可以直接用于FGA基因突变检测,另一方面,也可以对相应的靶标片段进行扩增富集。可以理解,本申请的试剂除了可以用于本申请的胎儿FGA基因检测方法以外,其它基于PCR扩增靶标片段的FGA基因检测方法也可以采用本申请的试剂,在此不做具体限定。It should be noted that the reagent of the present application, its six primer pairs can amplify and cover the common mutation sites of the FGA gene; therefore, the reagent of the present application can be directly used for FGA gene mutation detection on the one hand, on the other hand, The corresponding target fragments can be amplified and enriched. It can be understood that the reagents of the present application can be used for the detection method of the fetal FGA gene of the present application, and other FGA gene detection methods based on PCR amplification of target fragments can also use the reagents of the present application, which are not specifically limited herein.
优选地,本申请的试剂还包括用于扩增看家基因GAPDH的参照引物,该参照引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。Preferably, the reagent of the present application further includes a reference primer for amplifying the housekeeping gene GAPDH, and the upstream and downstream primers of the reference primer are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
需要说明的是,本申请的试剂中,六个特异性引物对和扩增看家基因GAPDH的引物对,这些引物对可以根据使用需求和优化的多重PCR扩增方案,预先按比例混合,也可以各引物对单独包装,根据具体使用情况自行配置引物混合液,在此不做具体限定。可以理解,如果不需要对看家基因进行扩增,也可以不使用相应的参照引物对。It should be noted that in the reagents of the present application, six specific primer pairs and primer pairs for amplifying the housekeeping gene GAPDH, these primer pairs can be pre-mixed in proportion according to the use requirements and the optimized multiplex PCR amplification scheme. Each primer pair can be packaged separately, and the primer mixture can be configured according to the specific use situation, which is not specifically limited here. It can be understood that if it is not necessary to amplify the housekeeping gene, the corresponding reference primer pair may not be used.
优选地,本申请的试剂还包括用于构建测序文库的通用引物,通用引物包括第一引物和第二引物,第一引物为SEQ ID NO.15所示序列,第二引物为SEQ ID NO.16所示序列;并且,优选地,第一引物的5’端具有磷酸化修饰。Preferably, the reagent of the present application further includes a universal primer for constructing a sequencing library. The universal primer includes a first primer and a second primer. The first primer is the sequence shown in SEQ ID NO.15, and the second primer is SEQ ID NO. The sequence shown in 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
本申请的第六方面公开了一种用于胎儿FGA基因检测的试剂盒,该试剂盒中包括本申请第五方面的用于胎儿FGA基因检测的试剂。A sixth aspect of the present application discloses a kit for fetal FGA gene detection, which includes the reagent for fetal FGA gene detection in the fifth aspect of the present application.
优选地,试剂盒中还包括游离RNA提取试剂、逆转录试剂、PCR扩增试剂和核酸纯化试剂中的至少一种。Preferably, the kit further includes at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents and nucleic acid purification reagents.
需要说明的是,为了使用方便,完全可以将本申请的FGA基因的特异性引物对、看家基因GAPDH扩增引物对和/或文库构建通用引物,甚至本申请胎儿FGA基因检测方法中所涉及的各种试剂,组装成试剂盒,以方便对胎儿FGA基因进行检测;当然,游离RNA提取试剂、逆转录试剂、PCR扩增试剂、核酸纯化试剂这些也可以使用实验室常规的试剂,在此不做具体限定。It should be noted that, for ease of use, the specific primer pairs of the FGA gene of the present application, the housekeeping gene GAPDH amplification primer pairs and/or the library can be used to construct universal primers, or even involved in the fetal FGA gene detection method of the present application The various reagents are assembled into a kit to facilitate the detection of fetal FGA genes; of course, free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents, nucleic acid purification reagents, etc. can also use conventional laboratory reagents, here No specific restrictions.
本申请的第七方面公开了一种用于胎儿HESX1基因检测的试剂,包括用于扩增HESX1基因的特异性引物,特异性引物包括引物对一至引物对七中的至少一组,引物对一的上下游引物分别为SEQ ID NO.17和SEQ ID NO.18所示序列,引物对二的上下游引物分别为SEQ ID NO.19和SEQ ID NO.20所示序列,引物对三的上下游引物分别为SEQ ID NO.21和SEQ ID NO.22所示序列,引物对四的上下游引物分别为SEQ ID NO.23和SEQ ID NO.24所示序列,引物对五的上下游引物分别为SEQ ID NO.25和SEQ ID NO.26所示序列,引物对六的上下游引物分别为SEQ ID NO.27和SEQ ID NO.28所示序列,引物对七的上下游引物分别为SEQ ID NO.29和SEQ ID NO.30所示序列。The seventh aspect of the present application discloses a reagent for detecting fetal HESX1 gene, including a specific primer for amplifying the HESX1 gene, the specific primer includes at least one of primer pair 1 to primer pair 7, primer pair 1 The upstream and downstream primers are the sequences shown in SEQ ID NO.17 and SEQ ID NO.18, respectively. The upstream and downstream primers of primer pair 2 are the sequences shown in SEQ ID NO.19 and SEQ ID NO.20, and the primer pair three is The downstream primers are the sequences shown in SEQ ID NO.21 and SEQ ID NO.22 respectively, the upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO.24, and the upstream and downstream primers of primer pair V are respectively They are the sequences shown in SEQ ID NO.25 and SEQ ID NO. 26 respectively, the upstream and downstream primers of primer pair 6 are the sequences shown in SEQ ID NO.27 and SEQ ID NO. 28, and the upstream and downstream primers of primer pair 7 are respectively The sequence shown in SEQ ID NO.29 and SEQ ID NO.30.
同样,本申请的试剂,其七个引物对能够扩增并覆盖HESX1基因的常见突变位点;因此,本申请的试剂一方面可以直接用于HESX1基因突变检测,另一方面,也可以对相应的靶标片段进行扩增富集。可以理解,本申请的试剂除了可以用于本申请的胎儿HESX1基因检测方法以外,其它基于PCR扩增靶标片段的HESX1基因检测方法也可以采用本申请的试剂,在此不做具体限定。Similarly, the reagent of the present application, its seven primer pairs can amplify and cover the common mutation sites of the HESX1 gene; therefore, the reagent of the present application can be directly used for HESX1 gene mutation detection on the one hand, and can also be used for the corresponding The target fragments are amplified and enriched. It can be understood that the reagents of the present application can be used for the HESX1 gene detection method of the present application, and other HESX1 gene detection methods based on PCR amplification of target fragments can also use the reagents of the present application, which are not specifically limited herein.
优选地,本申请的试剂还包括用于扩增看家基因GAPDH的参照引物,该参照引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。Preferably, the reagent of the present application further includes a reference primer for amplifying the housekeeping gene GAPDH, and the upstream and downstream primers of the reference primer are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
优选地,本申请的试剂还包括用于构建测序文库的通用引物,通用引物包括第一引物和第二引物,第一引物为SEQ ID NO.15所示序列,第二引物为SEQ ID NO.16所示序列;并且,优选地,第一引物的5’端具有磷酸化修饰。Preferably, the reagent of the present application further includes a universal primer for constructing a sequencing library. The universal primer includes a first primer and a second primer. The first primer is the sequence shown in SEQ ID NO.15, and the second primer is SEQ ID NO. The sequence shown in 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
同样的,引物的包装形式以及参照引物使用与否如本申请的第五方面的用于胎儿FGA基因检测的试剂,在此不累述。Similarly, the packaging format of the primer and whether the reference primer is used are the reagents for detecting the FGA gene of the fetus according to the fifth aspect of the present application, which are not repeated here.
同样的,为了使用方便,也可以将本申请的HESX1基因的特异性引物对、看家基因GAPDH扩增引物对和/或文库构建通用引物,甚至本申请胎儿HESX1基因检测方法中所涉及的各种试剂,组装成试剂盒,以方便对胎儿HESX1基因进行检测;当然,游离RNA提取试剂、逆转录试剂、PCR扩增试剂、核酸纯化试剂这些也可以使用实验室常规的试剂,在此不做具体限定。Similarly, for convenience of use, specific primer pairs of the HESX1 gene of the present application, housekeeping gene GAPDH amplification primer pairs and/or libraries can also be used to construct universal primers, and even each of the methods involved in the fetal HESX1 gene detection method of the present application Reagents, assembled into a kit to facilitate the detection of fetal HESX1 gene; of course, free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents, nucleic acid purification reagents can also use conventional laboratory reagents, do not do here Specific restrictions.
本申请的第八方面公开了一种用于胎儿HESX1基因检测的试剂盒,该试剂盒中包括本申请第七方面的用于胎儿HESX1基因检测的试剂。The eighth aspect of the present application discloses a kit for fetal HESX1 gene detection, which includes the reagent for fetal HESX1 gene detection in the seventh aspect of the present application.
优选地,试剂盒中还包括游离RNA提取试剂、逆转录试剂、PCR扩增试剂和核酸纯化试剂中的至少一种。Preferably, the kit further includes at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents and nucleic acid purification reagents.
同样的,试剂盒中各种试剂的包装形式如本申请第六方面的用于胎儿FGA基因检测的试剂盒,在此不累述。Similarly, the packaging form of various reagents in the kit is the same as the kit for detecting fetal FGA gene in the sixth aspect of the present application, which is not repeated here.
本申请的有益效果在于:The beneficial effects of this application are:
本申请的靶基因文库构建方法,利用胎儿和母体的基因表达差异,以提取自孕妇的游离RNA为基础对胎儿表达、母体几乎不表达的基因进行靶基因文库构建。本申请的靶基因文库构建方法以及基于所构建的靶基因文库的胎儿基因检测方法,第一,本申请通过检测胎儿与母体表达差异基因,特别是胎儿特异性表达的、而且母体几乎不表达的基因,可以排除母体背景的干扰,解决了基于游离DNA的检测中因携带有大量母体背景而导致突变检测无法判断的问题;第二,本申请通过对孕妇血浆游离RNA中的靶基因进行特异性扩增,能够很好地检测到血浆中低表达量的胎儿基因,解决了基于全转录组检测会丢掉表达量低的mRNA的问题;第三,cfmRNA是经过DNA翻译后得到的完整的转录本序列,除去了大量的内含子冗余信息,可以反映DNA发生在外显子上的结构变异,因此,本申请能够检测基因结构变异,解决了现有基于游离DNA的检测难以检测基因结构变异的问题;第四,采用本申请构建的靶基因文库进行胎儿基因检测,在检测隐性单基因病时不需要获知父母和先证者的信息,不需要构建胎儿单倍体信息,操作更简单,成本更低。本申请的靶基因文库构建方法以及基于所构建的靶基因文库的胎儿基因检测方法,操作简单、成本低,能够满足临床使用需求,为孕妇无创检测的进一步推广使用奠定了基础。The target gene library construction method of the present application uses the difference in gene expression between the fetus and the mother to construct a target gene library for genes expressed by the fetus and hardly expressed by the mother based on free RNA extracted from the pregnant woman. The target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library, first, the present application detects the differential genes expressed by the fetus and the mother, especially those that are specifically expressed by the fetus and almost not expressed by the mother Genes can eliminate the interference of maternal background, and solve the problem that the mutation detection cannot be judged due to the large amount of maternal background in the detection based on free DNA; second, this application is specific for the target gene in free RNA of maternal plasma Amplification can detect fetal genes with low expression in plasma, and solve the problem of losing low-expression mRNA based on full transcriptome detection. Third, cfmRNA is a complete transcript obtained after DNA translation. The sequence, which removes a lot of redundant information of introns, can reflect the structural variation of DNA occurring on exons. Therefore, the present application can detect genetic structural variation, which solves the difficulty of detecting genetic structural variation in the existing free DNA-based detection Question; Fourth, the target gene library constructed in this application is used for fetal gene detection. When detecting recessive single gene disease, there is no need to know the information of parents and probands, no need to construct fetal haploid information, and the operation is simpler. The cost is lower. The construction method of the target gene library and the fetal gene detection method based on the constructed target gene library of this application are simple in operation and low in cost, and can meet the clinical use requirements, laying a foundation for the further promotion and use of non-invasive detection for pregnant women.
附图说明BRIEF DESCRIPTION
图1是本申请实施例中靶基因文库构建方法的技术流程示意图;FIG. 1 is a schematic diagram of a technical process of a method for constructing a target gene library in an embodiment of the present application;
图2是本申请实施例一中各扩增子的覆盖深度分布图;2 is a distribution diagram of the coverage depth of each amplicon in Example 1 of the present application;
图3是本申请实施例一中正常胎儿的孕妇血浆样本的2个重复测序样本的重复性统计结果图;3 is a graph of repetitive statistical results of two repeated sequencing samples of plasma samples of pregnant women with normal fetuses in Example 1 of the present application;
图4是本申请实施例一中患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本的重复性统计结果图;4 is a graph of repetitive statistical results of two repeated sequencing samples of a pregnant woman plasma sample with a fetus with congenital afibrinogenemia in Example 1 of the present application;
图5是本申请实施例一中四个测序样本对目标致病位点的碱基覆盖度统计结果图;FIG. 5 is a statistical result diagram of the base coverage of four sequencing samples in the first embodiment of the present application to the target pathogenic site;
图6是本申请实施例二中四个测序样本对目标致病位点的碱基覆盖度统计 结果图。6 is a graph showing the results of base coverage of four sequencing samples in Example 2 of the present application on target pathogenic sites.
具体实施方式detailed description
现有的胎儿基因检测方法主要是针对孕妇血液中的胎儿游离DNA进行,Dennis Lo等人提供的基于胎儿游离DNA的检测方法,不但可以明确胎儿是否从父亲处获得致病突变,也可以实现对胎儿从母亲处所继承的等位基因情况的判断。但是,Dennis Lo等人的检测方法操作复杂、成本高,难以满足临床使用需求;并且,更为重要的是,基于胎儿游离DNA的检测方法难以进行基因结构变异检测。The existing fetal gene detection method is mainly for free fetal DNA in the blood of pregnant women. The detection method based on free fetal DNA provided by Dennis Lo et al. can not only determine whether the fetus obtains the pathogenic mutation from the father, but also Judgment of the status of alleles inherited by the fetus from the mother. However, the detection methods of Dennis Lo and others are complicated and costly, and it is difficult to meet the needs of clinical use; and, more importantly, the detection method based on free fetal DNA is difficult to detect genetic structure variation.
本申请发明人的研究显示在整个孕期,孕妇的转录谱和胎儿的转录谱是不一样的;胎儿从受精卵发育成个体的过程中不同基因在时间和空间的表达存在差异,例如对于某些基因,母体不表达,而胎儿在孕期某一时期表达或者在孕期一直表达。本申请发明人发现,对于这些胎儿和母体表达存在差异的基因的突变导致的疾病,可以通过直接检测血浆中胎儿特异性表达的cfmRNA进行检测。胎儿特异表达的RNA通过胎盘释放到血液循环中,通过对血液循环中cfmRNA的检测就可以推导出胎儿DNA水平的突变。Research by the inventors of the present application shows that the transcript profile of the pregnant woman and the transcript profile of the fetus are different throughout the pregnancy period; the expression of different genes in time and space varies during the development of the fetus from the fertilized egg into an individual, for example, for certain Genes are not expressed by the mother, while the fetus is expressed during a certain period of pregnancy or has been expressed throughout pregnancy. The inventors of the present application found that diseases caused by mutations in genes that differ in fetal and maternal expression can be detected by directly detecting cfmRNA expressed specifically in fetuses in plasma. Fetal-specifically expressed RNA is released into the blood circulation through the placenta. By detecting cfmRNA in the blood circulation, mutations in the fetal DNA level can be deduced.
基于以上研究和认识,本申请创造性的提出了一种新的靶基因文库构建方法,包括从来源于孕妇的生物样本中提取游离RNA,游离RNA包含胎儿游离RNA;将提取的游离RNA逆转录生成第一链cDNA;采用特异性引物对第一链cDNA进行扩增,获得特异性扩增产物,其中,特异性引物能够特异性扩增靶基因,靶基因为胎儿与母体的差异表达基因,特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同;采用通用引物对特异性扩增产物进行第二轮PCR扩增,获得本申请的靶基因文库。Based on the above research and knowledge, the present application creatively proposes a new method for constructing a target gene library, including extracting free RNA from biological samples derived from pregnant women, the free RNA includes fetal free RNA; and reverse transcription of the extracted free RNA is generated First-strand cDNA; the first-strand cDNA is amplified with specific primers to obtain specific amplification products, wherein the specific primers can specifically amplify the target gene, which is a differentially expressed gene of the fetus and the mother, specific The sequence of the 5'end of the upstream primer and/or the downstream primer of the sexual primer is the same as at least a part of the sequence of the universal primer; the universal primer is used to perform a second round of PCR amplification on the specific amplification product to obtain the target gene library of the present application.
在本申请的靶基因文库构建方法的基础上,本申请进一步提出了基于本申请靶基因文库构建方法的胎儿基因检测方法,以及胎儿基因的检测装置等。进一步的,基于本申请靶基因文库构建方法和胎儿基因检测方法的一种具体应用;以及本申请提出了一种用于胎儿FGA基因检测的试剂和试剂盒;一种用于胎儿HESX1基因检测的试剂和试剂盒。On the basis of the target gene library construction method of the present application, the present application further proposes a fetal gene detection method based on the target gene library construction method of the present application, and a fetal gene detection device and the like. Further, based on a specific application of the target gene library construction method and fetal gene detection method of the present application; and the present application proposes a reagent and kit for fetal FGA gene detection; a fetal HESX1 gene detection method Reagents and kits.
本申请的胎儿基因检测方法,基于孕妇的血浆游离RNA进行检测,通过血浆中胎儿特异性表达的cfmRNA,可以很容易检测胎儿的变异类型,包括基因结构变异,从而分析胎儿是否携带父母的突变。具体来说,在孕妇怀孕过程中,胎儿的一些基因会特异性表达,通过胎盘释放到母亲的血液循环系统中,如果这些基因和某些疾病相关,就可以直接对血浆cfRNA进行检测达到检测胎儿 DNA突变,如图1所示。The fetal gene detection method of the present application is based on the detection of free plasma RNA from pregnant women. The cfmRNA specifically expressed by fetus in plasma can easily detect the type of mutation of the fetus, including the genetic structure, so as to analyze whether the fetus carries the mutation of the parent. Specifically, during pregnancy, some genes of the fetus are specifically expressed and released into the mother's blood circulation system through the placenta. If these genes are related to certain diseases, the plasma cfRNA can be directly detected to detect the fetus. DNA mutation, as shown in Figure 1.
本申请的胎儿基因检测方法,例如,对于隐性病假,其母体的基因型为AA或者Aa,其判定逻辑是:The fetal gene detection method of the present application, for example, for recessive sick leave, the maternal genotype is AA or Aa, and the decision logic is:
如果检测到血浆cfRNA中只有AA,那么胎儿的基因型为AA;If only AA is detected in plasma cfRNA, then the fetal genotype is AA;
如果检测到血浆cfRNA中只有Aa,那只有两种情况:1)A:a=1:1,那么胎儿的基因型为Aa;2)A<<a,即检测到的A远小于a,那么胎儿的基因型为aa;If only Aa is detected in the plasma cfRNA, there are only two cases: 1) A: a = 1:1, then the genotype of the fetus is Aa; 2) A<<a, that is, the detected A is much smaller than a, then The genotype of the fetus is aa;
如果检测到血浆cfRNA中只有aa,那么胎儿的基因型必定是aa。If only aa is detected in plasma cfRNA, then the fetal genotype must be aa.
本申请的靶基因文库构建方法和胎儿基因检测方法,利用胎儿和母体表达有差异的基因进行检测,尤其适用于胎儿特异性表达,而母体不表达或者母体表达量很低的基因,这类基因如表1所示。The target gene library construction method and fetal gene detection method of the present application use genes with different expressions in the fetus and the mother to detect, and are particularly suitable for genes that are specifically expressed by the fetus, but the mother does not express or the expression of the mother is very low, such genes As shown in Table 1.
表1可检测的基因列表Table 1 List of detectable genes
KRT7KRT7 CSH1CSH1 PSG8PSG8 GBP1P1GBP1P1 SEMA3BSEMA3B
TMEM54TMEM54 KISS1KISS1 ENDOUENDOU CLDN4CLDN4 GPC3GPC3
PRKCZPRKCZ STAT1STAT1 EGFREGFR C2orf72C2orf72 PLAC2PLAC2
SDC1SDC1 CGACGA DUSP4DUSP4 PAPPA2PAPPA2 PSG9PSG9
LGALS13LGALS13 CSH2CSH2 PHYHIPLPHYHIPL HIST2H3AHIST2H3A FN1FN1
EFHD1EFHD1 TFPI2TFPI2 CTSFCTSF HIST2H3CHIST2H3C NOS3NOS3
CAPN6CAPN6 GBP1GBP1 TRIM29TRIM29 TCL6TCL6 LOC100506655LOC100506655
XAGE3XAGE3 PLAC4PLAC4 RCN3RCN3 MFSD2AMFSD2A PSG11PSG11
EBI3EBI3 HSD17B1HSD17B1 SPIRE2SPIRE2 ZFAT-AS1ZFAT-AS1 SPTLC3SPTLC3
GH2GH2 CSHL1CSHL1 LOC100216001LOC100216001 INSL4INSL4 EXPH5EXPH5
PAGE4PAGE4 KRT8KRT8 FAM176AFAM176A DHRS2DHRS2 HSPA2HSPA2
ALPPALPP KRT18KRT18 SCINSCIN HES2HES2 PSG6PSG6
INHBAINHBA HPGDHPGD ZNF500ZNF500 WLSWLS PLAC1PLAC1
LOC100505659LOC100505659 GADD45GGADD45G PRR16PRR16 PLCXD3PLCXD3 TACC2TACC2
FBLN1FBLN1 LGALS14LGALS14 LOC100128054LOC100128054 LOC100505483LOC100505483 PRPF40BPRPF40B
LOC388948LOC388948 ADAM12ADAM12 GRHL2GRHL2 SVEP1SVEP1 CRHCRH
SERPINB2SERPINB2 PSG2PSG2 PPP1R32PPP1R32 CRYABCRYAB EFSEFS
HSD3B1HSD3B1 OLR1OLR1 C1orf130C1orf130 HSPB8HSPB8 TIMD4TIMD4
KRT19KRT19 SLC30A2SLC30A2 FOSBFOSB PKIBPKIB ALDH3B2ALDH3B2
SERPINE1SERPINE1 LOC285972LOC285972 GCM1GCM1 PGFPGF KRT81KRT81
GDF15GDF15 HESX1HESX1 TRPV6TRPV6 CYP11A1CYP11A1 MUC15MUC15
CYP19A1CYP19A1 TMEM139TMEM139 TFAP2ATFAP2A PSG5PSG5 PRSS8PRSS8
PSG4PSG4 ZNF727ZNF727 MMP11MMP11 PSG1PSG1 SH2D5SH2D5
PSG3PSG3 TM4SF19TM4SF19 TUSC3TUSC3 PAPPAPAPPA LOC728175LOC728175
HMGCRHMGCR GLDNGLDN C8orf39C8orf39 VGLL3VGLL3 CORO6CORO6
CCKCCK PGAP3PGAP3 LOC100129935LOC100129935 MSX2P1MSX2P1 GH1GH1
PABPN1LPABPN1L FGAFGA  A  A  A
表1所示为胎儿特异性表达,而母亲不表达或表达量极少的基因,这些基 因可以采用本申请的胎儿基因检测方法进行隐性遗传单基因病突变、胎儿基因型、基因结构变异等检测。Table 1 shows the genes that are specifically expressed by the fetus, but the mother does not express or has very little expression. These genes can be used for the detection of the fetal gene of the present application for recessive genetic single gene disease mutations, fetal genotypes, genetic structure variations, etc. Detection.
本申请的靶基因文库构建方法,以及基于构建的靶基因文库的胎儿基因检测方法,可以进行胎儿基因型检测,能够很容易的实现基因结构变异检测;并且操作简单、检测成本相对较低,能够很好地满足临床检测使用,为无创产检的进一步推广应用奠定了基础。The target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library can perform fetal genotype detection, and can easily realize genetic structure variation detection; and the operation is simple, the detection cost is relatively low, and It satisfactorily satisfies the use of clinical testing and lays the foundation for the further promotion and application of noninvasive production testing.
下面通过具体实施例对本申请作进一步详细说明。以下实施例仅对本申请进行进一步说明,不应理解为对本申请的限制。The present application will be further described in detail below through specific embodiments. The following examples only further illustrate the present application, and should not be construed as limiting the present application.
实施例一Example one
先天性无纤维蛋白原血症是一种凝血过程损失而导致的出血症状,大多和FGA基因上的突变相关,为常染色体隐性遗传病。在孕妇怀孕过程中,该基因在母体中很少或者不表达,而胎儿特异性表达。因此,本例对胎儿FGA基因进行检测,以验证本申请的靶基因文库构建方法和胎儿基因检测方法。Congenital afibrinogenemia is a bleeding symptom caused by the loss of the coagulation process. Most of them are related to mutations in the FGA gene and are autosomal recessive genetic diseases. During pregnancy, the gene is rarely or not expressed in the mother, while the fetus is specifically expressed. Therefore, in this example, the fetal FGA gene was tested to verify the target gene library construction method and fetal gene detection method of the present application.
本例设计了六个引物对,即特异性引物对,覆盖常见的FGA突变位点;本例设计的引物对如表2所示,所覆盖的常见FGA突变位点如表3所示。另外,本例还设计了一对引物扩增GAPDH作为样本参照,用于实验成功以及样本定量的参照,具体引物序列如表2所示。表2中,FGA-F01和FGA-R01为第一引物对,FGA-F02和FGA-R02为第二引物对,FGA-F03和FGA-R03为第三引物对,FGA-F04和FGA-R04为第四引物对,FGA-F05和FGA-R05为第五引物对,FGA-F06和FGA-R06为第六引物对,GAPDH-F和GAPDH-R为扩增GAPDH的引物对。In this example, six primer pairs were designed, namely specific primer pairs, covering common FGA mutation sites; the primer pairs designed in this example are shown in Table 2, and the common FGA mutation sites covered are shown in Table 3. In addition, this example also designed a pair of primers to amplify GAPDH as a sample reference for reference for successful experiment and sample quantification. The specific primer sequences are shown in Table 2. In Table 2, FGA-F01 and FGA-R01 are the first primer pair, FGA-F02 and FGA-R02 are the second primer pair, FGA-F03 and FGA-R03 are the third primer pair, FGA-F04 and FGA-R04 It is the fourth primer pair, FGA-F05 and FGA-R05 are the fifth primer pair, FGA-F06 and FGA-R06 are the sixth primer pair, and GAPDH-F and GAPDH-R are the primer pairs for amplifying GAPDH.
表2特异性引物对和GAPDH扩增引物Table 2 Specific primer pairs and GAPDH amplification primers
引物编号Primer number 序列(5’→3’)Sequence (5’→3’) SEQ ID NO.SEQ ID NO.
FGA-F01FGA-F01 GACCGCTTGGCCTCCGACTTTGTTTGCTGTAACTTGAAGATTTACC GACCGCTTGGCCTCCGACTT TGTTTGCTGTAACTTGAAGATTTACC 11
FGA-R01FGA-R01 GACATGGCTACGATCCGACTTCTTCTCACCTATGTTAGGAGAG GACATGGCTACGATCCGACTT CTTCTCACCTATGTTAGGAGAG 22
FGA-F02FGA-F02 GACCGCTTGGCCTCCGACTTATTGCCTCGGGACAGTCAGAACCA GACCGCTTGGCCTCCGACTT ATTGCCTCGGGACAGTCAGAACCA 33
FGA-R02FGA-R02 GACATGGCTACGATCCGACTTCAGGACTGGTAAAGAGAAGGTCACC GACATGGCTACGATCCGACTT CAGGACTGGTAAAGAGAAGGTCACC 44
FGA-F03FGA-F03 GACCGCTTGGCCTCCGACTTGCCAGGATTCCAGGTTCCGGTAC GACCGCTTGGCCTCCGACTT GCCAGGATTCCAGGTTCCGGTAC 55
FGA-R03FGA-R03 GACATGGCTACGATCCGACTTGGACTGGAGGGACTGCAACCTGG GACATGGCTACGATCCGACTT GGACTGGAGGGACTGCAACCTGG 66
FGA-F04FGA-F04 GACCGCTTGGCCTCCGACTTCGAGTAATCTCATTTCCACCAGGTCTC GACCGCTTGGCCTCCGACTT CGAGTAATCTCATTTCCACCAGGTCTC 77
FGA-R04FGA-R04 GACATGGCTACGATCCGACTTTGAACAGGTCATTGCCAAAGACTT GACATGGCTACGATCCGACTT TGAACAGGTCATTGCCAAAGACTT 88
FGA-F05FGA-F05 GACCGCTTGGCCTCCGACTTCACTTTTCTCTGCATATAGTAGAC GACCGCTTGGCCTCCGACTT CACTTTTCTCTGCATATAGTAGAC 99
FGA-R05FGA-R05 GACATGGCTACGATCCGAC TTAATTGAAGTCCTGAAGCGCAAAG GACATGGCTACGATCCGACTT AATTGAAGTCCTGAAGCGCAAAG 1010
FGA-F06FGA-F06 GACCGCTTGGCCTCCGACTTTGACTGCTTACCCAGTCTTCAT GACCGCTTGGCCTCCGACTT TGACTGCTTACCCAGTCTTCAT 1111
FGA-R06FGA-R06 GACATGGCTACGATCCGACTTTATGTGAATGAATCTTTAAAGACTGC GACATGGCTACGATCCGACTT TATGTGAATGAATCTTTAAAGACTGC 1212
GAPDH-FGAPDH-F GACCGCTTGGCCTCCGACTTTCAACGACCACTTTGTCAAGC GACCGCTTGGCCTCCGACTT TCAACGACCACTTTGTCAAGC 1313
GAPDH-RGAPDH-R GACATGGCTACGATCCGACTTGCCAGACCCTGCACTTTTTAAG GACATGGCTACGATCCGACTT GCCAGACCCTGCACTTTTTAAG 1414
表2中,下划线部分的序列,即上游引物的5’端的20bp序列和下游引物的5’端的21bp序列,为接头序列。本例在对cDNA进行第一轮PCR时通过引物引入部分接头序列;在第二轮PCR,即通用引物对特异性PCR扩增产物进行扩增时,通过通用引物引入完整的含有标签序列的接头序列,这些接头序列可以根据不同测序平台进行调整。In Table 2, the underlined sequence, that is, the 20 bp sequence at the 5'end of the upstream primer and the 21 bp sequence at the 5'end of the downstream primer is the linker sequence. In this example, when the first round of PCR is performed on the cDNA, a part of the adapter sequence is introduced through the primer; in the second round of PCR, that is, when the specific primer is used to amplify the specific PCR amplification product, the complete linker containing the tag sequence is introduced through the universal primer Sequence, these linker sequences can be adjusted according to different sequencing platforms.
表3特异性引物对覆盖的FGA突变位点Table 3 FGA mutation sites covered by specific primer pairs
编号Numbering 突变mutation HG38上的位置Location on HG38
11 c.510+1G>Tc.510+1G>T chr4:154587511chr4:154587511
22 c.1622delT(p.Val541Alafs)c.1622delT(p.Val541Alafs) chr4:154585807chr4:154585807
33 c.1634A>T(p.Glu545Val)c.1634A>T(p.Glu545Val) chr4:154585795chr4:154585795
44 c.104G>A(p.Arg35His)c.104G>A(p.Arg35His) chr4:154589513chr4:154589513
55 c.103C>T(p.Arg35Cys)c.103C>T(p.Arg35Cys) chr4:154589514chr4:154589514
66 c.571G>A(p.Gly191Arg)c.571G>A(p.Gly191Arg) chr4:154609725chr4:154609725
77 c.1718G>T(p.Arg573Leu)c.1718G>T(p.Arg573Leu) chr4:154585711chr4:154585711
88 c.1629delG(p.Thr544Leufs)c.1629delG(p.Thr544Leufs) chr4:154585800chr4:154585800
99 c.711dupT(p.Lys238Terfs)c.711dupT(p.Lys238Terfs) chr4:154586718chr4:154586718
1010 c.502C>T(p.Arg168Ter)c.502C>T(p.Arg168Ter) chr4:154587520chr4:154587520
本例通过提取孕妇血液中的游离RNA,并将cfRNA逆转录为cDNA,再结合多重PCR完成靶向文库的制备,再经过高通量测序获得突变信息。详细如下:In this example, the free RNA in the blood of pregnant women is extracted, and the cfRNA is reverse transcribed into cDNA, which is combined with multiple PCR to complete the preparation of the target library, and then the mutation information is obtained through high-throughput sequencing. The details are as follows:
实验样本:怀有正常胎儿的孕妇血浆样本1mL,将该样本平均分为2份,作为重复试验。已知患有先天性无纤维蛋白原血症的胎儿的孕妇,该胎儿经临床诊断其基因突变情况为:c.103C>T(p.Arg35Cys),其该孕妇血浆样本1mL,同样将样本平均分为2份,作为重复试验。Experimental sample: 1 mL of a plasma sample of a pregnant woman carrying a normal fetus. The sample was divided into 2 equal parts for repeated testing. A pregnant woman with a fetus known to have a congenital afibrinogenemia. The fetus was clinically diagnosed with a gene mutation: c.103C>T(p.Arg35Cys). The pregnant woman’s plasma sample was 1 mL, and the sample was also averaged. Divided into 2 parts, as a repeated test.
1.cfRNA提取1.cfRNA extraction
血浆样本的cfRNA提取采用Qigen公司的
Figure PCTCN2018119906-appb-000001
Circulating Nucleic Acid Kit,按照标准操作流程进行提取,最终得到的总RNA溶解到20μL的水中。
The plasma samples were extracted from Qigen using Qigen’s
Figure PCTCN2018119906-appb-000001
The Circulating Nucleic Acid Kit was extracted according to standard operating procedures, and the final total RNA was dissolved in 20 μL of water.
2.cDNA第一链合成2. cDNA first strand synthesis
采用thermofisher公司的第一链cDNA合成试剂盒SuperScript TMIV First-Strand Synthesis System 18091200,具体如下: Thermofisher's first-strand cDNA synthesis kit SuperScript IV First-Strand Synthesis System 18091200 was used, as follows:
先将10mM dNTPs 1μL、50ng/μL随机引物1μL、提取的RNA 1μL加入10μL的RNase free H 2O中,混匀后65℃5分钟,冰上放置1分钟,获得RNA混合物。其中,随机引物为N 6随机引物。 First add 1 μL of 10 mM dNTPs, 1 μL of 50 ng/μL random primers, and 1 μL of extracted RNA to 10 μL of RNase free H 2 O, mix for 5 minutes at 65°C, and place on ice for 1 minute to obtain an RNA mixture. Among them, the random primer is an N 6 random primer.
然后配制20μL的逆转录体系,包括:5×SSIV BUFFER 4μL、100mM的DTT 1μL、Ribonuclease Inhibitor 1μL、200U/μL的SuperScript TM IV Reverse Transcriptase 1μL,以及前一步的RNA混合物13μL。Then prepare a 20μL reverse transcription system, including: 5×SSIV BUFFER 4μL, 100mM DTT 1μL, Ribonuclease Inhibitor 1μL, 200U/μL SuperScript TM IV Reverse Transcriptase 1μL, and the previous step RNA mixture 13μL.
逆转录体系迅速混匀后,23℃10分钟,55℃10分钟。反应完成后加入30μL Agencourt AMPure XP磁珠,获得50μL的磁珠纯化体系,按照说明书进行纯化,最后用20μL蒸馏水溶解DNA。其中,Agencourt AMPure XP磁珠购自美国贝克曼库尔特有限公司。After the reverse transcription system was mixed quickly, 10 minutes at 23°C and 10 minutes at 55°C. After the reaction was completed, 30 μL of Agencourt AMPure XP magnetic beads were added to obtain a 50 μL magnetic bead purification system. Purification was performed according to the instructions. Finally, 20 μL of distilled water was used to dissolve the DNA. Among them, Agencourt AMPure XP magnetic beads were purchased from Beckman Coulter Co., Ltd.
3.特异性PCR扩增3. Specific PCR amplification
PCR扩增酶采用美国kapa公司的KAPA2G Fast Multiplex PCR Kit产品货号KK5801。The PCR amplification enzyme adopts KAPA2G Fast Multiplex PCR Kit product number KK5801 of American kapa company.
PCR反应体系包括:2×kapa聚合酶混合液25μL、10μM引物池5μL、纯化获得的DNA 20μL,总计50μL。The PCR reaction system includes: 2×kapa polymerase mixed solution 25 μL, 10 μM primer pool 5 μL, purified DNA 20 μL, a total of 50 μL.
其中,引物池包括表1所示的特异性引物对和扩增GAPDH的引物对,各引物对的上下游引物在引物池中的浓度都是10μM。The primer pool includes the specific primer pairs shown in Table 1 and the primer pairs for amplifying GAPDH. The concentration of the upstream and downstream primers of each primer pair in the primer pool is 10 μM.
PCR反应条件为:98℃预变性2min,然后进入15个循环:98℃10s、62℃2min、72℃30s,循环结束后72℃延伸5min,即完成PCR扩增。The PCR reaction conditions were: pre-denaturation at 98°C for 2 min, and then entered into 15 cycles: 98°C for 10 s, 62°C for 2 min, and 72°C for 30 s. After the cycle was extended at 72°C for 5 min, PCR amplification was completed.
PCR结束后,向PCR扩增产物中加入1倍体积的Agencourt AMPure XP磁珠,即加入磁珠50μL,按照说明书进行纯化,纯化后用20μL蒸馏水溶解DNA。After the PCR is completed, add 1 volume of Agencourt AMPure XP magnetic beads to the PCR amplification product, that is, add 50 μL of magnetic beads, and purify according to the instructions. After purification, dissolve the DNA in 20 μL of distilled water.
4.通用PCR扩增4. Universal PCR amplification
PCR扩增酶采用美国kapa公司的KAPA2G Fast Multiplex PCR Kit产品,货号KK5801。The PCR amplification enzyme adopts KAPA2G Fast Multiplex PCR Kit product of American kapa company, article number KK5801.
反应体系包括:2×kapa聚合酶混合液25μL、10μM第一引物2.5μL、10μM第二引物2.5μL,特异性PCR扩增的纯化产物获得的DNA 20μL,总计50μL。The reaction system includes: 25×L of 2×kapa polymerase mixture, 2.5 μL of 10 μM first primer, 2.5 μL of 10 μM second primer, and 20 μL of DNA obtained from the purified product of specific PCR amplification, totaling 50 μL.
第一引物为SEQ ID NO.15所示序列,第二引物为SEQ ID NO.16所示序列;并且,第一引物的5’端具有磷酸化修饰。The first primer is the sequence shown in SEQ ID NO.15, the second primer is the sequence shown in SEQ ID NO.16; and, the 5'end of the first primer has a phosphorylation modification.
SEQ ID NO.15:5’-GAACGACATGGCTACGATCCGACTT-3’SEQ ID NO.15: 5’-GAACGACATGGCTACGATCCGACTT-3’
SEQ ID NO.16:SEQ ID NO.16:
5’-TGTGAGCCAAGGAGTTGCTGCGTACATTTGTCTTCCTAAGACCGCTTGGCCTCCGACTT-3’。5'-TGTGAGCCAAGGAGTTGCTGCGTACATTTGTCTTCCTAAGACCGCTTGGCCTCCGACTT-3'.
反应条件为:98℃预变性2min,然后进入15个循环:98℃10s、62℃2min、72℃30s,循环结束后72℃延伸5min。The reaction conditions were: pre-denaturation at 98°C for 2min, then 15 cycles: 98°C for 10s, 62°C for 2min, 72°C for 30s, and 72°C extension for 5min after the end of the cycle.
反应结束后,加入1倍体积的Agencourt AMPure XP磁珠,加入磁珠50μL,按照说明书进行纯化,纯化后用20μL蒸馏水溶解DNA。After the reaction, add 1 volume of Agencourt AMPure XP magnetic beads, add 50 μL of magnetic beads, and purify according to the instructions. After purification, dissolve the DNA in 20 μL of distilled water.
5.上机测序5. Computer sequencing
文库质检合格后,根据华大基因BGISEQ-500平台要求制备DNA纳米球,然后将DNA纳米球上机测序,测序类型双端50bp。DNA纳米球的制备包括:将双链核酸扩增产物经过高温变性为单链核酸,采用核酸介导片段将单链核酸连接成环状单链核酸,将环状单链核酸进行滚环扩增,从而获得DNA纳米球。详细步骤参考华大基因BGISEQ-500平台使用说明。After the library quality test is passed, DNA nanospheres are prepared according to the requirements of BGISEQ-500 platform of BGI, and then the DNA nanospheres are sequenced on the computer. The sequencing type is 50bp double-ended. The preparation of DNA nanospheres includes: denaturing double-stranded nucleic acid amplification products into single-stranded nucleic acids through high temperature, using nucleic acid-mediated fragments to connect single-stranded nucleic acids into circular single-stranded nucleic acids, and performing rolling circle amplification on the circular single-stranded nucleic acids To obtain DNA nanospheres. For detailed steps, please refer to the instruction of BGISEQ-500 platform of BGI.
6.数据分析6. Data analysis
数据下机后去接头,采用cutadapt 1.18软件对fastq文件进行去接头和低质量过滤(具体参见https://github.com/marcelm/cutadapt/)得到过滤后的读段(clean reads);用BWA或其他比对软件将fastq数据比对参考基因组,统计比对率,比对率=比上参考基因组的reads/clean reads;去掉多重比对的reads,对留下唯一比对的reads进行统计,统计在所有目标扩增位置上的reads数,统计各扩增子区域的reads总数,扩增子的覆盖深度,统计目标位点的碱基分布,根据碱基分布得到位点突变信息与疾病数据库进行对比,得到突变检测结果。After the data is off the machine, remove the connector, use cutadapt 1.18 software to remove the fastq file and low-quality filtering (see https://github.com/marcelm/cutadapt/) to obtain filtered reads (clean reads); use BWA Or other comparison software compares fastq data to the reference genome, statistical comparison rate, comparison rate = reads/clean reads of the reference genome; remove reads from multiple comparisons, and count the reads that leave the only comparison, Count the number of reads at all target amplification positions, count the total number of reads in each amplicon region, the coverage depth of the amplicon, count the base distribution of the target site, and obtain site mutation information and disease database according to the base distribution Make a comparison to get the mutation detection result.
四个测序样本的下机数据的原始reads数、去接头reads数、比对reads数和目标区域reads数的统计结果如表4所示。其中,四个测序样本,即正常胎儿的孕妇血浆样本的2个重复测序样本,和患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本。The statistical results of the number of original reads, the number of de-joined reads, the number of aligned reads, and the number of reads in the target area for the four sequencing samples are shown in Table 4. Among them, four sequencing samples, that is, two repeated sequencing samples of plasma samples of pregnant women with normal fetuses, and two repeated sequencing samples of plasma samples of pregnant women with fetal congenital afibrinogenemia.
表4下机数据统计结果Table 4 Statistical results of offline data
样本编号Sample number 原始readsRaw reads 去接头readsGo to the joint reads 比对readsCompare reads 目标区域readsTarget area reads
1号样本重复1Sample 1 repeats 1 6739467394 6598765987 6499164991 6398163981
1号样本重复2Sample 1 repeats 2 7654276542 7454174541 7392373923 7301273012
2号样本重复1Sample 2 repeats 1 6349563495 6259662596 6146361463 6098260982
2号样本重复2Sample 2 repeats 2 7948279482 7821078210 7712977129 7636576365
表4中,1号样本重复1和1号样本重复2,即正常胎儿的孕妇血浆样本的 2个重复测序样本;2号样本重复1和2号样本重复2,即患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本。表4的结果显示,本例的胎儿FGA基因检测方法,下机数据的数据利用率可以达到97%,比对率可达到98%,目标区域比例可达98%。In Table 4, sample 1 repeats sample 1 and sample 1 repeats 2, that is, 2 repeated sequencing samples of the plasma samples of pregnant women with normal fetus; sample 2 repeats sample 1 and sample 2 repeats 2, ie suffers from congenital afibrin Two repeated sequencing samples of maternal plasma samples of protozoan fetuses. The results in Table 4 show that in the fetal FGA gene detection method of this example, the data utilization rate of the off-machine data can reach 97%, the comparison rate can reach 98%, and the target area ratio can reach 98%.
对四个测序样本中各个特异性引物对以及GAPDH扩增引物的覆盖率进行统计,结果如图2所示;图2中,横坐标FGA01、FGA02、FGA03、FGA04、FGA05、FGA06依序为第一引物对至第六引物对在四个测序样本中的覆盖率,1-重复1、1-重复2为正常胎儿的孕妇血浆样本的2个重复测序样本,2-重复1、2-重复2为患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本。图2的结果显示,各扩增子(即引物对)测序后得到的深度分布,在四个样本中不同扩增子之间的深度差异在5倍以内。The coverage of each specific primer pair and GAPDH amplification primers in the four sequencing samples is statistically shown in Figure 2; in Figure 2, the abscissas FGA01, FGA02, FGA03, FGA04, FGA05, FGA06 are in order Coverage of one pair of primers to the sixth pair of primers in four sequencing samples, 1-repeat 1, 1-repeat 2 are two replicate sequencing samples of pregnant fetal plasma samples from normal fetuses, 2-repeat 1, 2-repeat 2 Two repeated sequencing samples of plasma samples from pregnant women with fetuses with congenital afibrinogenemia. The results in Figure 2 show that the depth distribution obtained after sequencing each amplicon (ie, primer pair) is within 5 times the depth difference between different amplicons in the four samples.
对正常胎儿的孕妇血浆样本的2个重复测序样本的重复性进行统计,结果如图3所示,图3的纵坐标为重复1的覆盖率,横坐标为重复2的覆盖率。对患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本的重复性进行统计,结果如图4所示,图4的纵坐标为重复1的覆盖率,横坐标为重复2的覆盖率。图3和图4的结果显示,同一样本不同实验得到的各个扩增子的深度稳定性很好,如图3所示两个重复的R 2=0.9666,如图4所示两个重复的R 2=0.9797,说明各样本的重复性很好。 The repeatability of two repeated sequencing samples of the plasma samples of pregnant women with normal fetuses was counted. The results are shown in Figure 3. The vertical coordinate of Figure 3 is the coverage of duplicate 1, and the horizontal coordinate is the coverage of duplicate 2. The repeatability of two repeated sequencing samples of the plasma samples of pregnant women with congenital non-fibrinogenemia fetuses was counted. The results are shown in Figure 4. The vertical coordinate of Figure 4 is the coverage of Repeat 1, and the horizontal Coverage for repeat 2. The results of Figure 3 and Figure 4 show that the depth stability of each amplicon obtained from different experiments of the same sample is very good. As shown in Figure 3, the two repeated R 2 = 0.9666, as shown in Figure 4, the two repeated R 2 = 0.9797, indicating that the repeatability of each sample is very good.
通过测序数据统计目标致病位点的碱基覆盖情况,结果如图5所示,图5的横坐标分别为四个测序样本,纵坐标为覆盖深度比例,█表示致病碱基,□表示正常碱基,图5显示了四个测序样本对目标位点c.103C>T的覆盖度。图5的结果显示,两个正常胎儿的孕妇血浆样本能够很好的覆盖致病碱基,而两个患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本也能够很好的覆盖正常碱基,说明本例的测序数据对目标致病位点有很好的覆盖度。The base coverage of the target pathogenic site was counted by sequencing data. The results are shown in Fig. 5. The abscissa of Fig. 5 is four sequencing samples, the ordinate is the coverage depth ratio, █ indicates the pathogenic base, □ indicates Normal bases. Figure 5 shows the coverage of the four sequencing samples for the target site c.103C>T. The results in Figure 5 show that the plasma samples of pregnant women with two normal fetuses can cover the pathogenic bases well, and the plasma samples of two pregnant women with congenital afibrinogenemia can also cover the normal. Base, indicating that the sequencing data of this example has a good coverage of the target pathogenic site.
对四个测序样本的突变分析结果如表5所示。The results of mutation analysis on four sequencing samples are shown in Table 5.
表5样本检测结果Table 5 Sample test results
样本编号Sample number 临床检测结果Clinical test results 本例检测结果Test results in this case
1号样本重复1Sample 1 repeats 1 无突变No mutation 未检测到突变No mutation detected
1号样本重复2Sample 1 repeats 2 无突变No mutation 未检测到突变No mutation detected
2号样本重复1Sample 2 repeats 1 c.103C>Tc.103C>T c.103C>Tc.103C>T
2号样本重复2Sample 2 repeats 2 c.103C>Tc.103C>T c.103C>Tc.103C>T
表5的结果显示,对于正常胎儿的孕妇血浆样本的2个重复测序样本,都没有检测出突变;而对患有先天性无纤维蛋白原血症的胎儿的孕妇血浆样本的2个重复测序样本都有检出C.103C>T,结果与预期相符,说明本例的胎儿FGA基因检测方法能够准确的检出FGA基因的单碱基突变。The results in Table 5 show that no mutations were detected in two repeated sequencing samples of plasma samples of pregnant women with normal fetuses; and two repeated sequencing samples of plasma samples of pregnant women with fetal congenital afibrinogenemia C.103C>T was detected, and the results were in line with expectations, indicating that the fetal FGA gene detection method in this case can accurately detect single base mutations of the FGA gene.
实施例二Example 2
视隔发育不良是罕见的中线结构前部畸形,也称De-Morsier综合征,包括透明隔缺如及视觉传到通路发育不良,可合并其他脑内发育异常。有部分致病原因是HESX1基因上发生突变导致。在孕妇怀孕过程中,该基因在母体中很少或者不表达,而胎儿特异性表达。因此,本例对胎儿HESX1基因进行检测,以验证本申请的靶基因文库构建方法和胎儿基因检测方法。Optic septal dysplasia is a rare anterior malformation of the midline structure, also known as De-Morsier syndrome, including the absence of transparent septa and dysplasia of the visual transmission pathway, which can be combined with other developmental abnormalities in the brain. Part of the cause is caused by mutations in the HESX1 gene. During pregnancy, the gene is rarely or not expressed in the mother, while the fetus is specifically expressed. Therefore, in this example, the fetal HESX1 gene was tested to verify the target gene library construction method and fetal gene detection method of the present application.
本例设计了7对特异性引物,覆盖常见的HESX1突变位点;本例设计的引物对如表6所示,所覆盖的常见HESX1突变位点如表7所示。另外,本例还采用了一对引物扩增GAPDH作为样本参照,用于实验成功以及样本定量的参照,具体引物序列如表1所示。表6中,HESX-F01和HESX-R01为引物对一,HESX F02和HESX-R02为引物对二,HESX-F03和HESX-R03为引物对三,HESX-F04和HESX-R04为引物对四,HESX F05和HESX-R05为引物对五,HESX-F06和HESX-R06为引物对六,HESX-F07和HESX-R07为引物对七。In this example, 7 pairs of specific primers were designed to cover the common HESX1 mutation sites; the primer pairs designed in this example are shown in Table 6, and the covered common HESX1 mutation sites are shown in Table 7. In addition, this example also uses a pair of primers to amplify GAPDH as a sample reference for reference for successful experiment and sample quantification. The specific primer sequences are shown in Table 1. In Table 6, HESX-F01 and HESX-R01 are primer pair one, HESX F02 and HESX-R02 are primer pair two, HESX-F03 and HESX-R03 are primer pair three, and HESX-F04 and HESX-R04 are primer pair four , HESX F05 and HESX-R05 are primer pair five, HESX-F06 and HESX-R06 are primer pair six, HESX-F07 and HESX-R07 are primer pair seven.
表6特异性引物对Table 6 Specific primer pairs
引物编号Primer number 序列(5’→3’)Sequence (5’→3’) SEQ ID NO.SEQ ID NO.
HESX1-01FHESX1-01F GACCGCTTGGCCTCCGACTTCACTTCTTTAGAGAAAGTTAAGTCGACCGCTTGGCCTCCGACTTCACTTCTTTAGAGAAAGTTAAGTC 1717
HESX1-01RHESX1-01R GACATGGCTACGATCCGACTTTCCTGTCTTAGAAAGTTTTAGCGACATGGCTACGATCCGACTTTCCTGTCTTAGAAAGTTTTAGC 1818
HESX1-02FHESX1-02F GACCGCTTGGCCTCCGACTTGCTGGGCAAGTGTTCATTGACGACCGCTTGGCCTCCGACTTGCTGGGCAAGTGTTCATTGAC 1919
HESX1-02RHESX1-02R GACATGGCTACGATC CGACTTGGAGACATCCTCTCGTGGTCTGCAGACATGGCTACGATCCGACTTGGAGACATCCTCTCGTGGTCTGCA 2020
HESX1-03FHESX1-03F GACCGCTTGGCCTCCGACTTCAGAGGCCAGAGCTGTTGCTCGACCGCTTGGCCTCCGACTTCAGAGGCCAGAGCTGTTGCTC 21twenty one
HESX1-03RHESX1-03R GACATGGCTACGATCCGACTTGACATAAGTTACCATCTTTCCGACATGGCTACGATCCGACTTGACATAAGTTACCATCTTTCC 22twenty two
HESX1-04FHESX1-04F GACCGCTTGGCCTCCGACTTGGGCAGACACCTGCAGCTCATCGACCGCTTGGCCTCCGACTTGGGCAGACACCTGCAGCTCATC 23twenty three
HESX1-04RHESX1-04R GACATGGCTACGATCCGACTTTCTTCGGCCTCTATACCAACTCGACATGGCTACGATCCGACTTTCTTCGGCCTCTATACCAACTC 24twenty four
HESX1-05FHESX1-05F GACCGCTTGGCCTCCGACTTAAGACTGTCTTTGAAAAGAGAGGACCGCTTGGCCTCCGACTTAAGACTGTCTTTGAAAAGAGAG 2525
HESX1-05RHESX1-05R GACATGGCTACGATCCGACTTCTTTTCAGTTTTGCACGCCGAGACATGGCTACGATCCGACTTCTTTTCAGTTTTGCACGCCGA 2626
HESX1-06FHESX1-06F GACCGCTTGGCCTCCGACTTACAGAATCCAGATTTGGTTTCAAGACCGCTTGGCCTCCGACTTACAGAATCCAGATTTGGTTTCAA 2727
HESX1-06RHESX1-06R GACATGGCTACGATCCGACTTTTTAACACTTAATATTTCCACTGGACATGGCTACGATCCGACTTTTTAACACTTAATATTTCCACTG 2828
HESX1-07FHESX1-07F GACCGCTTGGCCTCCGACTTCTTCTAATTGCAGAGCATGAAGAGACCGCTTGGCCTCCGACTTCTTCTAATTGCAGAGCATGAAGA 2929
HESX1-07RHESX1-07R GACATGGCTACGATC CGACTTTCTTTACTATAACTAAAAGTGCCCGACATGGCTACGATCCGACTTTCTTTACTATAACTAAAAGTGCCC 3030
表7特异性引物对覆盖的HESX1基因变位点Table 7 HESX1 gene mutation sites covered by specific primer pairs
突变编号Mutation number 突变mutation HG38上的位置Location on HG38
1818 c.18G>C(p.Gln6His)c.18G>C(p.Gln6His) chr3:57199901chr3:57199901
1919 c.313T>G(p.Trp105Gly)c.313T>G(p.Trp105Gly) chr3:57198797chr3:57198797
2020 c.357+2T>Cc.357+2T>C chr3:57198751chr3:57198751
21twenty one c.445G>A(p.Glu149Lys)c.445G>A(p.Glu149Lys) chr3:57198405chr3:57198405
22twenty two c.450_451delCA(p.Asp150Glufs)c.450_451delCA(p.Asp150Glufs) chr3:57198399-57198400chr3:57198399-57198400
23twenty three c.478C>T(p.Arg160Cys)c.478C>T(p.Arg160Cys) chr3:57198277chr3:57198277
24twenty four c.509C>T(p.Ser170Leu)c.509C>T(p.Ser170Leu) chr3:57198246chr3:57198246
2525 c.511_512delCA(p.Gln171Valfs)c.511_512delCA(p.Gln171Valfs) chr3:57198243-57198244chr3:57198243-57198244
2626 c.77T>C(p.Ile26Thr)c.77T>C(p.Ile26Thr) chr3:57199842chr3:57199842
本例的实验样本:怀有正常胎儿的孕妇血浆样本1mL,编号为3号样本,将该样本平均分为2份,作为重复试验。经过MRI检测发现所怀胎儿患有视隔发育不良,且穿刺活检为阳性的孕妇,该胎儿经临床诊断其基因突变情况为:ESX1,c.509C>T,p.Ser170Leu;取该孕妇血浆样本1mL,编号为4号样本,同样将样本平均分为2份,作为重复试验。The experimental sample in this case: 1 mL of a plasma sample of a pregnant woman carrying a normal fetus, sample No. 3, which was divided into 2 equal parts as a repeated test. After MRI, the pregnant woman was found to have a dysplasia of the septum and a positive biopsy. The fetus was clinically diagnosed with a gene mutation: ESX1, c.509C>T, p.Ser170Leu; take a plasma sample of the pregnant woman 1mL, No. 4 sample, equally divide the sample into 2 parts, as a repeated test.
本例使用表6的引物,采用实施例一相同的文库构建方法进行测序文库制备,文库制备过程中通用引物与实施例一相同。然后采用与实施例一相同的测序平台进行上机测序,采用与实施例一相同的方法进行数据分析。In this example, the primers in Table 6 were used to prepare the sequencing library using the same library construction method as in Example 1. The general primers used in the library preparation process were the same as in Example 1. Then use the same sequencing platform as in Example 1 for on-machine sequencing, and use the same method as in Example 1 for data analysis.
本例通过测序数据统计目标致病位点的碱基覆盖情况,结果如图6所示,图6的横坐标分别为四个测序样本,3号样本重复1和3号样本重复2,即正常胎儿的孕妇血浆样本的2个重复测序样本;4号样本重复1和4号样本重复2,即患有视隔发育不良的胎儿的孕妇血浆样本的2个重复测序样本;纵坐标为覆盖深度比例,█表示致病碱基,□表示正常碱基,图6显示了四个测序样本对目标位点c.509C>T的覆盖度。图6的结果显示,两个正常胎儿的孕妇血浆样本能够很好的覆盖致病碱基,而两个患有视隔发育不良的胎儿的孕妇血浆样本也能够很好的覆盖正常碱基,说明本例的测序数据对目标致病位点有很好的覆盖度。In this example, the sequencing data was used to count the base coverage of the target disease site. The results are shown in Figure 6. The abscissa of Figure 6 is four sequencing samples, sample 3 repeats 1 and sample 3 repeats 2, which is normal. 2 repeated sequencing samples of fetal maternal plasma samples; sample 4 repeats 1 and sample 4 repeats 2, that is, 2 repeat sequencing samples of maternal plasma samples with fetal dysplasia; the ordinate is the depth of coverage ratio , █ indicates a pathogenic base, □ indicates a normal base, and Figure 6 shows the coverage of the four sequencing samples for the target site c.509C>T. The results in Figure 6 show that the plasma samples of pregnant women with two normal fetuses can well cover the pathogenic bases, and the plasma samples of two pregnant women with fetal dysplasia can also cover the normal bases well. The sequencing data in this case has a good coverage of the target pathogenic site.
对四个测序样本的突变分析结果如表8所示。The results of mutation analysis on four sequencing samples are shown in Table 8.
表8样本检测结果Table 8 Sample test results
样本编号Sample number 临床检测结果Clinical test results 本例检测结果Test results in this case
3号样本重复1Sample 3 repeats 1 无突变No mutation 未检测到突变No mutation detected
3号样本重复2Sample 3 repeats 2 无突变No mutation 未检测到突变No mutation detected
4号样本重复1Sample 4 repeats 1 c.509C>Tc.509C>T c.509C>Tc.509C>T
4号样本重复2Sample 4 repeats 2 c.509C>Tc.509C>T c.509C>Tc.509C>T
表8的结果显示,对于正常胎儿的孕妇血浆样本的2个重复测序样本,都没有检测出突变;而对视隔发育不良胎儿的孕妇血浆样本的2个重复测序样本都有检出c.509C>T,结果与预期相符,说明本例的胎儿HESX1基因检测方法能够准确的检出HESX1基因的单碱基突变。The results in Table 8 show that no mutations were detected in two repeated sequencing samples of plasma samples of pregnant women with normal fetuses; and two. >T, the result is as expected, indicating that the fetal HESX1 gene detection method in this example can accurately detect single-base mutations of the HESX1 gene.
本申请的靶基因文库构建方法以及基于构建的靶基因文库的胎儿基因检测方法,不仅能够对实施例一的FGA基因的单碱基突变和实施例二的HESX1基因的单碱基突变进行检测,同样的,对于与FGA基因和HESX1基因情况类似的,胎儿特异性表达,而母亲不表达或表达量极少的基因,如表1所示的基因,也能进行检测,只要按照本申请的发明思路设计相应的覆盖各基因突变位点的特异性引物对,对cDNA进行第一轮PCR扩增即可;特异性扩增之前的cfRNA的提取、cDNA第一链合成,以及之后的通用PCR扩增、上机测序和数据分析都可以参考本申请实施例一的FGA基因检测方法。The target gene library construction method of the present application and the fetal gene detection method based on the constructed target gene library can not only detect the single base mutation of the FGA gene in Example 1 and the single base mutation of the HESX1 gene in Example 2, Similarly, genes similar to those of the FGA gene and the HESX1 gene, which are specifically expressed by the fetus but not expressed by the mother or expressed in very small amounts, such as the genes shown in Table 1, can also be tested as long as the invention of the present application The idea is to design the corresponding specific primer pairs covering the mutation sites of each gene, and then perform the first round of PCR amplification on the cDNA; the extraction of cfRNA before the specific amplification, the synthesis of the first strand of cDNA, and the subsequent general PCR amplification For augmentation, computer sequencing, and data analysis, refer to the FGA gene detection method in Example 1 of the present application.
以上内容是结合具体的实施方式对本申请所作的进一步详细说明,不能认定本申请的具体实施只局限于这些说明。对于本申请所属技术领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干简单推演或替换,都应当视为属于本申请的保护范围。The above content is a further detailed description of this application in conjunction with specific embodiments, and it cannot be assumed that the specific implementation of this application is limited to these descriptions. For a person of ordinary skill in the technical field to which this application belongs, without deviating from the concept of this application, several simple deductions or replacements can be made, which should be regarded as falling within the protection scope of this application.

Claims (35)

  1. 一种靶基因文库的构建方法,其特征在于:包括以下步骤,A method for constructing a target gene library, characterized in that it includes the following steps,
    从来源于孕妇的生物样本中提取游离RNA,所述游离RNA包含胎儿游离RNA;Extract free RNA from biological samples derived from pregnant women, the free RNA including fetal free RNA;
    将提取的游离RNA逆转录生成第一链cDNA;Reverse transcription of the extracted free RNA to generate the first strand cDNA;
    采用特异性引物对第一链cDNA进行扩增,获得特异性扩增产物,其中,所述特异性引物能够特异性扩增所述靶基因,所述靶基因为胎儿与母体的差异表达基因,所述特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同;Amplify the first-strand cDNA with specific primers to obtain specific amplification products, wherein the specific primers can specifically amplify the target gene, which is a differentially expressed gene of the fetus and the mother, The sequence of the 5'end of the upstream primer and/or the downstream primer of the specific primer is the same as at least a part of the sequence of the universal primer;
    采用通用引物对特异性扩增产物进行第二轮PCR扩增,获得所述靶基因文库。The specific amplification product is subjected to a second round of PCR amplification using a universal primer to obtain the target gene library.
  2. 根据权利要求1所述的方法,其特征在于:所述差异表达基因为胎儿特异性表达,而母体几乎不表达的基因。The method according to claim 1, characterized in that the differentially expressed genes are genes that are specifically expressed by the fetus and hardly expressed by the mother.
  3. 根据权利要求2所述的方法,其特征在于:所述差异表达基因为表1所示基因中的至少一个。The method according to claim 2, wherein the differentially expressed genes are at least one of the genes shown in Table 1.
  4. 根据权利要求1所述的方法,其特征在于:所述生物样本为外周血或尿液。The method according to claim 1, wherein the biological sample is peripheral blood or urine.
  5. 根据权利要求1所述的方法,其特征在于:所述将提取的游离RNA逆转录生成第一链cDNA,具体包括采用随机引物对游离RNA进行逆转录,获得第一链cDNA;优选地,所述随机引物为N6随机引物。The method according to claim 1, characterized in that: the reverse transcription of the extracted free RNA to generate the first strand cDNA, which specifically comprises reverse transcription of the free RNA using random primers to obtain the first strand cDNA; preferably, The random primer is N6 random primer.
  6. 根据权利要求1所述的方法,其特征在于:所述特异性引物为第一引物对至第六引物对中的至少一组;The method according to claim 1, wherein the specific primer is at least one of a first primer pair to a sixth primer pair;
    所述第一引物对的上下游引物分别为SEQ ID NO.1和SEQ ID NO.2所示序列,第二引物对的上下游引物分别为SEQ ID NO.3和SEQ ID NO.4所示序列,第三引物对的上下游引物分别为SEQ ID NO.5和SEQ ID NO.6所示序列,第四引物对的上下游引物分别为SEQ ID NO.7和SEQ ID NO.8所示序列,第五引物对的上下游引物分别为SEQ ID NO.9和SEQ ID NO.10所示序列,第六引物对的上下游引物分别为SEQ ID NO.11和SEQ ID NO.12所示序列。The upstream and downstream primers of the first primer pair are the sequences shown in SEQ ID NO. 1 and SEQ ID NO. 2, respectively, and the upstream and downstream primers of the second primer pair are shown in SEQ ID NO. 3 and SEQ ID NO. 4. Sequence, the upstream and downstream primers of the third primer pair are the sequences shown in SEQ ID NO.5 and SEQ ID NO.6, and the upstream and downstream primers of the fourth primer pair are shown in SEQ ID NO.7 and SEQ ID NO.8, respectively Sequence, the upstream and downstream primers of the fifth primer pair are the sequences shown in SEQ ID NO.9 and SEQ ID NO.10, and the upstream and downstream primers of the sixth primer pair are shown in SEQ ID NO.11 and SEQ ID NO.12, respectively sequence.
  7. 根据权利要求1所述的方法,其特征在于:特异性引物为引物对一至引物对七中的至少一组;The method according to claim 1, wherein the specific primer is at least one of primer pair one to primer pair seven;
    所述引物对一的上下游引物分别为SEQ ID NO.17和SEQ ID NO.18所示序列,引物对二的上下游引物分别为SEQ ID NO.19和SEQ ID NO.20所示序列,引物对三的上下游引物分别为SEQ ID NO.21和SEQ ID NO.22所示序列,引物对四的上下游引物分别为SEQ ID NO.23和SEQ ID NO.24所示序列,引物对五的上下游引物分别为SEQ ID NO.25和SEQ ID NO.26所示序列,引物对六的上 下游引物分别为SEQ ID NO.27和SEQ ID NO.28所示序列,引物对七的上下游引物分别为SEQ ID NO.29和SEQ ID NO.30所示序列。The upstream and downstream primers of primer pair 1 are the sequences shown in SEQ ID NO. 17 and SEQ ID NO. 18, respectively, and the upstream and downstream primers of primer pair 2 are the sequences shown in SEQ ID NO. 19 and SEQ ID NO. 20, respectively. The upstream and downstream primers of primer pair 3 are the sequences shown in SEQ ID NO.21 and SEQ ID NO.22, respectively. The upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO.24, respectively. The five upstream and downstream primers are the sequences shown in SEQ ID NO. 25 and SEQ ID NO. 26, respectively. The upstream and downstream primers of the primer pair 6 are the sequences shown in SEQ ID NO. 27 and SEQ ID NO. 28, respectively, and the primer pair 7 is The upstream and downstream primers are SEQ ID NO.29 and SEQ ID NO.30.
  8. 根据权利要求1-7任一项所述的方法,其特征在于:还包括在采用特异性引物对第一链cDNA进行扩增的同时,采用参照引物对第一链cDNA进行扩增,所述参照引物能够特异性扩增看家基因。The method according to any one of claims 1-7, further comprising: using specific primers to amplify the first strand cDNA while using reference primers to amplify the first strand cDNA, said The reference primer can specifically amplify the housekeeping gene.
  9. 根据权利要求8所述的方法,其特征在于:所述看家基因为GAPDH基因。The method according to claim 8, wherein the housekeeping gene is the GAPDH gene.
  10. 根据权利要求9所述的方法,其特征在于:所述参照引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。The method according to claim 9, wherein the upstream and downstream primers of the reference primer are the sequences shown in SEQ ID NO. 13 and SEQ ID NO. 14, respectively.
  11. 根据权利要求1-7任一项所述的方法,其特征在于:所述通用引物包括第一引物和第二引物,所述第一引物为SEQ ID NO.15所示序列,所述第二引物为SEQ ID NO.16所示序列;并且,优选地,第一引物的5’端具有磷酸化修饰。The method according to any one of claims 1-7, wherein the universal primer includes a first primer and a second primer, the first primer is the sequence shown in SEQ ID NO.15, and the second The primer is the sequence shown in SEQ ID NO. 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
  12. 根据权利要求1-7任一项所述的方法,其特征在于:还包括在第二轮PCR扩增后,将第二轮PCR扩增产物变性为单链核酸,并采用核酸介导片段将所述单链核酸连接成环状核酸,进而获得环状靶基因文库,其中,所述核酸介导片段能够与所述单链核酸的两个末端通过碱基互补配对原则进行结合。The method according to any one of claims 1-7, further comprising: after the second round of PCR amplification, denaturing the second round of PCR amplification products into single-stranded nucleic acids, and using nucleic acid-mediated fragments to The single-stranded nucleic acid is connected into a circular nucleic acid to obtain a circular target gene library, wherein the nucleic acid-mediated fragment can be combined with the two ends of the single-stranded nucleic acid through the base complementary pairing principle.
  13. 一种胎儿基因的检测方法,其特征在于:包括以下步骤,A method for detecting fetal genes, which is characterized by the following steps:
    采用权利要求1-12任一项所述的方法进行靶基因文库构建,获得所述靶基因文库;Using the method of any one of claims 1-12 to construct a target gene library to obtain the target gene library;
    对所述靶基因文库进行测序,获得由多个测序数据构成的测序结果;Sequencing the target gene library to obtain sequencing results composed of multiple sequencing data;
    对所述测序结果进行分析,获得胎儿的基因信息。The sequencing results are analyzed to obtain fetal genetic information.
  14. 根据权利要求13所述的检测方法,其特征在于:所述胎儿的基因信息包括隐性遗传单基因病的突变信息、胎儿基因型信息、基因结构变异信息的至少一种。The detection method according to claim 13, wherein the genetic information of the fetus includes at least one of mutation information of recessive genetic single gene disease, fetal genotype information, and genetic structure variation information.
  15. 根据权利要求13或14所述的检测方法,其特征在于:对所述测序结果进行分析,具体包括以下步骤,The detection method according to claim 13 or 14, wherein the analysis of the sequencing result specifically includes the following steps,
    对所述测序数据进行过滤;Filtering the sequencing data;
    将过滤后的测序数据比对到参考基因组,保留唯一对比上参考基因组的测序数据;Compare the filtered sequencing data to the reference genome, and retain the unique sequencing data of the reference genome;
    基于所述唯一对比上参考基因组的测序数据,统计靶基因的碱基分布情况,获得靶基因突变信息,进而获得胎儿的基因信息。Based on the unique comparison of the sequencing data of the reference genome, the base distribution of the target gene is counted to obtain mutation information of the target gene, and then obtain the genetic information of the fetus.
  16. 根据权利要求15所述的检测方法,其特征在于:对所述测序结果进行分析,进一步包括,将所述靶基因突变信息与疾病数据库进行对比,获得胎儿的隐性遗传单基因病的突变信息。The detection method according to claim 15, characterized in that: analyzing the sequencing result, further comprising comparing the mutation information of the target gene with a disease database to obtain mutation information of the recessive genetic single gene disease of the fetus .
  17. 一种胎儿基因的检测装置,其特征在于:包括游离RNA提取模块、逆转录模块、靶基因扩增模块、靶基因文库生成模块、测序模块和分析模块;A fetal gene detection device, characterized in that it includes a free RNA extraction module, a reverse transcription module, a target gene amplification module, a target gene library generation module, a sequencing module and an analysis module;
    所述游离RNA提取模块,用于从来源于孕妇的生物样本中提取游离RNA,所述游离RNA包含胎儿游离RNA;The free RNA extraction module is used to extract free RNA from biological samples derived from pregnant women, the free RNA includes fetal free RNA;
    所述逆转录模块,用于将提取的游离RNA逆转录生成第一链cDNA;The reverse transcription module is used for reverse transcription of the extracted free RNA to generate the first strand cDNA;
    所述靶基因扩增模块,用于采用特异性引物对第一链cDNA进行扩增,获得特异性扩增产物,特异性引物能够特异性扩增所述靶基因,所述靶基因为胎儿与母体的差异表达基因,所述特异性引物的上游引物和/或下游引物的5’端序列与通用引物的至少一部分序列相同;The target gene amplification module is used to amplify the first-strand cDNA with specific primers to obtain specific amplification products. The specific primers can specifically amplify the target gene, and the target gene is the fetus and The differentially expressed genes of the maternal, the sequence of the 5'end of the upstream primer and/or the downstream primer of the specific primer is the same as at least a part of the sequence of the universal primer;
    所述靶基因文库生成模块,用于采用通用引物对特异性扩增产物进行第二轮PCR扩增,获得所述靶基因文库;The target gene library generation module is used to perform a second round of PCR amplification on the specific amplification product using universal primers to obtain the target gene library;
    所述测序模块,用于对所述靶基因文库进行测序,获得由多个测序数据构成的测序结果;The sequencing module is used to sequence the target gene library to obtain sequencing results composed of multiple sequencing data;
    所述分析模块,用于对所述测序结果进行分析,获得胎儿的基因信息。The analysis module is used to analyze the sequencing results to obtain fetal genetic information.
  18. 根据权利要求17所述的检测装置,其特征在于:所述游离RNA提取模块中,所述生物样本为外周血或尿液。The detection device according to claim 17, wherein in the free RNA extraction module, the biological sample is peripheral blood or urine.
  19. 根据权利要求17所述的检测装置,其特征在于:所述逆转录模块中,将提取的游离RNA逆转录生成第一链cDNA,具体包括采用随机引物对游离RNA进行逆转录,获得第一链cDNA;优选地,所述随机引物为N6随机引物。The detection device according to claim 17, wherein the reverse transcription module reversely transcribes the extracted free RNA to generate a first strand cDNA, which specifically comprises performing reverse transcription of the free RNA using random primers to obtain the first strand cDNA; preferably, the random primer is an N6 random primer.
  20. 根据权利要求17所述的检测装置,其特征在于:所述靶基因扩增模块中,所述差异表达基因为胎儿特异性表达,而母体几乎不表达的基因。The detection device according to claim 17, wherein in the target gene amplification module, the differentially expressed genes are genes that are specifically expressed by the fetus and hardly expressed by the mother.
  21. 根据权利要求20所述的检测装置,其特征在于:所述差异表达基因为表1所示基因中的至少一个。The detection device according to claim 20, wherein the differentially expressed genes are at least one of the genes shown in Table 1.
  22. 根据权利要求17所述的检测装置,其特征在于:所述分析模块中,胎儿的基因信息包括隐性遗传单基因病的突变信息、胎儿基因型信息、基因结构变异信息的至少一种。The detection device according to claim 17, wherein in the analysis module, the genetic information of the fetus includes at least one of mutation information of the recessive single gene disease, fetal genotype information, and genetic structure variation information.
  23. 根据权利要求17-22任一项所述的检测装置,其特征在于:所述分析模块进一步包括过滤单元、参考基因组比对单元和统计单元;The detection device according to any one of claims 17-22, wherein the analysis module further includes a filtering unit, a reference genome comparison unit, and a statistical unit;
    所述过滤单元,用于对所述测序数据进行过滤;The filtering unit is used to filter the sequencing data;
    所述参考基因组比对单元,用于将过滤后的测序数据比对到参考基因组,保留唯一对比上参考基因组的测序数据;The reference genome comparison unit is used to compare the filtered sequencing data to the reference genome, and retain the unique comparison of the sequencing data of the reference genome;
    所述统计单元,用于基于所述唯一对比上参考基因组的测序数据,统计靶基因的碱基分布情况,获得靶基因突变信息,进而获得胎儿的基因信息。The statistical unit is used to count the base distribution of the target gene based on the sequencing data of the unique reference genome, to obtain mutation information of the target gene, and then obtain genetic information of the fetus.
  24. 根据权利要求23所述的检测装置,其特征在于:所述分析模块进一步包括疾病数据库对比单元,所述疾病数据库对比单元用于将所述靶基因突变信息与疾病数据库进行对比,获得胎儿的隐性遗传单基因病的突变信息。The detection device according to claim 23, wherein the analysis module further comprises a disease database comparison unit, the disease database comparison unit is used to compare the target gene mutation information with the disease database to obtain the hidden fetus Information on mutations in sexually inherited single gene diseases.
  25. 根据权利要求1-12任一项所述的方法、权利要求13-16任一项所述的检测方法、或权利要求17-24任一项所述的检测装置在隐性遗传的单基因病检测、或结构变异检测中的应用。The method according to any one of claims 1-12, the detection method according to any one of claims 13-16, or the detection device according to any one of claims 17-24 in recessively inherited monogenic diseases Detection, or application of structural variation detection.
  26. 一种用于胎儿FGA基因检测的试剂,其特征在于:包括用于扩增FGA基因的特异性引物,所述特异性引物包括第一引物对至第六引物对中的至少一组,第一引物对的上下游引物分别为SEQ ID NO.1和SEQ ID NO.2所示序列,第二引物对的上下游引物分别为SEQ ID NO.3和SEQ ID NO.4所示序列,第三引物对的上下游引物分别为SEQ ID NO.5和SEQ ID NO.6所示序列,第四引物对的上下游引物分别为SEQ ID NO.7和SEQ ID NO.8所示序列,第五引物对的上下游引物分别为SEQ ID NO.9和SEQ ID NO.10所示序列,第六引物对的上下游引物分别为SEQ ID NO.11和SEQ ID NO.12所示序列。A reagent for fetal FGA gene detection, characterized in that it includes a specific primer for amplifying the FGA gene, the specific primer includes at least one of the first primer pair to the sixth primer pair, the first The upstream and downstream primers of the primer pair are the sequences shown in SEQ ID NO.1 and SEQ ID NO. 2, respectively. The upstream and downstream primers of the second primer pair are the sequences shown in SEQ ID NO. 3 and SEQ ID NO. 4, respectively. The upstream and downstream primers of the primer pair are the sequences shown in SEQ ID NO. 5 and SEQ ID NO. 6, respectively. The upstream and downstream primers of the fourth primer pair are the sequences shown in SEQ ID NO. 7 and SEQ ID NO. 8, respectively. The upstream and downstream primers of the primer pair are the sequences shown in SEQ ID NO. 9 and SEQ ID NO. 10 respectively, and the upstream and downstream primers of the sixth primer pair are the sequences shown in SEQ ID NO. 11 and SEQ ID NO. 12 respectively.
  27. 根据权利要求26所述的试剂,其特征在于:还包括用于扩增看家基因GAPDH的参照引物,所述参照引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。The reagent according to claim 26, further comprising a reference primer for amplifying the housekeeping gene GAPDH, the upstream and downstream primers of the reference primer are shown in SEQ ID NO. 13 and SEQ ID NO. 14 respectively sequence.
  28. 根据权利要求26或27所述的试剂,其特征在于:还包括用于构建测序文库的通用引物,所述通用引物包括第一引物和第二引物,所述第一引物为SEQ ID NO.15所示序列,所述第二引物为SEQ ID NO.16所示序列;并且,优选地,第一引物的5’端具有磷酸化修饰。The reagent according to claim 26 or 27, further comprising a universal primer for constructing a sequencing library, the universal primer includes a first primer and a second primer, and the first primer is SEQ ID NO. 15 In the sequence shown, the second primer is the sequence shown in SEQ ID NO. 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
  29. 一种用于胎儿FGA基因检测的试剂盒,其特征在于:所述试剂盒中包括权利要求26-28任一项所述的试剂。A kit for detecting fetal FGA genes, characterized in that the kit includes the reagent according to any one of claims 26-28.
  30. 根据权利要求29所述的试剂盒,其特征在于:所述试剂盒中还包括游离RNA提取试剂、逆转录试剂、PCR扩增试剂和核酸纯化试剂中的至少一种。The kit according to claim 29, wherein the kit further comprises at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents, and nucleic acid purification reagents.
  31. 一种用于胎儿HESX1基因检测的试剂,其特征在于:包括用于扩增HESX1基因的特异性引物,所述特异性引物包括引物对一至引物对七中的至少一组,引物对一的上下游引物分别为SEQ ID NO.17和SEQ ID NO.18所示序列,引物对二的上下游引物分别为SEQ ID NO.19和SEQ ID NO.20所示序列,引物对三的上下游引物分别为SEQ ID NO.21和SEQ ID NO.22所示序列,引物对四的上下游引物分别为SEQ ID NO.23和SEQ ID NO.24所示序列,引物对五的上下游引物分别为SEQ ID NO.25和SEQ ID NO.26所示序列,引物对六的上下游引物分别为SEQ ID NO.27和SEQ ID NO.28所示序列,引物对七的上下游引物 分别为SEQ ID NO.29和SEQ ID NO.30所示序列。A reagent for fetal HESX1 gene detection, characterized in that it includes a specific primer for amplifying the HESX1 gene, the specific primer includes at least one of primer pair one to primer pair seven, and the primer pair one The downstream primers are the sequences shown in SEQ ID NO.17 and SEQ ID NO.18, the upstream and downstream primers of primer pair two are the sequences shown in SEQ ID NO.19 and SEQ ID NO.20, and the upstream and downstream primers of primer pair three are The sequences shown in SEQ ID NO.21 and SEQ ID NO.22 are respectively, the upstream and downstream primers of primer pair 4 are the sequences shown in SEQ ID NO.23 and SEQ ID NO.24, and the upstream and downstream primers of primer pair 5 are respectively SEQ ID NO.25 and SEQ ID NO.26, the upstream and downstream primers of primer pair 6 are SEQ ID NO.27 and SEQ ID NO.28, respectively, and the upstream and downstream primers of primer pair 7 are SEQ ID NO.29 and SEQ ID NO.30 shown in the sequence.
  32. 根据权利要求31所述的试剂,其特征在于:还包括用于扩增看家基因GAPDH的参照引物,所述参照引物的上下游引物分别为SEQ ID NO.13和SEQ ID NO.14所示序列。The reagent according to claim 31, further comprising a reference primer for amplifying the housekeeping gene GAPDH, the upstream and downstream primers of the reference primer are shown in SEQ ID NO. 13 and SEQ ID NO. 14 respectively sequence.
  33. 根据权利要求31或32所述的试剂,其特征在于:还包括用于构建测序文库的通用引物,所述通用引物包括第一引物和第二引物,所述第一引物为SEQ ID NO.15所示序列,所述第二引物为SEQ ID NO.16所示序列;并且,优选地,第一引物的5’端具有磷酸化修饰。The reagent according to claim 31 or 32, further comprising a universal primer for constructing a sequencing library, the universal primer includes a first primer and a second primer, and the first primer is SEQ ID NO. 15 In the sequence shown, the second primer is the sequence shown in SEQ ID NO. 16; and, preferably, the 5'end of the first primer has a phosphorylation modification.
  34. 一种用于胎儿HESX1基因检测的试剂盒,其特征在于:所述试剂盒中包括权利要求31-33任一项所述的试剂。A kit for detecting fetal HESX1 gene, characterized in that the kit includes the reagent according to any one of claims 31-33.
  35. 根据权利要求34所述的试剂盒,其特征在于:所述试剂盒中还包括游离RNA提取试剂、逆转录试剂、PCR扩增试剂和核酸纯化试剂中的至少一种。The kit according to claim 34, wherein the kit further comprises at least one of free RNA extraction reagents, reverse transcription reagents, PCR amplification reagents, and nucleic acid purification reagents.
PCT/CN2018/119906 2018-12-07 2018-12-07 Method for constructing target gene library, detection device and application thereof WO2020113577A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2018/119906 WO2020113577A1 (en) 2018-12-07 2018-12-07 Method for constructing target gene library, detection device and application thereof
CN201880098877.5A CN112996926A (en) 2018-12-07 2018-12-07 Construction method, detection device and application of target gene library

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/119906 WO2020113577A1 (en) 2018-12-07 2018-12-07 Method for constructing target gene library, detection device and application thereof

Publications (1)

Publication Number Publication Date
WO2020113577A1 true WO2020113577A1 (en) 2020-06-11

Family

ID=70973391

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/119906 WO2020113577A1 (en) 2018-12-07 2018-12-07 Method for constructing target gene library, detection device and application thereof

Country Status (2)

Country Link
CN (1) CN112996926A (en)
WO (1) WO2020113577A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113897427A (en) * 2021-11-16 2022-01-07 深圳知因细胞生物科技有限公司 Reagent and kit for detecting fetal FGA gene mutation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016181128A1 (en) * 2015-05-11 2016-11-17 Genefirst Ltd Methods, compositions, and kits for preparing sequencing library
CN106755485A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit
CN107513570A (en) * 2017-09-28 2017-12-26 上海思路迪生物医学科技有限公司 The method and kit of more target library constructions are uniformed based on high-flux sequence

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6664056B2 (en) * 2000-10-17 2003-12-16 The Chinese University Of Hong Kong Non-invasive prenatal monitoring
AU2015203579B2 (en) * 2010-01-19 2017-12-21 Verinata Health, Inc. Sequencing methods and compositions for prenatal diagnoses
ES2704701T3 (en) * 2010-01-19 2019-03-19 Verinata Health Inc New protocol for preparing sequencing libraries
CN102140497A (en) * 2010-01-29 2011-08-03 中山大学达安基因股份有限公司 Method for detecting RNA-SNP by reverse transcription-multiplex ligation probe amplification technology
CN102127818A (en) * 2010-12-15 2011-07-20 张康 Method for creating fetus DNA library by utilizing peripheral blood of pregnant woman
CN102329876B (en) * 2011-10-14 2014-04-02 深圳华大基因科技有限公司 Method for measuring nucleotide sequence of disease associated nucleic acid molecules in sample to be detected
CN105087756A (en) * 2014-04-23 2015-11-25 北京贝瑞和康生物技术有限公司 Method and kit for non-invasive measurement on fetus deaf pathogenic gene mutation
CN104232777B (en) * 2014-09-19 2016-08-24 天津华大基因科技有限公司 Determine the method and device of fetal nucleic acid content and chromosomal aneuploidy simultaneously
WO2016059601A1 (en) * 2014-10-16 2016-04-21 Group Ovo Inc. Non-invasive methods for detection of genetic abnormalities in an unborn fetus, and primers, probes and kits for uses thereof
US20170321270A1 (en) * 2016-05-06 2017-11-09 Counsyl, Inc. Noninvasive prenatal diagnostic methods
CN106755484A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α SEA types ground, detection method and kit
CN107012225B (en) * 2017-04-20 2020-10-09 司法鉴定科学研究院 STR locus detection kit and detection method based on high-throughput sequencing
CN108642160B (en) * 2018-05-16 2022-03-11 广州市达瑞生物技术股份有限公司 Method and kit for detecting fetal thalassemia pathogenic gene

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016181128A1 (en) * 2015-05-11 2016-11-17 Genefirst Ltd Methods, compositions, and kits for preparing sequencing library
CN106755485A (en) * 2017-01-19 2017-05-31 人和未来生物科技(长沙)有限公司 The poor detection in Gene Mutation library constructing method in noninvasive prenatal foetal α types ground, detection method and kit
CN107513570A (en) * 2017-09-28 2017-12-26 上海思路迪生物医学科技有限公司 The method and kit of more target library constructions are uniformed based on high-flux sequence

Also Published As

Publication number Publication date
CN112996926A (en) 2021-06-18

Similar Documents

Publication Publication Date Title
JP7397498B2 (en) Transcriptome analysis of maternal plasma by massively parallel RNA sequencing
JP6585117B2 (en) Diagnosis of fetal chromosomal aneuploidy
EP2772549B1 (en) Method for detecting genetic variation
CN102625854B (en) method for detecting chromosomal aneuploidy
Beulen et al. Clinical utility of non‐invasive prenatal testing in pregnancies with ultrasound anomalies
KR102605749B1 (en) Circulating rna signatures specific to preeclamsia
WO2009105531A1 (en) Methods for cell genotyping
KR20200055151A (en) Diagnosing fetal chromosomal aneuploidy using massively parallel genomic sequencing
CN105555970B (en) Method and system for simultaneous haplotyping and chromosomal aneuploidy detection
Sato et al. Analysis of chromosome microstructures in products of conception associated with recurrent miscarriage
CN113249458A (en) Method and kit for evaluating and predicting placenta-derived diseases
WO2020113577A1 (en) Method for constructing target gene library, detection device and application thereof
CN105256059A (en) Application of TUBA3D gene in preparing keratoconus diagnosis product
Yang et al. Simultaneous detection of fetal aneuploidy, de novo FGFR3 mutations and paternally derived β‐thalassemia by a novel method of noninvasive prenatal testing
JP2016518811A (en) Determination of the fetal genome in multiple pregnancy
Cai et al. Classifying and evaluating fetuses with ventriculomegaly in genetic etiologic studies
CN111560424A (en) Detectable target nucleic acid, probe, method for determining fetal F8 gene haplotype and application
Du et al. Unique dual indexing PCR reduces chimeric contamination and improves mutation detection in cell-free DNA of pregnant women
Deng et al. Placental transcriptome sequencing combined with bioinformatics predicts potential genes and circular RNAs associated with hemoglobin Bart's hydrops fetalis syndrome
WO2024076469A1 (en) Non-invasive methods of assessing transplant rejection in pregnant transplant recipients
Wambach et al. Basic Genetics and Epigenetics of Childhood Lung Disease
Qin et al. The chromosomal characteristics of spontaneous abortion and its potential associated copy number variants and genes
Santoro et al. Preimplantation Genetic Diagnosis Assisted Reproductive Technology
WO2020119626A1 (en) Method for non-invasive prenatal testing of fetus for genetic disease
Wu et al. Simultaneous detection of fetal aneuploidies and single gene diseases by a novel method of noninvasive prenatal testing: Targeted And Genome-wide simultaneous sequencing (TAGs-seq)

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18942502

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18942502

Country of ref document: EP

Kind code of ref document: A1