WO2014068075A1 - Non-invasive method for detecting a fetal chromosomal aneuploidy - Google Patents
Non-invasive method for detecting a fetal chromosomal aneuploidy Download PDFInfo
- Publication number
- WO2014068075A1 WO2014068075A1 PCT/EP2013/072848 EP2013072848W WO2014068075A1 WO 2014068075 A1 WO2014068075 A1 WO 2014068075A1 EP 2013072848 W EP2013072848 W EP 2013072848W WO 2014068075 A1 WO2014068075 A1 WO 2014068075A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- samples
- sample
- chromosome
- dna
- cell
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/166—Oligonucleotides used as internal standards, controls or normalisation probes
Definitions
- the present invention relates to non-invasive prenatal diagnosis of fetal aneuploidy using cell-free DNA, particularly size-selected cell-free DNA. More particularly, the invention relates to methods of diagnosis of fetal aneuploidy characterized by the use of a set of external reference samples providing highly improved sensitivity and specificity. The invention also relates to methods for obtaining the reference samples and kits comprising the reference samples and / or a set of reference parameters for use in diagnosis of fetal aneuploidy.
- fetal chromosomal aneuploidies The detection of fetal chromosomal aneuploidies is an important procedure in prenatal diagnosis.
- chromosomal aneuploidies such as Down syndrome (also referred to as trisomy 21 ), trisomy 18, trisomy 13, and it is of utmost importance to predict as soon as possible whether a fetus will be affected by one of these anomalies.
- the risk that a fetus will be afflicted by an aneuploidy generally increases with the mother's age. Therefore, the increase in the average age of pregnant women in most developed countries further raises the need for powerful and safe diagnostic methods for detecting fetal chromosomal aneuploidies.
- fetal chromosomal aneuploidies are commonly performed through invasive procedures such as chorionic villus sampling, amniocentesis or cord blood sampling. These methods have in common that they rely on the collection of a fetal biological material (amniotic fluid, chorionic villi, cord blood) in order to obtain fetal cells, necessary for a karyotype analysis. These methods have been routinely practised for a long time. However, due to their invasiveness, they are not free of risk for the fetus and for the mother. The most frequent risk is the chance of miscarriage, close to 1 % in the case of amniocentesis. Other risks are associated with these invasive procedures, such as risks of infection, transmission of a disease from the mother to the fetus (for example AIDS or hepatitis B), amniotic fluid leakage, or premature birth.
- invasive procedures such as chorionic villus sampling, amniocentesis
- Non-invasive methods based on ultrasound scanning or on the detection of maternal serum biochemical markers have also been developed, but these methods are mainly restricted to the detection of epiphenomena, and have a limited clinical usefulness for detecting the core pathologies of chromosomal abnormalities.
- the discovery of cell-free fetal nucleic acids in maternal plasma in 1997 opened up new possibilities.
- the first strategies using these nucleic acids for assessing the fetal chromosomal dosage were based on the analysis of the allelic ratio of SNPs in target nucleic acids (placental mRNA and DNA molecules bearing a placental-specific DNA methylation signature) based on the assessment of the fetal chromosomal dosage by allelic ratio analysis of SNPs.
- the technique consists in measuring the total amount of a specific locus on a potentially aneuploid chromosome (for example chromosome 21 ) in maternal plasma and comparing this amount to that on a reference chromosome.
- Chiu ef al successfully implemented massively parallel sequencing in a method for diagnosing fetal trisomy 21 in maternal plasma (Chiu et al., 2008).
- Their method consists in performing a massively parallel sequencing on DNA extracted from the plasma samples.
- the sequences obtained from the MPGS step are then aligned to a reference sequence of the human genome, and the number of sequences which have been uniquely mapped to a location on the human genome, without mismatch, is counted for each chromosome, and compared to the total number of sequences obtained during the MPGS. This ratio provides an indication of the "chromosomal representation" of the DNA molecules found in a maternal plasma sample.
- the overrepresentation of chromosome 21 in a given sample, by comparison to a set of reference samples already known as euploid, is indicative of a fetal trisomy 21.
- Fan ef al successfully developed another method for the diagnosis of fetal trisomy 21 , using shotgun sequencing of cell-free plasma (Fan et al., 2008). After massively sequencing the cell-free DNA extracted from maternal plasma samples, Fan et al. mapped each sequence to the human genome. Each chromosome of the human genome was then divided into 50 kb bins, and, for each bin the number of sequence tags uniquely mapped to the human genome with at most one mismatch was counted. Fan et al. then calculated the median value of this count of sequence tag over each chromosome. Finally, Fan et al.
- the sensitivity of non-invasive prenatal diagnosis to detect fetal aneuploidy with whole genome next generation sequencing depends on the fetal DNA fraction in the maternal plasma, and on the sequencing depth. While the fetal DNA fraction depends on a series of largely inherent biological variables, the technical variables subject to experimental modification include i), the efficiency of the DNA extraction procedure, ii), the accuracy and throughput of NGS, namely the fraction of sequence tags with unique exact matches that can be aligned to the sequenced genome (termed “unique exact sequences without mismatches" or "UES”) and the total number of molecules sequenced iii), the nature of the bioinformatic algorithms, and iv), the control group of samples from pregnant women with normal fetal caryotypes that provides the reference set. The latter is of utmost importance, since individual molecules counting for each single chromosome is normalized with the median sequence tag density of all autosomes (Fan ef al 2008).
- the present invention implements a DNA extraction method not previously used for noninvasive prenatal diagnosis and having a fivefold greater yield than standard methods, together with a rigorously quality-controlled NGS work-flow with overall 25-30% more UESs than the published references, and average total count of UESs of more than 15- 10 6 , which is three times higher than the current standard.
- the final readout of the test fits the requirements of a robust clinical test, i.e. a 100% sensitivity and 100% specificity for the major fetal aneuploidies. This procedure for instance discriminates trisomy 21 or Down syndrome from normal male and female caryotypes with ⁇ 1.1 - 10 "5 prior probability of generating false results by chance.
- a first aspect of the present invention thus relates to a method for obtaining a set of reference samples and/or a set of reference parameters for the diagnosis of fetal aneuploidy from a maternal biological sample, preferably a blood sample, comprising: a step of extracting cell-free DNA from a set of biological samples, preferably blood samples, obtained from euploid pregnant women carrying a euploid fetus;
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for each sample;
- the extraction of cell-free DNA from each biological sample comprises:
- pre-sequencing DNA of each sample After the extraction step or after the selection step based on the size distribution of the DNA molecules, pre-sequencing DNA of each sample, mapping the obtained sequences to the human genome, and selecting a set of samples based on the amount of unique exact sequences mapped to the human genome;
- the method can comprise any one of these additional steps or features, any combination of two or three of these additional steps or features or the four additional steps and features.
- the method of the invention includes a step of size selection of the cell-free DNA, particularly immediately after the extraction step and prior to massive parallel sequencing.
- the invention relates to a method for obtaining a set of reference samples and/or a set of reference parameters for the diagnosis of fetal aneuploidy from a maternal biological sample, containing cell-free DNA, said method comprising:
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for each sample;
- a preferred example of such a method for obtaining a set of reference samples, including a size-selection step, comprises :
- step (b) processing the size-selected extracted DNA samples obtained in step (b) for the preparation of a sequencing library, for example by end repair of the DNA molecules and ligation of sequencing adaptors, optionally followed by amplification of the adaptor-ligated fragments;
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for each sample;
- the set of biological samples from which cell-free DNA is extracted further includes samples obtained from euploid pregnant women carrying an aneuploid fetus, In this way, the reference set provides reference values for both euploid and aneuploid samples.
- the method for obtaining a set of reference samples for the diagnosis of fetal aneuploidy from a maternal biological sample containing cell-free DNA comprises steps of pre-sequencing and mapping on a size-selected sub-set of samples prior to massive parallel sequencing.
- the method comprises:
- step (vi) selecting a second set of samples based on the amount of unique exact sequences mapped to the human genome in step (v);
- step (viii) mapping the sequences obtained in step (vii) to the human genome
- step (ix) selecting a set of reference samples based on the number of unique exact sequences mapped to the human genome in step (viii).
- step (iii) comprises selecting samples in which at least 90 wt%, preferably more than 95wt% of the DNA molecules have a size from 156 bp to 176 bp.
- step(iii) comprises selecting samples with at least 0.88 ng/ ⁇ DNA molecules with a size from 156 bp to 176 bp.
- step (iv) comprises sequencing from 1000 to 100000 sequences within each sample.
- step (vi) comprises selecting samples having at least 70 % of unique exact sequences with respect to the total number of sequences obtained in step (iv).
- step (vii) comprises sequencing at least 25 million sequences for each sample. In another embodiment, step (vii) comprises obtaining at least 25 million filter passing reads for each sample.
- step (ix) comprises selecting samples having more than 15 millions unique exact sequence reads.
- the present invention also relates to a method for diagnosing fetal aneuploidy from a maternal biological test sample, preferably a blood sample, comprising:
- step (c) mapping the sequences obtained in step (b) to the human genome
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for a sample of a set of reference samples, such as a set of euploid reference samples, for example as obtained according to the present invention
- a preferred method of diagnosis of fetal aneuploidy comprises the above method in which, after the extraction step, a step of size selection based on the size of the DNA molecules within said sample is carried out.
- the step of size selection substantially eliminates DNA molecules having a size greater than 200 bp from the test sample. This step is preferably conducted prior to the preparation of a sequencing library.
- This method of diagnosis is particularly preferred in conjunction with the use of reference samples which have also undergone a step of cell-free DNA size selection as described above. Indeed, according to the invention, it is preferred that the test sample be subject to the same methodology as the reference samples.
- the method for diagnosing fetal aneuploidy from a maternal biological test sample preferably a blood sample, comprises:
- step (d) massively parallel sequencing the cell-free DNA obtained in step (c);
- step (e) mapping the sequences obtained in step (d) to the human genome
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for a sample of a set of reference samples, such as a set of euploid reference samples, obtained according to the size-selection method of the present invention
- the extraction of cell-free DNA from the maternal biological test sample comprises:
- said test parameter is the unique sequence tag density of the chromosome or chromosomal region of interest normalized to the median unique exact sequence tag density of all autosomes.
- said test parameter is the percentage of unique exact sequences mapped to said chromosome or chromosomal region, with respect to the total number of unique exact sequences mapped to all chromosomes, or to the total number of unique exact sequences mapped to all autosomes.
- step (f) is made through calculation of the z- score of said test parameter with respect to the set of reference parameters.
- test parameter is the absolute exact sequence count for the chromosome or chromosomal region of interest or the average exact sequence count for the chromosome or chromosomal region of interest.
- step (f) is made through calculation of the probability that the unique exact sequence count for the chromosome or chromosomal region of interest, or the average exact sequence count for the chromosome or chromosomal region of interest, belongs to the normal distribution of the unique exact sequence counts for the chromosome of interest of the reference set.
- the chromosome of interest is chromosome 21 , chromosome 18, chromosome 16, chromosome 1 1 or chromosome 13.
- the chromosome of interest is chromosome 21
- the z-score of a trisomy 21 sample is at least 4.4 while the absolute value of the z-score of a sample euploid for chromosome 21 is less than 4.4.
- the present invention also relates to a method for extracting cell-free DNA from a maternal biological sample containing fetal and maternal cell-free DNA, comprising:
- precipitating DNA from said aqueous phase; optionally collecting precipitated DNA.
- the present invention also relates to the use of chloroform and phenol, preferably of a composition comprising chloroform and phenol for extracting cell-free DNA from a maternal biological sample containing fetal and maternal cell-free DNA.
- said use is in a method for obtaining a set of reference samples for the diagnosis of fetal aneuploidy from a maternal biological sample.
- said use is in a method for diagnosing fetal aneuploidy from a maternal biological test sample
- the present invention also relates to a set of reference samples obtainable according to the method of the present invention.
- the present invention also relates to a computer program product for implementing one or more steps of the method for obtaining a set of reference samples for the diagnosis of fetal aneuploidy from a maternal biological sample.
- the present invention also relates to a computer program product for implementing one or more steps of the method for diagnosing fetal aneuploidy from a maternal biological test sample, for example one or more of step (d) to (g).
- the present invention also relates to a kit comprising one or more of:
- compositions and/or a kit for extracting cell-free DNA for example including a composition comprising phenol and chloroform;
- a set of reference parameters obtainable according to the method according to the present invention, optionally included in a physical support, such as a computer readable media;
- a computer program product for implementing one or more steps of the method for obtaining a set of reference samples for the diagnosis of fetal aneuploidy from a maternal biological sample
- the kit for the diagnosis of fetal aneuploidy comprises :
- a set of reference samples obtainable according to the method of the invention, for example a set of samples having undergone size selection to enrich the sample for cell-free DNA having a size of ⁇ 200bp, and eliminating DNA molecules greater than 200 bp, and comprising not only samples from euploid pregnant women carrying a euploid fetus but also samples from euploid pregnant women carrying an aneuploid fetus
- each reference parameter is indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for a sample of a reference set obtainable according to the method of the invention, optionally included in a physical support,
- kit may further comprise at least one of :
- compositions and/or a kit for extracting cell-free DNA including a composition comprising phenol and chloroform;
- a computer program product for implementing one or more steps of the method for obtaining a set of reference samples for the diagnosis of fetal aneuploidy from a maternal biological sample
- a computer program product for implementing one or more steps of the method for diagnosing fetal aneuploidy from a maternal biological test sample.
- Figure 1 size distribution of 3 maternal plasma samples as obtained by capillary electrophoresis.
- the DNA molecules in these samples are ligated to a 132 bp sequencing adaptor/barcode.
- Figure 2 total number of filter passing sequence reads obtained by NGS sequencing for 91 samples (euploid and aneuploid).
- the axis legend in ordinate reads "Cnt +1 e6", namely the sequence count in million.
- Figure 3 number of unique exact sequences for the same samples shown in Fig. 2.
- the axis legend in ordinate reads "Cnt +1 e6", namely the sequence count in million.
- the horizontal middle dotted line corresponds to the mean percentage of the reference sample.
- the horizontal full lines above and below the dotted line correspond to the discrimination threshold (mean ⁇ 4.4* SD). The trisomy 21 samples are positively discriminated.
- the horizontal middle dotted line corresponds to the mean percentage of the reference sample.
- the horizontal full lines above and below the dotted line correspond to the discrimination threshold (mean ⁇ 4.4* SD).
- the trisomy 18 samples are posititively discriminated.
- Figure 6 Scores of chromosome 1 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- Figure 7 Scores of chromosome 19 score using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- Figure 8 Scores of chromosome 13 score using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the trisomy 13 sample is positively discriminated.
- Figure 9 Scores of chromosome 18 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the trisomy 18 samples are positively discriminated.
- Figure 10 Scores of chromosome 21 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the trisomy 21 samples are positively discriminated.
- Figure 11 Scores of chromosome 22 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the trisomy 22 sample is positively discriminated.
- Figure 12 Scores of chromosome 4 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the 4p microdeletion (Wolf-Hirschhorn syndrome) sample is negatively discriminated.
- Figure 13 Scores of chromosome 5 using a second scoring algorithm.
- the discrimination thresholds correspond to a 1/100,000,000,000 confidence interval with respect to known healthy individuals (reference samples selected according to the method of the present invention).
- the 5p microdeletion/duplication (cri du chat syndrome) sample is positively discriminated.
- Figure 14 Sequence tag densities over chromosome 4 of a 4p microdeletion syndrome sample. A negative deviation from the mean density of the reference samples is apparent at the location of the 4p deletion.
- Figure 15 Sequence tag densities over chromosome 5 of a 5p microdeletion/duplication syndrome sample. Positive and negative deviations from the mean density of the reference samples are apparent at the location of the 5p microdeletion and duplication, respectively.
- the data shown on Figures 2 to 13 were all obtained with the same set of 91 samples, and are shown in the same order on each Figure. The ID of every 10 samples is indicated below the bars.
- the karyotype of specific samples is indicated inside or above the corresponding bar. These karyotypes are also listed in Table 5 (text identical to that of the Figures).
- Figure 16 Size selection : Bioanalyzer results before (panel A, left hand side) and after (panel B, right hand side) size selection of extracted cell-free DNA using AMPure beads for three test samples GWX-351 , -352 and -353. Peaks at 1 13.00 and 43.00 are size markers ([s] signifies time of migration in seconds, and can be translated directly to base pairs). In the size-selected samples (panel B), the large molecular weight peak at > 1000bp is eliminated by the process of purification, and the lower molecular weight peak corresponding to fetal cell-free DNA at 150-200 bp is retained.
- Figures 17-38 comparison of results of aneuploidy detection test for all autosomes using the size selection procedure of the invention (TPR, y axis) and the same procedure without size selection (GW, x-axis).
- 48 test samples were evaluated according to the protocol described in Example 3, and compared to six reference samples A1 , A2, N1 , N2, B1 , B2, with and without size selection, for all autosomes. Fetal enrichment by size selection clearly results in stronger signals for the detection of trisomies 13, 16, 18 and 21.
- FIG. 17 chromosome 1
- FIG. 20 chromosome 4
- FIG. 26 chromosome 10
- FIG. 28 chromosome 12
- FIG. 30 chromosome 14
- FIG. 32 chromosome 16
- FIG. 33 chromosome 17
- FIG. 34 chromosome 18
- Figure 36 chromosome 20
- FIG. 37 chromosome 21
- FIG. 38 chromosome 22
- Figure 39 results obtained for euploid sample designated GWX-1 137 compared to reference set A1.
- the inner, fine dotted lines represent a probability threshold of 1/1000 and the outer, thicker dotted lines represent a probability threshold of 1/10000 i.e. a value lying outside these thresholds has less than one chance in 1000, or less than one chance in 10000, respectively, of being normal :
- Figure 39a value derived from UEM of chromosome 13 of test sample GWX-1 137 (circled black spot) compared to values derived from UEMs of each sample of reference set A1 for chromosome 13 (grey spots), including validated aneuploid T13 samples.
- the test sample is within the interval of values representing normal karyotype.
- Figure 39b value derived from UEM of chromosome 16 of test sample GWX-1 137 (circled black spot) compared to values derived from UEMs of each sample of reference set A1 for chromosome 16 (grey spots), including validated T16 aneuploid samples.
- the test sample is within the interval of values representing normal karyotype.
- Figure 39c value derived from UEM of chromosome 18 of test sample GWX-1 137 (circled black spot) compared to values derived from UEMs of each sample of reference set A1 for chromosome 18 (grey spots), including validated T18 aneuploid samples.
- the test sample is within the interval of values representing normal karyotype.
- Figure 39d value derived from UEM of chromosome 21 of test sample GWX-1 137 (circled black spot) compared to values derived from UEMs of each sample of reference set A1 for chromosome 21 (grey spots), including validated T21 aneuploid samples.
- the test sample is within the interval of values representing normal karyotype.
- Figure 40 results obtained for aneuploid samples compared to reference set N1.
- the inner, fine dotted lines represent a probability threshold of 1/1000 and the outer, thicker dotted lines represent a probability threshold of 1/10000 i.e. a value outside these thresholds has less than one chance in 1000, or less than one chance in 10000, respectively, of being normal :
- Figure 40a value derived from UEM of chromosome 13 of test sample GWX-1 196 FDT8b (circled black spot) compared to values derived from UEMs of each sample of reference set N1 for chromosome 13 (grey spots), including validated aneuploid T13 samples.
- the test sample is outside the interval of values representing normal karyotype and has less than one chance in 10000 of being normal i.e. there is a probability of ⁇ 1 - 10 "5 that such an abnormal result be generated by chance. Trisomy 13 is suspected.
- Figure 40b value derived from UEM of chromosome 16 of test sample GWX-1420 FDT6b (circled black spot) compared to values derived from UEMs of each sample of reference set N1 for chromosome 16 (grey spots), including validated aneuploid T16 samples.
- the test sample is outside the interval of values representing normal karyotype and has less than one chance in 10000 of being normal, i.e. there is a probability of ⁇ 1 - 10 "5 that such an abnormal result be generated by chance. Trisomy 16 is suspected.
- Figure 40c value derived from UEM of chromosome 18 of test sample GWX-1421 FDT5b (circled black spot) compared to values derived from UEMs of each sample of reference set N1 for chromosome 18 (grey spots), including validated aneuploid T18 samples.
- the test sample is outside the interval of values representing normal karyotype and has less than one chance in 10000 of being normal i.e. there is a probability of ⁇ 1 - 10 "5 that such an abnormal result be generated by chance. Trisomy 18 is suspected.
- Figure 40d value derived from UEM of chromosome 21 of test sample GWX-1470 FDT4b (circled black spot) compared to values derived from UEMs of each sample of reference set N1 for chromosome 21 (grey spots), including validated aneuploid T21 samples.
- the test sample is outside the interval of values representing normal karyotype and has less than one chance in 10000 of being normal i.e. there is a probability of ⁇ 1 - 10 "5 that such an abnormal result be generated by chance. Trisomy 21 is suspected.
- Figure 41 Results of aneuploidy detection test of the invention on three trisomic samples using a semiconductor-based NGS platform for massive parallel sequencing as described in Example 5.
- the thick dark boxes represent the probabilities that the sample in question belongs to six different normal reference sets using semiconductor technology, wherein the six reference sets were generated also using semiconductor technology and an experimental protocol identical to that used for handling the test samples.
- a comparison is shown (thin bars) of results obtained with the same test samples but four reference sets generated by use of a sequencing by synthesis platform.
- NGS next-generation sequencing
- MGS massively parallel sequencing
- Single-molecule real-time sequencing Ion semiconductor sequencing
- pyrosequencing sequencing by synthesis
- sequencing by ligation sequencing by ligation.
- Cell-free DNA refers to a DNA molecule or a set of DNA molecules freely circulating in a biological sample, for example in blood.
- a synonym is "circulating DNA”.
- Cell-free DNA is extracellular, and this term is used as opposed to the intracellular DNA which can be found, for example, in the cell nucleus or mitochondria.
- aneuploidy refers to the variation of a quantitative amount of one chromosome from that of a diploid genome.
- the variation may be a gain, or a loss. It may involve a whole chromosome or a part thereof, for example only a chromosomal region.
- Aneuploidy can include monosomy (lack of one chromosome), partial monosomy (translocation or deletion of a portion of a chromosome), trisomy (gain of one extra chromosome), partial trisomy (gain and/or duplication of a portion of a chromosome).
- Euploidy is herein used to mean the contrary of aneuploidy, i.e. a euploid sample refers to a diploid genome, chromosome or chromosomal portion. For instance, an individual euploid for chromosome 21 has two copies of the chromosome 21.
- monosomy or partial monosomy examples include Wolf-Hirschhorn syndrome, cri du chat syndrome, 5q deletion syndrome, Williams syndrome, Jacobsen syndrome, Angelman syndrome, Prader-Willi syndrome, Miller-Dieker syndrome, Smith-Magenis syndrome, 18q deletion syndrome, DiGeorge syndrome.
- trisomy examples include trisomy 1 , trisomy 2, trisomy 3, trisomy 4, trisomy 5, trisomy 6, trisomy 7, trisomy 8 (Warkany syndrome), trisomy 9, trisomy 10, trisomy 1 1 , trisomy 12, trisomy 13 (Patau syndrome), trisomy 14, trisomy 15, trisomy 16, trisomy 17, trisomy 18 (Edwards syndrome), trisomy 19, trisomy 20, trisomy 21 (Down syndrome), trisomy 22.
- disorders involving a loss (deletion) of one or several chromosomal regions include 1 p36 deletion syndrome, TAR deletion, 1q21.1 deletion, 2q1 1.2 deletion, 2q 1 1.2q 13 deletion, 2q13 deletion, 2q37 deletion, 3q29 deletion, Wolf-Hirschhorn deletion, Sotos syndrome deletion, 6q16 deletion, Williams syndrome deletion , WBS-distal deletion, 8p23.1 deletion, 9q34 deletion, 10q23 deletion, Potocki-Shaffer syndrome, SHANK2 FGFs deletion, 12q14 deletion syndrome, 13q12 deletion, 15q1 1.2 deletion, Prader- Willi/Angelman syndrome, 15q13.3 deletion, 15q24 BP0-BP1 deletion, 15q24 BP0-BP1 deletion, 15q24 BP2-BP3 deletion, 15q25.2 deletion, Rubinstein-Taybi syndrome, 16p13.1 1 deletion, 16p1 1 .2p12.1 deletion, 16p12.1 deletion, 16p1 1.2 distal deletion
- disorders involving a gain (duplication) of one or several chromosomal regions include 1 p36 duplication, 1q21.1 duplication, 2q1 1.2 duplication, 2q1 1.2q13 duplication, 2q13 duplication, 2q37 duplication, 3q29 duplication, Wolf-Hirschhorn region duplication, 5q35 duplication, 6q 16 duplication, Williams syndrome duplication, WBS-distal duplication, 8p23.1 duplication, 9q34 duplication, 10q23 duplication, 1 1 p1 1.2 duplication, SHANK2 FGFs duplication, 12q14 duplication, 13q12 duplication, 15q1 1.2 duplication, Prader-Willi/Angelman region duplication, 15q13.3 duplication, 15q24 BP0-BP1 duplication, 15q24 BP2-BP3 duplication, 15q25.2 duplication, Rubinstein-Taybi region duplication, 16p13.1 1 duplication, 16p1 1 .2p12.1 duplication, 16
- the term "euploid sample” refers to a sample obtained from a euploid mother carrying a euploid fetus.
- the term "euploid” can be used with a relative sense, i.e. relating to a specific chromosome or chromosomal region of interest.
- the term "euploid” can be used with an absolute sense, i.e. relating to the whole genome. In this case, a euploid sample is not afflicted by any aneuploidy over its whole genome.
- aneuploid sample refers to a sample obtained from a euploid mother carrying an aneuploid fetus.
- aneuploid can be used with reference to a specific chromosome or chromosomal region of interest, or with reference to the whole genome.
- the term "unique exact sequence” refers to a sequence uniquely mapped to the human genome without any mismatch. In other words, the sequence has been aligned with a single location in the human genome, and has exactly the same sequence as said location, i.e. without any deletion, addition or mutation with respect to the sequence found at said location in the human genome.
- the unique exact sequence generally has a length of 20 to 100 bp, preferably 40 to 70 bp, still preferably 50 bp.
- the term “unique exact sequence” (UES) is used herein synonymously with the term “unique exact match” (UEM).
- a “maternal sample” such as in “maternal biological sample” is a sample obtained from a pregnant woman.
- a biological sample preferably refers to a biological sample containing cell-free DNA, still preferably refers to a whole blood, plasma, serum, urine or breast milk sample.
- a first aspect of the invention refers to the constitution of a set of euploid reference biological samples, or a set of both euploid and aneuploid reference samples, wherein each reference sample is carefully chosen so as to increase the statistical confidence of a fetal aneuploidy diagnosis method.
- the workflow of this selection process comprises several important selection steps:
- the method according to the present invention can comprise any of the three above- mentioned selection steps. However, in a preferred embodiment, all three selection steps are performed, thus increasing the quality of the final set of reference samples.
- the methods according to the present invention can generally be performed on any biological sample in which cell-free DNA, in particular fetal and maternal cell-free DNA can be found.
- the biological sample can especially be a bodily fluid such as blood, urine, breast milk.
- a blood sample is preferred.
- a blood sample refers to a whole-blood sample, a plasma sample or a serum sample.
- the biological samples can be collected at any time during the pregnancy, but are preferably collected from 7 weeks of pregnancy, for example between 7 weeks and 20 weeks of pregnancy, preferably from 7 to 14 weeks of pregnancy, still preferably from 7 to 10 weeks of pregnancy.
- a diagnosis performed as early as 7 weeks of pregnancy provides the advantage of keeping more medical options opened in cases where a decision to interrupt the pregnancy is taken (for example, an interruption through the use of a drug or a combination of drugs may be allowed depending on the national laws).
- the biological samples can be collected following an invasive prenatal procedure, such as chorionic villus sampling, amniocentesis, or cord blood sampling. They can be collected at any time following the invasive procedure, for example at least 10 min, 20 minutes or 30 minutes following the invasive procedure.
- the biological samples can also be collected at least one or more days following the invasive procedure, for example from two to five days following the invasive procedure.
- the biological samples can be collected from women not yet having experienced an invasive prenatal procedure. This situation is preferable for the biological samples to be diagnosed, as an advantage of the method is precisely to avoid any invasive procedure.
- the aneuploidy status of the fetus in samples intended to form the reference set can be diagnosed independently from the method according to the present invention. This may be useful for ascertaining that the samples used for forming the reference set of samples are indeed euploid samples, or in other words, samples obtained from euploid mothers carrying a euploid fetus.
- the euploid samples used for obtaining the reference set of samples are preferably euploid with reference to the "absolute" definition of the term, as given above, i.e. they are euploid over the whole genome, and not only for a specific chromosome of interest.
- the samples destined to constitute the reference samples may further include samples from euploid mothers carrying an aneuploid fetus, for example a fetus having trisomy 21 , 18 or 13.
- the aneuploidy status of the fetus in such samples can be diagnosed independently from the method according to the present invention.
- a method for assessing the aneuploidy status of the fetus can comprise collecting fetal cell material from the mother by an invasive prenatal diagnosis procedure, such as amniocentesis, chorionic villus sampling or cord blood sampling.
- the aneuploidy status of the fetus can then be assessed by any of following techniques: karyotyping, Fluorescence In Situ Hybridization (FISH), Quantitative Polymerase Chain Reaction (PCR) of Short Tandem Repeats, Quantitative Fluorescence PCR (QF-PCR), Quantitative Real-time PCR (RT-PCR) dosage analysis, Quantitative Mass Spectrometry of Single Nucleotide Polymorphisms, and Comparative Genomic Hybridization (CGH).
- FISH Fluorescence In Situ Hybridization
- PCR Quantitative Polymerase Chain Reaction
- QF-PCR Quantitative Fluorescence PCR
- RT-PCR Quantitative Real-time PCR dosage analysis
- CGH Comparative Genomic Hybridization
- the aneuploidy status of the mother is already known, because most aneuploidy-related diseases are symptomatic. However, if needed, the aneuploidy status of the mother can also be assessed by using cell material obtained from the mother. Any of the aforementioned techniques can be employed.
- An important parameter of the method according to the invention is an efficient DNA extraction from the maternal biological samples.
- Cell-free DNA extraction is preferably performed via a protocol of phenol-chloroform extraction.
- the extraction protocol typically comprises:
- the present invention encompasses the use of phenol/chloroform for extracting cell-free DNA from a biological sample, preferably from a blood sample such as a plasma sample.
- the method is particularly appreciable for extracting mixed fetal and maternal cell-free DNA from a maternal biological sample, as it yields a more robust fetal DNA signal than the existing methods.
- phenol/chloroform refers to a mixture of phenol and chloroform, i.e. to a composition comprising phenol and chloroform.
- Said composition is preferably an aqueous solution and preferably also comprises isoamyl alcohol.
- the pH of the composition is preferably from 7 to 9, still preferably from 7.8 to 8.2.
- a preferred composition is a 25:24: 1 mixture of phenol:chloroform:isoamyl alcohol at a pH from 7.8 to 8.2.
- the composition may comprise one or more additives, such as one or more antioxidants and/or stabilizers.
- the extraction method comprises a step of pre-treating the biological sample with one or more proteases, such as proteinase K.
- the extraction of the aqueous phase may comprise centrifuging the biological sample mixed with chloroform and phenol, and collecting the aqueous phase.
- the centrifugation provides a separation of the mixed biological sample into a lower organic phase, comprising mainly phenol, proteins or protein debris, and an upper aqueous phase comprising nucleic acids.
- the precipitation of cell-free DNA from the aqueous phase comprises the steps of:
- the precipitation agent is preferably selected from glycogen, a lower alcohol such as isopropanol or ethanol, or mixtures thereof.
- the centrifugation pellet containing DNA can then be washed one or more time, for example with ethanol and/or ether. Finally, DNA can be resuspended in a suspension buffer, for example a Tris buffer.
- the phenol-choloroform extraction protocol yields a fivefold higher amount of DNA than the column methods classically employed in the context of fetal aneuploidy detection using massively parallel sequencing (Chiu et al., 2008, Fan et al., 2008). It also yields a higher fraction of DNA at a size of 156-176 bp, i.e. maternal and fetal cell-free DNA. This protocol is thus an important tool for increasing the number of sequence reads originating from fetal DNA.
- the samples containing extracted DNA are optionally processed for preparing the sequencing library. Such processing can take place immediately after the extraction of cell-free DNA or preferably, it can take place after a step of size-selection of the extracted cell-free DNA.
- the library preparation can include one or more amplification steps, a ligation with one or more sequencing adaptors, and/or barcoding the DNA molecules.
- a typical workflow of the sequencing library preparation includes a step of ligation of one or more adaptor sequences, optionally linked to one or more barcode sequences, to the DNA molecules inside the sample, followed by an amplification of the adaptor/barcode-ligated DNA molecules.
- Sequencing adaptors are short nucleotide sequences which are commonly used in modern sequencing technologies.
- the adaptors are used for anchoring the DNA molecules to be sequenced to a solid surface, for example in a flow cell. These adaptors are thus designed so as to hybridize to target oligonucleotides tethered to the solid surface.
- the ligation of adaptors is preferably performed by repairing the ends of the DNA molecules, i.e. suppressing or filling out the overhangs of the extracted DNA molecules, for example through the action of one or more exonucleases and/or polymerases, thus yielding blunt- ended DNA molecules.
- An overhang of one or more 'A' bases may then be optionally added at the 3' end of the blunt-ended DNA molecules.
- the adaptors containing an overhang of one or more T bases at their 3' end are then added and are ligated to the overhang of one or more 'A' bases at the 3'end of the DNA molecules.
- Adaptors can also be blunt ligated.
- the DNA fragments within the sample can also be barcoded.
- Barcoding refers to the ligation of a sample-specific tag to the DNA molecules of a sample. Barcoding allows the sequencing of several samples in a single sequencing run, which saves time and resources.
- the DNA fragments inside the sample can also be subjected to one or more amplification cycles, for example by PCR. From 10 to 25 amplification cycles, for example 18 amplification cycles may be run.
- the amplification is preferably carried out after the ligation of an adaptor sequence to the DNA molecules.
- the PCR amplification preferably uses primers against the adaptor sequence, thus enriching the library into adaptor-ligated fragments.
- the size distribution of the DNA molecules within each sample can be analyzed. This analysis is preferably performed by capillary electrophoresis. It is for example carried out by using a commercial lab-on-a-chip capillary electrophoresis system.
- the size distribution analysis can be conducted before or after the preparation of the sequencing library. However, it is preferably performed before the preparation of the sequencing library.
- the present inventors have established that for equal total quantities of input DNA there was an unexpected variability in the number of total raw reads after NGS.
- Capillary electrophoresis of raw extracts revealed that one possible explanation for this could be the presence of a high molecular weight (MW) DNA species (> 1000 bp) that decreased the relative amount of the small MW fraction containing the fetal DNA of interest available for NGS.
- MW molecular weight
- Experiments carried out to remove the high molecular weight species immediately after cell-free DNA extraction and before library preparation have confirmed that size selection of the small MW species ( ⁇ 200 bp, particularly 150-200bp) and exclusion of the high MW species largely removes the variability in the number of raw reads obtained after NGS (see Fig. 16).
- This technical step also improves the robustness and resolution of the assay, in addition to its economic interest arising from the fact that only size selected molecules are processed for sequencing library preparation and massively sequenced.
- this procedure of size selection increases the fetal fraction, i.e. the proportion of cell-free circulating fetal DNA among the total amount of circulating cell-free DNA, making its use critical for the robustness of the assay in cases with low fetal fraction.
- the increase in fetal fraction brought about by size selection prior to library preparation has the effect of decreasing the number of reads required to reliably detect trisomies.
- the step of removal of cell-free DNA molecules having a size of more than 200 bp can be carried out by any technique known in the art.
- the use of magnetic beads is particularly preferred, for example AMPure XP® beads as described in the examples below. Gel electrophoresis may also be used.
- the present inventors have demonstrated that the beneficial effects of the size selection according to the invention is achieved irrespective of the specific technology used for the massive parallel sequencing step. For example, it is achieved using sequencing-by-synthesis methods as well as semiconductor-based next generation sequence technology. It has also been demonstrated that whilst it is optimal to use the same massive parallel sequencing platform for the test samples and for the reference sets, reliable results are nevertheless achieved when different platforms are applied for the samples and for the reference sets.
- the inventors of the present application have found that the size distribution of cell-free DNA processed for preparation of the sequencing library i.e. adaptor-ligated cell- free DNA had a size peak at about 298 bp ( Figure 1 ). After subtraction of the adaptor/barcode sequence size of 132 bp, the peak size corresponds to 166 bp. This value is in agreement with the data previously provided by Fan et al., 2008 and also with the hypothesis of a mainly mononucleosomal origin of cell-free DNA.
- the size distribution of DNA within the samples can be used as a criterion in the process of composing an appropriate set of reference samples for the diagnosis of fetal aneuploidy.
- This criterion allows the selection of samples with a high level of cell-free DNA and the elimination of the samples with a low level of cell-free DNA.
- a selection criterion may consist in the occurrence of a size peak at about 166 bp.
- the term “about 166 bp” can have the meaning of “from 151 to 181 bp”, or “from 156 to 176 bp”, or “from 161 to 171 bp” or “from 163 to 169 bp” or “from 165 to 167 bp”.
- this term can have the meaning of "at exactly 166 bp".
- step (iii) comprises selecting the samples wherein at least 80 wt%, still preferably at least 90 wt%, preferably at least 95 wt%, still preferably at least 97wt% of the DNA molecules inside the sample have a size of about 166 bp, preferably from 156 to 176 bp.
- step (iii) comprises selecting samples wherein the concentration of DNA molecules with a size of about 166 bp, preferably from 156 to 176 bp, is of at least 0.88 ng/ ⁇ , preferably at least 0.90 ng/ ⁇ , still preferably at least 0.95 ng/ ⁇ or at least 1 .00 ng/ ⁇ or at least 1 .05 ng/ ⁇ or at least 1.10 ng/ ⁇ .
- step (iii) comprises selecting samples wherein the quantity of DNA molecules with a size of about 166 bp, preferably from 156 to 176 bp, is of at least 13 ng, preferably at least 13.5 ng, still preferably at least 14.25 ng or at least 15 ng or at least 15.75 ng or at least 16.5 ng.
- the mean concentration of extracted DNA molecules with a size of about 166 bp, preferably from 156 to 176 bp, among the set of samples selected at step (iii) is of at least 0.88 ng/ ⁇ , preferably at least 0.90 ng/ ⁇ , still preferably at least 0.95 ng/ ⁇ or at least 1.00 ng/ ⁇ or at least 1 .05 ng/ ⁇ or at least 1.10 ng/ ⁇ .
- the mean quantity of DNA molecules with a size of about 166 bp, preferably from 156 to 176 bp, among the set of samples selected at step (iii) is of at least 13 ng, preferably at least 13.5 ng, still preferably at least 14.25 ng or at least 15 ng or at least 15.75 ng or at least 16.5 ng.
- the concentration and/or quantity can be measured on DNA libraries prepared for the sequencing step, for example it can be measured on adaptor/barcode-ligated DNA molecules, for instance on DNA molecules ligated with a 132 bp adaptor/barcode.
- the DNA molecules have been submitted to 18 amplification cycles after the ligation of the adaptor/barcode.
- the concentration and/or quantity is measured on DNA libraries prepared using the lllumina's ChIP sequencing protocol by using 20 ng DNA as input material. The concentration and / or quantity can also be measured prior to preparation of DNA libraries.
- step (iii) may also comprise selecting samples whose DNA size distribution reveals a peak or shoulder between 133 and 143 bp.
- the size values indicated above correspond to non-adaptor or barcode ligated DNA molecules, i.e. to the DNA molecules as found in maternal blood. If needed, these values may be adapted for taking into account the presence of an adaptor, barcode, or of any sequence tag at one or both ends of the DNA molecules.
- a peak refers to a local maximum in the curve representing the size distribution of DNA molecules inside a sample.
- a shoulder refers to an inflection point in this curve.
- pre-sequencing refers to a small-scale sequencing which can be optionally performed prior to a larger scale next-generation sequencing. Therefore, contrary to the methods of the prior art, this variant of the invention is characterized by two sequencing steps successively performed on each sample of the reference set. Accordingly, “pre-sequencing” can also be referred as “first sequencing”. In a similar way, “massively parallel sequencing” can be referred as “second sequencing”. The inventors have postulated that the proportion of unique exact sequences within a small library of sequences would be representative of the proportion of unique exact sequences in the full scale library obtained by next-generation sequencing.
- the present invention enables time and resources to be saved while eliminating samples with an insufficient quality, thereby yielding a reference set of increased quality.
- the pre-sequencing step comprises sequencing from 1000 to 100,000 sequences per sample, still preferably from 5000 to 50000 sequences per sample.
- the size of each sequence read is preferably from 20 bp to 100 bp, still preferably from 40 to 70 bp, for example of 50 bp. These sizes, in particular 50 bp, are a good compromise between too short reads that are more likely to map to more than one location in the human genome, and too long reads which raise the chance to have SNPs inside the sequence. If a step of size selection as described above is carried out after cell-free DNA extraction and prior to library preparation, a step of pre-sequencing is not normally necessary.
- the alignment of the sequences over the human genome can be carried out using any standard alignment software, for example as described in Chiu et al., 2008 or Fan et al., 2008.
- the human genome sequence used for the mapping is preferably a reference sequence, such as the sequences established by the NBCI (http://www.ncbi.nlm.nih.gov/assembly/2758/) or the UCSC
- the reference sequence is preferably February 2009 (hg19, GRCh37), also referred as hg19.
- the method according the invention comprises two sequencing steps (as an optional variant), it also comprises two mapping steps: the mapping of the sequences obtained at the pre-sequencing step and the mapping of the sequences obtained at the massively parallel sequencing step.
- the two mapping steps are preferably performed in the same way, i.e. by using the same human genome sequence and/or the same alignment software.
- Both mapping steps can be done over the whole sequence of the human genome, for example over the whole hg 19 reference sequence.
- the alignment can be done over only a portion of the human genome, or in other words over a partial sequence of the human genome.
- the partial sequence of the human genome used in score calculation is obtained by masking predefined regions of the human genome.
- the regions to be masked can be chosen on the basis of a number of different parameters, including: a lower quality of sequencing of a region (these regions are also known as "non-well annotated regions"); the occurrence of a high number of repeats within a region; the duplication of a region within the human genome; a region with a complex architecture.
- the masked regions are thus preferably selected among the non-well-annotated regions of the human genome, the high copy repeat regions of the human genome, the duplicated regions of the human genome, or the regions with a complex architecture.
- a region with a lower quality of sequencing or a "non-well annotated" region is for instance a region with scaffold N50 of less than 46,395,641 and/or a contig N50 of less than 38,508,932, and/or with total assembly gap length of more than 239,845,127/3, 137, 144,693, and/or with a genome coverage of at least 90%, preferably at least 95% (Yandell et al., 2012).
- Examples of non-well annotated regions are subtelomeric regions and pericentromeric regions.
- Genome assemblies are composed of scaffolds and contigs.
- Contigs are contiguous consensus sequences that are derived from collections of overlapping reads. Scaffolds are ordered and orientated sets of contigs that are linked to one another by mate pairs of sequencing reads.
- a contig N50 is calculated by first ordering every contig by length from longest to shortest. Next, starting from the longest contig, the lengths of each contig are summed, until this running sum equals one-half of the total length of all contigs in the assembly.
- the contig N50 of the assembly is the length of the shortest contig in this list.
- the scaffold N50 is calculated in the same fashion but uses scaffolds rather than contigs. Scaffolds and contigs that comprise only a single read or read pair— often termed 'singletons'— may be excluded from these calculations, as may be contigs and scaffolds that are shorter than -800 bp.
- Genome coverage refers to the percentage of the genome that is contained in the assembly based on size estimates; these are usually based on cytological techniques.
- a region with a complex architecture is for instance a highly variant region, for example a region with a high number of CNVs (copy number variants), and/or SNVs (single nucleotide variants) (Frazer et al., 2009).
- An estimate of 5% of the human genome is for instance copy number variable.
- Optional step (vi) of the method according to the invention consists in selecting a set of samples based on the quantity of unique exact sequences obtained for said samples.
- Step (vi) can thus consist in selecting samples having more than a minimal quantity of unique exact sequences, or, in other terms, in eliminating samples having less than a minimal quantity of unique exact sequences.
- the term "quantity" may refer to the absolute number of unique exact sequences or to a ratio. The ratio can be calculated with respect to the total number of sequence reads obtained at the presequencing step. However, the ratio is preferably calculated with respect to the number of filter-passing reads.
- Filtering may consist in eliminating the sequences mapped at least partially to an adaptor sequence.
- the number of filter passing reads is the total number of sequence reads minus the number of sequence reads mapped at least partially to an adaptor sequence.
- step (v) comprises selecting samples with at least 70% unique exact sequences, preferably at least 72% unique exact sequences, still preferably at least 75% or still preferably at least 77% or still preferably at least 80% of unique exact sequences with respect to the total number of sequence reads obtained at the presequencing step for said sample.
- a step of size selection as described above is carried out after cell-free DNA extraction and prior to library preparation, a step of pre-sequencing followed by selecting a set of samples based on the quantity of unique exact sequences obtained for said samples is not normally necessary.
- the massively parallel sequencing platform may for instance consist in a "sequencing-by- synthesis” system, such as the lllumina's HiSeq2000 platform. This platform uses a reversible terminator-based method that detects single bases as they are incorporated into growing DNA strands.
- the sequencing workflow in a "sequencing-by-synthesis" system can be summarized in 3 phases:
- this step has already been described and, as mentioned above, it can be carried out at an early phase of the whole process of selecting euploid appropriate reference samples, or of the diagnosis process. It is for example performed immediately after DNA extraction, or immediately after size selection of the extracted cell-free DNA. During this phase, DNA molecules are ligated with adaptors at both ends. In addition, they contain primer sites that are used to amplify the library by PCR and to sequence it.
- the cluster generation during this phase, DNA molecules are hybridized to oligonucleotide probes tethered on a solid surface inside a flow cell. Each DNA molecule is amplified by solid-phase bridge-amplification, forming a cluster of molecules with identical sequences.
- the "sequencing-by-synthesis" phase A mixture of the four nucleotides, each containing a fluorescently-labeled terminator, is introduced into the flow-cell.
- the fluorescently-labeled terminator is imaged as each dNTP is incorporated into the growing DNA strand, and then cleaved to allow incorporation of the next base. Since all four reversible terminator-bound dNTPs are present during each sequencing cycle, natural competition minimizes incorporation bias. Base calls are made directly from intensity signal measurements during each cycle.
- the massively parallel sequencing platform may for instance consist in a semiconductor-based next generation sequence technology.
- the massively parallel sequencing step consists in sequencing at least 10 millions, preferably at least 20 millions still preferably at least 30 million sequences per sample.
- mapping step for example step (viii)
- a mean number of at least 12 million, preferably at least 15 million, still preferably at least 20 million unique exact sequences per sample is obtained in the mapping step (for example step (viii)).
- the total number of sequences and/or the number of unique exact sequences obtained in the massively parallel sequencing step can also be used as a quality control criterion, in the process of selecting the samples forming the set of reference samples.
- the method for obtaining a set of euploid reference samples according to the invention, or a set of euploid and aneuploid reference samples comprises selecting samples with a total number of at least 10 million, preferably at least 20 million, still preferably at least 30 million sequences per sample.
- the method for obtaining a set of euploid reference samples according to the invention, or a set of euploid and aneuploid reference samples comprises selecting samples with at least 6 million, preferably at least 8 million, still preferably at least 10 million, or at least 12 million or at least 14 million or at least 15 million unique exact sequences. 10 million to 12.5 million unique exact sequences in the euploid and aneuploid reference samples is particularly preferred.
- the set of reference samples has a mean total number of sequences obtained in the massively parallel sequencing step of at least 20 million, preferably at least 25 million, still preferably at least 27 million.
- total number of sequences may refer to the total number of non-filtered reads obtained at the sequencing step, or to the total number of filter-passing reads, in cases where the sequencing platform includes a filtering. In such cases, the term “total number of sequences” preferably refers to the total number of filter-passing reads.
- the set of reference samples has a mean number of unique exact sequences of at least 12 million, preferably at least 15 million, still preferably at least 20 million.
- a second major aspect of the present invention consists in a method for diagnosing fetal aneuploidy from a maternal biological sample, characterized in that the sample to be diagnosed is compared to the reference set of samples obtained with the method for obtaining a set of reference samples as described above.
- the workflow of the diagnosis method does not necessarily comprise steps (ii), (iii), (iv), (v) and (vi), namely the selection based on the size distribution and the selection based on the pre-sequencing results.
- steps (ii), (iii), (iv), (v) and (vi) namely the selection based on the size distribution and the selection based on the pre-sequencing results.
- this does not mean that a size distribution analysis / selection or a pre-sequencing may not be performed on a sample to be diagnosed.,.
- a step of size selection eliminating DNA molecules having a size of more than 200 bp be performed after extraction of the cell-free DNA from the test sample and before massive parallel sequencing, more particularly before library preparation.
- the score calculated for a given chromosome or chromosomal region is a parameter indicative of the count of unigue exact seguences (UES or UEM) mapped to said chromosome or chromosomal region, for a given sample.
- the score can be calculated over the whole human genome seguence, or over a partial seguence of the human genome or, in other terms a seguence from which some regions have been masked.
- the partial seguence of the human genome used in score calculation is obtained by masking predefined regions of the human genome.
- a number of parameters can be considered for defining the regions to be masked, including a lower guality of seguencing of a region (also defined, in other terms as a non-well annotated region), the occurrence of a high number of repeat within a region, the duplication of a region within the human genome, a region with a complex architecture.
- the masked regions are thus preferably selected among the non-well-annotated regions of the human genome, the high copy repeat regions of the human genome, the duplicated regions of the human genome or the regions with a complex architecture.
- the score for each chromosome can be calculated by dividing each chromosome into bins of a predefined length, for example 50 kb bins. The division can be carried out on a whole human genome sequence or on a partial human genome sequence, i.e. on a human genome sequence in which some regions have been masked, as explained above.
- the number of unique exact sequences (UES) mapped to a given bin is then counted, thus yielding a UES count for each bin.
- the count of UES for each bin is bias-corrected, i.e. it is corrected to take into account the bias related to the sequencing process.
- a known bias is caused by the variation in GC distribution across the genome. As noted by Fan et al., 2010, the distribution of sequence tags across the genome is not uniform. In fact, there exists a positive correlation between the GC content of a chromosomal region, and the number of sequences mapped to said region, which explains why sequences originating from GC-rich regions are more represented within the sequence library than sequences originating from GC-poor regions.
- This bias can be compensated by weighting the count of UESs in each bin, for example with a weight inversely proportional to the GC content in said bin.
- the median UES count value for all bins over a chromosome or chromosomal region of interest is then calculated. This value is representative of the count of UESs across the chromosome or chromosomal region, and is referred as the sequence tag density of a chromosome or chromosomal region. This median value can be calculated by using non- weighted UES counts, or by weighting each UES count with a bias-correction factor, as indicated above. In another embodiment, other values than the median value are selected for representing the UES count across a chromosome: for instance the sum of the UES counts for all bins within a chromosome.
- sequence tag density of the chromosome or chromosomal region of interest can be normalized to the median sequence tag density for all chromosomes. Alternatively, it can be normalized to the median sequence tag density for all autosomes. Still alternatively, it can be normalized to the median sequence tag density for a predefined set of chromosomes. As used herein "set of chromosomes" refers to any combination of chromosomes selected from chromosome 1 to chromosome 22 and chromosome X and Y. Still alternatively, it can be normalized to the median sequence tag density for a predefined set of chromosomal regions. Still alternatively, it can be normalized to the sum of sequence tag densities for all chromosomes, or for all autosomes, or for a predefined set of chromosomes, or for a predefined set of chromosomal regions.
- the normalized sequence tag density of a chromosome or chromosomal region can be used as a parameter indicative of the number of unique exact sequences mapped to a chromosome or chromosomal region of interest for a given sample.
- This parameter can however be represented by other values:
- sequence tag density of a chromosome or chromosomal region of interest the sequence tag density of a chromosome or chromosomal region of interest; the number of UESs mapped to said chromosome or chromosomal region of interest;
- the chromosome of interest is chromosome 21 and/or the fetal aneuploidy is trisomy 21.
- the chromosome of interest is chromosome 18 and/or the fetal aneuploidy is trisomy 18.
- the chromosome of interest is chromosome 13 and/or the fetal aneuploidy is trisomy 13.
- the chromosome of interest is chromosome 22 and/or the fetal aneuploidy is trisomy 22.
- the chromosome of interest is chromosome 4 and/or the fetal aneuploidy is Wolf-Hirschhorn syndrome.
- the chromosomal region of interest is a portion of chromosome 4 comprising the deleted region in Wolf-Hirschhorn syndrome.
- the chromosome of interest is chromosome 5 and/or the fetal aneuploidy is cri du chat syndrome.
- the chromosomal region of interest is a portion of chromosome 5 comprising the deleted and/or duplicated region in cri du chat syndrome and/or the fetal aneuploidy is cri du chat syndrome.
- the chromosome of interest is chromosome 19.
- the chromosome of interest is chromosome 1. Any combination of the aforementioned chromosomes or chromosomal region can also be chosen as a specific embodiment.
- the chromosome of interest is chromosome 21 , chromosome 18, or chromosome 13, still preferably, the chromosome of interest is chromosome 21 or chromosome 18.
- test parameter selected as indicative of the number of unique exact sequences mapped to the chromosome or chromosomal region of interest for the test sample
- same parameter is calculated for each sample of the reference set of samples, thus yielding the set of reference parameters
- standard parameter means that the parameter is calculated by using the same method as that used for the test sample, but applied to the sequencing data obtained on the reference sample, instead of those obtained on the test sample).
- test parameter obtained for the test sample is then compared to the set of reference parameters obtained for the reference samples.
- Pt es t is the test parameter indicative of the number of unique exact sequences mapped to the chromosome or chromosomal region of interest, calculated from the test sample.
- Mean (P ref ) and SD(P ref ) are respectively the mean and the standard deviation of the set of reference parameters indicative of the number of unique exact sequences mapped to the chromosome or chromosomal region of interest, calculated from the set of reference samples.
- the absolute value of the z-score of a sample aneuploid for the chromosome or chromosomal region of interest is above 4, still preferably above 4.4.
- the absolute value of the z-score of a sample euploid for the chromosome or chromosomal region of interest is below 4.4, still preferably below 4.
- the absolute value of the z-score of each sample of the reference set of samples is below 4.4, still preferably below 4.
- the selection of an appropriate set of reference samples allows discrimination of trisomy 21 and trisomy 18 samples from euploid samples, with a z-score of 4.4 as cutoff value.
- This z- score corresponds to a prior probability of ⁇ 1.1 - 10 "5 of generating false results by chance, which is much lower than the corresponding data in prior art.
- the comparison can be done using a probability-based calculation, preferably using a reference set which includes both euploid and aneuploid (trisomic) samples.
- the process again comprises two steps. The first involves the alignment of the sequences obtained from the test sample on a reference human genome, and the second involves comparing the results obtained for each chromosome of the test sample with the results obtained for the corresponding chromosome of samples of a reference set:
- the values obtained from the UES count for a given chromosome in a set of samples having validated trisomy are represented on a graph together with the values obtained from the UES count for the same given chromosome in a set of normal reference samples ;
- the value obtained from the UES count for a given chromosome of the test sample is also indicated on the corresponding reference graph which serves as the basis for the clinical evaluation.
- a plurality of reference sets for example at least four and preferably six reference sets (such as reference sets N 1 , N, B1 , B2, A1 and A2 illustrated in Figures 17 to 38) each comprising at least 50 and preferably at least 75 reference samples, are consistently used to establish the diagnosis, thereby providing confirmation of the diagnosis. Examples
- Blood samples were collected from 100 pregnant women in the context of a prospective clinical study with pending approval by the local ethical committee.
- the gestational age of the mothers was 14.63 ⁇ 4.00 weeks.
- Plasma samples Two 7.5ml tubes (BD Vacutainer blood collection tubes, Beckton Dickinson, NJ USA 07417, or BCT-tubes, Streck, Inc., Omaha, NE 68128) were collected 30 minutes after invasive prenatal diagnosis. Plasma was purified as described (Chiu ef al 2008; Fan ef al 2008), and frozen immediately at -20°C. 2ml plasma aliquots were used for cell-free DNA extraction with the nucleospin plasma Kit (Macherely Nagel, according to the manufacturer's instructions as described below), or with a phenol-chloroform method, which was as follows.
- the columns were then washed a first time with 500 ⁇ Buffer WB and centrifugated at 1 1000g (9600 rpm) during 30 seconds, and a second time with 250 ⁇ Buffer WB and centrifugated at 1 1000g (9600 rpm) during 3 minutes. Finally, 20 ⁇ elution buffer were added to the columns, which were then centrifugated at 1 1000g (9600 rpm) during 30 seconds. The resulting DNA extracts were pooled in a single 2ml_ tube.
- the supernatant was decanted, and the remaining volume added, and the tube centrifuged under the same conditions.
- the DNA pellet was first washed with 600 ⁇ of ethanol 70%, followed by 600 ⁇ of ether, and suspended in 20 ⁇ of 0.5 mM Tris pH 8.2.
- DNA concentration was measured with PicoGreen, and qPCR assays for TH01 and SRY were performed on samples corresponding to a male fetus.
- the principle of these assays is to quantify:
- Male DNA i.e. fetal DNA, by amplifying a 137 bp sequence of the SRY gene, present on human chromosome Y;
- Total human DNA i.e. fetal + maternal DNA, by amplifying a 162 bp sequence comprising the TH01 STR (short tandem repeat), present on human chromosome 1 1.
- the mouse gene GALT was used as an internal control. Briefly, for each sample a master mix was prepared containing 12.5 ⁇ Absolute QPCR Mix (AB-1 133/A, ABGene), 2.5 ⁇ of a mixture of primers/probes SRY/TH01/GALT and 0.4 ⁇ of AmpliTag Gold 5 ⁇ / ⁇ (N8080249, Applied Biosystems). 25 ⁇ PCR mix were prepared, each containing: 5 ⁇ of DNA sample to be amplified in H 2 0, 5 ⁇ Std Gait 10 copies/ ⁇ (standard sequence of GALT), 15 ⁇ master mix.
- Each series included a standard (10 ⁇ standard, 200 cell/10 ⁇ ). 50 RT-PCR cycles (95°C/15";60°C/60") were run on a RotorGene qPCR apparatus (Qiagen), with an acquisition at 60°C on the channels SRY (green), TH01 (Yellow), GALT (Red).
- the value in "cells/ ⁇ " was calculated with reference to the standard, and refers to an equivalency of the quantity of genomic DNA in terms of cell number, based on the assumption of 6 pg genomic DNA/cell.
- the ChIP seguencing protocol (lllumina) was performed according to instructions. 20 ng of cell-free DNA was used for library construction. 1 ⁇ of each library, corresponding to 1/15 of the total library volume, was run on a 2100 Bioanalyzer (Agilent) for size distribution analysis and determination of peak concentration. Every fifth library was pre-seguenced on a MiSeg (lllumina). The libraries were seguenced on a HiSeg 2000 (lllumina), with single reads of 50 bp, and 50+7 cycles, thus resulting in 30- 10 6 reads per sample, using the TruSeg SBS v3 kit according to instructions (lllumina).
- the size determination of cell-free DNA shows that after subtraction of the adaptor/barcode seguence size, the peak size is almost perfectly within the predicted size of 166 bp (Fig. 1 ; Lo ei al 2010).
- the peak size distribution was uniform for all 91 samples analyzed, with 1-2 bp variations.
- the smaller sized shoulder visible on the right hand panel likely reflects fetal DNA, which has a peak size of 133-143 bp.
- the phenol/chloroform extraction protocol yielded a much higher concentration of DNA molecules having a size around the peak of 166 bp, with a statistically significant difference between the column library and the phenol/chloroform library (p ⁇ 10 ⁇ 25 ; Table 2, showing the concentration of the fraction of DNA molecules with a size ranging from 156 bp to 176 bp, as measured on 50 libraries for each extraction method).
- Each chromosome was divided into 50 kb bins and, for each bin, the number of UESs mapped to said bin was counted. The median value of the UESs counts per bin was calculated for each chromosome, thus yielding a sequence tag density value for all autosomes.
- sequence tag density of chromosome 21 was normalized to the median value of sequence tag densities for all autosomes, thus yielding the normalized sequence tag density for chromosome 21 , as shown in Fig. 4 for all 91 euploid and aneuploid samples. This value is indicative of the fraction of fetal and maternal DNA fragments issued from chromosome 21.
- Samples with normal karyotypes were used to constitute a reference set that provides the basis to normalize single chromosome counts.
- the diagnosis method according to the present invention is capable of perfectly discriminating trisomy 21 cases from non-trisomy 21 cases using a z-score of 4.4 (Fig. 3).
- sequence tag density of chromosome 18 was normalized to the median value of sequence tag densities for all autosomes, thus yielding the normalized sequence tag density, as shown in Figure 5 for all 91 euploid and aneuploid samples analyzed in this study.
- the diagnosis method according to the present invention is also capable of discriminating trisomy 18 cases from non-trisomy 18 cases using a z-score of 4.4, using the same reference set of 66 euploid samples.
- the method according to the invention allows a more stringent discrimination of about two orders of magnitude over first generations assays (Chiu ei al 2008, Fan ei al 2008, Stumm et al 2012) with a prior probability of ⁇ 1.1 - 10 "5 to generate false results by chance.
- the diagnosis method allows discriminating trisomy 21 samples, trisomy 13 samples, trisomy 18 samples, trisomy 22 samples, 4p microdeletion samples, 5p microdeletion-duplication samples from euploid samples, with a prior probability of ⁇ 1.1 ⁇ 10 "11 to generate false results by chance.
- Example 3 Size-selection of cell-free DNA :
- the amount of DNA extracted from a defined amount of blood can be variable, from a few nanograms to more than a microgram (on average between 10-50 ng/2ml of plasma). Analysis of the DNA has shown that this variability is caused mostly by the presence or absence of large DNA fragments (> 1 kb) which are likely the result of cell lysis, thus of maternal origin.
- a protocol was devised by the present inventors to eliminate large DNA fragments from the extracted cell-free DNA samples and thus "enrich" for the small DNA fragments (less than or equal to 200 bp) which contain the fetal DNA, thereby improving the quality of noninvasive prenatal diagnostic tests.
- the size selection procedure is carried out on the crude DNA extracts, prior to any further processing such as sequencing library preparation.
- Magnetic beads (AMPure® Beckman Coulter) were used for the size selection. According to this technology, DNA fragments bind to the magnetic beads, and are then separated from contaminants by application of a magnetic field. The bound DNA is washed with ethanol and is then eluted from the magnetic particles.
- Figure 16B shows the results obtained on analysis by Bioalayzer for samples GWX-351 , -352 and -353 after successive rounds of purification with AMPure beads.
- the large molecular weight peak is eliminated by the process of purification, and the lower molecular weight peak from 150-200 bp is retained. Comparable results were obtained with other samples. The results confirm that the high molecular weight fraction can be removed using the beads, producing a fraction having a size of approximately 200 bp and smaller.
- Example 4 Detection of aneuploidy on size-selected cell-free DNA samples (1) a) DNA extraction
- This process converts the overhangs resulting from fragmentation of the dsDNA into blunt ends using an End Repair Mix.
- the 3' to 5' exonuclease activity of this mix removes the 3' overhangs and the polymerase activity fills in the 5' overhangs.
- ERP End Repair Mix
- the samples were removed from the thermal cycler and subjected to a step of purification.
- a single 'A' nucleotide was added to the 3' ends of the blunt dsDNA fragments to prevent them from ligating to one another during the adapter ligation reaction, and to provide a complementary overhang for subseguently ligating an adapter to the fragment which has a corresponding single nucleotide on its 3' end .
- This strategy ensures a low rate of chimera (concatenated template) formation.
- ATL A-Tailing Mix
- paired-end adaptors such as those commercialised by lllumina, which allow PCR amplification, are ligated to the ends of the dsDNA.
- This step of the process uses PCR to selectively enrich those DNA fragments that have adapter molecules on both ends while adding a specific VINCI index to each sample and completing the adapter sequences to allow subsequent hybridization on a flow cell. Fragments devoid of adapters cannot hybridize to surface-bound primers in the flow cell, and fragments with an adapter on only one end can hybridize to surface bound primers but cannot form clusters.
- 34 ⁇ _ of PCR pre-mix was added to each well of the PCR plate, followed by 1 ⁇ _ of a thawed PCR P7-lndex Primer (25 ⁇ ). 15 ⁇ _ of sample was transferred to each well of the PCR plate, and 15 uL of water was added as negative control in an empty well of the sample plate.
- the plate was incubated on a thermal cycler using the following PCR program:
- UEM Unique Exact Sequence
- the values obtained from the UES count for a given chromosome in a first set of reference samples (e.g. reference set N1 ) having validated trisomy and validated euploidy were plotted on a graph.
- the normal (euploid) samples of the reference set were used to determine an interval of values which, in terms of probability, only one in one thousand normal samples should exceed. This interval was shown on the graph.
- Figures 39a to 39d show that the sample designated GWX-1 137 is normal for chromosomes 13, 16, 18 and 21.
- Figures 40a to 40d show that the samples designated GWX-1 196, GWX-1420, GWX-1421 and GWX-1470 have less than one chance in 10000 of being normal for chromosomes 13, 16, 18 and 21 respectively.
- the size selection procedure also decreased potentially false positive results.
- 9 were initially suspected of being pathological : 7 were finally validated by karyotyping, and two borderline cases turned out to have normal results after size selection.
- Example 4 The protocol described in Example 4 was adapted for use with a semiconductor-based NGS platform instead of a sequencing-by-synthesis platform, again using 48 test samples.
- Six new reference sets were generated using methodology identical to that used for analysis of the test samples, including size selection and use of a semiconductor-based NGS platform.
- the library preparation for this platform uses blunt-end adaptor ligation and does not involve dA-tailing. Moreover, a lower number of PCR cycles was used (8 instead of 15).
- the size selection step was identical to that described in Example 4.
- Table 1 comparison of the DNA quantity obtained by column extraction and by phenol/chloroform extraction sample 304784 307020 313999
- DNA concentration column cone (ng/ ⁇ ) 0.40 0.33 0.40 measured by Picogreen P/C cone, (ng/ ⁇ ) 1.53 1.19 1.82 column cells/ ⁇ 12.00 2.50 8.50
- Table 2 comparison of the DNA fraction at the peak between libraries obtained by column extraction and libraries obtained by phenol/chloroform extraction. DNA concentration at the peak (156-176 bp), ng/ ⁇
- Table 3 Number of unique exact sequences mapped from a total number of 20000 sequences obtained by pre-sequencing 30 libraries.
- Table 5 karyotypes of specific samples shown in Fig. 2 to 13
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Pathology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
Description
Claims
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2015538513A JP2015534807A (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting fetal chromosomal aneuploidy |
CN201380068714.XA CN105074004A (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
AU2013340795A AU2013340795A1 (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
EP13786650.5A EP2914738A1 (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
US14/439,579 US20150275290A1 (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
CA2888906A CA2888906A1 (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
IL238426A IL238426A0 (en) | 2012-10-31 | 2015-04-22 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
HK15109158.2A HK1208708A1 (en) | 2012-10-31 | 2015-09-18 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP12190844.6A EP2728014B1 (en) | 2012-10-31 | 2012-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
EP12190844.6 | 2012-10-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014068075A1 true WO2014068075A1 (en) | 2014-05-08 |
Family
ID=47172444
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2013/072848 WO2014068075A1 (en) | 2012-10-31 | 2013-10-31 | Non-invasive method for detecting a fetal chromosomal aneuploidy |
Country Status (10)
Country | Link |
---|---|
US (1) | US20150275290A1 (en) |
EP (3) | EP3026124A1 (en) |
JP (1) | JP2015534807A (en) |
CN (1) | CN105074004A (en) |
AU (1) | AU2013340795A1 (en) |
CA (1) | CA2888906A1 (en) |
DK (1) | DK2728014T3 (en) |
HK (1) | HK1208708A1 (en) |
IL (1) | IL238426A0 (en) |
WO (1) | WO2014068075A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105296606A (en) * | 2014-07-25 | 2016-02-03 | 深圳华大基因股份有限公司 | Method and device for determining proportion of free nucleic acids in biological sample and application of method and device for determining proportion of free nucleic acids in biological sample |
EP3018213A1 (en) | 2014-11-04 | 2016-05-11 | Genesupport SA | Method for determining the presence of a biological condition by determining total and relative amounts of two different nucleic acids |
WO2017093561A1 (en) | 2015-12-04 | 2017-06-08 | Genesupport Sa | Method for non-invasive prenatal testing |
US9976181B2 (en) | 2016-03-25 | 2018-05-22 | Karius, Inc. | Synthetic nucleic acid spike-ins |
US10450620B2 (en) | 2013-11-07 | 2019-10-22 | The Board Of Trustees Of The Leland Stanford Junior University | Cell-free nucleic acids for the analysis of the human microbiome and components thereof |
US10697008B2 (en) | 2017-04-12 | 2020-06-30 | Karius, Inc. | Sample preparation methods, systems and compositions |
US11111520B2 (en) | 2015-05-18 | 2021-09-07 | Karius, Inc. | Compositions and methods for enriching populations of nucleic acids |
US11674167B2 (en) | 2018-03-16 | 2023-06-13 | Karius, Inc. | Sample series to differentiate target nucleic acids from contaminant nucleic acids |
Families Citing this family (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9424392B2 (en) | 2005-11-26 | 2016-08-23 | Natera, Inc. | System and method for cleaning noisy genetic data from target individuals using genetic data from genetically related individuals |
US11111543B2 (en) | 2005-07-29 | 2021-09-07 | Natera, Inc. | System and method for cleaning noisy genetic data and determining chromosome copy number |
US11111544B2 (en) | 2005-07-29 | 2021-09-07 | Natera, Inc. | System and method for cleaning noisy genetic data and determining chromosome copy number |
US11322224B2 (en) | 2010-05-18 | 2022-05-03 | Natera, Inc. | Methods for non-invasive prenatal ploidy calling |
US9677118B2 (en) | 2014-04-21 | 2017-06-13 | Natera, Inc. | Methods for simultaneous amplification of target loci |
US10316362B2 (en) | 2010-05-18 | 2019-06-11 | Natera, Inc. | Methods for simultaneous amplification of target loci |
US11408031B2 (en) | 2010-05-18 | 2022-08-09 | Natera, Inc. | Methods for non-invasive prenatal paternity testing |
US11332793B2 (en) | 2010-05-18 | 2022-05-17 | Natera, Inc. | Methods for simultaneous amplification of target loci |
US11939634B2 (en) | 2010-05-18 | 2024-03-26 | Natera, Inc. | Methods for simultaneous amplification of target loci |
US20190010543A1 (en) | 2010-05-18 | 2019-01-10 | Natera, Inc. | Methods for simultaneous amplification of target loci |
EP2854058A3 (en) | 2010-05-18 | 2015-10-28 | Natera, Inc. | Methods for non-invasive pre-natal ploidy calling |
US11326208B2 (en) | 2010-05-18 | 2022-05-10 | Natera, Inc. | Methods for nested PCR amplification of cell-free DNA |
US11332785B2 (en) | 2010-05-18 | 2022-05-17 | Natera, Inc. | Methods for non-invasive prenatal ploidy calling |
US11339429B2 (en) | 2010-05-18 | 2022-05-24 | Natera, Inc. | Methods for non-invasive prenatal ploidy calling |
JP6153874B2 (en) | 2011-02-09 | 2017-06-28 | ナテラ, インコーポレイテッド | Method for non-invasive prenatal ploidy calls |
EP3561075A1 (en) | 2014-04-21 | 2019-10-30 | Natera, Inc. | Detecting mutations in tumour biopsies and cell-free samples |
EP3294906B1 (en) | 2015-05-11 | 2024-07-10 | Natera, Inc. | Methods for determining ploidy |
SG11201804651XA (en) * | 2015-12-04 | 2018-07-30 | Green Cross Genome Corp | Method for determining copy-number variation in sample comprising mixture of nucleic acids |
GB201522665D0 (en) * | 2015-12-22 | 2016-02-03 | Premaitha Ltd | Detection of chromosome abnormalities |
WO2018067517A1 (en) | 2016-10-04 | 2018-04-12 | Natera, Inc. | Methods for characterizing copy number variation using proximity-litigation sequencing |
US10011870B2 (en) | 2016-12-07 | 2018-07-03 | Natera, Inc. | Compositions and methods for identifying nucleic acid molecules |
HRP20240709T1 (en) * | 2017-01-24 | 2024-08-16 | Bgi Genomics Co., Ltd. | Method and device for determining proportion of free nucleotide from predetermined source in biological sample |
CN108342455B (en) * | 2017-06-25 | 2021-11-30 | 北京新羿生物科技有限公司 | Method for detecting fetal aneuploid chromosome from maternal peripheral blood and kit thereof |
US11851650B2 (en) | 2017-09-28 | 2023-12-26 | Grail, Llc | Enrichment of short nucleic acid fragments in sequencing library preparation |
JP2021506342A (en) | 2017-12-14 | 2021-02-22 | ティーエーアイ ダイアグノスティックス インコーポレイテッドTai Diagnostics,Inc. | Evaluation of Graft Conformity for Transplantation |
CA3090426A1 (en) | 2018-04-14 | 2019-10-17 | Natera, Inc. | Methods for cancer detection and monitoring by means of personalized detection of circulating tumor dna |
CA3105349A1 (en) * | 2018-05-03 | 2019-11-07 | The Chinese University Of Hong Kong | Size-tagged preferred ends and orientation-aware analysis for measuring properties of cell-free mixtures |
CN111373054B (en) * | 2018-05-31 | 2024-06-25 | 深圳华大临床检验中心 | Method, system and computer readable medium for determining whether triploid exists in male test sample |
US11525159B2 (en) | 2018-07-03 | 2022-12-13 | Natera, Inc. | Methods for detection of donor-derived cell-free DNA |
CN109584963A (en) * | 2018-09-30 | 2019-04-05 | 南京派森诺基因科技有限公司 | A kind of diversified abstracting method of high-flux sequence data |
EP3956466A1 (en) * | 2019-04-15 | 2022-02-23 | Natera, Inc. | Improved liquid biopsy using size selection |
RU2717023C1 (en) * | 2019-04-24 | 2020-03-17 | Общество с ограниченной ответственностью "ГЕНОТЕК ИТ" | Method for determining foetal karyotype of pregnant woman based on sequencing hybrid readings consisting of short fragments of extracellular dna |
WO2022035670A1 (en) * | 2020-08-09 | 2022-02-17 | Myriad Women's Health, Inc. | Bayesian sex caller |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120270739A1 (en) * | 2010-01-19 | 2012-10-25 | Verinata Health, Inc. | Method for sample analysis of aneuploidies in maternal samples |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2456229A1 (en) * | 2001-08-03 | 2003-02-13 | Diversa Corporation | Epoxide hydrolases, nucleic acids encoding them and methods for making and using them |
CA2541706C (en) * | 2003-10-08 | 2014-02-18 | The Trustees Of Boston University | Methods for prenatal diagnosis of chromosomal abnormalities |
CA2544178A1 (en) * | 2003-10-30 | 2005-05-19 | Tufts-New England Medical Center | Prenatal diagnosis using cell-free fetal dna in amniotic fluid |
EA035451B9 (en) * | 2007-07-23 | 2020-09-09 | Те Чайниз Юниверсити Ов Гонконг | Method to diagnose cancer using genomic sequencing |
US20120010085A1 (en) * | 2010-01-19 | 2012-01-12 | Rava Richard P | Methods for determining fraction of fetal nucleic acids in maternal samples |
-
2012
- 2012-10-31 DK DK12190844.6T patent/DK2728014T3/en active
- 2012-10-31 EP EP15188289.1A patent/EP3026124A1/en not_active Withdrawn
- 2012-10-31 EP EP12190844.6A patent/EP2728014B1/en active Active
-
2013
- 2013-10-31 CA CA2888906A patent/CA2888906A1/en not_active Abandoned
- 2013-10-31 US US14/439,579 patent/US20150275290A1/en not_active Abandoned
- 2013-10-31 AU AU2013340795A patent/AU2013340795A1/en not_active Abandoned
- 2013-10-31 CN CN201380068714.XA patent/CN105074004A/en active Pending
- 2013-10-31 EP EP13786650.5A patent/EP2914738A1/en not_active Withdrawn
- 2013-10-31 WO PCT/EP2013/072848 patent/WO2014068075A1/en active Application Filing
- 2013-10-31 JP JP2015538513A patent/JP2015534807A/en active Pending
-
2015
- 2015-04-22 IL IL238426A patent/IL238426A0/en unknown
- 2015-09-18 HK HK15109158.2A patent/HK1208708A1/en unknown
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120270739A1 (en) * | 2010-01-19 | 2012-10-25 | Verinata Health, Inc. | Method for sample analysis of aneuploidies in maternal samples |
Non-Patent Citations (8)
Title |
---|
ARASH DAVOUDI ET AL: "The application of amplified TSPY and amelogenin genes from maternal plasma as a non-invasive bovine fetal DNA diagnosis", EURASIAN JOURNAL OF BIOSCIENCES, vol. 5, 1 November 2011 (2011-11-01), pages 119 - 126, XP055099848, DOI: 10.5053/ejobios.2011.5.0.14 * |
CHIU ROSSA W K ET AL: "Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, US, vol. 105, no. 51, 23 December 2008 (2008-12-23), pages 20458 - 20463, XP002620454, ISSN: 0027-8424, [retrieved on 20081210], DOI: 10.1073/PNAS.0810641105 * |
DEEPIKA MISRA ET AL: "A simple and reliable method of obtaining fetal DNA from maternal circulation; its accuracy and sensitivity", ANNALS OF BIOLOGICAL RESEARCH, vol. 2, no. 6, 1 January 2011 (2011-01-01), pages 155 - 164, XP055099849, Retrieved from the Internet <URL:http://scholarsresearchlibrary.com/ABR-vol2-iss6/ABR-2011-2-6-155-164.pdf> [retrieved on 20140203] * |
ERIC Z. CHEN ET AL: "Noninvasive Prenatal Diagnosis of Fetal Trisomy 18 and Trisomy 13 by Maternal Plasma DNA Sequencing", PLOS ONE, vol. 6, no. 7, 6 July 2011 (2011-07-06), pages e21791, XP055024137, DOI: 10.1371/journal.pone.0021791 * |
FAN H C ET AL: "Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, US, vol. 105, no. 42, 21 October 2008 (2008-10-21), pages 16266 - 16271, XP002613056, ISSN: 0027-8424, [retrieved on 20081006], DOI: 10.1073/PNAS.0808319105 * |
LO Y M DENNIS ET AL: "Maternal plasma DNA sequencing reveals the genome-wide genetic and mutational profile of the fetus", SCIENCE / SCIENCE TRANSLATIONAL MEDICINE, WASHINGTON, DC : AAAS, US, vol. 2, no. 61, 8 December 2010 (2010-12-08), pages 61ra91 - 1, XP008132703, ISSN: 1946-6242, DOI: 10.1126/SCITRANSLMED.3001720 * |
MATHIAS EHRICH ET AL: "Noninvasive detection of fetal trisomy 21 by sequencing of DNA in maternal blood: a study in a clinical setting", AMERICAN JOURNAL OF OBSTETRICS & GYNECOLOGY, MOSBY, ST LOUIS, MO, US, vol. 204, no. 3, 28 December 2010 (2010-12-28), pages 205.e1 - 205.e11, XP028184664, ISSN: 0002-9378, [retrieved on 20110107], DOI: 10.1016/J.AJOG.2010.12.060 * |
R. W. K. CHIU ET AL: "Non-invasive prenatal assessment of trisomy 21 by multiplexed maternal plasma DNA sequencing: large scale validity study", BMJ, vol. 342, no. jan11 1, 11 January 2011 (2011-01-11), pages c7401 - c7401, XP055024134, ISSN: 0959-8138, DOI: 10.1136/bmj.c7401 * |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11427876B2 (en) | 2013-11-07 | 2022-08-30 | The Board Of Trustees Of The Leland Stanford Junior University | Cell-free nucleic acids for the analysis of the human microbiome and components thereof |
US11401562B2 (en) | 2013-11-07 | 2022-08-02 | The Board Of Trustees Of The Leland Stanford Junior University | Cell-free nucleic acids for the analysis of the human microbiome and components thereof |
US11365453B2 (en) | 2013-11-07 | 2022-06-21 | The Board Of Trustees Of The Leland Stanford Junior University | Cell-free nucleic acids for the analysis of the human microbiome associated with respiratory infection |
US10450620B2 (en) | 2013-11-07 | 2019-10-22 | The Board Of Trustees Of The Leland Stanford Junior University | Cell-free nucleic acids for the analysis of the human microbiome and components thereof |
KR102018444B1 (en) | 2014-07-25 | 2019-09-04 | 비지아이 제노믹스 코포레이션 리미티드 | Method and device for determining fraction of cell-free nucleic acids in biological sample and use thereof |
KR20170036734A (en) * | 2014-07-25 | 2017-04-03 | 비지아이 제노믹스 코포레이션 리미티드 | Method and device for determining a ratio of free nucleic acids in a biological sample and use thereof |
CN105296606A (en) * | 2014-07-25 | 2016-02-03 | 深圳华大基因股份有限公司 | Method and device for determining proportion of free nucleic acids in biological sample and application of method and device for determining proportion of free nucleic acids in biological sample |
EP3178941B1 (en) * | 2014-07-25 | 2021-10-13 | BGI Genomics Co., Limited | Method for determining the fraction of cell-free fetal nucleic acids in a peripheral blood sample from a pregnant woman and use thereof |
WO2016071369A1 (en) | 2014-11-04 | 2016-05-12 | Genesupport Sa | Method for determining the presence of a biological condition by determining total and relative amounts of two different nucleic acids |
EP3018213A1 (en) | 2014-11-04 | 2016-05-11 | Genesupport SA | Method for determining the presence of a biological condition by determining total and relative amounts of two different nucleic acids |
US11111520B2 (en) | 2015-05-18 | 2021-09-07 | Karius, Inc. | Compositions and methods for enriching populations of nucleic acids |
WO2017093561A1 (en) | 2015-12-04 | 2017-06-08 | Genesupport Sa | Method for non-invasive prenatal testing |
US9976181B2 (en) | 2016-03-25 | 2018-05-22 | Karius, Inc. | Synthetic nucleic acid spike-ins |
US11078532B2 (en) | 2016-03-25 | 2021-08-03 | Karius, Inc. | Synthetic nucleic acid spike-ins |
US11692224B2 (en) | 2016-03-25 | 2023-07-04 | Karius, Inc. | Synthetic nucleic acid spike-ins |
US10697008B2 (en) | 2017-04-12 | 2020-06-30 | Karius, Inc. | Sample preparation methods, systems and compositions |
US11180800B2 (en) | 2017-04-12 | 2021-11-23 | Karius, Inc. | Sample preparation methods, systems and compositions |
US11834711B2 (en) | 2017-04-12 | 2023-12-05 | Karius, Inc. | Sample preparation methods, systems and compositions |
US11674167B2 (en) | 2018-03-16 | 2023-06-13 | Karius, Inc. | Sample series to differentiate target nucleic acids from contaminant nucleic acids |
Also Published As
Publication number | Publication date |
---|---|
EP2914738A1 (en) | 2015-09-09 |
EP2728014B1 (en) | 2015-10-07 |
IL238426A0 (en) | 2015-06-30 |
HK1208708A1 (en) | 2016-04-15 |
CA2888906A1 (en) | 2014-05-08 |
EP3026124A1 (en) | 2016-06-01 |
AU2013340795A1 (en) | 2015-05-14 |
EP2728014A1 (en) | 2014-05-07 |
US20150275290A1 (en) | 2015-10-01 |
DK2728014T3 (en) | 2016-01-25 |
CN105074004A (en) | 2015-11-18 |
JP2015534807A (en) | 2015-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20150275290A1 (en) | Non-invasive method for detecting a fetal chromosomal aneuploidy | |
JP7490219B2 (en) | Diagnosis of fetal chromosomal aneuploidies using genomic sequencing | |
JP6513622B2 (en) | Process and composition for methylation based enrichment of fetal nucleic acid from maternal sample useful for non-invasive prenatal diagnosis | |
EP3608420B1 (en) | Nucleic acids and methods for detecting chromosomal abnormalities | |
CA2807594C (en) | Assay systems for genetic analysis | |
EP3018213A1 (en) | Method for determining the presence of a biological condition by determining total and relative amounts of two different nucleic acids | |
WO2011051283A1 (en) | Means and methods for non-invasive diagnosis of chromosomal aneuploidy | |
GB2485635A (en) | Chromosomal aneuploidy detection by mass sequencing and analysis against whole or segment of normalising chromosome. | |
JP2017506908A (en) | Method for detecting a decrease or increase in the amount of nucleic acid having a sequence of interest | |
EP2994539A1 (en) | Non-invasive early detection of solid organ transplant rejection by quantitative analysis of mixtures by deep sequencing of hla gene amplicons using next generation systems | |
WO2017093561A1 (en) | Method for non-invasive prenatal testing | |
JP2020512000A (en) | How to detect fetal chromosomal abnormalities | |
CN112888783A (en) | Improvement of free DNA quality | |
CN111321210B (en) | Method for non-invasive prenatal detection of whether fetus suffers from genetic disease | |
JP2023527761A (en) | Nucleic acid sample enrichment and screening methods | |
CN109280697B (en) | Method for identifying fetal genotype by using plasma free DNA of pregnant woman | |
WO2015042649A1 (en) | A quantitative assay for target dna in a mixed sample comprising target and non-target dna | |
US11667955B2 (en) | Methods for isolation of cell-free DNA using an anti-double-stranded DNA antibody | |
US20230183672A1 (en) | Methods for isolating circulating nucleic acids from urine samples | |
EP3149202A1 (en) | Method of prenatal diagnosis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201380068714.X Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13786650 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2888906 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 238426 Country of ref document: IL |
|
ENP | Entry into the national phase |
Ref document number: 2015538513 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14439579 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2013340795 Country of ref document: AU Date of ref document: 20131031 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2013786650 Country of ref document: EP |