US20230079748A1 - Preparation method, product, and application of circulating tumor dna reference samples - Google Patents

Preparation method, product, and application of circulating tumor dna reference samples Download PDF

Info

Publication number
US20230079748A1
US20230079748A1 US17/901,431 US202217901431A US2023079748A1 US 20230079748 A1 US20230079748 A1 US 20230079748A1 US 202217901431 A US202217901431 A US 202217901431A US 2023079748 A1 US2023079748 A1 US 2023079748A1
Authority
US
United States
Prior art keywords
dna
reference sample
circulating tumor
cancer
cfdna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US17/901,431
Other languages
English (en)
Inventor
Guolin ZHONG
Mao Mao
Shiyong Li
Dandan Zhu
Yan Chen
Yumin Feng
Wei Wu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Seekin Inc
Original Assignee
Seekin Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Seekin Inc filed Critical Seekin Inc
Assigned to SEEKIN, INC. reassignment SEEKIN, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SEEKIN INC. (ZHENGZHOU), SEEKIN, INC.
Assigned to SEEKIN, INC. reassignment SEEKIN, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CHEN, YAN, ZHONG, Guolin, FENG, YUMIN, LI, SHIYONG, MAO, MAO, WU, WEI
Assigned to SEEKIN INC. (ZHENGZHOU) reassignment SEEKIN INC. (ZHENGZHOU) ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ZHU, DANDAN
Publication of US20230079748A1 publication Critical patent/US20230079748A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1003Extracting or separating nucleic acids from biological samples, e.g. pure separation or isolation methods; Conditions, buffers or apparatuses therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/166Oligonucleotides used as internal standards, controls or normalisation probes

Definitions

  • the present disclosure belongs to the biotechnology field, and in particular, the present disclosure relates to methods of preparing circulating tumor DNA as reference samples, methods of production and application.
  • Circulating cell-free DNA (cfDNA) is continuously released into blood following apoptosis or necrosis, and is usually about 150 to 200 base pairs with a half-life of about 0.5-1 hours. In healthy subjects, cfDNA concentration is minimal, usually 10-15 ng per 1 mL of plasma, whereas cfDNA concentration is significantly increased in cancer patients.
  • the DNA released from tumor cells measured in peripheral blood is also known as circulating tumor DNA (ctDNA), and ctDNA is a subgroup of cfDNA. The proportion of ctDNA in cfDNA varies greatly among cancer patients, from as little as 0.1% to more than 90%. With the emergence of the Next-Generation Sequencing (NGS) technology, massively parallel sequencing has become possible.
  • NGS Next-Generation Sequencing
  • NGS technology has many advantages, e.g., fast speed, high accuracy, low cost and wide coverage.
  • the technology of cfDNA detection using NGS has been widely used in cancer early screening and diagnosis, medication guidance, prognostic stratification, recurrence monitoring and treatment response monitoring.
  • Previous studies on cfDNA have focused on the analysis of different types of mutations (point mutations, insertions, deletions, etc.), copy number alterations as well as genetic correlations and polymorphisms.
  • a growing number of studies now show that the cfDNA fragmentation pattern differs significantly between normal individuals and cancer patients, and that the pattern differs among different tissue of origins. By taking cfDNA fragmentation patterns into account, it is possible to assess whether the subject has malignant lesion and to trace its tissue of origin.
  • reference samples are required for validating the performance of detection somatic mutations, including single nucleotide variant (SNV), insertions and deletions (indels), structural variation (SV), copy number variation (CNV) and fragment size of cfDNA.
  • SNV single nucleotide variant
  • Indels insertions and deletions
  • SV structural variation
  • CNV copy number variation
  • fragment size of cfDNA fragment size of cfDNA.
  • the ideal reference samples are derived from clinical samples, but clinical samples are valuable research materials limited in availability, making it difficult to serve as a long-term source of reference samples.
  • the reference samples should have high consistency with clinical samples and are simple to prepare.
  • NGS next-generation sequencing
  • cfDNA as quality control materials by extracting DNA from apoptosis cells.
  • the reference samples described herein can simulate the real plasma cfDNA fragmentation pattern to the maximum extent and do not affect the detection of point mutation, copy number variation and structural variation.
  • the reference samples described herein are valuable in assay development, test validation, internal quality control, and efficacy tests.
  • the disclosure is related to a method of preparing a circulating tumor DNA reference sample, the method comprising: (1) inducing apoptosis in tumor cells; and (2) extracting DNA from the tumor cells to obtain the circulating tumor DNA reference sample.
  • the tumor cells are incubated with an apoptosis inducer (e.g., in the culture medium) to induce apoptosis.
  • the apoptosis inducer can bind to the topoisomerase-DNA complex during DNA replication to prevent DNA strand reassembly and/or cause DNA double strand break.
  • the tumor cells are incubated with the apoptosis inducer in the culture medium for 2-8 hours.
  • the tumor cells are incubated with the apoptosis inducer in the culture medium for about 5 hours.
  • the apoptosis inducer is selected from the group consisting of camptothecin (CPT), As203, notopterol and gracillin.
  • the apoptosis inducer is CPT.
  • the concentration of the apoptosis inducer is about 5-15 ⁇ M. In some embodiments, the concentration of the apoptosis inducer is about 10 ⁇ M.
  • the disclosure is related to a method of preparing a circulating tumor DNA reference sample, the method comprising: (1) inducing apoptosis in tumor cells by treating tumor cells with CPT at a concentration of about 10 ⁇ M for 2-8 hours (e.g., about 5 hours); and (2) extracting DNA from the tumor cells to obtain the circulating tumor DNA reference sample.
  • the disclosure is related to a circulating tumor DNA reference sample obtained using the method described herein.
  • the disclosure is related to a method for determining the quality of the circulating tumor DNA reference sample described herein, the method comprising: (1) providing a first DNA library of DNA extracted from tumor cells that are not treated with an apoptosis inducer; (2) providing a second DNA library by sequencing the circulating tumor DNA reference sample; (3) identifying one or more genetic variations in the first DNA library and one or more genetic variations in the second DNA library; and (4) comparing the one or more genetic variations in the first and second DNA libraries.
  • consistency of the genetic variations in the first and second DNA libraries indicates a good quality of the circulating tumor DNA reference sample.
  • the one or more genetic variations are selected from the group consisting of single nucleotide variations (e.g., point mutations), structural variations, copy number variations, and/or fragmentation pattern variations.
  • the methods described herein further comprises, prior to step (1): determining and comparing the size distribution of the circulating tumor DNA reference sample and the size distribution of the cell-free DNA (cfDNA) from plasma of a subject.
  • the size distributions of the circulating tumor DNA reference sample and the cfDNA share a fragmentation pattern having one or more of the following features: (1) the fragmentation pattern comprises a main peak representing nucleosome monomers with a length of about 166 bp; (2) the fragmentation pattern comprises one or more sub-peaks representing complexes of nucleosome monomers (e.g., dimers and trimers); and (3) the fragmentation pattern comprises one or more minor sub-peaks with a length of less than 150 bp.
  • the disclosure is related to a method of predicting cancer using the circulating tumor DNA reference sample as described herein, or the circulating tumor DNA reference sample prepared by the method described herein.
  • the disclosure is related to a method of predicting cancer, comprising: (1) determining the size distribution of the cell-free DNA (cfDNA) from plasma of a subject; (2) determining the size distribution of the circulating tumor DNA reference sample as described herein; and (3) comparing the size distribution of the cfDNA in (1) and the size distribution of the circulating tumor DNA reference sample in (2).
  • cfDNA cell-free DNA
  • a matching fragmentation pattern of the size distribution of the cfDNA in step (1) and the size distribution of the circulating tumor DNA reference sample in step (2) indicates existence of cancer in the subject,
  • the fragmentation pattern is matched when the Pearson correlation coefficient is at least 0.5 for fragments between 50-166 bp between the size distribution of the cfDNA in step (1) and the size distribution of the circulating tumor DNA reference sample in step (2).
  • the subject is a human patient diagnosed with cancer, suspected to have cancer, or having a risk to have cancer.
  • the disclosure is related to a method of validating an assay, comprising: (1) providing a first DNA library of DNA prepared from a first type of cells (e.g., tumor cells) treated with an apoptosis inducer, in some embodiments, the first type of cells have one or more mutations at a chromosomal site; (2) providing a second DNA library of DNA prepared from a second type of cells (e.g., normal cells) treated with the apoptosis inducer, in some embodiments, the second type of cells have no mutation at the chromosomal site; (3) constructing a third DNA library of DNA prepared from a test sample; and (4) detecting the one or more mutations from the constructed DNA libraries.
  • a first type of cells e.g., tumor cells
  • apoptosis inducer in some embodiments, the first type of cells have one or more mutations at a chromosomal site
  • second type of cells e.g., normal cells
  • the second type of cells have no mutation at the
  • detection of the one or more mutations from the first DNA library and no detection of the one or more mutations from the second DNA library indicate the assay is validated. In some embodiments, no detection of the one or more mutations from the first DNA library or detection of the one or more mutations from the second DNA library indicates the assay is not validated.
  • the disclosure is related to a method of determining the limit of detection (LOD) of mutation frequency of an assay, comprising: (1) providing DNA extracted from a first type of cells treated with an apoptosis inducer, in some embodiments, the first type of tumor cells have one or more mutations at a chromosomal site; (2) providing DNA extracted from a second type of cells treated with the apoptosis inducer, in some embodiments, the second type of tumor cells have no mutation at the chromosomal site; (3) mixing the DNA from step (1) and step (2) at different ratios to obtain a series of DNA samples; (4) constructing one or more DNA libraries from the series of DNA samples; and (5) determining the frequency of the one or more mutations from the constructed DNA libraries.
  • LOD limit of detection
  • the LOD of mutation frequency of the assay can be determined by the frequency of the one or more mutations from the constructed DNA libraries.
  • the series of DNA samples have different mutation frequencies.
  • the first type of cells are tumor cells and/or the second type of cells are normal cells.
  • the test sample is from a human patient diagnosed with cancer, suspected to have cancer, or having a risk to have cancer.
  • the test sample is plasma.
  • the test sample contains circulating cell-free DNA (cfDNA), e.g., circulating tumor DNA (ctDNA).
  • the disclosure is related to a method of validating an assay, comprising: (1) providing a first DNA library of DNA extracted from tumor cells treated with an apoptosis inducer, in some embodiments, the tumor cells have one or more mutations at a chromosomal site; (2) constructing a second DNA library of DNA prepared from a test sample; and (3) detecting the one or more mutations from the constructed DNA libraries.
  • detection of the one or more mutations from the first DNA library indicates the assay is validated.
  • no detection of the one or more mutations from the first DNA library indicates the assay is not validated.
  • the method described herein further comprises: providing a third DNA library of DNA extracted from normal cells treated with the apoptosis inducer, in some embodiments, the normal cells have no mutation at the chromosomal site. In some embodiments, no detection of the one or more mutations from the third DNA library indicates the assay is validated. In some embodiments, detection of the one or more mutations from the third DNA library indicates the assay is not validated.
  • the disclosure is related to a method for mimicking human plasma with different DNA mutation frequency, comprising: adding the circulating tumor DNA reference sample as described herein, or the circulating tumor DNA reference sample prepared by the method as described herein into artificial plasma.
  • the disclosure is related to a cancer prediction kit, comprising the circulating tumor DNA reference sample as described herein, or the circulating tumor DNA reference sample prepared by the method as described herein.
  • CPT refers to camptothecin, a cancer treatment drug.
  • single-nucleotide variant also known as single-nucleotide polymorphism (SNP) is the variant of a single nucleotide that occurs at a specific genomic position.
  • SV structural variation
  • CNVs copy number variations
  • CNV copy number variation
  • cancer refers to cells having the capacity for autonomous growth, i.e., an abnormal state or condition characterized by rapidly proliferating cell growth.
  • the term is meant to include all types of cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness.
  • tumor refers to cancerous cells, e.g., a mass of cancerous cells.
  • Cancers that can be predicted using the methods described herein include malignancies of the various organ systems, such as affecting lung, breast, thyroid, lymphoid, gastrointestinal, and genito-urinary tract, as well as adenocarcinomas which include malignancies such as most colon cancers, renal-cell carcinoma, prostate cancer and/or testicular tumors, non-small cell carcinoma of the lung, cancer of the small intestine and cancer of the esophagus.
  • the tumor or cancer described herein is lymphoma, non-small cell lung cancer, cervical cancer, leukemia, ovarian cancer, nasopharyngeal cancer, breast cancer, endometrial cancer, colon cancer, rectal cancer, gastric cancer, bladder cancer, glioma, lung cancer, bronchial cancer, bone cancer, prostate cancer, pancreatic cancer, liver and bile duct cancer, esophageal cancer, kidney cancer, thyroid cancer, head and neck cancer, testicular cancer, glioblastoma, astrocytoma, melanoma, myeloproliferation abnormal syndromes, and sarcomas.
  • the leukemia is selected from acute lymphocytic (lymphoblastic) leukemia, acute myeloid leukemia, myeloid leukemia, chronic lymphocytic leukemia, multiple myeloma, plasma cell leukemia, and chronic myelogenous leukemia.
  • the lymphoma is selected from Hodgkin's lymphoma and non-Hodgkin's lymphoma, including B-cell lymphoma, diffuse large B-cell lymphoma, follicular lymphoma, mantle cell lymphoma, marginal zone B-cell lymphoma, T-cell lymphoma, and Waldenstrom macroglobulinemia.
  • the sarcoma is selected from the group consisting of osteosarcoma, Ewing sarcoma, leiomyosarcoma, synovial sarcoma, soft tissue sarcoma, angiosarcoma, liposarcoma, fibrosarcoma, rhabdomyosarcoma , and chondrosarcoma.
  • FIGS. 1 A- 1 C show the results of different DNA fragment size comparisons.
  • FIG. 1 A shows DNA fragmentation by ultrasonication.
  • FIG. 1 B shows DNA obtained from artificially-induced apoptotic cells.
  • FIG. 1 C shows plasma cfDNA from a cancer patient. The dashed lines stand for fragments of about 80 bp, 91 bp, 102 bp, 111 bp, 122 bp, and 134 bp.
  • FIGS. 2 A- 2 G show fragmented DNA derived from different cells using different apoptosis-inducing methods.
  • FIGS. 2 A- 2 D show the distribution of DNA fragments obtained from HL-60 resistance cells after CPT treatment for 5 hours, ATRA treatment for 3 days, high-density culture for 3 days, and CPT treatment for 24 hours, respectively.
  • FIGS. 2 E- 2 G show the distribution of DNA fragments obtained from NB-4 cells after CPT treatment for 5 hours, ATRA treatment for 3 days, and high-density culture for 3 days, respectively.
  • the dashed lines stand for fragments of about 80 bp, 91 bp, 102 bp, 111 bp, 122 bp, and 134 bp.
  • FIG. 3 shows the consistency analysis of copy number variation between the cfDNA reference samples and paired NB4 cell line genomic DNA; x-axis show the information of copy number variation from cfDNA reference samples, y-axis show the information of copy number variation from paired NB4 cell line genomic DNA.
  • cfDNA reference samples are usually prepared by sonication or enzymatic fragmentation to shear human genomic DNA into 100-200 bp fragments. But the sequence information, fragment size and random distribution characteristics of the DNA fragments yielded by these methods are different from those obtained from the plasma cfDNA.
  • the most commonly used enzyme digestion tool at present is micrococcal nuclease (MNase), which degrades DNA in the nucleosome junction region and releases individual nucleosomes.
  • MNase micrococcal nuclease
  • the DNA fragments obtained by this method are shorter than the cfDNA, where the main peak of length distribution of the former is about 146 bp while that of the latter is about 166 bp.
  • the MNase enzymatic digestion method has some limitations. Firstly, MNase has a digestion bias for A-T-rich regions, resulting in a decreased presentation of nucleosomes in the A-T-rich regions; secondly, MNase cannot break precisely at the nucleosome boundary, which leads to differences in determining the open position of chromatin from the real situation; thirdly, MNase is biased to digest fragile nucleosomes. Therefore, MNase is not a good digestion tool to prepare cfDNA reference samples. However, it is particularly important to prepare a reference sample that can be used as a quality control for cfDNA variant information detection.
  • nucleic acid fragments in a mixture of nucleic acid fragments are analyzed.
  • a mixture of nucleic acids can comprise two or more nucleic acid fragment species having different nucleotide sequences, different fragment lengths, different origins (e.g., genomic origins, cell or tissue origins, tumor origins, cancer origins, sample origins, subject origins, fetal origins, maternal origins), or combinations thereof.
  • Nucleic acid or a nucleic acid mixture described herein can be isolated from a sample obtained from a subject.
  • a subject can be any living or non-living organism, including but not limited to a human, a non-human animal, a mammal, a plant, a bacterium, a fungus or a virus.
  • Any human or non-human animal can be selected, including but not limited to mammal, reptile, avian, amphibian, fish, ungulate, ruminant, bovine (e.g., cattle), equine (e.g., horse), caprine and ovine (e.g., sheep, goat), swine (e.g., pig), camelid (e.g., camel, llama, alpaca), monkey, ape (e.g., gorilla, chimpanzee), ursid (e.g., bear), poultry, dog, cat, mouse, rat, fish, dolphin, whale and shark.
  • a subject can be a male or female.
  • Nucleic acid can be isolated from any type of suitable biological specimen or sample (e.g., a test sample).
  • a sample or test sample can be any specimen that is isolated or obtained from a subject (e.g., a human subject).
  • specimens include fluid or tissue from a subject, including, without limitation, blood, serum, umbilical cord blood, chorionic amniotic fluid, cerebrospinal fluid, spinal fluid, lavage fluid (e.g., bronchoalveolar, gastric, peritoneal, ductal, ear, arthroscopic), biopsy sample, celocentesis sample, fetal cellular remnants, urine, feces, sputum, saliva, nasal mucous, prostate fluid, lavage, semen, lymphatic fluid, bile, tears, sweat, breast milk, breast fluid, embryonic cells and fetal cells (e.g. placental cells).
  • a biological sample can be blood, plasma or serum.
  • blood encompasses whole blood or any fractions of blood, such as serum and plasma. Blood or fractions thereof can comprise cell-free or intracellular nucleic acids.
  • Blood can comprise buffy coats. Buffy coats are sometimes isolated by utilizing a ficoll gradient. Buffy coats can comprise white blood cells (e.g., leukocytes, T-cells, B-cells, platelets).
  • Blood plasma refers to the fraction of whole blood resulting from centrifugation of blood treated with anticoagulants.
  • Blood serum refers to the watery portion of fluid remaining after a blood sample has coagulated. Fluid or tissue samples often are collected in accordance with standard protocols hospitals or clinics generally follow.
  • a fluid or tissue sample from which nucleic acid is extracted can be acellular (e.g., cell-free).
  • a fluid or tissue sample can contain cellular elements or cellular remnants.
  • cancer cells or tumor cells can be included in the sample.
  • a sample often is heterogeneous. In many cases, more than one type of nucleic acid species is present in the sample.
  • heterogeneous nucleic acid can include, but is not limited to, cancer and non-cancer nucleic acid, pathogen and host nucleic acid, and/or mutated and wild-type nucleic acid.
  • a sample may be heterogeneous because more than one cell type is present, such as a cancer and non-cancer cell, or a pathogenic and host cell.
  • the sample comprise cell free DNA (cfDNA) or circulating tumor DNA (ctDNA).
  • cfDNA cell free DNA
  • ctDNA circulating tumor DNA
  • the term “cell-free DNA” or “cfDNA” refers to DNA that is freely circulating in the bloodstream.
  • these cfDNA can be isolated from a source having substantially no cells.
  • these extracellular nucleic acids can be present in and obtained from blood.
  • Extracellular nucleic acid often includes no detectable cells and may contain cellular elements or cellular remnants.
  • Non-limiting examples of acellular sources for extracellular nucleic acid are blood, blood plasma, blood serum and urine.
  • extracellular nucleic acid includes obtaining a sample directly (e.g., collecting a sample, e.g., a test sample) or obtaining a sample from another who has collected a sample.
  • extracellular nucleic acid may be a product of cell apoptosis and cell breakdown, which provides basis for extracellular nucleic acid often having a series of lengths across a spectrum (e.g., a “ladder”).
  • Extracellular nucleic acid can include different nucleic acid species.
  • blood serum or plasma from a person having cancer can include nucleic acid from cancer cells and nucleic acid from non-cancer cells.
  • circulating tumor DNA or “ctDNA” refers to tumor-derived fragmented DNA in the bloodstream that is not associated with cells.
  • ctDNA usually originates directly from the tumor or from circulating tumor cells (CTCs).
  • CTCs circulating tumor cells
  • the circulating tumor cells are viable, intact tumor cells that shed from primary tumors and enter the bloodstream or lymphatic system.
  • the ctDNA can be released from tumor cells by apoptosis and necrosis (e.g., from dying cells), or active release from viable tumor cells (e.g., secretion).
  • the length of ctDNA or cfDNA can be at least or about 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, or 200 bp.
  • the length of ctDNA or cfDNA can be less than about 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, or 200 bp.
  • the cell-free nucleic acid is of a length of about 500, 250, or 200 base pairs or less.
  • the present disclosure provides methods of separating, enriching and analyzing cell free DNA or circulating tumor DNA found in blood as a non-invasive means to detect the presence and/or to monitor the progress of a cancer.
  • the first steps of practicing the methods described herein are to obtain a blood sample from a subject and extract DNA from the subject.
  • a blood sample can be obtained from a subject (e.g., a subject who is suspected to have cancer). The procedure can be performed in hospitals or clinics. An appropriate amount of peripheral blood, e.g., typically between 1 and 50 ml (e.g., between 1 and 10 ml), can be collected. Blood samples can be collected, stored or transported in a manner known to the person of ordinary skill in the art to minimize degradation or the quality of nucleic acid present in the sample. In some embodiments, the blood can be placed in a tube containing EDTA to prevent blood clotting, and plasma can then be obtained from whole blood through centrifugation. Serum can be obtained with or without centrifugation-following blood clotting. If centrifugation is used then it is typically, though not exclusively, conducted at an appropriate speed, e.g., 1,500-3,000 ⁇ g. Plasma or serum can be subjected to additional centrifugation steps before being transferred to a fresh tube for DNA extraction.
  • An appropriate amount of peripheral blood e.g., typically between
  • DNA can also be recovered from the cellular fraction, enriched in the buffy coat portion, which can be obtained following centrifugation of a whole blood sample.
  • cell-free DNA can be about or at least 50% of the overall nucleic acid (e.g., about or at least 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% of the total nucleic acid is cell-free DNA).
  • the nucleic acid that can be analyzed by the methods described herein include, but are not limited to, DNA (e.g., complementary DNA (cDNA), genomic DNA (gDNA), cfDNA, or ctDNA), ribonucleic acid (RNA) (e.g., message RNA (mRNA), short inhibitory RNA (siRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), or microRNA), and/or DNA or RNA analogs (e.g., containing base analogs, sugar analogs and/or a non-native backbone and the like), RNA/DNA hybrids and polyamide nucleic acids (PNAs), all of which can be in single- or double-stranded form.
  • DNA e.g., complementary DNA (cDNA), genomic DNA (gDNA), cfDNA, or ctDNA
  • RNA e.g., message RNA (mRNA), short inhibitory RNA (siRNA), ribosomal RNA (rRNA), transfer RNA
  • nucleic acid can comprise known analogs of natural nucleotides, some of which can function in a similar manner as naturally occurring nucleotides.
  • a nucleic acid can be in any form useful for conducting processes herein (e.g., linear, circular, supercoiled, single-stranded, or double-stranded).
  • a nucleic acid in some embodiments can be from a single chromosome or fragment thereof (e.g., a nucleic acid sample may be from one chromosome of a sample obtained from a diploid organism).
  • nucleic acids comprise nucleosomes, fragments or parts of nucleosomes or nucleosome-like structures.
  • Nucleic acid provided for processing described herein can contain nucleic acid from one sample or from two or more samples (e.g., from 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, or 20 or more samples).
  • the nucleic acids are from reference samples.
  • the nucleic acids are from test samples.
  • the nucleic acids are from tumor cells.
  • the nucleic acids are from normal cells.
  • nucleic acid can be extracted, isolated, purified, partially purified or amplified from the samples before sequencing.
  • nucleic acid can be processed by subjecting nucleic acid to a method that generates nucleic acid fragments. Fragments can be generated by a suitable method known in the art, and the average, mean or nominal length of nucleic acid fragments can be controlled by selecting an appropriate fragment-generating procedure.
  • nucleic acid of a relatively shorter length can be utilized to analyze sequences that contain little sequence variation and/or contain relatively large amounts of known nucleotide sequence information.
  • nucleic acid of a relatively longer length can be utilized to analyze sequences that contain greater sequence variation and/or contain relatively small amounts of nucleotide sequence information.
  • Nucleic acids e.g., nucleic acid fragments, sample nucleic acid, cell-free nucleic acid, circulating tumor nucleic acids
  • Nucleic acids are sequenced before the analysis.
  • reads are short nucleotide sequences produced by any sequencing process described herein or known in the art. Reads can be generated from one end of nucleic acid fragments (“single-end reads”), and sometimes are generated from both ends of nucleic acids (e.g., paired-end reads, double-end reads).
  • Sequence reads obtained from cell-free DNA can be reads from nucleic acids derived from normal cells and/or tumor cells.
  • the nucleic acids include nucleic acids from reference samples and/or test samples as described herein.
  • the nucleic acids are labeled (e.g., to identify the source of the cells).
  • a mixture of relatively short reads can be transformed by processes described herein into a representation of a genomic nucleic acid present in a subject.
  • “obtaining” nucleic acid sequence reads of a sample can involve directly sequencing nucleic acid to obtain the sequence information.
  • Sequence reads can be mapped and the number of reads or sequence tags mapping to a specified nucleic acid region (e.g., a chromosome, a bin, a genomic section) are referred to as counts.
  • counts can be manipulated or transformed (e.g., normalized, combined, added, filtered, selected, averaged, derived as a mean, the like, or a combination thereof).
  • a group of nucleic acid samples from one individual are sequenced.
  • nucleic acid samples from two or more samples, wherein each sample is from one individual or two or more individuals are pooled and the pool is sequenced together.
  • a nucleic acid sample from each biological sample often is identified by one or more unique identification tags.
  • the nucleic acids can also be sequenced with redundancy.
  • a given region of the genome or a region of the cell-free DNA can be covered by two or more reads or overlapping reads (e.g., “fold” coverage greater than 1).
  • Coverage (or depth) in DNA sequencing refers to the number of unique reads that include a given nucleotide in the reconstructed sequence.
  • a fraction of the genome is sequenced, which sometimes is expressed in the amount of the genome covered by the determined nucleotide sequences (e.g., “fold” coverage less than 1).
  • the fold is calculated based on the entire genome.
  • cell free DNAs are sequenced and the fold is calculated based on the entire genome. Thus, it is easier to compare the amount of sequencing and the amount of sequencing reads that are generated for different projects.
  • the fold can also be calculated based on the length of the reconstructed sequence (e.g., cfDNA).
  • the reconstructed sequence e.g., cfDNA
  • the number of nucleotides in all unique reads would be roughly the same as the entire nucleotide sequence of the cfDNA in the sample.
  • the nucleic acid is sequenced with about 0.1-fold to about 100-fold coverage, about 0.2-fold to 20-fold coverage, or about 0.2-fold to about 1-fold coverage. In some embodiments, sequencing is performed by about or at least 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, or 1000 fold coverage.
  • sequencing is performed by no more than 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, or 1000 coverage. In some embodiments, sequencing is performed by no more than 15, 20, 30, 40, 50, 60, 70, 80, 90 or 100 fold coverage.
  • the sequence coverage is performed by about or at least 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, or 5 fold (e.g., as determined by the entire genome). In some embodiments, the sequence coverage is performed by no more than 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, or 5 fold (e.g., as determined by the entire genome).
  • the sequence coverage is performed by about or at least 100, 150, 200, 250, 300, 350, 400, 450, or 500 fold (e.g., as determined by reconstructed sequence). In some embodiments, the sequence coverage is performed by no more than 100, 150, 200, 250, 300, 350, 400, 450, or 500 fold (e.g., as determined by reconstructed sequence).
  • a sequencing library can be prepared prior to or during a sequencing process.
  • Methods for preparing the sequencing library are known in the art and commercially available platforms may be used for certain applications.
  • Certain commercially available library platforms may be compatible with sequencing processes described herein.
  • one or more commercially available library platforms may be compatible with a sequencing by synthesis process.
  • a ligation-based library preparation method is used (e.g., ILLUMINA TRUSEQ, Illumina, San Diego Calif.).
  • Ligation-based library preparation methods typically use a methylated adaptor design which can incorporate an index sequence at the initial ligation step and often can be used to prepare samples for single-read sequencing, paired-end sequencing and multiplexed sequencing.
  • a transposon-based library preparation method is used (e.g., EPICENTRE NEXTERA, Epicentre, Madison Wis.).
  • Transposon-based methods typically use in vitro transposition to simultaneously fragment and tag DNA in a single-tube reaction (often allowing incorporation of platform-specific tags and optional barcodes), and prepare sequencer-ready libraries.
  • a high-throughput sequencing method is used.
  • High-throughput sequencing methods generally involve clonally amplified DNA templates or single DNA molecules that are sequenced in a massively parallel fashion within a flow cell.
  • Such sequencing methods also can provide digital quantitative information, where each sequence read is a countable “sequence tag” or “count” representing an individual clonal DNA template, a single DNA molecule, bin or chromosome.
  • MPS massively parallel sequencing
  • High-throughput sequencing technologies include, for example, sequencing-by-synthesis with reversible dye terminators, sequencing by oligonucleotide probe ligation, pyrosequencing and real time sequencing.
  • MPS include Massively Parallel Signature Sequencing (MPSS), Polony sequencing, Pyrosequencing, Illumina (Solexa) sequencing, SOLiD sequencing, Ion semiconductor sequencing, DNA nanoball sequencing, Helioscope single molecule sequencing, single molecule real time (SMRT) sequencing, nanopore sequencing, ION Torrent and RNA polymerase (RNAP) sequencing.
  • sequence reads are often associated with the particular sequencing technology.
  • High-throughput methods for example, provide sequence reads that can vary in size from tens to hundreds of base pairs (bp).
  • Nanopore sequencing for example, can provide sequence reads that can vary in size from tens to hundreds to thousands of base pairs.
  • the sequence reads are of a mean, median or average length of about 15 bp to 900 bp long (e.g., about or at least 20 bp, 25 bp, 30 bp, 35 bp, 40 bp, 45 bp, 50 bp, 55 bp, 60 bp, 65 bp, 70 bp, 75 bp, 80 bp, 85 bp, 90 bp, 95 bp, 100 bp, 110 bp, 120 bp, 130, 140 bp, 150 bp, 200 bp, 250 bp, 300 bp, 350 bp, 400 bp, 450 bp, or 500 bp).
  • the sequence reads are of a mean, median or average length of about 1000 bp or more. In some embodiments, the sequence reads are of less than 60 bp, 65 bp, 70 bp, 75 bp, 80 bp, 85 bp, 90 bp, 95 bp, 100 bp, 110 bp, 120 bp, 130, 140 bp, 150 bp, 200 bp, 250 bp, 300 bp, 350 bp, 400 bp, 450 bp, or 500 bp are removed because of poor quality.
  • Mapping nucleotide sequence reads can be performed in a number of ways, and often comprises alignment of the obtained sequence reads with a matching sequence in a reference genome (e.g., Li et al., “Mapping short DNA sequencing reads and calling variants using mapping quality score,” Genome Res., 2008 Aug. 19.)
  • sequence reads generally are aligned to a reference sequence and those that align are designated as being “mapped” or a “sequence tag.”
  • a mapped sequence read is referred to as a “hit” or a “count”.
  • the terms “aligned”, “alignment”, or “aligning” refer to two or more nucleic acid sequences that can be identified as a match (e.g., 100% identity) or partial match. Alignments can be done manually or by a computer algorithm, examples including the Efficient Local Alignment of Nucleotide Data (ELAND) computer program distributed as part of the Illumina Genomics Analysis pipeline.
  • the alignment of a sequence read can be a 100% sequence match. In some cases, an alignment is less than a 100% sequence match (i.e., non-perfect match, partial match, partial alignment).
  • an alignment is about a 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76% or 75% match.
  • an alignment comprises a mismatch.
  • an alignment comprises 1, 2, 3, 4 or 5 mismatches. Two or more sequences can be aligned using either strand.
  • a nucleic acid sequence is aligned with the reverse complement of another nucleic acid sequence.
  • sequence reads can be aligned with sequences in a reference genome.
  • sequence reads can be found and/or aligned with sequences in nucleic acid databases known in the art including, for example, GenBank, dbEST, dbSTS, EMBL (European Molecular Biology Laboratory) and DDBJ (DNA Databank of Japan).
  • BLAST or similar tools can be used to search the identified sequences against a sequence database. Search hits can then be used to sort the identified sequences into appropriate genomic sections, for example.
  • the disclosure provides methods to prepare circulating tumor DNA reference samples.
  • the methods comprise: inducing apoptosis in cells (e.g., tumor cells) to obtain DNA fragments; and extracting DNA from tumor cells after apoptosis induction to obtain the circulating tumor DNA reference sample.
  • DNA prepared by this method can be used as a reference sample for cfDNA.
  • the methods described herein have many advantages, e.g., simple to prepare, short production cycle, low-cost, suitable for many tumor cell types and for large-scale production.
  • the reference samples prepared by the methods described herein can be widely used for methodological validation, internal quality control and external quality evaluation with good reproducibility and consistency.
  • cell apoptosis treatment comprises: adding an apoptosis inducer into the tumor cell culture medium, wherein the apoptosis inducer comprises one type that binds to topoisomerase-DNA complex during DNA replication to prevent DNA strand reassembly and cause DNA double strand break.
  • the incubation time of the apoptosis inducer is from 2 to 8 hours.
  • the induction treatment time has an effect on the quality of the reference sample. Either too long or too short of the induction treatment time can lead to deviations in the mutational information of the reference sample, resulting in reduced consistency with the simulated cfDNA and failure to be a good reference.
  • the incubation time of the apoptosis inducer is about 2-8 hours, about 2-7 hours, about 2-6 hours, about 2-5 hours, about 2-4 hours, about 2-3 hours, about 3-8 hours, about 3-7 hours, about 3-6 hours, about 3-5 hours, about 3-4 hours, about 4-8 hours, about 4-7 hours, about 4-6 hours, about 4-5 hours, about 5-8 hours, about 5-7 hours, about 5-6 hours, about 6-8 hours, about 6-7 hours, or about 7-8 hours.
  • the incubation time of the apoptosis inducer is about 2 hours, about 2.5 hours, about 3 hours, about 3.5 hours, about 4 hours, about 4.5 hours, about 5 hours, about 5.5 hours, about 6 hours, about 6.5 hours, about 7 hours, about 7.5 hours, or about 8 hours. In some embodiments, the incubation time of the apoptosis inducer is about 2-20 hours, about 2-18 hours, about 2-16 hours, about 2-14 hours, about 2-12 hours, or about 2-10 hours.
  • the incubation time of the apoptosis inducer is less than 20 hours, less than 18 hours, less than 16 hours, less than 14 hours, less than 12 hours, less than 10 hours, or less than 8 hours, In some embodiments, the incubation time of the apoptosis inducer is at least 30 minutes, at least 1 hour, at least 1.5 hours, at least 2 hours, at least 2.5 hours, at least 3 hours, at least 3.5 hours, at least 4 hours, at least 4.5 hours, or at least 5 hours. In some embodiments, the incubation time of the apoptosis inducer is about 5 hours.
  • the concentration of the apoptosis inducer used for treating tumor cells is about 1-100 ⁇ M, e.g., about 1 ⁇ M, about 2 ⁇ M, about 3 ⁇ M, about 4 ⁇ M, about 5 ⁇ M, about 6 ⁇ M, about 7 ⁇ M, about 8 ⁇ M, about 9 ⁇ M, about 10 ⁇ M, about 11 ⁇ M, about 12 ⁇ M, about 13 ⁇ M, about 14 ⁇ M, about 15 ⁇ M, about 16 ⁇ M, about 17 ⁇ M, about 18 ⁇ M, about 19 ⁇ M, or about 20 ⁇ M.
  • the concentration of the apoptosis inducer is about 1-50 ⁇ M, about 1-40 ⁇ M, about 1-30 ⁇ M, about 1-20 ⁇ M, about 1-10 ⁇ M, about 5-50 ⁇ M, about 5-40 ⁇ M, about 5-30 ⁇ M, about 5-20 ⁇ M, about 5-10 ⁇ M, about 10-50 ⁇ M, about 10-40 ⁇ M, about 10-30 ⁇ M, about 10-20 ⁇ M, about 15-50 ⁇ M, about 15-40 ⁇ M, about 15-30 ⁇ M, about 15-20 ⁇ M, about 20-50 ⁇ M, about 20-40 ⁇ M, about 20-30 ⁇ M, about 30-50 ⁇ M, about 30-40 ⁇ M, or about 40-50 ⁇ M.
  • the concentration of the apoptosis inducer is about 5-15 ⁇ M, about 5-14 ⁇ M, about 5-13 ⁇ M, about 5-12 ⁇ M, about 5-11 ⁇ M, about 5-10 ⁇ M, about 5-9 ⁇ M, about 5-8 ⁇ M, about 5-7 ⁇ M, about 5-6 ⁇ M, about 6-15 ⁇ M, about 6-14 ⁇ M, about 6-13 ⁇ M, about 6-12 ⁇ M, about 6-11 ⁇ M, about 6-10 ⁇ M, about 6-9 ⁇ M, about 6-8 ⁇ M, about 6-7 ⁇ M, about 7-15 ⁇ M, about 7-14 ⁇ M, about 7-13 ⁇ M, about 7-12 ⁇ M, about 7-11 ⁇ M, about 7-10 ⁇ M, about 7-9 ⁇ M, about 7-8 ⁇ M, about 8-15 ⁇ M, about 8-14 ⁇ M, about 8-13 ⁇ M, about 8-12 ⁇ M, about 8-11 ⁇ M, about 8-10 ⁇ M, about 8-9 ⁇ M
  • the apoptosis inducer described herein is selected from CPT (Camptothecin), As 2 O 3 , Notopterol and Gracillin. In some embodiments, the apoptosis inducer described herein can bind to the topoisomerase-DNA complex during DNA replication, e.g., to prevent DNA strand reassembly and/or cause DNA double strand break.
  • the apoptosis inducer described herein is selected from a raltitrexed or equivalent, or TOMUDEXTM; a doxorubicin or equivalent, or ADRIAMYCINTM; a fluorouracil or 5-fluorouracil or equivalent; a docetaxel or equivalent, or TAXOTERETM; a larotaxel, tesetaxel or ortataxel or equivalent; an epothilone or an epothilone A, B, C, D, E or F or equivalent; an ixabepilone (also known as azaepothilone B) or equivalent, or BMS-247550TM; a vincristine (also known as leurocristine) or equivalent, or ONCOVINTM; a vinblastin, vinblastine, vindesine, vinflunine, vinorelbine or NAVELBINETM or equivalent; or, any combination thereof.
  • a raltitrexed or equivalent or TOMUDE
  • the apoptosis inducer or apoptosis-inducing agent described herein is selected from ABBV-621/APG880, APG350, RG7386/RO6874813, TAS266, MEDI3039, HexaBody®-DR5/DR5 (GEN1029), CPT, and ONC201. Additional apoptosis inducers can be found, e.g., in Lim, B., et al. “Novel apoptosis-inducing agents for the treatment of cancer, a new arsenal in the toolbox.” Cancers 11.8 (2019): 1087; and Fischer, U., et al. “Apoptosis-based therapies and drug targets.” Cell Death & Differentiation 12.1 (2005): 942-961; US20170196901A1; each of which is incorporated herein by reference in its entirety.
  • the apoptosis inducer is CPT.
  • CPT has a better effect and can be adapted to a wide range of tumor cells with broad spectrum.
  • the concentration of apoptosis inducer is about 5 to about 15 ⁇ M.
  • the apoptosis inducer does not change the mutation information (e.g., genetic variations) of intracellular DNA, e.g., point mutations, copy number variations, structural variations, fragmentation pattern variations, and other chromosomal variations known in the art, and it can be used to simulate cfDNA.
  • the DNA breaks induced by the methods described herein can form products similar to nucleosome monomers and their complexes (e.g., double or triple nucleosomal packaging).
  • the fragmented DNA can well mimic the pattern of cfDNA fragmentation compared to methods such as ultrasound treatment, all-trans retinoic acid induction, serum starvation treatment, or intensive culture (e.g., high-density culture) of tumor cells.
  • the concentration of apoptosis inducer is about 8-12 ⁇ M (e.g., about 10 ⁇ M).
  • concentration of the apoptosis inducer has an effect on the quality of the reference sample. A too high or too low concentration of the apoptosis inducer can lead to deviations in the mutational information of the reference sample, resulting in the DNA fragments being too small or too large and making them different from those of the simulated cfDNA.
  • the present disclosure relates to methods of making circulating tumor DNA reference samples.
  • the methods include: treating tumor cells with CPT to induce cells apoptosis, and extracting DNA from the tumor cells to obtain the circulating tumor DNA reference sample.
  • the treatment time is about 5 hours and the concentration of CPT is about 10 ⁇ M.
  • DNA prepared by the methods described herein can simulate the real plasma cfDNA fragmentation pattern a great extent (e.g., the percentage of each peak representing the nucleosome monomer and their complexes; the DNA fragment size of each peak; the cfDNA unique minor sub-peaks (peaks corresponding to cfDNAs of less than 150 bp); and the ratio between the DNA fragment peaks) and does not affect the detection of point mutations, copy number variations, structural variations, and/or fragmentation pattern variations.
  • DNA prepared by the methods described herein can be used as reference samples for point mutations, copy number variations, structural variations, and DNA fragmentation size detection.
  • the methods describe herein are easy to operate, have a short synthesis cycle, and can be produced in large-scale (e.g., the percentage of each peak representing the nucleosome monomer and their complexes; the DNA fragment size of each peak; the cfDNA unique minor sub-peaks (peaks corresponding to cfDNAs of less than 150 bp); and the ratio between the DNA fragment peaks).
  • the disclosure relates to a circulating tumor DNA reference sample, which is prepared by the methods described herein.
  • the products can be used as reference samples for detecting point mutations, copy number variations, structural variations, and DNA fragmentation size detection.
  • the products can simulate the real plasma cfDNA fragmentation pattern to a great extent.
  • the disclosure relates to methods for assessing whether the quality of a circulating tumor DNA reference sample is up-to-standard.
  • the method comprises: (1) extracting DNA from tumor cells that are not treated with an apoptosis inducer, and constructing a DNA library to obtain sequencing reads; (2) constructing a DNA library by sequencing the circulating tumor DNA reference sample described herein; (3) identifying one or more genetic variations of the untreated tumor cells in (1) and one or more genetic variations of the circulating tumor DNA reference sample in (2); and (4) comparing the genetic variations of the untreated tumor cells in (1) and the genetic variations of the circulating tumor DNA reference sample in (2).
  • a high consistency (e.g., the correlation coefficient R 2 above 0.5, above 0.55, above 0.6, above 0.65, above 0.7, above 0.75, above 0.8, above 0.85, above 0.9, above 0.91, above 0.92, above 0.93, above 0.94, above 0.95, above 0.96, above 0.97, above 0.98, or above 0.99) of the genetic variations of the untreated tumor cells in (1) and the genetic variations of the circulating tumor DNA reference sample in (2) indicates a good quality of the circulating tumor DNA reference sample.
  • the one or more genetic variations described herein includes at least one of the following: single nucleotide variations, structural variations, copy number variations and fragmentation pattern variations.
  • the tumor cells used in (1) and the tumor cells used for making the circulating tumor DNA reference sample described herein are from the same cell line or subject (e.g., human patient). In some embodiments, the tumor cells used in (1) and the tumor cells used for making the circulating tumor DNA reference sample described herein are different.
  • the methods described herein further include determining and comparing the size distribution of the circulating tumor DNA reference sample and the size distribution of the cell-free DNA(cfDNA) from plasma of a subject (e.g., a human patient described herein), wherein the fragmentation pattern of the size distribution of the circulating tumor DNA reference sample and the size distribution of the cfDNA are similar.
  • the two size distributions can share a fragmentation pattern having one or more of the following patterns: (1) the fragmentation pattern comprises a main peak representing nucleosome monomers (e.g., having a length of about 166 bp); (2) the fragmentation pattern comprises one or more sub-peaks representing complexes of nucleosome monomers (e.g., dimers and trimers); and (3) the fragmentation pattern comprises one or more minor sub-peaks with a length of less than 150 bp.
  • the fragmentation pattern comprises a main peak representing nucleosome monomers (e.g., having a length of about 166 bp); (2) the fragmentation pattern comprises one or more sub-peaks representing complexes of nucleosome monomers (e.g., dimers and trimers); and (3) the fragmentation pattern comprises one or more minor sub-peaks with a length of less than 150 bp.
  • the nucleosome monomer described herein has a length of about 100-200 bp, about 120-200 bp, about 140-200 bp, about 100-180 bp, about 120-180 bp, about 140-180 bp, about 150-180 bp, about 160-180 bp, or about 160-170 bp.
  • the nucleosome monomer described herein has a length of about 155 bp, about 158 bp, about 160 bp, about 162 bp, about 164 bp, about 165 bp, about 166 bp, about 167 bp, about 168 bp, about 170 bp, about 172 bp, about 175 bp, or about 180 bp.
  • the complex of nucleosome monomers has a length of about 320-350 bp (dimer), about 480-510 bp (trimer).
  • the complex of nucleosome monomers has a length that is about 1-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, or 8-fold of the length of a nucleosome monomer described herein.
  • the minor sub-peaks described herein have a length of about 50-166 bp, about 50-160 bp, about 50-150 bp, about 50-140 bp, about 60-166 bp, about 60-160 bp, about 60-150 bp, or about 60-140 bp.
  • the size distribution described herein includes at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 minor sub-peaks with a length less than 150 bp.
  • the disclosure described herein provides methods of determining the limit of detection (LOD) of mutation frequency of an assay.
  • the methods involve mixing different samples' DNA at different ratios to obtain a series of DNA samples.
  • the methods involve preparing a standard curve to detect LOD, which is generally known in the art.
  • the disclosure described herein provides methods of determining other dimensions of an assay, e.g., repeatability, reproducibility, positive and negative percentage agreement.
  • the disclosure relates to the use of the above methods and circulating tumor DNA reference samples in constructing a tumor prediction model.
  • the circulating tumor DNA reference prepared by the methods described herein can be used for testing the performance of the cancer prediction model. For example, construction of a pan-cancer early screening method or model.
  • the cfDNA fragment profile of normal individuals is more stable, while the cfDNA fragment profile of cancer patients is relatively heterogeneous, and there is a significant difference between the two.
  • 20220136062A1 which is incorporated herein by reference in its entirety, is based on comparing the distribution of copy number variation (CNV), fragment size (FS) and protein markers between normal and cancer patients, and standardizing all the quantified dimensions. Finally, the cancer contribution of each standardized dimension is weighted to obtain the overall cancer risk score (CRS).
  • CNV cancer risk score
  • the cancer described herein is a blood cancer (e.g., leukemia or lymphoma). In some embodiments, the cancer described herein is a solid tumor. In some embodiments, the cancer described herein is any cancer type that can be potentially diagnosed from cfDNA.
  • the disclosure relates to methods of predicting cancer, including (1) determining the size distribution of the cell-free DNA (cfDNA) from plasma of a subject; (2) determining the size distribution of the circulating tumor DNA reference sample of claims 11 ; and (3) comparing the size distribution of the cfDNA in (1) and the size distribution of the circulating tumor DNA reference sample in (2).
  • a matching fragmentation pattern of the size distribution of the cfDNA in step (1) and the size distribution of the circulating tumor DNA reference sample in step (2) indicates existence of cancer in the subject.
  • the fragmentation pattern is matched when the Pearson correlation coefficient is at least 0.3, a least 0.35, at least 0.4, at least 0.45, at least 0.5, at least 0.55, at least 0.6, at least 0.65, or at least 0.7 for fragments less than 166 bp (e.g., between 50-166 bp, between 60-166 bp, between 70-166 bp, between 80-166 bp, between 90-166 bp, or between 100-166 bp) between the size distribution of the cfDNA in step (1) and the size distribution of the circulating tumor DNA reference sample in step (2).
  • the Pearson correlation coefficient is at least 0.3, a least 0.35, at least 0.4, at least 0.45, at least 0.5, at least 0.55, at least 0.6, at least 0.65, or at least 0.7 for fragments less than 166 bp (e.g., between 50-166 bp, between 60-166 bp, between 70-166 bp, between 80-166 bp,
  • the disclosure relates to the application of the above reference sample in a kit for cancer prediction.
  • the reference products prepared using the methods described herein are used in kits for cancer prediction.
  • Laboratories performing internal quality control and third-party organizations performing external quality evaluation and proficiency testing need reference samples to ensure the reliability of test results.
  • the reference samples described herein can realize standardized high-throughput sequencing assays, perform performance confirmation or performance validation, internal quality control, and external quality evaluation.
  • the samples are derived from tissues, blood, or tumor cells of animals (e.g., common experimental animals including mice, rats, guinea pigs, hamsters, rabbits, dogs, monkeys, pigs, fish and so on).
  • animals e.g., common experimental animals including mice, rats, guinea pigs, hamsters, rabbits, dogs, monkeys, pigs, fish and so on.
  • the disclosure relates to a kit for cancer screening, which contains a circulating tumor DNA reference sample described herein.
  • numeric values in the disclosure are herein assumed to be modified by the term “about”, whether or not explicitly indicated.
  • the term “about” generally refers to a range of numbers that one of skill in the art would consider equivalent to the recited value (i.e., having the same function or result).
  • the terms “about” may include numbers that are rounded to the nearest significant figure.
  • the terms “about” may include numbers that are ⁇ 10%, ⁇ 20%, or ⁇ 30% of the value.
  • references to the terms “one embodiment”, “some embodiments”, “examples”, “concrete examples”, or “some examples”, etc. mean that the specific features, structures, materials, or features described in combination with such embodiments or examples are contained in at least one embodiment or example of the invention. In this specification, indicative representations of the above terms do not need refer to the same embodiments or examples. Furthermore, the specific features, structures, materials or features described may be combined in an appropriate manner in any one or more embodiments or examples. In addition, in the case of non-conflict, technicians in the field may combine together the different embodiments or examples described in this specification or the characteristics of the different embodiments or examples.
  • Cell culture medium was prepared by mixing 20% fetal bovine serum (FBS) and 80% Iscove's Modified Dulbecco's Medium (IMDM). The culture medium was supplemented with Gentamicin (Gentamicin dosage: 40,000 units/500 mL cell culture medium).
  • FBS fetal bovine serum
  • IMDM Iscove's Modified Dulbecco's Medium
  • Camptothecin was dissolved in DMSO to 4 mg/mL (maximal solubility: 5 mg/mL) and diluted to 1 mg/mL when used.
  • the CPT solution was filter sterilized with a 0.22 ⁇ m needle filter, aliquoted into 1.5 mL centrifuge tubes, and then stored in a refrigerator at 4° C. As DMSO solidifies at 4° C., the tubes containing CPT were thawed prior to subsequent experiments.
  • HL-60 resistance cells or NB-4 cells were cultured overnight in 75 cm 2 culture flasks supplemented with 20 mL complete medium and placed in a 37° C./5% CO 2 incubator to ensure cells entered logarithmic growth phase.
  • the cells were treated with apoptosis inducers (e.g.,
  • CPT and ARTA cultured with high-density separately as follows.
  • the cells suspended in the culture medium were harvested, transferred into tubes, and centrifuged at 800 rpm for 7 minutes. Cells were pelleted at tube bottom and collected for subsequent experiments.
  • Apoptotic DNA Ladder Extraction Kit (Beyotime, Cat#: C0008) was used. Specifically, 200 ⁇ L of PBS was added to the collected cell pellets and gently pipetted to resuspend the cells in PBS. 4 ⁇ L RNase A was added and mixed thoroughly by vortexing. The cell suspension was kept at room temperature (15-25° C.) for 3-5 minutes. Afterwards, 20 ⁇ L Proteinase K was added and mixed thoroughly by vortexing. 200 ⁇ L Lysis Buffer B was then added and mixed thoroughly by vortexing. The cell lysate was incubated at 70° C. for 10 minutes. After the incubation, 200 ⁇ L ethanol (96-100%) was added and mixed thoroughly by vortexing.
  • the mixture from the previous step was added to a DNA purification column, which was centrifuged at 6000 ⁇ g (about 8000 rpm) for 1 minute. Liquid waste was discarded. 500 ⁇ L of Washing Solution I was then added and the column was centrifuged at 6000 ⁇ g (about 8000 rpm) for 1 minute. Liquid waste was discarded. Next, 600 ⁇ L of Washing Solution II was added and the column was centrifuged at 18,000 ⁇ g (about 12,000 rpm) for 1 minute. Liquid waste was discarded. Next, the column was centrifuged at 18,000 ⁇ g (about 12,000 rpm) for 1 minute to remove residual ethanol. The column was then placed in a clean 1.5 mL elution tube and 50 ⁇ L of elution buffer was carefully applied. The tube was centrifuged at 12,000 rpm for 1 minute to elute the total DNA.
  • Each end repair and A-tailing reaction was prepared in a tube or a well of a PCR plate as shown in the table below.
  • the reaction system was mixed gently by vortexing, spun down briefly, and then kept on ice. Immediately afterwards, the tube/plate was placed in a thermocycler programmed as shown in the table below.
  • the reaction system was mixed thoroughly, centrifuged briefly, and then incubated at 20° C. for 15 minutes.
  • 80% ethanol (e.g., 50 mL of 80% ethanol can be prepared by mixing 40 mL of absolute ethanol and 10 mL of nuclease-free water) was prepared before use. 1.5 mL centrifuge tubes were prepared and labeled with the corresponding number. Magnetic beads that had been pre-equilibrated at room temperature were fully vortexed and mixed. Each tube was filled with 88 ⁇ L of the magnetic beads.
  • the above DNA mixture was mixed with the magnetic beads, and incubated at room temperature for 10 minutes. After the incubation, the 1.5 mL tubes were placed on the magnet to capture the magnetic beads until the liquid became clear. Supernatant was carefully removed and discarded. 200 ⁇ L of 80% ethanol was added into each tube. The tubes were rotated 360 degrees horizontally and incubated on the magnet at room temperature for 30 seconds. Afterwards, supernatant was discarded while the tubes were kept on the magnet.
  • the library amplification reaction system was prepared as follow:
  • the library was purified as described below.
  • 1.5 mL sample tubes were prepared and labeled with the corresponding numbers. Magnetic beads that had been pre-equilibrated at room temperature were fully vortexed and mixed. Each tube was filled with 50 ⁇ L of the magnetic beads. The above DNA mixture was mixed with the magnetic beads, and incubated at room temperature for 10 minutes. After the incubation, the 1.5 mL tubes were placed on the magnet to capture the magnetic beads until the liquid became clear. Supernatant was carefully removed and discarded. 200 ⁇ L of 80% ethanol was added into each tube. The tubes were rotated 360 degrees horizontally and incubated on the magnet at room temperature for 30 seconds. Afterwards, supernatant was discarded while the tubes were kept on the magnet.
  • DNA fragmentation was performed using the Covaris® M220 non-contact ultrasonic fragmentation instrument. Specifically, a power-up check was performed as follows: (1) the computer fixed on the top of the instrument was properly wired to the machine; and (2) the Drip Tray was placed under the machine; and (3) the operating tube holder was inserted. The power of instrument and computer were turned on, and the controlling software was clicked open to make the system in operational mode. The sliding weight on the top of the tube holder was pulled up and rotated by 90 degrees. Approximately 15 mL of distilled or deionized water was added into the center of the holder. The water level should reach the green “ ⁇ ” status or exceed the “RUN” marker, and the water level just touched the operating tube holder completely.
  • genomic DNA 1 ⁇ g genomic DNA was pipetted into a 1.5 mL tube and 1 ⁇ Low TE Buffer was added to make the volume 50 ⁇ L.
  • the diluted genomic DNA sample was mixed gently and transferred carefully to the ultrasonication tube to avoid bubbles.
  • the sliding weight on the top of the tube holder was pulled up and rotated by 90 degrees.
  • the ultrasonication tube with sample was placed into the instrument.
  • the sliding weight was rotated and pushed down so that it pressed against the sample tube.
  • the safety gate was then closed.
  • the program used is shown in the table below.
  • the following procedures were also performed with caution. (1) The room temperature of the laboratory was kept at 15-30° C., and not too cold. (2) The program was run with a water bath to avoid damaging the sensor. (3) Only double-distilled water or deionized water was used for the water bath. (4) At the end of daily use, the Drip Tray was emptied and dried to prevent growth of microorganisms. (5) The safety door was closed when operating the system. (6) The DNA fragments were temporarily stored at ⁇ 20° C.
  • the QIAvac 24 Plus system was connected to a vacuum source.
  • a VacValve was inserted into each luer slot of the QIAvac 24 Plus.
  • a VacConnector was inserted into each VacValve.
  • the QIAamp Mini columns were inserted into the VacConnectors on the manifold.
  • a tube extender (20 mL) was inserted into each QIAamp Mini column.
  • the tube extender was firmly inserted into the QIAamp Mini column to avoid leakage of sample. More specifically, the 2 mL collection tube was remained for the subsequent operation.
  • the sample number was marked on the QIAamp Mini silica membrane column.
  • VacValve ensured a steady flow rate.
  • VacConnectors prevented direct contact between the spin column and VacValve during purification, thereby avoiding any cross-contamination between samples.
  • the QIAamp Mini silica membrane column can adsorb DNA, and the tube extender can hold large volumes of plasma.
  • the lysate-Buffer ACB mixture was carefully applied to the tube extender of the QIAamp Mini column.
  • the vacuum pump was switched on. When all lysates had been drawn through the columns completely, the vacuum pump was switched off and the exhaust valve was opened to release the pressure to 0 mbar. The tube extender was carefully removed and discarded. 600 ⁇ L Buffer ACW1 was applied to the QIAamp Mini column. The exhaust valve was closed and the vacuum pump was switched on. After all of Buffer ACW1 had been drawn through the QIAamp Mini column, the vacuum pump was switched off and the exhaust valve was opened to release the pressure to 0 mbar. 750 ⁇ L Buffer ACW2 was applied to the QIAamp Mini column.
  • the exhaust valve was closed and the vacuum pump was switched on. After all of Buffer ACW2 had been drawn through the QIAamp Mini column, the vacuum pump was switched off and the exhaust valve was opened to release the pressure to 0 mbar. 750 ⁇ L ethanol (96-100%) was applied to the QIAamp Mini column. The exhaust valve was closed and the vacuum pump was switched on. After all of the ethanol had been drawn through the QIAamp Mini column, the vacuum pump was switched off and the exhaust valve was opened to release the pressure to 0 mbar. The lid of the QIAamp Mini column was closed and removed from the vacuum manifold. The VacConnector was discarded.
  • the QIAamp Mini column was placed in a clean 2 mL collection tube, and centrifuged at full speed (20,000 ⁇ g; 14,000 rpm) for 3 minutes.
  • the QIAamp Mini Column was placed into a new 2 mL collection tube.
  • the lid was opened, and the assembly was incubated at 56° C. for 10 minutes to dry the membrane completely.
  • the QIAamp Mini column was placed in a clean 1.5 mL elution tube (included in the kit), and the 2 mL collection tube was discarded. 55 ⁇ L of nuclease-free water was carefully applied to the center of the QIAamp Mini membrane.
  • the lid was closed and incubated at room temperature for 3 minutes. After the incubation, the tube was centrifuged in a microcentrifuge at full speed (20,000 ⁇ g; 14,000 rpm) for 1 minute to elute the nucleic acids.
  • the DNA obtained by inducing cell apoptosis was similar to the real cfDNA fragmentation pattern.
  • the DNA fragments were obtained by shearing the genomic DNA of NB4 cells to 200-300 bp by ultrasonication, and the DNA fragment size distribution is shown in FIG. 1 A .
  • NB4 cells were treated with CPT for 5 hours before DNA extraction, and the distribution of DNA fragment size is shown in FIG. 1 B .
  • the distribution of the fragment size of cfDNA obtained from a cancer patient's plasma is shown in FIG. 1 C .
  • Some minor sub-peaks (marked by dashed lines) can also be detected in fragments of cfDNA less than 150 bp. The results showed that the minor sub-peaks can also be detected in the reference sample in FIG. 1 B .
  • the DNA fragments were prepared using different cells and by different apoptosis-inducing methods to find the optimal experimental conditions.
  • the NB4 cells were separated into two groups. Cells from one group (experimental) were treated with CPT for 5 hours to induce apoptosis before DNA extraction, library construction and sequencing analysis. Cells from the other group (control) without drug treatment were used as controls for DNA extraction, DNA fragmentation by ultrasonication, library construction and sequencing analysis. The libraries of both samples were subjected to whole-genome sequencing at a depth of 50 ⁇ , and the sequencing data were then compared and analyzed.
  • the DNA fragments obtained by the methods of drug-induced apoptosis described herein can simulate the fragmentation pattern of real plasma cfDNA to a great extent, and do not affect the detection of point mutations, copy number variations, structural variations, etc.
  • the DNA fragments prepared by this method can be used as reference samples for point mutation, copy number variation, structural variation, and DNA fragmentation size detection.
  • the methods are easy to operate, have a short synthesis cycle, and are suitable for mass production.
  • the methods described herein can be widely used in methodological validation, internal quality control and inter-laboratory quality evaluation, with good repeatability and consistency.
  • the reference samples described herein can be used as an internal quality control in assays.
  • cancer cell lines with targeted mutation sites can be selected as positive controls, and a normal cell line (e.g., GM12878) can be selected as a negative control.
  • Apoptosis can be induced in cells, and DNA can be extracted from these cells to produce reference samples.
  • the DNA from cancer cells with specific mutations can be used as a positive control, and the DNA from normal cells without mutations can be used as a negative control.
  • These reference samples can be subjected to library preparations, sequencing, and data analyses in the same batch with the experimental samples to monitor the whole process.
  • experiments can be proceeded normally with a satisfactory quality control result. However, when the mutations can't be detected in the positive reference sample, experiments should be terminated because of the failure of quality control. Alternatively, when the mutations can be detected in the negative reference sample, experiments should also be terminated because of failure of quality control, or an indication of contamination in the assays.
  • the reference samples described herein can be used in performance validation. Through sequential dilution, the positive samples and negative samples can be blended to obtain a series of reference samples with a gradient of tumor DNA fractions. These DNA reference samples can be subjected to library preparations, sequencing, and data analyses with experimental samples. The LOD of the assay can be determined by testing reference samples with different mutation frequencies. The repeatability and reproducibility can be determined by repeated testing of these reference samples. The sensitivity and specificity of the assay can be determined by testing a variety of positive and negative references.
  • reference samples described herein have the fragmentation characteristic of real plasma-derived cfDNA, they can be used as a control to study the fragmentation pattern in the assays.
  • the reference samples described herein can also play an important role in assay development, efficacy tests and other applications.
  • the DNA reference samples described herein can also be used as a control for the DNA extraction step.
  • the DNA reference samples can be added into artificial plasma to produce plasma reference samples to mimic real human plasma with different DNA mutation frequency.
  • the reference samples can be extracted at the same time, to monitor whether there is contamination in the experiment process and whether there is any problem in the experimental procedure.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biomedical Technology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
US17/901,431 2021-09-01 2022-09-01 Preparation method, product, and application of circulating tumor dna reference samples Pending US20230079748A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111029341.8A CN113817717A (zh) 2021-09-01 2021-09-01 循环肿瘤dna参考品的制备方法、产品及用途
CN202111029341.8 2021-09-01

Publications (1)

Publication Number Publication Date
US20230079748A1 true US20230079748A1 (en) 2023-03-16

Family

ID=78923726

Family Applications (1)

Application Number Title Priority Date Filing Date
US17/901,431 Pending US20230079748A1 (en) 2021-09-01 2022-09-01 Preparation method, product, and application of circulating tumor dna reference samples

Country Status (2)

Country Link
US (1) US20230079748A1 (zh)
CN (1) CN113817717A (zh)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106845153A (zh) * 2016-12-29 2017-06-13 安诺优达基因科技(北京)有限公司 一种用于利用循环肿瘤dna样本检测体细胞突变的装置
CN108517360A (zh) * 2017-02-27 2018-09-11 北京医院 一种循环肿瘤游离dna突变检测质控品及其制备方法
CN109055487A (zh) * 2018-08-09 2018-12-21 苏州安可济生物科技有限公司 模拟血浆基质的循环肿瘤游离dna标准品的制备
CN109628595A (zh) * 2019-01-18 2019-04-16 臻悦生物科技江苏有限公司 用于基因检测的ffpe参考品、其制备方法及应用
CN112176061B (zh) * 2020-10-12 2024-04-12 菁良科技(深圳)有限公司 一种血液肿瘤突变负荷参考品及其制备方法

Also Published As

Publication number Publication date
CN113817717A (zh) 2021-12-21

Similar Documents

Publication Publication Date Title
US11519032B1 (en) Transposition of native chromatin for personal epigenomics
JP6772216B2 (ja) 臓器移植患者における移植片拒絶反応の非侵襲的診断方法
US20180068058A1 (en) Methods and compositions for sample identification
JP2024028828A (ja) 尿および他のサンプルにおける無細胞dnaの分析
CN107893116B (zh) 用于检测基因突变的引物对组合、试剂盒以及构建文库的方法
CN104745679A (zh) 一种无创检测egfr基因突变的方法及试剂盒
WO2019237425A1 (zh) 一种用于嗜铬细胞瘤/副神经节瘤早期诊断和术前评估的生物标志物及其应用
CN115418401A (zh) 用于膀胱癌的尿监测的诊断测定
CN109679957B (zh) IncRNALNC_004208及其检测试剂在制备脑胶质瘤预后试剂中的应用
EP3464640A1 (en) Methods of mast cell tumor prognosis and uses thereof
CN105705655A (zh) 用于对nras和braf核酸进行多重分析的组合物及方法
Kirkegaard et al. Comprehensive analysis of soluble RNAs in human embryo culture media and blastocoel fluid
US20230079748A1 (en) Preparation method, product, and application of circulating tumor dna reference samples
CN110709522A (zh) 生物样本核酸质量的测定方法
CN110607370B (zh) 一种用于人体肿瘤分子分型的基因组合及其应用
CN114514327A (zh) 使用同步标志物检测评估弥漫性神经胶质瘤和对治疗的应答性
Oberle et al. Evaluation of Nanopore Sequencing on Polar Bodies for Routine Pre-Implantation Genetic Testing for Aneuploidy
KR102695246B1 (ko) 유전체와 후성 유전체 동시 분석 방법 및 분석 시스템
CN104593496A (zh) 基于杂交连接法检测非编码rna转录水平的方法
Mahony et al. Improving bovine respiratory disease control through the characterisation of pathogen-host interactions
Ganderton A Feasibility Study Evaluating the use of Circulating Tumour DNA in Genetic tests, as a Proxy for Brain Cancer Biopsy Samples
WO2024097217A1 (en) Detection of non-cancer somatic mutations
CN115772564A (zh) 用于辅助检测肺癌体细胞atm基因融合突变的甲基化生物标记物及其应用
WO2024118500A2 (en) Methods for detecting and treating ovarian cancer
CN117512116A (zh) 一种用于胆管癌检测的生物标志物及其应用

Legal Events

Date Code Title Description
AS Assignment

Owner name: SEEKIN, INC., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MAO, MAO;ZHONG, GUOLIN;LI, SHIYONG;AND OTHERS;SIGNING DATES FROM 20220907 TO 20220913;REEL/FRAME:061163/0935

Owner name: SEEKIN INC. (ZHENGZHOU), CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ZHU, DANDAN;REEL/FRAME:061163/0810

Effective date: 20220908

Owner name: SEEKIN, INC., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SEEKIN, INC.;SEEKIN INC. (ZHENGZHOU);REEL/FRAME:061164/0140

Effective date: 20220914

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION