CN117551767A - Application of cfDNA fragment characteristic combination in prediction of cancer - Google Patents
Application of cfDNA fragment characteristic combination in prediction of cancer Download PDFInfo
- Publication number
- CN117551767A CN117551767A CN202311603742.9A CN202311603742A CN117551767A CN 117551767 A CN117551767 A CN 117551767A CN 202311603742 A CN202311603742 A CN 202311603742A CN 117551767 A CN117551767 A CN 117551767A
- Authority
- CN
- China
- Prior art keywords
- cfdna fragment
- fragment
- cfdna
- combination
- cancer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000012634 fragment Substances 0.000 title claims abstract description 204
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 130
- 201000011510 cancer Diseases 0.000 title claims abstract description 50
- 238000012163 sequencing technique Methods 0.000 claims abstract description 17
- 238000000034 method Methods 0.000 claims abstract description 11
- 230000003247 decreasing effect Effects 0.000 claims description 12
- 239000003153 chemical reaction reagent Substances 0.000 claims description 9
- 238000001514 detection method Methods 0.000 claims description 4
- 238000010183 spectrum analysis Methods 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 2
- 238000004458 analytical method Methods 0.000 abstract description 4
- 238000011144 upstream manufacturing Methods 0.000 abstract description 2
- 239000000523 sample Substances 0.000 description 45
- 238000012549 training Methods 0.000 description 15
- 238000012360 testing method Methods 0.000 description 13
- 238000010200 validation analysis Methods 0.000 description 6
- 239000013068 control sample Substances 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 239000003550 marker Substances 0.000 description 4
- 238000012070 whole genome sequencing analysis Methods 0.000 description 4
- 239000008280 blood Substances 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 206010009944 Colon cancer Diseases 0.000 description 2
- 208000032612 Glial tumor Diseases 0.000 description 2
- 206010018338 Glioma Diseases 0.000 description 2
- 208000017604 Hodgkin disease Diseases 0.000 description 2
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 2
- 208000009956 adenocarcinoma Diseases 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 208000005017 glioblastoma Diseases 0.000 description 2
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 2
- 201000002528 pancreatic cancer Diseases 0.000 description 2
- 208000008443 pancreatic carcinoma Diseases 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000014697 Acute lymphocytic leukaemia Diseases 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 201000003076 Angiosarcoma Diseases 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 206010004146 Basal cell carcinoma Diseases 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 208000005243 Chondrosarcoma Diseases 0.000 description 1
- 201000009047 Chordoma Diseases 0.000 description 1
- 208000006332 Choriocarcinoma Diseases 0.000 description 1
- 206010008805 Chromosomal abnormalities Diseases 0.000 description 1
- 208000031404 Chromosome Aberrations Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 201000009051 Embryonal Carcinoma Diseases 0.000 description 1
- 206010014967 Ependymoma Diseases 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 201000008808 Fibrosarcoma Diseases 0.000 description 1
- 208000001258 Hemangiosarcoma Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 206010025219 Lymphangioma Diseases 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 208000007054 Medullary Carcinoma Diseases 0.000 description 1
- 208000000172 Medulloblastoma Diseases 0.000 description 1
- 206010027406 Mesothelioma Diseases 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 208000005927 Myosarcoma Diseases 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 201000010133 Oligodendroglioma Diseases 0.000 description 1
- 208000007641 Pinealoma Diseases 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 201000010208 Seminoma Diseases 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 1
- 208000008383 Wilms tumor Diseases 0.000 description 1
- 208000017733 acquired polycythemia vera Diseases 0.000 description 1
- 208000003362 bronchogenic carcinoma Diseases 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 208000006990 cholangiocarcinoma Diseases 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 208000024207 chronic leukemia Diseases 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 208000037828 epithelial carcinoma Diseases 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 208000025750 heavy chain disease Diseases 0.000 description 1
- 201000002222 hemangioblastoma Diseases 0.000 description 1
- 201000005787 hematologic cancer Diseases 0.000 description 1
- 208000024200 hematopoietic and lymphoid system neoplasm Diseases 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 230000002962 histologic effect Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 208000022013 kidney Wilms tumor Diseases 0.000 description 1
- 206010024627 liposarcoma Diseases 0.000 description 1
- 238000011528 liquid biopsy Methods 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 208000015534 lymphangioendothelioma Diseases 0.000 description 1
- 208000012804 lymphangiosarcoma Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 208000023356 medullary thyroid gland carcinoma Diseases 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 206010027191 meningioma Diseases 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 201000002077 muscle cancer Diseases 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 208000025189 neoplasm of testis Diseases 0.000 description 1
- 201000008026 nephroblastoma Diseases 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 208000004019 papillary adenocarcinoma Diseases 0.000 description 1
- 208000024724 pineal body neoplasm Diseases 0.000 description 1
- 208000037244 polycythemia vera Diseases 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 201000008407 sebaceous adenocarcinoma Diseases 0.000 description 1
- 208000000587 small cell lung carcinoma Diseases 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 206010041823 squamous cell carcinoma Diseases 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 201000010965 sweat gland carcinoma Diseases 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 206010042863 synovial sarcoma Diseases 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 230000000451 tissue damage Effects 0.000 description 1
- 231100000827 tissue damage Toxicity 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Pathology (AREA)
- Medical Informatics (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Public Health (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Analytical Chemistry (AREA)
- Data Mining & Analysis (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Hospice & Palliative Care (AREA)
- Oncology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses an application of cfDNA fragment characteristics in predicting cancers, and belongs to the technical field of cancer genomics. The cfDNA fragment characteristic combination comprises a first cfDNA fragment characteristic sub-combination and a second cfDNA fragment characteristic sub-combination, wherein the first cfDNA fragment characteristic sub-combination comprises cfDNA fragment characteristics falling between 130bp and 175bp, and the second cfDNA fragment characteristic sub-combination comprises cfDNA fragment characteristics falling between 330bp and 360bp. The cfDNA fragment characteristic combination is utilized to predict the cancer, so that the requirement and the dependence of a cfDNA fragment analysis-based cancer prediction method on an upstream experimental end are reduced, the interpretability and the utilization rate of other histology sequencing data are remarkably widened, the experimental cost of diagnosing tumors based on cfDNA is greatly reduced, and the accuracy of cfDNA-based cancer prediction is improved.
Description
Related patent
The application is a divisional application of Chinese patent application with the application number of 2022112033941 and the application date of 2022, 09 and 29, and the invention name of cfDNA fragment characteristic combination, system and application for predicting cancer.
Technical Field
The invention belongs to the technical field of cancer genomics, and particularly relates to application of cfDNA fragment characteristic combination in prediction of cancers.
Background
Free DNA (cfDNA, circulating free DNA or Cell free DNA) in blood can change in concentration with tissue damage, cancer, inflammatory response and the like, and has important potential value in early diagnosis, prognosis, monitoring and the like of diseases. In recent years, cfDNA has been widely used in research fields such as early screening of cancer. Studies have shown that tumor tissue sources can be classified using specific cfDNA fragment characteristics, and that cfDNA fragment lengths can also reveal tissue origin or tumor origin.
However, most liquid biopsy methods are currently focused on detecting genetic mutations or chromosomal abnormalities in blood, and existing methods of fragment histology rely on Whole Genome Sequencing (WGS) methods, which cannot fully exploit other histologic sequencing data information.
Disclosure of Invention
In order to solve at least one of the above technical problems, the present invention develops a system for analyzing fragment histology based on multiple histology data to identify cfDNA fragments to distribute tumor markers, thereby identifying whether the sample is a tumor sample. Specifically, the technical scheme adopted by the invention is as follows:
the first aspect of the invention provides a cfDNA fragment characteristic combination comprising a first cfDNA fragment characteristic sub-combination comprising cfDNA fragment characteristics falling between 60bp and 200bp and having an increased fragment number ratio in a population cancer sample, and/or a second cfDNA fragment characteristic sub-combination comprising cfDNA fragment characteristics falling between 300 and 400bp and having a reduced fragment number ratio in a population cancer sample, said increase or decrease being relative to a representative value of the fragment number ratio of the corresponding fragment characteristics of a normal population sample.
In the present invention, the definition of the relevant terms is as follows:
fragment characteristics: the cfDNA fragments are divided into different fragment intervals according to different lengths, and all cfDNA fragments in each fragment interval are one fragment characteristic. For example, the fragment features are: 61-65bp, including cfDNA fragments with fragment lengths of 61bp, 62bp, 63bp, 64bp and 65 bp. For example, the fragment features are: 74-75bp, including cfDNA fragments with fragment lengths of 74bp and 75 bp.
Ratio of number of fragments: refers to the ratio of cfDNA fragments in a fragment signature to total fragments.
In the present invention, the cfDNA fragment length and number data refers to data obtained using a sequencing method selected from any one of the group consisting of WGS sequencing, WES sequencing, meDIP and MBD-Seq. In fact, one skilled in the art may use any method, either sequenced or not, provided that the length and number of cfDNA fragments can be obtained.
In the present invention, the cfDNA fragments contained in each fragment characteristic in the first cfDNA fragment characteristic sub-combination are relatively short, and the inventors of the present invention unexpectedly found that fragment characteristics falling between 60bp and 200bp can be used to identify cancer, and have higher accuracy. More surprisingly, the inventors found that cfDNA fragments falling between 130bp and 175bp are characterized with higher cancer recognition accuracy.
Further, if a subject having a cancer is selected to have a risk of at least one of 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp, 157-159bp, 165-168bp, 157-162bp, 157-160bp, 151-160bp, 152-158bp, 160-162bp, 153-156bp, 151-159bp, 166-168bp, 148-150bp, 149-156bp, 159-160bp, 151-156bp, 167-168bp, 147-148bp, 146-150bp, 165-172bp, 166-170bp, 151-155bp, 153-154bp, 149-152bp, 145-150bp, 145-151bp, 166-172bp, 160bp, 151-152bp, 151-172 bp, 151-152bp, 151-150 bp, and 169 bp, or 169, if any of the characteristics fall between 130-175 bp and 175 bp.
The inventors have further found that the selection of the above fragment characteristics is not as good as possible, and that the selection of 163-164bp, 163-165bp, 161-164bp and 165-166bp as markers has a very good cancer recognition effect.
In the present invention, the cfDNA fragments contained in each fragment characteristic in the second cfDNA fragment characteristic sub-combination are relatively long, and the inventors of the present invention unexpectedly found that fragment characteristics falling between 300bp and 400bp can be used to identify tumors, and have higher accuracy. More surprisingly, the inventors found that cfDNA fragments falling between 330bp and 360bp feature higher tumor recognition accuracy.
Further, if among the cfDNA fragment characteristics falling between 330bp and 360bp, 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp, 341-348bp, 345-346bp, 343-348bp, 341-350bp, 345-348bp, 346-348bp, 333-340bp, 347-348bp, 346-350bp, 335-336bp, 331-340bp, 334-336bp, 349-350bp, 349-351bp, 349-339 bp, 349-352bp, 333-336bp, 348-354bp, 351-352bp, 349-356bp, 352-354bp, 351-355bp, 333-334bp, 331-336bp, 349-357bp, 355-357bp, 355-355 bp, 355-357bp, 360bp, and 357bp are selected, whether a subject has cancer or is at risk of having cancer can be accurately predicted.
Likewise, the inventors have further found that the selection of the above fragment characteristics is not as much as possible, and has a very good cancer recognition effect when 339-340bp, 341-342bp, 343-344bp, 337-339bp and 340-342bp are selected as markers.
A second aspect of the invention provides a system for predicting whether a subject has or is at risk of having cancer, comprising the following modules:
the data input module is used for inputting the cfDNA fragment length and quantity data of the subject;
the distribution spectrum analysis module is connected with the data input module and is used for obtaining the fragment quantity proportion of each cfDNA fragment characteristic in the cfDNA fragment characteristic combination;
and the cancer prediction module is connected with the distribution spectrum analysis module and is used for judging whether the subject has cancer or is at risk of having cancer according to the fragment quantity proportion of the cfDNA fragment characteristics.
In some embodiments of the invention, the subject is determined to have or be at risk of having cancer if the proportion of the number of fragments of at least one cfDNA fragment feature in the first subset of cfDNA fragment features is increased and/or the proportion of the number of fragments of at least one cfDNA fragment feature in the second subset of cfDNA fragment features is decreased.
In other embodiments of the present invention, the cfDNA fragment signature combination includes a first cfDNA fragment signature sub-combination and a second cfDNA fragment signature sub-combination, and the cancer prediction module obtains the judgment value using the following formula:
wherein,
the Score is used as a judgment value and,
m is the number of cfDNA fragment features in the first cfDNA fragment feature sub-combination, and n is the number of cfDNA fragment features in the second cfDNA fragment feature sub-combination;
ti is the proportion of the number of fragments of the ith cfDNA fragment characteristic in the first cfDNA fragment characteristic sub-combination;
N j the ratio of the number of fragments characteristic of the jth cfDNA fragment in the first cfDNA fragment characteristic sub-combination,
if Score is greater than a preset threshold, the subject is judged to have cancer or to be at risk of having cancer.
In some embodiments of the invention, the predictive threshold is determined from a population cancer sample Score value and/or a population normal sample Score value.
Optionally, the predictive threshold is determined from a representative value of Score values of a population of cancer samples.
Optionally, the predictive threshold is determined from a representative value of a Score value of a population normal sample.
Optionally, the predictive threshold is determined from a representative value of an increased value of the Score value of the cancer sample of the population relative to the Score value of the normal sample of the population. The cancer sample and the normal sample are paired samples, so that the added value has clinical significance.
In some embodiments of the invention, the population of cancer samples refers to more than 10 cancer samples, such as 10, 20, 50, 100, 200, 500 or more.
In some embodiments of the invention, the representative value refers to one of an average, a mode, a median, a 1/4 fraction, and a 3/4 fraction.
In the present invention, the cancers include, but are not limited to, solid tumors and blood cancers such as fibrosarcoma, myosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endothelial sarcoma, lymphangiosarcoma, lymphangioendothelioma, synovioma, mesothelioma, ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon cancer, pancreatic cancer, prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, papillary adenocarcinoma, cystic adenocarcinoma, medullary carcinoma, bronchogenic carcinoma, hepatoma, cholangiocarcinoma, choriocarcinoma, seminoma, embryonal carcinoma, nephroblastoma, cervical cancer, testicular tumor, lung cancer, small cell lung cancer, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyoma, ependymoma, pineal tumor, angioblastoma, auditory glioma, oligodendroglioma, meningioma, melanoma, neuroblastoma, glioblastoma; leukemias such as acute lymphoblastic leukemia and acute myeloblastic leukemia (myeloblastic, promyelocytic, myelomonocytic, monocytic and erythrocytic leukemia); chronic leukemia (chronic myelogenous (granulocytic) leukemia and chronic lymphocytic leukemia); and polycythemia vera, lymphomas (hodgkin's and non-hodgkin's), multiple myeloma, waldenstrom's macroglobulinemia and heavy chain disease.
In a third aspect, the present invention provides the use of a detection reagent of a cfDNA fragment feature combination according to the first aspect of the present invention for the preparation of a kit for predicting whether a subject has cancer or is at risk of having cancer.
In some embodiments of the invention, the detection reagent comprises a capture reagent and/or a sequencing reagent.
In some embodiments of the invention, the kit further comprises cfDNA extraction reagents.
The beneficial effects of the invention are that
Compared with the prior art, the invention has the following effective effects:
the cfDNA fragment feature combination and system of the present invention can be used for cancer prediction, not only using data of any one of the sequencing methods selected from the group consisting of WGS sequencing, WES sequencing, meDIP and MBD-Seq, but also using data obtained by any sequencing or non-sequencing method, as long as the length and number of cfDNA fragments can be obtained.
The cfDNA fragment characteristic combination and the cfDNA fragment characteristic system are utilized to predict cancers, comprehensive characteristic analysis of cfDNA fragments can be utilized, and the prediction performance on cancers is better.
The cfDNA fragment characteristic combination and the cfDNA fragment characteristic system are utilized for carrying out cancer prediction, so that the requirement and the dependence of a cfDNA fragment analysis-based cancer prediction method on an upstream experimental end are reduced, the interpretability and the utilization rate of other histology sequencing data are remarkably widened, the experimental cost of cfDNA-based tumor diagnosis is greatly reduced, and the accuracy of cfDNA-based cancer prediction is improved.
Drawings
Fig. 1 shows the results of tumor identification in training and validation sets using 10 cfDNA fragment features.
Fig. 2 shows the results of tumor identification in training and validation sets using 20 cfDNA fragment features.
Fig. 3 shows the results of tumor identification in training and validation sets using 30 cfDNA fragment features.
Fig. 4 shows the results of tumor identification in training and validation sets using 40 cfDNA fragment features.
Fig. 5 shows the results of tumor identification in training and validation sets using 50 cfDNA fragment features.
Fig. 6 shows the results of tumor identification in training and validation sets using 60 cfDNA fragment features.
Fig. 7 shows the results of tumor identification in an external test set using 10 cfDNA fragment features.
Detailed Description
Unless otherwise indicated, implied from the context, or common denominator in the art, all parts and percentages in the present application are based on weight and the test and characterization methods used are synchronized with the filing date of the present application. Where applicable, the disclosure of any patent, patent application, or publication referred to in this application is incorporated by reference in its entirety, and the equivalent patents to those cited are incorporated by reference, particularly as they relate to the definitions of terms of art and the like. If the definition of a particular term disclosed in the prior art does not conform to any definition provided in this application, the definition of that term provided in this application controls.
Numerical ranges in this application are approximations, so that it may include the numerical values outside of the range unless otherwise indicated. The numerical range includes all values from the lower value to the upper value that increase by 1 unit, provided that there is a spacing of at least 2 units between any lower value and any higher value. These are merely specific examples of what is intended to be provided, and all possible combinations of numerical values between the lowest value and the highest value enumerated are to be considered to be expressly stated in this application.
In order to make the technical problems, technical schemes and beneficial effects solved by the invention more clear, the invention is further described in detail below with reference to the embodiments.
Examples
The following examples are presented herein to demonstrate preferred embodiments of the present invention. It will be appreciated by those skilled in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. Those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit or scope of the invention.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs, the disclosure of which is incorporated herein by reference as is commonly understood by reference.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the claims.
The molecular biology experiments described in the following examples, which are not specifically described, were performed according to the specific methods listed in the "guidelines for molecular cloning experiments" (fourth edition) (j. Sambrook, m.r. Green, 2017) or according to the kit and product specifications. Other experimental methods, unless otherwise specified, are all conventional. The instruments used in the following examples are laboratory conventional instruments unless otherwise specified; the test materials used in the examples described below, unless otherwise specified, were purchased from conventional biochemical reagent stores.
Example 1 identification of cfDNA fragment distribution tumor markers
Cfdna sequencing
To obtain cfDNA fragment distribution tumor markers, the inventors obtained blood samples of 417 tumor patients (183 colorectal cancer, 40 liver cancer, 92 gastric cancer, 68 pancreatic cancer, 9 esophageal cancer and 25 glioblastomas) and 813 normal persons. cfDNA was extracted and sequenced using methylation DNA enrichment Sequencing technique (MBD-Seq, methylated DNA Binding Domain-Sequencing).
2. Data preprocessing
a) Data cleaning: the adaptor sequence introduced during library construction was removed using fastp-0.20.0 software and low quality base fragments (more than 40% of the bases had a mass value below Q15 and more than 5N for the whole fragment, with a sliding window based cut end average mass < 4 bases of Q20).
b) Data comparison: the base sequence of fastq file was aligned to human reference genome hg19 (GRCH 37) using bowtie2-2.3.4.2 software to generate bam file, and the bam file was ranked according to genome coordinates, the ranked bam was deduplicated using picard MarkDuplicates-2.18.25-snappshot, and paired reads were screened for reads aligned to the reference genome and MAPQ > 20.
c) cfDNA screening: to delete cfDNA fragments of MBD proteins that are non-specifically captured, fragments of the bam file that do not contain CG base pairs are filtered out. cfDNA with fragment length (60, 400) was further retained for subsequent analysis.
Cfdna fragment distribution profile
The final processed bam file was analyzed using R package Rsamtools to calculate the fragment length of each cfDNA. Then, dividing the cfDNA fragment length into different fragment intervals according to the lengths of 2bp, 3bp, 4bp and 5bp and … … bp respectively (if the step length is 2bp, the divided fragment intervals are 61-62bp, 63-64bp … … and 398-400bp, if the step length is 3bp, the divided fragment intervals are 61-63bp, 64-66bp … … and 396-399bp, if the step length is 10bp, the divided fragment intervals are 61-70bp, 71-80bp … … 391-400 bp), defining all cfDNA fragments included in each fragment interval as fragment characteristics, and calculating the proportion of the cfDNA fragments in each fragment characteristic to the total fragment number so as to generate the fragment distribution spectrum of the cfDNA.
4. Tumor markers for recognizing cfDNA fragments
In both tumor and healthy samples, wilcox rank sum test was performed on each cfDNA fragment feature and BH correction was used to obtain corrected p-values, and the area under ROC curve (AUC) values for each fragment feature to distinguish tumor from healthy samples were further calculated. Fragment characteristics with corrected p-value <0.05 and AUC >0.6 were identified as differentially distributed in tumor and healthy samples.
Dividing tumor samples in a training set into two parts randomly and averagely, randomly generating two parts of samples with the same number as the tumor samples in a healthy sample, respectively mixing the two parts of tumor samples and the healthy sample, sequentially sequencing the two parts of samples according to each fragment characteristic, and calculating the dominance ratio OR value of distinguishing the tumor from the healthy sample by the fragment characteristics in the two parts of samples. The above process is repeated 100 times, then the average OR value is calculated 100 times for each segment feature, and segment features with average OR values >1.5 are retained.
Thus, 100 fragment features were obtained, wherein the proportion of the number of fragments of 50 fragment features to the total number of fragments was increased in the tumor sample, and the proportion of the number of fragments of 50 fragment features was decreased in the tumor sample, as shown in table 1:
TABLE 1 100 fragment characteristics
As can be seen from Table 1, the size is concentrated in 131-172bp in the fragment features increased in the tumor samples, and in 331-360bp in the fragment features decreased in the tumor samples.
Example 2 determination of tumor efficacy by different segment characteristics
And calculating the increasing or decreasing proportion of each feature in the tumor sample relative to the normal control sample in the training set by using the 50 fragment features increased in the tumor sample and the 50 fragment features decreased in the tumor sample, judging whether the sample belongs to the tumor or normal according to the standard, and verifying in the test set. Their efficacy in distinguishing tumor samples from normal samples, respectively, are shown in tables 2 and 3 below:
TABLE 2 determination of 50 increased fragment characteristics in tumor samples
TABLE 3 determination of 50 increased fragment characteristics in tumor samples
/>
It follows that the 100 fragment features described above can be used as markers for identifying tumors. Judging whether the sample belongs to a tumor sample or not by judging the proportion of the sample in the sample.
Example 3 identification of tumors by different segment feature combinations
1.10 markers
The increased fragment features (T5) in the first 5 tumor samples and the decreased fragment features (N5) in the first 5 tumor samples were combined, respectively, in order of AUC values for the individual features.
Wherein,
t5 comprises: 163-164bp, 163-165bp, 161-164bp and 165-166bp
N5 comprises: 339-340bp, 341-342bp, 343-344bp, 337-339bp and 340-342bp
For each sample, a score 10=sum (T5)/sum (N5) is calculated, and then the proportion of the score increase in the tumor sample relative to the normal control sample is calculated in the training set, based on which criteria the sample is judged to be tumor or normal, and validated in the test set.
2.20 markers
The increased fragment features (T10) in the first 10 tumor samples and the decreased fragment features (N10) in the first 10 tumor samples were combined, respectively, in order of AUC values for the individual features.
Wherein,
t10 comprises: 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp
N10 includes: 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp
For each sample, a score of score20 = sum (T10)/sum (N10) is calculated, and then the proportion of the score increase in the tumor sample relative to the normal control sample is calculated in the training set, based on which criteria the sample is judged to be tumor or normal, and validated in the test set.
3.30 markers
The increased fragment features (T15) in the first 15 tumor samples and the decreased fragment features (N15) in the first 15 tumor samples were combined, respectively, in order of AUC values of the individual features.
Wherein,
t15 comprises: 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp
N15 includes: 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp
For each sample, a score of 30=sum (T15)/sum (N15) was calculated, and then the proportion of the score increase in tumor samples relative to normal control samples was calculated in the training set, based on which criteria the samples were judged to be tumor or normal, and verified in the test set.
4.40 markers
The increased fragment features (T20) in the first 20 tumor samples and the decreased fragment features (N20) in the first 20 tumor samples were combined, respectively, in order of AUC values for the individual features.
Wherein,
t20 comprises: 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp, 157-159bp, 165-168bp, 157-162bp, 157-160bp, 151-160bp
N20 comprises: 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp, 341-348bp, 345-346bp, 343-348bp, 341-350bp, 345-348bp
For each sample, a score 40=sum (T20)/sum (N20) is calculated, and then the proportion of the score increase in the tumor sample relative to the normal control sample is calculated in the training set, based on which criteria the sample is judged to be tumor or normal, and validated in the test set.
5.50 markers
The increased fragment features (T25) in the first 25 tumor samples and the decreased fragment features (N25) in the first 25 tumor samples were combined, respectively, in order of AUC values for the individual features.
Wherein,
t25 comprises: 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp, 157-159bp, 165-168bp, 157-162bp, 157-160bp, 151-160bp, 152-158bp, 160-162bp, 153-156bp, 151-159bp, 166-168bp
N25 includes: 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp, 341-348bp, 345-346bp, 343-348bp, 341-350bp, 345-348bp, 346-348bp, 333-340bp, 347-348bp, 346-350bp, 335-336bp
For each sample, a score 50=sum (T25)/sum (N25) was calculated, and then the proportion of the score increase in the tumor sample relative to the normal control sample was calculated in the training set, based on which criteria the sample was judged to be tumor or normal, and verified in the test set.
6.60 markers
The increased fragment features (T30) in the first 30 tumor samples and the decreased fragment features (N30) in the first 30 tumor samples were combined, respectively, in order of AUC values of the individual features.
Wherein,
t30 includes: 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp, 157-159bp, 165-168bp, 157-162bp, 157-160bp, 151-160bp, 152-158bp, 160-162bp, 153-156bp, 151-159bp, 166-168bp, 148-150bp, 149-156bp, 159-160bp, 151-156bp
N30 includes: 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp, 341-348bp, 345-346bp, 343-348bp, 341-350bp, 345-348bp, 346-348bp, 333-340bp, 347-348bp, 346-350bp, 335-336bp, 331-340bp, 334-336bp, 349-350bp, 349-351bp, 331-331 bp
For each sample, a score of 60=sum (T30)/sum (N30) was calculated, and then the proportion of the score increase in tumor samples relative to normal control samples was calculated in the training set, based on which criteria the samples were judged to be tumor or normal, and verified in the test set.
7. Judgment result of different marker combinations
The results of the determination in the training set and the test set according to the scores of the above 6 marker combinations are shown in fig. 1 to 6 and table 4:
TABLE 4 tumor recognition results for different marker combinations
From the above table, it can be seen that the tumor sample can be well identified by using 10 fragment features, further increasing fragment features does not make the identification effect better, and the reverse side has a certain degree of reduction, which indicates that the tumor sample has a better tumor identification effect by using 10 fragment features, and it is possible to predict whether the subject has tumor or has a risk of having tumor by calculating the score.
Example 4 verification of marker combinations of 10 fragment signature in external test set
To further verify the performance of the above 10 fragment features as markers for predicting tumors, the inventors performed further verification using an external test set (external data), the results of which are shown in fig. 7.
As can be seen from fig. 7, the score obtained by using the 10 fragment characteristics can significantly distinguish between the tumor sample and the normal sample, and in particular, the score in the tumor sample is significantly higher than that in the normal sample, and the ROC curve AUC reaches 0.827.
All documents mentioned in this application are incorporated by reference as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the claims appended hereto.
Claims (10)
- Use of a detection reagent of a cfDNA fragment signature combination in the preparation of a kit for predicting whether a subject has cancer, characterized in that the cfDNA fragment signature combination comprises a first cfDNA fragment signature sub-combination comprising cfDNA fragment signatures falling between 130bp and 175bp and a second cfDNA fragment signature sub-combination comprising cfDNA fragment signatures falling between 330bp and 360bp.
- 2. The use of claim 1, wherein the cfDNA fragment falling between 130-175 bp is characterized by at least one member selected from the group consisting of 163-164bp, 163-165bp, 161-164bp, 165-166bp, 159-165bp, 157-164bp, 155-156bp, 163-168bp, 160-168bp, 157-158bp, 154-156bp, 161-170bp, 156-160bp, 161-162bp, 157-159bp, 165-168bp, 157-162bp, 157-160bp, 151-160bp, 152-158bp, 160-162bp, 153-156bp, 151-159bp, 166-168bp, 148-150bp, 149-156bp, 159-160bp, 151-156bp, 167-168bp, 147-148bp, 146-150bp, 165-172bp, 166-170bp, 151-155, 153-154bp, 149-152bp, 145-150bp, 166-172bp, 151-172 bp, and 169-150 bp.
- 3. The use of claim 2, wherein the first cfDNA fragment signature sub-combination comprises cfDNA fragment signatures falling between 160bp and 170bp, the cfDNA fragment signatures falling between 160bp and 170bp being selected from at least one of the group comprising 163-164bp, 163-165bp, 161-164bp and 165-166 bp.
- 4. The use according to claim 1, wherein the cfDNA fragment falling between 330bp and 360bp is characterized by at least one group selected from the group consisting of 339-340bp, 341-342bp, 343-344bp, 337-339bp, 340-342bp, 337-340bp, 341-344bp, 336-340bp, 341-345bp, 337-342bp, 337-338bp, 343-345bp, 341-347bp, 340-348bp, 334-340bp, 341-348bp, 345-346bp, 343-348bp, 341-350bp, 345-348bp, 346-348bp, 333-340bp, 347-348bp, 346-350bp, 335-336bp, 331-340bp, 334-336bp, 349-350bp, 349-351bp, 331-339bp, 349-352bp, 333-336bp, 348bp, 351-354 bp, 351-355bp, 333-334bp, 354bp, 355-348 bp, 355bp, 331-348 bp, 355bp, 357-356 bp, 355bp, 357-357 bp, 360bp, 357bp, 360-357 bp, and 357 bp.
- 5. The use of claim 4, wherein the second subset of cfDNA fragment features comprises cfDNA fragment features falling between 335bp and 345bp, the cfDNA fragment features falling between 335bp and 345bp being selected from at least one of the group consisting of 339-340bp, 341-342bp, 343-344bp, 337-339bp, and 340-342 bp.
- 6. The use according to any one of claims 1 to 5, wherein the detection reagent is a reagent that detects the combination of cfDNA fragment features based on a non-sequencing based method.
- 7. Use according to any one of claims 1 to 5, wherein a ratio of the number of fragments of each cfDNA fragment signature in the combination of cfDNA fragment signatures is obtained; judging whether the subject has cancer or is at risk of having cancer according to the fragment quantity proportion of the cfDNA fragment characteristics.
- 8. The use according to claim 7, wherein the subject is judged to have or to be at risk of cancer if the proportion of the number of fragments of at least one cfDNA fragment feature in the first cfDNA fragment feature sub-combination is increased and/or the proportion of the number of fragments of at least one cfDNA fragment feature in the second cfDNA fragment feature sub-combination is decreased.
- 9. The use of claim 7, wherein the cfDNA fragment signature combination comprises a first cfDNA fragment signature sub-combination and a second cfDNA fragment signature sub-combination, and wherein the cancer prediction module obtains the determination using the following formula:wherein,the Score is used as a judgment value and,m is the number of cfDNA fragment features in the first cfDNA fragment feature sub-combination, and n is the number of cfDNA fragment features in the second cfDNA fragment feature sub-combination;ti is the proportion of the number of fragments of the ith cfDNA fragment characteristic in the first cfDNA fragment characteristic sub-combination;N j the ratio of the number of fragments characteristic of the jth cfDNA fragment in the first cfDNA fragment characteristic sub-combination,if Score is greater than a preset threshold, the subject is judged to have cancer or to be at risk of having cancer.
- 10. A system for predicting whether a subject has or is at risk of having cancer, comprising the following modules:the data input module is used for inputting the cfDNA fragment length and quantity data of the subject;the distribution spectrum analysis module is connected with the data input module and is used for obtaining the fragment quantity proportion of each cfDNA fragment characteristic in the cfDNA fragment characteristic combination of claim 1;and the cancer prediction module is connected with the distribution spectrum analysis module and is used for judging whether the subject has cancer or is at risk of having cancer according to the fragment quantity proportion of the cfDNA fragment characteristics.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311603742.9A CN117551767A (en) | 2022-09-29 | 2022-09-29 | Application of cfDNA fragment characteristic combination in prediction of cancer |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211203394.1A CN115558716B (en) | 2022-09-29 | 2022-09-29 | cfDNA fragment characteristic combination, system and application for predicting cancer |
CN202311603742.9A CN117551767A (en) | 2022-09-29 | 2022-09-29 | Application of cfDNA fragment characteristic combination in prediction of cancer |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211203394.1A Division CN115558716B (en) | 2022-09-29 | 2022-09-29 | cfDNA fragment characteristic combination, system and application for predicting cancer |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117551767A true CN117551767A (en) | 2024-02-13 |
Family
ID=84743057
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211203394.1A Active CN115558716B (en) | 2022-09-29 | 2022-09-29 | cfDNA fragment characteristic combination, system and application for predicting cancer |
CN202311603742.9A Pending CN117551767A (en) | 2022-09-29 | 2022-09-29 | Application of cfDNA fragment characteristic combination in prediction of cancer |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211203394.1A Active CN115558716B (en) | 2022-09-29 | 2022-09-29 | cfDNA fragment characteristic combination, system and application for predicting cancer |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN115558716B (en) |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019110750A1 (en) * | 2017-12-07 | 2019-06-13 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Method for screening a subject for a cancer |
CA3092343A1 (en) * | 2018-02-27 | 2019-09-06 | Cornell University | Ultra-sensitive detection of circulating tumor dna through genome-wide integration |
US20210198747A1 (en) * | 2018-05-18 | 2021-07-01 | The Johns Hopkins University | Cell-free dna for assessing and/or treating cancer |
JP7332695B2 (en) * | 2018-12-21 | 2023-08-23 | エフ. ホフマン-ラ ロシュ アーゲー | Identification of global sequence features in whole-genome sequence data from circulating nucleic acids |
EP3927838A4 (en) * | 2019-02-22 | 2022-11-16 | AccuraGen Holdings Limited | Methods and compositions for early cancer detection |
IL302015A (en) * | 2020-08-18 | 2023-06-01 | Delfi Diagnostics Inc | Methods and systems for cell-free dna fragment size densities to assess cancer |
CN112410422B (en) * | 2020-10-30 | 2022-06-03 | 深圳思勤医疗科技有限公司 | Method for predicting tumor risk value based on fragmentation pattern |
CN114974430A (en) * | 2021-02-25 | 2022-08-30 | 博尔诚(北京)科技有限公司 | System for cancer screening and method thereof |
CN113421608B (en) * | 2021-07-03 | 2023-12-01 | 南京世和基因生物技术股份有限公司 | Construction method of liver cancer early screening model, detection device and computer readable medium |
-
2022
- 2022-09-29 CN CN202211203394.1A patent/CN115558716B/en active Active
- 2022-09-29 CN CN202311603742.9A patent/CN117551767A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CN115558716B (en) | 2023-11-03 |
CN115558716A (en) | 2023-01-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109767810B (en) | High-throughput sequencing data analysis method and device | |
CN109880910A (en) | A kind of detection site combination, detection method, detection kit and the system of Tumor mutations load | |
US8972202B2 (en) | Diagnosing fetal chromosomal aneuploidy using massively parallel genomic sequencing | |
CN112805563A (en) | Cell-free DNA for assessing and/or treating cancer | |
CN107586852B (en) | Gastric cancer peritoneal metastasis prediction model based on 22 genes and application thereof | |
CN111276252B (en) | Construction method and device of tumor benign and malignant identification model | |
JP6066924B2 (en) | DNA sequence data analysis method | |
TW201814290A (en) | Method and system for identifying tumor load in sample | |
WO2020237184A1 (en) | Systems and methods for determining whether a subject has a cancer condition using transfer learning | |
US20190287646A1 (en) | Identifying copy number aberrations | |
CN111304308A (en) | Method for auditing detection result of high-throughput sequencing gene variation | |
CN108304694B (en) | Method for analyzing gene mutation based on second-generation sequencing data | |
CN112289376A (en) | Method and device for detecting somatic cell mutation | |
CN107760783B (en) | Gastric cancer peritoneal metastasis prediction model based on 108 genes and application thereof | |
CN111508561B (en) | Method for detecting homologous sequences and tandem repeats in homologous sequences, computer readable medium and use | |
CN112037863B (en) | Early NSCLC prognosis prediction system | |
CN115558716B (en) | cfDNA fragment characteristic combination, system and application for predicting cancer | |
CN107760688A (en) | A kind of BRCA2 gene mutation bodies and its application | |
CN114530200B (en) | Mixed sample identification method based on calculation of SNP entropy | |
CN115662519B (en) | cfDNA fragment characteristic combination and system for predicting cancer based on machine learning | |
CN114067908B (en) | Method, device and storage medium for evaluating single-sample homologous recombination defects | |
US11535896B2 (en) | Method for analysing cell-free nucleic acids | |
CN113257349B (en) | Method for selecting design interval for analyzing tumor mutation load and application | |
Noguchi et al. | About sequence quality: impact on clinical applications | |
KR20220133516A (en) | Method for detecting tumor derived mutation from cell-free DNA based on artificial intelligence and Method for early diagnosis of cancer using the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |