CN112779334B - Methylation marker combination for early screening of prostate cancer and screening method - Google Patents

Methylation marker combination for early screening of prostate cancer and screening method Download PDF

Info

Publication number
CN112779334B
CN112779334B CN202110137326.9A CN202110137326A CN112779334B CN 112779334 B CN112779334 B CN 112779334B CN 202110137326 A CN202110137326 A CN 202110137326A CN 112779334 B CN112779334 B CN 112779334B
Authority
CN
China
Prior art keywords
prostate cancer
cpg
screening
cancer
methylation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110137326.9A
Other languages
Chinese (zh)
Other versions
CN112779334A (en
Inventor
蒲依依
王孝举
李超
袁海宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Medical College
Original Assignee
Hangzhou Medical College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Medical College filed Critical Hangzhou Medical College
Priority to CN202110137326.9A priority Critical patent/CN112779334B/en
Publication of CN112779334A publication Critical patent/CN112779334A/en
Application granted granted Critical
Publication of CN112779334B publication Critical patent/CN112779334B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/154Methylation markers

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Chemical & Material Sciences (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Genetics & Genomics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Microbiology (AREA)
  • Hospice & Palliative Care (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Artificial Intelligence (AREA)
  • Oncology (AREA)
  • Data Mining & Analysis (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Public Health (AREA)
  • Software Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a methylation marker combination for early screening of prostate cancer and a screening method, relates to the field of bioinformatics, and mainly provides a specificity methylation marker (cg26140475, cg24891312, cg 242422654, cg21359747, cg03254336, cg12697139 and cg19034132) related to early diagnosis of prostate cancer, which is obtained by screening through a database, and a model established based on the marker. The method is based on public databases (TCGA, GEO), and through accurate cancer sample classification, early diagnosis markers are identified in a targeted manner, and then through comparison of markers of different cancers, a series of cancer-specific markers are finally identified.

Description

Methylation marker combination for early screening of prostate cancer and screening method
Technical Field
The invention relates to the field of bioinformatics, in particular to a specific methylation marker for early diagnosis of prostate cancer, which is screened based on a database, a model established by the marker and a screening method of the marker.
Background
Prostate cancer continues to pose a threat to male health worldwide as a cancer of second-degree of morbidity and mortality in men worldwide. According to the latest published data of the national cancer center in China, the incidence rate of prostate cancer is 7.2 thousands and the mortality rate is 3.1 thousands in 2015, and the data are respectively positioned at the 6 th and the 10 th of the rank of the incidence rate and the death rate of malignant tumors of men in China. Different from the western countries, China is not a region with high incidence of prostate cancer, but the incidence rate of prostate cancer is obviously increased in recent years, and the prostate cancer becomes the fastest male malignant tumor in nearly ten years.
Prostate cancer screening based on Prostate Specific Antigen (PSA) has been widely used, but PSA methods have a high false positive rate and are prone to over-diagnosis and over-treatment. In the declaration issued by the U.S. preventive medicine working group in 2018, the PSA method screening is not recommended for people over 70 years old, and for people 55-69 years old, whether the PSA method screening is carried out regularly or not needs to be determined according to personal conditions. In this context, more specific screening tools are available, such as prostate cancer antigen 3(PCA3) and Prostate Health Index (PHI), and a range of marker screening tools including ConfirmMDx, SelectMDx, TMPRSS2-ERG, and the like. However, clinical tests of the methods are mostly based on western people, and the western people have differences in genome and epigenome, so that an effective prostate cancer screening means for the people in China is still lacked.
Cancer has been considered to be not just a genetic disease, but the role of epigenetic modifications in cancer has also been greatly appreciated. The relationship between DNA methylation and cancer is widely studied as one of the most important epigenetic modifications. In recent years, a number of studies have emerged on Cancer markers based on DNA methylation, wherein the studies using databases are not rare, but most of the studies have not subdivided Cancer samples and only performed a general comparison between normal tissues and Cancer tissues [1.De almeido BP, Apolonio JD, Binnie a, cast-branch p.roadmap of DNA methylation in branched biochemical markers. bmc 19(1), (219 2019), 2.Tu Y, Fan G, Xi H et al identification of candidate antibody and differentiated expressed genes in biochemical markers j biological cameras 119(11), (8797) (2018), 3. large sample Y, 3. gene and differentiated expressed genes in biochemical markers in biochemical cells, c.88018), (c.8808, Cell). Cancer progression is a dynamic process, with different stages of cancer having different methylation patterns, and thus, a general sample comparison may make it difficult to screen out methylation markers that are of clinical value.
Disclosure of Invention
The invention aims to screen out a specific DNA methylation marker suitable for early diagnosis of prostate cancer by analyzing methylation chip data in a database and combining with accurate clinical sample classification, and provides a methylation marker combination for early screening of prostate cancer and a screening method.
In a first aspect of the present invention, there is provided a methylation marker combination useful for early screening for prostate cancer, comprising:
cg26140475 located in 8 # chromosome intergenic region (the CpG locus and the DNA base sequence around 60bp are shown in SEQ ID NO: 1);
cg24891312 located in chromosome 12 PRB2 gene (the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown in SEQ ID NO: 2);
cg 24245854 located in the intergenic region of chromosome 12 (the CpG site and the DNA base sequence around the CpG site of 60bp are shown in SEQ ID NO: 3);
cg21359747 located in the ALDH1A3 gene of chromosome 15 (the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown in SEQ ID NO: 4);
cg03254336 located in 10 # chromosome intergenic region (the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown in SEQ ID NO: 5);
cg12697139 located in the intergenic region of chromosome 1 (the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown in SEQ ID NO: 6);
cg19034132 located in the intergenic region of chromosome 10 (the CpG site and the DNA base sequence of 60bp before and after the CpG site are shown in SEQ ID NO: 7).
CpG is an abbreviation for cytosine (C) -phosphate (p) -guanine (G), wherein cytosine can be methylated to 5' -methylcytosine.
In a second aspect of the present invention, there is provided a method for screening a methylation marker specific to prostate cancer related to early diagnosis using a database, comprising the steps of:
1) downloading 450k methylated chip data of a plurality of types of cancers from TCGA and GEO databases;
2) dividing the cancer sample into carcinoma in situ and metastatic carcinoma by using the clinical staging data of the TCGA sample;
3) comparing and analyzing in-situ cancer samples and normal tissue samples of various cancers in TCGA by using limma package (software package for processing microarray data by using a linear model) in R language to obtain differential methylation sites, and verifying by using GEO data;
4) comparing the differential methylation sites of various cancers, and screening out the specific methylation sites of various cancers;
5) combining with the Gleason grading information of the prostate cancer sample and a multivariate modeling method, finally screening out the optimal site combination from the prostate cancer specific methylation sites, namely the methylation marker combination for early screening of the prostate cancer.
The method combining the Gleason grading information of the prostate cancer sample and the multivariate modeling specifically comprises the following steps:
for a data set containing n samples, the data of n-1 samples are adopted each time to establish n models with the minimum AIC value, and sites contained in all the models are selected as key sites.
The present invention identified 7 sites and models thereof for differentiating normal tissues from in situ prostate cancer (12.234 cg26140475+4.247 cg24891312+14.924 cg 24245354-5.902 cg21359747-13.520 cg03254336-3.001 cg12697139-4.841 cg 19034132-0.425). Sites cg26140475 and its model (21.3216 × cg26140475-5.2742) were also identified to distinguish prostate and other urinary cancers.
Compared with the prior art, the invention has the following advantages:
the invention accurately classifies the cancer samples by utilizing clinical information, and then intensively contrasts and analyzes the in-situ cancer samples and the normal samples, thereby more pointedly finding a series of effective early diagnosis related markers. And the invention compares the data of multiple types of cancers in multiple databases, and finally screens out the prostate cancer specific marker combination with clinical application potential which is not reported in documents by combining a modeling means.
Drawings
FIG. 1 is a flow chart of an assay for screening methylation markers specific to prostate cancer using public databases.
Figure 2 is a summary heatmap of 173 cancer-specific differential methylation sites, with data for 16 types of cancer in TCGA summarized on the left and 11 datasets for 7 types of cancer in GEO summarized on the right. The cancer code in the middle of the two figures marks the location of each cancer specific marker. The colors in the heatmap correspond to the mean difference in methylation levels (. beta.) for carcinoma in situ and normal tissues.
Fig. 3 is a graph of methylation level (β) distribution of marker "cg 26140475" in prostate cancer and other urological cancers (bladder cancer, kidney cancer) in TCGA and GEO databases.
Figure 4 is the receiver operating characteristic curve (ROC) validated in the GEO dataset for the predictive model established for the 7 marker loci, with an area under the curve (AUC) of 1 for dataset GSE47915, 0.92 for dataset GSE76938, and 0.99 for dataset GSE 112047.
Detailed Description
The invention establishes a database-based method for screening methylation markers related to early diagnosis of cancers and discloses a methylation marker combination which can be effectively used for early diagnosis of prostate cancer.
The terms:
1) TCGA: the Cancer Genome Atlas, which is The Genome mapping program for human cancers initiated by The National Cancer Institute (NCI) and The national human Genome institute (NHGRI) in combination, currently contains data for 33 types of cancers.
2) GEO: is a Gene Expression database created and maintained by Gene Expression Omnibus, National Center for Biotechnology Information (NCBI).
3) FDR: the False Discovery rate is obtained by correcting the p value by adopting a Benjamini and Hochberg method in multiple comparisons.
4) Gleason score: a scoring system for grading prostate cancer gives 1-5 total histological scores of prostate cells according to the approximate degree of cancer, and the histological scores of the main cells and the secondary cells in the section are added at each time of scoring, so that the prostate cancer is graded into 2-10 points.
5) And (3) AIC: akaike information criterion is a standard for measuring the fitting superiority of a statistical model, and a model with the minimum AIC value is an optimal model.
Example 1, cancer specific methylation marker screening method based on TCGA and GEO databases.
An assay scheme for screening methylation markers specific for prostate cancer using public databases is shown in figure 1.
The method comprises the following steps: downloading and processing of methylated data
Methylated 450k chip Data (Illumina human methylation450 beamchip) for all 33 types of cancer in the TCGA database was downloaded via NIH GDC Data port (https:// port.gdc.cancer. gov. /). The corresponding clinical data was downloaded from the cBioPortal for Cancer Genomics database (http:// www.cbioportal.org /). Subsequent analysis required that each cancer type contained no less than 5 normal samples and had lymphatic and distant metastasis information, so the present invention ultimately selected 16 major cancer subtypes of the 14 cancer major classes: urothelial carcinoma of the Bladder (BLCA), breast invasive ductal carcinoma (D _ BRCA), breast invasive lobular carcinoma (L _ BRCA), colon adenocarcinoma (COAD), esophageal adenocarcinoma (ESCA), head and neck squamous cell carcinoma (HNSC), renal clear cell carcinoma (KIRC), renal papillary cell carcinoma (KIRP), hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (lucc), pancreatic carcinoma (PAAD), prostate carcinoma (PRAD), rectal adenocarcinoma (READ), follicular thyroid carcinoma (F _ THCA), and papillary thyroid carcinoma (P _ THCA).
Each sample contained methylation data for 485,577 sites, presented as β values (β values from 0 to 1, indicating complete demethylation to complete methylation). The pretreatment comprises three steps: first removing probe data that hybridizes to multiple sites on the genome and removing probe data on the sex chromosome; secondly, each type of cancer removes probe sites missing more than half of the data; finally, for a small amount of missing data, the approximation was used for completion (impute package from Bioconductor). After pretreatment, data for approximately 370,000 probes remained for each type of cancer.
A total of 11 450k methylation chip-based datasets for 7 types of cancer were downloaded from the GEO database: breast cancer (GSE60185, GSE69914), colorectal cancer (GSE42752, GSE48684), renal clear cell carcinoma (GSE61441), hepatocellular carcinoma (GSE56588, GSE77269), lung adenocarcinoma (GSE66836), pancreatic cancer (GSE49149), prostate cancer (GSE76938, GSE 112047). Data preprocessing is consistent with TCGA data.
Step two: analysis of methylation data
The internationally recognized handbook for staging tumors AJCC CANCER STAGING MANUAL staged tumors according to the TNM system (size and range of T-primary tumors, extent of involvement of N-regional lymph nodes, presence or absence of distant metastasis of M). The present invention classifies cancer samples in TCGA into carcinoma in situ (no lymphatic metastasis and no distant metastasis) and metastatic carcinoma (lymphatic or distant metastasis) according to N, M, and defines the finding of carcinoma in situ as a target for early diagnosis. As a log transformed form of β, the M value is more efficient in statistical tests, so β is transformed into M for subsequent significance analysis using the following formula:
Figure BDA0002927205580000051
the M values of the normal sample and the two types of cancer samples were analyzed using the R limma package:
Figure BDA0002927205580000052
comparing methylation levels at all sites in normal and carcinoma in situ samples
Figure BDA0002927205580000061
Comparing methylation levels at all sites in normal and metastatic cancer samples
Figure BDA0002927205580000062
By correlation analysis, sites with methylation levels correlated with gender, race, age were excluded
Figure BDA0002927205580000063
Samples of the GEO database to compare methylation levels at all sites in normal and cancer samples
Step three: screening for cancer specific markers
The differential methylation sites of each type of cancer need to satisfy the following conditions:
Figure BDA0002927205580000064
significant differences in methylation levels (FDR) between normal and carcinoma in situ samples<0.05) and the absolute value of the difference in beta values is greater than 0.1
Figure BDA0002927205580000065
Significant differences in methylation levels (FDR) between normal and metastatic cancer samples<0.05)
Figure BDA0002927205580000066
Site methylation level and gender, ethnicity, age-independent (FDR)>0.05)
Figure BDA0002927205580000067
Cancer types in the GEO database, with significant differences in methylation levels (FDR) between normal and cancer samples<0.05)
On the basis of different cancer methylation sites, the specific markers need to meet the following conditions:
Figure BDA0002927205580000068
sites were more than 9 significantly differentially methylated in 16 datasets of TCGA
Figure BDA0002927205580000069
Sites were more than 6 significantly differentially methylated in 11 data sets of GEO
Figure BDA00029272055800000610
The differential methylation trends of sites in specific cancers and other cancers are opposite
Through the above screening steps, the present invention screened 173 cancer specific markers (including 125 specific for prostate cancer, 20 specific for pancreatic cancer, 10 specific for breast cancer, 9 specific for clear cell renal carcinoma, 7 specific for hepatocellular carcinoma and 2 specific for colorectal cancer) (fig. 2).
Example 2, screening and validation of prostate cancer specific markers.
The method comprises the following steps: screening for prostate cancer specific markers
From all 125 prostate cancer specific markers, based on Gleason scores, markers that can differentiate prostate cancer grade were screened. The study showed that the prognosis for the two patterns of Gleason 7-score prostate cancer 3+4 and 4+3 were significantly different, while Gleason 6&3+4 was defined as low-score prostate cancer and 4+3&8-10 as high-score prostate cancer based on the sample composition of this project. Thereafter, methylation levels of the normal sample and the low-grade and high-grade prostate cancer samples were compared and analyzed by using the R limma package, and finally, 26 prostate cancer specific markers with significant difference (FDR <0.05) in the methylation levels of the three types of samples were selected.
Step two: building predictive models using key markers
To build a linear model to predict the state of a sample using the methylation levels of key sites (β values, between 0-1): normal tissue vs. prostate cancer in situ, prostate cancer vs. other cancers of the urinary system (bladder urothelial cancer, renal clear cell cancer, renal papillary cell cancer). The invention refers to the concept of leave-one-out cross validation, and for a data set containing n samples, the data of n-1 samples are adopted each time to establish n models with the minimum AIC value, and sites contained in all the models are selected as key sites. The present invention identified 7 sites and models thereof for differentiating normal tissues from in situ prostate cancer (12.234 cg26140475+4.247 cg24891312+14.924 cg 24245354-5.902 cg21359747-13.520 cg03254336-3.001 cg12697139-4.841 cg 19034132-0.425). Wherein site cg26140475 was used to model (21.3216 × cg26140475-5.2742) to differentiate prostate cancer from other urinary cancers based on its differential methylation in prostate cancer and other urinary cancers (fig. 3).
Step three: verification of predictive models
The above models were verified using a data set of GEO. GSE47915 included 4 cases each of Gleason 6 grade prostate cancer tissue and normal tissue; GSE76938 contained 73 prostate cancer tissues and 63 normal tissues; GSE112047 contained 31 cases of prostate cancer tissue and 16 cases of normal tissue. The model for differentiating normal tissue from orthotopic prostate cancer was validated in the three data sets above, and the area under the curve (AUC) of the model subject operating characteristic curve (ROC) was as follows: GSE47915, AUC ═ 1; GSE76938, AUC 0.92; GSE112047, AUC 0.99 (fig. 4). The model for distinguishing prostate cancer from other urinary system cancers was verified using the data set GSE52955 (including kidney, bladder and prostate cancer tissues and corresponding normal tissues) to yield AUC 1.
Sequence listing
<110> Hangzhou college of medicine
<120> methylation marker combination for early screening of prostate cancer and screening method
<130> 2020.12.22
<160> 7
<170> SIPOSequenceListing 1.0
<210> 1
<211> 122
<212> DNA
<213> Homo sapiens
<400> 1
agggctattt ttgaggagat tgcagacgat ctgatgaatc atagcaaaca cacatggctc 60
cggcggggga cagcccttcc tcatgttgct gtggaaagtg gagcttctca aaatgaatca 120
gc 122
<210> 2
<211> 122
<212> DNA
<213> Homo sapiens
<400> 2
ttctcagatt cttcttccac gagtattttt ctacaatatt ctgatatgca ttcctcccca 60
cgttctttct agtacttctt ttacatattt tattttctga gtcaagctca ttatgtgtat 120
ct 122
<210> 3
<211> 122
<212> DNA
<213> Homo sapiens
<400> 3
gatagaaccc ccagctatgt cccggcgctc agtgactcac cagcttactg gaaagtccct 60
cgagtcctct attaactaga ttcagaaaag cctacccaat ctctttgagt gaaagctcac 120
tc 122
<210> 4
<211> 122
<212> DNA
<213> Homo sapiens
<400> 4
tccctcccca aaaacccctg cgatgatttc cagggtggac gagtctggct ttgagagggt 60
cggagggtcc gagagcgcca agagagagtc cgggaggcag atggcggagc tgcagttgga 120
ag 122
<210> 5
<211> 122
<212> DNA
<213> Homo sapiens
<400> 5
gtcactgtgt ttctcagagc cgggatttcc ctccctctta tgaaatccat catggtttca 60
cggccccaca gctgtctcta ggagcctcag aaggatttag ccagtaacac tttgagctgc 120
tg 122
<210> 6
<211> 122
<212> DNA
<213> Homo sapiens
<400> 6
gctccgtcct gtgatcttgg gtgtgggggt ggcaatgaaa agagaggaga atgcccacaa 60
cggtctaccc tcagagtttt gactcaccat gcaatcaaga gacttcaact tgctttcaaa 120
ga 122
<210> 7
<211> 122
<212> DNA
<213> Homo sapiens
<400> 7
agtggctaac ctccttcctg aacaggcaga tgtgtcatgc tgggcagcac ctctccctcc 60
cgacacctgc atggtgactc aaactcccag gtaaagccca acccgatggg aaatgactca 120
gc 122

Claims (3)

1. A methylation marker combination for early screening of prostate cancer, which consists of the following markers:
the cg26140475 is located in 8 # chromosome intergenic region, the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown as SEQ ID NO: 1 is shown in the specification;
cg24891312 located in chromosome 12 PRB2 gene, wherein the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown as SEQ ID NO: 2 is shown in the specification;
cg 242422654 located in the intergenic region of chromosome 12, wherein the CpG site and the DNA base sequence around the CpG site of 60bp are shown as SEQ ID NO: 3 is shown in the specification;
the cg21359747 of the ALDH1A3 gene located on chromosome 15, the CpG locus and the DNA base sequence around the CpG locus of 60bp are shown as SEQ ID NO: 4 is shown in the specification;
the CpG 03254336 is positioned in the 10 # chromosome intergenic region, and the CpG locus and the DNA base sequences of the front and back 60bp of the CpG locus are shown as SEQ ID NO: 5 is shown in the specification;
cg12697139 located in the intergenic region of chromosome 1, the CpG locus and the 60bp DNA base sequence before and after the CpG locus are shown as SEQ ID NO: 6 is shown in the specification;
and cg19034132 located in the intergenic region of chromosome 10, wherein the CpG site and the 60bp DNA base sequences before and after the CpG site are shown as SEQ ID NO: shown at 7.
2. The methylation marker combination for early screening of prostate cancer according to claim 1, wherein CpG is an abbreviation for cytosine-phosphate-guanine, wherein cytosine can be methylated to 5' -methylcytosine.
3. The screening method for a methylation marker combination for early screening of prostate cancer according to claim 1, comprising the steps of:
1) downloading the data of the methylation chips of various cancers from TCGA and GEO databases;
2) dividing the cancer sample into carcinoma in situ and metastatic carcinoma by using the clinical staging data of the TCGA sample;
3) comparing and analyzing in-situ cancer samples and normal tissue samples of various cancers in TCGA by using a limma package in an R language to obtain differential methylation sites, and verifying by using GEO data;
4) comparing the differential methylation sites of various cancers, and screening out the specific methylation sites of various cancers;
5) combining the Gleason grading information of the prostate cancer sample with a multivariate modeling method, and finally screening a methylation marker combination for early screening of the prostate cancer from the prostate cancer specific methylation sites.
CN202110137326.9A 2021-02-01 2021-02-01 Methylation marker combination for early screening of prostate cancer and screening method Active CN112779334B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110137326.9A CN112779334B (en) 2021-02-01 2021-02-01 Methylation marker combination for early screening of prostate cancer and screening method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110137326.9A CN112779334B (en) 2021-02-01 2021-02-01 Methylation marker combination for early screening of prostate cancer and screening method

Publications (2)

Publication Number Publication Date
CN112779334A CN112779334A (en) 2021-05-11
CN112779334B true CN112779334B (en) 2022-05-27

Family

ID=75760276

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110137326.9A Active CN112779334B (en) 2021-02-01 2021-02-01 Methylation marker combination for early screening of prostate cancer and screening method

Country Status (1)

Country Link
CN (1) CN112779334B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113667757B (en) * 2021-10-20 2022-02-22 求臻医学科技(北京)有限公司 Biomarker combination for early screening of prostate cancer, kit and application
CN114743593B (en) * 2022-06-13 2023-02-24 北京橡鑫生物科技有限公司 Construction method of prostate cancer early screening model based on urine, screening model and kit
TW202403054A (en) * 2022-07-04 2024-01-16 大陸商江蘇鵾遠生物科技股份有限公司 Cancer-specific methylation markers and their use
CN116083588B (en) * 2023-03-09 2023-09-12 嘉兴允英医学检验有限公司 DNA methylation site combination as prostate cancer marker and application thereof

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2861764A1 (en) * 2012-06-13 2015-04-22 King Abdullah University Of Science And Technology Methylation biomarkers for prostate cancer
CN104593500A (en) * 2015-01-20 2015-05-06 中国科学院北京基因组研究所 Application of specific methylation sites in Y chromosome as cancer diagnosis marker
CN109680060A (en) * 2017-10-17 2019-04-26 华东师范大学 Methylate marker and its application in diagnosing tumor, classification
CN110431241A (en) * 2017-02-28 2019-11-08 梅约医学教育与研究基金会 Detect prostate cancer
CN110923328A (en) * 2020-01-04 2020-03-27 广州中鑫基因医学科技有限公司 Kit and method for screening prostate cancer
CN112236534A (en) * 2018-07-09 2021-01-15 香港精准医学技术有限公司 DNA methylation markers for non-invasive detection of cancer and uses thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2861764A1 (en) * 2012-06-13 2015-04-22 King Abdullah University Of Science And Technology Methylation biomarkers for prostate cancer
CN104593500A (en) * 2015-01-20 2015-05-06 中国科学院北京基因组研究所 Application of specific methylation sites in Y chromosome as cancer diagnosis marker
CN110431241A (en) * 2017-02-28 2019-11-08 梅约医学教育与研究基金会 Detect prostate cancer
CN109680060A (en) * 2017-10-17 2019-04-26 华东师范大学 Methylate marker and its application in diagnosing tumor, classification
CN112236534A (en) * 2018-07-09 2021-01-15 香港精准医学技术有限公司 DNA methylation markers for non-invasive detection of cancer and uses thereof
CN110923328A (en) * 2020-01-04 2020-03-27 广州中鑫基因医学科技有限公司 Kit and method for screening prostate cancer

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Identification of Methylation Biomarkers for Early Detection of Prostate Cancer;Yiyi Pu et al.;《Research Square》;20210306;第1~13页 *
Identification of prostate cancer specific methylation biomarkers from a multi‑cancer analysis;Yiyi Pu et al.;《BMC Bioinformatics》;20211012;第22卷;第1~12页 *

Also Published As

Publication number Publication date
CN112779334A (en) 2021-05-11

Similar Documents

Publication Publication Date Title
CN112779334B (en) Methylation marker combination for early screening of prostate cancer and screening method
CA2633339C (en) Methods for assessing probabilistic measures of clinical outcome using genomic profiling
US20060265138A1 (en) Expression profiling of tumours
EP1996729A2 (en) Molecular assay to predict recurrence of dukes&#39; b colon cancer
CN105154542B (en) One group of gene for being used for lung cancer molecule parting and its application
CN113462776B (en) m 6 Application of A modification-related combined genome in prediction of immunotherapy efficacy of renal clear cell carcinoma patient
EP4265739A1 (en) Gene combination for human tumor grading, and use thereof
CN109859796B (en) Dimension reduction analysis method for DNA methylation spectrum of gastric cancer
Stokowy et al. A two miRNA classifier differentiates follicular thyroid carcinomas from follicular thyroid adenomas
CN111564177B (en) Construction method of early non-small cell lung cancer recurrence model based on DNA methylation
CN109830264A (en) The method that tumor patient is classified based on methylation sites
CN116631508B (en) Detection method for tumor specific mutation state and application thereof
CN112779338B (en) Gene marker for esophageal cancer prognosis evaluation
Wang et al. Clinicopathological and molecular characterization of biphasic hyalinizing psammomatous renal cell carcinoma: further support for the newly proposed entity
CN110408706A (en) It is a kind of assess recurrent nasopharyngeal carcinoma biomarker and its application
CN116153387A (en) Overall survival rate prognosis model for lung squamous carcinoma patient and application
US20170183738A1 (en) Process, Apparatus or System and Kit for Classification of Tumor Samples of Unknown and/or Uncertain Origin and Use of Genes of the Group of Biomarkers
Guirguis et al. Use of gene expression profiles to stage concurrent endometrioid tumors of the endometrium and ovary
US20210103642A9 (en) Determining a probabilistic diagnosis of cancer by analysis of genomic copy number variations
CN115976209A (en) Training method of lung cancer prediction model, prediction device and application
CN104845992B (en) The biological markers of prostate cancer, therapy target and application thereof
CN107151708B (en) Application of DNA binding site CTCF-13 of multifunctional transcription regulatory factor CTCF
CN107227366B (en) Application of DNA binding site CTCF-113 of multifunctional transcription regulatory factor CTCF
JP2005261201A (en) Method for discriminating extent of gene expression in lung cancer tissue
CN116790755A (en) Glioma detection marker and application thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant