乳腺肿瘤特异性甲基化检测的试剂盒Breast tumor-specific methylation detection kit
技术领域technical field
本发明属于生物技术领域,具体涉及乳腺肿瘤特异性甲基化检测的试剂盒。The invention belongs to the field of biotechnology, and in particular relates to a kit for breast tumor-specific methylation detection.
背景技术Background technique
据中国肿瘤登记年报显示:女性乳腺癌发病率在0~24岁年龄段处较低水平,25岁后逐渐上升,50~54岁年龄段达到高峰,55岁以后逐渐下降。针对乳腺癌的易感基因,欧、美国家做了大量研究,现已知的有BRCA-1、BRCA-2,还有p53、PTEN等,与这些基因突变相关的乳腺癌称为遗传性乳腺癌,占全部乳腺癌的5%~10%。乳腺癌21基因检测不仅提供1-5年及5年后复发风险预测,还能作为唯一多基因检测来预测雌激素受体阳性浸润性乳腺癌患者的化疗及内分泌治疗的获益程度。乳腺癌70基因检测也在早期乳腺癌诊断中发挥了重要作用。最新的研究成果发现72个新的常见基因变异将导致女性患乳腺癌的风险上升,这使目前已知与乳腺癌相关的基因突变数量增至177个。许多人都可能携带常见的变异基因,单个基因突变致癌的风险相对较小,但女性体内的相关变异基因越多,患乳腺癌的风险越大。目前乳腺癌基因检测手段属于一种非侵入性检测,但需要基于原来的手术(乳房肿瘤切除手术、乳房切除手术或核心穿刺活组织检查)过程中取出的组织检测基础上。According to the China Cancer Registry Annual Report, the incidence of female breast cancer is at a low level in the age group of 0-24 years old, gradually increases after the age of 25, reaches a peak in the age group of 50-54 years old, and gradually decreases after the age of 55. European and American countries have done a lot of research on breast cancer susceptibility genes. Now known are BRCA-1, BRCA-2, p53, PTEN, etc. Breast cancer related to these gene mutations is called hereditary breast cancer Cancer, accounting for 5% to 10% of all breast cancers. The breast cancer 21-gene test not only provides prediction of recurrence risk after 1-5 years and after 5 years, but also serves as the only multi-gene test to predict the benefit of chemotherapy and endocrine therapy in patients with estrogen receptor-positive invasive breast cancer. Breast cancer 70 gene testing also plays an important role in early breast cancer diagnosis. The latest research finds 72 new common genetic variants that increase the risk of breast cancer in women, bringing the number of gene mutations known to be associated with breast cancer to 177. Many people may carry common genetic variants, and a single gene mutation has a relatively small risk of cancer, but the more related variants in a woman's body, the greater the risk of breast cancer. The current breast cancer genetic testing method is a non-invasive test, but it needs to be based on the test of the tissue removed during the original surgery (tumorectomy, mastectomy or core needle biopsy).
对肿瘤进行早筛和早诊,可以更大概率地发现早期癌症,从而提高其五年生存率,降低死亡率。中国乳腺癌筛查现状:乳腺癌5年生存率:原位癌100%,I期84-100%,II期76-87%,III期38-77%。中国多中心研究显示,初诊乳腺癌时,15.7%I期,44.9%II期,18.7%III期,2.4%IV期。乳腺癌影像筛查手段:1.钼靶X射线:中国女性乳腺发病年龄较轻(45-55岁),乳腺腺体较西方女性致密,MG易遗漏,且MG对致密型乳腺病灶诊断敏感性仅为30%。2.B超:单纯MG阳性预测值为55.5%,MG联合B超阳性预测值为43.3%。3.核磁共振(MRI):MRI对微小钙化灶不敏感,且有明确禁忌症;所以,肿瘤的早筛早诊早治疗对提高五年生存率具有非常大的意义,肿瘤早筛早诊的关键是建立有效的检测模型,寻找能够作为筛查与诊断的分子标记。Early screening and diagnosis of tumors can detect early-stage cancer with a higher probability, thereby improving its five-year survival rate and reducing mortality. Current status of breast cancer screening in China: 5-year survival rate of breast cancer: 100% for carcinoma in situ, 84-100% for stage I, 76-87% for stage II, and 38-77% for stage III. According to a Chinese multicenter study, at the time of initial diagnosis of breast cancer, 15.7% stage I, 44.9% stage II, 18.7% stage III, and 2.4% stage IV. Breast cancer imaging screening methods: 1. Mammographic target X-ray: Chinese women are younger (45-55 years old), breast glands are denser than Western women, MG is easy to miss, and MG is sensitive to the diagnosis of dense breast lesions Only 30%. 2. B-ultrasound: the positive predictive value of MG alone was 55.5%, and the positive predictive value of MG combined with B-ultrasound was 43.3%. 3. Magnetic resonance imaging (MRI): MRI is not sensitive to small calcifications and has clear contraindications; therefore, early screening and early treatment of tumors is of great significance to improve the five-year survival rate. The key is to establish an effective detection model and find molecular markers that can be used as screening and diagnosis.
cfDNA(cell-free DNA)是外周血中游离的核酸小片段DNA,源自正常细胞或肿瘤细胞代谢与凋亡,包含体细胞突变和DNA甲基化等遗传信息。通过检测疾病特异性cfDNA片段,掌握疾病的发生、发展的技术,称为液体活检(Liquid Biopsy),与传统的组织活检相比,其有着迅速、便捷、损伤性小等众多优点。2015年卢煜明教授通过cfDNA全基因组甲基化测序证明了液体活检替代组织活检在技术上和理论上的可行性;2017年张鹍教授团队利用ctDNA的甲基化定量描述了肿瘤负荷以及肿瘤来源的ctDNA图谱。临床越来越多的研究表明血浆ctDNA可作为生物标志物应用在肿瘤早期诊断筛选、预测、治疗的反应,监测肿瘤大小和复发等。目前,国际上的研究方向是整合多组学、 多种分子标志物、多基因、多位点来提高检测技术的灵敏度和特异性,以满足临床对检测产品的需求。cfDNA (cell-free DNA) is a small fragment of free nucleic acid DNA in peripheral blood, derived from the metabolism and apoptosis of normal cells or tumor cells, and contains genetic information such as somatic mutation and DNA methylation. By detecting disease-specific cfDNA fragments, the technology of mastering the occurrence and development of diseases is called liquid biopsy (Liquid Biopsy). Compared with traditional tissue biopsy, it has many advantages such as rapidity, convenience and less damage. In 2015, Professor Lu Yuming proved the technical and theoretical feasibility of liquid biopsy to replace tissue biopsy through cfDNA whole-genome methylation sequencing; in 2017, Professor Zhang Kun's team used ctDNA methylation to quantitatively describe tumor burden and tumor origin. ctDNA profile. More and more clinical studies have shown that plasma ctDNA can be used as a biomarker for early diagnosis and screening of tumors, prediction, response to treatment, and monitoring tumor size and recurrence. At present, the international research direction is to integrate multiple omics, multiple molecular markers, multiple genes, and multiple loci to improve the sensitivity and specificity of detection technology to meet clinical needs for detection products.
DNA甲基化是一种基因的表观遗传学修饰方式,不同的阶段的癌症患者和健康个体比较,DNA甲基化模式与甲基化水平均存在显著差异,并且在癌症发生早期,体液中的DNA甲基化水平会出现微量变化。因此准确且定量地检测DNA甲基化标志物对于癌症的早期诊断具有十分重要的意义。DNA methylation is an epigenetic modification of genes. Compared with cancer patients at different stages and healthy individuals, there are significant differences in DNA methylation patterns and methylation levels. There will be slight changes in the level of DNA methylation. Therefore, accurate and quantitative detection of DNA methylation markers is of great significance for the early diagnosis of cancer.
本发明的发明人在早先的研究中,筛选到可用于乳腺癌检测的甲基化生物标记物或其组合(参见CN2020104264964),通过研究这些血浆cfDNA甲基化修饰模式的基因组片段在各期乳腺癌患者及健康人群中的甲基化修饰差异,发现以上血浆基因组片段组合都能够作为乳腺癌早期诊断标记物。而且,发明人发现,其中一个甲基化标记物(cg23035715)或与前3个甲基化标记物(cg26371731,cg04541368,cg13973436)组合均具备了与两种算法LASSO和Random Forest筛选出来重叠的26个甲基化标记物组合相近的诊断力。In an earlier study, the inventors of the present invention screened out methylation biomarkers or their combination that can be used for breast cancer detection (see CN2020104264964), by studying the genomic fragments of these plasma cfDNA methylation modification patterns in breast cancer at various stages Differences in methylation modification between cancer patients and healthy people, and found that the combination of the above plasma genome fragments can be used as early diagnosis markers for breast cancer. Moreover, the inventors found that one of the methylation markers (cg23035715) or in combination with the first three methylation markers (cg26371731, cg04541368, cg13973436) all possessed 26 overlapped with those screened by the two algorithms LASSO and Random Forest. The diagnostic power of the combination of the methylation markers was similar.
MethyLight荧光定量利用TaqMan探针和PCR引物来区分甲基化和未甲基化的DNA。首先用亚硫酸盐处理DNA片段,针对甲基化和未甲基化各设计一个探针,用不同的荧光标记,随后开展实时定量PCR。如果探针与DNA杂交则释放出荧光信号,根据荧光信号的比值计算甲基化水平。该方法适用于样本量大但是位点少的研究。MethyLight fluorescence quantification utilizes TaqMan probes and PCR primers to differentiate between methylated and unmethylated DNA. The DNA fragments are first treated with sulfite, and a probe is designed for methylation and unmethylation, labeled with different fluorophores, followed by real-time quantitative PCR. A fluorescent signal is released if the probe hybridizes to DNA, and the methylation level is calculated from the ratio of the fluorescent signals. This method is suitable for studies with large sample size but few sites.
而目前对乳腺肿瘤患者血浆cfDNA甲基化检测多基因,存在由于cfDNA量较少,对于建库的低投入量要求高,而低投入量和损伤影响最终生信分析,对于整个assay的灵敏度要求高等缺陷。However, at present, the detection of polygenes in plasma cfDNA methylation of breast tumor patients has high requirements for low investment in library construction due to the small amount of cfDNA, and low investment and damage affect the final bioinformatics analysis, and the sensitivity requirements for the entire assay high defect.
发明内容SUMMARY OF THE INVENTION
本发明的目的之一在于提供一种高灵敏度检测乳腺癌相关的甲基化生物标记物的试剂盒,该试剂盒将多重PCR技术靶向富集多基因和MethLight技术结合,先通过优化多重PCR,较高特异性靶向富集,再通过特异的筛选引物和探针的方法,使单基因检测具备了很高的灵敏度和特异性。One of the objectives of the present invention is to provide a high-sensitivity test kit for breast cancer-related methylation biomarkers, which combines multiplex PCR technology to target enrichment of multiple genes and MethLight technology, and first optimizes the multiplex PCR , high specificity targeted enrichment, and then through specific screening of primers and probes, single-gene detection has high sensitivity and specificity.
实现上述目的的技术方案如下。The technical solution to achieve the above object is as follows.
一种乳腺癌相关的甲基化生物标记物的检测试剂盒,包括有:针对生物标记物cg23035715的选自以下任一组的引物和探针:SEQ ID NO.1-SEQ ID NO.3,SEQ ID NO.70-SEQ ID NO.72,SEQ ID NO.139-SEQ ID NO.141。A detection kit for breast cancer-related methylation biomarkers, comprising: primers and probes selected from any of the following groups for biomarker cg23035715: SEQ ID NO.1-SEQ ID NO.3, SEQ ID NO.70-SEQ ID NO.72, SEQ ID NO.139-SEQ ID NO.141.
在其中一个实施例中,还包括选自针对cg26371731,cg04541368和cg13973436至少一种生物标记物的引物和探针;所述针对cg26371731生物标记物的引物和探针选自:SEQ ID NO.4-SEQ ID NO.6,SEQ ID NO.73-SEQ ID NO.75,SEQ ID NO.142-SEQ ID NO.144;In one embodiment, it also includes primers and probes for at least one biomarker of cg26371731, cg04541368 and cg13973436; the primers and probes for cg26371731 biomarkers are selected from: SEQ ID NO.4- SEQ ID NO.6, SEQ ID NO.73-SEQ ID NO.75, SEQ ID NO.142-SEQ ID NO.144;
所述针对cg04541368生物标记物的引物和探针选自:SEQ ID NO.7-SEQ ID NO.9,SEQ ID NO.76-SEQ ID NO.78,SEQ ID NO.145-SEQ ID NO.147;The primers and probes for the cg04541368 biomarker are selected from: SEQ ID NO.7-SEQ ID NO.9, SEQ ID NO.76-SEQ ID NO.78, SEQ ID NO.145-SEQ ID NO.147 ;
所述针对cg13973436生物标记物的引物和探针选自:SEQ ID NO.10-SEQ ID NO.12,SEQ ID NO.79-SEQ ID NO.81,SEQ ID NO.148-SEQ ID NO.150。The primers and probes for the cg13973436 biomarker are selected from: SEQ ID NO.10-SEQ ID NO.12, SEQ ID NO.79-SEQ ID NO.81, SEQ ID NO.148-SEQ ID NO.150 .
在其中一个实施例中,还包括选自针对cg26371731,cg04541368和cg13973436生物标记物的引物和探针;所述针对cg26371731生物标记物的引物和探针为:SEQ ID NO.4-SEQ ID NO.6;所述针对cg04541368生物标记物的引物和探针为:SEQ ID NO.7-SEQ ID NO.9;In one embodiment, it also includes primers and probes selected from cg26371731, cg04541368 and cg13973436 biomarkers; the primers and probes for cg26371731 biomarkers are: SEQ ID NO.4-SEQ ID NO. 6; the primers and probes for the cg04541368 biomarker are: SEQ ID NO.7-SEQ ID NO.9;
所述针对cg13973436生物标记物的引物和探针为:SEQ ID NO.10-SEQ ID NO.12。The primers and probes for the cg13973436 biomarker are: SEQ ID NO.10-SEQ ID NO.12.
在其中一个实施例中,还包括选自cg16304215,cg20072171,cg21501525,cg22778178,cg08599259,cg25566568,cg15634980,cg07458308,cg01348584,cg14140881,cg25756435,cg00594560,cg18087672,cg14868703,cg17632299,cg18786873,cg20631750,cg25924096,和cg15321298中至少一种生物标记物的引物和探针;在其中一个实施例中,还包括选自cg16304215,cg20072171,cg21501525,cg22778178,cg08599259,cg25566568,cg15634980,cg07458308,cg01348584,cg14140881,cg25756435,cg00594560,cg18087672,cg14868703,cg17632299,cg18786873,cg20631750,cg25924096,和cg15321298中Primers and probes for at least one biomarker;
所述针对cg16304215生物标记物的引物和探针选自:SEQ ID NO.13-SEQ ID NO.15,SEQ ID NO.82-SEQ ID NO.84,SEQ ID NO.151-SEQ ID NO.153;The primers and probes for the cg16304215 biomarker are selected from: SEQ ID NO.13-SEQ ID NO.15, SEQ ID NO.82-SEQ ID NO.84, SEQ ID NO.151-SEQ ID NO.153 ;
所述针对cg20072171生物标记物的引物和探针选自:SEQ ID NO.16-SEQ ID NO.18,SEQ ID NO.85-SEQ ID NO.87,SEQ ID NO.154-SEQ ID NO.156;The primers and probes for the cg20072171 biomarker are selected from: SEQ ID NO.16-SEQ ID NO.18, SEQ ID NO.85-SEQ ID NO.87, SEQ ID NO.154-SEQ ID NO.156 ;
所述针对cg21501525生物标记物的引物和探针选自:SEQ ID NO.19-SEQ ID NO.21,SEQ ID NO.88-SEQ ID NO.90,SEQ ID NO.157-SEQ ID NO.159;The primers and probes for the cg21501525 biomarker are selected from: SEQ ID NO.19-SEQ ID NO.21, SEQ ID NO.88-SEQ ID NO.90, SEQ ID NO.157-SEQ ID NO.159 ;
所述针对cg22778178生物标记物的引物和探针选自:SEQ ID NO.22-SEQ ID NO.24,SEQ ID NO.91-SEQ ID NO.93,SEQ ID NO.160-SEQ ID NO.162;The primers and probes for the cg22778178 biomarker are selected from: SEQ ID NO.22-SEQ ID NO.24, SEQ ID NO.91-SEQ ID NO.93, SEQ ID NO.160-SEQ ID NO.162 ;
所述针对cg08599259生物标记物的引物和探针选自:SEQ ID NO.25-SEQ ID NO.27,SEQ ID NO.94-SEQ ID NO.96,SEQ ID NO.163-SEQ ID NO.165;The primers and probes for the cg08599259 biomarker are selected from: SEQ ID NO.25-SEQ ID NO.27, SEQ ID NO.94-SEQ ID NO.96, SEQ ID NO.163-SEQ ID NO.165 ;
所述针对cg25566568生物标记物的引物和探针选自:SEQ ID NO.28-SEQ ID NO.30,SEQ ID NO.97-SEQ ID NO.99,SEQ ID NO.166-SEQ ID NO.168;The primers and probes for the cg25566568 biomarker are selected from: SEQ ID NO.28-SEQ ID NO.30, SEQ ID NO.97-SEQ ID NO.99, SEQ ID NO.166-SEQ ID NO.168 ;
所述针对cg15634980生物标记物的引物和探针选自:SEQ ID NO.31-SEQ ID NO.33,SEQ ID NO.100-SEQ ID NO.102,SEQ ID NO.169-SEQ ID NO.171;The primers and probes for the cg15634980 biomarker are selected from: SEQ ID NO.31-SEQ ID NO.33, SEQ ID NO.100-SEQ ID NO.102, SEQ ID NO.169-SEQ ID NO.171 ;
所述针对cg07458308生物标记物的引物和探针选自:SEQ ID NO.34-SEQ ID NO.36,SEQ ID NO.103-SEQ ID NO.105,SEQ ID NO.172-SEQ ID NO.174;The primers and probes for the cg07458308 biomarker are selected from: SEQ ID NO.34-SEQ ID NO.36, SEQ ID NO.103-SEQ ID NO.105, SEQ ID NO.172-SEQ ID NO.174 ;
所述针对cg01348584生物标记物的引物和探针选自:SEQ ID NO.37-SEQ ID NO.39,SEQ ID NO.106-SEQ ID NO.108,SEQ ID NO.175-SEQ ID NO.177;The primers and probes for the cg01348584 biomarker are selected from: SEQ ID NO.37-SEQ ID NO.39, SEQ ID NO.106-SEQ ID NO.108, SEQ ID NO.175-SEQ ID NO.177 ;
所述针对cg14140881生物标记物的引物和探针选自:SEQ ID NO.40-SEQ ID NO.42,SEQ ID NO.109-SEQ ID NO.111,SEQ ID NO.178-SEQ ID NO.180;The primers and probes for the cg14140881 biomarker are selected from: SEQ ID NO.40-SEQ ID NO.42, SEQ ID NO.109-SEQ ID NO.111, SEQ ID NO.178-SEQ ID NO.180 ;
所述针对cg25756435生物标记物的引物和探针选自:SEQ ID NO.43-SEQ ID NO.45,SEQ ID NO.112-SEQ ID NO.114,SEQ ID NO.181-SEQ ID NO.183;The primers and probes for the cg25756435 biomarker are selected from: SEQ ID NO.43-SEQ ID NO.45, SEQ ID NO.112-SEQ ID NO.114, SEQ ID NO.181-SEQ ID NO.183 ;
所述针对cg00594560生物标记物的引物和探针选自:SEQ ID NO.46-SEQ ID NO.48,SEQ ID NO.115-SEQ ID NO.117,SEQ ID NO.184-SEQ ID NO.186;The primers and probes for the cg00594560 biomarker are selected from: SEQ ID NO.46-SEQ ID NO.48, SEQ ID NO.115-SEQ ID NO.117, SEQ ID NO.184-SEQ ID NO.186 ;
所述针对cg18087672生物标记物的引物和探针选自:SEQ ID NO.49-SEQ ID NO.51,SEQ ID NO.118-SEQ ID NO.120,SEQ ID NO.187-SEQ ID NO.189;The primers and probes for the cg18087672 biomarker are selected from: SEQ ID NO.49-SEQ ID NO.51, SEQ ID NO.118-SEQ ID NO.120, SEQ ID NO.187-SEQ ID NO.189 ;
所述针对cg14868703生物标记物的引物和探针选自:SEQ ID NO.52-SEQ ID NO.54,SEQ ID NO.121-SEQ ID NO.123,SEQ ID NO.190-SEQ ID NO.192;The primers and probes for the cg14868703 biomarker are selected from: SEQ ID NO.52-SEQ ID NO.54, SEQ ID NO.121-SEQ ID NO.123, SEQ ID NO.190-SEQ ID NO.192 ;
所述针对cg17632299生物标记物的引物和探针选自:SEQ ID NO.55-SEQ ID NO.57,SEQ ID NO.124-SEQ ID NO.126,SEQ ID NO.193-SEQ ID NO.195;The primers and probes for the cg17632299 biomarker are selected from: SEQ ID NO.55-SEQ ID NO.57, SEQ ID NO.124-SEQ ID NO.126, SEQ ID NO.193-SEQ ID NO.195 ;
所述针对cg18786873生物标记物的引物和探针选自:SEQ ID NO.58-SEQ ID NO.60,SEQ ID NO.127-SEQ ID NO.129,SEQ ID NO.196-SEQ ID NO.198;The primers and probes for the cg18786873 biomarker are selected from: SEQ ID NO.58-SEQ ID NO.60, SEQ ID NO.127-SEQ ID NO.129, SEQ ID NO.196-SEQ ID NO.198 ;
所述针对cg20631750生物标记物的引物和探针选自:SEQ ID NO.61-SEQ ID NO.63,SEQ ID NO.130-SEQ ID NO.132,SEQ ID NO.199-SEQ ID NO.201;The primers and probes for the cg20631750 biomarker are selected from: SEQ ID NO.61-SEQ ID NO.63, SEQ ID NO.130-SEQ ID NO.132, SEQ ID NO.199-SEQ ID NO.201 ;
所述针对cg25924096生物标记物的引物和探针选自:SEQ ID NO.64-SEQ ID NO.66,SEQ ID NO.133-SEQ ID NO.135,SEQ ID NO.202-SEQ ID NO.204;The primers and probes for the cg25924096 biomarker are selected from: SEQ ID NO.64-SEQ ID NO.66, SEQ ID NO.133-SEQ ID NO.135, SEQ ID NO.202-SEQ ID NO.204 ;
所述针对cg15321298生物标记物的引物和探针选自:SEQ ID NO.67-SEQ ID NO.69,SEQ ID NO.136-SEQ ID NO.138,SEQ ID NO.205-SEQ ID NO.207。The primers and probes for the cg15321298 biomarker are selected from: SEQ ID NO.67-SEQ ID NO.69, SEQ ID NO.136-SEQ ID NO.138, SEQ ID NO.205-SEQ ID NO.207 .
在其中一个实施例中,还包括针对内参基因的引物和探针,所述针对内参基因的引物和探针为:SEQ ID NO.208-SEQ ID NO.210。In one embodiment, primers and probes for the internal reference gene are also included, and the primers and probes for the internal reference gene are: SEQ ID NO.208-SEQ ID NO.210.
在其中一个实施例中,还包括选自cg16304215,cg20072171,cg21501525,cg22778178,cg08599259,cg25566568,cg15634980,cg07458308,cg01348584,cg14140881,cg25756435,cg00594560,cg18087672,cg14868703,cg17632299,cg18786873,cg20631750,cg25924096,和cg15321298的引物和探针。在其中一个实施例中,还包括选自cg16304215,cg20072171,cg21501525,cg22778178,cg08599259,cg25566568,cg15634980,cg07458308,cg01348584,cg14140881,cg25756435,cg00594560,cg18087672,cg14868703,cg17632299,cg18786873,cg20631750,cg25924096,和cg15321298的Primers and Probes.
在其中一个实施例中,所述针对cg16304215生物标记物的引物和探针为SEQ ID NO.13-SEQ ID NO.15;所述针对cg20072171生物标记物的引物和探针为SEQ ID NO.16-SEQ ID NO.18;In one embodiment, the primers and probes for cg16304215 biomarkers are SEQ ID NO.13-SEQ ID NO.15; the primers and probes for cg20072171 biomarkers are SEQ ID NO.16 -SEQ ID NO.18;
所述针对cg21501525生物标记物的引物和探针为SEQ ID NO.19-SEQ ID NO.21;The primers and probes for the cg21501525 biomarker are SEQ ID NO.19-SEQ ID NO.21;
所述针对cg22778178生物标记物的引物和探针为SEQ ID NO.22-SEQ ID NO.24;The primers and probes for the cg22778178 biomarker are SEQ ID NO.22-SEQ ID NO.24;
所述针对cg08599259生物标记物的引物和探针为SEQ ID NO.25-SEQ ID NO.27;The primers and probes for the cg08599259 biomarker are SEQ ID NO.25-SEQ ID NO.27;
所述针对cg25566568生物标记物的引物和探针为SEQ ID NO.28-SEQ ID NO.30;The primers and probes for the cg25566568 biomarker are SEQ ID NO.28-SEQ ID NO.30;
所述针对cg15634980生物标记物的引物和探针为SEQ ID NO.31-SEQ ID NO.33;The primers and probes for the cg15634980 biomarker are SEQ ID NO.31-SEQ ID NO.33;
所述针对cg07458308生物标记物的引物和探针为SEQ ID NO.34-SEQ ID NO.36;The primers and probes for the cg07458308 biomarker are SEQ ID NO.34-SEQ ID NO.36;
所述针对cg01348584生物标记物的引物和探针为SEQ ID NO.37-SEQ ID NO.39;The primers and probes for the cg01348584 biomarker are SEQ ID NO.37-SEQ ID NO.39;
所述针对cg14140881生物标记物的引物和探针为SEQ ID NO.40-SEQ ID NO.42;The primers and probes for the cg14140881 biomarker are SEQ ID NO.40-SEQ ID NO.42;
所述针对cg25756435生物标记物的引物和探针为SEQ ID NO.43-SEQ ID NO.45;The primers and probes for the cg25756435 biomarker are SEQ ID NO.43-SEQ ID NO.45;
所述针对cg00594560生物标记物的引物和探针为SEQ ID NO.46-SEQ ID NO.48;The primers and probes for the cg00594560 biomarker are SEQ ID NO.46-SEQ ID NO.48;
所述针对cg18087672生物标记物的引物和探针为SEQ ID NO.49-SEQ ID NO.51;The primers and probes for the cg18087672 biomarker are SEQ ID NO.49-SEQ ID NO.51;
所述针对cg14868703生物标记物的引物和探针为SEQ ID NO.52-SEQ ID NO.54;The primers and probes for the cg14868703 biomarker are SEQ ID NO.52-SEQ ID NO.54;
所述针对cg17632299生物标记物的引物和探针为SEQ ID NO.55-SEQ ID NO.57;The primers and probes for the cg17632299 biomarker are SEQ ID NO.55-SEQ ID NO.57;
所述针对cg18786873生物标记物的引物和探针为SEQ ID NO.58-SEQ ID NO.60;The primers and probes for the cg18786873 biomarker are SEQ ID NO.58-SEQ ID NO.60;
所述针对cg20631750生物标记物的引物和探针为SEQ ID NO.61-SEQ ID NO.63;The primers and probes for the cg20631750 biomarker are SEQ ID NO.61-SEQ ID NO.63;
所述针对cg25924096生物标记物的引物和探针为SEQ ID NO.64-SEQ ID NO.66;The primers and probes for the cg25924096 biomarker are SEQ ID NO.64-SEQ ID NO.66;
所述针对cg15321298生物标记物的引物和探针为SEQ ID NO.67-SEQ ID NO.69。The primers and probes for the cg15321298 biomarker are SEQ ID NO.67-SEQ ID NO.69.
在其中一个实施例中,还包括PCR反应液,所述PCR反应液包括DNATaq聚合酶、dNTPs、Mg
2+和10×DNA聚合酶buffer。
In one embodiment, a PCR reaction solution is also included, and the PCR reaction solution includes DNATaq polymerase, dNTPs, Mg 2+ and 10×DNA polymerase buffer.
在其中一个实施例中,所述检测试剂盒检测的待测样本是血液、血清或血浆或者组织样本。In one embodiment, the sample to be detected by the detection kit is blood, serum or plasma or a tissue sample.
在其中一个实施例中,还包括引物混合液,在扩增体系中,其中每条引物浓度在300±10nM;镁离子的浓度为1.5±0.1mM,dNTP混合液浓度为400±10uM;反应酶为Phusion U,引物探针混合液,在多重荧光定量PCR反应中:其中各引物浓度在900±10nM;探针浓度在200±10nM。In one embodiment, a primer mixture is also included. In the amplification system, the concentration of each primer is 300±10nM; the concentration of magnesium ions is 1.5±0.1mM, and the concentration of dNTP mixture is 400±10uM; the reaction enzyme Phusion U, primer-probe mixture, in multiplex fluorescence quantitative PCR reaction: each primer concentration is 900±10nM; probe concentration is 200±10nM.
本发明首先通过独特的多重PCR技术靶向富集多基因靶向区域,避免多基因检测原始cfDNA量较低的问题,通过结合MethyLight PCR检测的低成本、快捷性,提高了检测通量,同时也提高了模型检测的灵敏度和特异性。单独的分子标记物cg16304215的引物探针的检测结果具有很好的灵敏度,其与其他分子标记物中的任何1个的引物探针组合具有更强、更稳定的诊断能力。通过本发明的所述检测试剂盒,可以很好地提高对乳腺癌检测的灵敏度和特异性,有效提高早期乳腺癌的检出率,及早进行治疗和干预,提高患者生存率。The invention firstly uses the unique multiplex PCR technology to target and enrich the multi-gene targeted region, avoiding the problem of low original cfDNA in multi-gene detection, and by combining the low cost and rapidity of MethyLight PCR detection, the detection throughput is improved, and at the same time The sensitivity and specificity of model detection are also improved. The detection result of the primer-probe of the single molecular marker cg16304215 has good sensitivity, and its combination with the primer-probe of any one of the other molecular markers has stronger and more stable diagnostic ability. The detection kit of the present invention can well improve the sensitivity and specificity of breast cancer detection, effectively improve the detection rate of early breast cancer, perform early treatment and intervention, and improve the survival rate of patients.
本发明试剂盒提供的针对所述DNA甲基化分子标记物的荧光定量PCR检测引物和探针设计时克服了多个引物和探针在进行多重PCR扩增和检测时存在相互干扰的缺点,利用所述引物对经过亚硫酸氢盐处理的DNA进行多重PCR时,每个DNA甲基化分子标记物均能得到有效扩增和富集;后续对多重PCR产物进行多重荧光定量PCR检测时,相应DNA甲基化分子标记物在多重荧光定量PCR检测中获得的C
T值与该DNA甲基化分子标志物单独进行荧光定量PCR反应的C
T值没有显著差异,定量性能与单个区域定量等同。所述试剂盒经过优化,针对不同DNA甲基化分子标记物的引物和探针相互之间没有干扰,可成功实现多重PCR扩增和多重荧光定量PCR检测,有效提高检测效率。
The fluorescent quantitative PCR detection primers and probes for the DNA methylation molecular markers provided by the kit of the present invention overcome the disadvantage of mutual interference when multiple primers and probes perform multiple PCR amplification and detection. When using the primers to perform multiplex PCR on bisulfite-treated DNA, each DNA methylation molecular marker can be effectively amplified and enriched; when multiple PCR products are subsequently detected by multiplex fluorescence quantitative PCR, The CT value obtained by the corresponding DNA methylation molecular marker in the multiplex fluorescence quantitative PCR detection is not significantly different from the CT value of the DNA methylation molecular marker alone in the fluorescent quantitative PCR reaction, and the quantitative performance is equivalent to that of a single region. . The kit is optimized so that the primers and probes for different DNA methylation molecular markers do not interfere with each other, and can successfully realize multiple PCR amplification and multiple fluorescence quantitative PCR detection, effectively improving the detection efficiency.
本发明提供的针对所述DNA甲基化分子标志物的检测方法通过引入多重PCR扩增,能有效对目标分子进行富集,克服了检测样本获取量低的限制,在放大检测信号的同时也能进行多个分子标记物的联合检测,提高检测灵敏度和检测效率,能增强对乳腺癌的检出率。The detection method for the DNA methylation molecular marker provided by the present invention can effectively enrich the target molecule by introducing multiple PCR amplification, overcome the limitation of low sample acquisition, and also amplify the detection signal while also amplifying the detection signal. The combined detection of multiple molecular markers can improve the detection sensitivity and detection efficiency, and can enhance the detection rate of breast cancer.
附图说明Description of drawings
图1是测序法独立验证集中1个标记物(cg23035715)、4个标记物(cg23035715,cg26371731,cg04541368和cg13973436)甲基化标记物组合ROC曲线。Figure 1 shows the combined ROC curve of methylation markers for 1 marker (cg23035715) and 4 markers (cg23035715, cg26371731, cg04541368 and cg13973436) in the independent validation set of sequencing methods.
图2是测序法独立验证集中cg23035715,cg26371731,cg04541368和cg13973436甲基化标记物随机组合的不同乳腺癌分期的灵敏度和特异性。Figure 2 shows the sensitivity and specificity of different breast cancer stages in random combinations of cg23035715, cg26371731, cg04541368 and cg13973436 methylation markers in the sequencing-independent validation set.
图3是测序法独立验证集中cg23035715标记物与随机挑选1个标记物cg25924096的ROC曲线(a)和不同乳腺癌分期的灵敏度、特异性(b)。Figure 3 is the ROC curve (a) of the cg23035715 marker in the independent validation set of the sequencing method and the randomly selected one marker cg25924096 (a) and the sensitivity and specificity (b) of different breast cancer stages.
图4是实施例3中其中一个分子标记物(Marke9)荧光定量PCR反应检测扩增曲线。FIG. 4 is the amplification curve of one of the molecular markers (Marke9) in Example 3 detected by fluorescence quantitative PCR reaction.
图5是实施例4中标志物组合A(a)、B(b)和K(c)的ROC曲线。5 is the ROC curve of the marker combinations A(a), B(b) and K(c) in Example 4. FIG.
图6是实施例4中标志物组合A/B(a)、A/C/J(b)、A/B/C/J(c)和A-L(d)的ROC曲线。6 is the ROC curve of the marker combinations A/B(a), A/C/J(b), A/B/C/J(c) and A-L(d) in Example 4. FIG.
具体实施方式Detailed ways
本发明下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。实施例中所用到的各种常用化学试剂,均为市售产品。The experimental methods of unreceipted specific conditions in the following examples of the present invention are usually in accordance with conventional conditions, such as those described in Sambrook et al., Molecular Cloning: Laboratory Manual (New York:Cold Spring Harbor Laboratory Press, 1989), or Follow the conditions recommended by the manufacturer. Various common chemical reagents used in the examples are all commercially available products.
除非另有定义,本发明所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不用于限制本发明。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used in the description of the present invention are only for the purpose of describing specific embodiments, and are not used to limit the present invention.
本发明的术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤的过程、方法、装置、产品或设备没有限定于已列出的步骤或模块,而是可选地还包括没有列出的步骤,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤。The terms "comprising" and "having" and any variations thereof of the present invention are intended to cover a non-exclusive inclusion. For example, a process, method, apparatus, product or device comprising a series of steps is not limited to the listed steps or modules, but optionally also includes unlisted steps, or optionally also includes steps for these processes, other steps inherent in the method, product or device.
在本发明中提及的“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。The "plurality" mentioned in the present invention means two or more. "And/or", which describes the association relationship of the associated objects, means that there can be three kinds of relationships, for example, A and/or B, which can mean that A exists alone, A and B exist at the same time, and B exists alone. The character "/" generally indicates that the associated objects are an "or" relationship.
除非特别说明或另有定义,本文所使用的“第一、第二…”仅仅是用于对名称的区分,不代表具体的数量或顺序。Unless specifically stated or otherwise defined, "first, second..." used herein is only used to distinguish names, and does not represent a specific number or order.
为了便于理解本发明,下面将对本发明进行更全面的描述。本发明可以以许多不同的形式来实现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本发明公开内容的理解更加透彻全面。In order to facilitate understanding of the present invention, the present invention will be described more fully below. The present invention may be implemented in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that a thorough and complete understanding of the present disclosure will be provided.
表1 26个甲基化标记物Table 1 26 methylation markers
生物标记物biomarker
|
chromsomechromsome
|
hg19_starthg19_start
|
hg19_endhg19_end
|
GeneGene
|
strandstrand
|
SEQ ID NO.SEQ ID NO.
|
cg23035715cg23035715
|
chr1chr1
|
223302410223302410
|
223303011223303011
|
BMP7BMP7
|
++
|
211211
|
cg26371731cg26371731
|
chr1chr1
|
156389940156389940
|
156390541156390541
|
C1orf61C1orf61
|
++
|
212212
|
cg04541368cg04541368
|
chr5chr5
|
9290912992909129
|
9290973092909730
|
NR2F1-AS1NR2F1-AS1
|
++
|
213213
|
cg13973436cg13973436
|
chr16chr16
|
6742859867428598
|
6742919967429199
|
ZDHHC1ZDHHC1
|
++
|
214214
|
cg16304215cg16304215
|
chr5chr5
|
7692851076928510
|
7692911176929111
|
OTPOTP
|
++
|
215215
|
cg20072171cg20072171
|
chr3chr3
|
6235666162356661
|
6235726262357262
|
FEZF2FEZF2
|
--
|
216216
|
cg08402365cg08402365
|
chr2chr2
|
172944973172944973
|
172945574172945574
|
METAP1DMETAP1D
|
--
|
217217
|
cg21501525cg21501525
|
chr19chr19
|
3184364731843647
|
3184424831844248
|
TSHZ3TSHZ3
|
--
|
218218
|
cg22778178cg22778178
|
chr17chr17
|
4804165548041655
|
4804225648042256
|
FLJ45513FLJ45513
|
--
|
219219
|
cg08599259cg08599259
|
chr1chr1
|
3851063238510632
|
3851123338511233
|
POU3F1POU3F1
|
--
|
220220
|
cg25566568cg25566568
|
chr14chr14
|
3699470936994709
|
3699531036995310
|
NKX2-1-AS1NKX2-1-AS1
|
++
|
221221
|
cg15634980cg15634980
|
chr6chr6
|
3625272636252726
|
3625332736253327
|
PNPLA1PNPLA1
|
++
|
222222
|
cg07458308cg07458308
|
chr5chr5
|
134827211134827211
|
134827812134827812
|
TIFABTIFAB
|
--
|
223223
|
cg01348584cg01348584
|
chr7chr7
|
49107224910722
|
49113234911323
|
RADILRADIL
|
++
|
224224
|
cg14140881cg14140881
|
chr7chr7
|
105221719105221719
|
105222320105222320
|
EFCAB10EFCAB10
|
++
|
225225
|
cg25756435cg25756435
|
chr19chr19
|
1295182812951828
|
1295242912952429
|
MAST1MAST1
|
--
|
226226
|
cg00594560cg00594560
|
chr8chr8
|
7759294177592941
|
7759354277593542
|
ZFHX4-AS1ZFHX4-AS1
|
--
|
227227
|
cg08279008cg08279008
|
chr2chr2
|
8626292486262924
|
8626352586263525
|
POLR1APOLR1A
|
++
|
228228
|
cg09760908cg09760908
|
chr2chr2
|
4502841245028412
|
4502901345029013
|
CAMKMTCAMKMT
|
++
|
229229
|
cg18087672cg18087672
|
chr17chr17
|
4682461546824615
|
4682521646825216
|
HOXB13HOXB13
|
++
|
230230
|
cg14868703cg14868703
|
chr12chr12
|
113909647113909647
|
113910248113910248
|
LHX5-AS1LHX5-AS1
|
--
|
231231
|
cg17632299cg17632299
|
chr13chr13
|
5331266653312666
|
5331326753313267
|
LECT1LECT1
|
++
|
232232
|
cg18786873cg18786873
|
chr1chr1
|
110610598110610598
|
110611199110611199
|
ALX3ALX3
|
--
|
233233
|
cg20631750cg20631750
|
chr20chr20
|
5583926355839263
|
5583986455839864
|
BMP7BMP7
|
--
|
234234
|
cg25924096cg25924096
|
chr6chr6
|
100054285100054285
|
100054886100054886
|
PRDM13PRDM13
|
++
|
235235
|
cg15321298cg15321298
|
chr6chr6
|
150311425150311425
|
150312026150312026
|
ULBP1ULBP1
|
--
|
236236
|
发明人在前期的研究中,利用其中26个甲基化标记物(SEQ ID NO.1-26)的信息,对22例钼靶和彩超无异常的正常人以及47例乳腺癌病人血浆样本(其中,0期3例,I期12例,II期22例,III期8例,IV期2例)进行随机组合检验,利用LASSO和随机森进方法(Random Forest)分别在训练集和测试集进行标记物筛选和模型建立(可参照之前的研究CN2020104264964),利用逻辑回归模型进行建模分析,进一步筛选4个标记物(cg23035715,cg26371731,cg04541368和cg13973436),具体的数据分析方法与实施例2一致,Top1(cg23035715)和Top1/2/3/4(cg23035715,cg26371731,cg04541368和cg13973436)在98%的特异性下(22/22),验证集的乳腺癌0期的检测灵敏度分别为100%(3/3)和100%(3/3),乳腺癌I期的检测灵敏度分别为91.67%(11/12)和91.67%(11/12),乳腺癌II期的检测灵敏度分别为90.90%(20/22)和90.91%(20/22),III期的检测灵敏度分别为87.5%(7/8)和87.5%(7/8),乳腺癌IV期的检测灵敏度分别为50%(1/2)和0%(0/2),对于所有的乳腺癌样本,检测的整体灵敏度分别为91.49%(43/47)87.23%(41/47),整体的AUC分别为0.9395和0.9796,表明了这些位点组合与乳腺癌早期的诊断高度相关。结果请见图1和图2a-2b。In the previous research, the inventors used the information of 26 methylation markers (SEQ ID NO. Among them, 3 cases in stage 0, 12 cases in stage I, 22 cases in stage II, 8 cases in stage III, and 2 cases in stage IV) were subjected to random combination test. Carry out marker screening and model establishment (refer to previous research CN2020104264964), use logistic regression model for modeling analysis, and further screen 4 markers (cg23035715, cg26371731, cg04541368 and cg13973436), specific data analysis methods and Example 2 Consistently, Top1 (cg23035715) and Top1/2/3/4 (cg23035715, cg26371731, cg04541368, and cg13973436) had 98% specificity (22/22), and the detection sensitivity of breast cancer stage 0 in the validation set was 100%, respectively (3/3) and 100% (3/3), the detection sensitivity of breast cancer stage I was 91.67% (11/12) and 91.67% (11/12), respectively, and the detection sensitivity of breast cancer stage II was 90.90% (20/22) and 90.91% (20/22), the detection sensitivity of stage III was 87.5% (7/8) and 87.5% (7/8), respectively, and the detection sensitivity of stage IV breast cancer was 50% (1 /2) and 0% (0/2), for all breast cancer samples, the overall sensitivity of the detection was 91.49% (43/47), 87.23% (41/47), and the overall AUC was 0.9395 and 0.9796, respectively, indicating that These loci combinations are highly correlated with the early diagnosis of breast cancer. See Figure 1 and Figure 2a-2b for the results.
所述4个生物标记物:cg23035715,cg26371731,cg04541368和cg13973436,其中,单个标记物cg23035715的AUC 0.9395,见图1,且图1为26个共甲基化标记物中4个标记物(cg23035715,cg26371731,cg04541368和cg13973436)组合一起,与marker1(cg23035715)标记物在鉴别乳腺癌能力方面的比较,通过图1可看出,4个甲基化标记物组合的AUC已经很接近于26个标记物的组合的AUC,marker1的标记物AUC也有0.9395,说明这4个甲基化物标记物的重要性,特别是marker1的标记物已经具有很强的诊断早期乳腺癌的能力。The 4 biomarkers: cg23035715, cg26371731, cg04541368 and cg13973436, wherein the AUC of the single marker cg23035715 is 0.9395, see Figure 1, and Figure 1 shows 4 markers (cg23035715, The combination of cg26371731, cg04541368 and cg13973436) compared with marker1 (cg23035715) markers in the ability to identify breast cancer, as can be seen from Figure 1, the AUC of the combination of 4 methylation markers is already very close to 26 markers The combined AUC of the marker1 marker was also 0.9395, indicating the importance of these four methylation markers, especially the marker1 marker has a strong ability to diagnose early breast cancer.
图2c-2d所示,在98%的特异性下,Top1/2/3和Top1/2/4甲基化标记物组合在验证集的0期的检测灵敏度分别为100%(3/3)和33.33%(1/3),I期的检测灵敏度分别为100%(12/12)和66.67%(8/12),II期的检测灵敏度分别为81.82%(18/22)和63.63%(14/22),III期的 检测灵敏度分别为87.5%(7/8)和75%(6/8),IV期检测灵敏度分别为50%(1/2)和0%(0/2),整体检测的整体灵敏度为分别85.11%(40/47)和61.7%(29/47)。其中,图2c中的top1/2/3代表的是cg23035715,cg26371731和cg04541368组合,而top1/2/4代表的是3个标记物(cg23035715,cg26371731和cg13973436)的组合,以此类推。As shown in Figure 2c-2d, at 98% specificity, the detection sensitivity of Top1/2/3 and Top1/2/4 methylation marker combinations in phase 0 of the validation set was 100% (3/3), respectively and 33.33% (1/3), the detection sensitivity of phase I was 100% (12/12) and 66.67% (8/12), and the detection sensitivity of phase II was 81.82% (18/22) and 63.63% ( 14/22), the detection sensitivity of stage III was 87.5% (7/8) and 75% (6/8), and the detection sensitivity of stage IV was 50% (1/2) and 0% (0/2), respectively, The overall sensitivity of the overall detection was 85.11% (40/47) and 61.7% (29/47), respectively. Among them, top1/2/3 in Figure 2c represents the combination of cg23035715, cg26371731 and cg04541368, while top1/2/4 represents the combination of the three markers (cg23035715, cg26371731 and cg13973436), and so on.
进一步验证top1单独的重要性,在98%的特异性,top1(SEQ ID NO.1)与随机SEQ ID NO.5-26标记物中的cg23035715,cg16304215和cg08599259组合以及与其它剩余40个标记物(SEQ ID NO.27-66)的随机挑先的cg01167274,cg07790615,cg23134869和cg01832036组合,验证集的0期的检测灵敏度分别为100%(3/3)和100%(3/3),I期的检测灵敏度分别为100%(12/12)和83.33%(10/12),II期的检测灵敏度分别为95.45%(21/22)和95.45%(21/22),III期的检测灵敏度分别为100%(8/8)和87.5%(7/8),IV期检测灵敏度分别为0%(0/2)和0%(0/2),整体检测的整体灵敏度分别为93.61%(40/47)和61.7%(29/47),见图3a-3b和图3a-3b。Further validation of the importance of top1 alone, at 98% specificity, top1 (SEQ ID NO. 1) in combination with cg23035715, cg16304215 and cg08599259 in random SEQ ID NO. 5-26 markers and with the remaining 40 markers (SEQ ID NO. 27-66) randomly selected combinations of cg01167274, cg07790615, cg23134869 and cg01832036, the detection sensitivities of phase 0 of the validation set were 100% (3/3) and 100% (3/3), respectively. The detection sensitivity of phase II was 100% (12/12) and 83.33% (10/12), respectively, the detection sensitivity of phase II was 95.45% (21/22) and 95.45% (21/22), and the detection sensitivity of phase III were 100% (8/8) and 87.5% (7/8), respectively, the stage IV detection sensitivity was 0% (0/2) and 0% (0/2), and the overall sensitivity of the overall detection was 93.61% ( 40/47) and 61.7% (29/47), see Figures 3a-3b and 3a-3b.
针对上述的22例钼靶和彩超无异常的正常人以及47例乳腺癌病人血浆样本进行检测的结果,cg23035715对于早期乳腺癌的诊断有很好的表现,与cg26371731,cg04541368和cg13973436的组合也展现出与top1-26的组合相似的诊断能力,而且cg23035715与随机SEQ ID NO.5-26标记物中的cg23035715,cg16304215和cg08599259组合,top1/2/3/4标记物的AUC也在0.75以上,如cg2303571、cg26371731,cg04541368和cg13973436的AUC为0.9395,0.8020,0.7701和0.7501,见表2,说明了利用该方法筛选出来的位点与乳腺癌早期的诊断有非常高的相关性。According to the results of the above-mentioned 22 normal people with no abnormalities in mammography and color Doppler ultrasound and 47 patients with breast cancer plasma samples, cg23035715 has a good performance in the diagnosis of early breast cancer, and the combination with cg26371731, cg04541368 and cg13973436 also showed The diagnostic ability was similar to the combination of top1-26, and cg23035715 was combined with cg23035715, cg16304215 and cg08599259 in random SEQ ID NO.5-26 markers, and the AUC of top1/2/3/4 markers was also above 0.75, For example, the AUCs of cg2303571, cg26371731, cg04541368 and cg13973436 were 0.9395, 0.8020, 0.7701 and 0.7501, as shown in Table 2, indicating that the loci screened by this method had a very high correlation with the early diagnosis of breast cancer.
表.2 26个甲基化标记物片段在乳腺癌诊断中的表现-AUC。Table 2. Performance of 26 methylation marker fragments in breast cancer diagnosis - AUC.
本发明公开了用于乳腺癌诊断的甲基化标记物(23种)检测的引物和探针和组合的PCR 反应体系。根据TCGA数据库严格筛选乳腺癌特异性的甲基化markers,通过靶向甲基化高通量筛选出来原26个生物标记中的23个乳腺癌血浆中标志物,结合甲基化特异性聚合酶链式反应富集与MethyLight PCR荧光来检测多基因的甲基化标志物。The invention discloses primers and probes and a combined PCR reaction system for detecting methylation markers (23 types) for breast cancer diagnosis. The breast cancer-specific methylation markers were strictly screened according to the TCGA database, and 23 breast cancer plasma markers were screened out of the original 26 biomarkers through targeted methylation high-throughput screening, combined with methylation-specific polymerase. Chain reaction enrichment and MethyLight PCR fluorescence were used to detect methylation markers of multiple genes.
基于MethyLight PCR检测ctDNA甲基化来诊断乳腺癌的方法,包括如下步骤:A method for diagnosing breast cancer based on the detection of ctDNA methylation by MethyLight PCR, including the following steps:
步骤一、采用cfDNA提取试剂盒,提取待测生物样本的游离DNA; Step 1. Use a cfDNA extraction kit to extract the cell-free DNA of the biological sample to be tested;
步骤二、对所述DNA进行亚硫酸氢盐转化; Step 2, carrying out bisulfite conversion to the DNA;
步骤三、上述DNA甲基化分子标记物的扩增引物对转化后的DNA进行多重PCR扩增,得到多重PCR扩增产物; Step 3, the amplification primer of the above-mentioned DNA methylation molecular marker performs multiple PCR amplification on the transformed DNA to obtain a multiple PCR amplification product;
步骤四、获得的多重PCR产物用针对上述DNA甲基化分子标记物的探针进行多重荧光定量PCR检测; Step 4. The obtained multiplex PCR product is detected by multiplex fluorescence quantitative PCR with the probe for the above-mentioned DNA methylation molecular marker;
步骤五、通过内参基因C
T值判断样本是否有效,然后用内参基因C
T值对有效样本中检测的每个分子标记物的C
T值进行校正;
Step 5. Determine whether the sample is valid by the CT value of the internal reference gene, and then use the CT value of the internal reference gene to correct the CT value of each molecular marker detected in the valid sample;
步骤六、将校正后的数据进行模型分析,最后进行肺结节良恶性的判断。Step 6: Perform model analysis on the corrected data, and finally judge the benign and malignant pulmonary nodules.
所述步骤五中若内参基因的C
T值在8-18之间,则该样本判断为有效样本;反之,则为无效样本;然后用内参基因C
T值对有效样本中的每个DNA甲基化分子标记物的C
T值进行校正;若目标DNA甲基化分子标记物C
T值<35判断该DNA甲基化分子标记物被检出,得到该目标DNA甲基化分子标记物的相对循环数ΔC
T:ΔC
T=目标DNA甲基化分子标记物C
T值-内参基因C
T值;若目标DNA甲基化分子标记物C
T值≥35则判断该DNA甲基化分子标记物未被检出,则赋予其ΔC
T=27。
In the step 5, if the CT value of the internal reference gene is between 8 and 18, the sample is judged as a valid sample; otherwise, it is an invalid sample; The CT value of the target DNA methylation molecular marker is corrected; if the CT value of the target DNA methylation molecular marker is less than 35, it is judged that the DNA methylation molecular marker has been detected, and the target DNA methylation molecular marker is obtained. Relative cycle number ΔC T : ΔC T = target DNA methylation molecular marker CT value - internal reference gene CT value; if the target DNA methylation molecular marker CT value ≥ 35, the DNA methylation molecular marker is judged If the substance was not detected, it was assigned ΔC T =27.
所述步骤六中根据校正后的ΔC
T值进行数据分析,采用朴素贝叶斯算法建立乳腺癌诊断模型。根据校正后的ΔC
T值进行数据分析,将数据集按照6:4随机切分为训练集和测试集,然后针对训练集里含有不同DNA甲基化分子标记物的组合采用朴素贝叶斯算法建立恶性预测模型,最后在对应的含有特定的DNA甲基化分子标记物组合的测试集中评估模型的分类能力。
In the step 6, data analysis is performed according to the corrected ΔC T value, and a breast cancer diagnosis model is established by using the Naive Bayes algorithm. Data analysis was carried out according to the corrected ΔC T value, and the data set was randomly divided into training set and test set according to 6:4, and then the Naive Bayes algorithm was used for the combination of different DNA methylation molecular markers in the training set. The malignant prediction model was established, and finally the classification ability of the model was evaluated in the corresponding test set containing a combination of specific DNA methylation molecular markers.
以下具体实施例,是对本发明做进一步的详细阐述,但不用于限制本发明的保护范围。The following specific examples further illustrate the present invention in detail, but are not intended to limit the protection scope of the present invention.
实施例1本实施例提供了一种用于血浆样本检测乳腺癌的试剂盒,其包含了23个检测基因TLR5、C1orf61、NR2F1-AS1、ZDHHC1、OTP、FEZF2、TSHZ3、FLJ45513、POU3F1、NKX2-1-AS1、PNPLA1、TIFAB、RADIL、EFCAB10、MAST1、ZFHX4-AS1、HOXB13、LHX5-AS1、LECT1、ALX3、BMP7、PRDM13和ULBP1中的23个甲基化区域的DNA甲基化分子标记物Marker1至Marker23的检测引物和探针。Example 1 This example provides a kit for detecting breast cancer in plasma samples, which contains 23 detection genes TLR5, C1orf61, NR2F1-AS1, ZDHHC1, OTP, FEZF2, TSHZ3, FLJ45513, POU3F1, NKX2- DNA methylation molecular marker Marker1 of 23 methylated regions in 1-AS1, PNPLA1, TIFAB, RADIL, EFCAB10, MAST1, ZFHX4-AS1, HOXB13, LHX5-AS1, LECT1, ALX3, BMP7, PRDM13 and ULBP1 Detection primers and probes to Marker23.
所述试剂盒针对23个用于血浆样本检测乳腺癌的分子标志物Marker1至Marker23中的特异性甲基化位点各设计了三对引物和三条探针(探针荧光标记可采用FAM、VIC及NED等荧光基团标记),并分别标记为组合1、2、3。其中,每个分子标志物中被选择的引物和探针组合,可任意选择与其它的甲基化标志物中的引物和探针的组合1、2、3进行组合并在同一平台进行检测。具体的各分子标记物对应的引物和探针序列如表1所示:The kit designed three pairs of primers and three probes for each of the specific methylation sites in the 23 molecular markers Marker1 to Marker23 used for the detection of breast cancer in plasma samples (the probes can be fluorescently labeled with FAM, VIC, etc.). and NED and other fluorophore labels), and labeled as combination 1, 2, 3, respectively. Wherein, the selected primer and probe combination in each molecular marker can be arbitrarily selected and combined with the combination 1, 2 and 3 of primers and probes in other methylation markers and detected on the same platform. The specific primer and probe sequences corresponding to each molecular marker are shown in Table 1:
表1.1.相关分子标记物的引物和探针序列Table 1.1. Primer and probe sequences for related molecular markers
本实施例和以下实施例中,优选所用的引物和探针组合均为Marker1-Marker23的引物和探针组合1。在实际应用中,将根据甲基化分子标记物的不同组合进行选择相应的引物和探针。In this embodiment and the following embodiments, the primer and probe combination used is preferably the primer and probe combination 1 of Marker1-Marker23. In practical applications, the corresponding primers and probes will be selected according to different combinations of methylated molecular markers.
实施例2Example 2
本实施例利用实施例1所述试剂盒对血浆样本中23个生物标记物的甲基化水平进行检测。This example uses the kit described in Example 1 to detect the methylation levels of 23 biomarkers in plasma samples.
一种DNA甲基化分子标记物的甲基化水平检测方法,包括以下步骤:A method for detecting methylation levels of DNA methylation molecular markers, comprising the following steps:
1、样本中cfDNA的提取:1. Extraction of cfDNA from the sample:
血浆DNA提取具体操作步骤按照Life公司的MagMAX TM Cell-Free DNA Isolation Kit操作说明书进行。The specific operation steps of plasma DNA extraction were carried out according to the operation instructions of Life Company's MagMAX TM Cell-Free DNA Isolation Kit.
2、对提取的cfDNA进行亚硫酸盐转化2. Sulfite conversion of the extracted cfDNA
将提取的cfDNA(10ng)进行亚硫酸氢盐转化,使DNA中未发生甲基化的胞嘧啶脱氨基转变成尿嘧啶,而甲基化的胞嘧啶保持不变,得到亚硫酸氢盐转化后的DNA,转化具体操作按照Zymo Research的EZ DNA Methylation-Lightning Kit说明书进行。使DNA中未发生甲基化的胞嘧啶脱氨基转变成尿嘧啶,而甲基化的胞嘧啶保持不变。经过亚硫酸氢盐转化后的DNA产物全部用于进行多重PCR扩增。The extracted cfDNA (10ng) was subjected to bisulfite conversion to deaminate the unmethylated cytosine into uracil, while the methylated cytosine remained unchanged to obtain a bisulfite converted The specific operation of transformation was carried out according to the instructions of Zymo Research's EZ DNA Methylation-Lightning Kit. Deamination of unmethylated cytosines in DNA to uracils, while methylated cytosines remain unchanged. All bisulfite-converted DNA products were used for multiplex PCR amplification.
3、对转化后的DNA进行多重PCR扩增3. Multiplex PCR amplification of transformed DNA
转化后的DNA产物全部进行多重PCR扩增,其中的反应组分为:特定组合的分子标记物及1个内参基因的引物混合液,其中每条引物浓度在200nM-300nM,本实施例优选300nM;镁离子的浓度在1-3mM,本实施例优选在1.5mM;dNTP混合液浓度在200-600uM,本实施例优选在400uM;反应酶为Phusion U(Thermo Fisher,Cat#F555L),一个反应的单位数为1-3U,本实施例优选1.5U。多重PCR反应体系配制如表2.1:All the converted DNA products are amplified by multiplex PCR, and the reaction components are: a specific combination of molecular markers and a primer mixture of an internal reference gene, wherein the concentration of each primer is 200nM-300nM, and this embodiment is preferably 300nM The concentration of magnesium ions is 1-3mM, preferably 1.5mM in this embodiment; the concentration of dNTP mixture is 200-600uM, and this embodiment is preferably 400uM; the reaction enzyme is Phusion U (Thermo Fisher, Cat#F555L), a reaction The number of units is 1-3U, preferably 1.5U in this embodiment. The preparation of the multiplex PCR reaction system is shown in Table 2.1:
表2.1.多重PCR反应体系Table 2.1. Multiplex PCR reaction system
组分component
|
体积(ul)Volume (ul)
|
终浓度Final concentration
|
5X GC buffer 5X GC buffer
|
1010
|
1X1X
|
dNTP mix(10mM/dNTP)dNTP mix(10mM/dNTP)
|
22
|
400uM/dNTP400uM/dNTP
|
引物混合液(75uM/引物)Primer Mix (75uM/primer)
|
8.48.4
|
300nM/引物300nM/primer
|
MgCl2(25mM)MgCl2 (25mM)
|
33
|
1.5mM1.5mM
|
Phusion U(2U/ul)Phusion U(2U/ul)
|
0.750.75
|
1.5U1.5U
|
DNA模板DNA template
|
18-2218-22
|
|
DEPC H2ODEPC H2O
|
up to 50up to 50
|
|
具体反应条件为:预变性,98℃,30s;15-35个循环反应,本实施例优选22个循环:变性,98℃,15s;退火,58-66℃,15-30s,本实施例优选63℃,15s;延伸72℃,15-30s,本实施例优选15s。The specific reaction conditions are: pre-denaturation, 98°C, 30s; 15-35 cycles of reaction, preferably 22 cycles in this embodiment: denaturation, 98°C, 15s; annealing, 58-66°C, 15-30s, this embodiment preferably 63°C, 15s; extension 72°C, 15-30s, preferably 15s in this embodiment.
4、对多重PCR扩增产物进行多重荧光定量PCR测定4. Multiplex fluorescence quantitative PCR assay for multiplex PCR amplification products
对多重PCR产物进行1-5倍稀释,本实施例优选3倍稀释。多重荧光定量PCR反应组分为:引物探针混合液,其中各引物浓度在200-900nM,本实施例优选900nM;探针浓度在100-200nM,本实施例优选200nM。采用的反应酶混合液为1倍的ChamQ Geno-SNP probe Master Mix(Vazyme,Cat#Q811-02),一个反应为10ul体系。The multiplex PCR products are diluted 1-5 times, preferably 3 times in this embodiment. The multiplex fluorescence quantitative PCR reaction components are: primer-probe mixture, wherein the concentration of each primer is 200-900nM, preferably 900nM in this embodiment; the probe concentration is 100-200nM, preferably 200nM in this embodiment. The reaction enzyme mixture used is 1 times the ChamQ Geno-SNP probe Master Mix (Vazyme, Cat#Q811-02), and one reaction is a 10ul system.
该体系中的引物探针混合液包含了2-3个分子标记物的引物和对应的不同荧光基团标记的探针,表2.2中列举了2个分子标记物的引物和探针进行混合的混合液方案:The primer-probe mixture in this system contains 2-3 molecular marker primers and corresponding probes labeled with different fluorophores. Table 2.2 lists the mixture of two molecular marker primers and probes. Mixed solution:
表2.2.多重荧光定量PCR反应体系中的引物探针混合液组合方案Table 2.2. The primer-probe mixture scheme in the multiplex fluorescence quantitative PCR reaction system
具体反应条件为:预变性,95℃,5min;40-50个循环,本实施例优选45个循环:变性,95℃15s;退火,60-64℃,本实施例优选62℃,1min,信号收集。qPCR荧光定量反应体系配制如表2.3:The specific reaction conditions are: pre-denaturation, 95°C, 5min; 40-50 cycles, preferably 45 cycles in this embodiment: denaturation, 95°C for 15s; annealing, 60-64°C, preferably 62°C, 1min in this embodiment, signal collect. The configuration of the qPCR fluorescence quantitative reaction system is shown in Table 2.3:
表2.3.qPCR荧光定量反应体系Table 2.3. qPCR fluorescence quantitative reaction system
用完全甲基化(阳性对照)及非甲基化(阴性对照)标准品对23个分子标记物按照表格5所示的混合液组合(混合液A-M)进行多重荧光定量PCR测定的C
T值,与对其进行单独荧光定量PCR反应的C
T值比较,其中阴性对照在所有组合及单个定量中均无检出,结果显示各分子标记物在按照表2.2所示的混合液组合方案进行的荧光定量PCR反应的C
T值与其单独进行荧光定量PCR反应的C
T值相近,没有显著差异,由此判断多重荧光定量的引物探针混合液组合方案中各分子标记物的扩增效率没有相互干扰,定量性能与单个区域定量等同,可实现2-3个分子标记物的同时定量检测(表2.4)。
CT values of 23 molecular markers determined by multiplex fluorescence quantitative PCR using the fully methylated (positive control) and unmethylated (negative control) standards according to the mixed solution combination (mixed solution AM) shown in Table 5 , compared with the CT value of the single fluorescence quantitative PCR reaction, the negative control was not detected in all combinations and single quantification. The CT value of the fluorescence quantitative PCR reaction is similar to the CT value of the fluorescence quantitative PCR reaction alone, and there is no significant difference. Therefore, it can be judged that the amplification efficiency of each molecular marker in the primer-probe mixture combination scheme of multiplex fluorescence quantitative PCR is not mutually exclusive. Interference, the quantitative performance is equivalent to that of a single area, and the simultaneous quantitative detection of 2-3 molecular markers can be achieved (Table 2.4).
表2.4.各分子标记物在多重和单独荧光定量PCR反应下的C
T值
Table 2.4. CT values of each molecular marker under multiplex and single fluorescence quantitative PCR reactions
本实施例进一步提供一种检测乳腺癌的方法,还包括以下步骤:The present embodiment further provides a method for detecting breast cancer, further comprising the steps of:
5、根据荧光定量PCR反应测定的内参基因在样本中的C
T值判断该检测样本是否为有效样品,若检测样本内参基因的C
T值在8-18之间,则该样本判断为有效样品;若检测样本内参基因的C
T值<8,则样本初始投入量过剩;若检测样本内参基因的C
T值>18,则初始样本投入量不足。对于初始投入量过剩或不足的样本,均判定为无效样本,不纳入检测和分析。
5. Determine whether the test sample is a valid sample according to the CT value of the internal reference gene in the sample determined by the fluorescence quantitative PCR reaction. If the CT value of the internal reference gene in the test sample is between 8 and 18, the sample is judged as a valid sample. ; If the CT value of the internal reference gene in the test sample is less than 8, the initial input amount of the sample is excessive; if the CT value of the internal reference gene in the test sample is > 18, the initial sample input amount is insufficient. Samples with excess or insufficient initial input are judged as invalid samples and will not be included in testing and analysis.
6、在样品判断为有效样品的前提下,若目标DNA甲基化分子标记物C
T值<35判断该DNA甲基化分子标记物被检出,得到该目标DNA甲基化分子标记物的相对循环数ΔC
T:ΔC
T=目标DNA甲基化分子标记物C
T值-内参基因C
T值;若目标DNA甲基化分子标记物C
T值≥35则判断该DNA甲基化分子标记物未被检出,则赋予其ΔC
T=27。
6. On the premise that the sample is judged as a valid sample, if the C T value of the target DNA methylation molecular marker is less than 35, it is judged that the DNA methylation molecular marker has been detected, and the target DNA methylation molecular marker is obtained. Relative cycle number ΔC T : ΔC T = target DNA methylation molecular marker CT value - internal reference gene CT value; if the target DNA methylation molecular marker CT value ≥ 35, the DNA methylation molecular marker is judged If the substance was not detected, it was assigned ΔC T =27.
7、根据各样本中的目标分子标记物校正后的ΔCT进行数据分析,将数据集按照6:4分为训练集和测试集,并进行100次随机切分为,然后针对训练集以含有不同分子标记物的组合采用朴素贝叶斯算法建立良恶性预测模型,最后在对应的含有特定的分子标记物组合的测试集中评估模型的分类能力。7. Perform data analysis according to the corrected ΔCT of the target molecular markers in each sample, divide the data set into training set and test set according to 6:4, and perform 100 random segmentations. The combination of molecular markers uses the Naive Bayes algorithm to establish a benign and malignant prediction model, and finally evaluates the classification ability of the model in the corresponding test set containing a specific combination of molecular markers.
实施例3Example 3
本实施例提供分子标记物在标准品中的检测,其检测步骤如下:This embodiment provides the detection of molecular markers in standard products, and the detection steps are as follows:
1、标准品制备1. Standard preparation
1)0%甲基化标准品制备:利用
Single Cell Kit(Qiagen,Cat#150343)和Mung Bean Nuclease(NEB,Cat#M0250L)处理NA12878DNA以制备0%甲基化标准品;
1) 0% methylation standard preparation: use Single Cell Kit (Qiagen, Cat#150343) and Mung Bean Nuclease (NEB, Cat#M0250L) treated NA12878 DNA to prepare 0% methylation standard;
2)100%甲基化标准品制备:2) 100% methylation standard preparation:
利用CpG Methyltransferase(M.SssI)处理所制备的0%的甲基化标准品,得到100%的甲基化标准品。The prepared 0% methylation standard was treated with CpG Methyltransferase (M.SssI) to give a 100% methylation standard.
2、不同甲基化比例的标准品制备:2. Preparation of standards with different methylation ratios:
按照所需的甲基化比例梯度混合0%与100%甲基化标准品,得到0.2%,0.4%,1%甲基化比例的标准品。Mix the 0% and 100% methylation standards in a gradient of the desired methylation ratio to obtain 0.2%, 0.4%, and 1% methylation ratio standards.
3、对不同甲基化比例的标准品DNA进行亚硫酸氢盐转化:步骤如实施例2,转化投入量为10-50ng,本实施例优选为50ng。3. Perform bisulfite transformation on standard DNAs with different methylation ratios: the steps are as in Example 2, and the transformation input amount is 10-50 ng, preferably 50 ng in this embodiment.
4、对转化后的标准品DNA进行多重PCR扩增,步骤如实施例2,多重PCR引物混合液为23个分子标记物以及内参基因的引物。4. Amplify the transformed standard DNA by multiplex PCR, as in Example 2. The multiplex PCR primer mixture is primers for 23 molecular markers and an internal reference gene.
5、对上述多重PCR扩增产物进行荧光定量PCR测定,步骤如实施例2。5. Perform fluorescence quantitative PCR assay on the above-mentioned multiplex PCR amplification products, the steps are as in Example 2.
6、根据荧光定量PCR反应测定的内参基因ACTB在样本中的C
T值判断该检测样本是否为有效样品,若检测样本内参基因的C
T值在8-18之间,则该样本判断为有效样品;
6. Determine whether the test sample is a valid sample according to the CT value of the internal reference gene ACTB in the sample determined by the fluorescence quantitative PCR reaction. If the CT value of the internal reference gene in the test sample is between 8 and 18, the sample is judged to be valid. sample;
7、在样品判断为有效样品的前提下,若目标DNA甲基化分子标记物C
T值<35判断该DNA甲基化分子标记物被检出,若目标DNA甲基化分子标记物C
T值≥35则判断该DNA甲基化分子标记物未被检出。
7. On the premise that the sample is judged as a valid sample, if the C T value of the target DNA methylation molecular marker is less than 35, it is judged that the DNA methylation molecular marker has been detected. If the target DNA methylation molecular marker C T If the value is ≥35, it is judged that the DNA methylation molecular marker has not been detected.
本实施例及以下实施例中,各分子标记物的引物探针组合如实施例1中优选的组合1。In this example and the following examples, the primer-probe combination of each molecular marker is the preferred combination 1 in Example 1.
本实施例及以下实施例中,每一次试验都会设置阴性对照,该阴性对照以水作为模板进行多重PCR,得到的阴性对照多重PCR产物再进行各特定分子标记物的荧光定量PCR测定。如阴性对照无检测信号,则判定整个实验操作无外源污染。In this example and the following examples, a negative control is set for each test. The negative control uses water as a template to perform multiplex PCR, and the obtained negative control multiplex PCR products are then subjected to fluorescence quantitative PCR for each specific molecular marker. If there is no detection signal in the negative control, it is determined that there is no exogenous contamination in the entire experimental operation.
本实施例进行了3次完全独立重复试验,23个分子标记物在100%甲基化标准品中均有检测信号,而阴性对照和非甲基化检测标准品中都无检测信号,以其中一个分子标记物(Marker1)扩增曲线为例,如图1所示,在甲基化比例≥0.2%的各标准品中,Marker1,Marker2,Marker3,Marker4,Marker8,Marker10,Marker11,Marker15,Marker17,Marker19和Marker21三次试验全有检测信号,说明这些分子标记物的引物和探针对甲基化比例≥0.2% 的样品检出率达到了100%;在甲基化比例≥0.4%的各标准品中,Marker5,Marker6,Marker7,Marker9,Marker12,Marker13,Marker14,Marker15,Marker18,Marker20,Marker22和Marker23三次试验全有检测信号,说明这些探针和引物对甲基化比例≥0.4%的样品的检出率达到了100%;22个分子标记物在甲基化比例为1%的标准品中,三次试验全有检测信号,说明这些分子标记物的探针和引物都能检出甲基化比例为1%的信号,灵敏度较高。图4是分子标记物(cg08599259)荧光定量PCR反应检测扩增曲线。In this example, three completely independent repeated experiments were carried out, and 23 molecular markers all had detection signals in the 100% methylated standard, while there was no detection signal in the negative control and non-methylated detection standard. Take the amplification curve of a molecular marker (Marker1) as an example, as shown in Figure 1, among the standards with methylation ratio ≥ 0.2%, Marker1, Marker2, Marker3, Marker4, Marker8, Marker10, Marker11, Marker15, Marker17 , Marker19 and Marker21 all three tests have detection signals, indicating that the primers and probes of these molecular markers have a detection rate of 100% for samples with a methylation ratio of ≥0.2%; in each standard with a methylation ratio of ≥0.4% Among the samples, Marker5, Marker6, Marker7, Marker9, Marker12, Marker13, Marker14, Marker15, Marker18, Marker20, Marker22 and Marker23 all have detection signals, indicating that these probes and primers have a methylation ratio of ≥0.4%. The detection rate reached 100%; 22 molecular markers in the standard with a methylation ratio of 1% had detection signals in all three tests, indicating that the probes and primers of these molecular markers can detect methylation The ratio is 1% of the signal, the sensitivity is higher. Figure 4 is the amplification curve of the molecular marker (cg08599259) fluorescence quantitative PCR reaction detection.
实施例4不同分子标记物组合对血浆样本进行乳腺癌检测的表现Example 4 Performance of different molecular marker combinations for breast cancer detection in plasma samples
本实施例利用23个分子标记物包含Marker1至Marker23检测引物和探针的任意组合,通过实施例2的实验方法,具体的检测试剂盒、试验方法以及数据判断处理如实施例2所述,引物和探针组合如实施例1所优选的组合。This example uses 23 molecular markers including Marker1 to Marker23 to detect any combination of primers and probes. Through the experimental method of Example 2, the specific detection kit, test method and data judgment processing are as described in Example 2. The primers The combination with the probe is the preferred combination in Example 1.
本实施例对200例血浆本的分子标记物Marker1-Marker23进行了标志物的检测,该200例血浆样本包括了确诊的100例健康人对照和100例乳腺癌患者血浆样本,利用逻辑回归模型进行建模分析,将数据集按照6:4随机切分为训练集和测试集,采用朴素贝叶斯算法建立良恶性预测模型。In this example, the molecular markers Marker1-Marker23 of 200 plasma samples were detected. The 200 plasma samples included 100 healthy controls and 100 breast cancer patients. The logistic regression model was used to detect the markers. For modeling analysis, the data set was randomly divided into training set and test set according to 6:4, and the benign and malignant prediction model was established by the naive Bayes algorithm.
选取表2.2中分子标志物组合A(Marker 1+内参)进行建模分析,其在测试集中AUC为0.885(特异性:92.5%;灵敏性:67.5%),ROC如图5a所示。可见Marker 1的引物和探针的PCR检测结果的灵敏性和特异性都非常好。由于样本中有73%的恶性样本为乳腺癌早期样本,因此通过该引物和探针组合检测标志物组合A,对早期乳腺癌检测也有较高的灵敏性和特异性。The molecular marker combination A (Marker 1+internal reference) in Table 2.2 was selected for modeling analysis, and its AUC in the test set was 0.885 (specificity: 92.5%; sensitivity: 67.5%), and the ROC was shown in Figure 5a. It can be seen that the sensitivity and specificity of the PCR detection results of the primers and probes of Marker 1 are very good. Since 73% of malignant samples in the samples are early breast cancer samples, the combination of primers and probes to detect marker combination A also has high sensitivity and specificity for early breast cancer detection.
选取表2.2中分子标记物组合B进行建模分析,其在测试集中AUC为0.8316(特异性:92.5%;灵敏性:60%),ROC如图5b所示,可见所述所选分子标记物的引物和探针的PCR检测结果,同样有较好的灵敏性和特异性。由于样本中有73%的恶性样本为乳腺癌早期样本,因此该分子标记物组合对早期乳腺癌检测也有较高的灵敏性和特异性。The molecular marker combination B in Table 2.2 was selected for modeling analysis, and its AUC in the test set was 0.8316 (specificity: 92.5%; sensitivity: 60%). The ROC is shown in Figure 5b, and the selected molecular markers can be seen The PCR detection results of the primers and probes also have good sensitivity and specificity. Since 73% of malignant samples are early breast cancer samples, the molecular marker combination also has high sensitivity and specificity for early breast cancer detection.
选取表2.2中分子标记物组合K进行建模分析,其在测试集中AUC为0.7766(特异性:95%;灵敏性:32.5%),ROC如图5c所示。可见所述所选分子标记物的引物和探针的PCR检测结果,具有较好的灵敏性和特异性,只是灵敏性比D组合稍差。The molecular marker combination K in Table 2.2 was selected for modeling analysis, and its AUC in the test set was 0.7766 (specificity: 95%; sensitivity: 32.5%), and the ROC was shown in Figure 5c. It can be seen that the PCR detection results of the primers and probes of the selected molecular markers have better sensitivity and specificity, but the sensitivity is slightly lower than that of the D combination.
选取表2.2中分子标记物组合A和B进行建模分析,其在测试集中AUC为0.9144(特异性:92.5%;灵敏性:72.5%),ROC如图6a所示。可见所述所选分子标记物的引物和探针组合检测结果,具有较好的灵敏度,特异性稍弱于其他组合。由于样本中有73%的恶性样本为乳腺癌早期样本,因此该分子标记物组合对早期乳腺癌检测也有较高的灵敏性和特异性。Molecular marker combinations A and B in Table 2.2 were selected for modeling analysis, and their AUC in the test set was 0.9144 (specificity: 92.5%; sensitivity: 72.5%), and the ROC was shown in Figure 6a. It can be seen that the detection result of the primer and probe combination of the selected molecular marker has better sensitivity and slightly weaker specificity than other combinations. Since 73% of malignant samples are early breast cancer samples, the molecular marker combination also has high sensitivity and specificity for early breast cancer detection.
选取表2.2中分子标记物组合A与随机C和J进行建模分析,其在测试集中AUC为0.8969(特异性:95%;灵敏性:65%),ROC如图6b所示。可见所述所选分子标记物的引物和探针检测结果,具有较好的灵敏度,特异性稍弱于其他组合。由于样本中有73%的恶性样 本为乳腺癌早期样本,因此该分子标记物的引物和探针的PCR检测结果对早期乳腺癌检测也有较高的灵敏性和特异性。The molecular marker combination A and random C and J in Table 2.2 were selected for modeling analysis, and the AUC in the test set was 0.8969 (specificity: 95%; sensitivity: 65%), and the ROC was shown in Figure 6b. It can be seen that the detection results of the primers and probes of the selected molecular markers have better sensitivity and slightly weaker specificity than other combinations. Since 73% of malignant samples in the samples are early breast cancer samples, the PCR detection results of the primers and probes of this molecular marker also have high sensitivity and specificity for early breast cancer detection.
选取表2.2中分子标记物组合A、B组全与随机C和J进行建模分析,其在测试集中AUC为0.9163(特异性:95%;灵敏性:75%),ROC如图6c所示。可见所选分子标记物的引物和探针的PCR检测结果具有较好的灵敏度,特异性稍弱于其他组合。由于样本中有73%的恶性样本为乳腺癌早期样本,因此该分子标记物的引物和探针的PCR检测结果对早期乳腺癌检测也有较高的灵敏性和特异性。The molecular marker combinations A and B in Table 2.2 were selected for modeling analysis and random C and J were used for modeling analysis. The AUC in the test set was 0.9163 (specificity: 95%; sensitivity: 75%), and the ROC was shown in Figure 6c . It can be seen that the PCR detection results of the primers and probes of the selected molecular markers have better sensitivity, and the specificity is slightly weaker than other combinations. Since 73% of the malignant samples are early breast cancer samples, the PCR detection results of the primers and probes of this molecular marker also have high sensitivity and specificity for early breast cancer detection.
选取全部23个分子标记物组合进行建模分析,其在测试集中AUC为0.9481(特异性:95%;灵敏性:85%),ROC如图6d所示。可见所述所选分子标记物的引物和探针的PCR检测结果,具有较好的灵敏度,特异性稍弱于其他组合。由于样本中有73%的恶性样本为乳腺癌早期样本,因此该分子标记物的引物和探针的PCR检测结果对早期乳腺癌检测也有较高的灵敏性和特异性。All 23 molecular marker combinations were selected for modeling analysis, and their AUC in the test set was 0.9481 (specificity: 95%; sensitivity: 85%), and the ROC was shown in Figure 6d. It can be seen that the PCR detection results of the primers and probes of the selected molecular markers have better sensitivity and weaker specificity than other combinations. Since 73% of the malignant samples are early breast cancer samples, the PCR detection results of the primers and probes of this molecular marker also have high sensitivity and specificity for early breast cancer detection.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-described embodiments can be combined arbitrarily. For the sake of brevity, all possible combinations of the technical features in the above-described embodiments are not described. However, as long as there is no contradiction between the combinations of these technical features, All should be regarded as the scope described in this specification.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as a limitation on the scope of the invention patent. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, several modifications and improvements can also be made, which all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be subject to the appended claims.