TW201829781A - Method for screening for gene marker of intestinal cancer and/or stomach cancer, gene marker for screening via method, and application of gene marker - Google Patents

Method for screening for gene marker of intestinal cancer and/or stomach cancer, gene marker for screening via method, and application of gene marker Download PDF

Info

Publication number
TW201829781A
TW201829781A TW106146503A TW106146503A TW201829781A TW 201829781 A TW201829781 A TW 201829781A TW 106146503 A TW106146503 A TW 106146503A TW 106146503 A TW106146503 A TW 106146503A TW 201829781 A TW201829781 A TW 201829781A
Authority
TW
Taiwan
Prior art keywords
gene marker
hmc
gene
marker
cancer
Prior art date
Application number
TW106146503A
Other languages
Chinese (zh)
Other versions
TWI647312B (en
Inventor
陸星宇
宋艷群
Original Assignee
大陸商上海易華恩基因科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 大陸商上海易華恩基因科技有限公司 filed Critical 大陸商上海易華恩基因科技有限公司
Publication of TW201829781A publication Critical patent/TW201829781A/en
Application granted granted Critical
Publication of TWI647312B publication Critical patent/TWI647312B/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided are a gene marker for detecting stomach cancer, an application of the gene marker, and a method for using the gene marker to detect stomach cancer.

Description

用於篩選腸癌的基因標誌物的方法、用該方法篩選的基因標誌物及其用途Method for screening gene markers of intestinal cancer, gene markers screened by the method, and uses thereof

本發明係涉及通過高通量定序篩選基因標誌物的領域。具體地,本發明涉及通過高通量定序對腸癌的基因標誌物進行篩選的方法,以及利用該方法篩選出的基因標誌物及其用途。The present invention relates to the field of screening for genetic markers by high throughput sequencing. In particular, the present invention relates to a method for screening a genetic marker for intestinal cancer by high-throughput sequencing, and a genetic marker screened by the method and use thereof.

隨著生活環境和習慣的改變,近些年來癌症的發病率在世界範圍內逐年升高,人們對其關注也隨之增加。胃癌和腸癌分別是我國第二大和第三大高發癌,屬於發病率和病死率都非常高的惡性腫瘤,它們的發生和發展是一個多因素參與和多階段累積致癌的複雜過程。本著現有通行的癌症早發現早治療方案,越早期發現,越有可能早期控制甚至治癒癌症。如何對胃癌和腸癌進行早期篩檢,及時治療,正確地判斷預後,逐漸受到人們重視。With the changes in living environment and habits, the incidence of cancer has increased year by year in the world, and people's attention has also increased. Gastric cancer and intestinal cancer are the second and third most common cancers in China, respectively. They are malignant tumors with very high morbidity and mortality. Their occurrence and development is a complex process involving multi-factor participation and multi-stage cumulative carcinogenesis. With the existing early treatment of early detection of cancer, the earlier it is discovered, the more likely it is to control or even cure cancer early. How to conduct early screening and timely treatment of gastric cancer and intestinal cancer, and correctly judge the prognosis, has gradually received attention.

傳統篩查方法有胃腸鏡檢查和大便隱血試驗,但都有自身不可克服的缺點。胃腸鏡加病理活檢被認為是胃腸癌症篩查和診斷的金標準,但是因為鏡檢的侵入性和腸道準備的不適,很多病人不願意接受腸胃鏡檢查。最近有研究表明腸鏡對右側結腸內常見的扁平息肉的診斷效果不好,腸鏡篩查並不能降低右側結腸癌的發病率。大便隱血試驗是另一種臨床上常用的大腸癌篩查方法,該法有完全無創和廉價等優點,但是其檢測腫瘤的準確性低,通過免疫化學的改進方法也僅能檢測到50-60%的大腸癌和30%左右的癌前腺瘤。並且,該檢測方法假陽性率較高,故很難得到推廣和普及。Traditional screening methods include gastrointestinal endoscopy and fecal occult blood tests, but all have their own insurmountable shortcomings. Gastrointestinal plus pathological biopsy is considered the gold standard for screening and diagnosis of gastrointestinal cancer, but many patients are reluctant to undergo gastroscopic examination because of the invasiveness of microscopic examination and discomfort in bowel preparation. Recently, studies have shown that colonoscopy is not good for the diagnosis of flat polyps commonly found in the right colon. Colonoscopy does not reduce the incidence of right colon cancer. Fecal occult blood test is another clinically used colorectal cancer screening method. It has the advantages of being completely non-invasive and inexpensive, but its accuracy in detecting tumors is low. Only 50-60% can be detected by immunochemical improvement. Colorectal cancer and about 30% of precancerous adenomas. Moreover, the false positive rate of the detection method is high, so it is difficult to get popularization and popularization.

除以上傳統篩查方法以外,人們還在不斷探索其他方法,以提高早期篩檢率,腫瘤標誌物是其中之一。例如,通過DHPLC分析發現T1151A作為錯配修復基因hMLH1上的一個多態位點,可作為胃及大腸腫瘤,尤其是低齡胃及大腸腫瘤高危人群篩選的候選指標(參見張曉梅等,《腫瘤》,2005,25(1):62-65)。採用CA19-9(糖類胃腸癌相關抗原)和SA(唾液酸)的腫瘤標誌物聯合檢測對結/直腸癌和胃/賁門癌的檢測陽性率分別可以達到76.7%和82.5% (參見蔡發成等,《實用醫技雜誌》,2005,12(3B):722-723)。此外,我國還開發了C12蛋白晶片檢測系統,通過分析血清中CA19-9、神經元特異性烯醇化酶(NSE)、癌胚抗原(CEA)、糖原242(CA242)、糖原125(CA125)、糖原153 (CA15-3)、甲胎蛋白(AFP)、鐵蛋白、人絨毛膜促性腺激素(HCG)、人生長激素(HGH)中這十種腫瘤標誌物的表達水準,以實現腫瘤的早期篩檢和監測。然而,大多數發現的腫瘤標誌物還停留在研究階段,並沒有發展到臨床篩檢階段;或者已經應用在臨床篩檢上,但檢測準確度或靈敏度不高。例如,研究發現,上述C12系統對胃腸癌患者的總體篩檢率為39.21%,I、II、III、IV期患者的篩檢率分別為13.73%、33.33%、38.30%、58.03%,表明C12檢測系統對晚期胃腸癌的篩檢有一定的價值,但對早期的敏感性不高(參見楊雪琴等,《南京醫科大學學報(自然科學版)》,2008(10):1285-1289)。In addition to the above traditional screening methods, people continue to explore other methods to improve early screening rate, tumor markers are one of them. For example, by THPLC analysis, T1151A was identified as a polymorphic site on the mismatch repair gene hMLH1, which can be used as a candidate for screening gastric and colorectal tumors, especially for high-risk populations of younger stomach and large intestine tumors (see Zhang Xiaomei et al., Oncology). 2005, 25(1): 62-65). The combined detection of tumor markers using CA19-9 (glucose gastrointestinal cancer-associated antigen) and SA (sialic acid) combined detection of colorectal cancer and gastric/cardiac cancer can reach 76.7% and 82.5%, respectively (see Cai Facheng et al. Journal of Practical Medical Technology, 2005, 12 (3B): 722-723). In addition, China has also developed a C12 protein wafer detection system by analyzing serum CA19-9, neuron-specific enolase (NSE), carcinoembryonic antigen (CEA), glycogen 242 (CA242), glycogen 125 (CA125). ), the expression level of these ten tumor markers in glycogen 153 (CA15-3), alpha-fetoprotein (AFP), ferritin, human chorionic gonadotropin (HCG), and human growth hormone (HGH) Early screening and monitoring of tumors. However, most of the tumor markers found remain at the research stage and have not progressed to the clinical screening stage; or have been applied to clinical screening, but the detection accuracy or sensitivity is not high. For example, the study found that the overall screening rate of the above C12 system for patients with gastrointestinal cancer was 39.21%, and the screening rates for patients with stage I, II, III, and IV were 13.73%, 33.33%, 38.30%, and 58.03%, respectively, indicating C12. The detection system has certain value for the screening of advanced gastrointestinal cancer, but it is not sensitive to the early stage (see Yang Xueqin et al., Journal of Nanjing Medical University (Natural Science Edition), 2008 (10): 1285-1289).

近期,在現有常規基因標記物的基礎上又發展了另外兩種腸癌的體外篩檢方法:來自美國Exact Sciences公司的Cologuard技術和來自Epigenomics公司的Epi proColon技術。前者主要檢測糞便中的異常DNA變化及人類糞便中存在的潛血紅蛋白;後者檢測血液游離DNA中的SEPT9 甲基化標記物。大規模臨床實驗表明,Cologuard的檢出率效果較好,但需要操作糞便,使用者體驗較差;而SEPT9甲基化屬於液體活檢,雖然用戶體驗較好,但是特異性和靈敏度都較差(分別為80%和72%)。Recently, two other in vitro screening methods for intestinal cancer have been developed based on existing conventional genetic markers: Cologuard technology from Exact Sciences, Inc. and Epi proColon technology from Epigenomics. The former mainly detects abnormal DNA changes in feces and occult hemoglobin present in human feces; the latter detects SEPT9 methylation markers in blood free DNA. Large-scale clinical trials have shown that Cologuard's detection rate is better, but it requires manipulation of feces, and the user experience is poor. SEPT9 methylation is a liquid biopsy. Although the user experience is better, the specificity and sensitivity are poor (respectively 80% and 72%).

因此迫切需要具有高特異性和高靈敏度並可供臨床檢測使用的腸癌的基因標誌物,使能夠以無創或微創、快捷的方法有效篩檢腸癌,以改善病人接受長期監測的意願。Therefore, there is an urgent need for genetic markers of intestinal cancer with high specificity and high sensitivity for clinical testing, which enables effective screening of intestinal cancer in a non-invasive or minimally invasive and rapid manner to improve the patient's willingness to undergo long-term monitoring.

發明人通過對正常樣品和腸癌或胃癌樣品進行高通量定序,並對其中各基因上的5-羥甲基胞嘧啶(5-hmC)含量進行分析,出乎意料地發現了多個極具資訊的可用於檢測腸癌或胃癌的基因標誌物。The inventors unexpectedly discovered multiple high-throughput sequencing of normal samples and intestinal or gastric cancer samples, and analysis of 5-hydroxymethylcytosine (5-hmC) content in each gene. A highly informative genetic marker for detecting intestinal or gastric cancer.

因此,本發明的第一個方面涉及用於檢測腸癌的基因標誌物,包括一個或多個選自以下的基因:ADAM金屬肽酶域20 (ADAM20 )、F盒和富亮氨酸重複蛋白7(FBXL7 )、卵泡抑素(FST )、TP53凋亡效應器(PERP )、普列克底物蛋白同源相似域家族A成員3 (PHLDA3 )、Runt相關轉錄因子1移動伴侶1(RUNX1T1 )、互養蛋白γ2 (SNTG2 )、精子相關抗原4 (SPAG4 )、硫酸酯酶1 (SULF1 )和NME/NM23核苷二磷酸激酶(NME3 )。優選的,所述基因標誌物包括至少兩個、至少三個、至少四個、至少五個、至少六個、至少七個、至少八個、至少九個或十個選自以下的基因:ADAM20 FBXL7 FST PERP PHLDA3 RUNX1T1 SNTG2 SPAG4 SULF1NME3 。更優選的,所述基因標誌物包括ADAM20 FBXL7 FST PERP PHLDA3 RUNX1T1 SNTG2 SPAG4 SULF1NME3Accordingly, a first aspect of the invention relates to a genetic marker for detecting intestinal cancer comprising one or more genes selected from the group consisting of ADAM metal peptidase domain 20 ( ADAM20 ), F-box and leucine-rich repeat protein 7 ( FBXL7 ), follistatin ( FST ), TP53 apoptotic effector ( PERP ) , Pleek substrate homologous domain family A member 3 ( PHLDA3 ) , Runt - related transcription factor 1 mobile partner 1 ( RUNX1T1 ) Mutulin γ2 ( SNTG2 ), sperm-associated antigen 4 ( SPAG4 ), sulfatase 1 ( SULF1 ) and NME/NM23 nucleoside diphosphate kinase ( NME3 ). Preferably, the genetic marker comprises at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine or ten genes selected from the group consisting of: ADAM20 , FBXL7 , FST , PERP , PHLDA3 , RUNX1T1 , SNTG2 , SPAG4 , SULF1, and NME3 . More preferably, the genetic markers include ADAM20 , FBXL7 , FST , PERP , PHLDA3 , RUNX1T1 , SNTG2 , SPAG4 , SULF1 and NME3 .

本發明的第二個方面涉及用於檢測胃癌的基因標誌物,包括一個或多個選自以下的基因:Rho GTP酶啟動蛋白28 (ARHGAP28 )、BMP結合內皮調控者(BMPER )、染色體9開放讀碼框92 (C9orf92 )、鈣依賴分泌啟動者2 (CADPS2 )、鈣黏蛋白11 (CDH11 )、F盒和富亮氨酸重複蛋白7(FBXL7 )、間質同源框2 (MEOX2 )、氧化一氮合成酶1 (NOS1 )、抑瘤素M受體(OSMR )、細胞膜調控蛋白paralemmin 2 (PALM2 )、磷酸二酯酶10A (PDE10A )、RNA結合模體單鏈相互作用蛋白3 (RBMS3)、硫酸酯酶1 (SULF1 )、wntless Wnt配體分泌調節者(WLS )、Wilms瘤1作用蛋白(WTIP )、鋅指蛋白518B (ZNF518B )、鋅指蛋白714 (ZNF714 )和Rad52模體包含1 (RDM1 )。優選的,所述基因標誌物包括至少兩個、至少三個、至少四個、至少五個、至少六個、至少七個、至少八個、至少九個、至少十個、至少11個、至少12個、至少13個、至少14個、至少15個、至少16個、至少17個或18個選自以下的基因:ARHGAP28 BMPER C9orf92 CADPS2 CDH11 FBXL7 MEOX2 NOS1 OSMR PALM2 PDE10A RBMS3 SULF1 WLS WTIP ANF518B ZNF714RDM1 。更優選的,所述基因標誌物包括ARHGAP28 BMPER C9orf92 CADPS2 CDH11 FBXL7 MEOX2 NOS1 OSMR PALM2 PDE10A RBMS3 SULF1 WLS WTIP ANF518B ZNF714RDM1A second aspect of the invention relates to a genetic marker for detecting gastric cancer comprising one or more genes selected from the group consisting of Rho GTPase promoter protein 28 ( ARHGAP28 ), BMP binding endothelial regulator ( BMPER ), chromosome 9 open Reading frame 92 ( C9orf92 ), calcium-dependent secretion promoter 2 ( CADPS2 ), cadherin 11 ( CDH11 ), F-box and leucine-rich repeat protein 7 ( FBXL7 ), mesenchymal homeobox 2 ( MEOX2 ), Nitric oxide synthase 1 ( NOS1 ), oncostatin M receptor ( OSMR ), cell membrane regulatory protein paralemmin 2 ( PALM2 ), phosphodiesterase 10A ( PDE10A ), RNA-binding motif single-chain interacting protein 3 (RBMS3) ), sulfatase 1 ( SULF1 ), wntless Wnt ligand secretion regulator ( WLS ), Wilms tumor 1 acting protein ( WTIP ), zinc finger protein 518B ( ZNF518B ), zinc finger protein 714 ( ZNF714 ), and Rad52 motif 1 ( RDM1 ). Preferably, the genetic marker comprises at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17 or 18 genes selected from: ARHGAP28, BMPER, C9orf92, CADPS2 , CDH11, FBXL7, MEOX2, NOS1, OSMR, PALM2 , PDE10A , RBMS3 , SULF1 , WLS , WTIP , ANF518B , ZNF714, and RDM1 . More preferably, the marker gene comprising ARHGAP28, BMPER, C9orf92, CADPS2, CDH11, FBXL7, MEOX2, NOS1, OSMR, PALM2, PDE10A, RBMS3, SULF1, WLS, WTIP, ANF518B, ZNF714 and RDM1.

本發明還涉及上述基因標誌物在檢測腸癌或胃癌中的用途。本發明還涉及利用上述基因標誌物進行腸癌或胃癌檢測的試劑盒,其包括用於測定上述基因標誌物的5-hmC含量的試劑和說明書。The invention also relates to the use of the above genetic markers for detecting intestinal or gastric cancer. The present invention also relates to a kit for detecting intestinal cancer or gastric cancer using the above gene marker, which comprises reagents and instructions for determining the 5-hmC content of the above gene marker.

本發明的第三個方面涉及用於檢測胃癌或腸癌的方法,包括以下步驟: (a)測定正常樣品和受試者樣品中本發明所述的基因標誌物的5-hmC的含量; (b)用正常樣品中所述基因標誌物的5-hmC含量作為參照,將受試者樣品中對應的基因標誌物的5-hmC含量標準化; (c)對經標準化的所述基因標誌物的5-hmC含量進行數學關聯,並獲得評分;和 (d)根據所述評分獲得檢測結果。A third aspect of the invention relates to a method for detecting gastric cancer or intestinal cancer, comprising the steps of: (a) determining the content of 5-hmC of the genetic marker of the present invention in a normal sample and a sample of a subject; b) normalizing the 5-hmC content of the corresponding gene marker in the subject sample using the 5-hmC content of the gene marker in the normal sample as reference; (c) on the standardized gene marker The 5-hmC content was mathematically correlated and scored; and (d) the test results were obtained based on the score.

在一個實施方案中,所述樣品是受試者或正常人體液中游離的DNA片段,或來源於胞器、細胞以及組織中的完整基因組DNA。其中,體液是血液、尿液、汗液、痰液、糞便、腦脊液、腹水、胸水、膽汁、胰腺液等。In one embodiment, the sample is a free DNA fragment in a subject or normal human body fluid, or is derived from intact genomic DNA in cells, cells, and tissues. Among them, body fluids are blood, urine, sweat, sputum, feces, cerebrospinal fluid, ascites, pleural effusion, bile, pancreatic juice, and the like.

在一個實施方案中,本發明所述的基因標誌物的5-hmC含量可通過本領域技術人員已知的任何方法進行測定,例如包括但不限於,葡糖基化法、限制性內切酶法、化學標記法、與高通量定序方法聯用的沉澱法、單分子即時定序法(SMRT)、氧化重亞硫酸鹽定序法(OxBS-Seq)等。葡糖基化法的原理是採用T4噬菌體β -葡萄糖轉移酶(β -GT),在葡萄糖供體受質(substrate)尿核苷二磷酸葡萄糖(UDP-Glu)存在下,將葡萄糖轉移至羥基位置,從而生成β -葡萄糖基-5-羥甲基胞嘧啶(5-ghmC)。同時可採用同位素標記受質進行定量。在葡糖基化法基礎上進一步發展出限制性內切酶法和化學標記法。限制性內切酶法的原理是:葡糖基化反應改變了一些限制性內切酶的酶切特性。甲基化依賴的限制性內切酶MspI和HpaII 可識別同樣的序列(CCGG),但它們對甲基化狀態的敏感性是不同: MspI 識別並切割 5-甲基胞嘧啶(5-mC)和 5-hmC,但不能切割5-ghmC; HpaII 只切割完全未修飾的位點,胞嘧啶上的任何修飾(5-mC、5-hmC、5-ghmC)均阻礙切割。若CpG 位點含有5-hmC,那麼糖基化、酶解之後能檢測到條帶,未糖基化對照反應中沒有條帶; 同時可採用qPCR 進行定量分析。另外,其他限制性內切酶也同樣存在阻礙5-ghmC 酶切的情況,可應用於5-hmC 檢測(如: GmrSD,MspJI,PvuRts1I,Taq I 等)。化學標記法的原理是:將酶反應受質上的葡萄糖進行化學修飾轉變成UDP-6-N3-glucose,將6-N3-glucose 轉移到羥甲基位置,生成N3-5ghmC。隨後,通過點擊化學方法在每個5-hmC上添加一分子生物素,結合下一代高通量DNA定序技術或單分子定序技術,可分析5-hmC在基因組DNA中的分佈情況。沉澱法是將5-hmC用特殊方式修飾後再將其特異性地從基因組DNA中捕獲下來,並進行定序分析。氧化重亞硫酸鹽定序法是首個以單鹼基解析度對5-hmC 進行定量定序的方法。首先將5-hmC 進行KRuO4 氧化處理,生成5-甲醯胞嘧啶(5fC),然後採用重亞硫酸鹽定序。在此過程中,5-hmC先氧化為5fC,而後脫氨形成U。通常,同時採用多種檢測方法對5-hmC進行定量檢測。In one embodiment, the 5-hmC content of the genetic markers of the invention can be determined by any method known to those skilled in the art, including, for example, but not limited to, glucosylation, restriction endonucleases Method, chemical labeling method, precipitation method combined with high-throughput sequencing method, single molecule immediate sequencing method (SMRT), oxidized bisulfite sequencing method (OxBS-Seq), and the like. The principle of the glucosylation method is to transfer glucose to a hydroxyl group in the presence of a glucose donor substrate, urinary nucleoside diphosphate glucose (UDP-Glu), using T4 bacteriophage β -glucose transferase ( β- GT). Position, thereby producing β -glucosyl-5-hydroxymethylcytosine (5-ghmC). At the same time, isotopically labeled receptors can be used for quantification The restriction endonuclease method and the chemical labeling method were further developed on the basis of the glucosylation method. The principle of the restriction endonuclease method is that the glucosylation reaction changes the enzymatic cleavage properties of some restriction enzymes. The methylation-dependent restriction enzymes MspI and HpaII recognize the same sequence (CCGG), but their sensitivity to methylation status is different: MspI recognizes and cleaves 5-methylcytosine (5-mC) And 5-hmC, but not 5-ghmC; HpaII only cleaves completely unmodified sites, and any modification on cytosine (5-mC, 5-hmC, 5-ghmC) blocks cleavage. If the CpG locus contains 5-hmC, the band can be detected after glycosylation and enzymatic hydrolysis, and there is no band in the unglycosylated control reaction; qPCR can be used for quantitative analysis. In addition, other restriction enzymes also have a hindrance to 5-ghmC digestion, and can be applied to 5-hmC detection (eg, GmrSD, MspJI, PvuRts1I, Taq I, etc.). The principle of chemical labeling is to chemically modify the glucose of the enzyme reaction to UDP-6-N3-glucose, and transfer 6-N3-glucose to the position of hydroxymethyl to form N3-5ghmC. Subsequently, a single molecule of biotin was added to each 5-hmC by click chemistry, combined with next-generation high-throughput DNA sequencing technology or single-molecule sequencing technology to analyze the distribution of 5-hmC in genomic DNA. The precipitation method is to modify 5-hmC in a special way and then specifically capture it from genomic DNA and perform sequence analysis. The oxidized bisulfite sequencing method is the first method to quantitatively sequence 5-hmC with a single base resolution. First, 5-hmC was subjected to KRuO4 oxidation treatment to produce 5-mercaptocytosine (5fC), which was then subjected to bisulfite sequencing. In this process, 5-hmC is first oxidized to 5fC, and then deaminated to form U. Usually, quantitative detection of 5-hmC is performed simultaneously using a variety of detection methods.

在本發明的一個實施方案中,利用化學標記法結合高通量定序來測定本發明的基因標誌物的5-hmC含量。在該具體的實施方案中,測定本發明的基因標誌物的5-hmC含量的方法包括以下步驟:將來自腸癌或胃癌患者和正常人的樣品的DNA片段化;將所述片段化的DNA末端修復並末端補齊;將末端補齊的DNA與定序接頭連接,獲得連接產物;通過標記反應對連接產物中的5-羥甲基胞嘧啶進行標記;收集濃化(enrichment、或稱之為富集)含有5-羥甲基胞嘧啶標記的DNA片段,獲得富集產物;對富集產物進行PCR擴增,獲得定序基因庫;對定序基因庫進行高通量定序,獲得定序結果;根據定序結果確定5-羥甲基胞嘧啶在基因上的含量。其中,標記反應包括:i)利用糖基轉移酶將帶有修飾基團的糖共價連接到5-羥甲基胞嘧啶的羥甲基上,和ii) 將直接或間接連有生物素的點擊化學受質與帶有修飾基團的5-羥甲基胞嘧啶反應。其中,步驟i)和步驟ii)可以按順序進行,也可以在一個反應中同時進行。這種標記方法減少了定序所需的樣本量,且5-羥甲基胞嘧啶上的生物素標籤使其在定序中顯示出更高的動力學信號,提高了核苷酸識別的準確性。在該實施方案中,所述糖基轉移酶包括但不限於: T4噬菌體β-葡糖基轉移酶(β-GT)、 T4噬菌體α-葡糖基轉移酶(α-GT)及其具有相同或相似活性的衍生物、類似物、或重組酶;所述帶有修飾基團的糖包括但不限於:帶有疊氮修飾的糖類(例如6-N3-葡萄糖)或帶有其他化學修飾(例如羰基、巰基、羥基、羧基、碳-碳雙鍵、碳-碳三鍵、二硫鍵、胺基、醯胺基、雙烯等)的糖類,其中優選帶有疊氮修飾的糖類;所述用於間接連接生物素和點擊化學受質的化學基團包括但不限於:羰基、巰基、羥基、羧基、碳-碳雙鍵、碳-碳三鍵、二硫鍵、胺基、醯胺基、雙烯。在該實施方案中,優選通過固相材料來收集濃化含有5-hmC標記的DNA片段。具體地,可以通過固相親和反應或其他特異性結合反應將含有5-羥甲基胞嘧啶標記的DNA片段結合在固相材料上,然後通過多次洗滌去除未結合的DNA片段。固相材料包括但不限於帶有表面修飾的矽片或其他晶片,例如人工高分子小球(優選直徑為1 nm-100 mm)、磁性小球(優選直徑為1 nm-100 mm)、瓊脂糖小球等(優選直徑為1 nm-100 mm)。固相富集中所用的洗滌液是本領域技術人員熟知的緩衝液,包括但不限於:含有Tris-HCl、MOPS、HEPES (pH=6.0-10.0,濃度在1 mM到1 M之間)、NaCl (0-2M)或表面活性劑如Tween20 (0.01%-5%)的緩衝液。在該實施方案中,優選直接在固相上進行PCR擴增從而製備定序基因庫。如有需要,在固相上進行PCR擴增後,可以回收擴增產物後進行第二輪PCR擴增來製備定序基因庫。所述第二輪PCR擴增可用本領域技術人員已知的常規方法進行。任選地,在製備定序基因庫的過程中可進一步包括一個或多個純化步驟。本領域技術人員知曉的或可商購的任何純化試劑盒均可用於本發明。純化方法包括但不限於:凝膠電泳切膠回收、矽膠膜離心柱法、磁珠法、乙醇或異丙醇沉澱法或其組合。任選地,在高通量定序之前,對定序基因庫進行品質檢查。例如,對文庫進行片段大小分析並使用qPCR方法對文庫的濃度進行絕對定量。通過品質檢查的定序基因庫可用於高通量定序。然後將一定數量(1-96個)含有不同barcode的文庫按相同濃度混勻並根據二代定序儀的標準上機方法上機定序,獲得定序結果。本領域已知的各種二代定序平臺及其相關的試劑可用於本發明。In one embodiment of the invention, the 5-hmC content of the genetic markers of the invention is determined by chemical labeling in combination with high throughput sequencing. In this particular embodiment, the method of determining the 5-hmC content of a genetic marker of the present invention comprises the steps of: fragmenting DNA from a sample of a patient with a bowel or gastric cancer and a normal human; the fragmented DNA The ends are repaired and end-filled; the end-filled DNA is ligated to the sequencing linker to obtain a ligation product; the 5-hydroxymethylcytosine in the ligation product is labeled by a labeling reaction; enrichment or enrichment is collected. To enrich the DNA fragment containing 5-hydroxymethylcytosine labeling to obtain an enriched product; PCR amplification of the enriched product to obtain a sequenced gene pool; high-throughput sequencing of the sequenced gene pool Sequencing results; the genetic content of 5-hydroxymethylcytosine was determined based on the sequencing results. Wherein the labeling reaction comprises: i) covalent attachment of a sugar having a modifying group to a methylol group of 5-hydroxymethylcytosine using a glycosyltransferase, and ii) direct or indirect attachment of biotin Click on the chemical acceptor to react with 5-hydroxymethylcytosine with a modifying group. Wherein step i) and step ii) may be carried out sequentially or simultaneously in one reaction. This labeling method reduces the amount of sample required for sequencing, and the biotin tag on 5-hydroxymethylcytosine gives it a higher kinetic signal in sequencing, which improves the accuracy of nucleotide recognition. Sex. In this embodiment, the glycosyltransferases include, but are not limited to, T4 phage β-glucosyltransferase (β-GT), T4 phage α-glucosyltransferase (α-GT), and the same Or a similarly active derivative, analog, or recombinase; the saccharide with a modifying group includes, but is not limited to, a saccharide with an azide modification (eg, 6-N3-glucose) or with other chemical modifications ( a saccharide such as a carbonyl group, a thiol group, a hydroxyl group, a carboxyl group, a carbon-carbon double bond, a carbon-carbon triple bond, a disulfide bond, an amine group, a decylamino group, a diene or the like, wherein a saccharide having an azide modification is preferred; Chemical groups for indirect attachment of biotin and click chemical acceptors include, but are not limited to, carbonyl, sulfhydryl, hydroxy, carboxy, carbon-carbon double bonds, carbon-carbon triple bonds, disulfide bonds, amine groups, guanamine Base, diene. In this embodiment, the concentrated DNA fragment containing the 5-hmC label is preferably collected by a solid phase material. Specifically, a DNA fragment containing a 5-hydroxymethylcytosine label can be bound to a solid phase material by a solid phase affinity reaction or other specific binding reaction, and then the unbound DNA fragment can be removed by multiple washings. Solid phase materials include, but are not limited to, bracts or other wafers with surface modifications, such as artificial polymer beads (preferably 1 nm-100 mm in diameter), magnetic beads (preferably 1 nm-100 mm in diameter), agar Sugar globules, etc. (preferably 1 nm to 100 mm in diameter). The washing liquid used for solid phase enrichment is a buffer well known to those skilled in the art, including but not limited to: containing Tris-HCl, MOPS, HEPES (pH=6.0-10.0, concentration between 1 mM and 1 M), NaCl (0-2M) or a buffer such as Tween20 (0.01%-5%). In this embodiment, PCR amplification is preferably performed directly on the solid phase to prepare a sequencing gene pool. If necessary, after performing PCR amplification on the solid phase, the amplification product can be recovered and subjected to a second round of PCR amplification to prepare a sequencing gene pool. The second round of PCR amplification can be performed using conventional methods known to those skilled in the art. Optionally, one or more purification steps may be further included in the process of preparing the sequencing gene pool. Any purification kit known or commercially available to those skilled in the art can be used in the present invention. Purification methods include, but are not limited to, gel electrophoresis gel extraction, silicone film spin column method, magnetic bead method, ethanol or isopropanol precipitation method, or a combination thereof. Optionally, a quality check is performed on the sequencing gene pool prior to high throughput sequencing. For example, the library is subjected to fragment size analysis and the concentration of the library is absolutely quantified using the qPCR method. A sequenced gene pool that passes quality checks can be used for high-throughput sequencing. A certain number (1-96) of libraries containing different barcodes were then mixed at the same concentration and sequenced according to the standard method of the second generation sequencer to obtain sequencing results. Various second generation sequencing platforms and related reagents known in the art can be used in the present invention.

在本發明的一個實施方案中,優選將定序結果與標準人類基因組參考序列進行比對,挑選出其中比對到本發明基因標誌物上的序列,即選擇比對位元點與基因特徵(如組蛋白修飾位點、轉錄因子結合位點、基因外顯子內含子區域以及基因啟動子等)重合區域的讀段數量,以代表5-hmC在該基因上的修飾水準,從而測定5-hmC在該基因標誌物上的含量。優選在進行比對前,首先將定序結果清除低品質定序位點,其中衡量定序位點品質的因素包括但不限於:鹼基品質、reads品質、GC含量、重複序列和Overrepresented 序列數量等。該步驟中涉及的各種比對軟體和分析方法是本領域已知的。In one embodiment of the invention, the sequencing results are preferably aligned with a standard human genome reference sequence, and sequences aligned to the gene markers of the invention are selected, ie, aligned bit positions and gene features are selected ( The number of reads of the coincident region, such as histone modification site, transcription factor binding site, gene exon intron region, and gene promoter, etc., to represent the modification level of 5-hmC on the gene, thereby determining 5 -hmC content on the gene marker. Preferably, prior to performing the alignment, the sequencing result is first cleared of low quality sequencing sites, wherein factors determining the quality of the sequencing sites include, but are not limited to, base quality, reads quality, GC content, repeat sequences, and number of Overrepresented sequences. Wait. The various alignment software and analytical methods involved in this step are known in the art.

在本發明的一個實施方案中,測定基因標誌物的5-hmC含量是指測定該基因標誌物全長上的5-hmC含量或測定該基因標誌物上某一片段的5-hmC含量或其組合。In one embodiment of the invention, determining the 5-hmC content of the genetic marker means determining the 5-hmC content of the full length of the genetic marker or determining the 5-hmC content of a fragment of the genetic marker or a combination thereof .

根據本發明,在測定各基因標誌物上5-hmC含量之後,用正常樣品中所述基因標誌物的5-hmC含量作為參照,將受試者樣品中對應的基因標誌物的5-hmC含量標準化。舉例而言,正常樣品和受試者樣品中同一基因標誌物的5-hmC含量分別為X和Y,則受試者樣品中該基因標誌物的標準化5-hmC含量為Y/X。According to the present invention, after determining the 5-hmC content of each gene marker, the 5-hmC content of the corresponding gene marker in the sample of the subject is used as a reference with the 5-hmC content of the gene marker in the normal sample. standardization. For example, the 5-hmC content of the same gene marker in the normal sample and the subject sample is X and Y, respectively, and the normalized 5-hmC content of the gene marker in the subject sample is Y/X.

根據本發明,在資料標準化後,對各基因標誌物的標準化5-hmC含量進行數學關聯以獲得評分,從而根據所述評分獲得檢測結果。如本文所用,“數學關聯”是指將來自生物樣品的基因標誌物的5-hmC含量與腸癌或胃癌篩檢結果相關聯的任何計算方法或機器學習方法。本領域普通技術人員理解,可選擇不同的計算方法或工具用於提供本發明的數學關聯,例如彈性網路正則化、決策樹、廣義線性模型、邏輯回歸、最高分值對、神經網路、線性和二次判別式分析(LQA 和QDA)、樸素貝葉斯、隨機森林和支持向量機。According to the present invention, after the data is standardized, the standardized 5-hmC content of each gene marker is mathematically correlated to obtain a score, thereby obtaining a detection result based on the score. As used herein, "mathematical association" refers to any computational or machine learning method that correlates the 5-hmC content of a genetic marker from a biological sample with an intestinal cancer or gastric cancer screening result. One of ordinary skill in the art will appreciate that different computing methods or tools may be selected to provide the mathematical associations of the present invention, such as elastic network regularization, decision trees, generalized linear models, logistic regression, highest score pairs, neural networks, Linear and quadratic discriminant analysis (LQA and QDA), naive Bayes, random forests, and support vector machines.

在本發明的一個實施方案中,對各基因標誌物的標準化5-hmC含量進行數學關聯並獲得評分的具體步驟如下:將各基因標誌物的標準化5-hmC含量乘以加權係數,獲得該基因標誌物的預測因子t;將各基因標誌物的預測因子t相加,獲得總預測因子T;將總預測因子T經過Logistic轉換獲得評分P;若P>0.5,則該受試者樣品患有腸癌或胃癌;若P<0.5,則該受試者樣品為正常。本文所述的加權係數是指在考慮可能影響5-hmC含量的因素(例如受試者地域、年齡、性別、低於、吸煙史、飲酒史、家族史等)的情況下,通過各種高級統計分析方法獲得的係數。In one embodiment of the present invention, the specific steps for mathematically correlating the standardized 5-hmC content of each gene marker and obtaining a score are as follows: multiplying the normalized 5-hmC content of each gene marker by a weighting coefficient to obtain the gene The predictor of the marker t; the predictor t of each gene marker is added to obtain a total predictor T; the total predictor T is subjected to Logistic conversion to obtain a score P; if P>0.5, the subject sample suffers Intestinal or gastric cancer; if P < 0.5, the subject sample is normal. The weighting factors described herein refer to various advanced statistics in consideration of factors that may affect the 5-hmC content (eg, subject area, age, gender, below, smoking history, drinking history, family history, etc.). The coefficients obtained by the analytical method.

與現有技術相比,本發明中用於檢測腸癌和/或胃癌的方法是基於基因標誌物上的5-hmC含量,因此可以使用更為廣泛的DNA樣品來源。因此,本發明中用於檢測腸癌和/或胃癌的方法具有以下幾個優點:(1)安全無創,即使無症狀人群也對該檢測接受度高;(2)DNA來源廣泛,不存在影像學中的檢測盲區;(3)準確性高,有較高的靈敏度和特異性;(4)操作方便,用戶體驗好,容易進行疾病動態監測。The method for detecting intestinal cancer and/or gastric cancer in the present invention is based on the 5-hmC content on the gene marker as compared with the prior art, and thus a wider source of DNA samples can be used. Therefore, the method for detecting intestinal cancer and/or gastric cancer in the present invention has the following advantages: (1) safe and non-invasive, and high acceptance of the test even in asymptomatic people; (2) wide source of DNA, no image In-school detection blind zone; (3) high accuracy, high sensitivity and specificity; (4) easy to operate, user experience is good, easy to monitor disease dynamics.

下面將參考附圖並結合實施例來詳細說明本發明,以使本領域的技術人員可以更好的理解本發明並能予以實施。需要說明的是,本領域的技術人員應該理解本發明的附圖及其實施例僅僅是為了說明的目的,並不能對本發明構成任何限制。在不矛盾的情況下,本申請中的實施例及實施例中的特徵可以相互組合。The invention will be described in detail below with reference to the drawings, in order to provide a better understanding of the invention. It should be understood that the drawings and the embodiments of the present invention are intended to be illustrative only and not restrictive. The embodiments of the present application and the features of the embodiments may be combined with each other without contradiction.

實施例Example 1.1. 腸癌基因標誌物的篩選Screening of colon cancer gene markers

(1)抽提血漿DNA:(1) Extraction of plasma DNA:

從來自15位腸癌患者和18位正常人的樣品中分別抽提10 ng血漿DNA。可利用本領域技術人員所熟知的任何適用於抽提血漿DNA的方法、和試劑進行此步驟。10 ng of plasma DNA was extracted from samples from 15 colon cancer patients and 18 normal subjects, respectively. This step can be carried out using any method and reagent suitable for extracting plasma DNA well known to those skilled in the art.

(2)將血漿DNA進行末端補齊、懸A並與定序接頭連接:(2) The plasma DNA is end-filled, suspended A and connected to the sequencing linker:

根據Kapa Hyper Perp Kit說明書製備含有50 mL 血漿DNA、7 mL End Repair & A-Tailing Buffer和3 mL End Repair & A-Tailing Enzyme mix的反應混合液(總體積為60 mL),在20ºC溫浴30分鐘,然後在65ºC溫浴30分鐘。在1.5 mL低吸附EP管中配置以下連接反應混合物:5 mL Nuclease free water,30 mL Ligation Buffer以及10 mL DNA Ligase。向45 mL連接反應混合物中加入5 mL的定序接頭,混合,於20ºC加熱20分鐘,然後保持於4ºC。使用AmpureXP beads對反應產物進行純化,用20 mL含Tris-HCl (10 mM,pH=8.0)及EDTA(0.1 mM)的緩衝液進行沖提獲得最終的DNA連接樣品。Prepare a reaction mixture containing 50 mL of plasma DNA, 7 mL of End Repair & A-Tailing Buffer and 3 mL of End Repair & A-Tailing Enzyme mix according to the Kapa Hyper Perp Kit instructions (total volume 60 mL), at 20 °C in a warm bath 30 Minutes, then warm at 65oC for 30 minutes. The following ligation reaction mixture was placed in a 1.5 mL low adsorption EP tube: 5 mL Nuclease free water, 30 mL Ligation Buffer and 10 mL DNA Ligase. Add 5 mL of the sequencing link to the 45 mL ligation reaction mixture, mix, heat at 20 °C for 20 minutes, then hold at 4 °C. The reaction product was purified using Ampure XP beads and eluted with 20 mL of a buffer containing Tris-HCl (10 mM, pH = 8.0) and EDTA (0.1 mM) to obtain a final DNA-ligated sample.

(3)標記5-羥甲基胞嘧啶:(3) Labeling 5-hydroxymethylcytosine:

製備總體積為26 mL的標記反應混合液:疊氮修飾的二磷酸尿苷葡萄糖(即UDP-N3-Glu,終濃度為50 mM)、β-GT(終濃度為1 mM)、Mg2+ (終濃度為25mM)、HEPES(pH=8.0,終濃度為50mM)和來自上述步驟的20 mL DNA。將混合液在37ºC溫浴1小時。取出混合液,用AmpureXP beads純化,獲得純化的20 mL DNA。Prepare a total of 26 mL of labeled reaction mixture: azide-modified uridine diphosphate glucose (ie UDP-N3-Glu, final concentration 50 mM), β-GT (final concentration 1 mM), Mg 2+ (final concentration 25 mM), HEPES (pH = 8.0, final concentration 50 mM) and 20 mL of DNA from the above procedure. The mixture was incubated at 37 ° C for 1 hour. The mixture was taken out and purified with Ampure XP beads to obtain purified 20 mL of DNA.

然後在上述純化的20 mL DNA中加入1 mL連接有生物素的二苯基環辛炔(DBCO-Biotin),於37ºC反應2小時,接著用AmpureXP beads純化,獲得純化的標記產物。Then, 1 mL of biotin-containing diphenylcyclooctyne (DBCO-Biotin) was added to the purified 20 mL of the above DNA, and reacted at 37 ° C for 2 hours, followed by purification with Ampure XP beads to obtain a purified labeled product.

(4)固相收集濃化含有標記的5-羥甲基胞嘧啶的DNA片段:(4) Solid phase collection of concentrated DNA fragments containing labeled 5-hydroxymethylcytosine:

首先,按以下步驟準備磁珠:取出0.5 mL C1 streptadvin beads (life technology)並加入100 mL緩衝液(5mM Tris,pH=7.5,1M NaCl,0.02% Tween20),渦旋混合30秒,然後用100 mL洗滌液(5mM Tris,pH=7.5,1M NaCl,0.02% Tween20)洗滌磁珠3次,最後加入25 mL結合緩衝液(10mM Tris,pH=7.5,2M NaCl, 0.04% Tween20或其他表面活性劑),並混合均勻。First, prepare the magnetic beads as follows: Remove 0.5 mL of C1 streptadvin beads (life technology) and add 100 mL of buffer (5 mM Tris, pH=7.5, 1 M NaCl, 0.02% Tween 20), vortex for 30 seconds, then use 100 The magnetic beads were washed 3 times with mL Wash (5 mM Tris, pH = 7.5, 1 M NaCl, 0.02% Tween 20), and finally 25 mL of binding buffer (10 mM Tris, pH = 7.5, 2 M NaCl, 0.04% Tween 20 or other surfactant) was added. ), and mix evenly.

然後,在磁珠混合液中加入上述步驟獲得的純化的標記產物,並在旋轉混合器中混合15 min使其充分結合。Then, the purified labeled product obtained in the above procedure was added to the magnetic bead mixture, and mixed for 15 min in a rotary mixer to sufficiently bind.

最後,用100 mL洗滌液(5mM Tris,pH=7.5,1M NaCl,0.02% Tween20)洗滌磁珠3次,離心去掉上清液,加入23.75 mL不含核酸酶的水。Finally, the magnetic beads were washed 3 times with 100 mL of washing solution (5 mM Tris, pH = 7.5, 1 M NaCl, 0.02% Tween 20), the supernatant was removed by centrifugation, and 23.75 mL of nuclease-free water was added.

(5)PCR擴增:(5) PCR amplification:

向上述步驟的最終體系中加入25 mL的2 X PCR master mix和1.25 mL PCR引子(總體積為50 mL),按照下述PCR反應循環的溫度和條件進行擴增: Add 25 mL of 2 X PCR master mix and 1.25 mL PCR primer (total volume 50 mL) to the final system of the above procedure and amplify according to the temperature and conditions of the PCR reaction cycle described below:

將擴增產物用AmpureXP beads純化,得到最終定序基因庫。The amplified product was purified using Ampure XP beads to obtain a final sequencing gene pool.

(6)對定序基因庫進行質檢後進行高通量定序:(6) High-throughput sequencing after quality inspection of the sequencing gene bank:

將獲得的定序基因庫通過qPCR進行濃度測定,並用Agilent2100對文庫中DNA片段大小含量進行確定。將通過質檢的定序基因庫以相同濃度混合,用Illumina Hiseq 4000進行定序。The obtained sequencing gene pool was subjected to concentration determination by qPCR, and the DNA fragment size content in the library was determined using Agilent 2100. The sequencing gene pools passed through the QC were mixed at the same concentration and sequenced with an Illumina Hiseq 4000.

(7)確定各基因標誌物的5-hmC含量和加權係數(7) Determine the 5-hmC content and weighting coefficient of each gene marker

將獲得的定序結果進行初步質控評估,清除低品質定序位點後,將達到定序品質標準的讀段利用Bowtie2工具與人類標準基因組參考序列進行比較。然後利用featureCounts和HtSeq-Count工具來統計讀段數量以確定各基因標誌物的5-hmC含量。同時利用高通量定序結果,將可能影響5-hmC含量的因素作為共變數,通過邏輯回歸和彈性網路正則化獲得各基因標誌物的加權係數。結果如表1所示。 表1:本發明的腸癌基因標誌物的平均標準化5-hmC含量和加權係數 The obtained sequencing results were subjected to preliminary quality control evaluation, and after the low-quality sequencing sites were removed, the readings that met the sequencing quality criteria were compared with the human standard genome reference sequence using the Bowtie 2 tool. The feature counts and HtSeq-Count tools were then used to count the number of reads to determine the 5-hmC content of each gene marker. At the same time, using high-throughput sequencing results, the factors that may affect the 5-hmC content were used as covariates, and the weighting coefficients of each gene marker were obtained by logistic regression and elastic network regularization. The results are shown in Table 1. Table 1: Average normalized 5-hmC content and weighting coefficient of intestinal cancer gene markers of the present invention

如上所述,平均標準化5-hmC含量是指腸癌樣品中該基因標誌物的平均5-hmC含量與正常樣品中同一基因標誌物的平均5-hmC含量之比。從表1可以看出,本發明的腸癌基因標誌物的5-hmC含量在正常樣品中和腸癌樣品中存在顯著差異,並且除NME3之外,其餘基因標誌物的5-hmC含量相對於正常人均顯著增加。As described above, the average normalized 5-hmC content refers to the ratio of the average 5-hmC content of the gene marker in the intestinal cancer sample to the average 5-hmC content of the same gene marker in the normal sample. As can be seen from Table 1, the 5-hmC content of the intestinal cancer gene marker of the present invention is significantly different between the normal sample and the intestinal cancer sample, and the 5-hmC content of the remaining genetic markers except for NME3 is relative to Normal people have increased significantly.

實施例Example 2.2. 腸癌基因標誌物的有效性Effectiveness of gut cancer gene markers

本實施例驗證本發明的腸癌基因標誌物用於檢測腸癌的有效性。This example demonstrates the effectiveness of the intestinal cancer gene marker of the present invention for detecting intestinal cancer.

根據實施例1的方法測定第一批59個樣品(24例腸癌和35例對照)中本發明所述的10個腸癌基因標誌物的5-hmC含量,並確定各基因標誌物的加權係數。The 5-hmC content of the 10 gut cancer gene markers of the present invention in the first batch of 59 samples (24 intestinal cancers and 35 controls) was determined according to the method of Example 1, and the weighting of each gene marker was determined. coefficient.

將各基因標誌物的標準化5-hmC含量乘以與其對應的加權係數,獲得該基因標誌物的預測因子t後,將各基因標誌物的預測因子t相加,獲得總預測因子T,然後將總預測因子T根據以下公式經過Logistic轉換獲得評分P: Multiplying the normalized 5-hmC content of each gene marker by its corresponding weighting coefficient to obtain the predictive factor t of the gene marker, adding the predictor t of each gene marker to obtain the total predictor T, and then The total predictor T is scored by Logistic conversion according to the following formula:

若P>0.5,則該受試者樣品患有腸癌;若P<0.5,則該受試者樣品為正常。If P > 0.5, the subject sample has intestinal cancer; if P < 0.5, the subject sample is normal.

圖1示出了根據本發明的方法區分該批樣品的結果。如圖1所示,本發明的方法能夠達到83%的靈敏度和94%的特異性。發明人進一步用本發明的方法區分第二批69個樣品(32例腸癌和37例對照),其結果表明該方法能夠達到88%的靈敏度和89%的特異性(圖2)。Figure 1 shows the results of distinguishing the batch of samples in accordance with the method of the present invention. As shown in Figure 1, the method of the invention is capable of achieving a sensitivity of 83% and a specificity of 94%. The inventors further distinguished the second batch of 69 samples (32 intestinal cancers and 37 controls) using the method of the present invention, and the results showed that the method was able to achieve 88% sensitivity and 89% specificity (Fig. 2).

這些結果表明,與現有技術相比,根據本發明的方法能夠以更高的靈敏度和特異性檢測腸癌。These results indicate that the method according to the present invention is capable of detecting intestinal cancer with higher sensitivity and specificity than the prior art.

實施例Example 3.3. 胃癌基因標誌物的篩選Screening of gastric cancer gene markers

根據實施例1所述的方法篩選胃癌的基因標誌物,唯一區別在於所用樣品是來自7位胃癌患者和18位正常人的血漿游離DNA。篩選到的胃癌標誌物如表2所示。 表2:本發明的胃癌基因標誌物的平均標準化5-hmC含量和加權係數 The genetic markers of gastric cancer were screened according to the method described in Example 1, the only difference being that the samples used were plasma free DNA from 7 gastric cancer patients and 18 normal humans. The gastric cancer markers screened are shown in Table 2. Table 2: Average normalized 5-hmC content and weighting coefficient of gastric cancer gene markers of the present invention

如上所述,平均標準化5-hmC含量是指胃癌樣品中該基因標誌物的平均5-hmC含量與正常樣品中同一基因標誌物的平均5-hmC含量之比。從表2可以看出,本發明的胃癌基因標誌物的5-hmC含量在正常樣品中和腸癌樣品中存在顯著差異,其中在胃癌樣品中,3種基因標誌物顯示5-hmC含量降低:RDM1 ZNF714ZNF518B ,15種基因標誌物顯示5-hmC含量升高:ARHGAP28 BMPER C9orf92 CADPS2 CDH11 FBXL7 MEOX2 NOS1 OSMR PALM2 PDE10A RBMS3 SULF1 WLSWTIPAs described above, the average normalized 5-hmC content refers to the ratio of the average 5-hmC content of the gene marker in the gastric cancer sample to the average 5-hmC content of the same gene marker in the normal sample. As can be seen from Table 2, the 5-hmC content of the gastric cancer gene marker of the present invention is significantly different between the normal sample and the intestinal cancer sample, wherein among the gastric cancer samples, the three gene markers show a 5-hmC content decrease: RDM1, ZNF714 and ZNF518B, 15 marker genes show elevated levels of 5-hmC: ARHGAP28, BMPER, C9orf92, CADPS2 , CDH11, FBXL7, MEOX2, NOS1, OSMR, PALM2, PDE10A, RBMS3, SULF1, WLS and WTIP.

實施例Example 4.4. 胃癌基因標誌物的有效性測定Determination of the effectiveness of gastric cancer gene markers

本實施例驗證本發明的胃癌基因標誌物用於檢測胃癌的有效性。This example demonstrates the effectiveness of the gastric cancer gene marker of the present invention for detecting gastric cancer.

根據實施例1的方法測定第三批60個樣品(25例胃癌和35例對照)中本發明所述的18個胃癌基因標誌物的5-hmC含量,並確定各基因標誌物的加權係數。The 5-hmC content of the 18 gastric cancer gene markers of the present invention in the third batch of 60 samples (25 gastric cancers and 35 controls) was determined according to the method of Example 1, and the weighting coefficients of the respective gene markers were determined.

將各基因標誌物的標準化5-hmC含量乘以與其對應的加權係數,獲得該基因標誌物的預測因子t後,將各基因標誌物的預測因子t相加,獲得總預測因子T,然後將總預測因子T根據以下公式經過Logistic轉換獲得評分P: Multiplying the normalized 5-hmC content of each gene marker by its corresponding weighting coefficient to obtain the predictive factor t of the gene marker, adding the predictor t of each gene marker to obtain the total predictor T, and then The total predictor T is scored by Logistic conversion according to the following formula:

若P>0.5,則該受試者樣品患有胃癌;若P<0.5,則該受試者樣品為正常。If P > 0.5, the subject sample has gastric cancer; if P < 0.5, the subject sample is normal.

圖3示出了根據本發明的方法區分該批樣品的結果。如圖3所示,本發明的方法能夠達到92%的靈敏度和91%的特異性。發明人進一步用本發明的方法區分第四批63個樣品(29例胃癌和37例對照),其結果表明該方法能夠達到90%的靈敏度和97%的特異性(圖4)。Figure 3 shows the results of distinguishing the batch of samples in accordance with the method of the present invention. As shown in Figure 3, the method of the invention is capable of achieving a sensitivity of 92% and a specificity of 91%. The inventors further distinguished the fourth batch of 63 samples (29 gastric cancers and 37 controls) by the method of the present invention, and the results showed that the method was able to achieve 90% sensitivity and 97% specificity (Fig. 4).

這些結果表明,與現有技術相比,根據本發明的方法能夠以更高的靈敏度和特異性檢測腸癌。These results indicate that the method according to the present invention is capable of detecting intestinal cancer with higher sensitivity and specificity than the prior art.

根據本發明可作之不同修正及變化對於熟悉該項技術者而言均顯然不會偏離本發明的範圍與精神。雖然本發明已敘述特定的較佳具體事實,必須瞭解的是本發明不應被不當地限制於該等特定具體事實上。事實上,在實施本發明之已述模式方面,對於熟習該項技術者而言顯而易知之不同修正亦被涵蓋於下列申請專利範圍之內。It is apparent to those skilled in the art that various modifications and variations can be made without departing from the scope and spirit of the invention. Although the present invention has been described in terms of specific preferred embodiments, it should be understood that the invention should not be In fact, the various modifications that are apparent to those skilled in the art are also contemplated by the scope of the invention.

圖1:用本發明的腸癌基因標誌物區分第一批樣品的結果。 圖2:用本發明的腸癌基因標誌物區分第二批樣品的結果。 圖3:用本發明的胃癌基因標誌物區分第三批樣品的結果。 圖4:用本發明的胃癌基因標誌物區分第四批樣品的結果。Figure 1: Results of distinguishing the first batch of samples using the intestinal cancer gene markers of the present invention. Figure 2: Results of distinguishing a second batch of samples using the intestinal cancer gene markers of the present invention. Figure 3: Results of distinguishing a third batch of samples using the gastric cancer gene markers of the present invention. Figure 4: Results of distinguishing the fourth batch of samples using the gastric cancer gene markers of the present invention.

Claims (8)

一種用於檢測腸癌的基因標誌物,包括一個或多個選自以下的基因:ADAM金屬肽酶域20 (ADAM20 )、F盒和富亮氨酸重複蛋白7 (FBXL7 )、卵泡抑素(FST )、TP53凋亡效應器(PERP )、普列克底物蛋白同源相似域家族A成員3 (PHLDA3 )、Runt相關轉錄因子1移動伴侶1 (RUNX1T1 )、互養蛋白γ2 (SNTG2 )、精子相關抗原4 (SPAG4 )、硫酸酯酶1 (SULF1 )和NME/NM23核苷二磷酸激酶(NME3 )。A genetic marker for detecting intestinal cancer, comprising one or more genes selected from the group consisting of ADAM metal peptidase domain 20 ( ADAM20 ), F-box and leucine-rich repeat protein 7 ( FBXL7 ), follistatin ( FST ), TP53 apoptotic effector ( PERP ), pleckin protein homologous domain family A member 3 ( PHLDA3 ), Runt-related transcription factor 1 mobile partner 1 ( RUNX1T1 ), mutual protein γ2 ( SNTG2 ), sperm associated antigen 4 (SPAG4), sulfatase 1 (SULF1) and NME / NM23 nucleoside diphosphate kinase (NME3). 如請求項1所述之基因標誌物,包括ADAM20 FBXL7 FST PERP PHLDA3 RUNX1T1 SNTG2 SPAG4 SULF1 NME3 The gene markers according to claim 1 include ADAM20 , FBXL7 , FST , PERP , PHLDA3 , RUNX1T1 , SNTG2 , SPAG4 , SULF1 and NME3 . 一種如請求項1或2所述之基因標誌物在用於檢測腸癌的方法中的用途。Use of the genetic marker of claim 1 or 2 in a method for detecting intestinal cancer. 一種用於檢測腸癌的方法,包括以下步驟: (a)測定正常樣品和受試者樣品中如請求項1或2所述的基因標誌物的5-羥甲基胞嘧啶(5-hmC)含量; (b)用正常品中所述基因標誌物的5-hmC含量作為參照,將受試者樣品中對應的基因標誌物的5-hmC含量標準化; (c)對步驟(b)中經標準化的所述基因標誌物的5-hmC含量進行數學關聯,並獲得評分;和 (d)根據所述評分獲得檢測結果。A method for detecting intestinal cancer comprising the steps of: (a) determining 5-hydroxymethylcytosine (5-hmC) of the gene marker according to claim 1 or 2 in a normal sample and a subject sample. (b) normalize the 5-hmC content of the corresponding gene marker in the sample of the subject with the 5-hmC content of the gene marker as described in the normal product; (c) the step (b) The 5-hmC content of the standardized gene marker is mathematically correlated and scored; and (d) the test results are obtained according to the score. 如請求項4所述之方法,其中步驟(a)是測定所述基因標誌物全長或其片段上的5-hmC的含量。The method of claim 4, wherein the step (a) is to determine the content of 5-hmC on the entire length of the gene marker or a fragment thereof. 如請求項4所述之方法,其中所述樣品是來自正常人或受試者體液中游離的DNA片段,或來源於胞器、細胞以及組織中的完整基因組DNA。The method of claim 4, wherein the sample is a free DNA fragment from a normal human or a subject's body fluid, or a complete genomic DNA derived from a cell, a cell, and a tissue. 如請求項4所述之方法,其中所述體液是血液、尿液、汗液、痰液、糞便、腦脊液、腹水、胸水、膽汁或胰腺液。The method of claim 4, wherein the body fluid is blood, urine, sweat, sputum, feces, cerebrospinal fluid, ascites, pleural effusion, bile or pancreatic juice. 一種用於檢測腸癌的試劑盒,包括: (a)用於測定如請求項1或2所述的基因標誌物的5-hmC含量的試劑;和 (b)說明書。A kit for detecting intestinal cancer, comprising: (a) an agent for determining a 5-hmC content of the gene marker according to claim 1 or 2; and (b) a specification.
TW106146503A 2017-01-11 2017-12-29 Method for screening gene markers of intestinal cancer, gene markers screened by the method, and uses thereof TWI647312B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710020252.4A CN106755464A (en) 2017-01-11 2017-01-11 For the method for screening the gene marker of intestinal cancer and/or stomach cancer, the gene marker and application thereof that is screened with the method
??201710020252.4 2017-01-11

Publications (2)

Publication Number Publication Date
TW201829781A true TW201829781A (en) 2018-08-16
TWI647312B TWI647312B (en) 2019-01-11

Family

ID=58947635

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106146503A TWI647312B (en) 2017-01-11 2017-12-29 Method for screening gene markers of intestinal cancer, gene markers screened by the method, and uses thereof

Country Status (3)

Country Link
CN (1) CN106755464A (en)
TW (1) TWI647312B (en)
WO (1) WO2018129989A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106755464A (en) * 2017-01-11 2017-05-31 上海易毕恩基因科技有限公司 For the method for screening the gene marker of intestinal cancer and/or stomach cancer, the gene marker and application thereof that is screened with the method
CN108300783A (en) * 2017-01-11 2018-07-20 上海易毕恩基因科技有限公司 The method of gene marker for screening intestinal cancer and/or gastric cancer, the gene marker and application thereof screened with this method
CN107385051A (en) * 2017-08-04 2017-11-24 上海易毕恩生物技术有限公司 For detecting liver tumour good pernicious gene marker, kit and detection method
CN107365845A (en) * 2017-08-04 2017-11-21 上海易毕恩生物技术有限公司 For detecting the gene marker, kit and lung cancer detection method of lung cancer
CN107385050A (en) * 2017-08-04 2017-11-24 上海易毕恩生物技术有限公司 For detecting the gene marker, kit and cancer of pancreas detection method of cancer of pancreas
EP3688195A1 (en) * 2017-09-27 2020-08-05 Cambridge Epigenetix Limited Biomarkers for colorectal cancer detection
CN108410984B (en) * 2018-02-11 2021-08-31 中山大学 Application of RBMS3 as tumor drug resistance detection, treatment and prognosis molecular target
AU2020257982A1 (en) * 2019-04-15 2021-11-25 Fundación Instituto De Investigación Sanitaria De Santiago De Compostela In vitro method for the diagnosis or prognosis of colorectal cancer or a pre-cancerous stage thereof
CN111004848B (en) * 2019-12-11 2022-09-23 中国人民解放军陆军军医大学第一附属医院 Application of FBXL6 as target in preparation of antitumor drugs
CN112760371A (en) * 2021-03-09 2021-05-07 上海交通大学 Primer, kit and analysis method for detecting MUC1 gene mutation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7838234B2 (en) * 2007-01-30 2010-11-23 Pharmacyclics, Inc. Methods for determining cancer resistance to histone deacetylase inhibitors
ES2685678T3 (en) * 2007-10-23 2018-10-10 Clinical Genomics Pty Ltd A method for the diagnosis of neoplasms - II
EP2549399A1 (en) * 2011-07-19 2013-01-23 Koninklijke Philips Electronics N.V. Assessment of Wnt pathway activity using probabilistic modeling of target gene expression
US20140030727A1 (en) * 2012-01-20 2014-01-30 Gerd PFEIFER Loss of 5-hydroxymethylcytosine as a biomarker for cancer
US9297806B2 (en) * 2012-08-01 2016-03-29 The Johns Hopkins University 5-hydroxymethylcytosine in human cancer
JP2016530232A (en) * 2013-06-24 2016-09-29 ミルナ セラピューティクス,インク. Biomarker of MIR-34 activity
CN104293938B (en) * 2014-09-30 2017-11-03 天津华大基因科技有限公司 Build the method and its application of sequencing library
CN106755464A (en) * 2017-01-11 2017-05-31 上海易毕恩基因科技有限公司 For the method for screening the gene marker of intestinal cancer and/or stomach cancer, the gene marker and application thereof that is screened with the method

Also Published As

Publication number Publication date
WO2018129989A1 (en) 2018-07-19
CN106755464A (en) 2017-05-31
TWI647312B (en) 2019-01-11

Similar Documents

Publication Publication Date Title
TWI647312B (en) Method for screening gene markers of intestinal cancer, gene markers screened by the method, and uses thereof
JP7002334B2 (en) How to Diagnose Bladder Cancer
KR20220015367A (en) Methods and Systems for Deep Sequencing of Methylated Nucleic Acids
JP6975807B2 (en) Diagnostic genetic marker panel for colorectal cancer
CN107847515B (en) Solid tumor methylation marker and application thereof
TWI680296B (en) Gene marker, reagent set for detecting pancreatic cancer, and method for detecting pancreatic cancer
TWI673496B (en) Gene marker, reagent group and lung cancer detection method for detecting lung cancer
CN106460046A (en) Detecting colorectal neoplasm
BR112013005929B1 (en) methods of screening for onset or predisposition to onset of large intestine neoplasm or monitoring of neoplasm progress in an individual
TW201905206A (en) Gene marker for use in detecting liver cancer and use thereof
WO2023226938A1 (en) Methylation biomarker, kit and use
TWI694152B (en) Gene marker for detecting liver cancer and its use
TWI667480B (en) Gene marker, reagent group and detection method for detecting benign and malignant liver tumors
WO2019149093A1 (en) Gene marker for detecting esophageal cancer, use thereof and detection method therefor
CN107868822B (en) Composition for detecting esophagus cancer, kit and application thereof
TWI653340B (en) Method for screening gene markers of gastric cancer, gene markers screened by the method, and uses thereof
WO2005021743A1 (en) Primers for nucleic acid amplification and method of examining colon cancer using the same
KR20240046525A (en) Compositions and methods associated with TET-assisted pyridine borane sequencing for cell-free DNA
WO2014160829A2 (en) Unbiased dna methylation markers define an extensive field defect in histologically normal porstate tissues associated with prostate cancer: new biomarkers for men with prostate cancer
WO2024008040A1 (en) Cancer-specific methylation marker and use thereof
JP7232438B2 (en) Methods for purifying, isolating or concentrating methyl-containing group-modified nucleic acids
CN117004720A (en) Composition for detecting thyroid cancer and application thereof
CN116798606A (en) System for detecting thyroid cancer
CN115772564A (en) Methylation biomarker for auxiliary detection of lung cancer somatic cell ATM gene fusion mutation and application thereof
CN115927644A (en) Novel marker combination for multi-target gastric cancer detection and application thereof

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees