CN103060348A - Genetic markers for detecting recurrence potential of primary liver cancer - Google Patents

Genetic markers for detecting recurrence potential of primary liver cancer Download PDF

Info

Publication number
CN103060348A
CN103060348A CN2011103263819A CN201110326381A CN103060348A CN 103060348 A CN103060348 A CN 103060348A CN 2011103263819 A CN2011103263819 A CN 2011103263819A CN 201110326381 A CN201110326381 A CN 201110326381A CN 103060348 A CN103060348 A CN 103060348A
Authority
CN
China
Prior art keywords
protein
albumen
recurrence
sample
polynucleotide
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011103263819A
Other languages
Chinese (zh)
Inventor
何祥火
魏霖
梁琳慧
顾健人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Cancer Institute
Original Assignee
Shanghai Cancer Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Cancer Institute filed Critical Shanghai Cancer Institute
Priority to CN2011103263819A priority Critical patent/CN103060348A/en
Publication of CN103060348A publication Critical patent/CN103060348A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to genetic markers for detecting recurrence potential of primary liver cancer. By detecting the gene expression profile level of samples with recurring cancer tissues and samples without recurring cancer tissues of patients with primary liver cancer, people screen out 42 specific genes by a statistical method. The inspection proves that the specific genetic markers can be used for effectively predicting the liver cancer recurrence condition or recurrence potential of the patients with primary liver cancer.

Description

Gene marker for detection of the recurrence potentiality of primary hepatocarcinoma
Technical field
The invention belongs to the medicine and pharmacology field; More specifically, the present invention relates to gene marker for detection of the recurrence potentiality of primary hepatocarcinoma.
Background technology
The genesis of anything all is under the internal and external reasons acting in conjunction, and is interior because main, outer because auxiliary.Gene is as the hereditary medium of life, organism sick, old,, be in the status of basic internal cause in dead.Overwhelming majority gene is by transcribing product nucleus ribosomal ribonucleic acid mRNA, and translation generates protein and brings into play biological function again.Although the function of gene mainly is to embody by protein, but still the content of transcription product mRNA that can be by gene and part reflection.
Full genomic expression spectrum chip contains more than 20,000 probes, utilize the content of principle contained various mRNA in full genomic level detection sample of dna double chain homologous complementary, therefore can be at one time the transcriptional level of the gene in the full genome range in the sample to be tested be detected, and then infer gene function.
Primary hepatocarcinoma is one of Chinese modal malignant tumour, and annual new cases account for the whole world 50%.The grade malignancy of liver cancer is high, and development is treated difficulty rapidly, and case fatality rate is high.Therefore, the as early as possible diagnosis of liver cancer just more seems urgent.
Yet, up to now, this area is for very few with the closely-related gene understanding of primary hepatocarcinoma, so this area is found generation, transfer, the recurrence for diagnosing liver cancer or detected relevant gene marker in the urgent need to separating further various and the closely-related gene of primary hepatocarcinoma.
Summary of the invention
The object of the present invention is to provide the gene marker for detection of the recurrence potentiality of primary hepatocarcinoma.
In a first aspect of the present invention, a kind of polynucleotide collection is provided, described polynucleotide collection comprises the polynucleotide (preferably, the polynucleotide by the following albumen of coding form) of the following albumen of encoding:
Sulfo group glucosamine sulfhydrase; microtubulin fold cofactor D; Tyrosylprotein kinase 2; the adenosine deaminase that RNA is special; the ubiquitin sample is modified activating enzymes; Wd duplicate domain 13; Gamma glutamyl lytic enzyme; micronuclear ribonucleoprotein 40kDa; Protein Tyrosine Phosphatases 4A1; studies of interleukin-18 binding protein; coiled coil territory 92, closed protein 1, KIAA0831 albumen; glutathione-S-transferase 2; solubility transporter family 11 member 2 albumen, CD99 quasi-molecule 2, hydroxyalkyl coa dehydrogenase beta; coenzyme desaturase flavoprotein 2; ubiquitin protein ligase E3 assembly n recognin 4, the MLX interaction protein is with the acceptor 6 of the G albumen coupling that contains leucine enrichment repeat region; zinc finger protein 692; the special homologous protein of transcribing of mesoderm, inositol 1,3; 4-triphosphate 5/6 kinases; the CD5 quasi-molecule, talin 1, heat shock protein(HSP) 40 homology subfamily C members 3; with fruit bat FZ homology 5; Coxsackie virus and adenovirus receptor, angiogenesis factor sample albumen 3, microtubule associating albumen 1 light chain 3alpha; 3-hydroxy-3-methyl glutaryl-coenzyme A synthetase 1; the neurotrophic factor in neurone source, methylmalonic aciduria cblC type, the island purine suppresses polypeptide 2 in conjunction with the activity of albumen alpha; No. 22 chromosomal No. 32 open reading frame; nucleoporin 88kDa, hypertensin 2 receptor-like protein 2, sterol-C4-methyl oxidation enzyme sample albumen; the target proteins of Mybl sample albumen 2, arylacetamide deacetylase and ubiquitin fold modified protein 1.
In a preference; the described sulfo group glucosamine sulfhydrase of encoding; microtubulin fold cofactor D; Tyrosylprotein kinase 2; the adenosine deaminase that RNA is special; the ubiquitin sample is modified activating enzymes; Wd duplicate domain 13; Gamma glutamyl lytic enzyme; micronuclear ribonucleoprotein 40kDa; Protein Tyrosine Phosphatases 4A1; studies of interleukin-18 binding protein; coiled coil territory 92; closed protein 1; KIAA0831 albumen; glutathione-S-transferase 2; solubility transporter family 11 member 2 albumen; CD99 quasi-molecule 2; hydroxyalkyl coa dehydrogenase beta; coenzyme desaturase flavoprotein 2; ubiquitin protein ligase E3 assembly n recognin 4; the MLX interaction protein; acceptor 6 with the G albumen coupling that contains leucine enrichment repeat region; zinc finger protein 692; the special homologous protein of transcribing of mesoderm; inositol 1; 3; 4-triphosphate 5/6 kinases; the CD5 quasi-molecule, talin 1, heat shock protein(HSP) 40 homology subfamily C members 3; with fruit bat FZ homology 5; Coxsackie virus and adenovirus receptor, angiogenesis factor sample albumen 3, microtubule associating albumen 1 light chain 3alpha; 3-hydroxy-3-methyl glutaryl-coenzyme A synthetase 1; the neurotrophic factor in neurone source, methylmalonic aciduria cblC type, the island purine suppresses polypeptide 2 in conjunction with the activity of albumen alpha; No. 22 chromosomal No. 32 open reading frame; nucleoporin 88kDa, hypertensin 2 receptor-like protein 2, sterol-C4-methyl oxidation enzyme sample albumen; the target proteins of Myb1 sample albumen 2; the polynucleotide of the folding modified protein 1 of arylacetamide deacetylase and ubiquitin are respectively: sequence such as GenBank accession number NM_000199.3, NM_005993.4, NM_003331.4; NM_001111.3; NM_003334.3, NM_017883.3, NM_003878.2; NM_004814.2; NM_003463.3, NM_173042.2, NM_025140.1; NM_021101.3; NM_014924.3, NM_000854.3, NM_000617.2; NM_031462.2; NM_000183.2, NM_021074.3, NM_020765.2; NM_014938.3; NM_001017403.1, NM_001136036.1, NM_002402.2; NM_014216.4; NM_005894.2, NM_006289.3, NM_006260.3; NM_003468.3; NM_001338.3, NM_014495.2, NM_032514.2; NM_001098272.1; NM_013349.4, NM_015506.2, NM_002070.2; NM_033318.3; NM_002532.3, NM_005162.2, NM_006745.3; NM_001033551.2, the polynucleotide shown in NM_001086.2 and the NM_016617.2.
In another aspect of this invention, provide the purposes of described polynucleotide collection, for the preparation of detection reagent or the test kit of the recurrence of measuring (or prediction) primary hepatocarcinoma or recurrence potentiality.
In a preference, described reagent is the primer pair that the described polynucleotide of specific amplification are concentrated each polynucleotide; Or the described polynucleotide of specific recognition are concentrated the probe of each polynucleotide.
In another aspect of this invention, a kind of recurrence for mensuration (or prediction) primary hepatocarcinoma or the reagent of recurrence potentiality are provided, it is specific recognition sequence such as GenBank accession number NM_000199.3, NM_005993.4, NM_003331.4, NM_001111.3, NM_003334.3, NM_017883.3, NM_003878.2, NM_004814.2, NM_003463.3, NM_173042.2, NM_025140.1, NM_021101.3, NM_014924.3, NM_000854.3, NM_000617.2, NM_031462.2, NM_000183.2, NM_021074.3, NM_020765.2, NM_014938.3, NM_001017403.1, NM_001136036.1, NM_002402.2, NM_014216.4, NM_005894.2, NM_006289.3, NM_006260.3, NM_003468.3, NM_001338.3, NM_014495.2, NM_032514.2, NM_001098272.1, NM_013349.4, NM_015506.2, NM_002070.2, NM_033318.3, NM_002532.3, NM_005162.2, NM_006745.3, NM_001033551.2, the probe of the polynucleotide shown in NM_001086.2 or the NM_016617.2, the nucleotide sequence of described probe is shown in SEQ IDNO:1-42 is arbitrary.
In another aspect of this invention, provide a kind of gene chip, described gene chip comprises:
Solid phase carrier; And
Be fixed in order the probe on the described solid phase carrier, described probe specificity ground is corresponding to sequence such as GenBank accession number NM_000199.3, NM_005993.4, NM_003331.4, NM_001111.3, NM_003334.3, NM_017883.3, NM_003878.2, NM_004814.2, NM_003463.3, NM_173042.2, NM_025140.1, NM_021101.3, NM_014924.3, NM_000854.3, NM_000617.2, NM_031462.2, NM_000183.2, NM_021074.3, NM_020765.2, NM_014938.3, NM_001017403.1, NM_001136036.1, NM_002402.2, NM_014216.4, NM_005894.2, NM_006289.3, NM_006260.3, NM_003468.3, NM_001338.3, NM_014495.2, NM_032514.2, NM_001098272.1, NM_013349.4, NM_015506.2, NM_002070.2, NM_033318.3, NM_002532.3, NM_005162.2, NM_006745.3, NM_001033551.2, the polynucleotide shown in NM_001086.2 or the NM_016617.2.
In a preference, the nucleotide sequence of described probe is shown in SEQ ID NO:1-42 is arbitrary.
In another preference, described probe contains: complementary land and/or the joining region that links to each other with solid phase carrier.
In another aspect of this invention, provide the purposes of described gene chip, for the preparation of the test kit of the recurrence of measuring (or prediction) primary hepatocarcinoma or recurrence potentiality.
In another aspect of this invention, provide the recurrence of a kind of mensuration (or prediction) primary hepatocarcinoma or the test kit of recurrence potentiality, contain described gene chip in the described test kit.
In a preference, also comprise in the described test kit: nucleic acid extract, nitrite ion or hybridization solution.
Other side of the present invention is because the disclosure of this paper is apparent to those skilled in the art.
Description of drawings
Fig. 1, with the generation recurrent hepatic cancer sample after the MDS algorithm conversion (red point or light point) with the scatter diagram of recurrent hepatic cancer sample (blueness or dark point) in three-dimensional space do not occur.
Embodiment
The gene expression profile level of the cancerous tissue sample of the inventor's (be respectively behind the tumor resection within the observation period tumour " recurred " and " occuring to recur " patient) primary hepatocarcinoma by detecting two kinds of liver cancer patients sources, use statistical method, therefrom filter out first 42 specific genes.Through testing identity, these specific gene markers can be used for liver cancer recurrence situation or the recurrence potentiality of prediction Patients with Primary very effectively.
Gene marker and uses thereof
Primary hepatocarcinoma is one of modal malignant tumour of China.The inventor adopts the method for chip hybridization to obtain the recurrent cancer tissue samples of primary hepatic carcinoma and the full genomic expression spectrum of recurrent cancer tissue samples not, by comparing the express spectra of two kinds of tissues, obtain the gene of differential expression between high recurrence potentiality sample and low recurrence potentiality sample.In these differential genes, using the screening of V-SVC sorter (linear kernel function) algorithm to obtain classify accuracy is a classifiers (containing 42 genes) of 82%.By the sorter of these 42 genomic constitutions, whether measurable hepatocarcinoma patient postoperative recurs, and its prediction accuracy reaches 75%.Can become these 42 Data minings small-sized gene chip or RT-PCR test kit to be used for the prediction of liver cancer recurrence.
Described 42 genes are as follows: SGSH, TBCD, TYK2, ADAR, UBE1, WDR13, GGH, HPRP8BP, PTP4A1, IL18BP, FLJ22471, CLDN1, KIAA0831, GSTT2, SLC11A2, DKFZP761H2024, HADHB, NDUFV2, KIAA0462, MONDOA, LGR6, FLJ20531, MEST, ITPK1, CD5L, TLN1, DNAJC3, DKFZP434E2135, CXADR, ANGPTL3, MAP1LC3A, HMGCS1, SPUF, DKFZP564I122, GNAI2, LOC91689, NUP88, AGTRL2, SC4MOL, TOM1L2, AADAC, BM-002.
Based on the inventor's new discovery, can be with above-mentioned 42 genes as measuring (or differentiation) cancerous tissue of primary hepatocarcinoma and mark of cancer beside organism (marker).By analyzing the expression of these genes in the testing sample (sample), thereby whether the cancer of learning the experimenter recurs, or can assess in early days the possibility of experimenter's cancer return, for diagnosis or the prognosis of disease provides foundation.Described testing sample or sample to be tested are patient's cancerous tissues, for example are that the patient tumors resection operation excises the cancerous tissue sample that gets off.
Can adopt various technology to detect the expression of said gene, these technology all comprise in the present invention.On the gene level, detection can be for cDNA, also can be for genomic dna, or for mRNA; Available prior art is such as (but being not limited to): biochip technology, probe hybridization technology, polymerase chain reaction (PCR), Southern blotting, dna sequence analysis etc.
As a kind of selection mode of the present invention, come expression and the expression amount of said gene in the analytic sample by quantitative or semiquantitative polymerase chain reaction (PCR) method, thereby can judge.Preferably, realize detecting by RT-PCR.
Gene chip
As a kind of optimal way of the present invention, adopt biochip technology to detect the expression of said gene.Comprise the probe for said gene on the described gene chip; Preferred, comprise the probe for two or more described gene on the described gene chip; Most preferred, comprise the probe for described 42 kinds of genes on the described gene chip.
Gene chip of the present invention comprises solid phase carrier and is fixed in order oligonucleotide probe on the described solid phase carrier that described oligonucleotide probe (is more particularly 20-80 by 10-100; Be more particularly 30-70) the continuous nucleotide composition.Described probe can also comprise the poly-poly-deoxythymidylic acid of one section amido modified 1-30 (poly-dT) at its 5 ' end.
Described solid phase carrier can adopt the various common used materials in gene chip field, such as but not limited to nylon membrane, and the slide of the slide of modifying through active group (such as aldehyde radical, amino, the fine acidic group of different sulphur etc.) or silicon chip, unmodified, plastic sheet etc.
The preparation of gene chip of the present invention can be adopted the conventional manufacture method of biochip.For example, if what solid phase carrier adopted is to modify slide or silicon chip, 5 ' end of probe contains amido modified poly-dT string, oligonucleotide probe can be mixed with solution, then point sample instrument is being modified slide or silicon chip with its point, be arranged in predetermined sequence or array, then spending the night by placement fixes, and just can obtain gene chip of the present invention.If it is amido modified that oligonucleotide probe does not contain, then its preparation method also can reference: Wang Shenwu chief editor's " gene diagnosis technology-on-radiation operational manual "; J.L.erisi, V.R.Iyer, P.O.BROWN.Exploring the metabolic and genetic control of gene expression on a genomic scale.Science, 1997; 278:680 and Ma Li people, the Jiang Zhonghua chief editor. biochip. Beijing: Chemical Industry Press, 2000,1-130.
Another aspect, the present invention also provides a kind of method by gene expression profile described in the genechip detection people tissue, comprises step:
(1) provides the RNA sample that separates from people's tissue, at described RNA marker is set;
(2) RNA with (1) contacts with described chip, makes the oligonucleotide probe generation hybridization on described RNA and the solid phase carrier, thereby forms " oligonucleotide probe-RNA " binary complex at solid phase carrier;
(3) detect the marker of the binary complex of (2) formation, thereby determine the express spectra of corresponding gene in people's tissue.
Solid-phase hybridization between RNA of the present invention and the chip carries out according to the classical way of this area, and the general personnel in this area determine relevant damping fluid, probe and the optimum condition of concentration of specimens, prehybridization temperature, hybridization temperature and time etc. easily according to experience.Perhaps also can be with reference to described in " molecular cloning experiment guide ".
Then treat measurement information according to acquisition of informations such as the position of marking signal on chip, intensity.If amplified production fluorophor mark also can directly obtain and treat measurement information with fluorescence detection device (such as laser confocal scanning instrument Scanarray3000 etc.).
The labelling groups that sample of nucleic acid is carried out mark includes but not limited to: digoxin molecule (DIG), biotin molecule (Bio), fluorescein and the biomolecules of deriving (FITC etc.) thereof, other fluorescence molecules (such as Cy3, Cy5 etc.), alkaline phosphatase (AP), horseradish peroxidase (HRP) etc.These marks and marking method thereof all have been routine techniques well-known in the art, also can be with reference to Wang Shenwu chief editor's " gene diagnosis technology-on-radiation operational manual "; J. Pehanorm Brooker, D.W. Russell chief editor, " molecular cloning experiment guide ", Science Press, 2002; Horse stands the people, the Jiang Zhonghua chief editor. biochip. and Beijing: Chemical Industry Press, 2000,1-130.
When above-mentioned sample of nucleic acid and gene chip are hybridized, can first gene chip and prehybridization damping fluid be carried out prehybridization.Solid-phase hybridization between sample of nucleic acid and the gene chip carries out according to the classical way of this area, and the general personnel in this area determine relevant damping fluid, probe and the optimum condition of concentration of specimens, prehybridization temperature, hybridization temperature and time etc. easily according to experience.
Then treat measurement information according to acquisition of informations such as the position of marking signal on gene chip, intensity.If amplified production fluorophor mark also can directly obtain and treat measurement information with fluorescence detection device (such as laser confocal scanning instrument Scanarray3000 etc.).
Detection kit
The present invention also provides a kind of test kit of the expression for detection of described gene.Described test kit can be used for measuring liver cancer recurrence situation or the recurrence potentiality of (or prediction) Patients with Primary.Comprise chip of the present invention in the described test kit.
Preferred, also contain the marker that is useful on the labeled rna sample in the described test kit, and the substrate corresponding with described marker.
In addition, also can comprise in the described test kit for extracting the required all ingredients such as RNA, PCR, hybridization, colour developing, include but not limited to: extract, amplification liquid, hybridization solution, enzyme, contrast liquid, nitrite ion, washing lotion, antibody etc.
In addition, also can comprise working instructions and/or chip image analysis software in the described test kit.Described chip image analysis software is such as the BaiO ArrayDoctor 2.0 of BaiO company, the Arraypro 4.0 of Media Cybernetics company.
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used for explanation the present invention and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example is write according to normal condition such as J. Pehanorm Brooker etc. usually, molecular cloning experiment guide, Science Press, the condition described in 2002, or the condition of advising according to manufacturer.Unless otherwise indicated, otherwise per-cent and umber calculate by weight.
Unless otherwise defined, the same meaning that employed all specialties and scientific words and one skilled in the art are familiar with in the literary composition.In addition, any method similar or impartial to described content and material all can be applicable among the present invention.The usefulness that better implementation method described in the literary composition and material only present a demonstration.
The preparation of embodiment 1, RNA sample
Tissue samples:
23 pairs of cancers come from (comprising that 10 examples have recurred sample and 13 examples recur sample) the excision sample of Patients with Primary, and these samples come from Qidong liver cancer research institute and the first affiliated hospital of Zhejiang University.10 routine normal hepatocytes come from unexpected death person.Above-mentioned all samples obtain all agreement by the Ethics Committee of WHO cooperative association of Shanghai City government authorization.The clinical data of tissue samples comprises: sex, age, tumor size, pathological grading (Edmonson), liver cirrhosis whether, whether shift, whether recur, HBV and HCV infection conditions etc.All do not accept chemotherapy before all operation in patients, Follow-up After reaches 60 months most.
Gene chip:
Full genomic expression spectrum chip adopts the brilliant core of Boao Biological Co., Ltd (http://www.capitalbio.com)
Figure BDA0000101615010000081
Human full genome oligonucleotide arrays V1.0 (two channels chip) contains more than 22000 gene probes.
RNA sample (total tissue RNA extraction):
1. the Collection and preservation of sample: in vitro tissue places the liquid nitrogen quick-frozen after cutting into fritter immediately, then moves to-80 ℃ of Refrigerator stores.
2. tissue block is broken: tissue block is put in the liquid nitrogen and pulverizes, and every gram tissue adds 1mlTrizol (Invitrogen company).
3. total RNA extracting: add the chloroform of 0.2ml, thermal agitation 15 seconds, incubated at room 2-3 minute by every milliliter of Trizol; 4 ℃, centrifugal 15 minutes of 10,000g; Colourless supernatant liquor is moved in the new centrifuge tube, add the 0.5ml Virahol, incubated at room 10 minutes, 4 ℃ of 10000g are centrifugal 10 minutes; Outwell supernatant, 0.75ml75% washing with alcohol, 4 ℃, centrifugal 5 minutes of 7,500g; Outwell supernatant, drying at room temperature RNA precipitates 5-10 minute (not making the RNA complete drying), after processing with DEPC without RNA enzyme H2O dissolution precipitation.
4. spectrophotometer standard measure RNA, and the Total RNA that takes a morsel carries out electrophoresis, checks whether RNA degrades.
The gene of embodiment 2, screening significant difference
Recurrent cancer sample and normal sample (10 examples are mixed), 13 examples have occured and the recurrent cancer sample does not occur have used respectively cy3 and two kinds of fluorochrome labels of cy5 with normal sample (10 routine the mixing) in 10 examples, with the probe competitive hybridization on the full genomic expression spectrum chip.The brilliant core of chip after the hybridization
Figure BDA0000101615010000091
LuxScan TMThe scanning of 10K micro-array chip scanner obtains result images, and is quantitative to results of hybridization by its LuxScan 3.0 general microarray image analysis softwares that attach at random again.Thus, obtaining 10 examples has the recurrence sample that normal sample (is had recurrence/normal, Recurrence/Normal) mRNA chip and 13 examples are without recurring sample to normal sample (nothing recurrence/normal, Non-Recurrence/Normal) the express spectra data of mRNA chip.
There are recurrence/normal and 13 examples at first to use LOWESS (locally weighted regression) normalization method without recurrence/normal chip results above-mentioned 10 examples.Afterwards, the chip results that recurrence/normal (Recurrence/Normal), nothing recurrence/normal (Non-Recurrence/Normal) are arranged after normalization method is got take the 2 logarithm (log the end of as 2X), use at random Tobin's mean variance model (RVM) screening after two groups of T methods of inspection (because of recurrence sample tissue being arranged with tissue-derived in different patients without the recurrence sample) are proofreaied and correct to obtain the significant difference gene that recurrence/nothing recurs (Recurrence/Non-Recurrence).
For the significance of verifying gene difference is not caused by coincidence, the inventor introduces after the above-mentioned T method of inspection data random rearrangement 1000 times again, detect a gene tested false positive rate (FDR, False Discovery Rate) that is decided to be the significant difference gene after screening by preceding method.
In above-mentioned statistical method, only have the gene of p value<0.05 just can screenedly be the significant difference gene.
Screening and the checking of sorter (can distinguish two histioid genes)
For searching is used for distinguishing two class samples (recurrence and nothing recurrence are arranged), the inventor uses the Lib-SVM tool kit of Matlab software, use C-SVC (the support vector sorter of parameters C) and two kinds of methods of V-SVC (the support vector sorter of V parameter), and adopt separately four kinds of kernel function (linear kernel, ploy kernel, gauss kernel, tanh kernel), amount to 8 kinds of methods and make up the svm classifier device.SVM (support vector machine, support vector machine) sorter is the nonlinear function of the logarithmic value of the difference multiple of differential gene between two class samples, its seek can maximize two class sample separation from lineoid, thereby can distinguish best two class samples.
Be the classify accuracy of inspection-classification device, the inventor selects in all crosscheck methods the most stable 10 to take advantage of 10 folding crosscheck methods.10 folding crosscheck methods are that sample totally is divided into 10 sons part, select 1 son part as test data set at every turn, and all the other 9 sons are part as training dataset, repeat 10 times (at every turn with different son parts as test data set).10 assays that so obtain combine and form a assessed value to the sorter classify accuracy.
Again by the highest a kind of as the optimal classification device of selection sort accuracy in 8 kinds of algorithms.
For showing intuitively on the same group similar between sample and not on the same group different between sample, the inventor looks in 3 dimensions and introduces multidimensional scaling (multidimensional scaling, MDS) in the effect.The MDS algorithm for the basis, is determined the position of each sample in lower dimensional space take similarity matrix between the sample-sample that is classified device (one group of gene) definition, and makes it to be adapted to 3 and tie up and look effect.In this 3 dimension space, two samples are more approaching, and then they are more similar; Otherwise, if two samples are at a distance of far away, then more different between them.
The result
Screening during differential gene take p value<0.05 as the boundary.The recurrence sample is arranged to without the other sample of recurrence (recurrence/nothing recurrence being arranged, Recurrence/Non-Recurrence, R/NR), have 203 significant difference genes.With these differential genes as the sorter candidate gene.Be inserted in 8 kinds of svm classifier device algorithms and take advantage of 10 crosschecks of rolling over to verify the classify accuracy of sorter with aforesaid 10.After 10 of 1000 secondary data displacement takes advantage of 10 folding cross validations to obtain sorter, for testing the predictive ability of this sorter, 5 samples of inventor's random choose are as unknown sample, with this sorter tissue-derived (have recurrence or without recurrence) of each sample predicted, observe its prediction accuracy.Comprehensive 8 kinds of algorithms, the classify accuracy that is obtained by this algorithm of V-SVC (linear kernel, linear kernel function) is the highest.This algorithm obtains be used for to distinguish recurrence and sorter without 42 genomic constitutions of recurrence sample, sees Table 1, and its classify accuracy can reach 82%, and the predictablity rate of unknown sample is reached 75%.
Table 1: 42 genes that form sorter
Figure BDA0000101615010000101
Figure BDA0000101615010000111
Figure BDA0000101615010000121
For ease of the classifying quality that shows sorter directly perceived, the inventor uses the MDS algorithm 42 gene signal values of each sample to be converted to the eigenvector of 3 dimensions, and they are located in 3 dimension spaces, be depicted as recurrence and without the three-dimensional scatter diagram of recurrence two class samples, see Fig. 1.Can be judged by figure whether the distance between any two samples of different group is larger than the distance between any two samples on the same group.
Embodiment 3, preparation gene detecting chip
The GenBank accession number (GenBank ID) of 42 gene orders that provide according to table 1, the design polynucleotide probes adds the catenation sequence of 0-20nt at the sequence two ends according to features such as the GC that produces sequence compare; Core sequence is different, and catenation sequence is also different.Catenation sequence can be produced at random by program, and the probe that catenation sequence and core sequence form meets the following conditions:
1) in the probe sequence, the quantity of same Nucleotide (A, C, G, T) can not surpass 50% of sequence sum;
2) any continuous A, T or the quantity of C, G can not surpass 25% of sequence sum;
3) G, C content account for the 40%-60% of sequence sum;
4) probe sequence can not be from hybridization, and namely the length of complementary fragment can not surpass 30% of probe length in the probe sequence.
The probe sequence of design sees Table 2.
Table 2
Figure BDA0000101615010000122
Figure BDA0000101615010000131
Figure BDA0000101615010000141
Figure BDA0000101615010000151
Figure BDA0000101615010000171
For making stable being combined on the slide of synthetic probe, adopt conventional method to carry out glycosyl modified at 5 ' end of synthetic rear probe.
The point system of chip: first the surface of slide is carried out alkylation and modify, to improve binding ability.Adopt conventional chip point sample method to carry out point sample, in order to detect the repeatability of cross experiment, each probe is at slide point 3-6 hybridization point.
Embodiment 4, test kit preparation
The Chip Packaging of preparation among the embodiment 3 is good, place a box with working instructions, consist of test kit.
The detection of embodiment 5, chip
To obtain from hospital 20 routine primary hepatocarcinoma the recurrent cancer tissue samples is arranged and without the recurrent cancer tissue samples, press the preparation of embodiment 1 method for the RNA sample that detects, then with the chip (including the probe shown in the SEQ ID NO:1-42 in the table 2 on the chip) of preparation among the embodiment 3, detect with double-blind method.According to the existence of 42 kinds of genes shown in the table 1 whether and the upper downward modulation situation that is in harmonious proportion judge sample to be tested.Wherein, positive control and negative control are respectively known liver cancer the recurrence sample are arranged and without the recurrence sample.
The result shows, the sorter that is consisted of by 42 species specific genes, its accuracy can reach 82%, and the predictablity rate of unknown sample is reached 75% (comparing the accuracy height with the recurrence that hospital is indicated), whether can very effectively distinguish the recurrence of liver cancer.
All quote in this application as a reference at all documents that the present invention mentions, just as each piece document is quoted separately as a reference.Should be understood that in addition those skilled in the art can make various changes or modifications the present invention after having read above-mentioned teachings of the present invention, these equivalent form of values fall within the application's appended claims limited range equally.
Figure IDA0000101615080000011
Figure IDA0000101615080000021
Figure IDA0000101615080000031
Figure IDA0000101615080000041
Figure IDA0000101615080000051
Figure IDA0000101615080000061
Figure IDA0000101615080000071
Figure IDA0000101615080000091
Figure IDA0000101615080000101

Claims (10)

1. a polynucleotide collection is characterized in that, described polynucleotide collection comprises the polynucleotide of the following albumen of encoding:
Sulfo group glucosamine sulfhydrase; microtubulin fold cofactor D; Tyrosylprotein kinase 2; the adenosine deaminase that RNA is special; the ubiquitin sample is modified activating enzymes; Wd duplicate domain 13; Gamma glutamyl lytic enzyme; micronuclear ribonucleoprotein 40kDa; Protein Tyrosine Phosphatases 4A1; studies of interleukin-18 binding protein; coiled coil territory 92, closed protein 1, KIAA0831 albumen; glutathione-S-transferase 2; solubility transporter family 11 member 2 albumen, CD99 quasi-molecule 2, hydroxyalkyl coa dehydrogenase beta; coenzyme desaturase flavoprotein 2; ubiquitin protein ligase E3 assembly n recognin 4, the MLX interaction protein is with the acceptor 6 of the G albumen coupling that contains leucine enrichment repeat region; zinc finger protein 692; the special homologous protein of transcribing of mesoderm, inositol 1,3; 4-triphosphate 5/6 kinases; the CD5 quasi-molecule, talin 1, heat shock protein(HSP) 40 homology subfamily C members 3; with fruit bat FZ homology 5; Coxsackie virus and adenovirus receptor, angiogenesis factor sample albumen 3, microtubule associating albumen 1 light chain 3alpha; 3-hydroxy-3-methyl glutaryl-coenzyme A synthetase 1; the neurotrophic factor in neurone source, methylmalonic aciduria cblC type, the island purine suppresses polypeptide 2 in conjunction with the activity of albumen alpha; No. 22 chromosomal No. 32 open reading frame; nucleoporin 88kDa, hypertensin 2 receptor-like protein 2, sterol-C4-methyl oxidation enzyme sample albumen; the target proteins of Myb1 sample albumen 2, arylacetamide deacetylase and ubiquitin fold modified protein 1.
2. polynucleotide collection as claimed in claim 1; it is characterized in that; the described sulfo group glucosamine sulfhydrase of encoding; microtubulin fold cofactor D; Tyrosylprotein kinase 2; the adenosine deaminase that RNA is special; the ubiquitin sample is modified activating enzymes; Wd duplicate domain 13; Gamma glutamyl lytic enzyme; micronuclear ribonucleoprotein 40kDa; Protein Tyrosine Phosphatases 4A1; studies of interleukin-18 binding protein; coiled coil territory 92; closed protein 1; KIAA0831 albumen; glutathione-S-transferase 2; solubility transporter family 11 member 2 albumen; CD99 quasi-molecule 2; hydroxyalkyl coa dehydrogenase beta; coenzyme desaturase flavoprotein 2; ubiquitin protein ligase E3 assembly n recognin 4; the MLX interaction protein; acceptor 6 with the G albumen coupling that contains leucine enrichment repeat region; zinc finger protein 692; the special homologous protein of transcribing of mesoderm; inositol 1; 3,4-triphosphate, 5/6 kinases, the CD5 quasi-molecule; talin 1; heat shock protein(HSP) 40 homology subfamily C members 3 are with fruit bat FZ homology 5, Coxsackie virus and adenovirus receptor; angiogenesis factor sample albumen 3; microtubule associating albumen 1 light chain 3alpha, 3-hydroxy-3-methyl glutaryl-coenzyme A synthetase 1, the neurotrophic factor in neurone source; methylmalonic aciduria cblC type; the island purine suppresses 2, No. 22 chromosomal No. 32 open reading frame of polypeptide, nucleoporin 88kDa in conjunction with the activity of albumen alpha; hypertensin 2 receptor-like protein 2; sterol-C4-methyl oxidation enzyme sample albumen, the target proteins of Myb1 sample albumen 2, the polynucleotide of the folding modified protein 1 of arylacetamide deacetylase and ubiquitin are respectively: sequence such as GenBank accession number NM_000199.3; NM_005993.4; NM_003331.4, NM_001111.3, NM_003334.3; NM_017883.3; NM_003878.2, NM_004814.2, NM_003463.3; NM_173042.2; NM_025140.1, NM_021101.3, NM_014924.3; NM_000854.3; NM_000617.2, NM_031462.2, NM_000183.2; NM_021074.3; NM_020765.2, NM_014938.3, NM_001017403.1; NM_001136036.1; NM_002402.2, NM_014216.4, NM_005894.2; NM_006289.3; NM_006260.3, NM_003468.3, NM_001338.3; NM_014495.2; NM_032514.2, NM_001098272.1, NM_013349.4; NM_015506.2; NM_002070.2, NM_033318.3, NM_002532.3; NM_005162.2; NM_006745.3, NM_001033551.2, the polynucleotide shown in NM_001086.2 and the NM_016617.2.
3. the purposes of claim 1 or 2 described polynucleotide collection is characterized in that, for the preparation of detection reagent or the test kit of the recurrence of measuring primary hepatocarcinoma or recurrence potentiality.
4. purposes as claimed in claim 3 is characterized in that, described reagent is the primer pair that specific amplification claim 1 or 2 described polynucleotide are concentrated each polynucleotide; Or the probe of specific recognition claim 1 or concentrated each polynucleotide of 2 described polynucleotide.
5. one kind is used for measuring the recurrence of primary hepatocarcinoma or the reagent of recurrence potentiality, and it is specific recognition sequence such as GenBank accession number NM_000199.3, NM_005993.4, NM_003331.4, NM_001111.3, NM_003334.3, NM_017883.3, NM_003878.2, NM_004814.2, NM_003463.3, NM_173042.2, NM_025140.1, NM_021101.3, NM_014924.3, NM_000854.3, NM_000617.2, NM_031462.2, NM_000183.2, NM_021074.3, NM_020765.2, NM_014938.3, NM_001017403.1, NM_001136036.1, NM_002402.2, NM_014216.4, NM_005894.2, NM_006289.3, NM_006260.3, NM_003468.3, NM_001338.3, NM_014495.2, NM_032514.2, NM_001098272.1, NM_013349.4, NM_015506.2, NM_002070.2, NM_033318.3, NM_002532.3, NM_005162.2, NM_006745.3, NM_001033551.2, the probe of the polynucleotide shown in NM_001086.2 or the NM_016617.2, the nucleotide sequence of described probe is shown in SEQ ID NO:1-42 is arbitrary.
6. a gene chip is characterized in that, described gene chip comprises:
Solid phase carrier; And
Be fixed in order the probe on the described solid phase carrier, described probe specificity ground is corresponding to sequence such as GenBank accession number NM_000199.3, NM_005993.4, NM_003331.4, NM_001111.3, NM_003334.3, NM_017883.3, NM_003878.2, NM_004814.2, NM_003463.3, NM_173042.2, NM_025140.1, NM_021101.3, NM_014924.3, NM_000854.3, NM_000617.2, NM_031462.2, NM_000183.2, NM_021074.3, NM_020765.2, NM_014938.3, NM_001017403.1, NM_001136036.1, NM_002402.2, NM_014216.4, NM_005894.2, NM_006289.3, NM_006260.3, NM_003468.3, NM_001338.3, NM_014495.2, NM_032514.2, NM_001098272.1, NM_013349.4, NM_015506.2, NM_002070.2, NM_033318.3, NM_002532.3, NM_005162.2, NM_006745.3, NM_001033551.2, the polynucleotide shown in NM_001086.2 or the NM_016617.2.
7. gene chip as claimed in claim 6 is characterized in that, the nucleotide sequence of described probe is shown in SEQ ID NO:1-42 is arbitrary.
8. the purposes of claim 6 or 7 described gene chips is for the preparation of the test kit of the recurrence of measuring primary hepatocarcinoma or recurrence potentiality.
9. measure the recurrence of primary hepatocarcinoma or the test kit of recurrence potentiality for one kind, it is characterized in that, contain claim 6 or 7 described gene chips in the described test kit.
10. test kit as claimed in claim 9 is characterized in that, also comprises in the described test kit: nucleic acid extract, nitrite ion or hybridization solution.
CN2011103263819A 2011-10-24 2011-10-24 Genetic markers for detecting recurrence potential of primary liver cancer Pending CN103060348A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011103263819A CN103060348A (en) 2011-10-24 2011-10-24 Genetic markers for detecting recurrence potential of primary liver cancer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011103263819A CN103060348A (en) 2011-10-24 2011-10-24 Genetic markers for detecting recurrence potential of primary liver cancer

Publications (1)

Publication Number Publication Date
CN103060348A true CN103060348A (en) 2013-04-24

Family

ID=48103244

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011103263819A Pending CN103060348A (en) 2011-10-24 2011-10-24 Genetic markers for detecting recurrence potential of primary liver cancer

Country Status (1)

Country Link
CN (1) CN103060348A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106995857A (en) * 2017-06-01 2017-08-01 北京泱深生物信息技术有限公司 Applications of the biomarker ENSG00000267416 in cancer
CN108853499A (en) * 2018-07-12 2018-11-23 苏州大学 Containing the hypoxic tumor cells radiosensitizer for striking low cZNF292 shRNA sequence
CN109801680A (en) * 2018-12-03 2019-05-24 广州中医药大学(广州中医药研究院) Tumour metastasis and recurrence prediction technique and system based on TCGA database

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101429552A (en) * 2007-11-30 2009-05-13 芮屈生物技术(上海)有限公司 Detection kit for early diagnosis, transfer and recrudescence (HCCR and RhoGDI2) gene of cancer, correlated detection method and uses thereof
CN101812507A (en) * 2009-12-04 2010-08-25 复旦大学附属中山医院 Gene chip for predicting liver cancer metastasis and recurrence risk and manufacturing method and using method thereof
CN102002490A (en) * 2009-08-31 2011-04-06 上海市肿瘤研究所 9 microRNA markers for predicting whether primary liver cancer relapses
CN102002491A (en) * 2009-08-31 2011-04-06 上海市肿瘤研究所 11 microRNA markers for distinguishing primary liver cancer tissues and para-cancer tissues

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101429552A (en) * 2007-11-30 2009-05-13 芮屈生物技术(上海)有限公司 Detection kit for early diagnosis, transfer and recrudescence (HCCR and RhoGDI2) gene of cancer, correlated detection method and uses thereof
CN102002490A (en) * 2009-08-31 2011-04-06 上海市肿瘤研究所 9 microRNA markers for predicting whether primary liver cancer relapses
CN102002491A (en) * 2009-08-31 2011-04-06 上海市肿瘤研究所 11 microRNA markers for distinguishing primary liver cancer tissues and para-cancer tissues
CN101812507A (en) * 2009-12-04 2010-08-25 复旦大学附属中山医院 Gene chip for predicting liver cancer metastasis and recurrence risk and manufacturing method and using method thereof

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106995857A (en) * 2017-06-01 2017-08-01 北京泱深生物信息技术有限公司 Applications of the biomarker ENSG00000267416 in cancer
CN108853499A (en) * 2018-07-12 2018-11-23 苏州大学 Containing the hypoxic tumor cells radiosensitizer for striking low cZNF292 shRNA sequence
CN109801680A (en) * 2018-12-03 2019-05-24 广州中医药大学(广州中医药研究院) Tumour metastasis and recurrence prediction technique and system based on TCGA database
CN109801680B (en) * 2018-12-03 2023-02-28 广州中医药大学(广州中医药研究院) Tumor metastasis and recurrence prediction method and system based on TCGA database

Similar Documents

Publication Publication Date Title
CN104937111B (en) For diagnosing the composition and method of thyroid tumors
Lu et al. Common human cancer genes discovered by integrated gene-expression analysis
ES2494843T3 (en) Methods and materials to identify the origin of a carcinoma of unknown primary origin
JP7452894B2 (en) Method for evaluating liver cancer prognosis or risk using gene CPG methylation changes
CN105431737B (en) System for predicting locally advanced gastric cancer prognosis
CN105087568B (en) One group of gene and its application for tumor cells parting
ES2647154T3 (en) Biomarker combinations for colorectal tumors
CN105102636B (en) For detecting and measuring the composition and method of prostate cancer prognosis
CN101454668A (en) Methods for prediction and prognosis of cancer, and monitoring cancer therapy
Norris et al. Genetic landscape of prostate cancer conspicuity on multiparametric magnetic resonance imaging: a systematic review and bioinformatic analysis
CN101988059B (en) Gastric cancer detection marker and detecting method thereof, kit and biochip
WO2012158780A2 (en) Lung cancer signature
CN105861724B (en) A kind of KRAS gene ultralow frequency mutation detection kit
CN102002490A (en) 9 microRNA markers for predicting whether primary liver cancer relapses
CN108977544A (en) For identifying kit and its application of gastric cancer and/or polyp of stomach
KR20170065027A (en) Early lung cancer detection by dna methylation phenotyping of sputum-derived cells
CN110229899B (en) Plasma marker combinations for early diagnosis or prognosis prediction of colorectal cancer
Guo et al. Screening and identification of specific markers for bladder transitional cell carcinoma from urine urothelial cells with suppressive subtractive hybridization and cDNA microarray
US20190024184A1 (en) Distinguishing metastatic-lethal prostate cancer from indolent prostate cancer using methylation status of epigenetic markers
CN103060348A (en) Genetic markers for detecting recurrence potential of primary liver cancer
CN102851283B (en) MicroRNA markers for discriminating metastatic and non-metastatic squamous cell lung carcinoma
CN103060351A (en) Gene marker for detecting cancer tissue and para-carcinoma tissue of primary liver cancer
JP2016086678A (en) Laboratory marker and laboratory method of malignancy of renal cancer
CN102159728A (en) Breast cancer metastasis determination method and blood serum evaluation method
CN103060312A (en) Gene marker of predicting primary hepatic carcinoma metastatic potential

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130424