CN103390119B - A kind of Binding site for transcription factor recognition methods - Google Patents

A kind of Binding site for transcription factor recognition methods Download PDF

Info

Publication number
CN103390119B
CN103390119B CN201310277169.7A CN201310277169A CN103390119B CN 103390119 B CN103390119 B CN 103390119B CN 201310277169 A CN201310277169 A CN 201310277169A CN 103390119 B CN103390119 B CN 103390119B
Authority
CN
China
Prior art keywords
chip
binding site
base fragment
transcription factor
random field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310277169.7A
Other languages
Chinese (zh)
Other versions
CN103390119A (en
Inventor
冯伟兴
董彦生
贺波
陈若雷
王科俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Engineering University
Original Assignee
Harbin Engineering University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Engineering University filed Critical Harbin Engineering University
Priority to CN201310277169.7A priority Critical patent/CN103390119B/en
Publication of CN103390119A publication Critical patent/CN103390119A/en
Application granted granted Critical
Publication of CN103390119B publication Critical patent/CN103390119B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention belongs to molecular biosciences infomation detection field, be specifically related to a kind of based on condition random field technology, merge ChIP-chip microarray data and ChIP-seq? the Binding site for transcription factor recognition methods of DNA sequencing data.The present invention includes: set up the condition random field models; Obtain ChIP-chip laboratory values, identify corresponding states value; Obtain ChIP-seq laboratory values and identify corresponding states value; The accuracy of identification of test condition random field models; N-th DNA base fragment identification probability of Weighted Fusion recognition result; Relatively with identify Binding site for transcription factor.The present invention utilizes condition random field to merge the detection data identification Binding site for transcription factor of ChIP-chip and ChIP-seq experiment.Verify by experiment, in recognition accuracy, the inventive method is higher than the recognition methods adopting monotechnics.

Description

A kind of Binding site for transcription factor recognition methods
Technical field
The invention belongs to molecular biosciences infomation detection field, be specifically related to a kind of based on condition random field technology, merge the Binding site for transcription factor recognition methods of ChIP-chip microarray data and ChIP-seqDNA sequencing data.
Background technology
The functional protein that transcription factor is transcribed as controlling gene, the identification of its binding site on DNA gene promoter region plays great role for the research of gene transcription regulation mechanism, and it is the Focal point and difficult point in life science.Along with the progress of information science experimental technique, also make rapid progress for the laboratory facilities obtaining molecular biosciences information.Wherein, the standard test detecting region of DNA territory specified protein bonding state within the scope of full-length genome is chromatin imrnunoprecipitation reaction (Chromatinimmunoprecipitaion, ChIP), chromatin imrnunoprecipitation reaction (ChIP) ChIP-chip and the ChIP-seq technology produced that combines with genetic chip or DNA sequencing two kinds of detection techniques is respectively then detect two large major techniques of chromatin imrnunoprecipitation reaction experiment result.
Due to the difference of Cleaning Principle, the experimental data that ChIP-chip and ChIP-seq experimental technique produces also exists respective relative merits.Genetic chip directly measures biological information, and its susceptibility is higher, but compared to high flux DNA sequencing, its measuring accuracy is not high, and test specificity is lower; High flux DNA sequencing then has higher detection resolution, and its measuring accuracy is higher, and test specificity is better, but owing to being carry out indirect inspection to biological information, its susceptibility is lower.Visible, although the experimental data obtained respectively by the experimental approach that ChIP-chip with ChIP-seq is different has relative merits different separately, but have obvious complementarity, therefore, the detection data merging these two kinds of technology acquisitions can obtain more complete Detection Information.
As the pattern recognition problem for one-dimensional sequence, analyze the binding site whether current dna base fragment is specific function albumen, not only to consider whether the biochemical characteristic of current base fragment is applicable to specific function protein combination, also needs to consider that the biochemical characteristic of the multiple base fragment in its both sides is on the impact of specific function protein binding capacity.Specific to the identification problem of Binding site for transcription factor, should consider in identifying exactly simultaneously binding site base fragment to be identified and near the characteristic of multiple base fragment.Numerous in the mode identification technology of one-dimensional sequence, due to condition random field (ConditionalRandomFields, CRF) technology has the directionless constraint of feature selecting, and feature locations supports the characteristics such as long-range correlation, makes it be applicable to very much the identification of Binding site for transcription factor.
Summary of the invention
The object of this invention is to provide a kind of more high-precision based on the fusion chip data of condition random field and the Binding site for transcription factor recognition methods of DNA sequencing data.
The object of the present invention is achieved like this:
(1) set up the condition random field models:
p ( y | x ) = 1 Z ( x ) exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Z ( x ) = Σ y exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Wherein, x={x 1, x 2..., x nrepresent the laboratory values of DNA base fragment; Y={y 1, y 2..., y nthe corresponding states value of DNA base fragment, 1 represents it is Binding site for transcription factor, and 0 represents it is not Binding site for transcription factor; t k(y i-1, y i, x, i) and represent that i-th base fragment is under current experiment detected value sequence x, state is y ii-th base fragment and state be y i-1the i-th-1 base fragment between transfer characteristic function; s l(y i, x, i) represent under current experiment detected value sequence x, the state of i-th base fragment is y istatus flag function; λ kand μ lt respectively k(y i-1, y i, x, i) and s l(y i, x, i) and corresponding weights, represent the importance of each fundamental function; Z (x) is standardizing factor, makes p (y|x) be positioned between [0,1];
(2) the ChIP-chip laboratory values x={x of DNA base fragment is obtained 1, x 2..., x n, according to conditional random field models, identify the state value y={y of corresponding DNA base fragment 1, y 2..., y n;
(3) the ChIP-seq laboratory values x={x of DNA base fragment is obtained 1, x 2..., x n, according to conditional random field models, identify the state value y={y of corresponding DNA base fragment 1, y 2..., y n;
(3) accuracy of identification of test condition random field models:
S n = TP TP + FN ,
S P = TN TN + FP ,
A c = S n + S p 2 ,
Wherein, susceptibility S n, specificity S p, accuracy rate A c, TP represents the predicted correct number of Binding site for transcription factor; FN represents the number of the predicted mistake of Binding site for transcription factor; TN represents the predicted correct number of non-transcribed factor binding site; FP represents the number of the predicted mistake of non-transcribed factor binding site;
(4) to the n-th DNA base fragment, the probability being identified as Binding site for transcription factor by ChIP-chip laboratory values is used represent, the probability being identified as non-transcribed factor binding site is used represent, the probability being identified as Binding site for transcription factor by ChIP-seq laboratory values is used represent, the probability being identified as non-transcribed factor binding site is used represent, the n-th DNA base fragment identification probability of Weighted Fusion recognition result is expressed as:
p 1 ( n ) = p lchip ( n ) × w 1 + p 1 seq ( n ) × w 2 ,
p 0 ( n ) = p 0 chip ( n ) × w 1 + p 0 seq ( n ) × w 2 ,
Blending weight w1 and w2 is the recognition accuracy with recognition result, w 1+ w 2=1;
(5) compare with then this base fragment is identified as Binding site for transcription factor; then this base fragment is identified as the non-binding site of transcription factor.
Beneficial effect of the present invention is:
The present invention utilizes condition random field to merge the detection data identification Binding site for transcription factor of ChIP-chip and ChIP-seq experiment.Verify by experiment, in recognition accuracy, the inventive method is higher than the recognition methods adopting monotechnics.
Accompanying drawing explanation
Fig. 1 is Binding site for transcription factor recognition methods process flow diagram of the present invention;
Fig. 2 is the training process flow diagram of conditional random field models.
Embodiment
Below in conjunction with accompanying drawing, the present invention is described further:
Based on the fusion chip data of condition random field and the Transcription Factor Binding Sites Prediction method of DNA sequencing data, comprise the following steps:
(1) adopt condition random field recognition technology to adopt ChIP-chip microarray data, Binding site for transcription factor is tentatively identified;
(2) adopt condition random field recognition technology to adopt ChIP-seqDNA sequencing data, Binding site for transcription factor is tentatively identified;
(3) adopt the mode of decision level fusion, Binding site for transcription factor is finally identified.Namely to the base fragment of same position, Weighted Fusion respectively from the preliminary recognition result of Binding site for transcription factor of ChIP-chip microarray data and ChIP-seqDNA sequencing data, and draws final recognition result.Wherein, blending weight is directly proportional to the recognition accuracy of preliminary recognition result, and two blending weights and be 1.
The representation of the condition random field that this method adopts is as follows:
p ( y | x ) = 1 Z ( x ) exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Z ( x ) = Σ y exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Wherein, x={x 1, x 2..., x nrepresent the laboratory values of DNA base fragment; Y={y 1, y 2..., y nthe corresponding states value of DNA base fragment, only have two states value here, 1 represents it is Binding site for transcription factor, and 0 represents it is not Binding site for transcription factor; t k(y i-1, y i, x, i) and represent that i-th base fragment is under current experiment detected value sequence x, state is y ii-th base fragment and state be y i-1the i-th-1 base fragment between transfer characteristic function; s l(y i, x, i) represent under current experiment detected value sequence x, the state of i-th base fragment is y istatus flag function; λ kand μ lt respectively k(y i-1, y i, x, i) and s l(y i, x, i) and corresponding weights, represent the importance of each fundamental function; Z (x) is standardizing factor, and p (y|x) is positioned between [0,1].Condition random field fundamental function selected by the present invention comprises status flag function and transfer characteristic function.Wherein, status flag selection window is 1, namely in conjunction with the detected value constitutive characteristic of each base fragment of current base fragment and front and back.It is 1 that transfer characteristic is simply taken as adjacent segment characterizations, otherwise is 0.Fundamental function then elect as each feature self, quadratic sum evolution, finally decided the importance of each fundamental function by weight.Adopt above-mentioned model, when obtaining the laboratory values x={x of DNA base fragment 1, x 2..., x nafter, the state value y={y of corresponding DNA base fragment can be identified 1, y 2..., y n, for judging whether each base fragment is Binding site for transcription factor.
The inventive method embodiment is as shown in Figure 1, specific as follows:
1. the preliminary recognition result of base fragment ChIP-chip data CRF model Binding site for transcription factor
The condition random field CRF modelling that base fragment ChIP-chip data transcription factor binding site tentatively identifies is as follows:
p ( y | x ) = 1 Z ( x ) exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Z ( x ) = Σ y exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Wherein, x={x 1, x 2..., x nrepresent the ChIP-chip laboratory values of DNA base fragment; Y={y 1, y 2..., y nthe corresponding states value of DNA base fragment, only have two states value here, 1 represents it is Binding site for transcription factor, and 0 represents it is not Binding site for transcription factor; t k(y i-1, y i, x, i) and represent that i-th base fragment is under current experiment detected value sequence x, state is y ii-th base fragment and state be y i-1the i-th-1 base fragment between transfer characteristic function; s l(y i, x, i) represent under current experiment detected value sequence x, the state of i-th base fragment is y istatus flag function; λ kand μ lt respectively k(y i-1, y i, x, i) and s l(y i, x, i) and corresponding weights, represent the importance of each fundamental function; Z (x) is standardizing factor, and p (y|x) is positioned between [0,1].Condition random field fundamental function selected by the present invention comprises status flag function and transfer characteristic function.Wherein, status flag selection window is 1, namely in conjunction with the detected value constitutive characteristic of each base fragment of current base fragment and front and back.It is 1 that transfer characteristic is simply taken as adjacent segment characterizations, otherwise is 0.Fundamental function then elect as each characteristic self, quadratic sum evolution, finally decided the importance of each fundamental function by weight.
Adopt above-mentioned model, when obtaining the ChIP-chip laboratory values x={x of DNA base fragment 1, x 2..., x nafter, the state value y={y of corresponding DNA base fragment can be identified 1, y 2..., y n, as the preliminary recognition result of each base fragment Binding site for transcription factor.
2. the preliminary recognition result of base fragment ChIP-seq data CRF model Binding site for transcription factor
The condition random field CRF modelling that base fragment ChIP-seq data transcription factor binding site tentatively identifies is as follows:
p ( y | x ) = 1 Z ( x ) exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Z ( x ) = Σ y exp ( Σ i , k λ k t k ( y i - 1 , y i , x , i ) + Σ i , l μ l s l ( y i , x , i ) )
Wherein, x={x 1, x 2..., x nrepresent the ChIP-seq laboratory values of DNA base fragment; Y={y 1, y 2..., y nthe corresponding states value of DNA base fragment, only have two states value here, 1 represents it is Binding site for transcription factor, and 0 represents it is not Binding site for transcription factor; t k(y i-1, y i, x, i) and represent that i-th base fragment is under current experiment detected value sequence x, state is y ii-th base fragment and state be y i-1the i-th-1 base fragment between transfer characteristic function; s l(y i, x, i) represent under current experiment detected value sequence x, the state of i-th base fragment is y istatus flag function; λ kand μ lt respectively k(y i-1, y i, x, i) and s l(y i, x, i) and corresponding weights, represent the importance of each fundamental function; Z (x) is standardizing factor, and p (y|x) is positioned between [0,1].Condition random field fundamental function selected by the present invention comprises status flag function and transfer characteristic function.Wherein, status flag selection window is 1, namely in conjunction with the detected value constitutive characteristic of each base fragment of current base fragment and front and back.It is 1 that transfer characteristic is simply taken as adjacent segment characterizations, otherwise is 0.Fundamental function then elect as each characteristic self, quadratic sum evolution, finally decided the importance of each fundamental function by weight.
Adopt above-mentioned model, when obtaining the ChIP-seq laboratory values x={x of DNA base fragment 1, x 2..., x nafter, the state value y={y of corresponding DNA base fragment can be identified 1, y 2..., y n, as the preliminary recognition result of each base fragment Binding site for transcription factor.
3. the fusion recognition result of Binding site for transcription factor
The present invention adopts the mode of decision level fusion to obtain final Binding site for transcription factor recognition result.Namely to the base fragment of same position, the mode of employing weighting merges the preliminary recognition result of base fragment CRF model Binding site for transcription factor respectively from ChIP-chip microarray data and ChIP-seqDNA sequencing data, and draws final Binding site for transcription factor recognition result.Be specially:
First, adopt the mode of cross validation, the accuracy of identification of condition random field training pattern is tested.Wherein, test index adopts susceptibility S n(Sensitivity), specificity S p(Specificity), accuracy rate A c, they are defined as follows:
S n = TP TP + FN - - - ( 1 )
S P = TN TN + FP - - - ( 2 )
A c = S n + S p 2 - - - ( 3 )
In formula, TP represents the predicted correct number of Binding site for transcription factor; FN represents the number of the predicted mistake of Binding site for transcription factor; TN represents the predicted correct number of non-transcribed factor binding site; FP represents the number of the predicted mistake of non-transcribed factor binding site.The accuracy rate A of condition random field training pattern casking for of blending weight will be used for.
Secondly, for the n-th DNA base fragment, the probability being tentatively identified as Binding site for transcription factor based on ChIP-chip data is used represent, the probability being tentatively identified as non-transcribed factor binding site is used represent; The probability being tentatively identified as Binding site for transcription factor based on ChIP-seq data is used represent, the probability being tentatively identified as non-transcribed factor binding site is used represent.This base fragment identification probability then merging above-mentioned preliminary recognition result is expressed as:
p 1 ( n ) = p lchip ( n ) × w 1 + p 1 seq ( n ) × w 2 - - - ( 4 )
p 0 ( n ) = p 0 chip ( n ) × w 1 + p 0 seq ( n ) × w 2 ,
In formula, blending weight w 1and w 2be designed to the recognition accuracy with preliminary recognition result, i.e. the accuracy rate A of condition random field training pattern cbe directly proportional, and two blending weights and be 1.
Finally, compare with if then this base fragment is identified as Binding site for transcription factor; If then this base fragment is identified as the non-binding site of transcription factor.
4. experimental verification
4.1 data prediction
4.1.1ChIP-chip data prediction
The ChIP-chip microarray data that this experiment adopts derives from international large-scale public biomolecule information database GEO database (GSE6892), the bonding state of transcription factor STAT1 on this Data Detection human body 44 genes.Each data measured is that 50bp is long, but owing to have employed tile test format, exists and partly overlap between adjacent two data measureds.We again split data and combine, the base fragment be connected of the final 38bp of acquisition, and calculate the ChIP-chip detected value in each fragment.
4.1.2ChIP-seq data prediction
The ChIP-seq sequencing data that this experiment adopts derives from BCGSC database, is also the detection for human transcription factor STAT1 bonding state.When carrying out pretreated to the data of ChIP-seq, first raw data is converted to fastq form, then comparison is passed through, data measured is mapped in human genome, and calculate the data measured degree of covering of each base position on DNA, finally, the DNA base fragment that corresponding ChIP-chip data obtain, calculates the ChIP-seq data measured degree of covering value of each base fragment.
4.2 conditional random field models training
The fundamental function of conditional random field models comprises status flag function and transfer characteristic function.Wherein, status flag selection window is 1, namely in conjunction with the detected value constitutive characteristic of each base fragment of current base fragment and front and back.It is 1 that transfer characteristic is simply taken as adjacent segment characterizations, otherwise is 0.Fundamental function then elect as each feature self, quadratic sum evolution.Utilize STAT1 bonding state data on known DNA base fragment, train conditional random field models, training process flow diagram as shown in Figure 2.
The fusion of 4.3 conditional random field models recognition results
First, adopt the mode of 5 cross validations, the accuracy of identification of condition random field training pattern is tested.Test result is as shown in Table 1 and Table 2:
Table 1ChIP-chip data cross the result
Table 2ChIP-seq data cross the result
In table 1, the Average Accuracy recording the CRF model adopting the training of ChIP-chip data with five cross-validation methods is 57.68%., in table 2, the Average Accuracy recording the CRF model adopting the training of ChIP-seq data with five cross-validation methods is 63.76%.
Then, according to based on the CRF model training accuracy rate calculating blending weight of ChIP-chip data and ChIP-seq data being respectively: w1=0.475, w2=0.525.And to calculate the n-th DNA base fragment be accordingly the probability of Binding site for transcription factor with the probability for non-transcribed factor binding site
Finally, compare with if then the n-th DNA base fragment is identified as the binding site of transcription factor STAT1; If then the n-th DNA base fragment is identified as the non-binding site of transcription factor STAT1.Adopt the accuracy test of five cross-validation methods to fusion results as shown in table 3:
The cross validation results that table 3 merges
Visible, the Average Accuracy merging the recognition result that ChIP-chip data and ChIP-seq data obtain is 0.7697, significantly higher than the Average Accuracy being used alone the recognition result that ChIP-chip data and ChIP-seq data obtain.

Claims (1)

1. a Binding site for transcription factor recognition methods, is characterized in that:
(1) set up the condition random field models:
Wherein, x={x 1, x 2..., x nrepresent the laboratory values of DNA base fragment; Y={y 1, y 2..., y nthe corresponding states value of DNA base fragment, 1 represents it is Binding site for transcription factor, and 0 represents it is not Binding site for transcription factor; t k(y i-1, y i, x, i) and represent that i-th base fragment is under current experiment detected value sequence x, state is y ii-th base fragment and state be y i-1the i-th-1 base fragment between transfer characteristic function; s l(y i, x, i) represent under current experiment detected value sequence x, the state of i-th base fragment is the status flag function of yi; λ kand μ lt respectively k(y i-1, y i, x, i) and s l(y i, x, i) and corresponding weights, represent the importance of each fundamental function; Z (x) is standardizing factor, makes p (y|x) be positioned between [0,1];
(2) the ChIP-chip laboratory values x={x of DNA base fragment is obtained 1, x 2..., x n, according to conditional random field models, identify the state value y={y of corresponding DNA base fragment 1, y 2..., y n;
(3) the ChIP-seq laboratory values x={x of DNA base fragment is obtained 1, x 2..., x n, according to conditional random field models, identify the state value y={y of corresponding DNA base fragment 1, y 2..., y n;
(4) accuracy of identification of test condition random field models:
Wherein, susceptibility S n, specificity S p, accuracy rate A c, TP represents the predicted correct number of Binding site for transcription factor; FN represents the number of the predicted mistake of Binding site for transcription factor; TN represents the predicted correct number of non-transcribed factor binding site; FP represents the number of the predicted mistake of non-transcribed factor binding site;
(5) to the n-th DNA base fragment, the probability being identified as Binding site for transcription factor by ChIP-chip laboratory values is used represent, the probability being identified as non-transcribed factor binding site is used represent, the probability being identified as Binding site for transcription factor by ChIP-seq laboratory values is used represent, the probability being identified as non-transcribed factor binding site is used represent, the n-th DNA base fragment identification probability of Weighted Fusion recognition result is expressed as:
Blending weight w 1and w 2for the recognition accuracy with recognition result, w 1+ w 2=1;
(6) compare with then this base fragment is identified as Binding site for transcription factor; then this base fragment is identified as the non-binding site of transcription factor.
CN201310277169.7A 2013-07-03 2013-07-03 A kind of Binding site for transcription factor recognition methods Expired - Fee Related CN103390119B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310277169.7A CN103390119B (en) 2013-07-03 2013-07-03 A kind of Binding site for transcription factor recognition methods

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310277169.7A CN103390119B (en) 2013-07-03 2013-07-03 A kind of Binding site for transcription factor recognition methods

Publications (2)

Publication Number Publication Date
CN103390119A CN103390119A (en) 2013-11-13
CN103390119B true CN103390119B (en) 2016-01-27

Family

ID=49534388

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310277169.7A Expired - Fee Related CN103390119B (en) 2013-07-03 2013-07-03 A kind of Binding site for transcription factor recognition methods

Country Status (1)

Country Link
CN (1) CN103390119B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103898113A (en) * 2014-03-11 2014-07-02 北京理工大学 Method for adjusting promoter intensity by using transcription factor binding site
CN106650313B (en) * 2016-09-29 2019-10-18 哈尔滨工程大学 A method of it filtering out DNA base in DNase high-flux sequence data and is inclined to sexual deviation
CN108154008A (en) * 2017-12-25 2018-06-12 上海嘉因生物科技有限公司 Detection method applied to Binding site for transcription factor on chromosome in tissue samples
CN108733977A (en) * 2018-05-31 2018-11-02 中国人民解放军军事科学院军事医学研究院 Eucaryote guards the recognition methods and application of Binding site for transcription factor accumulation regions TFCR
CN110335639B (en) * 2019-06-13 2021-10-29 哈尔滨工业大学(深圳) Transcription factor binding site prediction algorithm and device of trans-transcription factor
CN111243674B (en) * 2020-01-08 2023-07-04 华南理工大学 Base sequence identification method, device and storage medium
CN113066527B (en) * 2021-04-14 2024-02-09 吉优诺(上海)基因科技有限公司 Target prediction method and system for siRNA knockdown mRNA

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777686A (en) * 2003-03-28 2006-05-24 科根泰克股份有限公司 Statistical analysis of regulatory factor binding sites of differentially expressed genes

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020037519A1 (en) * 2000-05-11 2002-03-28 States David J. Identifying clusters of transcription factor binding sites

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777686A (en) * 2003-03-28 2006-05-24 科根泰克股份有限公司 Statistical analysis of regulatory factor binding sites of differentially expressed genes

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CTF: a CRF-based transcription factor binding sites finding system;Yupeng He1等;《BMC Genomics 2012》;20120424;1-9 *
Genome-Wide Analysis of Transcription Factor Binding Sites Based on ChIP-Seq Data;Anton Valouev等;《nature mathod》;20080930;1-14 *

Also Published As

Publication number Publication date
CN103390119A (en) 2013-11-13

Similar Documents

Publication Publication Date Title
CN103390119B (en) A kind of Binding site for transcription factor recognition methods
Qin et al. HPeak: an HMM-based algorithm for defining read-enriched regions in ChIP-Seq data
CN109671469B (en) Method for predicting binding relationship and binding affinity between polypeptide and HLA type I molecule based on circulating neural network
IL213245A (en) Method for predicting response to anti-tnfα therapy, a computer based system, a device and a kit therefor
Srivastava et al. HOME: a histogram based machine learning approach for effective identification of differentially methylated regions
CN105063192A (en) Molecular marker for identifying amur sturgeon germplasm and application of molecular marker
Kim et al. A short survey of computational analysis methods in analysing ChIP-seq data
Krstić et al. Physical model for recognition tunneling
CN108034724A (en) For predicting colorectal cancer prognosis and the circular RNA molecule label of mortality risk and its application
Brand et al. Identification of two novel mammographic density loci at 6Q25. 1
Franceschi et al. Multiple comparisons in mass-spectrometry-based-omics technologies
CN101914619A (en) RNA (Ribonucleic Acid) sequencing quality control method and device relating to gene expression
KR101888628B1 (en) Method and Media of Predicting protein-binding regions in RNA Using Nucleotide Profiles and Compositions
CN107391962B (en) Method for analyzing regulation and control relation of genes or loci to diseases based on multiple groups of theories
Price et al. Systems biology and systems medicine
US20230073973A1 (en) Deep learning based system and method for prediction of alternative polyadenylation site
CN104462817B (en) Gene selects and cancer classification method based on Monte Carlo and Nonnegative matrix factorization
WO2012149107A2 (en) Stratifying patient populations through characterization of disease-driving signaling
Simon Advances in clinical trial designs for predictive biomarker discovery and validation
CN105205350B (en) The length decision method of poly base in a kind of Ion Torrent sequencing datas
Matsui Statistical issues in clinical development and validation of genomic signatures
Hu et al. Sequential model selection-based segmentation to detect DNA copy number variation
EP3207133B1 (en) Cross-platform transformation of gene expression data
Myers Zen and the art of parameter estimation in systems biology
Ni et al. Development of a model for the prediction of biological age

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160127

Termination date: 20210703