CN104131093B - The DNase high pass order-checking detection signal treatment process of DNA protein binding site - Google Patents
The DNase high pass order-checking detection signal treatment process of DNA protein binding site Download PDFInfo
- Publication number
- CN104131093B CN104131093B CN201410352942.6A CN201410352942A CN104131093B CN 104131093 B CN104131093 B CN 104131093B CN 201410352942 A CN201410352942 A CN 201410352942A CN 104131093 B CN104131093 B CN 104131093B
- Authority
- CN
- China
- Prior art keywords
- dnase
- seq
- binding site
- data
- dna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biophysics (AREA)
- Theoretical Computer Science (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses the DNase high pass order-checking detection signal pretreatment process of DNA protein binding site.Comprise following step: obtain gene essential information, and the DNase-Seq high pass order-checking of DNA protein binding site detects data and ChIP-Seq high pass order-checking monitoring data; Data quality accessment is detected to the order-checking of DNase-Seq high pass, filters out credible sequencing data; Credible for every bar sequencing data is only retained the order-checking zero position of directly reflection protein binding site; Obtain DNase-Seq and detect sample data sets; Detect sample data sets to DNase-Seq to be normalized; Detect sample data sets to DNase-Seq to segment; From front and back both direction, longitudinal summation is carried out, complete operation to two sub-intensive data respectively.The present invention significantly improves accuracy of identification and the recognition resolution of DNA protein binding site.
Description
Technical field
The invention belongs to the treatment process of the detection signal of DNA protein binding site, particularly relate to a kind of DNase high pass order-checking detection signal treatment process of DNA protein binding site.
Background technology
At present, DNA protein binding site detects main employing chromatin immune chemical coprecipitation technique (ChromatinImmunoprecipitation, ChIP).And the ChIP-Seq technology that ChIP experimental result is combined with high throughput sequencing technologies, then can efficiently in full-length genome range detection and the protein bound region of DNA section of specific function.The principle of ChIP-Seq is: first utilized the DNA fragmentation being combined with target protein with the enzyme enrichment of target protein specific binding by chromatin immune chemical coprecipitation technique (ChIP), and carry out purifying and library construction to it.Then high-flux sequence is carried out to the DNA fragmentation that enrichment obtains, again millions of the reading sequences that order-checking obtains accurately are navigated on genome, thus obtain the region of DNA segment information being combined with target protein within the scope of full-length genome, and then obtain target protein DNA binding site accurately by various analytical algorithm.But, although the DNA protein binding site analytical procedure detecting data for ChIP-Seq is very ripe, but this technology also has weak point, first be that the desmoenzyme of enrichment target protein has specificity, thus cause some albumen cannot detect because can not find suitable specific combination enzyme; Secondly, once experiment can only detect a kind of albumen, and take time and effort, cost is high, cannot use on a large scale; 3rd, what is more important, the DNA fragmentation be combined with target protein obtained due to experiment is longer, part order-checking can only be carried out to its two ends during order-checking, because order-checking region is not binding site itself, therefore, although the resolving power of sequencing data can reach single base, the positioning resolution of target protein binding site is the highest also can only reach tens bases.
For the problems referred to above, created a kind of new DNA protein binding site detection technique in recent years--based on the DNA protein binding site detection technique of DNase high pass order-checking information, i.e. DNase-Seq technology.This technology also claims DNase footprinting (DNasefootprinting), can the binding site of accurate identification DBP on DNA molecular.Its principle is: first utilize DNase nucleic acid shearing enzyme to carry out enzyme to DNA and cut process.Then do not have the protein bound region of DNA territory of DNA will be sheared enzyme by DNase nucleic acid to cut off equably at random, and have DNA protein bound region of DNA territory not to be cut off owing to being subject to protein-bonded obstruction specificity.Subsequently, the DNA fragmentation processed is cut to enzyme and carries out purifying and library construction, then check order, thus the enzyme obtaining DNase nucleic acid shearing enzyme within the scope of full-length genome cuts information.Cut in information at enzyme, the enzyme at protein binding site place cuts information by specific reduction, just as staying on DNA one by one footprint, thus can the binding site of accurate identification DBP on DNA molecular.Compared with ChIP-Seq technology, the advantage of the new DNase-Seq high pass sequencing technologies proposed is very outstanding.First, owing to not having specificity, DNase-Seq can disposablely detect the binding site of all DNA albumen within the scope of full-length genome; Secondly, due to the binding site of all DNA albumen of disposable detection, DNase-Seq technology significantly improves detection efficiency and reduces testing cost, and making to carry out the detection of DNA protein binding site on a large scale becomes possibility; 3rd, what is more important, the order-checking zero position due to DNase-Seq is exactly that enzyme cuts position, and therefore, DNase-Seq can reach single base to the detection resolution of DNA protein binding site, and so high resolving power is very helpful to follow-up study.Therefore, DNase-Seq signal is processed, and to carry out deep research and analysis be very necessary.
Since the technology accurately detecting DNA protein binding site since DNase is suggested, DNase technology is utilized for DNA binding site correlative study achievement more provides accurate experiment to verify.Also mostly to the process of its experimental data is carry out analysis by the form of simple count and illustrated mode directly perceived is stated.Until 2006, Crawford and Sabo proposes DNase-Chip technology at Naturemethods simultaneously, utilize microarray to carry out high-throughput measurement to DNase detection signal, thus open application DNase technology carries out extensive determination and analysis in genome range stage to protein binding site.2010, Crawford further provides can to the DNase-Seq technology that protein binding site detects within the scope of full-length genome.
After DNase-Seq technology is suggested, in succession create many analytical procedures.2010, Chen utilized DNase-Seq data to analyze DNA protein binding site based on dynamic bayesian network.2011, Fletez utilized DNase-Seq data to analyze DNA protein binding site based on SVMs.2012, Pique proposed CENTIPEDE method, by designing the statistical nature of rigorous statistical model analyzing DNA protein binding site place DNase-Seq data, and then identifies DNA protein binding site.2013, Jason proposed Wellington method, and DNase-Seq data characteristic when utilizing DNA protein binding site chromosomal region open, identifies DNA protein binding site.2014, Sherwood utilized the amplitude of DNA protein binding site region uniqueness and shape facility to identify DNA protein binding site.
But the pretreatment mode that the various analytical procedures proposed at present all adopt ChIP-Seq to detect data detects data to DNase-Seq and carries out pre-treatment.But in fact, due to the remarkable difference of Cleaning Principle, DNase-Seq data to the detection resolution of DNA protein binding site far above ChIP-Seq data.
Summary of the invention
The object of this invention is to provide a kind of DNA protein binding site can be provided accuracy of identification and recognition resolution, the DNase high pass order-checking detection signal treatment process of DNA protein binding site.
The present invention is achieved by the following technical solutions:
The DNase high pass order-checking detection signal pretreatment process of DNA protein binding site, comprises following step:
Step one: obtain gene essential information, gene essential information comprises the genomic base sequence of DNA and the positional information of gene on DNA, and the DNase-Seq high-flux sequence obtaining DNA protein binding site detects data and ChIP-Seq high-flux sequence detection data;
Step 2: detect data quality accessment to the order-checking of DNase-Seq high pass, filter out the credible sequencing data of quality score more than 20 of base position, finds the source of the credible predicted data of every bar in genome by mapping;
Step 3: the order-checking zero position credible for every bar sequencing data only being retained directly reflection protein binding site, obtains the DNase – Seq data after upgrading;
Step 4: the number of the DNase – Seq data point ask for renewal on each DNA base position after, as the DNase-Seq detected value of DNA base position; Utilize ChIP-Seq data acquisition to there is the region of the binding site of relevant DNA albumen, extract DNase-Seq detected value complete in region, obtain DNase-Seq and detect sample data sets;
Step 5: detect sample data sets to DNase-Seq and be normalized, detects the DNase-Seq detected value sum of all DNA base position in sample data sets divided by DNase-Seq by the DNase-Seq detected value of each DNA base position;
Step 6: sample data sets is detected to DNase-Seq and segments;
DNase-Seq being detected sample data sets is divided into just checking order subset, normal chain negative order-checking subset, minus strand of normal chain just checking order subset sums minus strand negative order-checking subset, the subset sums minus strand that normal chain just checked order is born order-checking subset and is merged by the mode of relevant alignment the front becoming DNA protein binding site and detect data subset, normal chain is born the order-checking subset sums minus strand subset that just checking order and is merged the back side detection data subset becoming DNA protein binding site by relevant mode of aliging;
Step 7: respectively from front and back both direction, detects data in data subset and back side detection data subset to front and carries out longitudinal summation, complete operation.
Beneficial effect of the present invention:
Due to the remarkable difference of Cleaning Principle, DNase-Seq data to the detection resolution of DNA protein binding site far above ChIP-Seq data, have studied the treatment process targetedly DNase-Seq being detected to data in the present invention, significantly improve accuracy of identification and the recognition resolution of DNA protein binding site.
Simultaneously by detecting the process of data to DNase-Seq, realize the Detection Information highlighting DNA protein binding site, for the high precision identification of subsequent protein binding site is layed foundation.
Accompanying drawing explanation
The DNase high pass order-checking detection signal pretreatment process block diagram of Fig. 1 DNA protein binding site;
The processed conventionally operating process of Fig. 2;
The operating process of Fig. 3 special processing;
The DNase-Seq normal chain of Fig. 4 DNA protein binding site is just checking order Detection Information, K562 cell ATF1 albumen;
The DNase-Seq minus strand negative order-checking Detection Information of Fig. 5 DNA protein binding site, K562 cell ATF1 albumen;
The DNase-Seq normal chain negative order-checking Detection Information of Fig. 6 DNA protein binding site, K562 cell ATF1 albumen;
The DNase-Seq minus strand of Fig. 7 DNA protein binding site is just checking order Detection Information, K562 cell ATF1 albumen;
The DNase-Seq front Detection Information of Fig. 8 DNA protein binding site, K562 cell ATF1 albumen;
The DNase-Seq back side Detection Information of Fig. 9 DNA protein binding site, K562 cell ATF1 albumen.
Embodiment
Below in conjunction with accompanying drawing, the present invention is described in further details.
The DNase high pass order-checking obtaining DNA protein binding site detects data, carries out in-depth analysis and processes targetedly, and finally reaching the object highlighting protein binding site Detection Information.As shown in Figure 1, specifically comprise the following steps:
1. data acquisition
We obtain genome base sequence essential information on international bio information site UCSC; DNA albumen essential information is obtained in the TRANSFAC database of BIOBASE company.It is all come from the data that the ENCODE announced UCSC website plans to generate that DNase-Seq and ChIP-Seq of DNA protein binding site detects data.Wherein DNase-Seq detects data is on ENCODE website, downloads in the data that the DUKE laboratory under hg19 condition and UW laboratory provide; It is download in the data that provide in SYDH laboratory that ChIP-Seq detects data.
2. conventional preprocessing part
As shown in Figure 2, the DNase-Seq high pass order-checking of DNA protein binding site detects data owing to adopting short data records order-checking mode, and the high-flux sequence platform of general Illumina company carries out detecting and generating.The data that this order-checking platform generates observe FASTQ form, and namely each reading (read) point four lines of sequencing data carries out information storage.Wherein, the first row is the relevant information of order-checking platform, starts with ""; Second row is sequence information; The third line starts with "+" number, after identical with the first row, sometimes can be omitted; Fourth line is the base Detection job score that sequencing sequence is corresponding.Wherein, each base position has a corresponding with it quality score.Here require that the quality score of all base position in credible sequencing data all should more than 20, its implication is that the probability of arbitrary base position sniffing is all below 1%.
The mapping link of following by sequencing data, namely finds the source of each sequencing data in genome by mapping.Illumina check order platform sequencing data due to length consistent, so be suitable for adopting BWA comparison software simulating sequencing data to genomic mapping.
Sequencing data also needs to analyze the mapping quality of each data after mapping.Require that the mapping mass M APQ score of believable sequencing data should more than 20 (its implication is that sequencing data maps wrong probability below 1%), and the number of base mismatch should below 2.
3. special pre-treatment part
As shown in Figure 3, first, order-checking zero position due to DNase-Seq is exactly that enzyme cuts position, and DNA protein binding site can be reflected in single base discrimination rate, so detect the pretreatment stage of data at DNase-Seq, each DNase-Seq can be detected data modified line is a little, namely only retains the order-checking zero position point directly reflecting protein binding site, thus effectively highlights the binding site information of DNA albumen.
Secondly, after DNase-Seq data being carried out to modified line and being pre-treatment a little, each DNA base position is asked for the number of DNase-Seq data point as the DNase-Seq detected value on this base position.There is the region of the binding site of relevant DNA albumen in recycling ChIP-Seq data acquisition.If exist (its resolving power only has tens bases), then extract the DNase-Seq detected value that this region is complete, and accurately determine DNA protein binding site by the base proneness of associated protein, its resolving power should reach single base.Like this, one group of DNase-Seq that certainly there is this DNA protein binding site can be obtained and detect sample data sets.
In addition, for avoiding the percentage contribution of different sample in subsequent analysis process inconsistent, also tackling sampled data and being normalized pre-treatment.Namely in sample, the DNase-Seq detected value of each base position should divided by the DNase-Seq detected value sum of base position all in this sample areas.
Subsequently, sample set is segmented.First by DNA protein binding site position, different sample is divided into DNA normal chain and minus strand two class.Secondly, each class sample is divided into DNA normal chain and minus strand two portions according to the direction of its order-checking reading again.Be divided into normal chain just to check order in samples all in sample set like this, the negative order-checking of normal chain, minus strand just checks order, and the negative order-checking of minus strand waits four parts.Wherein, the implication that normal chain is just checking order be DNA protein binding site when normal chain, from the DNA protein binding site detected value that normal chain detects it; The implication that normal chain bears order-checking be DNA protein binding site when normal chain, from the DNA protein binding site detected value that minus strand detects it; The implication that minus strand is just checking order be DNA protein binding site when minus strand, from the DNA protein binding site detected value that normal chain detects it.The implication that minus strand bears order-checking be DNA protein binding site when minus strand, from the DNA protein binding site detected value that minus strand detects it.According to the DNase-Seq biological detection mechanism of DNA protein binding site, normal chain is just checking order and minus strand to bear order-checking be all from just detecting in the face of DNA protein binding site, and the negative order-checking of normal chain and minus strand just to check order be all detect from the back side DNA protein binding site, therefore, normal chain is just checking order and minus strand is born order-checking and merged by the mode of relevant alignment the front becoming DNA protein binding site and detect data subset, and normal chain is born the mode that order-checking and minus strand just checking order by relevant alignment and merged the back side detection data subset becoming DNA protein binding site.
Finally, respectively from front and back both direction, longitudinal summation is carried out to all samples, thus in statistics except making an uproar on basis, highlight the DNase-Seq whole detection information on DNA protein binding site.This Information Availability in extracting and forming the relevant distinctive DNase-Seq signal mode of DNA albumen, and within the scope of full-length genome in subsequent experimental to the identification of the high-accuracy high-resolution of this DNA protein binding site and detection.
4. experimental verification
DNase-Seq and ChIP-Seq obtaining the DNA binding site of DBP ATF1 in K562 clone from the UCSC bioinformation website of International Publication detects data.The DNase-Seq for ATF1 albumen shown in Fig. 4 ~ Fig. 9 detects the result of data after Preprocessing method of the present invention process.Wherein, transverse axis is the base positions in region of DNA territory, DNA protein binding site place, and left side is the 5 ' end in region of DNA territory, and right side is 3 ' end, and the longitudinal axis is DNase-Seq detected value.In order to clear display, as shown in Figure 4 and Figure 5, the negative order-checking of minus strand and the negative order-checking of normal chain overturn about comparing with conventional display mode and having carried out level.Region in figure between two vertical lines is the binding site (DNA protein binding is directive, should hold from 5 ' end of DNA upstream toward 3 ' of downstream) of ATF1.
Can be clearly seen that in Fig. 4 ~ Fig. 7, from 5 ' of DNA end to 3 ' end, normal chain is just checking order and between the negative order-checking of minus strand, and between the negative order-checking of normal chain and minus strand just checking order, the DNase-Seq detection signal at its DNA binding site place is basically identical; But normal chain just checks order, the negative order-checking of minus strand bear with normal chain check order, minus strand just checks order) between, the DNase-Seq detection signal at its DNA binding site place is different.This shows, by the segmentation to DNase-Seq sample, effectively highlights the Detection Information of DNA protein binding site, reaches pretreated object.
Finally, be added after positive for normal chain sequencing data and minus strand being born sequencing data is relevant and aliging, to reflect that front for DNA binding site is in conjunction with the detection of situation, as shown in Figure 8; Correspondingly, be added after normal chain being born sequencing data is relevant to the positive sequencing data of minus strand and aliging, to reflect that the back side for DNA binding site is in conjunction with the detection of situation, as shown in Figure 9.Through this process, highlight the Detection Information of DNA protein binding site further.
Experimental result shows, the pretreatment process of the DNase-Seq signal that the present invention proposes has highlighted the Detection Information of DNA protein binding site effectively, for follow-up accurate extraction DNA protein binding site recognition mode, and good basis is laid in the identification realizing DNA protein binding site further.
Claims (1)
- The pretreatment process of the DNase high-flux sequence detection signal of 1.DNA protein binding site, is characterized in that, comprise following step:Step one: obtain gene essential information, gene essential information comprises the genomic base sequence of DNA and the positional information of gene on DNA, and the DNase-Seq high-flux sequence obtaining DNA protein binding site detects data and ChIP-Seq high-flux sequence detection data;Step 2: detect data quality accessment to DNase-Seq high-flux sequence, filter out the credible sequencing data of quality score more than 20 of base position, finds the source of the credible predicted data of every bar in genome by mapping;Step 3: the order-checking zero position credible for every bar sequencing data only being retained directly reflection protein binding site, obtains the DNase – Seq data after upgrading;Step 4: the number of the DNase – Seq data point ask for renewal on each DNA base position after, as the DNase-Seq detected value of DNA base position; Utilize ChIP-Seq data acquisition to there is the region of the binding site of relevant DNA albumen, extract DNase-Seq detected value complete in region, obtain DNase-Seq and detect sample data sets;Step 5: detect sample data sets to DNase-Seq and be normalized, detects the DNase-Seq detected value sum of all DNA base position in sample data sets divided by DNase-Seq by the DNase-Seq detected value of each DNA base position;Step 6: sample data sets is detected to DNase-Seq and segments;DNase-Seq being detected sample data sets is divided into just checking order subset, normal chain negative order-checking subset, minus strand of normal chain just checking order subset sums minus strand negative order-checking subset, the subset sums minus strand that normal chain just checked order is born order-checking subset and is merged by the mode of relevant alignment the front becoming DNA protein binding site and detect data subset, normal chain is born the order-checking subset sums minus strand subset that just checking order and is merged the back side detection data subset becoming DNA protein binding site by relevant mode of aliging;Step 7: respectively from front and back both direction, carries out longitudinal summation, complete operation to the data that front is detected in data subset and back side detection data subset.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410352942.6A CN104131093B (en) | 2014-07-23 | 2014-07-23 | The DNase high pass order-checking detection signal treatment process of DNA protein binding site |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410352942.6A CN104131093B (en) | 2014-07-23 | 2014-07-23 | The DNase high pass order-checking detection signal treatment process of DNA protein binding site |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104131093A CN104131093A (en) | 2014-11-05 |
CN104131093B true CN104131093B (en) | 2015-12-09 |
Family
ID=51803942
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410352942.6A Expired - Fee Related CN104131093B (en) | 2014-07-23 | 2014-07-23 | The DNase high pass order-checking detection signal treatment process of DNA protein binding site |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104131093B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106650313B (en) * | 2016-09-29 | 2019-10-18 | 哈尔滨工程大学 | A method of it filtering out DNA base in DNase high-flux sequence data and is inclined to sexual deviation |
CN110335640B (en) * | 2019-07-09 | 2022-01-25 | 河南师范大学 | Prediction method of drug-DBPs binding sites |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1980618A2 (en) * | 1995-02-24 | 2008-10-15 | Genentech, Inc. | Human DNASE I variants |
-
2014
- 2014-07-23 CN CN201410352942.6A patent/CN104131093B/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1980618A2 (en) * | 1995-02-24 | 2008-10-15 | Genentech, Inc. | Human DNASE I variants |
Non-Patent Citations (2)
Title |
---|
ChIP–seq and beyond: new and improved methodologies to detect and characterize protein–DNA interactions;Terrence S. Furey;《Nature Reviews Genetics》;20121231;第13卷;第840-852页 * |
下一代测序技术在表观遗传学研究中的重要应用及进展;沈圣等;《遗传》;20140331;第36卷(第3期);第256―275页 * |
Also Published As
Publication number | Publication date |
---|---|
CN104131093A (en) | 2014-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10347365B2 (en) | Systems and methods for visualizing a pattern in a dataset | |
Oh et al. | Comparison of accuracy of whole-exome sequencing with formalin-fixed paraffin-embedded and fresh frozen tissue samples | |
Melsted et al. | Fusion detection and quantification by pseudoalignment | |
CN106021984A (en) | Whole-exome sequencing data analysis system | |
Ramachandran et al. | MaSC: mappability-sensitive cross-correlation for estimating mean fragment length of single-end short-read sequencing data | |
US20150142334A1 (en) | System, method and computer-accessible medium for genetic base calling and mapping | |
CN116312780B (en) | Method, terminal and medium for detecting somatic mutation of targeted gene second-generation sequencing data | |
CN107267613A (en) | Sequencing data processing system and SMN gene detection systems | |
RU2013135282A (en) | DNA SEQUENCE DATA ANALYSIS | |
CN101914619A (en) | RNA (Ribonucleic Acid) sequencing quality control method and device relating to gene expression | |
CN117947163A (en) | Method for evaluating background level of variant nucleic acid sample | |
Kearse et al. | The Geneious 6.0. 3 read mapper | |
CN104131093B (en) | The DNase high pass order-checking detection signal treatment process of DNA protein binding site | |
CN110875082A (en) | Microorganism detection method and device based on targeted amplification sequencing | |
CN115620812A (en) | Resampling-based feature selection method and device, electronic equipment and storage medium | |
US20150142328A1 (en) | Calculation method for interchromosomal translocation position | |
WO2014083018A1 (en) | Method and system for processing data for evaluating a quality level of a dataset | |
CN106936561B (en) | Side channel attack protection capability assessment method and system | |
AU2023261122A1 (en) | Construction method for model for analyzing variation detection result | |
CN108715891B (en) | Expression quantification method and system for transcriptome data | |
CN116612814A (en) | Regression model-based batch detection method, device, equipment and medium for gene sample pollution | |
CN103745135A (en) | Protein kinase specificity prediction method and device based on nearest neighbor algorithm | |
CN103853941A (en) | Method for enhancing high-throughput DNA (Deoxyribose Nucleic Acid) sequencing data matching | |
Kim et al. | A Universal Analysis Pipeline for Hybrid Capture-Based Targeted Sequencing Data with Unique Molecular Indexes | |
Chong et al. | SeqControl: process control for DNA sequencing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20151209 Termination date: 20210723 |