CN104484750A - Method and system for automatically matching product parameters of biological information project - Google Patents
Method and system for automatically matching product parameters of biological information project Download PDFInfo
- Publication number
- CN104484750A CN104484750A CN201410742454.6A CN201410742454A CN104484750A CN 104484750 A CN104484750 A CN 104484750A CN 201410742454 A CN201410742454 A CN 201410742454A CN 104484750 A CN104484750 A CN 104484750A
- Authority
- CN
- China
- Prior art keywords
- project
- sub
- sample
- quality control
- analysis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B45/00—ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
Landscapes
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Theoretical Computer Science (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Public Health (AREA)
- Evolutionary Computation (AREA)
- Epidemiology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention discloses a method for automatically matching product parameters of a biological information project. The method comprises the following steps: automatically matching predetermined default parameter configuration according to the condition of samples and filtering and analyzing off-line data of samples which are sequenced by a sequencer by using a uniform filtering standard when the type of a sub-project is a only-filtering sub-project, so as to generate an analysis result; creating standard procedure analysis for the samples when the samples corresponding to the sub-project are sequenced by the sequencer when the type of the sub-project is a standardized sub-project, inputting corresponding comparison parameters and filtration parameters by a user according to the sample condition of the current sub-project during creating each standard procedure analyzing process; filtering and comparing each sample data according to the filtering parameters and the comparison parameters which are automatically matched and predetermined according to the sample condition so as to remove the sample data which does not conform to the comparison parameters; then analyzing each compared sample data which conforms to the filtering parameters by using the created standard procedure analysis so as to generate an analysis result.
Description
Technical field
The present invention relates to analysis of biological information field, particularly relate to a kind of product parameters automatic matching method and system of biological information project.
Background technology
Along with the fast development of life science experimental technique, robotization, the intelligent level of scientific instrument improve day by day, and data output capacity has had qualitative leap.Meanwhile, no matter life science is proposed higher standard and the request to the requirement of analytical test in sample size, analytical cycle, analysis project and data accuracy etc., and the information of biology laboratory output increases by geometric progression.
In traditional biology laboratory, because data type is various, form differs, the preservation of data, exchange, inquiry, analysis, maintenance are all very inconvenient, and the information seriously hindered between researchist is submitted to.(sequencing technologies of future generation, high-flux sequence is also in large-scale parallel order-checking, English: the multiple flow process related in experiment NGS, High-throughput Sequencing) and bioinformatic analysis, as DNA library builds, gene order-checking, data processing, interpretation of result, achievement output, multiple link such as data sharing, each link has different technician to participate in, and therefore there will be information dropout or inefficiency etc. problem in links transmission or in accepting.Especially the order-checking in bioinformatics, high-performance calculation link, specialty order-checking laboratory needs the sequencing result accepting a large amount of order-checking order items, arrangement order-checking experiment, in time process high speed output.
In link after order-checking in bioinformatics completes, all items has corresponding service line project team to do.Service line had both done and had only filtered, and did standardization again, did personalization again.Will confirm through 1. project managements project verifications, 2. information Man responsible official examination & approval, 3. information and executing people equally, 4. arrange to run flow process, 5. report is filled in, and 6. pays, 7. the process such as project management communication, therefore consumes the resource that the basis of service line is just limited.
Wherein, for the flow process of each type entry, by corresponding service line independence parameters, often run once, arrange once, run.There is following problem: 1, sample number difference causes optimum configurations different; 2, in parameter setting procedure because the factor such as sample ID, large scale computer nonrecognition, leads to errors.
Summary of the invention
Object of the present invention provides a kind of product parameters automatic matching method and system of biological information project, can directly and product type associate, the sample situation of machine data under Auto-matching, Automatic-searching path, thus decrease the artificial issuable error of setting and the various criterion problem of each service line.
The invention provides a kind of product parameters automatic matching method of biological information project, comprise step:
Step one: establishment project is also stored in business management system, and each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
Step 2: when the type of described sub-project is for only to filter sub-project, then according to the sub-project type selected and mission bit stream, obtain corresponding and after sequencer sample data successively from lower machine data management system; And often obtaining a sample data, the default parameters that namely situation Auto-matching is preset per sample configures and uses unified filter criteria to carry out filtering and analyze, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
Step 3: when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus the alignment parameters of default parameters and described input uses the filter criteria of the normal process analysis correspondence created carry out filtering to each sample data and analyze, thus filter the sample not meeting default parameters and alignment parameters; And until all samples data filtering and after having analyzed, generating analysis result, analysis result comprises sub-project information and corresponding sample message; Step 4: filtration/Quality Control parameter that situation Auto-matching is default per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, then again carry out filtration and the analytic process of step 2 or step 3 after updating described sample data or filtration/Quality Control parameter, until analysis result passes through Quality Control; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system.
As the improvement of technique scheme, whether the summary info of every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
As the improvement of technique scheme, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution plan.
As the improvement of technique scheme, also comprise:
Step 5: described analysis result is carried out storage backup.
As the improvement of technique scheme, in described step 4: if Quality Control not by and the gap of described analysis result and quality control standard in threshold range, update described sample data be can single sample data edition or sample batch editor.
The invention also discloses a kind of product parameters automatic patching system of biological information project, comprising:
Creating unit, for creating project and being stored in business management system, each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
First filter analysis unit, for when the type of described sub-project is for only to filter sub-project, then according to the sub-project type selected and mission bit stream, obtain from lower machine data management system successively correspondence and sample data after sequencer; And often obtaining a sample data, the default parameters configuration that namely situation Auto-matching is preset per sample is used unified filter criteria to carry out filtering and analyzes, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
Second filter analysis unit, for when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus generate analysis result, analysis result comprises sub-project information and corresponding sample message; Quality Control unit, the filtration/Quality Control parameter preset for situation Auto-matching per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, then again carry out filtration and the analytic process of step 2 or step 3 after updating described sample data or filtration/Quality Control parameter, until analysis result passes through Quality Control; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system.
As the improvement of technique scheme, whether the summary info of every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
As the improvement of technique scheme, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution plan.
As the improvement of technique scheme, also comprise:
Storage unit: for the described analysis result by Quality Control is carried out storage backup.
As the improvement of technique scheme, in described Quality Control unit: if Quality Control not by and the gap of described analysis result and quality control standard in threshold range, update described sample data be can single sample data edition or sample batch editor.
Compared with prior art, product parameters automatic matching method and the system of biological information project disclosed by the invention have following beneficial effect: associated by direct and product type, the sample situation of machine data under Auto-matching, Automatic-searching path.Thus summarize the parameter of all service lines, and reach standardization, unitized, decrease the artificial issuable error arranged, and the various criterion problem of each service line.
Accompanying drawing explanation
Fig. 1 is the schematic flow sheet of the product parameters automatic matching method of a kind of biological information project in the embodiment of the present invention.
Fig. 2 shows the idiographic flow of the step S2 in Fig. 1.
Fig. 3 shows the idiographic flow of the step S3 in Fig. 1.
Fig. 4 shows the idiographic flow of the step S4 in Fig. 1.
Fig. 5 shows the idiographic flow of the step S5 in Fig. 1.
Fig. 6 is the structural representation of the product parameters automatic patching system of a kind of biological information project in the embodiment of the present invention.
Fig. 7 shows the present invention and to grow directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, and this sectional drawing shows the selective listing of sub-project.
Fig. 8 shows the present invention and to grow directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, and this sectional drawing shows the summary info of every sub-project.
Fig. 9 shows the present invention and to grow directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, and this sectional drawing shows the optimum configurations interface of only filtering sub-project.
Figure 10 shows the present invention and to grow directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, and this sectional drawing shows the optimum configurations interface of normalizer project and interface is selected in normal process analysis.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
See Fig. 1, it is the structural representation of the product parameters automatic matching method of a kind of biological information project that the embodiment of the present invention provides.The product parameters automatic matching method of this biological information project, comprises step:
S1: establishment project is also stored in business management system, and each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
In this step, whether the summary info of selected every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
S2: when the type of described sub-project is for only to filter sub-project, then according to the sub-project type selected and mission bit stream, obtain corresponding and after sequencer sample data successively from lower machine data management system; And often obtaining a sample data, the default parameters configuration that namely situation Auto-matching is preset per sample is used unified filter criteria to carry out filtering and analyzes, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
In this step, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, ErrorRate, base distribution figure and base Quality Control distribution plan.
S3: when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus generate analysis result, analysis result comprises sub-project information and corresponding sample message;
S4: filtration/Quality Control parameter that situation Auto-matching is default per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, then again carry out filtration and the analytic process of step S2 or step S3 after updating described sample data or filtration/Quality Control parameter, until analysis result passes through Quality Control; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system.
S5: described analysis result is carried out storage backup.
Filter analysis of the present invention is only filter sub-project or the difference of normalizer project is carried out according to the type of sub-project, is described in detail below respectively by Fig. 2 and Fig. 3.
As shown in Figure 2, when the type of sub-project is comprise step to the process that sample message carries out filter analysis when only filtering sub-project:
S201: machine under the sample (sample) that only checks order that detection is corresponding;
In this step, the sample data that obtains after referring to and sample data being completed order-checking by sequenator of lower machine.
S202: the default parameters configuration that situation Auto-matching is preset per sample is carried out filtering to this order-checking sample and analyzes (run);
In this step, use unified filter analysis standard (and the default parameters that Auto-matching is preset configures) the lower machine data to each sample that only checks order to carry out, thus filter out non-compliant lower machine data.S203: determine that all of this sub-project (project) order-checkings samples (sample) are filtered with to analyze (run) complete? if so, then carry out step S204, otherwise return step S202;
S204: generate analysis result.
As shown in Figure 3, when the type of sub-project is normalizer project, step is comprised to the process that sample message carries out filter analysis:
S301: detect the upper machine of a normalized sample (sample);
In this step, upper machine refers to and sample data is uploaded to sequenator to check order.
S302: one or more normal process analysis is created to this normalized sample, and while creating normal process analysis, corresponding alignment parameters is set by user; This normal process analysis includes but not limited to filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analyzes and base editor analyzes
S303: machine under sample (sample) selected by detecting;
In this step, the sample data that obtains after referring to and sample data being completed order-checking by sequenator of lower machine.
S304: the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed;
S305: determine that all normalized sample (sample) of this sub-project (project) are filtered with to analyze (run) complete? if so, then carry out step S306, otherwise return step S304;
S306: generate analysis result.
With reference to figure 4, it is the process that the analysis result obtained after carrying out filter analysis to any one sample data in a sub-project carries out Quality Control, notice that Quality Control just to carry out afterwards the sample data analysis and filter of all samples of sub-project is complete, and successively Quality Control is carried out to each sample, specifically comprise step:
S401: detect that certain sample completes filter analysis, and generate analysis result;
S402: Quality Control is carried out to this analysis result;
Filtration/Quality Control parameter that specifically situation Auto-matching is default per sample and analysis result contrast, thus carry out Quality Control.
S403: judge whether Quality Control is passed through, if Quality Control is passed through, enter step S404, otherwise enter step S405;
S404: export this analysis result;
S405: judge that the gap of described analysis result and quality control standard is not whether in threshold range (namely gap is too large), then enters step S406 if not, otherwise proceeds to step S408;
S406: update this sample data or filtration/Quality Control parameter;
In this step, can single sample data edition or sample batch editor.
S407: again carry out filtering to this sample data according to sub-project type and analyze, again generating analysis result; And return step S402;
S408: edit described sample and discarded relevant Lane, and again place an order in BMS (Business ManagementSystem, business management system);
S409: to wait under new sample data machine and sub-project type described per sample and carry out corresponding filtration and analysis, generating analysis result, and return step S402;
Then, after all samples data of a sub-project carry out Quality Control, then generate a QC report.
With reference to figure 5, the process of the analysis result of sample data being carried out to storage backup specifically comprises step:
S501: sample data analysis;
S502: judge that this sample analysis completes? if so, then enter step S503, otherwise continue step S501;
S503: start up system device backup function is available;
S504: user confirms to back up, and clicks " backup ";
S505: system prompt backup request is submitted to;
S506: system is to delivery system copies data;
S507: judge that whether copy successful? if so, then enter step S509, otherwise enter step S508:
S508: prompting user ID is made mistakes, and returns step S504.
S509: prompting user ID success; And terminate.
Visible, the product parameters automatic matching method of biological information project disclosed in the present embodiment, is associated by direct and product type, the sample situation of machine data under Auto-matching, Automatic-searching path.Thus summarize the parameter of all service lines, and reach standardization, unitized, decrease the artificial issuable error arranged, and the various criterion problem of each service line.
Present invention also offers a kind of product parameters automatic patching system of biological information project, as shown in Figure 6, comprise creating unit 10, first filter analysis unit 20, second filter analysis unit 30, Quality Control unit 40 and storage unit 50, wherein creating unit 10, first filter analysis unit 20, second filter analysis unit 30, Quality Control unit 40 and storage unit 50 can be incorporated in a background server, and front end directly operates on webpage, undertaken operating and input parameter by user, concrete:
Creating unit 10, for creating project and being stored in business management system (Business ManagementSystem, BMS, order-checking and information analysis task matching and management system, contain the organizational informations such as sub-project, person liable, data) in, each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
As shown in Figure 7, for the present invention is grown directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, this sectional drawing shows the selective listing of sub-project.Show multiple sub-project in this sub-project list, and every sub-project is labeled as a filtering items (Y) or standardization project (N).And Fig. 8 is the summary info that specifically show a sub-project.Whether the summary info of every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
First filter analysis unit 20, for the type when described sub-project for only to filter sub-project, then according to the sub-project type selected and mission bit stream, successively from lower machine data management system (Data ManagementSystem, DMS, carries out quality monitoring and data management to the lower machine data checked order) in obtain corresponding and after sequencer sample data; And often obtaining a sample data, the default parameters configuration that namely situation Auto-matching is preset per sample is used unified filter criteria to carry out filtering and analyzes, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
With reference to figure 9, be that the present invention is grown directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, this sectional drawing shows the optimum configurations interface of only filtering sub-project.
Second filter analysis unit 30, for when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus generate analysis result, analysis result comprises sub-project information and corresponding sample message; Wherein, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Rawdata, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution plan.
With reference to Figure 10, be that the present invention is grown directly from seeds the screenshot capture of the UI page of an embodiment of product parameters automatic patching system of thing information project, this sectional drawing shows the optimum configurations interface of normalizer project and interface is selected in normal process analysis.
Quality Control unit 40, the filtration/Quality Control parameter preset for situation Auto-matching per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, filtration and the analytic process of the first filter analysis unit 20 or the second filter analysis unit 30 is again carried out, until analysis result passes through Quality Control after then updating (can single sample data edition or sample batch editor) described sample data or filtration/Quality Control parameter; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system; And
Storage unit 50: for analysis result described in storage backup.
Visible, the product parameters automatic patching system of biological information project disclosed in the present embodiment, is associated by direct and product type, the sample situation of machine data under Auto-matching, Automatic-searching path.Thus summarize the parameter of all service lines, and reach standardization, unitized, decrease the artificial issuable error arranged, and the various criterion problem of each service line.
The above is the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications are also considered as protection scope of the present invention.
Claims (10)
1. a product parameters automatic matching method for biological information project, is characterized in that, comprise step:
Step one: establishment project is also stored in business management system, and each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
Step 2: when the type of described sub-project is for only to filter sub-project, then according to the sub-project type selected and mission bit stream, obtain corresponding and after sequencer sample data successively from lower machine data management system; And often obtaining a sample data, the default parameters that namely situation Auto-matching is preset per sample configures and uses unified filter criteria to carry out filtering and analyze, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
Step 3: when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus generate analysis result, analysis result comprises sub-project information and corresponding sample message;
Step 4: filtration/Quality Control parameter that situation Auto-matching is default per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, then again carry out filtration and the analytic process of step 2 or step 3 after updating described sample data or filtration/Quality Control parameter, until analysis result passes through Quality Control; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system.
2. the product parameters automatic matching method of biological information project as claimed in claim 1, it is characterized in that, whether the summary info of every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
3. the product parameters automatic matching method of biological information project as claimed in claim 1, it is characterized in that, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution plan.
4. the product parameters automatic matching method of biological information project as claimed in claim 1, is characterized in that, also comprise:
Step 5: described analysis result is carried out storage backup.
5. the product parameters automatic matching method of biological information project as claimed in claim 1, it is characterized in that, in described step 4: if Quality Control not by and the gap of described analysis result and quality control standard in threshold range, update described sample data be can single sample data edition or sample batch editor.
6. a product parameters automatic patching system for biological information project, is characterized in that, comprising:
Creating unit, for creating project and being stored in business management system, each project comprises multiple sub-project; And select the sub-project in described establishment project and mission bit stream; The type of described sub-project comprises only filters sub-project and normalizer project;
First filter analysis unit, for when the type of described sub-project is for only to filter sub-project, then according to the sub-project type selected and mission bit stream, obtain from lower machine data management system successively correspondence and sample data after sequencer; And often obtaining a sample data, the default parameters configuration that namely situation Auto-matching is preset per sample is used unified filter criteria to carry out filtering and analyzes, thus filters the sample data not meeting default parameters configuration; And all samples data filtering to be obtained with has analyzed after, generate analysis result, analysis result comprise sub-project information and correspondence sample message;
Second filter analysis unit, for when the type of described sub-project is normalizer project, the sample that then this sub-project is corresponding is while sequencer, this sample is created to one or more the normal process analysis comprised in filter analysis, express spectra quantitative test, comparison in difference analysis, Cluster cluster analysis, microRNA target prediction analysis, KOGO analysis and base editor analysis, and input corresponding alignment parameters and filtration parameter by user according to the sample situation of current sub-project in each normal process analytic process of establishment; After sequencer completes, the filtration parameter that situation Auto-matching is default per sample and alignment parameters filter and comparison each sample data, thus remove the sample data not meeting alignment parameters; And then use the normal process analysis that creates to meeting filtration parameter and each sample data after comparison is analyzed, thus generate analysis result, analysis result comprises sub-project information and corresponding sample message;
Quality Control unit, the filtration/Quality Control parameter preset for situation Auto-matching per sample, to carry out contrast Quality Control to described analysis result, if Quality Control is passed through, then directly exports this analysis result; If Quality Control is not passed through, and the gap of described analysis result and quality control standard is in threshold range, then again carry out filtration and the analytic process of step 2 or step 3 after updating described sample data or filtration/Quality Control parameter, until analysis result passes through Quality Control; If Quality Control is not passed through, and the gap of described analysis result and quality control standard exceedes threshold value, then edit described sample and discarded relevant Lane, and again place an order in described business management system.
7. the product parameters automatic patching system of biological information project as claimed in claim 6, it is characterized in that, whether the summary info of every sub-project comprises sub-project code, sub-project title, sub-project type, is only filtration, total sample number, executor, start time and end time, sub-project state and sub-project associative operation.
8. the product parameters automatic patching system of biological information project as claimed in claim 61, it is characterized in that, described sample message comprises sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution plan.
9. the product parameters automatic matching method system of biological information project as claimed in claim 6, is characterized in that, also comprise:
Storage unit: for the described analysis result by Quality Control is carried out storage backup.
10. the product parameters automatic matching method system of biological information project as claimed in claim 6, it is characterized in that, in described Quality Control unit: if Quality Control not by and the gap of described analysis result and quality control standard in threshold range, update described sample data be can single sample data edition or sample batch editor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410742454.6A CN104484750B (en) | 2014-12-08 | 2014-12-08 | The product parameters automatic matching method and system of biological information project |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410742454.6A CN104484750B (en) | 2014-12-08 | 2014-12-08 | The product parameters automatic matching method and system of biological information project |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104484750A true CN104484750A (en) | 2015-04-01 |
CN104484750B CN104484750B (en) | 2018-04-24 |
Family
ID=52759291
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410742454.6A Active CN104484750B (en) | 2014-12-08 | 2014-12-08 | The product parameters automatic matching method and system of biological information project |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104484750B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106650319A (en) * | 2016-11-15 | 2017-05-10 | 上海派森诺生物科技股份有限公司 | Automatic filtering method for high-throughout Miseq sequencing data |
CN112365928A (en) * | 2020-11-16 | 2021-02-12 | 赛福解码(北京)基因科技有限公司 | Biological information data analysis and result quality control automation method and system |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050273272A1 (en) * | 2004-04-22 | 2005-12-08 | Applera Corporation, A Delaware Corporation | System and method for laboratory-wide information management |
CN102982409A (en) * | 2012-11-07 | 2013-03-20 | 浪潮电子信息产业股份有限公司 | Informationalized management design method for information biology high-performance computing platform |
CN103324866A (en) * | 2013-03-26 | 2013-09-25 | 张弘 | Ripple system |
CN103714180A (en) * | 2014-01-08 | 2014-04-09 | 浪潮(北京)电子信息产业有限公司 | Bioinformatics database system and data processing method |
CN103993069A (en) * | 2014-03-21 | 2014-08-20 | 深圳华大基因科技服务有限公司 | Virus integration site capture sequencing analysis method |
-
2014
- 2014-12-08 CN CN201410742454.6A patent/CN104484750B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050273272A1 (en) * | 2004-04-22 | 2005-12-08 | Applera Corporation, A Delaware Corporation | System and method for laboratory-wide information management |
CN102982409A (en) * | 2012-11-07 | 2013-03-20 | 浪潮电子信息产业股份有限公司 | Informationalized management design method for information biology high-performance computing platform |
CN103324866A (en) * | 2013-03-26 | 2013-09-25 | 张弘 | Ripple system |
CN103714180A (en) * | 2014-01-08 | 2014-04-09 | 浪潮(北京)电子信息产业有限公司 | Bioinformatics database system and data processing method |
CN103993069A (en) * | 2014-03-21 | 2014-08-20 | 深圳华大基因科技服务有限公司 | Virus integration site capture sequencing analysis method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106650319A (en) * | 2016-11-15 | 2017-05-10 | 上海派森诺生物科技股份有限公司 | Automatic filtering method for high-throughout Miseq sequencing data |
CN112365928A (en) * | 2020-11-16 | 2021-02-12 | 赛福解码(北京)基因科技有限公司 | Biological information data analysis and result quality control automation method and system |
Also Published As
Publication number | Publication date |
---|---|
CN104484750B (en) | 2018-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104484558B (en) | The analysis report automatic generation method and system of biological information project | |
Nellore et al. | Rail-RNA: scalable analysis of RNA-seq splicing and coverage | |
CN104484582B (en) | The biological information project automatic analysis method and system realized by modularization selection | |
CN103714180A (en) | Bioinformatics database system and data processing method | |
CN105740650A (en) | Method for rapidly and accurately identifying high-throughput genome data pollution sources | |
JP5017176B2 (en) | Manufacturing instruction evaluation support system, manufacturing instruction evaluation support method, and manufacturing instruction evaluation support program | |
CN113313280A (en) | Cloud platform inspection method, electronic equipment and nonvolatile storage medium | |
US20180032549A1 (en) | Experimental data recording device, experimental data recording method and experimental data display device | |
CN104484375B (en) | Establish the method and system of database automatically in project analysis flow | |
Love et al. | Beginner’s guide to using the DESeq2 package | |
CN104484750A (en) | Method and system for automatically matching product parameters of biological information project | |
CN105653897B (en) | LncRNA analysis system and method based on biological cloud platform | |
CN111433610A (en) | Laboratory instrument selection and configuration | |
CN104484581B (en) | The automated analysis method and system of biological information project | |
CN107506614A (en) | A kind of bacterium ncRNA Forecasting Methodologies of transcript profile sequencing data and PeakCalling methods based on Illumina | |
CN112199376B (en) | Standard knowledge base management method and system based on cluster analysis | |
Dirihan et al. | Efficient analysis of ploidy levels in plant evolutionary ecology | |
CN105069712A (en) | Patent analysis system and analysis method | |
CN110176276B (en) | Biological information analysis process management method and system | |
CN104899216B (en) | A kind of discarded call bill processing method and device | |
Reyes et al. | Inferring differential exon usage in RNA-Seq data with the DEXSeq package | |
CN106610964A (en) | Abnormal reimbursement data analysis method and server | |
CN111400644B (en) | Calculation processing method for laboratory analysis sample | |
CN107609349A (en) | A kind of project implementation quality control system in bioanalysis platform | |
CN105740652A (en) | sRNA analysis system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |