CN104484558B - The analysis report automatic generation method and system of biological information project - Google Patents
The analysis report automatic generation method and system of biological information project Download PDFInfo
- Publication number
- CN104484558B CN104484558B CN201410742360.9A CN201410742360A CN104484558B CN 104484558 B CN104484558 B CN 104484558B CN 201410742360 A CN201410742360 A CN 201410742360A CN 104484558 B CN104484558 B CN 104484558B
- Authority
- CN
- China
- Prior art keywords
- project
- sub
- analysis
- sample
- quality control
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Investigating Or Analysing Biological Materials (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Disclose a kind of analysis report automatic generation method of biological information project, including step:When sub-project is an only filtering sub-project, i.e., according to default parameter configuration with unified filter criteria to being filtered and analyzed through machine data under the sample after the completion of sequencer, generation webpage version or/and the analysis result information of PDF editions;When sub-project is normalizer project, then the corresponding sample of the sub-project is while through sequencer, normal process analysis is created to the sample, and corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process is created;After the completion of sequencer, sample segment data are first filtered out according to the filtration parameter of input, filtering is then carried out to remaining each sample data with analyzing generation webpage version or/and the analysis result information of PDF editions with created normal process analysis.
Description
Technical field
The present invention relates to analysis of biological information field, more particularly to a kind of analysis report of biological information project to automatically generate
Method and system.
Background technology
With the fast development of life science experimental technique, the automation of scientific instrument, intelligent level increasingly carry
Height, data output capacity have qualitative leap.Meanwhile life science to analysis test requirement no matter sample size,
Analytical cycle, analysis project and data accuracy etc. are proposed higher standard and the request, biology laboratory output
Information increases by geometric progression.
In traditional biology laboratory, since data type is various, form differs, the preservations of data, exchange, inquiry,
Analysis, maintenance are all very inconvenient, and the information seriously hindered between researcher is submitted.In large-scale parallel sequencing (under also crying
Generation sequencing technologies, high-flux sequence, English:NGS, High-throughput Sequencing) experiment and biological information
A variety of flows for arriving involved in credit analysis, as DNA library is built, gene order-checking, data processing, interpretation of result, achievement output,
Multiple links such as data sharing, each link have different technical staff to participate in, therefore in links transmission or accepting
The problems such as information loss or inefficiency etc. occurs.Sequencing especially in bioinformatics, high-performance calculation link, specially
Industry sequencing laboratory needs to receive substantial amounts of sequencing order items, arranges sequencing experiment, the sequencing knot of timely processing high speed output
Fruit.
In the link after the completion of sequencing in bioinformatics, all items have corresponding service line project team to do.
Service line, which had not only been done, only to be filtered, but also is standardized, and does personalization.Also pass through 1. project managements project verification, 2. information Mans are responsible for
People's examination & approval, 3. information executors confirm that 4. arrange to run flow, and 5. reports are filled in, and 6. deliver, the processes such as 7. project managements are linked up,
Therefore the originally limited resource of service line is consumed.
In addition, in the prior art, after analysis process is completed, some are generated under linux similar under windows
The file of txt, the inside include the data that whole concluding report needs.Data distribution in that file, it is necessary to 1, on linux roads
That file is opened under footpath;2nd, some data are replicated manually, such as:Data volume ...;3 and then fill out our word versions
In report template;4th, Quality Control is finally carried out again;And 5, client is transmitted to by project management.
The analysis result information generating mode of the prior art has the following disadvantages:
File is not as word document is in use under first, linux.Digital independent is more inconvenient.
Second, process is filled in the reproduction process of data, form modifying process (such as 3000000,3,000 must be extended this as,
000), it is filled in manually in addition, and the easy change property of word templates, be easy to cause the mistake of data.To fill in and the above-mentioned 4th
Step Quality Control adds pressure.
3rd, take time and effort, filling in for a concluding report often just needs several hours with Quality Control.
The content of the invention
, can effectively certainly it is an object of the invention to provide the analysis report automatic generation method and system of a kind of biological information project
It is dynamic to quickly generate webpage version or/and the analysis report of PDF editions, and since webpage version or/and the report of PDF editions are not easy to change, from
And error is reduced, substantially eliminate the pressure of Quality Control work.
The present invention provides a kind of analysis report automatic generation method of biological information project, including step:
Step 1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And described in selecting
Sub-project and mission bit stream in establishment project;The type of the sub-project includes only filtering sub-project and normalizer project;
Step 2:When the sub-project type for only filtering a sub-project, then according to the sub-project type and task of selection
Information, obtains sample data corresponding and after sequencer from lower machine data management system successively;And often obtain
One sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, so as to filter not
Meet the sample data of parameter configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/
Analysis result information with PDF editions reports that analysis result information report includes sub-project information and corresponding sample message;
Step 3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through surveying
While sequence instrument is sequenced, filter analysis, express spectra quantitative analysis are included to sample establishment, comparison in difference is analyzed, Cluster gathers
One or more normal process analyses in alanysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and
Create in each normal process analytic process and corresponding filtration parameter is inputted according to the sample situation of current sub-project by user;Through
After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input, is not met so as to remove
The sample data of filtration parameter;Then each sample data to meeting filtration parameter is analyzed with created normal process again
Analyzed, directly generate webpage version or/and the analysis result information of PDF editions, analysis result information include sub-project information and
Corresponding sample message;
Step 4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result report
Accuse;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is in threshold range, then updates described
Filtering and the analytic process of step 2 or step 3 are carried out after sample data or filtering/Quality Control parameter again, until analysis knot
Fruit report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits
The sample and discarded correlation Lane, and place an order again in the business management system.
As the improvement of above-mentioned technical proposal, the summary info per sub-project include sub-project code, sub-project title,
Whether sub-project type, be filtering, total sample number, executor, starting and end time, sub-project state and a son
Project relevant operation.
As the improvement of above-mentioned technical proposal, the sample message includes sample ID, library title, Lane ID, sequencing
Strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate,
Base distribution figure and base Quality Control distribution map.
As the improvement of above-mentioned technical proposal, further include:
Step 5:The analysis result information is subjected to storage backup.
As the improvement of above-mentioned technical proposal, in the step 4:If Quality Control not by and the analysis result information
Gap with quality control standard in threshold range, update the sample data be can be with single sample data edition or sample
Product batch is edited.
The invention also discloses a kind of analysis report automatic creation system of biological information project, including:
Creating unit, for creating project and being stored in business management system, each project includes more sub-projects;And select
Select the sub-project and mission bit stream in the establishment project;The type of the sub-project includes only filtering sub-project and normalizer
Project;
First filter analysis unit, is only filtering sub-project for the type when the sub-project, then according to the son of selection
Item types and mission bit stream, obtain sample corresponding and after sequencer from lower machine data management system successively
Data;And often obtain a sample data, i.e., filtered according to default parameter configuration with unified filter criteria with being divided
Analysis, so as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, generate
Webpage version or/and the analysis result information of PDF editions, analysis result information include sub-project information and corresponding sample message;
Second filter analysis unit, for when the type of the sub-project is normalizer project, then the sub-project pair
For the sample answered while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference
One or more marks in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis
Quasi- process analysis, and in each normal process analytic process is created by user according to the input pair of the sample situation of current sub-project
The filtration parameter answered;After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input,
So as to remove the sample data for not meeting filtration parameter;Then analyzed again with created normal process to meeting filtration parameter
Each sample data analyzed, generate webpage version or/and the analysis result information of PDF edition, analysis result information includes sub
Project information and corresponding sample message;
Quality Control unit, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis knot
Fruit is reported;If Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates in threshold range
Filtering and the analytic process of step 2 or step 3 are carried out after the sample data or filtering/Quality Control parameter again, until point
Analysis result report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then
The sample and discarded correlation Lane are edited, and is placed an order again in the business management system.
As the improvement of above-mentioned technical proposal, the summary info per sub-project include sub-project code, sub-project title,
Whether sub-project type, be filtering, total sample number, executor, starting and end time, sub-project state and a son
Project relevant operation.
As the improvement of above-mentioned technical proposal, the sample message includes sample ID, library title, Lane ID, sequencing
Strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate,
Base distribution figure and base Quality Control distribution map.
As the improvement of above-mentioned technical proposal, further include:
Storage unit:For storage backup will to be carried out by the analysis result information of Quality Control.
As the improvement of above-mentioned technical proposal, in the Quality Control unit:If Quality Control not by and the analysis result report
Accuse and the gap of quality control standard be in threshold range, update the sample data be can with single sample data edition or
Sample batch is edited.
Compared with prior art, the analysis report automatic generation method of biological information project disclosed by the invention and system tool
Have the advantages that:Can effectively quickly generate webpage version or/and the analysis report of PDF editions automatically, and due to webpage version or/and
The report of PDF editions is not easy to change, so as to reduce error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions
Easily fixed propagation, webpage version can be seen online.
Brief description of the drawings
Fig. 1 is a kind of flow signal of analysis report automatic generation method of biological information project in the embodiment of the present invention
Figure.
Fig. 2 shows the idiographic flow of the step S2 in Fig. 1.
Fig. 3 shows the idiographic flow of the step S3 in Fig. 1.
Fig. 4 shows the idiographic flow of the step S4 in Fig. 1.
Fig. 5 shows the idiographic flow of the step S5 in Fig. 1.
Fig. 6 is a kind of structural representation of the analysis report automatic creation system of biological information project in the embodiment of the present invention
Figure.
Fig. 7 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds
The screenshot capture in face, the sectional drawing show the selective listing of sub-project.
Fig. 8 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds
The screenshot capture in face, the sectional drawing show the summary info of every sub-project.
Fig. 9 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds
The screenshot capture in face, the sectional drawing show the parameter setting interface for only filtering sub-project.
Figure 10 shows the UI of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds
The screenshot capture of the page, the sectional drawing show parameter setting interface and the normal process analysis selection interface of normalizer project.
Embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete
Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on
Embodiment in the present invention, those of ordinary skill in the art are obtained every other without creative efforts
Embodiment, belongs to the scope of protection of the invention.
It is a kind of analysis report automatic generation method of biological information project provided in an embodiment of the present invention referring to Fig. 1
Structure diagram.The analysis report automatic generation method of the biological information project, including step:
S1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And select the establishment
Sub-project and mission bit stream in project;The type of the sub-project includes only filtering sub-project and normalizer project;
In this step, the selected summary info per sub-project includes sub-project code, sub-project title, subitem
Whether mesh type, be filtering, total sample number, executor, starting and end time, sub-project state and a sub-project
Relevant operation.
S2:When the type of the sub-project is an only filtering sub-project, then according to the sub-project type and mission bit stream of selection,
Sample data corresponding and after sequencer is obtained from lower machine data management system successively;And often obtain a sample
Product data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, and ginseng is not met so as to filter
The sample data of number configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/and PDF
The analysis result information of version, analysis result information include sub-project information and corresponding sample message;
In this step, the sample message include sample ID, library title, Lane ID, sequencing strategy,
Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, ErrorRate, base point
Butut and base Quality Control distribution map.
S3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through sequenator
While sequencing, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference analysis, Cluster clusters point
One or more normal process analyses in analysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and creating
Corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process;Through sequencing
After the completion of instrument sequencing, each sample data is filtered according to the filtration parameter of the input, filtering is not met so as to remove
The sample data of parameter;Then each sample data for meeting filtration parameter is carried out with created normal process analysis again
Analysis, directly generates webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and correspondence
Sample message;
S4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result information;If
Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates the sample in threshold range
Filtering and the analytic process of step S2 or step S3 are carried out after data or filtering/Quality Control parameter again, until analysis result report
Announcement passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then described in editor
Sample and discarded correlation Lane, and place an order again in the business management system.
S5:The analysis result information is subjected to storage backup.
The filter analysis of the present invention is to be distinguished according to the type of sub-project for only filtering sub-project or normalizer project
Carry out, be described in detail separately below by Fig. 2 and Fig. 3.
As shown in Fig. 2, the process of filter analysis is carried out to sample message when the type of sub-project is only filters sub-project
Including step:
S201:Detect corresponding one and machine under sample (sample) is only sequenced;
In this step, lower machine refers to the sample data for completing to obtain after sequencing by sequenator by sample data.
S202:Sample is sequenced to this according to default parameter configuration to be filtered and analyzed (run);
In the step, the lower machine data for each being only sequenced with unified filter analysis standard sample carry out, so that
Non-compliant lower machine data are filtered out, and obtain analysis result.
S203:Determine whether all sequencing samples (sample) of the sub-project (project) filter and analysis (run)
It is completeIf so, then carry out step S204, otherwise return to step S202;
S204:Directly generate webpage version or/and the analysis result information of PDF editions.
As shown in figure 3, the process of filter analysis is carried out to sample message when the type of sub-project is normalizer project
Including step:
S301:Detect machine on a normalized sample (sample);
In this step, upper machine refers to sample data uploading to sequenator to be sequenced.
S302:One or more normal process analyses are created to the normalized sample, and create the same of normal process analysis
When by the corresponding filtration parameter of user setting;The normal process analysis include but not limited to filter analysis, express spectra quantitative analysis,
Comparison in difference analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis
S303:Machine under sample selected by detection (sample);
In this step, lower machine refers to the sample data for completing to obtain after sequencing by sequenator by sample data.
S304:The normalized sample is filtered according to the filtration parameter of user setting, so as to filter out the standardization
Machine data under the sample of filtration parameter are not met in sample, are then analyzed with the normal process created to remaining sample after filtering
Product data are analyzed;S305:Determine whether all normalized samples (sample) of the sub-project (project) are filtered with dividing
It is complete to analyse (run)If so, then carry out step S204, otherwise return to step S202;
S306:Directly generate webpage version or/and the analysis result information of PDF editions.
It is the analysis knot to being obtained after any one sample data progress filter analysis in a sub-project with reference to figure 4
Fruit report carries out the process of Quality Control, and paying attention to Quality Control is just carried out after the sample data analysis of all samples of sub-project has been filtered
, and Quality Control is carried out successively to each sample, specifically include step:
S401:Detect that some sample completes filter analysis, and directly generate webpage version or/and the analysis result of PDF editions
Report;
S402:Quality Control is carried out to the analysis result information;
Specifically analysis result information is compared with quality control standard.
S403:Judge Quality Control whether by if Quality Control is by entering step S404, otherwise entering step S405;
S404:Export the analysis result information;
S405:Judge the analysis result information and quality control standard gap whether not in threshold range (i.e. gap is too
Greatly), if otherwise entering step S406, otherwise it is transferred to step S408;
S406:Update the sample data or filtering/Quality Control parameter;
In this step, can be edited with single sample data edition or sample batch.
S407:The sample data is filtered and analyzed again according to sub-project type, directly generates webpage version again
Or/and the analysis result information of PDF editions;And return to step S402;
S408:The sample and discarded correlation Lane are edited, and in BMS (Business Management System, industry
Business management system) in place an order again;
S409:Wait machine under new sample data and the sub-project type according to sample and carry out corresponding filtering
With analysis, webpage version or/and the analysis result information of PDF editions, and return to step S402 are directly generated;
Then, after all samples data of a sub-project carry out Quality Control, then a QC report is generated.
With reference to figure 5, the process that storage backup is carried out to the analysis result information of sample data specifically includes step:
S501:Sample data is analyzed;
S502:Judge whether the sample analysis is completedIf so, then entering step S503, otherwise continue step S501;
S503:Activation system device backup function is available;
S504:User confirms to back up, and clicks on " backup ";
S505:System prompt backup request is submitted;
S506:System copies data to delivery system;
S507:Judge whether copy succeedsIf so, then entering step S509, S508 is otherwise entered step;
S508:Prompt user ID error, and return to step S504.
S509:Prompt user ID success;And terminate.
As it can be seen that the analysis report automatic generation method of biological information project disclosed in the present embodiment, can be effectively automatic quick
Webpage version or/and the analysis report of PDF editions are generated, and since webpage version or/and the report of PDF editions are not easy to change, so as to reduce
Error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions are easier fixed propagation, webpage version can be with
See online.
Present invention also offers a kind of analysis report automatic creation system of biological information project, as shown in fig. 6, including wound
Unit 10, the first filter analysis unit 20, the second filter analysis unit 30, Quality Control unit 40 and storage unit 50 are built, wherein
Creating unit 10, the first filter analysis unit 20, the second filter analysis unit 30, Quality Control unit 40 and storage unit 50 can be with
It is incorporated into a background server, and front end directly operates on webpage, is operated by user and input parameter, specifically
's:
Creating unit 10, for create project and be stored in business management system (Business Management System,
BMS, sequencing and the distribution of information analysis task and management system, contain the organizational informations such as sub-project, person liable, data) in, often
A project includes more sub-projects;And select the sub-project and mission bit stream in the establishment project;The type of the sub-project
Including only filtering sub-project and normalizer project;
As shown in fig. 7, one embodiment for the analysis report automatic creation system of thing information project that grows directly from seeds for the present invention
The screenshot capture of the UI pages, the sectional drawing show the selective listing of sub-project.More sub-projects are shown in the sub-project list,
And it is labeled as a filtering items (Y) or standardization project (N) per sub-project.And Fig. 8 is to specifically show a sub-project
Summary info.Per sub-project summary info include sub-project code, sub-project title, sub-project type, whether be only
Filtering, total sample number, executor, starting and end time, sub-project state and sub-project relevant operation.
First filter analysis unit 20, is only filtering sub-project for the type when the sub-project, then according to selection
Sub-project type and mission bit stream, successively from lower machine data management system (Data Management System, DMS, to sequencing
The lower machine data completed carry out quality monitoring and data management) in obtain corresponding and after sequencer sample number
According to;And often obtain a sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration,
So as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, directly give birth to
Into webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and corresponding sample message;
With reference to figure 9, be the present invention grow directly from seeds thing information project analysis report automatic creation system one embodiment UI
The screenshot capture of the page, the sectional drawing show the parameter setting interface for only filtering sub-project.
Second filter analysis unit 30, for when the type of the sub-project is normalizer project, then the sub-project
For corresponding sample while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, diversity ratio
Compared with one or more in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis
Normal process is analyzed, and is inputted in each normal process analytic process is created by user according to the sample situation of current sub-project
Corresponding filtration parameter;After the completion of sequencer, each sample data was carried out according to the filtration parameter of the input
Filter, so as to remove the sample data for not meeting filtration parameter;Then again with created normal process analysis to meeting filtering
Each sample data of parameter is analyzed, and directly generates webpage version or/and the analysis result information of PDF editions, analysis result report
Announcement includes sub-project information and corresponding sample message;
Wherein, the sample message include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID,
Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base
Quality Control distribution map.
With reference to figure 10, be the present invention grow directly from seeds thing information project analysis report automatic creation system one embodiment UI
The screenshot capture of the page, the sectional drawing show parameter setting interface and the normal process analysis selection interface of normalizer project.
Quality Control unit 40, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis
As a result report;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is then compiled again in threshold range
The is carried out after volume (can be edited with single sample data edition or sample batch) described sample data or filtering/Quality Control parameter again
The filtering of one filter analysis unit 20 or the second filter analysis unit 30 and analytic process, until analysis result information passes through matter
Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits the sample and useless
Related Lane is abandoned, and is placed an order again in the business management system;And
Storage unit 50:The analysis result information is backed up for storing.
As it can be seen that the analysis report automatic creation system of biological information project disclosed in the present embodiment, can be effectively automatic quick
Webpage version or/and the analysis report of PDF editions are generated, and since webpage version or/and the report of PDF editions are not easy to change, so as to reduce
Error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions are easier fixed propagation, webpage version can be with
See online.
The above is the preferred embodiment of the present invention, it is noted that for those skilled in the art
For, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications are also considered as
Protection scope of the present invention.
Claims (8)
1. a kind of analysis report automatic generation method of biological information project, it is characterised in that including step:
Step 1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And select the establishment
Sub-project and mission bit stream in project;The type of the sub-project includes only filtering sub-project and normalizer project;Wherein,
Per sub-project summary info include sub-project code, sub-project title, sub-project type, whether be only filtering, a total sample
Product number, executor, starting and end time, sub-project state and sub-project relevant operation;
Step 2:When the type of the sub-project is an only filtering sub-project, then according to the sub-project type and mission bit stream of selection,
Sample data corresponding and after sequencer is obtained from lower machine data management system successively;And often obtain a sample
Product data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, and ginseng is not met so as to filter
The sample data of number configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/and PDF
The analysis result information report of version, analysis result information report include sub-project information and corresponding sample message;
Step 3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through sequenator
While sequencing, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference analysis, Cluster clusters point
One or more normal process analyses in analysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and creating
Corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process;Through sequencing
After the completion of instrument sequencing, each sample data is filtered according to the filtration parameter of the input, filtering is not met so as to remove
The sample data of parameter;Then each sample data for meeting filtration parameter is carried out with created normal process analysis again
Analysis, directly generates webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and correspondence
Sample message;
Step 4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result information;If
Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates the sample in threshold range
Filtering and the analytic process of step 2 or step 3 are carried out after data or filtering/Quality Control parameter again, until analysis result report
Announcement passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then described in editor
Sample and discarded correlation Lane, and place an order again in the business management system.
2. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that the sample
Information include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads,
Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution map.
3. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that further include:
Step 5:The analysis result information is subjected to storage backup.
4. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that in the step
In rapid four:If Quality Control not by and the gap of the analysis result information and quality control standard in threshold range, updating
The sample data is can be edited with single sample data edition or sample batch.
A kind of 5. analysis report automatic creation system of biological information project, it is characterised in that including:
Creating unit, for creating project and being stored in business management system, each project includes more sub-projects;And select institute
State the sub-project and mission bit stream in establishment project;The type of the sub-project includes only filtering sub-project and standardization subitem
Mesh;Whether the summary info per sub-project includes sub-project code, sub-project title, sub-project type, is only filtering, total
Sample number, executor, starting and end time, sub-project state and sub-project relevant operation;
First filter analysis unit, is only filtering sub-project for the type when the sub-project, then according to the sub-project of selection
Type and mission bit stream, obtain sample number corresponding and after sequencer from lower machine data management system successively
According to;And often obtain a sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration,
So as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, generation net
Page version or/and the analysis result information of PDF editions, analysis result information include sub-project information and corresponding sample message;
Second filter analysis unit, for when the type of the sub-project is normalizer project, then the sub-project to be corresponding
For sample while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference point
One or more standards in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis
Process analysis, and inputted and corresponded to according to the sample situation of current sub-project by user in each normal process analytic process is created
Filtration parameter;After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input, from
And remove the sample data for not meeting filtration parameter;Then analyzed again with created normal process to meeting filtration parameter
Each sample data is analyzed, and generating webpage version or/and the analysis result information of PDF editions, analysis result information includes subitem
Mesh information and corresponding sample message;
Quality Control unit, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis result report
Accuse;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is in threshold range, then updates described
Filtering and the analytic process of step 2 or step 3 are carried out after sample data or filtering/Quality Control parameter again, until analysis knot
Fruit report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits
The sample and discarded correlation Lane, and place an order again in the business management system.
6. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that the sample
Information include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads,
Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution map.
7. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that further include:
Storage unit:For storage backup will to be carried out by the analysis result information of Quality Control.
8. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that in the matter
Control in unit:If Quality Control not by and the gap of the analysis result information and quality control standard in threshold range, compiling again
It is that can be edited with single sample data edition or sample batch to collect the sample data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410742360.9A CN104484558B (en) | 2014-12-08 | 2014-12-08 | The analysis report automatic generation method and system of biological information project |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410742360.9A CN104484558B (en) | 2014-12-08 | 2014-12-08 | The analysis report automatic generation method and system of biological information project |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104484558A CN104484558A (en) | 2015-04-01 |
CN104484558B true CN104484558B (en) | 2018-04-24 |
Family
ID=52759099
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410742360.9A Active CN104484558B (en) | 2014-12-08 | 2014-12-08 | The analysis report automatic generation method and system of biological information project |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104484558B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106775998A (en) * | 2016-11-09 | 2017-05-31 | 上海派森诺生物科技股份有限公司 | High flux 16S sequencing data automatic processing methods |
CN106557666A (en) * | 2016-11-15 | 2017-04-05 | 上海派森诺生物科技股份有限公司 | It is a kind of can be based on the data filtering method of Nextseq500 high-flux sequence platforms |
CN107329941B (en) * | 2017-07-05 | 2020-07-10 | 福建省特种设备检验研究院 | Special equipment inspection report based editing and previewing system |
CN108776748A (en) * | 2018-05-16 | 2018-11-09 | 成都奇恩生物科技有限公司 | A kind of gene detection system and its detection method |
CN109063959A (en) * | 2018-06-22 | 2018-12-21 | 深圳弘睿康生物科技有限公司 | A kind of sample quality control analysis method and system |
CN109712683A (en) * | 2018-12-17 | 2019-05-03 | 上海派森诺医学检验所有限公司 | A kind of examining report generation method of automation |
CN110176276B (en) * | 2019-04-12 | 2021-01-05 | 苏州赛美科基因科技有限公司 | Biological information analysis process management method and system |
CN110335638B (en) * | 2019-05-22 | 2021-11-23 | 北京安智因生物技术有限公司 | Automatic generation method and system for statin drug gene detection report |
CN112666360A (en) * | 2019-09-27 | 2021-04-16 | 深圳迈瑞生物医疗电子股份有限公司 | Sample information query method and sample analyzer |
CN111424075B (en) * | 2020-04-10 | 2021-01-15 | 西咸新区予果微码生物科技有限公司 | Third-generation sequencing technology-based microorganism detection method and system |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1711548A (en) * | 2002-11-18 | 2005-12-21 | 西门子公司 | Method for quality control of analytical processes and device for implementing the method |
CN101914628A (en) * | 2010-09-02 | 2010-12-15 | 深圳华大基因科技有限公司 | Method and system for detecting polymorphism locus of genome target region |
CN102206704A (en) * | 2011-03-02 | 2011-10-05 | 深圳华大基因科技有限公司 | Method and device for assembling genome sequence |
CN102477460A (en) * | 2010-11-24 | 2012-05-30 | 深圳华大基因科技有限公司 | Method for sequencing and clustering analysis of metagenome 16S hypervariable region V6 |
CN103324866A (en) * | 2013-03-26 | 2013-09-25 | 张弘 | Ripple system |
CN103714180A (en) * | 2014-01-08 | 2014-04-09 | 浪潮(北京)电子信息产业有限公司 | Bioinformatics database system and data processing method |
CN103993069A (en) * | 2014-03-21 | 2014-08-20 | 深圳华大基因科技服务有限公司 | Virus integration site capture sequencing analysis method |
-
2014
- 2014-12-08 CN CN201410742360.9A patent/CN104484558B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1711548A (en) * | 2002-11-18 | 2005-12-21 | 西门子公司 | Method for quality control of analytical processes and device for implementing the method |
CN101914628A (en) * | 2010-09-02 | 2010-12-15 | 深圳华大基因科技有限公司 | Method and system for detecting polymorphism locus of genome target region |
CN102477460A (en) * | 2010-11-24 | 2012-05-30 | 深圳华大基因科技有限公司 | Method for sequencing and clustering analysis of metagenome 16S hypervariable region V6 |
CN102206704A (en) * | 2011-03-02 | 2011-10-05 | 深圳华大基因科技有限公司 | Method and device for assembling genome sequence |
CN103324866A (en) * | 2013-03-26 | 2013-09-25 | 张弘 | Ripple system |
CN103714180A (en) * | 2014-01-08 | 2014-04-09 | 浪潮(北京)电子信息产业有限公司 | Bioinformatics database system and data processing method |
CN103993069A (en) * | 2014-03-21 | 2014-08-20 | 深圳华大基因科技服务有限公司 | Virus integration site capture sequencing analysis method |
Also Published As
Publication number | Publication date |
---|---|
CN104484558A (en) | 2015-04-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104484558B (en) | The analysis report automatic generation method and system of biological information project | |
CN104484582B (en) | The biological information project automatic analysis method and system realized by modularization selection | |
Nellore et al. | Rail-RNA: scalable analysis of RNA-seq splicing and coverage | |
CN107943463B (en) | Interactive mode automation big data analysis application development system | |
US20030218634A1 (en) | System and methods for visualizing diverse biological relationships | |
CN105894183B (en) | Project evaluation method and device | |
CN106708718A (en) | Service framework interface testing method and device | |
Ockree et al. | Integrating big data analytics into development planning optimization | |
CN104484750B (en) | The product parameters automatic matching method and system of biological information project | |
CN104484375B (en) | Establish the method and system of database automatically in project analysis flow | |
CN114493482A (en) | Enterprise archive data processing method and system | |
Ramos-Gutiérrez et al. | Discovering configuration workflows from existing logs using process mining | |
CN105653897B (en) | LncRNA analysis system and method based on biological cloud platform | |
CN104484581B (en) | The automated analysis method and system of biological information project | |
Bridges et al. | Towards a standard benchmark for variant and gene prioritisation algorithms: PhEval-Phenotypic inference Evaluation framework | |
CN112434032B (en) | Automatic feature generation system and method | |
WO2016039011A1 (en) | Simulation system, simulation method, and simulation program | |
Ranjan et al. | SNIKT: sequence-independent adapter identification and removal in long-read shotgun sequencing data | |
CN118072829A (en) | Protein acid-base stability prediction method, electronic equipment and storage medium | |
CN113436023B (en) | Financial product recommendation method and device based on block chain | |
Van der Blom et al. | Sparkle: Toward Accessible Meta-Algorithmics for Improving the State of the Art in Solving Challenging Problems | |
Baruti | Learning Alteryx: A beginner's guide to using Alteryx for self-service analytics and business intelligence | |
Chougule et al. | Improved RNA‐seq Workflows Using CyVerse Cyberinfrastructure | |
Pérez et al. | A new mathematical model for the workover rig scheduling problem | |
US20090070332A1 (en) | Information retrieval |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |