CN111885177A - Biological information analysis cloud computing method and system based on cloud computing technology - Google Patents

Biological information analysis cloud computing method and system based on cloud computing technology Download PDF

Info

Publication number
CN111885177A
CN111885177A CN202010734237.8A CN202010734237A CN111885177A CN 111885177 A CN111885177 A CN 111885177A CN 202010734237 A CN202010734237 A CN 202010734237A CN 111885177 A CN111885177 A CN 111885177A
Authority
CN
China
Prior art keywords
analysis
cloud computing
server
cloud
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010734237.8A
Other languages
Chinese (zh)
Other versions
CN111885177B (en
Inventor
余育超
朱晓文
陈浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Shengwu Technology Co ltd
Original Assignee
Hangzhou Shengwu Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Shengwu Technology Co ltd filed Critical Hangzhou Shengwu Technology Co ltd
Priority to CN202010734237.8A priority Critical patent/CN111885177B/en
Publication of CN111885177A publication Critical patent/CN111885177A/en
Application granted granted Critical
Publication of CN111885177B publication Critical patent/CN111885177B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/12Protocols specially adapted for proprietary or special-purpose networking environments, e.g. medical networks, sensor networks, networks in vehicles or remote metering networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Abstract

A biological information analysis cloud computing method based on a cloud computing technology comprises the following steps: a resident small server ECS is created to support front-end web interaction and send a control and mediation instruction; s2: after the front-end web interaction end submits an analysis task, the resident small server ECS issues an instruction to establish a cloud server which is adaptive to the operation analysis configuration of the analysis task; s3: computing and analyzing the analysis task by the cloud server created in the step S2, storing the analysis result, and finally returning the completion information to the resident mini server ECS and closing the cloud server; s4: and after receiving the analysis task completion information, the resident small server ECS delivers the download information of the result to the user at the front-end web interaction end. The scheme reduces the cost of single use to the minimum, the configuration and the number of the created elastic servers are only related to the upper limit of a cloud computing service provider, the multi-sample parallel computing can be met, and the time cost is saved with the maximum efficiency.

Description

Biological information analysis cloud computing method and system based on cloud computing technology
Technical Field
The invention relates to the technical field of bioinformatics analysis, in particular to a cloud computing method and system for bioinformatics analysis based on a cloud computing technology.
Background
The biological information analysis and calculation mainly refers to processing a large amount of original biological data generated by the current biological detection technology, including gene data, protein data and the like. The calculation of the big data needs to be performed by using a professional server, and the existing calculation technical scheme mainly comprises the following two types:
(1) and the local server is erected for analyzing and calculating the biological information data. According to different computing data requirements, various types of servers are purchased to build a local computing platform. Such as tower servers, rack servers, etc. The technical scheme has the problems of high single input cost, poor expandability, high daily maintenance cost, long time for returning the original and the like.
(2) And (3) purchasing a cloud server, batch computing and high-performance computing services provided by a cloud service provider to perform biological information data computing. The technical scheme is limited by the advanced development field of the industry, and the services provided by the cloud service providers have the problems of low industry adaptation degree and no great cost advantage caused by cloud computing resource waste.
For example, patent document No. CN109192248A discloses a biological information analysis system and method based on a cloud platform, and a cloud computing platform system, including a client, a web server, and a cloud platform computing system; information is transmitted and received between the client and the web server through a network, and data is exchanged between the web server and the cloud platform computing system through a Nginx webpage server; the cloud platform web server comprises a biological information analysis request interface; the cloud platform computing system comprises a biological information analysis application interface, a biological information analysis component, a storage server, a Mysql database, a Mongo database and a plurality of computing nodes; the biological information analysis component receives a biological information analysis request and parameters from the biological information analysis request interface, and analyzes different objects according to the parameter interpretation calculation types; the analysis result is stored in the storage server, the Mysql database stores the analysis records and the like, and the Mongo database stores chart data for the cloud platform client to display, so that the automatic analysis operation of the analysis system is realized.
The problems presented by the above patents and prior art are:
1. the high-performance server is configured locally or purchased at the cloud end, so that the cost investment is high and the universality is not realized.
2. The bioinformatics analysis includes a large number of types, and the system software configuration of the server is complex and is not easy to manage.
3. The gene data are huge data, and the storage cost and the circulation time cost of the data are high by adopting a local hard disk storage and ftp to carry out network transmission.
Disclosure of Invention
In order to solve the problems, the invention provides a biological information analysis cloud computing method and system based on a cloud computing technology, which can create elastic telescopic servers for computing when computing tasks occur, so that the cost for single use is reduced to the minimum, the configuration and the number of the created elastic servers are only related to the upper limit of a cloud computing service provider, the multi-sample parallel computing can be met, and the time cost is saved at the maximum efficiency.
The technical scheme of the invention is as follows:
a biological information analysis cloud computing method based on a cloud computing technology comprises the following steps:
s1: a user uploads original data required for biological information analysis on an interface of a front-end web;
s2: a resident small server is created to support front-end web interaction and send a management and control regulation instruction;
s3: submitting an analysis task on a front-end web, and issuing an instruction by a resident small server to create a cloud server which is adaptive to the operation analysis configuration of the analysis task;
s4: based on the original data in step S1, performing calculation analysis on the analysis task by the cloud server created in step S2, storing the analysis result and the original data, returning calculation completion information to the resident mini server, and closing the cloud server;
s5: the resident small server receives the result downloading address while receiving the calculation completion information and displays the result downloading address on the front-end web, and the front-end web is used for downloading the result information according to the downloading address and delivering the result information to the user.
Preferably, the configuration method of the cloud server in step S3 is: and according to hardware requirements required by data analysis, elastically and telescopically configuring a cloud server for computing based on a cloud computing technology, and releasing the cloud server after computing is completed.
Preferably, the step S3 further includes configuring the system software environment of the server: and using the corresponding analysis snapshot to carry out deployment so as to construct the environment state of the server system suitable for analysis.
Preferably, the step S4 further includes: the original data described in step S1 is copied to the mounted file storage by the cloud server using the file storage service of the cloud computing.
Preferably, the step S4 further includes storing the analysis result by using a storage service of object storage in cloud computing, and sending the result download address and the account password for extracting the analysis result to the resident small server by the object storage.
The invention also provides a biological information analysis cloud computing system based on the cloud computing technology, and a biological information analysis cloud computing method based on the cloud computing technology is used, and the method comprises the following steps:
a web interaction module: the system is used for inputting original data needing to be analyzed by a user and submitting an analysis task;
a management module: the management and control node is used for sending a deployment instruction, wherein the deployment instruction is specifically used for invoking corresponding storage, calculation and network cloud services according to an analysis product selected by a user and sending an analysis and calculation instruction;
a calculation module: the analysis task analysis system is used for carrying out analysis calculation according to original data input by a user and the content of an analysis task to obtain an analysis result;
a storage module: the system comprises a data storage module, a data processing module and a data processing module, wherein the data storage module is used for storing original data input by a user and an analysis result obtained by the calculation module;
a data delivery platform: and the cloud server is used for forming a report of the analysis result calculated by the cloud server and delivering the report to the user.
The invention has the beneficial effects that:
1. the resident server only needs one small server, the initial investment cost is low, the elastically telescopic server is established for calculation only when the calculation task occurs, and the cost of single use is reduced to the minimum.
2. The mirror image is managed by using the snapshot technology, so that the system software environments required by each biological information analysis product are mutually independent and convenient to manage, and the deployment is quicker.
3. The configuration and the number of the elastic servers created by the method are only related to the upper limit of a cloud computing service provider, so that the multi-sample parallel computing can be met, and the time cost can be saved with the maximum efficiency.
4. The invention uses file storage to store the data file during operation, and can be mounted in all the calculation servers, thereby releasing the network bandwidth, improving the analysis performance and reducing the operation calculation cost.
5. The invention uploads the original data and delivers the analysis result by using the storage service of the object storage, and has higher data transmission speed and data security protection.
Drawings
FIG. 1 is a schematic diagram of the components of an embodiment of the present invention.
Fig. 2 is a block diagram of a system according to an embodiment of the present invention.
Detailed Description
The embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
As shown in fig. 1, an embodiment of the present invention provides a biological information analysis cloud computing method based on a cloud computing technology, including the following steps: a resident small server ECS is created to support front-end web interaction and manage and control the sending of mediation instructions.
After the analysis task is submitted by a front-end web, the resident server issues an instruction to create a cloud server which is adapted to the operation analysis configuration by using an elastic computing ECS of the cloud computing, and a system software environment is configured by using the pre-made snapshot. And the computing cloud server computes the input analysis task.
And according to hardware requirements required by data analysis, elastically and telescopically configuring a cloud server for computing based on a cloud computing technology, and releasing the cloud server after computing is completed. The foregoing is a scheme of the adaptive cloud server architecture based on the characteristics of gene data analysis in this embodiment. The gene data calculation is a sudden demand in a short period, has high requirements on a server, and needs to use a service mode which can be released after the calculation is completed. The idle cost needs to be shared for long-term leasing or self-purchase of the server, and the idle cost can be reduced by the configuration mode of the embodiment.
And copying data provided by a user to the mounted file storage by using a newly created analysis and calculation server by using cloud calculation object storage and file storage services, performing calculation and analysis, delivering a result to the object storage for storage after the result is completed, returning completion information, and closing the server.
The reference databases with large data volume need to be accessed in the analysis of the gene data, and the architecture scheme of the embodiment utilizes the file storage service of the cloud computing technology, so that only one database can be configured and mounted to a plurality of cloud computing servers for computing access.
The object storage OSS service is used for uploading user data and delivering results, and is suitable for the characteristics of large data, low access frequency, high privacy and the like of gene data.
And after receiving the completion information, the resident server delivers the downloading information of the result to the user at the web interaction end.
The technical scheme is realized on the basis of a web end of a cloud computing technology, so that various devices with browsers can access and issue analysis instructions.
As shown in fig. 2, the present invention further provides a cloud computing system for biological information analysis based on cloud computing technology, which is used for supporting a cloud computing method for biological information analysis based on cloud computing technology on hardware, and the cloud computing method comprises:
a web interaction module: the system is used for inputting relevant positions of data in the cloud end and accessing the certificate by a user and selecting needed analysis computing services.
A management module: the management and control node is used for sending a deployment instruction, wherein the deployment instruction is specifically used for invoking corresponding storage, calculation and network cloud services according to an analysis product selected by a user and sending an analysis and calculation instruction;
a calculation module: the analysis task analysis system is used for carrying out analysis calculation according to original data input by a user and the content of an analysis task to obtain an analysis result;
a storage module: the system comprises a data storage module, a data processing module and a data processing module, wherein the data storage module is used for storing original data input by a user and an analysis result obtained by the calculation module;
a data delivery platform: and the cloud server is used for forming a report of the analysis result calculated by the cloud server and delivering the report to the user.
Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the present invention in its spirit and scope. Are intended to be covered by the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A biological information analysis cloud computing method based on a cloud computing technology is characterized by comprising the following steps:
s1: a user uploads original data required for biological information analysis on an interface of a front-end web;
s2: a resident small server is created to support front-end web interaction and send a management and control regulation instruction;
s3: submitting an analysis task on a front-end web, and issuing an instruction by a resident small server to create a cloud server which is adaptive to the operation analysis configuration of the analysis task;
s4: based on the original data in step S1, performing calculation analysis on the analysis task by the cloud server created in step S2, storing the analysis result and the original data, returning calculation completion information to the resident mini server, and closing the cloud server;
s5: the resident small server receives the result downloading address while receiving the calculation completion information and displays the result downloading address on the front-end web, and the front-end web is used for downloading the result information according to the downloading address and delivering the result information to the user.
2. The cloud computing method for biological information analysis based on cloud computing technology according to claim 1, wherein the cloud server configuration method in step S3 is as follows: and according to hardware requirements required by data analysis, elastically and telescopically configuring a cloud server for computing based on a cloud computing technology, and releasing the cloud server after computing is completed.
3. The cloud computing method for bioinformatics analysis based on cloud computing technology according to claim 1, wherein the step S3 further includes configuring the system software environment of the server: and using the corresponding analysis snapshot to carry out deployment so as to construct the environment state of the server system suitable for analysis.
4. The cloud computing method for bioinformatics analysis based on cloud computing technology according to claim 1, wherein the step S4 further includes: the original data described in step S1 is copied to the mounted file storage by the cloud server using the file storage service of the cloud computing.
5. The cloud computing method for biological information analysis based on cloud computing technology according to claim 1, wherein step S4 further comprises storing the analysis result by using a storage service of a subject storage in the cloud computing, and sending a result download address and an account password for extracting the analysis result to the resident mini-server by the subject storage.
6. A biological information analysis cloud computing system based on cloud computing technology, which is used in the biological information analysis cloud computing method based on cloud computing technology according to any one of claims 1 to 5, and which comprises:
a web interaction module: the system is used for inputting original data needing to be analyzed by a user and submitting an analysis task;
a management module: the management and control node is used for sending a deployment instruction, wherein the deployment instruction is specifically used for invoking corresponding storage, calculation and network cloud services according to an analysis task selected by a user and sending an analysis and calculation instruction;
a calculation module: the analysis task analysis system is used for carrying out analysis calculation according to original data input by a user and the content of an analysis task to obtain an analysis result;
a storage module: the system comprises a data storage module, a data processing module and a data processing module, wherein the data storage module is used for storing original data input by a user and an analysis result obtained by the calculation module;
a data delivery platform: and the cloud server is used for forming a report of the analysis result calculated by the cloud server and delivering the report to the user.
CN202010734237.8A 2020-07-28 2020-07-28 Biological information analysis cloud computing method and system based on cloud computing technology Active CN111885177B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010734237.8A CN111885177B (en) 2020-07-28 2020-07-28 Biological information analysis cloud computing method and system based on cloud computing technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010734237.8A CN111885177B (en) 2020-07-28 2020-07-28 Biological information analysis cloud computing method and system based on cloud computing technology

Publications (2)

Publication Number Publication Date
CN111885177A true CN111885177A (en) 2020-11-03
CN111885177B CN111885177B (en) 2023-05-30

Family

ID=73201333

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010734237.8A Active CN111885177B (en) 2020-07-28 2020-07-28 Biological information analysis cloud computing method and system based on cloud computing technology

Country Status (1)

Country Link
CN (1) CN111885177B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113326123A (en) * 2021-04-30 2021-08-31 杭州绳武科技有限公司 Biological information analysis and calculation system and method based on container technology

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880515A (en) * 2012-09-07 2013-01-16 浪潮电子信息产业股份有限公司 Method for deploying virtual machine rapidly in smart cloud OS (operating system)
CN104021029A (en) * 2014-06-13 2014-09-03 北京大学 Spatial information cloud computing system and implementing method thereof
CN106022007A (en) * 2016-06-14 2016-10-12 中国科学院北京基因组研究所 Cloud platform system and method oriented to biological omics big data calculation
CN107734035A (en) * 2017-10-17 2018-02-23 华南理工大学 A kind of Virtual Cluster automatic telescopic method under cloud computing environment
CN108537008A (en) * 2018-03-20 2018-09-14 常州大学 High-throughput gene sequencing big data analysis cloud platform system
CN108924217A (en) * 2018-06-29 2018-11-30 中山大学 A kind of distribution cloud system Automation arranging method
CN109192248A (en) * 2017-07-21 2019-01-11 上海桑格信息技术有限公司 Biological information analysis system, method and cloud computing platform system based on cloud platform
KR20200058757A (en) * 2018-11-20 2020-05-28 (주) 아이크로진 Service method and platform for analysing gene based on cloud computing system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880515A (en) * 2012-09-07 2013-01-16 浪潮电子信息产业股份有限公司 Method for deploying virtual machine rapidly in smart cloud OS (operating system)
CN104021029A (en) * 2014-06-13 2014-09-03 北京大学 Spatial information cloud computing system and implementing method thereof
CN106022007A (en) * 2016-06-14 2016-10-12 中国科学院北京基因组研究所 Cloud platform system and method oriented to biological omics big data calculation
CN109192248A (en) * 2017-07-21 2019-01-11 上海桑格信息技术有限公司 Biological information analysis system, method and cloud computing platform system based on cloud platform
CN107734035A (en) * 2017-10-17 2018-02-23 华南理工大学 A kind of Virtual Cluster automatic telescopic method under cloud computing environment
CN108537008A (en) * 2018-03-20 2018-09-14 常州大学 High-throughput gene sequencing big data analysis cloud platform system
CN108924217A (en) * 2018-06-29 2018-11-30 中山大学 A kind of distribution cloud system Automation arranging method
KR20200058757A (en) * 2018-11-20 2020-05-28 (주) 아이크로진 Service method and platform for analysing gene based on cloud computing system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘勤 等: "《XBRL知识体验 理论、方法与实践》", 30 November 2016, 上海:立信会计出版社 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113326123A (en) * 2021-04-30 2021-08-31 杭州绳武科技有限公司 Biological information analysis and calculation system and method based on container technology
CN113326123B (en) * 2021-04-30 2024-03-26 杭州绳武科技有限公司 Biological information analysis and calculation system and method based on container technology

Also Published As

Publication number Publication date
CN111885177B (en) 2023-05-30

Similar Documents

Publication Publication Date Title
CN109478266B (en) Resource allocation for database provisioning
US11157318B2 (en) Optimizing timeouts and polling intervals
US10623470B2 (en) Optimizing internet data transfers using an intelligent router agent
US9584372B2 (en) Discovering resources of a distributed computing environment
CN109564527A (en) The security configuration of cloud computing node
US20170318129A1 (en) Generation and distribution of named, definable, serialized tokens
US11165585B2 (en) Token repository and integration
US10657136B2 (en) Searching data on a synchronization data stream
CN110677307B (en) Service monitoring method, device, equipment and storage medium
US11693909B2 (en) Data sharing tool for facilitating real-time access to current or updated datasets
US11237889B1 (en) Application infrastructure configuration based on annotated API schemas
US10693939B2 (en) Providing modified protocol responses
CN110928594A (en) Service development method and platform
CN111800511B (en) Synchronous login state processing method, system, equipment and readable storage medium
CN111885177A (en) Biological information analysis cloud computing method and system based on cloud computing technology
CN111831503B (en) Monitoring method based on monitoring agent and monitoring agent device
US10554770B2 (en) Dynamic cognitive optimization of web applications
CN115174248A (en) Network access control method and device
CN113722007B (en) Configuration method, device and system of VPN branch equipment
CN107347024A (en) A kind of method and apparatus for storing Operation Log
CN104021027A (en) Method and equipment for providing virtual device
Satsyk et al. Reduction of server load by means of CMS Drupal
CN115485677A (en) Secure data replication in a distributed data storage environment
US20200089593A1 (en) Data collection in transaction problem diagnostic
CN109088913A (en) The method and load-balanced server of request data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant