CN107977287A - One kind is using disaster tolerance implementation method, apparatus and system - Google Patents

One kind is using disaster tolerance implementation method, apparatus and system Download PDF

Info

Publication number
CN107977287A
CN107977287A CN201610921883.9A CN201610921883A CN107977287A CN 107977287 A CN107977287 A CN 107977287A CN 201610921883 A CN201610921883 A CN 201610921883A CN 107977287 A CN107977287 A CN 107977287A
Authority
CN
China
Prior art keywords
disaster tolerance
data center
application
service
level agreement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201610921883.9A
Other languages
Chinese (zh)
Inventor
卜继贤
赵培
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201610921883.9A priority Critical patent/CN107977287A/en
Publication of CN107977287A publication Critical patent/CN107977287A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1438Restarting or rejuvenating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1469Backup restoration techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/865Monitoring of software

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An embodiment of the present invention provides one kind using disaster tolerance implementation method, apparatus and system;This method includes:Obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, disaster tolerance service-level agreement parameter includes data duplication parameter and/or data recovery parameter, according to the disaster tolerance service-level agreement parameter and the resource occupying state of each data center for treating disaster tolerance application, determine target data center, the Internet resources of invocation target data center, disaster tolerance application is treated in deployment, and according to disaster tolerance service-level agreement parameter, disaster tolerance application is treated in management.An embodiment of the present invention provides one kind to apply disaster tolerance implementation method, user can be the application selection disaster tolerance grade of service, automatically dispose is carried out using according to the disaster tolerance grade of service, when disaster occurs for data center using progress intelligent linkage switching, so as to meet the continuity demand of customer service, the disaster tolerance of VDC applications is realized.

Description

One kind is using disaster tolerance implementation method, apparatus and system
Technical field
The present invention relates to disaster tolerance field, more particularly to one kind is using disaster tolerance implementation method, apparatus and system.
Background technology
With reaching its maturity for cloud computing technology and business model, more data centers selection VDC (Virtual Data Center, Visualized data centre) pattern builds, and on the one hand can reduce the IT costs of an asset of enterprise, and on the other hand can be with Reduce the IT O&M costs of enterprise.
Safety is that, by safety management and the measure of safe practice, have with a major issue of cloud computing all the time Effect strengthen using and data safety, once but disaster occurs for environment residing for data center, as war, terror are attacked Hit, city power failure, earthquake, flood, hurricane, large area equipment fault etc., great security threat will be brought to business, causes difficulty With the loss born.Therefore, in order to provide the guarantee of more high safety rank to business, data center needs to provide disaster tolerance service, When disaster occurs for creation data center, using can recover in disaster tolerance data center, and continue externally offer service, avoid calamity Hardly possible brings huge loss to user, and then the prior art is not provided with applying disaster recovery method based on VDC.
The content of the invention
It is a kind of based on VDC's to provide an embodiment of the present invention provides one kind using disaster tolerance implementation method, apparatus and system Using disaster recovery method.
On the one hand, there is provided one kind applies disaster tolerance implementation method, including:
The disaster tolerance service-level agreement parameter for treating disaster tolerance application is obtained, disaster tolerance service-level agreement parameter includes data duplication Parameter and/or data recovery parameter;
According to treat disaster tolerance apply disaster tolerance service-level agreement parameter and each data center resource occupying state, Determine target data center;
Disaster tolerance application is treated in the Internet resources of invocation target data center, deployment;
According to disaster tolerance service-level agreement parameter, disaster tolerance application is treated in management.
On the one hand, there is provided one kind applies disaster tolerance realization device, including:Setup module, deployment module and management module, its In,
Setup module is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, disaster tolerance service-level agreement parameter Including data duplication parameter and/or data recovery parameter;
Deployment module is used for the network money according to the disaster tolerance service-level agreement parameter and each data center for treating disaster tolerance application Source seizure condition, determines target data center, the Internet resources of invocation target data center, and disaster tolerance application is treated in deployment;
Management module is used to treat disaster tolerance application according to disaster tolerance service-level agreement parameter, management.
On the one hand, there is provided one kind realizes system using disaster tolerance, including:At least one data center, cloud application management system System, cloud platform management system, at least one data center are respectively arranged with independent cloud platform management system, at least one data Center shares a cloud application management system, wherein,
Data center is used to provide Internet resources for application;
Cloud platform management system is used for the Internet resources for managing corresponding data center;
Cloud application management system is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, disaster tolerance grade of service association View parameter include data duplication parameter and/or data recovery parameter, according to treat disaster tolerance application disaster tolerance service-level agreement parameter, And the resource occupying state of each data center, determine target data center, the cloud platform management at triggering target data center Disaster tolerance application is treated in the Internet resources of system invocation target data center, deployment;It is additionally operable to according to disaster tolerance service-level agreement parameter, Disaster tolerance application is treated in management.
On the other hand, there is provided a kind of computer-readable storage medium, is stored with computer in computer-readable storage medium and can perform Instruction, computer executable instructions are used to perform foregoing application disaster tolerance implementation method.
The beneficial effect of the embodiment of the present invention:
An embodiment of the present invention provides one kind to apply disaster tolerance implementation method, and user can be application selection disaster tolerance service etc. Level, automatically dispose is carried out using according to the disaster tolerance grade of service, is cut when disaster occurs for data center using progress intelligent linkage Change, so as to meet the continuity demand of customer service, realize the disaster tolerance of VDC applications, i.e., the present invention provides one kind to be based on VDC Apply disaster recovery method, solve the prior art be not based on VDC apply disaster recovery method.
Brief description of the drawings
Fig. 1 is the flow chart using disaster tolerance implementation method that first embodiment of the invention provides;
Fig. 2 is the structure diagram using disaster tolerance realization device that second embodiment of the invention provides;
Fig. 3 is the structure diagram that system is realized using disaster tolerance that third embodiment of the invention provides;
Fig. 4 is the schematic diagram for the data system that fourth embodiment of the invention is related to;
Fig. 5 is the flow chart using disaster recovery method that fourth embodiment of the invention is related to.
Embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is part of the embodiment in the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without making creative work Embodiment, belongs to the scope of protection of the invention.
Further annotation explanation is now made to the present invention by way of embodiment combination attached drawing.
First embodiment:
Fig. 1 is the flow chart using disaster tolerance implementation method that first embodiment of the invention provides, as shown in Figure 1, this implementation What example provided includes using disaster tolerance implementation method:
S101:The disaster tolerance service-level agreement parameter for treating disaster tolerance application is obtained, disaster tolerance service-level agreement parameter includes number According to duplication parameter and/or data recovery parameter;
S102:According to the disaster tolerance service-level agreement parameter and the resource occupying of each data center for treating disaster tolerance application State, determines target data center;
S103:Disaster tolerance application is treated in the Internet resources of invocation target data center, deployment;
S104:According to disaster tolerance service-level agreement parameter, disaster tolerance application is treated in management.
In certain embodiments, the disaster tolerance service-level agreement parameter bag of disaster tolerance application is treated in the acquisition in above-described embodiment Include:
The list of application of each data center is obtained, generates and shows using catalogue;
According to selection operation of the user in application catalogue, determine to treat disaster tolerance application;
Displaying includes the disaster tolerance service-level agreement list of at least one disaster tolerance service-level agreement parameter;
According to selection operation of the user in disaster tolerance service-level agreement list, determine to treat disaster tolerance service of disaster tolerance application etc. Level protocol parameter.
In certain embodiments, the data center that sets the goal really in above-described embodiment includes:
According to the resource occupying state of each data center, the disaster tolerance service-level agreement that each data center supports is calculated Parameter;
The disaster tolerance service-level agreement parameter applied according to disaster tolerance is treated, in the disaster tolerance grade of service association that each data center supports View parameter is screened, and is met the data to be selected of the data center for the disaster tolerance service-level agreement parameter for treating disaster tolerance application Center List;
According to user in the selection operation of data center's list to be selected, target data center is determined.
In certain embodiments, being further included using disaster tolerance implementation method in above-described embodiment:
If the disaster tolerance service-level agreement parameter applied according to disaster tolerance is treated, in the disaster tolerance grade of service that each data center supports Protocol parameter is screened, and does not obtain data center's list to be selected, then prompts user to reselect the appearance for treating disaster tolerance application Calamity service-level agreement parameter, and determine to treat the new disaster tolerance service-level agreement parameter of disaster tolerance application;
The new disaster tolerance service-level agreement parameter applied according to disaster tolerance is treated, in the disaster tolerance grade of service that each data center supports Protocol parameter is screened, and the data center for being met the new disaster tolerance service-level agreement parameter for treating disaster tolerance application waits to select Data center's list;
According to user in the selection operation of data center's list to be selected, target data center is determined again.
In certain embodiments, the Internet resources of the invocation target data center in above-described embodiment, deployment treat that disaster tolerance should With including:
The applied topology for treating disaster tolerance application is obtained and shows, applied topology includes the application node at target data center;
According to user applied topology editing operation, complete treat disaster tolerance application apply layout;
Internet resources in invocation target data center, carry out establishment, the distribution of virtual machine node, container and special joint And configuration.
In certain embodiments, the management in above-described embodiment treat disaster tolerance apply including:
The operating status that disaster tolerance application is treated in target data center is monitored and shown, when an exception occurs, performs linkage plan Slightly, carry out applying disaster tolerance;
And/or
According to user's operation, the overall failure rehearsal at target data center is carried out, and/or the node failure of application node is drilled Practice.
Second embodiment:
Fig. 2 is the structure diagram using disaster tolerance realization device that second embodiment of the invention provides, as shown in Figure 2, this What embodiment provided includes using disaster tolerance realization device:Setup module 21, deployment module 22 and 23 pieces of mould of management, wherein,
Setup module 21 is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, disaster tolerance service-level agreement ginseng Number includes data duplication parameter and/or data recovery parameter;
Deployment module 22 is used for according to the disaster tolerance service-level agreement parameter and the network of each data center for treating disaster tolerance application Resource occupation state, determines target data center, the Internet resources of invocation target data center, and disaster tolerance application is treated in deployment;
Management module 23 is used to treat disaster tolerance application according to disaster tolerance service-level agreement parameter, management.
In certain embodiments, the setup module 21 in above-described embodiment is used for the list of application for obtaining each data center, Generate and show using catalogue;According to selection operation of the user in application catalogue, determine to treat disaster tolerance application;Displaying is included at least The disaster tolerance service-level agreement list of one disaster tolerance service-level agreement parameter;According to user in disaster tolerance service-level agreement list On selection operation, determine to treat the disaster tolerance service-level agreement parameter of disaster tolerance application.
In certain embodiments, the deployment module 22 in above-described embodiment is used to be accounted for according to the Internet resources of each data center With state, the disaster tolerance service-level agreement parameter that each data center supports is calculated;The disaster tolerance grade of service applied according to disaster tolerance is treated Protocol parameter, is screened in the disaster tolerance service-level agreement parameter that each data center supports, is met and is treated disaster tolerance application Data center's list to be selected of the data center of disaster tolerance service-level agreement parameter;Arranged according to user in data center to be selected The selection operation of table, determines target data center.
In certain embodiments, if the deployment module 22 in above-described embodiment is additionally operable to according to the disaster tolerance clothes for treating that disaster tolerance is applied Business level protocol parameter, is screened in the disaster tolerance service-level agreement parameter that each data center supports, is not obtained waiting to select Data center's list, then prompt user to reselect the disaster tolerance service-level agreement parameter for treating disaster tolerance application, and determines to treat disaster tolerance The new disaster tolerance service-level agreement parameter of application;The new disaster tolerance service-level agreement parameter applied according to disaster tolerance is treated, in each data The disaster tolerance service-level agreement parameter that center is supported is screened, and is met the new disaster tolerance service-level agreement for treating disaster tolerance application Data center's list to be selected of the data center of parameter;According to user data center's list to be selected selection operation, really Data center set the goal again.
In certain embodiments, the deployment module 22 in above-described embodiment is used to obtaining and showing the application for treating disaster tolerance application Topology, applied topology include the application node at target data center;According to user in the editing operation of applied topology, complete to wait to hold Layout is applied in calamity application;Internet resources in invocation target data center, carry out virtual machine node, container and special joint Create, distribute and configure.
In certain embodiments, the management module 23 in above-described embodiment is used to monitor and show in target data center to treat The operating status of disaster tolerance application, when an exception occurs, performs linkage strategy, carries out applying disaster tolerance;And/or according to user's operation, Carry out the overall failure rehearsal at target data center, and/or the node failure rehearsal of application node.
In practical applications, all functions module in embodiment illustrated in fig. 2, can use processor, editorial logic The modes such as device are realized.
3rd embodiment:
Fig. 3 is the structure diagram that system is realized using disaster tolerance that third embodiment of the invention provides, from the figure 3, it may be seen that this What embodiment provided realizes that system includes using disaster tolerance:At least one data center 31 (such as 31a and 31b as indicated at 3), Yun Ying With management system 32, cloud platform management system 33 (such as 33a and 33b as indicated at 3), at least one data center 31 is set respectively There is independent cloud platform management system 33, at least one data center 31 shares a cloud application management system 32, wherein,
Data center 31 is used to provide Internet resources for application;
Cloud platform management system 33 is used for the Internet resources for managing corresponding data center;
Cloud application management system 32 is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, the disaster tolerance grade of service Protocol parameter includes data duplication parameter and/or data recovery parameter, according to the disaster tolerance service-level agreement ginseng for treating that disaster tolerance is applied Number and the resource occupying state of each data center, determine target data center, the cloud platform at triggering target data center 31 Disaster tolerance application is treated in the Internet resources of 33 invocation target data center of management system, deployment;It is additionally operable to be assisted according to the disaster tolerance grade of service Parameter is discussed, disaster tolerance application is treated in management.
In certain embodiments, the cloud application management system 32 in above-described embodiment is used to put down by the cloud of each data center Platform management system obtains the list of application of each data center, generates and shows using catalogue;According to user in application catalogue Selection operation, determines to treat disaster tolerance application;Displaying includes the disaster tolerance grade of service association of at least one disaster tolerance service-level agreement parameter Discuss list;According to selection operation of the user in disaster tolerance service-level agreement list, determine to treat disaster tolerance service of disaster tolerance application etc. Level protocol parameter.
In certain embodiments, the cloud application management system 32 in above-described embodiment is used for the cloud by each data center 31 Platform management system 33 obtains the resource occupying state of each data center, according to the resource occupying shape of each data center State, calculates the disaster tolerance service-level agreement parameter that each data center supports;The disaster tolerance service-level agreement applied according to disaster tolerance is treated Parameter, is screened in the disaster tolerance service-level agreement parameter that each data center supports, is met the disaster tolerance for treating disaster tolerance application Data center's list to be selected of the data center of service-level agreement parameter;According to user in data center's list to be selected Selection operation, determines target data center.
In certain embodiments, if the cloud application management system 32 in above-described embodiment be additionally operable to according to treat disaster tolerance apply Disaster tolerance service-level agreement parameter, is screened in the disaster tolerance service-level agreement parameter that each data center supports, is not obtained Data center's list to be selected, then prompt user to reselect the disaster tolerance service-level agreement parameter for treating disaster tolerance application, and determines Treat the new disaster tolerance service-level agreement parameter of disaster tolerance application;The new disaster tolerance service-level agreement parameter applied according to disaster tolerance is treated, The disaster tolerance service-level agreement parameter that each data center supports is screened, and is met new disaster tolerance service for treating disaster tolerance application etc. Data center's list to be selected of the data center of level protocol parameter;Grasped according to user in the selection of data center's list to be selected Make, determine target data center again.
In certain embodiments, the cloud application management system 32 in above-described embodiment is used to obtain and show to treat disaster tolerance application Applied topology, applied topology includes the application node at target data center;According to user applied topology editing operation, it is complete Into the application layout for treating disaster tolerance application;Trigger the net in the cloud platform management system invocation target data center at target data center Network resource, carries out establishment, distribution and the configuration of virtual machine node, container and special joint.
In certain embodiments, the cloud application management system 32 in above-described embodiment is used for by target data center 31 Cloud platform management system 33 monitors and shows the operating status that disaster tolerance application is treated in target data center, when an exception occurs, holds Row linkage strategy, carries out applying disaster tolerance;And/or according to user's operation, the overall failure for carrying out target data center is drilled, and/ Or the node failure rehearsal of application node.
Fourth embodiment:
Further annotation explanation is done to the present invention in conjunction with concrete application scene.
The present embodiment is related to the automatically dispose technology in data center's disaster tolerance field, more particularly to VDC (virtualization numbers According to center) disaster tolerance application Automation arranging method.Disaster tolerance automatization of service method deployment provided in this embodiment is related to holding The whole process technology of calamity, including disaster tolerance service request, disaster tolerance application layout, disaster tolerance application automatically dispose, disaster tolerance application state Monitoring, disaster tolerance application switching are drilled, and disaster tolerance application intelligent linkage switching during disaster generation.
Specifically, the present embodiment provides one kind to be based on SLA (Service-Level Agreement, service-level agreement) Disaster tolerance automatization of service dispositions method.This method provides different grades of disaster tolerance service to the user, and the disaster tolerance grade of service can be with From the dimension of data copy method, or RPO (Recovery Point Objective, refer to from system and application data and Speech, realizing can recover to that can support all departments' business running, what kind of renewal journey system and creation data should return to Degree, this renewal degree can be the real time datas of the Backup Data of upper one week or last transaction) plus RTO (Recovery Time Objective, after it refers to that disaster occurs, since IT system when when machine causes service pause, are arrived IT system recover to can support all departments running, recover operation when, the period between this 2 points) dimension selected Select.When selecting the grade of service from the dimension of data copy method, the grade that can be selected includes data level backup services, data level Disaster tolerance service, application redundancy service, application layer dual-active service etc.;, can be with when selecting the grade of service from the dimension of RPO plus RTO The grade of selection is made of RPO values and RTO values, the actual construction situation by data center administrator according to distributive data center Different RPO values scopes and RTO values scope is provided to select for user.Further, user apply on creation data center and Automatically dispose is carried out in disaster tolerance data center.Further, user can monitor on administration interface and apply in creation data State on center and disaster tolerance data center, and simulate disaster and carry out DR test.Further, when creation data center occurs During disaster, switch using by intelligent linkage.
By the present invention, user can be the application selection disaster tolerance grade of service, be carried out certainly using according to the disaster tolerance grade of service Dynamicization is disposed, when disaster occurs for data center using intelligent linkage switching is carried out, so as to meet the continuity need of customer service Ask.The present invention is not only applicable in VDC, is equally applicable to other types data center;The present invention is not only applicable to two data Center, is equally applicable to distributive data center.
First, the present embodiment designs a kind of distributive data center framework of two layer-managements, as shown in Figure 4:Upper strata is A Domain domain is shared by Domain domains, all data centers, the main preparation system of the cloud application management system in Domain domains Different data centers is deployed in, to avoid Single Point of Faliure.Cloud application management system belongs to the management system of application layer, unified pipe The application in all data centers is managed, includes the management of the dual-active and disaster tolerance to application;Lower floor is Region domains, in each data The heart has a Region domain of oneself, and the cloud platform management system in Region domains carries out the resource layer-management at notebook data center.
Secondly, medium cloud application management system of the present invention provides disaster tolerance service and the arranging service function of application.Specifically include Following functions:
Disaster tolerance service:User selects the application to be disposed in cloud application management system is using catalogue, for needing to hold The application of calamity service, further selects disaster tolerance SLA ranks.Disaster tolerance SLA ranks from during the selection of the dimension of data copy method, including Data level backup services, data redundancy service, application redundancy service, application layer dual-active service.Wherein, data level backup clothes Business includes two service class of data local backup and data remote backup, further each service class offer full backup, Differential backup, the function of incremental backup are selected for user;Data redundancy service includes data asynchronous replication and data are synchronously multiple Make two service class;Application redundancy service equally synchronously replicates two service class comprising data asynchronous replication and data; Application layer dual-active is the highest service of the grade of service, and data demand replicates to be synchronous.Disaster tolerance SLA ranks add RTO values from RPO values When dimension selects, different RPO value models are provided according to the actual construction situation of distributive data center by data center administrator Enclose and selected with RTO values scope for user, the different brackets that value scope may be referred to relevant criterion is set.When according to national standard When " GB/T 20988-2007 " is set, the highest grade of service is RTO several minutes, RPO 0.
Data center selects:After corresponding disaster tolerance SLA ranks being selected for application, it is desirable to provide meet the number of the SLA ranks Selected according to center for user.Cloud application management system analyzes application, and obtains each data by cloud platform management system The resource situation at center, so as to judge which data center can meet disaster tolerance SLA ranks, by these data centers with list It is supplied to user to select etc. form.When in the case of all data centers can not all meet disaster tolerance SLA ranks, it is necessary to error Prompting, it is necessary to prompt which of SLA entry to meet in detail by mistake.
Using layout:It is raw on the layout interface of cloud application management system after user completes the selection of Region numeric field datas center Into the topology of the application, the node applied in Liang Ge data centers is shown in topology.Can the application node of layout not only include Virtual machine, can also be negative by container, physical application server, physical database server, server load balancing equipment, the overall situation Carry balancing equipment, network equipment etc. and be programmed into application.User can increase, delete in the layout applied in topology Except node, configuration node parameter, configuration node linkage strategy.
Linkage strategy layout:Described in follow-up " applying intelligent linkage function ".
Again, cloud application management system provides the automatically dispose function of business.After user completes layout, deployment choosing is clicked on Xiang Hou, business carry out automatically dispose in both sides data center.Automatically dispose includes below scheme:
Virtual machine node creates and configuration:Cloud application management system calls the cloud platform management system interface in Region domains, Virtual machine and container are created in data center according to the application template of layout, and support that script mode is soft to the application in virtual machine Part carries out personalizing parameters configuration.
Other nodes distribute and configuration:Cloud application management system calls the cloud platform management system interface in Region domains, presses Correlate and carry out resource allocation and configuration with the result of layout, in addition to virtual machine node, further include container, physical server, The nodes such as storage resource, database resource, Internet resources, global load balancing resource, server load balancing resource it is automatic Change distribution and configuration.
Part of nodes manual assignment and configuration:The special joint not yet managed for part cloud platform management system, can These nodes are configured with craft, to ensure the node integrality of whole application system.All nodes are completed with postponing, should With can externally provide service, the data on both sides carry out data synchronization according to the grade of service, so as to complete whole disaster tolerance application Deployment.
4th, cloud application management system provides business monitoring and DR test function.
Monitored using health status:, should in the data center of cloud application management system monitors both sides after completing automatically dispose Operating status, includes the health status of each node, and analyzes the path of Business Stream, in cloud application management system monitors circle The health status and service flow path of each node of real-time display on face.Wherein, the color that the monitor state of node passes through node Or icon is prompted, and the means such as node further can be clicked on by mouse to obtain more health and fitness informations.
The display of application parameter and performance:On the monitoring interface of cloud application management system, mouse is moved to application When on some node, the configuration parameter of the node and relevant performance indicator will be ejected on monitoring interface.
Operation exception alerts:When an exception occurs, except showing egress on the monitoring interface of cloud application management system Health status exception outside, additionally it is possible to mail or short massage notice application administrative staff., can if being provided with linkage strategy There occurs application linkage for the linkage of initiation application, equally meeting mail and short massage notice administrative staff.
Data center's overall failure rehearsal:User can carry out disaster tolerance on the monitoring interface of cloud application management system and drill Practice.Rehearsal includes data center's overall failure, and applies individual node failure.When wherein data center's overall failure is drilled, Application monitoring interface on select overall failure rehearsal, cloud application management system by each node of the application of primary data center into Row power down etc. operates, so as to simulate the scene of whole data center's failure.
Application node failure is drilled:Except above-mentioned data center's overall failure DR test, node can also be carried out Failure is drilled.DR test, cloud application management are selected on some node of primary data center application on application monitoring interface The node is carried out the operation such as power down by system, so as to simulate the scene of the node failure.The failure rehearsal of node, can apply The failure rehearsal of multiple nodes is carried out on monitoring interface one by one, to simulate the effect of multiple node failures.
Finally, cloud application management system of the invention provides and applies intelligent linkage function.
Linkage strategy layout:The layout of linkage strategy, including the layout of condition and action are carried out in the application layout stage, i.e., Which type of condition will trigger which type of action.Condition can be the combination of single condition or multiple conditions, support The logical combination such as the "AND" of condition, "or", " non-".Action can also be the combination of multiple actions, and support the order of action, prolong When etc. set.
Linkage strategy performs:The state of each node of cloud application management system monitors application, when linkage condition meets, Cloud application management system calls the interface of interdependent node, linkage command is issued to node according to linkage strategy, so as to complete to apply Linkage.And the result for the execution that links is sent to administrator by way of mail or short message.
As shown in figure 5, for the overall flow of disaster tolerance automatization of service of the present invention deployment, the particular content of its each step is upper Text has been described above, and repeats no more.
Redundancy ability is supplied to user by the present invention in a manner of servicing, and realizes the automatic deployment and Intelligent joint of disaster tolerance application It is dynamic, efficiency is disposed using disaster tolerance so as to be lifted, ensures the data safety and business continuance of user.
In summary, by the implementation of the embodiment of the present invention, at least there are following beneficial effect:
An embodiment of the present invention provides one kind to apply disaster tolerance implementation method, and user can be application selection disaster tolerance service etc. Level, automatically dispose is carried out using according to the disaster tolerance grade of service, is cut when disaster occurs for data center using progress intelligent linkage Change, so as to meet the continuity demand of customer service, realize the disaster tolerance of VDC applications, i.e., the present invention provides one kind to be based on VDC Apply disaster recovery method, solve the prior art be not based on VDC apply disaster recovery method.
It should be understood by those skilled in the art that, the embodiment of the present invention can be provided as method, system or computer program Product.Therefore, the shape of the embodiment in terms of the present invention can use hardware embodiment, software implementation or combination software and hardware Formula.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more to use storage The form for the computer program product that medium is implemented on (including but not limited to magnetic disk storage and optical memory etc.).
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or square frame in journey and/or square frame and flowchart and/or the block diagram.These computer programs can be provided The processors of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices, which produces, to be used in fact The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or The instruction performed on other programmable devices is provided and is used for realization in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a square frame or multiple square frames.
It the above is only the embodiment of the present invention, limitation in any form not done to the present invention, it is every Any simple modification, equivalent variations, combination or the modification that technical spirit according to the present invention makes embodiment of above, still Belong to the protection domain of technical solution of the present invention.

Claims (18)

1. one kind applies disaster tolerance implementation method, including:
The disaster tolerance service-level agreement parameter for treating disaster tolerance application is obtained, the disaster tolerance service-level agreement parameter includes data duplication Parameter and/or data recovery parameter;
According to it is described treat disaster tolerance apply disaster tolerance service-level agreement parameter and each data center resource occupying state, Determine target data center;
The Internet resources at the target data center are called, disaster tolerance application is treated described in deployment;
According to the disaster tolerance service-level agreement parameter, disaster tolerance application is treated described in management.
2. apply disaster tolerance implementation method as claimed in claim 1, it is characterised in that the disaster tolerance clothes of disaster tolerance application are treated in the acquisition Business level protocol parameter includes:
The list of application of each data center is obtained, generates and shows using catalogue;
According to user in the selection operation using in catalogue, determine described to treat disaster tolerance application;
Displaying includes the disaster tolerance service-level agreement list of at least one disaster tolerance service-level agreement parameter;
According to selection operation of the user in the disaster tolerance service-level agreement list, the disaster tolerance clothes for treating disaster tolerance application are determined Business level protocol parameter.
3. apply disaster tolerance implementation method as claimed in claim 1, it is characterised in that the definite target data center includes:
According to the resource occupying state of each data center, the disaster tolerance service-level agreement ginseng that each data center supports is calculated Number;
According to the disaster tolerance service-level agreement parameter treated disaster tolerance and applied, in the disaster tolerance grade of service association that each data center supports View parameter is screened, and the data center for being met the disaster tolerance service-level agreement parameter for treating disaster tolerance application waits to select Data center's list;
According to user in the selection operation of data center's list to be selected, the target data center is determined.
4. apply disaster tolerance implementation method as claimed in claim 3, it is characterised in that further include:
If according to the disaster tolerance service-level agreement parameter treated disaster tolerance and applied, in the disaster tolerance grade of service that each data center supports Protocol parameter is screened, and does not obtain data center's list select, then prompt user reselect described in treat disaster tolerance The disaster tolerance service-level agreement parameter of application, and determine the new disaster tolerance service-level agreement parameter for treating disaster tolerance application;
According to the new disaster tolerance service-level agreement parameter treated disaster tolerance and applied, in the disaster tolerance grade of service that each data center supports Protocol parameter is screened, and the data center for being met the new disaster tolerance service-level agreement parameter for treating disaster tolerance application treats Select data center's list;
According to user in the selection operation of data center's list to be selected, the target data center is determined again.
5. apply disaster tolerance implementation method as claimed in claim 1, it is characterised in that the calling target data center Internet resources, treat described in deployment disaster tolerance apply including:
Obtain and show the applied topology for treating disaster tolerance application, the applied topology includes the application section at target data center Point;
According to user in the editing operation of the applied topology, that disaster tolerance application is treated described in completion applies layout;
The Internet resources in the target data center are called, carry out establishment, the distribution of virtual machine node, container and special joint And configuration.
6. such as claim 1 to 5 any one of them application disaster tolerance implementation method, it is characterised in that wait to hold described in the management Calamity apply including:
The operating status that disaster tolerance application is treated described in target data center is monitored and shown, when an exception occurs, performs linkage plan Slightly, carry out applying disaster tolerance;
And/or
According to user's operation, the overall failure rehearsal at target data center, and/or the node failure rehearsal of application node are carried out.
7. one kind applies disaster tolerance realization device, including:Setup module, deployment module and management module, wherein,
The setup module is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, the disaster tolerance service-level agreement Parameter includes data duplication parameter and/or data recovery parameter;
The deployment module is used to treat the disaster tolerance service-level agreement parameter of disaster tolerance application and the net of each data center according to Network resource occupation state, determines target data center, calls the Internet resources at the target data center, disaster tolerance is treated described in deployment Using;
The management module is used for according to the disaster tolerance service-level agreement parameter, and disaster tolerance application is treated described in management.
8. apply disaster tolerance realization device as claimed in claim 7, it is characterised in that the setup module is used to obtain each data The list of application at center, generates and shows using catalogue;According to user in the selection operation using in catalogue, determine described Treat disaster tolerance application;Displaying includes the disaster tolerance service-level agreement list of at least one disaster tolerance service-level agreement parameter;According to Selection operation of the family in the disaster tolerance service-level agreement list, determines the disaster tolerance service-level agreement for treating disaster tolerance application Parameter.
9. apply disaster tolerance realization device as claimed in claim 7, it is characterised in that the deployment module is used for according to each data The resource occupying state at center, calculates the disaster tolerance service-level agreement parameter that each data center supports;Wait to hold according to described The disaster tolerance service-level agreement parameter of calamity application, is screened in the disaster tolerance service-level agreement parameter that each data center supports, It is met data center's list to be selected of the data center of the disaster tolerance service-level agreement parameter for treating disaster tolerance application;Root According to user in the selection operation of data center's list to be selected, the target data center is determined.
10. apply disaster tolerance realization device as claimed in claim 9, it is characterised in that if the deployment module be additionally operable to according to It is described treat disaster tolerance application disaster tolerance service-level agreement parameter, each data center support disaster tolerance service-level agreement parameter into Row screening, does not obtain data center's list to be selected, then prompts user to reselect the disaster tolerance for treating disaster tolerance application Service-level agreement parameter, and determine the new disaster tolerance service-level agreement parameter for treating disaster tolerance application;Disaster tolerance is treated according to described The new disaster tolerance service-level agreement parameter of application, is screened in the disaster tolerance service-level agreement parameter that each data center supports, It is met data center's list to be selected of the data center of the new disaster tolerance service-level agreement parameter for treating disaster tolerance application; According to user in the selection operation of data center's list to be selected, the target data center is determined again.
11. apply disaster tolerance realization device as claimed in claim 7, it is characterised in that the deployment module is used to obtain and open up Show the applied topology for treating disaster tolerance application, the applied topology includes the application node at target data center;Existed according to user The editing operation of the applied topology, that disaster tolerance application is treated described in completion applies layout;Call in the target data center Internet resources, carry out establishment, distribution and the configuration of virtual machine node, container and special joint.
12. such as claim 7 to 11 any one of them application disaster tolerance realization device, it is characterised in that the management module is used In monitor and show treated described in target data center disaster tolerance application operating status, when an exception occurs, perform linkage strategy, Carry out applying disaster tolerance;And/or according to user's operation, carry out the overall failure rehearsal at target data center, and/or application node Node failure rehearsal.
13. one kind realizes system using disaster tolerance, including:At least one data center, cloud application management system, cloud platform management system System, at least one data center are respectively arranged with independent cloud platform management system, and at least one data center is total to With a cloud application management system, wherein,
The data center is used to provide Internet resources for application;
The cloud platform management system is used for the Internet resources for managing corresponding data center;
The cloud application management system is used to obtain the disaster tolerance service-level agreement parameter for treating disaster tolerance application, described disaster tolerance service etc. Level protocol parameter includes data duplication parameter and/or data recovery parameter, according to the disaster tolerance grade of service treated disaster tolerance and applied Protocol parameter and the resource occupying state of each data center, determine target data center, trigger the target data center Cloud platform management system call the Internet resources at the target data center, treat disaster tolerance application described in deployment;It is additionally operable to basis The disaster tolerance service-level agreement parameter, treats disaster tolerance application described in management.
14. system is realized using disaster tolerance, it is characterised in that the cloud application management system is used to lead to as claimed in claim 13 The cloud platform management system of Guo Ge data centers obtains the list of application of each data center, generates and shows using catalogue;According to User determines described to treat disaster tolerance application in the selection operation using in catalogue;Displaying includes at least one disaster tolerance service etc. The disaster tolerance service-level agreement list of level protocol parameter;Grasped according to selection of the user in the disaster tolerance service-level agreement list Make, determine the disaster tolerance service-level agreement parameter for treating disaster tolerance application.
15. system is realized using disaster tolerance, it is characterised in that the cloud application management system is used to lead to as claimed in claim 13 The cloud platform management system of Guo Ge data centers obtains the resource occupying state of each data center, according to each data center Resource occupying state, calculates the disaster tolerance service-level agreement parameter that each data center supports;Disaster tolerance application is treated according to described Disaster tolerance service-level agreement parameter, each data center support disaster tolerance service-level agreement parameter screened, expired Data center's list to be selected of the data center of the disaster tolerance service-level agreement parameter of disaster tolerance application is treated described in foot;According to user In the selection operation of data center's list to be selected, the target data center is determined.
16. system is realized using disaster tolerance, it is characterised in that the cloud application management system is additionally operable to as claimed in claim 15 If according to the disaster tolerance service-level agreement parameter treated disaster tolerance and applied, in the disaster tolerance service-level agreement that each data center supports Parameter is screened, and does not obtain data center's list select, then prompt user reselect described in treat disaster tolerance application Disaster tolerance service-level agreement parameter, and determine the new disaster tolerance service-level agreement parameter for treating disaster tolerance application;According to described Treat the new disaster tolerance service-level agreement parameter of disaster tolerance application, carried out in the disaster tolerance service-level agreement parameter that each data center supports Screening, is met the data center to be selected of the data center of the new disaster tolerance service-level agreement parameter for treating disaster tolerance application List;According to user in the selection operation of data center's list to be selected, the target data center is determined again.
17. system is realized using disaster tolerance, it is characterised in that the cloud application management system is used to obtain as claimed in claim 13 Take and show the applied topology for treating disaster tolerance application, the applied topology includes the application node at target data center;According to For user in the editing operation of the applied topology, that disaster tolerance application is treated described in completion applies layout;Trigger in the target data The cloud platform management system of the heart calls the Internet resources in the target data center, carries out virtual machine node, container and special Establishment, distribution and the configuration of node.
18. as claim 13 to 17 any one of them application disaster tolerance realizes system, it is characterised in that the cloud application management System is used to the cloud platform management system monitors by target data center and shows treat that disaster tolerance should described in target data center Operating status, when an exception occurs, performs linkage strategy, carries out applying disaster tolerance;And/or according to user's operation, carry out mesh Mark the overall failure rehearsal of data center, and/or the node failure rehearsal of application node.
CN201610921883.9A 2016-10-21 2016-10-21 One kind is using disaster tolerance implementation method, apparatus and system Withdrawn CN107977287A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610921883.9A CN107977287A (en) 2016-10-21 2016-10-21 One kind is using disaster tolerance implementation method, apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610921883.9A CN107977287A (en) 2016-10-21 2016-10-21 One kind is using disaster tolerance implementation method, apparatus and system

Publications (1)

Publication Number Publication Date
CN107977287A true CN107977287A (en) 2018-05-01

Family

ID=62004697

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610921883.9A Withdrawn CN107977287A (en) 2016-10-21 2016-10-21 One kind is using disaster tolerance implementation method, apparatus and system

Country Status (1)

Country Link
CN (1) CN107977287A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110321250A (en) * 2019-06-03 2019-10-11 阿里巴巴集团控股有限公司 A kind of disaster recovery method and device for application
CN110971464A (en) * 2019-12-10 2020-04-07 国网信通亿力科技有限责任公司 Operation and maintenance automatic system suitable for disaster recovery center
CN111030945A (en) * 2019-12-06 2020-04-17 深信服科技股份有限公司 Disaster recovery method, disaster recovery gateway, storage medium, device and system
CN112732490A (en) * 2021-01-14 2021-04-30 国网上海市电力公司 Information determination method, device, equipment and storage medium
CN114780301A (en) * 2022-06-22 2022-07-22 深圳市木浪云科技有限公司 Disaster recovery method and system supporting multi-cloud production environment
CN115599606A (en) * 2022-11-16 2023-01-13 恒丰银行股份有限公司(Cn) Method, device and medium for generating disaster recovery switching scheme

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080313242A1 (en) * 2007-06-15 2008-12-18 Savvis, Inc. Shared data center disaster recovery systems and methods
CN102035798A (en) * 2009-09-25 2011-04-27 中兴通讯股份有限公司 Service processing method, system and device for realizing disaster tolerance
JP2012027587A (en) * 2010-07-21 2012-02-09 Tokyo Denki Univ Data distribution storage, method, program and storage medium
CN103647849A (en) * 2013-12-24 2014-03-19 华为技术有限公司 Method and device for migrating businesses and disaster recovery system
CN104115447A (en) * 2014-04-14 2014-10-22 华为技术有限公司 Allowing destroy scheme configuration method and device under cloud computing architecture
CN104794028A (en) * 2014-01-16 2015-07-22 中国移动通信集团浙江有限公司 Disaster tolerance processing method and device, main data center and backup data center
US9274903B1 (en) * 2013-09-23 2016-03-01 Amazon Technologies, Inc. Disaster recovery service

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080313242A1 (en) * 2007-06-15 2008-12-18 Savvis, Inc. Shared data center disaster recovery systems and methods
CN102035798A (en) * 2009-09-25 2011-04-27 中兴通讯股份有限公司 Service processing method, system and device for realizing disaster tolerance
JP2012027587A (en) * 2010-07-21 2012-02-09 Tokyo Denki Univ Data distribution storage, method, program and storage medium
US9274903B1 (en) * 2013-09-23 2016-03-01 Amazon Technologies, Inc. Disaster recovery service
CN103647849A (en) * 2013-12-24 2014-03-19 华为技术有限公司 Method and device for migrating businesses and disaster recovery system
CN104794028A (en) * 2014-01-16 2015-07-22 中国移动通信集团浙江有限公司 Disaster tolerance processing method and device, main data center and backup data center
CN104115447A (en) * 2014-04-14 2014-10-22 华为技术有限公司 Allowing destroy scheme configuration method and device under cloud computing architecture

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110321250A (en) * 2019-06-03 2019-10-11 阿里巴巴集团控股有限公司 A kind of disaster recovery method and device for application
CN110321250B (en) * 2019-06-03 2023-05-09 创新先进技术有限公司 Disaster recovery method and device for application
CN111030945A (en) * 2019-12-06 2020-04-17 深信服科技股份有限公司 Disaster recovery method, disaster recovery gateway, storage medium, device and system
CN111030945B (en) * 2019-12-06 2023-05-16 深信服科技股份有限公司 Disaster recovery method, disaster recovery gateway, storage medium, device and system
CN110971464A (en) * 2019-12-10 2020-04-07 国网信通亿力科技有限责任公司 Operation and maintenance automatic system suitable for disaster recovery center
CN112732490A (en) * 2021-01-14 2021-04-30 国网上海市电力公司 Information determination method, device, equipment and storage medium
CN114780301A (en) * 2022-06-22 2022-07-22 深圳市木浪云科技有限公司 Disaster recovery method and system supporting multi-cloud production environment
CN114780301B (en) * 2022-06-22 2022-09-13 深圳市木浪云科技有限公司 Disaster recovery method and system supporting multi-cloud production environment
CN115599606A (en) * 2022-11-16 2023-01-13 恒丰银行股份有限公司(Cn) Method, device and medium for generating disaster recovery switching scheme

Similar Documents

Publication Publication Date Title
CN107977287A (en) One kind is using disaster tolerance implementation method, apparatus and system
CN105429780B (en) A method of virtualization network service business automatically generates and dynamic monitors
da Cunha Rodrigues et al. Monitoring of cloud computing environments: concepts, solutions, trends, and future directions
CN103197952B (en) The management system and method disposed for application system maintenance based on cloud infrastructure
WO2016101638A1 (en) Operation management method for electric power system cloud simulation platform
CN109214704A (en) A kind of distributed intelligence operation platform, method, apparatus and readable storage medium storing program for executing
Koslovski et al. Reliability support in virtual infrastructures
CN104040503A (en) An open resilience framework for simplified and coordinated orchestration of multiple availability managers
CN107005421A (en) Utilize the management based on topology of stage and version policy
CN106209482A (en) A kind of data center monitoring method and system
CN105938443A (en) Method and system used for performing diagnosis in computating environment
CN108270726A (en) Application example dispositions method and device
US9854002B1 (en) Application centric compliance management system and method for a multi-level computing environment
CN103595572B (en) A kind of method of cloud computing cluster interior joint selfreparing
Veeraraghavan et al. Maelstrom: Mitigating datacenter-level disasters by draining interdependent traffic safely and efficiently
CN104850394B (en) The management method and distributed system of distributed application program
Yao et al. A hybrid fault-tolerant scheduling for deadline-constrained tasks in cloud systems
CN108062231A (en) A kind of cloud application automatic configuration method based on correlation analysis
CN108628716A (en) Information receives guard system, method and device
CN109254859A (en) Multilayer-control self-adaptive micro-service system
WO2023138014A1 (en) Intelligent operation and maintenance system oriented to computing-network integration scenario and use method thereof
CN108052371A (en) Railway TDCS/CTC systems and its application based on virtualization technology
CN108199901A (en) Hardware reports method, system, equipment, hardware management server and storage medium for repairment
CN108696373B (en) Virtual resource allocation method, NFVO and system
Zhou et al. FTCloudSim: support for cloud service reliability enhancement simulation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20180501