CN108932180A - A kind of disaster tolerance management method, device, storage medium and computer equipment matter - Google Patents

A kind of disaster tolerance management method, device, storage medium and computer equipment matter Download PDF

Info

Publication number
CN108932180A
CN108932180A CN201810644529.5A CN201810644529A CN108932180A CN 108932180 A CN108932180 A CN 108932180A CN 201810644529 A CN201810644529 A CN 201810644529A CN 108932180 A CN108932180 A CN 108932180A
Authority
CN
China
Prior art keywords
openstack
managing device
disaster tolerance
resource
standby
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810644529.5A
Other languages
Chinese (zh)
Inventor
郭春庭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201810644529.5A priority Critical patent/CN108932180A/en
Publication of CN108932180A publication Critical patent/CN108932180A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1469Backup restoration techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses a kind of disaster tolerance management methods, applied to the system comprising the first cloud computing platform OpenStack and the 2nd OpenStack, first OpenStack is main OpenStack, 2nd OpenStack is standby OpenStack, first OpenStack is equipped with the first disaster tolerance managing device, 2nd OpenStack is equipped with the second disaster tolerance managing device, the following steps are included: the first disaster tolerance managing device monitors the resource change of the first OpenStack, based on preset Disaster Recovery Strategy, the resource change is synchronized to the 2nd OpenStack by the first disaster tolerance managing device and the second disaster tolerance managing device;When the first OpenStack collapse, the second disaster tolerance managing device is switched the 2nd OpenStack for main OpenStack based on disaster recovery strategy and the synchronous resource, and the first OpenStack is switched to standby OpenStack by the first disaster tolerance managing device.The recovery process complicated compared to manual operation can be protected according to tenant's classification between the resource of disaster tolerance two centers, can also flexibly configure various Disaster Recovery Strategies, avoid manual operation fault bring disaster.

Description

A kind of disaster tolerance management method, device, storage medium and computer equipment matter
Technical field
The present invention relates to computer technology, espespecially a kind of disaster tolerance management method, device, storage medium and computer equipment Matter.
Background technique
Cloud computing is in fast-developing and landing stage, and OpenStack is by its open source, open and stable spy Property, the fact that have become private clound standard.OpenStack landing at present most will urgently solve the problems, such as to be exactly data center Disaster tolerance is substantially in the Disaster Tolerant Scheme used in OpenStack cloud platform actual production environment based on traditional data center Disaster Tolerant Scheme realizes storage by means of technologies such as the remote copies of OpenStack bottom storage between active and standby Liang Ge data center The either synchronously or asynchronously duplication of volume.Then, corresponding resource is created in Disaster Preparation Center manually, or manages work by upper layer disaster tolerance Tool, maps the resource of Liang Ge data center, is also only limitted to the other mapping of virtual machine-level here.Although such manager Method can achieve the disaster tolerance of virtual machine, still, manually than relatively time-consuming, also be easy error, can not be easily the money in tenant Source carries out unified protection.
Summary of the invention
In order to solve the above-mentioned technical problems, the present invention provides a kind of disaster tolerance management method, device, storage medium and calculating Machine equipment matter can be realized automatic backup and the disaster tolerance of data center.
In order to reach the object of the invention, the present invention provides a kind of disaster tolerance management method, it is applied to include the first cloud computing The system of platform OpenStack and the 2nd OpenStack, the first OpenStack be main OpenStack, described second OpenStack is that the first disaster tolerance managing device is equipped with for OpenStack, the first OpenStack, described second OpenStack is equipped with the second disaster tolerance managing device, the described method comprises the following steps:
First disaster tolerance managing device monitors the resource change of the first OpenStack, is based on preset Disaster Recovery Strategy, and described the The resource change is synchronized to the 2nd OpenStack by one disaster tolerance managing device and the second disaster tolerance managing device;
When the first OpenStack collapse when, the second disaster tolerance managing device be based on disaster recovery strategy with it is described synchronous Resource the 2nd OpenStack is switched as main OpenStack, the first disaster tolerance managing device is by described first OpenStack is switched to standby OpenStack.
Further, further include, after the first OpenStack restores, the first disaster tolerance managing device and the second disaster tolerance The resource information of 2nd OpenStack is synchronized to the first OpenStack, the first disaster tolerance management by managing device First OpenStack is switched back into main OpenStack by device, and the second disaster tolerance managing device is by described second OpenStack switches back into standby OpenStack.
Further, described to be based on preset Disaster Recovery Strategy, the first disaster tolerance managing device and the second disaster tolerance management dress It sets and the resource change is synchronized to the 2nd OpenStack, comprising:
Based on preset Disaster Recovery Strategy, the first disaster tolerance managing device is by the resource change of the first OpenStack It is converted into resource information and is sent to the second disaster tolerance managing device;
The second disaster tolerance managing device receives and converts the resource information as the action command of establishing resource, is sent to 2nd OpenStack.
Further, the preset Disaster Recovery Strategy, including following one:
The resource change of all resources of all tenants is protected in monitoring;
Monitoring protection portion sublease family all resources resource change;
The resource change of the part resource of all tenants is protected in monitoring;
Monitoring protection portion sublease family part resource resource change.
Further, the disaster recovery strategy, including one or more of:
Start some or all of default tenant resource;
Successively starting resource is carried out according to priority;
Start resource according to the incidence relation of tenant and resource.
In order to reach the object of the invention, the present invention also provides a kind of disaster tolerance managing device, disaster tolerance managing device positions In the first cloud computing platform OpenStack, and when the first OpenStack is main OpenStack, for monitoring first The resource change of OpenStack, cooperating with the disaster tolerance managing device for being located at standby OpenStack, it is standby to be synchronized to the resource change OpenStack;And in the first OpenStack collapse, the first OpenStack to be switched to standby OpenStack;
The disaster tolerance managing device is located at the 2nd OpenStack, and when the 2nd OpenStack is standby OpenStack, The resource change is synchronized to the 2nd OpenStack for cooperating with the disaster tolerance managing device positioned at main OpenStack, and For the 2nd OpenStack being cut with the synchronous resource based on disaster recovery strategy in main OpenStack collapse It is changed to main OpenStack.
Further, it is also used to,
The disaster tolerance managing device is located at the 2nd OpenStack, and when the 2nd OpenStack is main OpenStack, The resource change for monitoring the 2nd OpenStack, after the first OpenStack restores, with the disaster tolerance management for being located at standby OpenStack The resource change is synchronized to the first OpenStack by device cooperation, the 2nd OpenStack is switched back into standby OpenStack;
The disaster tolerance managing device is located at the first OpenStack, and when the first OpenStack is standby OpenStack, After the first OpenStack restores, cooperates with the disaster tolerance managing device positioned at main OpenStack and be synchronized to the resource change First OpenStack is switched back into main OpenStack by the first OpenStack.
Further, the disaster tolerance managing device cooperates with the disaster tolerance managing device for being located at standby OpenStack by the money Source, which changes, is synchronized to standby OpenStack, comprising: the disaster tolerance managing device is based on preset Disaster Recovery Strategy, and the resource is become Turn changes resource information into and is sent to standby OpenStack;
The disaster tolerance managing device and the disaster tolerance managing device cooperation positioned at main OpenStack are synchronous by the resource change To the 2nd OpenStack, comprising: the disaster tolerance managing device is based on receiving and convert the resource information as the dynamic of establishing resource It instructs, is sent to the standby OpenStack.
In order to reach the object of the invention, the present invention also provides a kind of computer readable storage mediums, are stored thereon with meter Calculation machine program, when which is executed by processor the step of the realization above method.
In order to reach the object of the invention, the present invention also provides a kind of computer equipment, including memory, processor and The computer program that can be run on a memory and on a processor is stored, the processor is realized above-mentioned when executing described program Step in method.
Compared with prior art, the present invention includes the resource change that the first disaster tolerance managing device monitors the first OpenStack, Based on preset Disaster Recovery Strategy, the resource change is synchronized to by the first disaster tolerance managing device and the second disaster tolerance managing device 2nd OpenStack;When the first OpenStack collapse, the second disaster tolerance managing device is based on disaster recovery strategy The 2nd OpenStack is switched as main OpenStack with the synchronous resource, the first disaster tolerance managing device is by institute It states the first OpenStack and is switched to standby OpenStack.Can the resource to data center flexibly back up and automatic disaster tolerance.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention can be by specification, right Specifically noted structure is achieved and obtained in claim and attached drawing.
Detailed description of the invention
Attached drawing is used to provide to further understand technical solution of the present invention, and constitutes part of specification, with this The embodiment of application technical solution for explaining the present invention together, does not constitute the limitation to technical solution of the present invention.
Fig. 1 is the flow chart of one disaster tolerance management method of the embodiment of the present invention;
Fig. 2 is the integrated stand composition of two disaster tolerance management method of the embodiment of the present invention;
Fig. 3 is two disaster tolerance managing device architecture diagram of the embodiment of the present invention;
Fig. 4 is the structural schematic diagram of the computer equipment of the embodiment of the present invention three.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with attached drawing to the present invention Embodiment be described in detail.It should be noted that in the absence of conflict, in the embodiment and embodiment in the application Feature can mutual any combination.
Step shown in the flowchart of the accompanying drawings can be in a computer system such as a set of computer executable instructions It executes.Also, although logical order is shown in flow charts, and it in some cases, can be to be different from herein suitable Sequence executes shown or described step.
Embodiment one
A kind of disaster tolerance management method is present embodiments provided, is applied to include the first OpenStack and the 2nd OpenStack System, the first OpenStack is main OpenStack, and the 2nd OpenStack is standby OpenStack, the first OpenStack peace Equipped with the first disaster tolerance managing device, the 2nd OpenStack is equipped with the second disaster tolerance managing device, as shown in Figure 1, this method includes S11-S12:
S11, the first disaster tolerance managing device monitor the resource change of the first OpenStack, are based on preset Disaster Recovery Strategy, the Resource change is synchronized to the 2nd OpenStack by one disaster tolerance managing device and the second disaster tolerance managing device;
S12, when the first OpenStack collapse when, the second disaster tolerance managing device is based on disaster recovery strategy and synchronous money Source switches the 2nd OpenStack for main OpenStack, and the first OpenStack is switched to standby by the first disaster tolerance managing device OpenStack。
In the embodiment of the present invention, it is based on openstack cloud platform, by the way that the resource change of main OpenStack to be synchronized to Standby OpenStack is restored when main OpenStack collapse based on disaster recovery strategy.It can be to the resource of data center Flexibly backup and automatic disaster tolerance.
Calamity managing device monitors while connecting the OpenStack of other side, is detected, so when the first OpenStack collapses When bursting, it can be detected simultaneously by the first OpenStack collapse.
Resource change refers to the other variation of resource change object level, such as creation virtual machine etc., then according to the behaviour of event All information that the resource object is collected as object ID, are converted into resource information.Business datum rank is by the remote of storage Journey duplication carrys out real-time synchronization;
Resource information do not include the data of virtual machine, roll up the data of rank, the system disk data of virtual machine, virtual machine volume The business datums such as data;Above-mentioned business datum is still by the remote copy of storage come real-time synchronization.
The resource information includes: tenant's information, resource id information, resource specification information, resource network information, metadata Id information, the volume information of information, the storage volume of corresponding bottom;
When the resource change is main OpenStack initialization procedure, all resources of the main OpenStack are inquired, It is converted into resource information and is sent to the standby OpenStack.The standby OpenStack is equipped with standby disaster tolerance managing device for DR- Manager;The main OpenStack initialization procedure inquires all resources of the main OpenStack, is converted into resource information It is sent to the standby OpenStack, comprising: main DR-Manager inquires all resources of the main OpenStack, by the institute There is resource to be converted to resource information and is sent to the standby DR-Manager;The standby DR-Manager converts the resource information The standby OpenStack, which is sent to, at the action command of establishing resource carries out asset creation.
Optionally, the above method further include: after the first OpenStack restores, the first disaster tolerance managing device and second is held The resource information of 2nd OpenStack is synchronized to the first OpenStack by calamity managing device, and the first disaster tolerance managing device is by first OpenStack switches back into main OpenStack, and the 2nd OpenStack is switched back into standby OpenStack by the second disaster tolerance managing device.
According to concrete scene difference, the information for the process preamble that the former main OpenStack of switching is main OpenStack is not yet Together;If it is the electricity went off by main OpenStack, after re-powering, it is only necessary to which standby OpenStack resource becomes in synchronous power-off time It is dynamic;If it is main OpenStack data caused by the natural disasters such as earthquake are lost completely, then need to synchronize all standby OpenStack resource.
When main OpenStack collapse, standby data center becomes main according to the recovery policy that DR-Manager is configured OpenStack externally provides service.After the main OpenStack of original restores, the master of the synchronous distal end local DR-Manager OpenStack resource information to local, can choose business it is less when, switch back to former main OpenStack and service be externally provided.
In one alternate embodiment, described to be based on preset Disaster Recovery Strategy, the first disaster tolerance managing device and the second disaster tolerance Resource change is synchronized to the 2nd OpenStack by managing device, comprising:
Based on preset Disaster Recovery Strategy, the resource change of the first OpenStack is converted into providing by the first disaster tolerance managing device Source information is sent to the second disaster tolerance managing device;
Second disaster tolerance managing device receives and Transition Resource Information is the action command of establishing resource, is sent to second OpenStack。
In one alternate embodiment, the preset Disaster Recovery Strategy, including following one:
The resource change of all resources of all tenants is protected in monitoring;
Monitoring protection portion sublease family all resources resource change;
The resource change of the part resource of all tenants is protected in monitoring;
Monitoring protection portion sublease family part resource resource change.
In one alternate embodiment, the disaster recovery strategy, including one or more of:
Start some or all of default tenant resource;
Successively starting resource is carried out according to priority;
Start resource according to the incidence relation of tenant and resource.
Wherein, presetting some or all of tenant resource includes following one:
Start whole resources of whole tenants;
Whole resources of actuating section tenant;
Start the part resource in whole tenants;
The part resource of actuating section tenant
Herein in whole or in part refer to, configured in Disaster Recovery Strategy monitoring resource change on the basis of whole or Person part.
In the embodiment of the present invention, the first disaster tolerance managing device monitors the resource change of the first OpenStack, based on preset The resource change is synchronized to described second by Disaster Recovery Strategy, the first disaster tolerance managing device and the second disaster tolerance managing device OpenStack;When the first OpenStack collapse, the second disaster tolerance managing device is based on disaster recovery strategy and described same The resource of step switches the 2nd OpenStack for main OpenStack, and the first disaster tolerance managing device is by described first OpenStack is switched to standby OpenStack.The recovery process complicated compared to manual operation, can be between disaster tolerance two centers Resource is protected according to tenant's classification, can also flexibly configure various Disaster Recovery Strategies, and manual operation fault bring is avoided Disaster.
Embodiment two
Above-described embodiment method is specifically described in the present embodiment, provides a kind of based on OpenStack cloud platform The disaster tolerance management method of tenant's granularity, as shown in Fig. 2, disaster tolerance system overall architecture include in primary data center and standby data The heart.
One disaster tolerance managing device of each self installation, referred to as DR- in the active and standby Liang Ge data center of OpenStack disaster tolerance Manager, as shown in Fig. 3, the DR-Manager include: Event Collector, source synchronous device, actuator, disaster recovery Device carries out the real-time duplication of storage volume between Liang Ge data center by storage remote copy technology.
The configuration work of OpenStack, comprising: it is completed in active and standby data center deployment, and after having configured bottom disaster tolerance, Each self installation DR-Manager, receive the OpenStack of pipe local, and two DR_Manager are attached, configure as main DR_ Manager, standby DR_Manager.DR-Manager device includes: Event Collector, heartbeat detector, source synchronous device, executes Device, disaster recovery device.
Data backup when main OpenStack is worked normally, between main OpenStack and standby OpenStack, comprising: when It is all that the DR-Manager led is monitored when having new movement in main OpenStack, main DR-Manager collects master Resource information in OpenStack then synchronizes in standby DR-Manager, sends instructions under standby DR-Manager to standby OpenStack, standby OpenStack just have and identical tenant and resource in main OpenStack.Virtual machine VM and volume grade Other data, which change, to be realized by the remote copy of storage, host virtual machine and standby virtual machine in active and standby OpenStack System disk and volume are also real-time synchronization.
Resource after creation determines starting state or closed state according to the scheme of disaster tolerance, if being set as cold Standby, then resource is in off state, and is starting state if it is hot standby then resource.It can also support dual-active, it is only necessary to upper layer application Load balancing is accessed, the virtual machine of Liang Ge data center is accessed load balancing, network, which is got through, can be realized.
Start for the first time if it is DR-Manager, initialized, main DR-Manager inquires the master All resources are converted to resource information and are sent to the standby DR-Manager by all resources of OpenStack;It is described standby The action command that the resource information is converted into establishing resource is sent to the standby OpenStack and carries out resource by DR-Manager Creation.
When main OpenStack collapse, standby DR-Manager prison executes under disaster recovery device according to disaster recovery strategy It says the word to the order of actuator deactivation resource, standby OpenStack becomes main OpenStack, externally provides service, DR- Manager is oneself being arranged as main DR-Manager.
After former main OpenStack restores, run standby OpenStack oneself is regarded, local DR-Manager is set as standby. At this point, former standby OpenStack is main OpenStack, former main OpenStack is standby OpenStack.Former main OpenStack can be same The resource information of the former standby OpenStack of step.When business is not busy at that time, switchback is carried out manually, i.e. original OpenStack becomes Main, former standby OpenStack continues as standby center.
The concrete configuration of DR-Manager includes:
(1) basis instrument
Local DR-Manager is respectively disposed at the active and standby double centers of the disaster tolerance of OpenStack, and receive pipe local Then OpenStack is attached mutually by network, be able to carry out communication, main DR-Manager based on oneself being arranged, and It informs that other side DR-Manager allows it to be set as standby, detects whether other side deposits by beat detector between two DR-Manager Living, the generation of fissure in order to prevent, the third-party arbitration of configuration one is arranged, and DR-Manager connects other side's simultaneously OpenStack is detected, and the phenomenon that DR-Manager delay machine and OpenStack are survived is prevented.
(2) Disaster Recovery Strategy is configured on main DR-Manager, comprising:
A. all resources of all tenants are protected;
B. all resources of certain a part of tenant are protected;
C. the part resource of all tenants is protected;
D. protection portion is subleased the part resource at family;
Disaster Recovery Strategy is configured by tenant, relatively more flexible.
(3) main DR-Manager Event Collector carries out the event-monitoring of tenant according to the Disaster Recovery Strategy of configuration, such as creates Virtual machine etc. is built, all information of the resource object are then collected according to the operation object ID of event, are converted into resource information.
Resource information includes:
A. tenant's information
B. resource ID, specification, the network information
C. metadata, such as: secure group, code key key
D. the ID of the storage volume of bottom is corresponded to
E. the resource of relationship associated therewith, such as: volume
(4) recovery policy:
A. start whole resources of whole tenants;
B. whole resources of actuating section tenant;
C. start the part resource in whole tenants;
D. the part resource of actuating section tenant;
E. successively starting resource is carried out according to priority;
F. according to incidence relation, starting resource is successively carried out;
Based on above-mentioned setting, when main OpenStack is worked normally, the source synchronous at master/slave data center includes: the master The Event Collector of DR-Manager monitors the main OpenStack and resource change occurs, and the resource change is converted to Resource information is sent to the source synchronous device of the main DR-Manager;The source synchronous device of the main DR-Manager will be described Resource information is sent to the source synchronous device of the standby DR-Manager;The source synchronous device of the standby DR-Manager will be described Resource information is sent to the actuator of the standby DR-Manager;The actuator of the standby DR-Manager is by the resource information The action command for being converted into establishing resource is sent to the standby OpenStack and carries out asset creation.
Initialization procedure includes: to be when the Event Collector of the main DR-Manager monitors the main OpenStack All resources of the main OpenStack are converted to resource information and are sent to the main DR-Manager's by initialization procedure Source synchronous device;The resource information is sent to the standby DR-Manager's by the source synchronous device of the main DR-Manager Source synchronous device;The resource information is sent to the standby DR-Manager's by the source synchronous device of the standby DR-Manager Actuator;The action command that the resource information is converted into establishing resource is sent to institute by the actuator of the standby DR-Manager It states standby OpenStack and carries out asset creation.
Embodiment three
A kind of disaster tolerance managing device is present embodiments provided, description is also applied for the present embodiment in above method embodiment In, the disaster tolerance managing device is located at the first cloud computing platform OpenStack, and the first OpenStack is main OpenStack When, for monitoring the resource change of the first OpenStack, cooperate with the disaster tolerance managing device for being located at standby OpenStack by resource Variation is synchronized to standby OpenStack;And for the first OpenStack being switched to standby in the first OpenStack collapse OpenStack;
The disaster tolerance managing device is located at the 2nd OpenStack, and when the 2nd OpenStack is standby OpenStack, is used for It cooperates with the disaster tolerance managing device positioned at main OpenStack and resource change is synchronized to the 2nd OpenStack, and in master When OpenStack collapses, the 2nd OpenStack is switched as main OpenStack with synchronous resource based on disaster recovery strategy.
Optionally, above-mentioned apparatus is also used to: the disaster tolerance managing device is located at the 2nd OpenStack, and second When OpenStack is main OpenStack, the resource change of the 2nd OpenStack is monitored, after the first OpenStack restores, with It cooperates positioned at the disaster tolerance managing device of standby OpenStack and resource change is synchronized to the first OpenStack, by second OpenStack switches back into standby OpenStack;
The disaster tolerance managing device is located at the first OpenStack, and when the first OpenStack is standby OpenStack, when the After one OpenStack restores, cooperates with the disaster tolerance managing device positioned at main OpenStack and resource change is synchronized to first First OpenStack is switched back into main OpenStack by OpenStack;
In one alternate embodiment, disaster tolerance managing device cooperates with the disaster tolerance managing device for being located at standby OpenStack and incites somebody to action Resource change is synchronized to standby OpenStack, comprising: disaster tolerance managing device is based on preset Disaster Recovery Strategy, and resource change is converted Standby OpenStack is sent at resource information;
Disaster tolerance managing device cooperates with the disaster tolerance managing device positioned at main OpenStack resource change being synchronized to second OpenStack, comprising: disaster tolerance managing device is sent to based on the action command that simultaneously Transition Resource Information is establishing resource is received Standby OpenStack.
The embodiment of the present invention also provides a kind of computer storage medium, and the computer storage medium is stored with computer journey Sequence;After the computer program is performed, the attack evidence collecting method of previous embodiment offer can be realized, for example, executing such as Fig. 1 In shown method.
The embodiment of the present invention also provides a kind of computer equipment, can be structure as shown in Figure 4, including processor 41, memory 42 and storage on a memory and the computer program that can run on a processor, processor execution institute Part or Overall Steps in embodiment one or embodiment two can be realized when stating program.
Although disclosed herein embodiment it is as above, the content only for ease of understanding the present invention and use Embodiment is not intended to limit the invention.Technical staff in any fields of the present invention is taken off not departing from the present invention Under the premise of the spirit and scope of dew, any modification and variation, but the present invention can be carried out in the form and details of implementation Scope of patent protection, still should be subject to the scope of the claims as defined in the appended claims.
It will appreciated by the skilled person that whole or certain steps, system, dress in method disclosed hereinabove Functional module/unit in setting may be implemented as software, firmware, hardware and its combination appropriate.In hardware embodiment, Division between the functional module/unit referred in the above description not necessarily corresponds to the division of physical assemblies;For example, one Physical assemblies can have multiple functions or a function or step and can be executed by several physical assemblies cooperations.Certain groups Part or all components may be implemented as by processor, such as the software that digital signal processor or microprocessor execute, or by It is embodied as hardware, or is implemented as integrated circuit, such as specific integrated circuit.Such software can be distributed in computer-readable On medium, computer-readable medium may include computer storage medium (or non-transitory medium) and communication media (or temporarily Property medium).As known to a person of ordinary skill in the art, term computer storage medium is included in for storing information (such as Computer readable instructions, data structure, program module or other data) any method or technique in the volatibility implemented and non- Volatibility, removable and nonremovable medium.Computer storage medium include but is not limited to RAM, ROM, EEPROM, flash memory or its His memory technology, CD-ROM, digital versatile disc (DVD) or other optical disc storages, magnetic holder, tape, disk storage or other Magnetic memory apparatus or any other medium that can be used for storing desired information and can be accessed by a computer.This Outside, known to a person of ordinary skill in the art to be, communication media generally comprises computer readable instructions, data structure, program mould Other data in the modulated data signal of block or such as carrier wave or other transmission mechanisms etc, and may include any information Delivery media.

Claims (10)

1. a kind of disaster tolerance management method is applied to comprising the first cloud computing platform OpenStack and the 2nd OpenStack System, which is characterized in that the first OpenStack is main OpenStack, and the 2nd OpenStack is standby OpenStack, First OpenStack is equipped with the first disaster tolerance managing device, and the 2nd OpenStack is equipped with the second disaster tolerance management dress It sets, the described method comprises the following steps:
First disaster tolerance managing device monitors the resource change of the first OpenStack, is based on preset Disaster Recovery Strategy, and described first holds The resource change is synchronized to the 2nd OpenStack by calamity managing device and the second disaster tolerance managing device;
When the first OpenStack collapse, the second disaster tolerance managing device is based on disaster recovery strategy and the synchronous money Source switches the 2nd OpenStack for main OpenStack, and the first disaster tolerance managing device is by described first OpenStack is switched to standby OpenStack.
2. the method according to claim 1, wherein further include, after the first OpenStack restores, described the The resource information of 2nd OpenStack is synchronized to described first by one disaster tolerance managing device and the second disaster tolerance managing device First OpenStack is switched back into main OpenStack by OpenStack, the first disaster tolerance managing device, and described second holds 2nd OpenStack is switched back into standby OpenStack by calamity managing device.
3. the method according to claim 1, wherein described be based on preset Disaster Recovery Strategy, first disaster tolerance The resource change is synchronized to the 2nd OpenStack by managing device and the second disaster tolerance managing device, comprising:
Based on preset Disaster Recovery Strategy, the first disaster tolerance managing device converts the resource change of the first OpenStack The second disaster tolerance managing device is sent at resource information;
The second disaster tolerance managing device receives and converts the resource information as the action command of establishing resource, is sent to described 2nd OpenStack.
4. method according to claim 1 or 3, which is characterized in that the preset Disaster Recovery Strategy, including following one:
The resource change of all resources of all tenants is protected in monitoring;
Monitoring protection portion sublease family all resources resource change;
The resource change of the part resource of all tenants is protected in monitoring;
Monitoring protection portion sublease family part resource resource change.
5. the method according to claim 1, wherein the disaster recovery strategy, including one or more of:
Start some or all of default tenant resource;
Successively starting resource is carried out according to priority;
Start resource according to the incidence relation of tenant and resource.
6. a kind of disaster tolerance managing device, which is characterized in that
The disaster tolerance managing device is located at the first cloud computing platform OpenStack, and based on the first OpenStack When OpenStack, for monitoring the resource change of the first OpenStack, assisted with the disaster tolerance managing device for being located at standby OpenStack Make for the resource change to be synchronized to standby OpenStack;And it is used in the first OpenStack collapse, by described first OpenStack is switched to standby OpenStack;
The disaster tolerance managing device is located at the 2nd OpenStack, and when the 2nd OpenStack is standby OpenStack, is used for It cooperates with the disaster tolerance managing device positioned at main OpenStack and the resource change is synchronized to the 2nd OpenStack, and be used for In main OpenStack collapse, the 2nd OpenStack is switched to the synchronous resource based on disaster recovery strategy Main OpenStack.
7. device according to claim 6, which is characterized in that it is also used to,
The disaster tolerance managing device is located at the 2nd OpenStack, and when the 2nd OpenStack is main OpenStack, monitoring The resource change of 2nd OpenStack, after the first OpenStack restores, with the disaster tolerance managing device for being located at standby OpenStack The resource change is synchronized to the first OpenStack by cooperation, and the 2nd OpenStack is switched back into standby OpenStack;
The disaster tolerance managing device is located at the first OpenStack, and when the first OpenStack is standby OpenStack, when the After one OpenStack restores, cooperates with the disaster tolerance managing device positioned at main OpenStack and the resource change is synchronized to first First OpenStack is switched back into main OpenStack by OpenStack.
8. device according to claim 6, which is characterized in that
The disaster tolerance managing device cooperate with the disaster tolerance managing device for being located at standby OpenStack the resource change is synchronized to it is standby OpenStack, comprising: the disaster tolerance managing device is based on preset Disaster Recovery Strategy, and the resource change is converted into resource letter Breath is sent to standby OpenStack;
The disaster tolerance managing device cooperates with the disaster tolerance managing device positioned at main OpenStack the resource change being synchronized to the Two OpenStack, comprising: the disaster tolerance managing device is referred into the movement of establishing resource based on receiving and convert the resource information It enables, is sent to the standby OpenStack.
9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The step of any one of claim 1-5 the method is realized when row.
10. a kind of computer equipment, can run on a memory and on a processor including memory, processor and storage Computer program, which is characterized in that realize that any one of claim 1-5 right such as is wanted when the processor executes described program Seek the step in the method.
CN201810644529.5A 2018-06-21 2018-06-21 A kind of disaster tolerance management method, device, storage medium and computer equipment matter Pending CN108932180A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810644529.5A CN108932180A (en) 2018-06-21 2018-06-21 A kind of disaster tolerance management method, device, storage medium and computer equipment matter

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810644529.5A CN108932180A (en) 2018-06-21 2018-06-21 A kind of disaster tolerance management method, device, storage medium and computer equipment matter

Publications (1)

Publication Number Publication Date
CN108932180A true CN108932180A (en) 2018-12-04

Family

ID=64446136

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810644529.5A Pending CN108932180A (en) 2018-06-21 2018-06-21 A kind of disaster tolerance management method, device, storage medium and computer equipment matter

Country Status (1)

Country Link
CN (1) CN108932180A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112099835A (en) * 2020-09-07 2020-12-18 紫光云(南京)数字技术有限公司 Soft load balancing mirror image dynamic upgrading method and system
CN112929221A (en) * 2021-03-02 2021-06-08 浪潮云信息技术股份公司 Method for realizing disaster tolerance of main and standby cloud service products
CN113691342A (en) * 2021-09-13 2021-11-23 上海云轴信息科技有限公司 Method and system for time synchronization of virtualization platform
CN113821384A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Cross-data-center and same-city disaster recovery method, device and equipment based on cloud platform
CN114745557A (en) * 2022-03-22 2022-07-12 浙江大华技术股份有限公司 Disaster recovery operation execution method and device, storage medium and electronic device
WO2023093379A1 (en) * 2021-11-26 2023-06-01 中兴通讯股份有限公司 Disaster recovery switching method and system, electronic device, and storage medium

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1851657A (en) * 2005-07-21 2006-10-25 上海华为技术有限公司 Dual-machine back-up realizing method and system
CN102629224A (en) * 2012-04-26 2012-08-08 广东电子工业研究院有限公司 Method and device of integrated data disaster recovery based on cloud platform
US20140280961A1 (en) * 2013-03-15 2014-09-18 Frank Martinez System and method for a cloud computing abstraction with multi-tier deployment policy
CN104794012A (en) * 2014-03-19 2015-07-22 北京中科同向信息技术有限公司 Switching technique based on disaster recovery
CN104794028A (en) * 2014-01-16 2015-07-22 中国移动通信集团浙江有限公司 Disaster tolerance processing method and device, main data center and backup data center
US9582379B2 (en) * 2014-06-16 2017-02-28 International Business Machines Corporation Multi-site disaster recovery mechanism for distributed cloud orchestration software
CN106506636A (en) * 2016-11-04 2017-03-15 武汉噢易云计算股份有限公司 A kind of cloud platform cluster method and system based on OpenStack
CN106502823A (en) * 2016-09-29 2017-03-15 北京许继电气有限公司 data cloud backup method and system
CN106776140A (en) * 2016-12-21 2017-05-31 博飞信息科技(上海)有限公司 The standby device and method for recovering all-in-one of super disaster tolerance
CN106844180A (en) * 2017-02-07 2017-06-13 山东浪潮云服务信息科技有限公司 A kind of monitoring and controlling forecast method of OpenStack platforms computing resource
CN107241430A (en) * 2017-07-03 2017-10-10 国家电网公司 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage
CN107256184A (en) * 2017-06-05 2017-10-17 郑州云海信息技术有限公司 A kind of data disaster backup method and device based on storage pool
CN107995043A (en) * 2017-12-15 2018-05-04 南京南瑞信息通信科技有限公司 Application disaster recovery and backup systems and calamity based on mixing cloud platform are for collocation method

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1851657A (en) * 2005-07-21 2006-10-25 上海华为技术有限公司 Dual-machine back-up realizing method and system
CN102629224A (en) * 2012-04-26 2012-08-08 广东电子工业研究院有限公司 Method and device of integrated data disaster recovery based on cloud platform
CN102629224B (en) * 2012-04-26 2014-11-26 广东电子工业研究院有限公司 Method and device of integrated data disaster recovery based on cloud platform
US20140280961A1 (en) * 2013-03-15 2014-09-18 Frank Martinez System and method for a cloud computing abstraction with multi-tier deployment policy
CN104794028A (en) * 2014-01-16 2015-07-22 中国移动通信集团浙江有限公司 Disaster tolerance processing method and device, main data center and backup data center
CN104794012A (en) * 2014-03-19 2015-07-22 北京中科同向信息技术有限公司 Switching technique based on disaster recovery
US9582379B2 (en) * 2014-06-16 2017-02-28 International Business Machines Corporation Multi-site disaster recovery mechanism for distributed cloud orchestration software
CN106502823A (en) * 2016-09-29 2017-03-15 北京许继电气有限公司 data cloud backup method and system
CN106506636A (en) * 2016-11-04 2017-03-15 武汉噢易云计算股份有限公司 A kind of cloud platform cluster method and system based on OpenStack
CN106776140A (en) * 2016-12-21 2017-05-31 博飞信息科技(上海)有限公司 The standby device and method for recovering all-in-one of super disaster tolerance
CN106844180A (en) * 2017-02-07 2017-06-13 山东浪潮云服务信息科技有限公司 A kind of monitoring and controlling forecast method of OpenStack platforms computing resource
CN107256184A (en) * 2017-06-05 2017-10-17 郑州云海信息技术有限公司 A kind of data disaster backup method and device based on storage pool
CN107241430A (en) * 2017-07-03 2017-10-10 国家电网公司 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage
CN107995043A (en) * 2017-12-15 2018-05-04 南京南瑞信息通信科技有限公司 Application disaster recovery and backup systems and calamity based on mixing cloud platform are for collocation method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MANISH POKHAREL 等: "Disaster Recovery for System Architecture Using Cloud Computing", 《2010 10TH IEEE/IPSJ INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET》 *
于洪涛 等: "灾备场景下OpenStack虚机实时迁移网络研究", 《天津科技》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112099835A (en) * 2020-09-07 2020-12-18 紫光云(南京)数字技术有限公司 Soft load balancing mirror image dynamic upgrading method and system
CN112099835B (en) * 2020-09-07 2022-10-28 紫光云(南京)数字技术有限公司 Soft load balancing mirror image dynamic upgrading method and system
CN112929221A (en) * 2021-03-02 2021-06-08 浪潮云信息技术股份公司 Method for realizing disaster tolerance of main and standby cloud service products
CN113821384A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Cross-data-center and same-city disaster recovery method, device and equipment based on cloud platform
CN113691342A (en) * 2021-09-13 2021-11-23 上海云轴信息科技有限公司 Method and system for time synchronization of virtualization platform
WO2023093379A1 (en) * 2021-11-26 2023-06-01 中兴通讯股份有限公司 Disaster recovery switching method and system, electronic device, and storage medium
CN114745557A (en) * 2022-03-22 2022-07-12 浙江大华技术股份有限公司 Disaster recovery operation execution method and device, storage medium and electronic device
CN114745557B (en) * 2022-03-22 2024-05-24 浙江大华技术股份有限公司 Disaster recovery operation execution method and device, storage medium and electronic device

Similar Documents

Publication Publication Date Title
CN108932180A (en) A kind of disaster tolerance management method, device, storage medium and computer equipment matter
WO2018103318A1 (en) Distributed transaction handling method and system
US7865680B2 (en) Remote copy system
CN100489796C (en) Methods and system for implementing shared disk array management functions
US7594138B2 (en) System and method of error recovery for backup applications
CN105471622B (en) A kind of high availability method and system of the control node active-standby switch based on Galera
CN103414712B (en) A kind of distributed virtual desktop management system and method
CN106850260A (en) A kind of dispositions method and device of virtual resources management platform
US20040153719A1 (en) Method for controlling information processing system, information processing system and information processing program
JP2003076592A (en) Data storage system
US20170255528A1 (en) Smart data replication recoverer
CN103294701B (en) A kind of method that distributed file system and data process
WO2014101424A1 (en) Method and system for synchronizing distributed database
CN105760519A (en) Cluster file system and file lock allocation method thereof
EP2545467A1 (en) Data synchronization between a data center environment and a cloud computing environment
CN101441582A (en) Backup and recovery method of computer data object and system and program product for implementing the same
CN107111530B (en) A kind of disaster recovery method, system and device
CN108984107A (en) Improve the availability of storage system
CN104486438B (en) The disaster recovery method and device of distributed memory system
JP2008186142A (en) Remote backup method and storage system
EP2869199A2 (en) Distributed reservation systems and methods
US20090063486A1 (en) Data replication using a shared resource
CN115794499B (en) Method and system for dual-activity replication data among distributed block storage clusters
CN110175089A (en) A kind of dual-active disaster recovery and backup systems with read and write abruption function
US20110167235A1 (en) Remote copy system and remote copy control method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181204