CN106502835A - A kind of disaster-tolerant backup method and device - Google Patents

A kind of disaster-tolerant backup method and device Download PDF

Info

Publication number
CN106502835A
CN106502835A CN201610943435.9A CN201610943435A CN106502835A CN 106502835 A CN106502835 A CN 106502835A CN 201610943435 A CN201610943435 A CN 201610943435A CN 106502835 A CN106502835 A CN 106502835A
Authority
CN
China
Prior art keywords
backup
node
disaster
time
tolerant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610943435.9A
Other languages
Chinese (zh)
Other versions
CN106502835B (en
Inventor
韩笑
郝建明
宋泽锋
伍福生
简超
潘星明
李兴锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201610943435.9A priority Critical patent/CN106502835B/en
Publication of CN106502835A publication Critical patent/CN106502835A/en
Application granted granted Critical
Publication of CN106502835B publication Critical patent/CN106502835B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1479Generic software techniques for error detection or fault masking
    • G06F11/1489Generic software techniques for error detection or fault masking through recovery blocks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention provides a kind of disaster-tolerant backup method and device, the disaster-tolerant backup method includes:Judge backup node whether in data syn-chronization interrupt status;When backup node is in data syn-chronization interrupt status, judge whether the time of final updating data is less than very first time threshold value with the time difference of system time in host node;If the time of final updating data, with the time difference of system time less than very first time threshold value, starts backup node synchrodata from host node in host node;Poll detects backup node, when backup node completes data syn-chronization, stops backup node synchrodata.Using disaster-tolerant backup method provided in an embodiment of the present invention and device, remotely control can be carried out to each controlled end node, realize procedure and the automatization of strange land data duplication and backup.

Description

A kind of disaster-tolerant backup method and device
Technical field
The present invention relates to communication technical field, more particularly to a kind of disaster-tolerant backup method and device.
Background technology
With the fast development of information technology, information security increasingly becomes in each field of every profession and trade extensively concerned and asks Topic, although whenever more the development of computer provides more efficiently information service in real time than ever for people, realizes Information Store and the automatization of management, but the hidden danger brought by the extensive application of computer is also very big.Once meter Calculation machine system suffers irreversible destruction, it will cause massive losses.For fast quick-recovery data after calamity, one in prior art As using data backup method, including methods such as manual backup, automated back-ups, NetApp SnapMirror softwares have warp High efficiency, simplification and the reasonable cost of checking is crossed, therefore the software is always in various NetApp storage environments for many years Carry out replicating the one preferred technique with disaster recovery, although NetApp SnapMirror can realize that the strange land of data efficient is replicated And backup, however it is necessary that the manually strange land transmission of control data.
Content of the invention
For solving above-mentioned technical problem, the invention provides a kind of disaster-tolerant backup method and device.
One aspect of the present invention provides a kind of disaster-tolerant backup method, and the disaster-tolerant backup method includes:
Judge backup node whether in data syn-chronization interrupt status;
When the backup node is in data syn-chronization interrupt status, judge in host node the time of final updating data and Whether the time difference of system time is less than very first time threshold value;
If the time of final updating data is less than very first time threshold value with the time difference of system time in the host node, Start the backup node from synchrodata at the host node;
Poll detects the backup node, when the backup node completes data syn-chronization, stops the backup node same Step data.
In one embodiment, when the backup node completes data syn-chronization, if the backup node cannot be stopped, described Disaster-tolerant backup method also includes:Poll detects one Preset Time of host node, if the host node is in the Preset Time Interior stopping receives external data, then stop the backup node synchrodata.
In one embodiment, the disaster-tolerant backup method also includes:If the state of the host node is when described default In do not stop receive external data, then send warning information, carry out disaster tolerance and report an error.
In one embodiment, when the backup node completes data syn-chronization, the disaster-tolerant backup method also includes:
Judge the time of the time of last synchrodata and final updating data in the host node in the backup node Time difference whether be less than the second time threshold;
When the time of last synchrodata in the backup node and the time of final updating data in the host node When time difference is less than second time threshold, stop the backup node synchrodata.
In one embodiment, in the host node time difference of the time of final updating data and system time less than the One time threshold, starts the backup node from after synchrodata at the host node, and the disaster-tolerant backup method also includes:Sentence Whether the backup node break still in data syn-chronization interrupt status, if the backup node interrupts shape still in data syn-chronization State, restarting the backup node carries out data syn-chronization.
In one embodiment, the disaster-tolerant backup method also includes:Start timing after the backup node is started, and examine Whether survey the backup node currently carries out the overall time of data syn-chronization more than the 3rd time threshold, if it is, sending announcement Alarming information, carries out disaster tolerance and reports an error.
In one embodiment, the disaster-tolerant backup method also includes:Judge whether backup node interrupts in data syn-chronization When the result of state is no, warning information is sent, disaster tolerance is carried out and is reported an error.
In one embodiment, the disaster-tolerant backup method also includes:When last synchrodata in the backup node Between when being more than second time threshold with the time difference of the time of final updating data in the host node, make the backup section Point continues synchrodata.
On the other hand the embodiment of the present invention additionally provides a kind of disaster-tolerant backup device, and the disaster-tolerant backup device includes:
Whether backup node access unit, for judging backup node in data syn-chronization interrupt status;
Host node access unit, for judging that the time of final updating data with the time difference of system time is in host node No less than very first time threshold value;
Backup node start unit, for when the time of the time of final updating data in the host node and system time When difference is less than very first time threshold value, start the backup node from synchrodata at the host node;
Polling system unit, detects the backup node for poll, when the backup node completes data syn-chronization, stops The only backup node synchrodata.
In one embodiment, when the backup node completes data syn-chronization, if the backup node cannot be stopped, described Polling system unit is additionally operable to:Poll detects one Preset Time of host node, if the host node is in the Preset Time Interior stopping receives external data, then stop the backup node synchrodata.
In one embodiment, the disaster-tolerant backup device also includes:Report an error unit, exists for the state when the host node Do not stop warning information being sent when receiving external data in the Preset Time, carry out disaster tolerance and report an error.
In one embodiment, the polling system unit also includes:
Time difference judge module, for judging in the backup node in the time of last synchrodata and the host node Whether the time difference of the time of final updating data is less than the second time threshold;
Node interrupts module, last with the host node for working as the time of last synchrodata in the backup node When the time difference of the time for updating the data is less than second time threshold, stop the backup node synchrodata.
In one embodiment, the polling system unit also includes:
Judge module, after the backup node start unit starts backup node, for judging that the backup node is No still in data syn-chronization interrupt status;
Module is restarted, described for when the backup node is still in data syn-chronization interrupt status, restarting Backup node carries out data syn-chronization.
In one embodiment, the disaster-tolerant backup device also includes a timing unit, for starting the backup node After start timing, and detect that whether the backup node currently carries out the overall time of data syn-chronization more than the 3rd time threshold, When the overall time that the backup node currently carries out data syn-chronization is more than three time threshold, the unit that reports an error is sent out Go out warning information, carry out disaster tolerance and report an error.
In one embodiment, the unit that reports an error is additionally operable to, when the output result of backup node access unit is no, send out Go out warning information, carry out disaster tolerance and report an error.
In one embodiment, the polling system unit also includes:Update module, for working as the time difference judge module Output result for no when, make the backup node continue synchrodata.
In one embodiment, the disaster-tolerant backup device is arranged in the backup node, and the disaster-tolerant backup device leads to Cross SSH and remotely access the host node, wherein, SSH represents safety shell protocol.
In one embodiment, the disaster-tolerant backup device is arranged in the host node, and the disaster-tolerant backup device passes through SSH remotely accesses the backup node.
In one embodiment, the disaster-tolerant backup is independently arranged outside being installed on the backup node and host node.
Using disaster-tolerant backup method provided in an embodiment of the present invention and device, each controlled end node can remotely be controlled System, realizes procedure and the automatization of strange land data duplication and backup.
Description of the drawings
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing Accompanying drawing to be used needed for having technology description is briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with Other accompanying drawings are obtained according to these accompanying drawings.
Schematic flow sheets of the Fig. 1 for embodiment of the present invention disaster-tolerant backup method;
Schematic flow sheets of the Fig. 2 for the another disaster-tolerant backup method of the embodiment of the present invention;
Structural representations of the Fig. 3 for embodiment of the present invention disaster-tolerant backup device;
Structural representations of the Fig. 4 for embodiment of the present invention polling system unit 4;
Organization Charts of the Fig. 5 for embodiment of the present invention disaster-tolerant backup device;
Control structure figures of the Fig. 6 for the another disaster tolerance system of the embodiment of the present invention.
Specific embodiment
Accompanying drawing in below in conjunction with the embodiment of the present invention, to the embodiment of the present invention in technical scheme carry out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiment.It is based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of protection of the invention.
The present invention has supplied the embodiment of a disaster-tolerant backup method, as shown in figure 1, the method is mainly included the following steps that:
Step S1, judge backup node whether in data syn-chronization interrupt status.
Such as cross backup node and be currently at data syn-chronization interrupt status, illustrate to meet the requirement for continuing synchrodata, if The output result of step S1 is no, illustrates that backup node does not meet the requirement for continuing synchrodata, needs to send warning information, enter Row disaster tolerance reports an error (step S6), no longer executes following steps.
Step S2, when backup node be in data syn-chronization interrupt status when, determine whether final updating number in host node According to the time difference of time and system time whether be less than very first time threshold value.
Due to being to be received external data by host node at the beginning in the embodiment of the present invention, and backup node is from host node Middle replicate data, its data source are host nodes, access host node by step S2, determine the time of host node final updating data Whether a preset time threshold is less than with the time difference of system time, if it does, final updating data in explanation host node Time and external data source basic synchronization, subsequent step can be executed.It may thus be appreciated that the Main Function of the step is to determine master In node, whether acquired external data is newest data.
In one embodiment, above-mentioned time threshold generally takes 15 minutes, that is, allow final updating data in host node The time difference of time and current time in system then sent warning information more than 15 minutes in 15 minutes, carries out disaster tolerance and reports an error (step Rapid S6).
If the time of final updating data is less than very first time threshold with the time difference of system time in step S3 host node Value, starts backup node synchrodata from host node.
If time difference of the time of final updating data compared with the current time in system is less than 15 minutes in host node, Then can be using the external data of host node final updating as latest data, starting backup node carries out data from host node Synchronous.
Step S4, poll detection backup node, judge whether backup node completes data syn-chronization.
Step S5, when backup node completes data syn-chronization and completes, stop backup node synchrodata.
The embodiment of the present invention goes the external data obtained in host node by SSH remotely control backup nodes, ensures number According to being synchronized on backup node, when the embodiment of the present invention is implemented, it is only necessary to which a backup node is managed, manage Object is simple, it is achieved that the procedure and automatization of strange land data duplication and backup.
In one embodiment, when backup node completes data syn-chronization, normally backup node can be stopped by step S5, is made Its state is changed into data syn-chronization interrupt status.But if now host node is receiving external data, i.e. host node in number According to transmission state, even if now backup node completes data syn-chronization, it is also possible to cannot be stopped, in order to prevent repeating to report Mistake, has preset a buffer time to host node, and poll detects the host node pre-set buffer time, if the state of host node is at this Idle condition (stopping receiving external data) is converted in the pre-set buffer time, then be possible to be successfully closed backup section Point.If the state of host node does not stop receiving external data in above-mentioned Preset Time, warning information is sent, disaster tolerance is carried out Report an error.
Fig. 2 schematic flow sheets also for disaster-tolerant backup method provided in an embodiment of the present invention.As shown in Fig. 2 examining in poll Survey backup node and stop also including judgement step S7 between backup node synchrodata, for judging backup node whether still In data syn-chronization interrupt status, if backup node is still in data syn-chronization interrupt status, illustrates that step S3 fails and open Dynamic backup node, now needing to restart backup node carries out data syn-chronization.
As shown in Fig. 2 when backup node completes data syn-chronization, the disaster-tolerant backup method also includes step S8, judge In backup node, whether the time of last synchrodata is less than for the moment with the time difference of the time of final updating data in host node Between threshold value, and when the time of last synchrodata in backup node is little with the time difference of the time of final updating data in host node When the time threshold, stop backup node synchrodata.The step is mainly used in completing data duplication in secondary node laggard Row verification, it is ensured that the data of backup node final updating are the newest external datas in host node.When last same in backup node When in the time of step data and host node, the time difference of the time of final updating data is more than the time threshold, make backup node after Continuous synchrodata.Wherein, time threshold herein can take 5 minutes, it is also possible to take 15 minutes, but typically no less than 5 minutes.
Disaster-tolerant backup method provided in an embodiment of the present invention also includes a timed process (not shown in Fig. 2), standby when starting Part node from host node synchrodata when, in order to avoid backup node hangs up (i.e. hang is dead), the embodiment of the present invention also sets A time threshold is determined, has started timing while backup node is started, detection backup node has currently carried out the whole of data syn-chronization Whether the body time be more than this time threshold, if if just carry out disaster tolerance and report an error.The time threshold generally takes 40 minutes, That is, backup node carry out the overall process of data syn-chronization from host node used by the time should control in 40 minutes, beyond now Between threshold value explanation data synchronization process occur in that special circumstances, some steps may occur the dead situations of hang, the inspection of this threshold value It is the bottom-line plan for ensureing that whole data synchronization process is controllable.
Based on the disaster-tolerant backup method identical inventive concept shown in Fig. 1 and Fig. 2, inventive embodiments additionally provide one kind Disaster-tolerant backup device, as described in example below.Principle and disaster-tolerant backup method due to the disaster-tolerant backup device solve problem Similar, the therefore enforcement of the disaster-tolerant backup device may refer to the enforcement of disaster-tolerant backup method, repeats part and repeats no more.
Structural representations of the Fig. 3 for embodiment of the present invention disaster-tolerant backup device, as shown in figure 3, the disaster-tolerant backup device bag Include:Backup node access unit 1, host node access unit 2, backup node start unit 3 and polling system unit 4.Wherein, standby Whether part node visit unit 1 is used for judging backup node in data syn-chronization interrupt status;Host node access unit 2 is used for sentencing In disconnected host node, whether the time of final updating data is less than very first time threshold value with the time difference of system time;Backup node is opened Moving cell 3 is used for when the time difference of the time of final updating data in host node and system time is less than very first time threshold value, Start backup node synchrodata from host node;Polling system unit 4 be used for poll detect backup node, when backup node complete During into data syn-chronization, stop backup node synchrodata.
In one embodiment, when backup node completes data syn-chronization, if backup node cannot be stopped, then above-mentioned poll Control unit 4 can be also used for poll detection one Preset Time of host node, if host node in the Preset Time stops receiving External data, then can stop backup node synchrodata.
In one embodiment, above-mentioned disaster-tolerant backup device also includes reporting an error unit 5 for when the state of host node exists State and do not stop warning information being sent when receiving external data in Preset Time, carry out disaster tolerance and report an error.
As shown in figure 4, above-mentioned polling system unit 4 includes a time difference judge module 41 and node interrupts module 42. Time difference judge module 41 is used for judging the time of last synchrodata and final updating data in host node in backup node Whether the time difference of time is less than the second time threshold;Node interrupts module 42 is used for working as last synchrodata in backup node When the time difference of the time of final updating data is less than above-mentioned second time threshold in time and host node, stop backup node same Step data.The second above-mentioned time threshold can be taken not less than 5 minutes.
In one embodiment, polling system unit 4 also includes that a judge module 43 and restarts module 44, judges mould Whether block 43 is used for after backup node start unit 3 starts backup node, for judging backup node still in data syn-chronization Interrupt status;Restart module 44 and save for when backup node is still in data syn-chronization interrupt status, restarting backup Point carries out data syn-chronization.After backup node start unit 3 starts backup node, if it is determined that the judged result of module 43 is standby Part node remains as data syn-chronization interrupt status, illustrates that backup node fails startup, needs to restart 44 duplicate removal of module Opening backup node carries out data syn-chronization.
In one embodiment, above-mentioned calamity back-up device also includes that a timing unit 6, timing unit 6 are used for standby in startup Start timing after part node, and detect that whether backup node currently carries out the overall time of data syn-chronization more than the 3rd time threshold Value, when the overall time that backup node currently carries out data syn-chronization is more than the second time threshold, the unit 5 that reports an error sends alarm letter Breath, carries out disaster tolerance and reports an error.
In one embodiment, (the initial state of backup node when the output result of backup node access unit 1 is no It is not data syn-chronization interrupt status), the unit 6 that reports an error can send warning information, carry out disaster tolerance and report an error.
In one embodiment, polling system unit 4 also includes a update module 45, for when time difference judge module 41 When output result is no, backup node start unit 3 is jumped to, start backup node and continue to the same step number from host node According to.
Using disaster-tolerant backup method provided in an embodiment of the present invention and device, each controlled end node can remotely be controlled System, realizes procedure and the automatization of strange land data duplication and backup.
Above-mentioned disaster-tolerant backup device can be arranged in backup node, it is also possible to arranged in the master node, or also may be used Outside being independently disposed to host node and backup node.When disaster-tolerant backup device is arranged in backup node, disaster-tolerant backup is filled Put and the host node can be remotely accessed by SSH (Secure Shell, safety shell protocol).Disaster-tolerant backup device is arranged at When in host node, backup node can be remotely accessed by SSH.Outside disaster-tolerant backup is installed on backup node and host node solely Erect when putting, host node and backup node can be remotely accessed by SSH.
Fig. 5 is the one of which configuration structure of disaster-tolerant backup device provided in an embodiment of the present invention.The present invention is only with appearance Calamity back-up device not limits the invention independently of illustrating as a example by arranging outside host node, backup node.As schemed Shown in 5, host node is used for obtaining external source data, and backup node is used for from host node replicate data to carry out data syn-chronization, Disaster-tolerant backup device is arranged independently of outside host node, backup node, with backup node and host node respectively by network connection, Active and standby part node can be conducted interviews respectively by SSH, for controlling backup node replicate data from host node.
Embodiment is provided for a better understanding of the present invention disaster-tolerant backup method, server and system, with reference to tool The example of body is illustrated.
NetApp SnapMirror softwares have high efficiency, simplification and the reasonable cost through verifying, therefore for many years The software always carries out replicating the one preferred technique with disaster recovery in various NetApp storage environments.Can utilize Network Appliance set up NetApp SnapMirror softwares on the basis of Data ONTAPP operating systems to build Backup node and host node in disaster tolerant backup system, using disaster-tolerant backup assembly monitor remotely control NetApp Backup node and host node in SnapMirror softwares, realizes teledata synchronization according to control structure figure shown in Fig. 6.At this In inventive embodiments, respectively by NetApp SnapMirror software deployments in the node of Pekinese two:A nodes and B node difference As host node and backup node.
First, the relevant parameter of disaster-tolerant backup device is checked, if parameter testing is normal, B node is further checked Whether state is that data syn-chronization interrupts (i.e. broken_off) state.If B node is not broken_off states, reports an error and move back Go out, no longer execute following operation.
If B node meets the requirement for continuing synchrodata in broken_off states, A nodes are further checked Whether the time of final updating data is less than 15 minutes with the time difference of system time.If A node final updating data when Between slower than system time more than 15 minutes, then continue poll A nodes 20 times, per minor tick 30 seconds, if A nodes are most after ten minutes Within the time for updating the data afterwards is not still synchronized to current time 15 minutes, then reports an error and exit.
If the time of A node final updating data is less than 15 minutes with the time difference of system time, or to A node wheels Ask the multiple rear time difference for finding the time of final updating data and system time in A nodes to be less than 15 minutes, then illustrate that A is saved Point has obtained newest external data, continues executing with subsequent step, starts B node, starts to carry out data syn-chronization (i.e. from A nodes Start snapmirrior to replicate).
Start timing while B node is started, detect that whether B node carries out the overall time of data syn-chronization less than 40 points Clock, if it exceeds 40 minutes explanation system certain modules in data synchronization process may occur hang extremely, then reports an error and exits.
If B node carried out the overall time of data syn-chronization less than 40 minutes, proceed subsequent step, poll is examined Look into the state of B node and A nodes.Judge B node and A nodes be whether " snapmirrored&transferring " state (i.e. B node completes (snapmirrored) state in data syn-chronization and A nodes are in data transfer (transferring) shape State), if it is, wait the state for going to check A nodes and B node for 1 minute again, this be in inspection B node and A node states it Before, need whether the overall time of data syn-chronization again to B node was once judged again less than 40 minutes, only when B is saved The state of B node and A node is just checked again for when the overall data lock in time of point was still less than 40 minutes.A nodes are in Transferring states refer to that A nodes are currently being received external data, if now stopping B node, it is possible to will not be successful, In order to ensure successfully to stop B node, the step of A nodes are checked in the embodiment, is with the addition of, this mode is to present invention side Further supplementing for method and device related embodiment, is not intended as the restriction of the present invention.
If B node and A nodes are not " snapmirrored&transferring " states, B node is determined whether Whether it is that (i.e. B node is in snapmirrored states and A nodes are in for " snapmirrored&idle " state with A nodes Idle (idle) state).In idle states, A nodes refer to that A nodes neither receive external data, while also not passing to B node Transmission of data, if now stop B node, situation about will not stop.
If the state of B node and A nodes is not " snapmirrored&idle " state, determine whether B node and Whether A nodes are that (i.e. B node is in data syn-chronization interrupt status and A nodes are in the free time for " broken_off&idle " state State), if B node and A nodes are " broken_off&idle " state, the not actuated success of B node are described, need to open again Dynamic B node is carrying out data syn-chronization.In the specific implementation, it is also possible to only B node is determined whether, when B node is data During sync break state, illustrate that B node fails startup, need to restart B node and carry out synchrodata.
If the state of B node and A nodes is " snapmirrored&idle " state, illustrates that data syn-chronization is finished, enter One step checks whether the time of last synchrodata is less than 5 points with the time difference of the time of A node final updating data on B node Clock, if less than 5 minutes, then illustrates data syn-chronization success, then executed renewal (update) order if greater than 5 minutes, save B Point continues from A node synchrodatas.Time when final updating data in the time of last synchrodata in B node and A nodes Difference when being less than 5 minutes, it is generally recognized that newest in the successful synchronization A nodes data of B node.
The embodiment of the present invention is realized by disaster tolerance application server and NetApp SnapMirror softwares to be combined Remotely control of the administrative center to each controlled end node, and procedure and automatization is achieved, and automatically-monitored The data syn-chronization state of NetApp SnapMirror softwares, it is achieved that the teledata synchronization of high-efficient automatic, makes long-range SSH Data interaction becomes controllable, it is achieved that the automated execution of remote command.
Those skilled in the art are it should be appreciated that embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can adopt complete hardware embodiment, complete software embodiment or with reference to software and hardware in terms of reality Apply the form of example.And, the present invention can be adopted in one or more computers for wherein including computer usable program code The upper computer program that implements of usable storage medium (including but not limited to disk memory, CD-ROM, optical memory etc.) is produced The form of product.
The present invention is the flow process with reference to method according to embodiments of the present invention, equipment (system) and computer program Figure and/or block diagram are describing.It should be understood that can be by computer program instructions flowchart and/or each stream in block diagram Journey and/or the combination of square frame and flow chart and/or the flow process in block diagram and/or square frame.These computer programs can be provided Instruct the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing devices The device of the function of specifying in present one flow process of flow chart or one square frame of multiple flow processs and/or block diagram or multiple square frames.
These computer program instructions may be alternatively stored in and can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory is produced to be included referring to Make the manufacture of device, the command device realize in one flow process of flow chart or one square frame of multiple flow processs and/or block diagram or The function of specifying in multiple square frames.
These computer program instructions can be also loaded in computer or other programmable data processing devices so that in meter Series of operation steps is executed on calculation machine or other programmable devices to produce computer implemented process, so as in computer or The instruction executed on other programmable devices is provided for realization in one flow process of flow chart or multiple flow processs and/or block diagram one The step of function of specifying in individual square frame or multiple square frames.
Apply specific embodiment to be set forth principle of the invention and embodiment in the present invention, above example Explanation be only intended to help and understand the method for the present invention and its core concept;Simultaneously for one of ordinary skill in the art, According to the thought of the present invention, will change in specific embodiments and applications, in sum, in this specification Appearance should not be construed as limiting the invention.

Claims (19)

1. a kind of disaster-tolerant backup method, it is characterised in that the disaster-tolerant backup method includes:
Judge backup node whether in data syn-chronization interrupt status;
When the backup node is in data syn-chronization interrupt status, the time of final updating data and system in host node is judged Whether the time difference of time is less than very first time threshold value;
If the time of final updating data, with the time difference of system time less than very first time threshold value, starts in the host node The backup node is from synchrodata at the host node;
Poll detects the backup node, when the backup node completes data syn-chronization, stops the same step number of the backup node According to.
2. disaster-tolerant backup method according to claim 1, it is characterised in that when the backup node completes data syn-chronization When, if the backup node cannot be stopped, the disaster-tolerant backup method also includes:When poll detects that the host node one is preset Between, if the host node in the Preset Time stops receiving external data, stop the backup node synchrodata.
3. disaster-tolerant backup method according to claim 2, it is characterised in that the disaster-tolerant backup method also includes:If The state of the host node does not stop receiving external data in the Preset Time, then send warning information, carry out disaster tolerance report Wrong.
4. disaster-tolerant backup method according to claim 1 and 2, it is characterised in that when the backup node complete data with During step, the disaster-tolerant backup method also includes:
Judge time of last synchrodata in the backup node with the host node time of final updating data when Between difference whether be less than the second time threshold;
Time when the time of last synchrodata and the time of final updating data in the host node in the backup node When difference is less than second time threshold, stop the backup node synchrodata.
5. disaster-tolerant backup method according to claim 1, it is characterised in that final updating data in the host node Time is less than very first time threshold value with the time difference of system time, starts the backup node from synchrodata at the host node Afterwards, the disaster-tolerant backup method also includes:Whether the backup node is judged still in data syn-chronization interrupt status, if described Still in data syn-chronization interrupt status, restart the backup node carries out data syn-chronization to backup node.
6. disaster-tolerant backup method according to claim 1 or 5, it is characterised in that the disaster-tolerant backup method also includes:? Start, and detect the backup node currently carry out data syn-chronization the overall time whether big In the 3rd time threshold, if it is, sending warning information, carry out disaster tolerance and report an error.
7. disaster-tolerant backup method according to claim 1, it is characterised in that the disaster-tolerant backup method also includes:Judge Backup node whether in data syn-chronization interrupt status result for no when, send warning information, carry out disaster tolerance and report an error.
8. disaster-tolerant backup method according to claim 4, it is characterised in that the disaster-tolerant backup method also includes:Work as institute The time difference for stating the time of last synchrodata and the time of final updating data in the host node in backup node is more than institute When stating the second time threshold, the backup node is made to continue synchrodata.
9. a kind of disaster-tolerant backup device, it is characterised in that the disaster-tolerant backup device includes:
Whether backup node access unit, for judging backup node in data syn-chronization interrupt status;
Host node access unit, for judging whether the time of final updating data is little with the time difference of system time in host node In very first time threshold value;
Backup node start unit is little for working as the time of final updating data and the time difference of system time in the host node When very first time threshold value, start the backup node from synchrodata at the host node;
Polling system unit, detects the backup node for poll, when the backup node completes data syn-chronization, stops institute State backup node synchrodata.
10. disaster-tolerant backup device according to claim 9, it is characterised in that when the backup node completes data syn-chronization When, if the backup node cannot be stopped, the polling system unit is additionally operable to:When poll detects that the host node one is preset Between, if the host node in the Preset Time stops receiving external data, stop the backup node synchrodata.
11. disaster-tolerant backup devices according to claim 10, it is characterised in that the disaster-tolerant backup device also includes:Report Wrong unit, for sending alarm letter when the state of the host node does not stop in the Preset Time and receives external data Breath, carries out disaster tolerance and reports an error.
The 12. disaster-tolerant backup devices according to claim 9 or 10, it is characterised in that the polling system unit also includes:
Time difference judge module, for judging that the time of last synchrodata is last with the host node in the backup node Whether the time difference of the time for updating the data is less than the second time threshold;
Node interrupts module, for working as the time of last synchrodata and final updating in the host node in the backup node When the time difference of the time of data is less than second time threshold, stop the backup node synchrodata.
13. disaster-tolerant backup devices according to claim 9, it is characterised in that the polling system unit also includes:
Still whether judge module, after the backup node start unit starts backup node, for judging the backup node It is in data syn-chronization interrupt status;
Module is restarted, for when the backup node is still in data syn-chronization interrupt status, restarting the backup Node carries out data syn-chronization.
The 14. disaster-tolerant backup devices according to claim 11 or 13, it is characterised in that the disaster-tolerant backup device also includes One timing unit, for starting timing after the backup node is started, and it is same to detect that the backup node currently carries out data Whether the overall time of step is more than the 3rd time threshold, when the overall time that the backup node currently carries out data syn-chronization is more than During three time threshold, the unit that reports an error sends warning information, carries out disaster tolerance and reports an error.
15. disaster-tolerant backup devices according to claim 11, it is characterised in that the unit that reports an error is additionally operable to when backup section When the output result of point access unit is no, warning information is sent, disaster tolerance is carried out and is reported an error.
16. disaster-tolerant backup devices according to claim 12, it is characterised in that the polling system unit also includes:More New module, for when the output result of the time difference judge module is no, making the backup node continue synchrodata.
17. disaster-tolerant backup devices according to claim 9, it is characterised in that the disaster-tolerant backup device is arranged at described In backup node, the disaster-tolerant backup device remotely accesses the host node by SSH, and wherein, SSH represents that Secure Shell is assisted View.
18. disaster-tolerant backup devices according to claim 9, it is characterised in that the disaster-tolerant backup device is arranged at described In host node, the disaster-tolerant backup device remotely accesses the backup node by SSH.
19. disaster-tolerant backup devices according to claim 9, it is characterised in that the disaster-tolerant backup is installed on the backup It is independently arranged outside node and host node.
CN201610943435.9A 2016-10-26 2016-10-26 A kind of disaster-tolerant backup method and device Active CN106502835B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610943435.9A CN106502835B (en) 2016-10-26 2016-10-26 A kind of disaster-tolerant backup method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610943435.9A CN106502835B (en) 2016-10-26 2016-10-26 A kind of disaster-tolerant backup method and device

Publications (2)

Publication Number Publication Date
CN106502835A true CN106502835A (en) 2017-03-15
CN106502835B CN106502835B (en) 2018-10-16

Family

ID=58321821

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610943435.9A Active CN106502835B (en) 2016-10-26 2016-10-26 A kind of disaster-tolerant backup method and device

Country Status (1)

Country Link
CN (1) CN106502835B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107995282A (en) * 2017-11-29 2018-05-04 北京悦畅科技有限公司 Method of data synchronization, device, system and server
CN108632067A (en) * 2017-03-21 2018-10-09 华为技术有限公司 Disaster tolerance dispositions method, apparatus and system
CN109005246A (en) * 2018-09-12 2018-12-14 北京中电普华信息技术有限公司 A kind of synchronous method of data, apparatus and system
CN110830591A (en) * 2019-11-22 2020-02-21 北京浪潮数据技术有限公司 Information synchronization method, system, electronic equipment and storage medium
CN111176888A (en) * 2018-11-13 2020-05-19 浙江宇视科技有限公司 Cloud storage disaster recovery method, device and system
CN112506707A (en) * 2020-12-10 2021-03-16 深信服科技股份有限公司 Disaster recovery method, system, equipment and computer readable storage medium
CN113849328A (en) * 2018-05-22 2021-12-28 华为技术有限公司 Management method and device of disaster recovery system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388759A (en) * 2007-09-10 2009-03-18 中兴通讯股份有限公司 Method and system for implementing from asynchronous copy to synchronous copy by data
CN103902617A (en) * 2012-12-28 2014-07-02 华为技术有限公司 Distributed database synchronization method and system
CN104504062A (en) * 2014-12-22 2015-04-08 浙江宇视科技有限公司 Data synchronization method and device of primary and standby databases
US20150339366A1 (en) * 2013-06-25 2015-11-26 International Business Machines Corporation Replication for on-line hot-standby database
CN105138441A (en) * 2015-06-30 2015-12-09 中标软件有限公司 HAC system, warning method based on HAC system and warning system based on HAC system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388759A (en) * 2007-09-10 2009-03-18 中兴通讯股份有限公司 Method and system for implementing from asynchronous copy to synchronous copy by data
CN103902617A (en) * 2012-12-28 2014-07-02 华为技术有限公司 Distributed database synchronization method and system
US20150339366A1 (en) * 2013-06-25 2015-11-26 International Business Machines Corporation Replication for on-line hot-standby database
CN104504062A (en) * 2014-12-22 2015-04-08 浙江宇视科技有限公司 Data synchronization method and device of primary and standby databases
CN105138441A (en) * 2015-06-30 2015-12-09 中标软件有限公司 HAC system, warning method based on HAC system and warning system based on HAC system

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108632067A (en) * 2017-03-21 2018-10-09 华为技术有限公司 Disaster tolerance dispositions method, apparatus and system
CN108632067B (en) * 2017-03-21 2020-12-08 华为技术有限公司 Disaster recovery deployment method, device and system
US11307943B2 (en) 2017-03-21 2022-04-19 Huawei Technologies Co., Ltd. Disaster recovery deployment method, apparatus, and system
CN107995282A (en) * 2017-11-29 2018-05-04 北京悦畅科技有限公司 Method of data synchronization, device, system and server
CN113849328B (en) * 2018-05-22 2024-04-12 华为技术有限公司 Management method and device of disaster recovery system
CN113849328A (en) * 2018-05-22 2021-12-28 华为技术有限公司 Management method and device of disaster recovery system
CN109005246A (en) * 2018-09-12 2018-12-14 北京中电普华信息技术有限公司 A kind of synchronous method of data, apparatus and system
CN109005246B (en) * 2018-09-12 2021-10-01 北京国电通网络技术有限公司 Data synchronization method, device and system
CN111176888B (en) * 2018-11-13 2023-09-15 浙江宇视科技有限公司 Disaster recovery method, device and system for cloud storage
CN111176888A (en) * 2018-11-13 2020-05-19 浙江宇视科技有限公司 Cloud storage disaster recovery method, device and system
CN110830591A (en) * 2019-11-22 2020-02-21 北京浪潮数据技术有限公司 Information synchronization method, system, electronic equipment and storage medium
CN110830591B (en) * 2019-11-22 2022-07-29 北京浪潮数据技术有限公司 Information synchronization method, system, electronic equipment and storage medium
CN112506707B (en) * 2020-12-10 2023-12-29 深信服科技股份有限公司 Disaster recovery returning method, system, equipment and computer readable storage medium
CN112506707A (en) * 2020-12-10 2021-03-16 深信服科技股份有限公司 Disaster recovery method, system, equipment and computer readable storage medium

Also Published As

Publication number Publication date
CN106502835B (en) 2018-10-16

Similar Documents

Publication Publication Date Title
CN106502835A (en) A kind of disaster-tolerant backup method and device
CN108270726B (en) Application instance deployment method and device
CN105471622B (en) A kind of high availability method and system of the control node active-standby switch based on Galera
CN100426751C (en) Method for ensuring accordant configuration information in cluster system
US20080281938A1 (en) Selecting a master node in a multi-node computer system
CN107634855A (en) A kind of double hot standby method of embedded system
CN103995868B (en) The global transaction manager and transaction methods of Based on Distributed system
CN112506702B (en) Disaster recovery method, device, equipment and storage medium for data center
CN105306605B (en) A kind of double host server systems
CN103370693A (en) Restarting processes
CN102724083A (en) Degradable triple-modular redundancy computer system based on software synchronization
CN109918360A (en) Database platform system, creation method, management method, equipment and storage medium
CN105915391B (en) The distributed key assignments storage method of self-recovering function is submitted and had based on single phase
CN110348826A (en) Strange land disaster recovery method, system, equipment and readable storage medium storing program for executing mostly living
CN108337108A (en) A kind of cloud platform failure automation localization method based on association analysis
CN104331353A (en) Method for guaranteeing software high availability
JP2023505879A (en) Distributed database system and data disaster backup training method
CN113472891A (en) SDN controller cluster data processing method, device and medium
CN111338767A (en) PostgreSQL master-slave database automatic switching system and method
Pattanaik et al. Recovery and reliability prediction in fault tolerant automotive embedded system
CN111181630B (en) Method for switching standby optical fiber optical port of transformer substation process layer network and electronic equipment
CN114791900A (en) Operator-based Redis operation and maintenance method, device, system and storage medium
CN106656437A (en) Redundant hot standby platform
CN110247833B (en) Communication control method, device, sub-equipment and communication system
CA2719673A1 (en) Fencing shared cluster resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant