CN107241430A - A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage - Google Patents

A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage Download PDF

Info

Publication number
CN107241430A
CN107241430A CN201710533133.9A CN201710533133A CN107241430A CN 107241430 A CN107241430 A CN 107241430A CN 201710533133 A CN201710533133 A CN 201710533133A CN 107241430 A CN107241430 A CN 107241430A
Authority
CN
China
Prior art keywords
disaster
data center
data
information
strange land
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710533133.9A
Other languages
Chinese (zh)
Inventor
王继业
魏晓菁
曾楠
王晋雄
郝悍勇
李云
孙磊
王思宁
冷曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Beijing Guodiantong Network Technology Co Ltd
Original Assignee
State Grid Corp of China SGCC
Beijing Guodiantong Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Beijing Guodiantong Network Technology Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201710533133.9A priority Critical patent/CN107241430A/en
Publication of CN107241430A publication Critical patent/CN107241430A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes

Abstract

The invention discloses a kind of enterprise-level disaster tolerance system based on distributed storage both disaster tolerant control method, the system includes:Primary data center and with the strange land data center with primary data center identical data;Database asynchronous replication module, is consistent for the database data by asynchronous replication mode Shi Liangge data centers;Distributed storage mirror module, the asynchronous replication for realizing distributed storage data in Liang Ge data centers by mirror image mechanism ensures the uniformity of asynchronous data by log mechanism;Disaster monitoring modular, for monitoring the information of primary data center and sending disaster information to disaster recovery module after disaster generation;Disaster recovery module, for receiving disaster information and carrying out data recovery to primary data center in Internet using strange land data center.The enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage, it is possible to increase the security of system data and be able to maintain that system is effectively run.

Description

A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage
Technical field
The present invention relates to system disaster tolerance correlative technology field, a kind of enterprise-level disaster tolerance based on distributed storage is particularly related to System and disaster tolerant control method.
Background technology
Distributed memory system is widely used in IPTV, video monitoring, search etc. and needs to use magnanimity to deposit The field of storage.Increase for business data storage and virtualization applications demand magnanimity under cloud computing and big data trend, based on industry Boundary mark is accurate, using full distributed framework there is provided high-performance, highly reliable, high extension, disaster tolerance distributed memory system, Neng Gouyou Effect solves the IT business demand under complex environment.
Distributed memory system is generally made up of rack, and multiple machine frames are placed with each rack, conduct is each placed with The storage server of memory node.In the prior art, to ensure the security of data storage, it will usually deposited using certain data Storage strategy makes a backup store to same data in distributed memory system but is that these measures are difficult to when disaster tolerance occurs for system Ensure the security of data.In particular in enterprise-level disaster tolerance system, it usually needs up to ten million investments, and it is only capable of protection A small amount of core data.And cloud data center environment common at present is no longer applicable enterprise-level disaster tolerance system, it is difficult to ensure enterprise's number According to safety.
During the application is realized, inventor has found at least there is problems with the prior art:Current disaster tolerance System especially enterprise-level disaster tolerance system is difficult to effectively realize security and guarantee of the system data in disaster generating process The validity of system service.
The content of the invention
In view of this, it is an object of the invention to propose a kind of enterprise-level disaster tolerance system based on distributed storage and disaster tolerance Control method, it is possible to increase the security of system data and be able to maintain that system is effectively run.
A kind of enterprise-level disaster tolerance system based on distributed storage provided based on the above-mentioned purpose present invention, including:
Primary data center, storage and inquiry service for realizing data under normal circumstances;
Strange land data center, for as the corresponding Remote Switched Port Analyzer of primary data center, having in the strange land data center With the identical data of primary data center;
Database asynchronous replication module, for causing primary data center and strange land data center by way of asynchronous replication Both database datas are consistent;
Distributed storage mirror module, for realizing primary data center with being distributed in the data center of strange land by mirror image mechanism The asynchronous replication of formula data storage, passes through the uniformity of data during log mechanism guarantee asynchronous replication;
Disaster monitoring modular, the relevant information for monitoring primary data center, and to disaster recovery after disaster generation Module sends disaster information;
Disaster recovery module, the disaster information for receiving the transmission of disaster monitoring modular, using strange land data center in net Network layers carry out data recovery to primary data center.
Optionally, carried out data transmission between the primary data center and strange land data center using dedicated data line.
Optionally, the disaster monitoring modular is additionally operable to according to default rule and related letter of the algorithm to primary data center Breath is handled, and user or keeper is given a warning or prompt message before disaster generation.
Optionally, the disaster monitoring modular is additionally operable to after disaster occurs, detect and judge the phase in primary data center Close whether information can use, if unavailable ability sends disaster information to disaster recovery module;
The disaster recovery module is additionally operable to send disaster recovery condition information to disaster monitoring modular;
The disaster monitoring modular is additionally operable to the disaster recovery condition information sent according to disaster recovery module, by correlation-like State feeds back to user or attendant.
Optionally, the system also include network handover module, for after disaster occurs by the access of application and service Address is switched to strange land data center.
Present invention also provides a kind of enterprise-level disaster tolerant control method based on distributed storage, including:
Monitor related data and information in primary data center and judge whether primary data center data are abnormal;
If monitoring data exception, disaster recovery module is called to be at activating upstate;
Verify the uniformity of strange land data center and primary data center distributed storage data, and strange land data center with The uniformity of primary data center database;
Network switching is carried out, the reference address of application and service is switched to strange land data center;
Data recovery is carried out to primary data center using disaster recovery module, rear return information is successfully recovered to user or dimension Shield personnel;
According to user instruction or feedback information is successfully recovered, the network address is again switched into primary data center.
Optionally, it is described to judge whether abnormal step also includes primary data center data:
According to preset strategy or algorithm, judge whether data are abnormal;
If data exception, further determine whether to cause data or service unavailable, if so, then calling disaster recovery mould Block;Otherwise, abnormal information is fed back into user or attendant.
Optionally, it is described also to include the step of call disaster recovery module:
Disaster recovery module is persistently called according to preset times, until calling success, otherwise, malloc failure malloc information is fed back.
Optionally, verification uniformity or carry out network switching during, if find do not meet uniformity or Network switching fails, then returns to corresponding failure information to user or attendant.
From the above it can be seen that enterprise-level disaster tolerance system based on distributed storage that the present invention is provided and disaster tolerance control Method processed is by setting the remote mirroring data center of a primary data center, i.e. strange land data center so that even if local number Occurs disaster according to center, strange land data also ensure that safe enough.Deposited by database asynchronous replication module and distribution Storage mirror module causes database data and distributed storage data in Liang Ge data centers to keep real-time update, namely Being consistent property of data.Whether occur disaster by disaster monitoring module monitors primary data center and lead in time after disaster generation Know disaster recovery module so that disaster recovery module can enter line number based on the data in the data center of strange land to primary data center According to recovery.So so that whole system has higher redundancy ability, can ensure the safety and integrality of data.In addition, institute State system also by after disaster occurs and before data recovery by the access of the corresponding application and service of primary data center Location is switched to strange land data center so that system can at once based on strange land data center realize system normal use access and Related Data Services.Therefore, herein described enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage can Improve the security of system data and be able to maintain that system is effectively run.
Brief description of the drawings
The structural representation of the one embodiment for the enterprise-level disaster tolerance system based on distributed storage that Fig. 1 provides for the present invention Figure;
The structure of another embodiment of the enterprise-level disaster tolerance system based on distributed storage that Fig. 2 provides for the present invention is shown It is intended to;
The design principle of the one embodiment for the enterprise-level disaster tolerance system based on distributed storage that Fig. 3 provides for the present invention Schematic diagram;
The distributed storage distributed mirror image replicating principle schematic diagram that Fig. 4 provides for the present invention;
Fig. 5 accesses principle schematic for the normal data that the present invention is provided;
Data access principle schematic during the disaster recovery that Fig. 6 provides for the present invention;
The flow of the one embodiment for the enterprise-level disaster tolerant control method based on distributed storage that Fig. 7 provides for the present invention Figure;
The distributed storage data trnascription read-write theory design diagram that Fig. 8 provides for the present invention.
Embodiment
For the object, technical solutions and advantages of the present invention are more clearly understood, below in conjunction with specific embodiment, and reference Accompanying drawing, the present invention is described in more detail.
It should be noted that all statements for using " first " and " second " are for differentiation two in the embodiment of the present invention The entity of individual same names non-equal or the parameter of non-equal, it is seen that " first " " second " should not only for the convenience of statement The restriction to the embodiment of the present invention is interpreted as, subsequent embodiment no longer illustrates one by one to this.
The problem of being existed based on current enterprise level disaster tolerance system, enterprise-level disaster tolerance technology of the application based on distributed storage Devise a kind of brand-new thinking.In data storage, the data to distributed memory system carry out Remote Switched Port Analyzer, to ensure number According to real-time.Simultaneously in operation layer, ensure the real-time synchronization of business datum by duplicating remote data.When generation data center When abnormal, the situation according to business datum that disaster monitoring system can be automated is quick by the operation system of primary data center Switching is completed in disaster-tolerant backup data center's pull-up, and in Internet.
Specifically, herein described enterprise-level disaster tolerance system is generally divided into primary data center and strange land data center, i.e., it is standby Calamity data center or remote mirroring data center.Carried out data transmission between Liang Ge data centers by special line.The application bottom Layer distributed storage ensures the consistent of distributed storage data between Liang Ge data centers by mirror image (Mirroring) mechanism Property;Database ensures the uniformity of database data between Liang Ge data centers by asynchronous system.Mould is monitored by disaster Block carrys out all situations at monitoring data center, it is ensured that when occurring disaster, the very first time lets the user know that, and notifies disaster extensive Multiple module, carries out disaster recovery, in the case of user's unaware, is switched in Internet;It ensure that the continuity of business.
It is shown referring to Figures 1 and 2, two of the enterprise-level disaster tolerance system based on distributed storage provided for the present invention The structural representation of embodiment.The enterprise-level disaster tolerance system based on distributed storage includes:
Primary data center 1, storage and inquiry service for realizing data under normal circumstances;That is, primary data center It is the important component in local system for local data service center, when not having disaster, all data are deposited Storage, service are realized in primary data center 1.
Strange land data center 2, for as the corresponding Remote Switched Port Analyzer of primary data center 1, having in the strange land data center 2 Have and the identical data of primary data center 1;Liang Ge data centers will not be had influence on when occurring for disaster simultaneously, generally will Strange land data center 2 is arranged on strange land, it is ensured that Liang Ge data centers have certain independence.
Database asynchronous replication module 3, for being caused by way of asynchronous replication in primary data center 1 and strange land data Both database datas of the heart 2 are consistent;Wherein, asynchronous replication uses cluster copy mode, based on binary log (bin Log) and it is globally unique numbering (GTID, Global Transaction ID), only successful transaction can just write Bin log, and the duplication of Slave nodes does not interfere with the issued transaction of Master nodes;Tieed up between Master and Slave nodes Shield heart hop-information, when Master nodes have renewal, can to Slave nodes send notify, provide newest bin log and GTID, Slave node processes are updated according to latest news to local data base.
Distributed storage mirror module 4, for realizing primary data center and in the data center of strange land points by mirror image mechanism The asynchronous replication of cloth data storage, passes through the uniformity of data during log mechanism guarantee asynchronous replication;Shown in reference picture 4, it is The distributed storage distributed mirror image replicating principle schematic diagram that the present invention is provided.Mirror image (Mirroring) mechanism can make RBD Images by asynchronous replication, is come between two clusters (Cluster) using RBD Image daily record (Journaling) mechanism Ensure data consistency during asynchronous replication.
Disaster monitoring modular 5, the relevant information for monitoring primary data center, and to disaster recovery after disaster generation Module sends disaster information;Wherein, disaster monitoring modular 5 could be arranged to disaster monitor client, can monitor in master data Application message, database information, distributed storage information, the network information, physical context information, operation system information of the heart 1 etc., According to the regular and corresponding algorithm pre-set, alerted or pointed out before disaster generation, after disaster confirms, If primary data center is unavailable, disaster information is sent to disaster recovery module 6.And receive the return of disaster recovery module 6 Message and related news are returned into user or attendant.
Disaster recovery module 6, the disaster information for receiving the transmission of disaster monitoring modular 5, is existed using strange land data center 2 Internet carries out data recovery to primary data center 1.Wherein, the disaster recovery module 6 could be arranged to when occurring disaster, Disaster monitor client will send disaster recovery instruction, and monitoring data, and disaster recovery client is according to monitoring data and calamity Difficult recovery policy is recovered, including switching distributed storage, switch data storehouse, switching application layer, and these recoveries are all in net Network layers are completed.
Shown in reference picture 3, one embodiment of the enterprise-level disaster tolerance system based on distributed storage provided for the present invention Design principle schematic diagram.As seen from the figure, the data in Liang Ge data centers are realized by asynchronous replication and mirror image mechanism The uniformity of two aspects of database and distributed data.Wherein, OSD (Object-based Storage Device) is base In the object storage device of object storage technology.
From above-described embodiment, the herein described enterprise-level disaster tolerance system based on distributed storage is by setting one The remote mirroring data center of primary data center, i.e. strange land data center so that even if disaster, strange land occur for local data center Data also ensure that safe enough.Two numbers are caused by database asynchronous replication module and distributed storage mirror module Real-time update, namely being consistent property of data are kept according to the database data in center and distributed storage data.Pass through Whether disaster monitoring module monitors primary data center occurs disaster and notifies disaster recovery module in time after disaster generation so that Disaster recovery module can carry out data recovery based on the data in the data center of strange land to primary data center.So so that whole Individual system has higher redundancy ability, can ensure the safety and integrality of data.In addition, the system is also by disaster Reference address after generation and before data recovery by the corresponding application and service of primary data center is switched in the data of strange land The heart so that the normal use that system can realize system based on strange land data center at once is accessed and related Data Services.Therefore, The herein described enterprise-level disaster tolerance system based on distributed storage can improve the security of system data and be able to maintain that System is effectively run.
In the application some optional embodiments, using special between the primary data center 1 and strange land data center 2 Data wire carries out data transmission.In such manner, it is possible to ensure the stability and security of data transfer.
In the application some optional embodiments, the disaster monitoring modular is additionally operable to according to default rule and algorithm Relevant information to primary data center is handled, and user or keeper is given a warning or pointed out letter before disaster generation Breath.That is, the prediction of some disaster relevant informations or signal, the disaster that sensed in advance may occur, so as to point out can be passed through User or attendant prepare in advance.
In the application some optional embodiments, the disaster monitoring modular 5 is additionally operable to after disaster occurs, and detection is simultaneously Judge whether the relevant information in primary data center 1 can use, if unavailable ability sends disaster information to disaster recovery module; That is, after having the generation of some disasters not the data that have influence in primary data center or related service in use, can be with Disaster recovery is not needed.
The disaster recovery module is additionally operable to send disaster recovery condition information to disaster monitoring modular;
The disaster monitoring modular is additionally operable to the disaster recovery condition information sent according to disaster recovery module, by correlation-like State feeds back to user or attendant.So, user or attendant can be caused to know data recovery situation, be conducive to Follow-up associative operation.
In the application some optional embodiments, system also includes network handover module, for will after disaster occurs The reference address of application and service is switched to strange land data center.So so that during disaster recovery, user can also Carry out normal data access service.
Shown in reference picture 5, principle schematic is accessed for the normal data that the present invention is provided.Before not occurring disaster, application Kimonos normal access path of doing honest work is as follows:User is applied or serviced by network access, and this application or service are in master data Center portion is affixed one's name to.Then application accesses database, obtains all data that user needs to access, and this database is deployed in master data The heart.Using distributed storage data are accessed, such as block storage, object are stored, and secondary distributed storage is also to be deployed in primary data center. The service of all primary data centers is all good for use.
Shown in reference picture 6, data access principle schematic during the disaster recovery provided for the present invention.Occur in disaster Afterwards, the normal access path of application and service has been switched to strange land data center, and access path is as follows:User passes through network access Using or service, this application or service in strange land data center deployment.Then application accesses database, and obtaining user needs All data accessed, this database is deployed in strange land data center.This application access distributed storage data, such as block storage, Object is stored, and secondary distributed storage is also to be deployed in strange land data center.In the data and master data of all strange land data centers The data of the heart are consistent.
Shown in reference picture 7, a reality of the enterprise-level disaster tolerant control method based on distributed storage provided for the present invention Apply the flow chart of example.The enterprise-level disaster tolerant control method based on distributed storage includes:
Monitor related data and information in primary data center and judge whether primary data center data are abnormal;
If monitoring data exception, disaster recovery module is called to be at activating upstate;
Verify the uniformity of strange land data center and primary data center distributed storage data, and strange land data center with The uniformity of primary data center database;
Network switching is carried out, the reference address of application and service is switched to strange land data center;
Data recovery is carried out to primary data center using disaster recovery module, rear return information is successfully recovered to user or dimension Shield personnel;
According to user instruction or feedback information is successfully recovered, the network address is again switched into primary data center.
Optionally, it is described to judge whether abnormal step also includes primary data center data:
According to preset strategy or algorithm, judge whether data are abnormal;
If data exception, further determine whether to cause data or service unavailable, if so, then calling disaster recovery mould Block;Otherwise, abnormal information is fed back into user or attendant.
Optionally, it is described also to include the step of call disaster recovery module:Disaster recovery is persistently called according to preset times Module, until calling success, otherwise, feeds back malloc failure malloc information.
Optionally, verification uniformity or carry out network switching during, if find do not meet uniformity or Network switching fails, then returns to corresponding failure information to user or attendant.
In the application other optional embodiments, the disaster control method or handling process are as follows:
(1) user or attendant call monitor client, in real time monitoring primary data center service.
(2) monitor client is monitored in real time to primary data center, and monitoring content includes network, storage, using, physics The information such as environment.
(3) according to corresponding strategy or algorithm, whether Monitoring Data occurs exception, if no exceptions, continues Monitoring, if exception occurs for data, causes environment or services unavailable, then trigger next step flow, call disaster recovery mould Block.
(4) if calling disaster recovery module to fail, progress re-calls disaster recovery module, if calling success, Then enter next step flow.
(5) call after the success of disaster recovery module, disaster recovery module prepare before disaster recovery.
(6) strange land distributed storage uniformity is verified, if verification failure, failure information is returned to client or use Family, if verified successfully, into next step.
(7) calibration database uniformity, if verification failure, returns to failure information to client or user, if Verify successfully, then into next step.
(8) network switching, when distributed storage and the success of database consistency desired result, then carries out network layer handoff, including Reference address, database address, the distributed storage addresses of application.If handoff failure, failure information is returned to client Or user, if switched successfully, user accesses has been transferred into strange land data center using unaware.
(9) after disaster recovery success, monitor client or user are return success.
Primary data center is switched back to finally according to user's control or automatically by strange land data center.
Pass through above-mentioned control or flow so that primary data center can not only be restored in time after disaster occurs, and And during restoration can also the normal access based on strange land data center digital display data.
From above-described embodiment, the application at least includes herein below:(1) the enterprise-level storage of distributed storage is more secondary Proprietary design (3) distributed storage of the enterprise-level disaster tolerance technology data duplication network of present mechanism design (2) distributed storage Enterprise-level disaster tolerance technology data duplication interrupt after resume design (4) distributed storage the daily record of enterprise-level disaster tolerance technology spy Property, to act on the enterprise-level disaster tolerance technology daily record characteristic of each affairs log enable (5) distributed storage in storage volume Storage volume, passes through the storage replication group process replication storage volume.
In the application some optional embodiments, distributed storage distributed mirror image process is as follows:(1) IO is stored into block Storage volume daily record;(2) service of storage replication group is synchronized, and storage volume log information in local distributed storage is synchronous Into the storage volume of long-range (strange land) distributed storage cluster.(3) storage replication group service support breakpoint transmission, support are multigroup multiple System, it is ensured that the integrality and high efficiency of data.
In the application some optional embodiments, shown in reference picture 8, the distributed storage data pair provided for the present invention This read-write theory design diagram.Many copy mechanism are stored by the enterprise-level of distributed storage, exploitation is completed can for guarantee The data stored to it by property free of errors hold capacity, obtains high reliability to data storage, will be used by multi-duplicate technology The data at family deposit many parts in memory bank.In this case, as long as being not all of losing in data, the data of user would not Lose.
Calculated at user interface end after three data disks, directly communicated with master data disk, initiate write operation.Master data Disk is received after request, initiates write operation to from data disks respectively.After each write operation is completed from data disks, will respectively to Master data disk sends confirmation.After master data disk, which receives other two write-ins from data disks, to be confirmed, and oneself also complete Data write, then confirm that data write operation is completed to user interface.
In addition herein described system also includes:
Storage replication group:The Pools selections of the storage replication group of correspondence High Availabitity group website are set, storage replication group is set Replicate direction, the parameter for setting related storage replication group to replicate, for example:Replicate block size, breakpoint transmission, network bandwidth ginseng Number setting etc..Many storage replication groups can be set.
Remote cluster management:Each of storage replication group function needs in companion's cluster (Peer Clusters) is right Configured on the pool answered, all storage volume in automatic disaster tolerance some storage pool can be set and also support to specify that disaster tolerance is single to be deposited Store up a particular subset of volume.
The strong consistency of storage replication group:Using uniformity hash algorithm or data syn-chronization algorithm, it is ensured that two ends cluster The uniformity of data, it is ensured that availability of data and security.Wherein, uniformity hash algorithm is to each node distribution one in system Individual random token, these token constitute a Hash ring.When performing data deposit operation, Key cryptographic Hash is first calculated, then It is stored in the node that clockwise direction first is more than or equal to where the token of the cryptographic Hash.
Storage replication group is monitored:State, flow, progress, the information of time in the process of storage replication are supervised Control ensures the reliability of data.
Breakpoint transmission:During the entire process of breakpoint transmission is realized, it is ensured that the safety of whole distributed storage disaster recovery and backup systems It is stable, while ensureing the uniformity and integrality of data.Wait after network recovery.The biography of original vol data need not be restarted It is defeated, but from the suspension moment, continue to transmit remaining data, overall process no data overflows, and no data is lost.
The status information creation data of storage system of the primary storage server in station in production website and disaster tolerance website The information assurance of safe distribution, information is mutually synchronized.
When main website point failure, home site storage server corresponds to the N subregions in former data safety distributed intelligence respectively It is updated from storage dish Synchronization Status Message, generates new data distributed intelligence on schedule.
When from station failure, the N subregions in former data safety distributed intelligence are corresponded to master by slave site storage server respectively Website storage dish Synchronization Status Message is updated, recording synchronism information, after after slave site recovery, according to last synchronizing information, Continue more home site and carry out data syn-chronization.
Duplicate network optimizes, and bandwidth is optimized, the multicast between support site, reduces unnecessary duplication, and optimization data are passed It is defeated, efficiency of transmission is improved, Optimized Replication network strategy can be changed flexibly there is provided a variety of replication strategies to different requirements, improved Network utilization, reduces propagation delay time.
Those of ordinary skills in the art should understand that:The discussion of any of the above embodiment is exemplary only, not It is intended to imply that the scope of the present disclosure (including claim) is limited to these examples;Under the thinking of the present invention, above example Or can also not be combined between the technical characteristic in be the same as Example, step can be realized with random order, and be existed such as Many other changes of upper described different aspect of the invention, for simplicity, they are provided not in details.
In addition, to simplify explanation and discussing, and in order to obscure the invention, can in the accompanying drawing provided To show or can not show that the known power ground with integrated circuit (IC) chip and other parts is connected.Furthermore, it is possible to Device is shown in block diagram form, to avoid obscuring the invention, and this have also contemplated that following facts, i.e., on this The details of the embodiment of a little block diagram arrangements be depend highly on the platform that will implement the present invention (that is, these details should It is completely in the range of the understanding of those skilled in the art).Elaborating detail (for example, circuit) with describe the present invention In the case of exemplary embodiment, it will be apparent to those skilled in the art that can be in these no details In the case of or implement the present invention in the case that these details are changed.Therefore, these descriptions are considered as explanation It is property rather than restricted.
Although having been incorporated with specific embodiment of the invention, invention has been described, according to retouching above State, many replacements of these embodiments, modifications and variations will be apparent for those of ordinary skills.Example Such as, other memory architectures (for example, dynamic ram (DRAM)) can use discussed embodiment.
Embodiments of the invention be intended to fall within the broad range of appended claims it is all it is such replace, Modifications and variations.Therefore, within the spirit and principles of the invention, any omission, modification, equivalent substitution, the improvement made Deng should be included in the scope of the protection.

Claims (9)

1. a kind of enterprise-level disaster tolerance system based on distributed storage, it is characterised in that including:
Primary data center, storage and inquiry service for realizing data under normal circumstances;
Strange land data center, for as the corresponding Remote Switched Port Analyzer of primary data center, having and master in the strange land data center The identical data of data center;
Database asynchronous replication module, for causing both primary data center and strange land data center by way of asynchronous replication Database data be consistent;
Distributed storage mirror module, for realizing that primary data center is deposited with distribution in the data center of strange land by mirror image mechanism The asynchronous replication of data is stored up, passes through the uniformity of data during log mechanism guarantee asynchronous replication;
Disaster monitoring modular, the relevant information for monitoring primary data center, and to disaster recovery module after disaster generation Send disaster information;
Disaster recovery module, the disaster information for receiving the transmission of disaster monitoring modular, using strange land data center in Internet Data recovery is carried out to primary data center.
2. system according to claim 1, it is characterised in that used between the primary data center and strange land data center Dedicated data line carries out data transmission.
3. system according to claim 1, it is characterised in that the disaster monitoring modular is additionally operable to according to default rule The relevant information of primary data center is handled with algorithm, user or keeper are given a warning or carried before disaster generation Show information.
4. system according to claim 1, it is characterised in that the disaster monitoring modular is additionally operable to after disaster occurs, Detect and judge whether the relevant information in primary data center can use, believe if unavailable ability sends disaster to disaster recovery module Breath;
The disaster recovery module is additionally operable to send disaster recovery condition information to disaster monitoring modular;
The disaster monitoring modular is additionally operable to the disaster recovery condition information sent according to disaster recovery module, and correlation behavior is anti- Feed user or attendant.
5. system according to claim 1, it is characterised in that also including network handover module, for after disaster occurs The reference address of application and service is switched to strange land data center.
6. a kind of enterprise-level disaster tolerant control method based on distributed storage, it is characterised in that including:
Monitor related data and information in primary data center and judge whether primary data center data are abnormal;
If monitoring data exception, disaster recovery module is called to be at activating upstate;
Verify strange land data center and the uniformity of primary data center distributed storage data, and strange land data center and main number According to the uniformity of central database;
Network switching is carried out, the reference address of application and service is switched to strange land data center;
Data recovery is carried out to primary data center using disaster recovery module, rear return information is successfully recovered to user or safeguards people Member;
According to user instruction or feedback information is successfully recovered, the network address is again switched into primary data center.
7. method according to claim 6, it is characterised in that it is described judge primary data center data whether abnormal step Also include:
According to preset strategy or algorithm, judge whether data are abnormal;
If data exception, further determine whether to cause data or service unavailable, if so, then calling disaster recovery module; Otherwise, abnormal information is fed back into user or attendant.
8. method according to claim 6, it is characterised in that described also to include the step of call disaster recovery module:
Disaster recovery module is persistently called according to preset times, until calling success, otherwise, malloc failure malloc information is fed back.
9. method according to claim 6, it is characterised in that in verification uniformity or the process of progress network switching In, if finding not meeting uniformity or network switching failure, then corresponding failure information is returned to user or people is safeguarded Member.
CN201710533133.9A 2017-07-03 2017-07-03 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage Pending CN107241430A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710533133.9A CN107241430A (en) 2017-07-03 2017-07-03 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710533133.9A CN107241430A (en) 2017-07-03 2017-07-03 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage

Publications (1)

Publication Number Publication Date
CN107241430A true CN107241430A (en) 2017-10-10

Family

ID=59991406

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710533133.9A Pending CN107241430A (en) 2017-07-03 2017-07-03 A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage

Country Status (1)

Country Link
CN (1) CN107241430A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108365990A (en) * 2018-02-12 2018-08-03 中国电力工程顾问集团中南电力设计院有限公司 Electric energy metered system application fusion architecture and fusion method
CN108512693A (en) * 2018-02-24 2018-09-07 国家计算机网络与信息安全管理中心 A kind of trans-regional disaster recovery method and device
CN108710550A (en) * 2018-08-16 2018-10-26 北京易华录信息技术股份有限公司 A kind of Double Data center disaster recovery system for system of deploying to ensure effective monitoring and control of illegal activities for public security traffic control inspection
CN108932180A (en) * 2018-06-21 2018-12-04 郑州云海信息技术有限公司 A kind of disaster tolerance management method, device, storage medium and computer equipment matter
CN109558267A (en) * 2018-11-16 2019-04-02 郑州云海信息技术有限公司 A kind of storage cluster data restore verification method and device
CN109672551A (en) * 2018-09-25 2019-04-23 平安科技(深圳)有限公司 Across data-center applications dissemination methods, equipment, storage medium and device
CN109947593A (en) * 2017-12-21 2019-06-28 中国电信股份有限公司 Data disaster tolerance method, system, tactful arbitration device and storage medium
CN110162153A (en) * 2019-04-16 2019-08-23 上海马小修智能科技有限公司 A kind of data disaster tolerance switching system
CN111158949A (en) * 2018-11-07 2020-05-15 中国移动通信集团重庆有限公司 Configuration method, switching method and device of disaster recovery architecture, equipment and storage medium
CN111340414A (en) * 2020-02-14 2020-06-26 上海东普信息科技有限公司 Cloud bin big data processing method, cloud bin system, computer equipment and storage medium
CN113111143A (en) * 2021-04-09 2021-07-13 河南交通发展研究院有限公司 Road multi-source heterogeneous data reconstruction integration and support sharing complete method and system
CN113157660A (en) * 2021-01-22 2021-07-23 淘宝(中国)软件有限公司 Data unit copy placement method and device, electronic equipment and system
CN114461438A (en) * 2022-04-12 2022-05-10 北京易鲸捷信息技术有限公司 Distributed database disaster recovery system and method of asymmetric center mode
CN114520811A (en) * 2022-04-20 2022-05-20 柏科数据技术(深圳)股份有限公司 Production center data recovery method, system, terminal equipment and storage medium
CN115086150A (en) * 2022-05-31 2022-09-20 阿里巴巴(中国)有限公司 Disaster recovery control system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6823349B1 (en) * 2001-09-21 2004-11-23 Emc Corporation Method and system for establishing, maintaining, and using a persistent fracture log
CN104243195A (en) * 2013-06-19 2014-12-24 国家电网公司 Remote disaster recovery processing method and device
CN104239164A (en) * 2013-06-19 2014-12-24 国家电网公司 Cloud storage based disaster recovery backup switching system
CN105516365A (en) * 2016-01-22 2016-04-20 浪潮电子信息产业股份有限公司 Method for managing a distributed type mirror image storage block device based on network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6823349B1 (en) * 2001-09-21 2004-11-23 Emc Corporation Method and system for establishing, maintaining, and using a persistent fracture log
CN104243195A (en) * 2013-06-19 2014-12-24 国家电网公司 Remote disaster recovery processing method and device
CN104239164A (en) * 2013-06-19 2014-12-24 国家电网公司 Cloud storage based disaster recovery backup switching system
CN105516365A (en) * 2016-01-22 2016-04-20 浪潮电子信息产业股份有限公司 Method for managing a distributed type mirror image storage block device based on network

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109947593A (en) * 2017-12-21 2019-06-28 中国电信股份有限公司 Data disaster tolerance method, system, tactful arbitration device and storage medium
CN109947593B (en) * 2017-12-21 2021-06-04 中国电信股份有限公司 Data disaster tolerance method, system, strategy arbitration device and storage medium
CN108365990B (en) * 2018-02-12 2021-03-09 中国电力工程顾问集团中南电力设计院有限公司 Electric energy metering system application fusion framework and fusion method
CN108365990A (en) * 2018-02-12 2018-08-03 中国电力工程顾问集团中南电力设计院有限公司 Electric energy metered system application fusion architecture and fusion method
CN108512693A (en) * 2018-02-24 2018-09-07 国家计算机网络与信息安全管理中心 A kind of trans-regional disaster recovery method and device
CN108932180A (en) * 2018-06-21 2018-12-04 郑州云海信息技术有限公司 A kind of disaster tolerance management method, device, storage medium and computer equipment matter
CN108710550A (en) * 2018-08-16 2018-10-26 北京易华录信息技术股份有限公司 A kind of Double Data center disaster recovery system for system of deploying to ensure effective monitoring and control of illegal activities for public security traffic control inspection
CN108710550B (en) * 2018-08-16 2021-09-28 北京易华录信息技术股份有限公司 Double-data-center disaster tolerance system for public security traffic management inspection and control system
CN109672551A (en) * 2018-09-25 2019-04-23 平安科技(深圳)有限公司 Across data-center applications dissemination methods, equipment, storage medium and device
CN111158949A (en) * 2018-11-07 2020-05-15 中国移动通信集团重庆有限公司 Configuration method, switching method and device of disaster recovery architecture, equipment and storage medium
CN109558267A (en) * 2018-11-16 2019-04-02 郑州云海信息技术有限公司 A kind of storage cluster data restore verification method and device
CN109558267B (en) * 2018-11-16 2021-10-29 郑州云海信息技术有限公司 Storage cluster data recovery verification method and device
CN110162153A (en) * 2019-04-16 2019-08-23 上海马小修智能科技有限公司 A kind of data disaster tolerance switching system
CN111340414A (en) * 2020-02-14 2020-06-26 上海东普信息科技有限公司 Cloud bin big data processing method, cloud bin system, computer equipment and storage medium
CN113157660A (en) * 2021-01-22 2021-07-23 淘宝(中国)软件有限公司 Data unit copy placement method and device, electronic equipment and system
CN113111143A (en) * 2021-04-09 2021-07-13 河南交通发展研究院有限公司 Road multi-source heterogeneous data reconstruction integration and support sharing complete method and system
CN114461438A (en) * 2022-04-12 2022-05-10 北京易鲸捷信息技术有限公司 Distributed database disaster recovery system and method of asymmetric center mode
CN114520811A (en) * 2022-04-20 2022-05-20 柏科数据技术(深圳)股份有限公司 Production center data recovery method, system, terminal equipment and storage medium
CN115086150A (en) * 2022-05-31 2022-09-20 阿里巴巴(中国)有限公司 Disaster recovery control system
CN115086150B (en) * 2022-05-31 2023-12-29 阿里巴巴(中国)有限公司 Disaster recovery control system

Similar Documents

Publication Publication Date Title
CN107241430A (en) A kind of enterprise-level disaster tolerance system and disaster tolerant control method based on distributed storage
US7120769B2 (en) Point in time remote copy for multiple sites
AU2017282817B2 (en) Data processing method and device
EP1533701B1 (en) System and method for failover
CN101136728A (en) Cluster system and method for backing up a replica in a cluster system
US10430290B2 (en) Method and system for star replication using multiple replication technologies
CN102890716B (en) The data back up method of distributed file system and distributed file system
CN105069160A (en) Autonomous controllable database based high-availability method and architecture
CN101763321B (en) Disaster-tolerant method, device and system
TWI677797B (en) Management method, system and equipment of master and backup database
CN108810150B (en) Data replication method of application-level disaster recovery backup system of cooperative office system
US20160062854A1 (en) Failover system and method
CN106339278A (en) Data backup and recovery method for network file system
CN105988894A (en) Disaster tolerance technique of active-active mode
CN106789180A (en) The service control method and device of a kind of meta data server
CN112181723A (en) Financial disaster recovery method and device, storage medium and electronic equipment
EP3896571B1 (en) Data backup method, apparatus and system
CN114900532A (en) Power data disaster tolerance method, system, device, computer equipment and storage medium
WO2021115043A1 (en) Distributed database system and data disaster backup drilling method
WO2015196692A1 (en) Cloud computing system and processing method and apparatus for cloud computing system
WO2017122060A1 (en) Parallel recovery for shared-disk databases
CN111488247B (en) High availability method and equipment for managing and controlling multiple fault tolerance of nodes
CN104850628A (en) Data synchronization method and apparatus in database
CN116389233A (en) Container cloud management platform active-standby switching system, method and device and computer equipment
CN107404511B (en) Method and device for replacing servers in cluster

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20171010