WO2014110994A1

WO2014110994A1 - Integrated platform for disaster recovery of it system

Info

Publication number: WO2014110994A1
Application number: PCT/CN2014/070331
Authority: WO
Inventors: 戚跃民; 郝建明; 伍福生; 简超
Original assignee: 中国银联股份有限公司
Priority date: 2013-01-15
Filing date: 2014-01-08
Publication date: 2014-07-24
Also published as: CN103929320A

Abstract

The present invention relates to an integrated platform for disaster recovery of an IT system, and is used for integrally mange a remote distributed IT system. The integrated platform comprises a system managing module (100), a system communicating module (200), a data synchronizing module (300), a data comparing and analyzing module (400), a data storing module (500), a service flow module (600), a service recovering module (700), and a security auditing module (800). The integrated platform for disaster recovery of the IT system according to the present invention can implement real-time communication between service hosts, and can integrate data recovery and service switching into a service flow; therefore, the present invention can provide the integrated platform for disaster recovery of the IT system effectively combining hardware recovery, data recovery and service recovery when a disaster occurs on the remote distributed IT system.

Description

An integrated platform for IT system disaster recovery

Technical field

[0001] The present invention relates to disaster tolerance technologies in network communications, and in particular, to an integrated platform capable of data recovery and maintaining data consistency in the event of a disaster in an IT system. Background technique

[0002] With the advent of the information age, data has become more and more prominent as the core of the normal operation of society. For a company, data is the key to its survival and development. Users and enterprises in various industries are increasingly dependent on network applications and data information. Sudden disasters such as fires, floods, earthquakes or terrorist incidents are all The company's data and business production can have a major impact. Therefore, how to ensure that enterprise data is not lost in the event of a disaster and to ensure that system services resume operation as soon as possible becomes one of the focuses of attention. Therefore, disaster recovery technology has increasingly become the focus of attention in various industries.

[0003] Disaster tolerance is generally divided into three levels: the data level, the system level, and the business level.

[0004] The focus of data-level disaster recovery is data, which means that the original data of the user will not be lost or destroyed after the disaster occurs. Data-level disaster recovery is different from backup, which requires backup of data to be stored off-site.

[0005] System-level disaster recovery is to replicate the execution application processing energy (service server area) on the basis of data-level disaster recovery, that is, to build a support system at the backup site, and system-level disaster recovery can Provides uninterrupted application services that allow user application service requests to continue to run transparently without the impact of a disaster.

[0006] Data-level disaster tolerance and system-level disaster tolerance are all within the IT category, but for normal In terms of business, only the protection of IT systems is not enough. Some users need to build the highest level of service-level disaster recovery.

[0007] In the prior art, when a large-scale off-site distributed system disaster occurs locally, it focuses on hardware (system) level recovery, data level recovery, or service level recovery, but lacks an integrated platform that effectively combines the three. .

[0008]

Summary of the invention

In view of the above problems, the present invention is directed to an integrated platform for IT system disaster recovery capable of implementing effective control of service hosts in a distributed distributed system and capable of maintaining data consistency and business continuity.

[0010] The integrated platform for disaster recovery of the IT system of the present invention can effectively solve the centralized problem of each host in the distributed IT system in the off-site, implement effective control on the service host, and ensure business continuity through flexible business process functions. The integrated platform for disaster recovery of IT systems of the present invention realizes real-time communication with various service hosts, and unifies data recovery and business switching into business processes.

[0011] The integrated platform for disaster recovery of an IT system of the present invention is used for centralized management of an off-site distributed IT system, which has multiple local service hosts, multiple off-site service hosts, and management local service hosts. And two management centers of off-site business hosts, the integration platform includes:

a system management module, configured to monitor and manage the remote distributed IT system in real time, so that the management center can obtain information of each service host in real time;

The system communication module is deployed on each of the local service hosts, the different service hosts, and the management center, and is used to implement communication between the local service hosts and the management center, and the different industries. Communication between the host and the management center, and communication between the management centers;

a data synchronization module, configured to implement real-time data synchronization in the remote distributed IT system; and a data comparison analysis module, configured to implement data consistency verification in the remote distributed IT system;

a data storage module, configured to implement data storage in the remote distributed IT system; a business process module, configured to implement various types of business processes in the remote distributed IT system; and a service recovery module, configured to be in the off-site Realizing the takeover of off-site business processes in the event of a disaster in a distributed IT system;

The security audit module is configured to encrypt and decrypt messages received and sent between each service host and each management center.

[0012] Preferably, the system management module, the data synchronization module, the data comparison analysis module, the data storage module, the service flow module, the service recovery module, and the security audit module are all related to The system communication module is associated with the data synchronization module, the data comparison analysis module, the data storage module, and the service process module are all associated with the service recovery module.

[0013] Preferably, the system communication module is configured to implement message sending, message parsing, command execution, and result feedback between each service host and each management center.

[0014] Preferably, each management center and each unit module in each service host are connected to a Tuxedo/Q service through a WTC interface.

[0015] Preferably, the security audit module encrypts the sending and receiving of messages between the management center and each unit module in the service host by using a WSL insertion message.

[0016] Preferably, the security audit module performs rsa encryption setting by using the generation of _z in the process of inserting a message by WSL. [0017] Preferably, the data comparison analysis module is capable of performing data positioning and analysis according to the difference of the data and performing data matching according to the data.

[0018] Preferably, the service recovery module includes: a first submodule for starting a remote handover system application and a database; a second submodule for acquiring a time of the disaster handover; Three sub-modules.

[0019] Preferably, the data synchronization module is configured to deploy an image storage in a local service system to implement synchronous data replication between local service systems.

[0020] Preferably, the data synchronization module is configured to implement asynchronous replication of data between local mirror storage and offsite storage.

[0021] Preferably, the data synchronization module is further configured to replenish the off-site data into the local database after the local system recovers the service.

[0022] Preferably, the two management centers are functionally identical and are backups of each other.

[0023] Preferably, the management center uses an LDAP server to perform identity authentication.

[0024] The technical problems mainly solved by the present invention are as follows: (1) How to centrally manage and control distributed systems in different places; (2) How to realize fast service switching of distributed systems in different places, and ensure the continuity of services when disasters occur; (3) How to realize the automatic processing of the business; (4) How to monitor the status of the controlled terminal; (5) How to compare the consistency of the business database between the two places.

[0025] With respect to the above technical problem (1), the technical means adopted is: Implementing a service host for all service hosts through a messaging mechanism between the management center and each controlled terminal (using Tuxedo/Q reliable message queue) control.

[0026] For the above technical problem (2), the technical means adopted is: establishing a corresponding off-site handover and switchback process for each set of operational systems of operation, and tracking with data The combination of complementary modules ensures the sustainability of the business in the event of a disaster, while the latter guarantees the integrity of the data in the event of a disaster.

[0027] For the above technical problem (3), the technical means adopted is: Since the management center can realize management and control of all service hosts, the daily operations of the business system can be automated, that is, the fixed process is sent through the management center. Job instructions are used to implement; in addition, for certain business needs, the business personnel can also define a set of arbitrary processes to achieve them in a flexible manner.

[0028] For the above technical problem (4), the technical means used are: deploying the corresponding Tuxedo WSL service on each of the two management centers, and setting the WSNADDR environment variable, environment variable on each controlled end. The value is the address corresponding to the WSL service published by the tuxedo server (ip address: port number), and is used by the tuxedo client program (controlled application program) to connect to the tuxedo server. If the connection fails, reconnect after 30 seconds. At the same time, the controlled end periodically sends a heartbeat message to the management center, and the management center determines whether the state of the controlled end is normal.

[0029] With respect to the above technical problem (5), the technical means adopted is: through the data comparison module, it is possible to compare any one of the two business databases or one table set (multiple tables), and the comparison manner is diverse. , 1] compares the number of records on the table; 2) compares some fields in the table; 3) compares the MD5 algorithm on the table; through these data comparison methods, it can accurately find out whether the local and remote business databases are consistent. Inconsistent can tell the user where the difference is.

[0030] In summary, the integrated platform for disaster recovery of an IT system of the present invention can realize real-time communication between various service hosts, and can integrate data recovery and service switching into a business process. Therefore, the present invention can provide a A disaster in a distributed IT system An integrated platform for disaster recovery of IT systems that combines hardware recovery, data recovery, and business recovery.

[0031]

DRAWINGS

1 is a schematic structural diagram showing centralized management of an off-site distributed IT system for an integrated platform for IT system disaster recovery according to the present invention.

2 is a schematic diagram showing the construction of an integrated platform for disaster recovery of an IT system of the present invention.

3 is a schematic diagram showing data storage and data synchronization processing for an integrated platform for IT system disaster recovery according to the present invention.

4 is a flow chart showing the processing of a unit module, that is, a controlled end, under the integrated platform management for IT system disaster recovery according to the present invention.

[0036]

detailed description

[0037] The following are some of the various embodiments of the invention, which are intended to provide a basic understanding of the invention. It is not intended to identify key or critical elements of the invention or the scope of the invention.

1 is a schematic structural diagram showing centralized management of an off-site distributed IT system for an integrated platform for IT system disaster recovery according to the present invention. As shown in FIG. 1 , the remote distributed IT system has multiple local service hosts (having a service host 1 , a service host 2 , a service host 3 . . . ) in the local area, and multiple off-site service hosts (having services in different places) Host 4, service host 5, service host 6...), two management centers that manage local service hosts and off-site service hosts. The local service host, the off-site service host, and the two management centers are connected through communication lines. Among them, the two management centers have the same management functions and are sealed each other. The off-site The distributed IT system includes all of the above business hosts and management centers. The "controlled end" that will be mentioned in the present invention is a module deployed on all service hosts. Thus, for the integrated platform for disaster recovery of IT systems of the present invention of the present invention, all service hosts can accept instructions from the management center to perform related operations (thus, the host that deploys the unit module can also be referred to herein as abbreviated. For "controlled end").

[0040] As shown in FIG. 2, the integrated platform for disaster recovery of an IT system of the present invention includes: real-time monitoring and managing the remote distributed IT system to enable the management center to acquire information of each service host in real time. The system management module 100 is deployed in each of the local service hosts, the off-site service hosts, and the management center, and is used to implement communication between the local service hosts and the management center, communication between the different service hosts and the management center, and the management center. a system communication module 200 for communication between the data synchronization module 300 for realizing real-time synchronization of data in the remote distributed IT system; data comparison for realizing consistency verification of data in the remote distributed IT system An analysis module 400; a data storage module 500 for implementing data storage in the remote distributed IT system; a business process module 600 for implementing various types of business processes in the remote distributed IT system; A business recovery model for realizing the takeover of off-site business processes in the event of a disaster in a distributed distributed system 700; for encrypting messages sent and received between the hosts and the respective service management center, the decryption module 800 of the security audit.

[0041] The system management module 100, the data synchronization module 300, the data comparison analysis module 400, the data storage module 500, the business process module 600, the service recovery module 700, and the security audit module 800 are all associated with the system communication module 200. Data synchronization module 300, data ratio The comparison analysis module 400, the data storage module 500, and the business process module 600 are all associated with the service recovery module (700).

[0042] The system management module 100 can monitor and manage the remote distributed IT system in real time, so that the management center can obtain the service host in real time through a reliable message communication mechanism between each associated unit module and the management center in the system. information.

[0043] The system communication module 200 is deployed in each unit module and management center in the remote distributed IT system, and is used for real-time messaging, message parsing, command execution, and result feedback, and is the basis for realizing disaster recovery of the IT system. The Tuxedo /Q service is connected between the management center and each unit module through the WTC interface. WTC is the connection tool between BEA's WEB support product Weblogic and middleware product Tuxedo, the full name of Weblogic Tuxedo Connector. WTC provides two-way access between Weblogic and Tuxedo. Tuxedo is also a middleware product from BEA. Tuxedo/Q components can be implemented in a reliable manner, allowing messages to be queued and stored in persistent shields such as disk or non-persistent media. Such as in memory, for later use. In the present invention, the management center is deployed

On the Weblogic platform and the Java application, Tuxedo's /Q reliable message queue is deployed on each unit module (that is, each controlled terminal) to receive the message command sent by the management center -> Execute -> to store the result message in the response queue. The communication between Weblogic in the management center and Tuxedo between each unit module uses the WTC interface.

[0044] The management center sends the relevant command message and receives the executed return result message. Each unit module receives the command message of the management center and sends a return result message of the execution. When the execution time of the command is too long or the network is faulty, Tuxedo /Q can provide reliable message service and ensure the integrity of the message delivery. Such a mechanism provides a more flexible and reliable asynchronous execution method than tpacall(), which satisfies the remote distributed system. Need. Therefore, in the present invention, by adopting between the management center and each controlled terminal

Tuxedo /Q, able to continue centralized management and control of off-site distributed systems.

[0045] The data synchronization module 300, the data comparison analysis module 400, and the data storage module 500 jointly construct a guarantee of data consistency in a remote distributed system, and the data synchronization module 300 and the data module 500 store services for realizing off-site distribution within the system. The real-time synchronization of the system data, the data comparison analysis module 400 is used for verifying the consistency of the data distribution of the business system distributed in different places and performing data positioning and analysis according to the difference, and performing related data recovery. The data synchronization module 300, the data comparison analysis module 400, and the data storage module 500 are the output of the service recovery module 600.

[0046] The business process module 600 is to be used to effectively implement various business processes of the off-site distribution system, and the process information is defined in the database in the form of ordered functional steps based on basic elements such as processes, steps, functions, and combined functions, and Custom modifications can be made through scripts. The management center program reads the process information and interprets the execution, completes the execution of the process business functions, and implements the daily business processes fixed in the system. These business processes are collectively referred to as fixed processes. In addition, in order to cope with some temporary system requirements, such as equipment replacement, line maintenance, fault handling, etc., it is necessary to randomly perform a series of necessary business functions, thereby deriving the functions of any process, supporting the relevant special functions defined. Selective execution is a supplement to the fixed business process and is a very flexible way of controlling the business system.

[0047] Based on the guarantee of data consistency, the service recovery module 700 ensures that the remote distributed system can quickly implement the off-site business process takeover in the event of a disaster or the like, thereby ensuring the continuity of data and services. Sex. The service recovery module 700 includes the following sub-modules: a first sub-module for initiating a remote business system application and database; a second sub-module for obtaining time for disaster switching; Three sub-modules. For example, if the transit service system of the Shanghai Center fails, you need to switch to the Beijing Center immediately. At this time, the first sub-module of the service recovery module 700 will start the remote transfer system application and database, and determine whether With the switching condition, the second sub-module of the service recovery module 700 acquires the time point of the disaster handover (for subsequent data recovery), and the third sub-module of the service recovery module 700 performs network switching and the like.

[0048] If these processes are decomposed, each step in the process is essentially the control of a certain business host, and automatically completes the operation instructions issued by the management center. After the execution of the process, the process is transparent to the user. The actual transaction processing location has changed from Shanghai to Beijing, ensuring the continuity of the business. When the Shanghai Central Business System is restored, it also corresponds to a set of business re-cutting process. After the switchback, the transaction will be sent to Shanghai for processing. And the data recovery process will track the transactions processed in Beijing during this period to the Shanghai Center.

[0049] The security audit module 800 is configured to avoid the plaintext transmission of messages between the management center and each unit module, and adds encryption settings to the message, and uses the -Z parameter to perform rsa encryption setting during WSL insertion of the message. Moreover, the message is received and sent in encrypted form, and the message is automatically decrypted after it is received, thus ensuring the security of data transmission. At the same time, the management center uses a unified LDAP (Lightweight Directory Access Protocol) server for identity authentication. The operator's permission setting information is also taken from the LDAP server, and the related authorization is checked first when performing various function operations. Only authorized users can perform the functions of each business process. In addition, the security audit module 800 also records and audits login information, operational logs, and process execution.

[0050] In the case where the integrated platform for disaster recovery of the IT system of the present invention shown in FIG. 2 centrally manages the remote distributed system shown in FIG. 1, it is controlled on the local and off-site deployed service hosts. End, local and remote deployment of each management center, each Each management center communicates with all business hosts in the local and remote locations to achieve system management, service implementation and recovery.

3 is a schematic diagram showing data storage and data synchronization processing for an integrated platform for IT system disaster recovery according to the present invention. As shown in FIG. 3, a set of image storage is deployed in the local service system of the integrated platform for disaster recovery of the IT system of the present invention, and data synchronous replication between the local main service systems is realized and the data is bidirectionally replicated. Data asynchronous replication is achieved between local mirror storage and offsite storage and the data is one-way replicated. Such a data synchronization mechanism ensures that when a local business system or data disaster occurs, service recovery can be quickly performed in a different place, and data is not lost. After the local system resumes service, it can replenish the offsite data to the local database.

[0052] The "data replenishment" can be understood as follows, for example, continuing an example mentioned earlier in describing the "service recovery module 700": When the local service system (Shanghai) needs to switch to Beijing, the time point of the handover is recorded. After TO, after switching, all transactions have actually been transferred to the Beijing Center for processing. When the Shanghai Center resumes its business, it will execute the back-cutting process of the corresponding business system. At the same time, it will also record the switching time point Tl, and the data tracking process will be executed later. The time difference of T1-T0 is the time period during which the transaction is processed in the Beijing Center. . Therefore, the data recovery process will start at this time, the Beijing Management Center will issue instructions to read the data from the Beijing Central's transaction database (that is, off-site data), and pass this data through the optical network from Beijing to Shanghai. Shanghai Management Center, then Shanghai Management Center will insert this data into the corresponding business database. In this way, the transaction data is complete for both the business system and the user, just as no switching occurs.

4 is a flow chart showing the processing of the unit module, that is, the controlled end to the message transmission and reception under the integrated platform management for the disaster recovery of the IT system of the present invention (that is, the system communication module) 200 specific process). As shown in Figure 4, in a unit module, first, the process initializes, allocates space, and generates a linked list of leading nodes. On the servers of the two management centers, respectively deploy the corresponding WUX service of tuxedo, and set the WSNADDR environment variable on each client server. The value of the environment variable is the address corresponding to the WSL service issued by the tuxedo server (ip address: Port number) for the tuxedo client program (controlled application program) to connect to the tuxedo server. If the connection fails, reconnect after 30 seconds.

[0054] The linked list is mainly used to store state information of the currently executing execution process, and the content of each node includes a process number, a message function number, a message uniqueness flag, a parameter value set, a time when the process starts executing, and whether the node is Available flags (0 is available, 1 is not available). After the execution process is completed, the main process will clear the node information in the linked list corresponding to the execution process, and set the availability flag to 0 for later use.

[0055] determining the validity of the message, mainly the verification identifier (system), function number (ftinc_id), IP address (ip), time (time), and message type (type) of the application system in the command message. The value of (equal) is evaluated for validity.

[0056] When the function processing script performs the function operation processing, it judges whether or not the processing is performed according to different situations, how to deal with the situation, and the like, and avoids unnecessary operations that have been erroneously returned, and returns the corresponding value. A return value of 0 indicates that the function operation was processed successfully, and a non-zero indicates a failure.

[0057] When the received message is an interrupt message, the main process sends an interrupt signal to the corresponding execution process, and after the execution process receives the interrupt signal, the loop is stopped, and the subsequent operations are not performed.

[0058] In FIG. 4, the left part is the execution flow of the main process. The main process is a cyclic process The sequence mainly completes sending heartbeat information, accepts the message and judges the validity of the message, generates a corresponding execution process according to the content of the message, and manages the operation of the executing process being executed.

[0059] According to the above-mentioned integrated platform management for IT system disaster recovery, it can provide a fast, simple and effective disaster recovery mechanism, and achieves RPO=0, RTO=0 in the design goal, in the event of an actual disaster , can also provide continuous business services in the shortest time. In terms of disaster recovery, the industry currently recognizes three goals worth working. First, the recovery time, how long the enterprise can endure without IT, is in a state of suspension; the second is how long the network can recover; the third is the recovery at the business level. There are two most critical metrics throughout the recovery process: one is RTO and the other is RPO. The so-called RTO (Recovery Time Objective) refers to the time period between the two points after the disaster occurs, when the IT system crashes and the business is stopped, and when the IT system is restored to support the operation of each department and resume operations. For RTO. The so-called RPO (Recovery Point Objective) refers to the degree to which the system and production data should be restored to the extent that the system and application data can be restored to support the business operations of each department. This update can be either the last week's backup data or the real-time data from the previous transaction. It can be seen that the integrated platform management for IT system disaster recovery of the present invention can provide continuous business services in the shortest time in the event of a disaster.

Moreover, according to the integrated platform for IT system disaster recovery of the present invention, the controlled end (ie, each unit module) can automatically maintain a reliable connection with the server side and maintain sufficient operational robustness.

[0061] Moreover, according to the integrated platform for disaster recovery of the IT system of the present invention, the running state of all deployed controlled ends (ie, each unit module) can be effectively monitored, and the operating states of various business processes are provided. Effective monitoring, at the same time, capable of matching The set parameters provide management and maintenance methods.

[0062] Moreover, according to the integrated platform for disaster recovery of IT systems according to the present invention, flexible configuration and combination can be implemented for business processes, such as supporting general changes in business functions through parameterized configuration; The error, the function of exception handling is provided in the process, and the effective processing of the exception is realized.

Moreover, in order to ensure the performance requirements of the RTO and the RPO of the remote distributed system, the integrated platform for the disaster recovery of the IT system of the present invention is designed to implement various business processes, and complete the processes in daily, planned, and disaster situations. control. The implementation of fixed business processes and any functional processes is a core feature provided by disaster recovery applications. In order to effectively implement various business processes, process information is based on basic elements such as processes, steps, functions, and combined functions. It is defined in the form of ordered functional steps in the database, and can be modified by script customization. The management center program reads out the process information and interprets the execution, and completes the execution of the process business functions, which are collectively referred to as fixed processes. In addition, in order to cope with some temporary business system requirements, such as equipment replacement, line maintenance, and fault handling, a series of necessary business functions need to be performed randomly.

[0064] The above examples mainly illustrate the integrated platform of the present invention for IT system disaster recovery. Although only a few of the specific embodiments of the present invention have been described, it will be understood by those skilled in the art that the invention may be practiced in many other forms without departing from the spirit and scope. Accordingly, the present invention is to be construed as illustrative and not restrictive, and the invention may cover various modifications without departing from the spirit and scope of the invention as defined by the appended claims With replacement.

Claims

Claim

1. An integrated platform for disaster recovery of IT systems for centralized management of remote distributed IT systems with multiple local service hosts, multiple off-site service hosts, managed local service hosts, and off-site Two management centers for business hosts, including:

a system management module (100), configured to monitor and manage the remote distributed IT system in real time, so that the management center can obtain information of each service host in real time;

The system communication module (200) is deployed in each of the local service hosts, the different service hosts, and the management center, and is used to implement communication between the local service hosts and the management center, and communication between the remote service hosts and the management center. And a communication between the management centers; a data synchronization module (300), configured to implement real-time data synchronization in the remote distributed IT system;

a data comparison analysis module (400), configured to implement consistency verification of data in the remote distributed IT system;

a data storage module (500), configured to implement data storage in the remote distributed IT system;

a business process module (600), configured to implement various types of business processes in the remote distributed IT system;

The service recovery module (700) is configured to implement the takeover of the off-site business process in the event of a disaster of the remote distributed IT system;

The security audit module (800) is configured to encrypt and decrypt the receiving and sending of messages between the service hosts and the management centers.

2. The integrated platform for disaster recovery of an IT system according to claim 1, characterized by Yes,

The system management module (100), the data synchronization module (300), the data comparison analysis module (400), the data storage module (500), the business process module (600), and the service recovery The module (700) and the security audit module (800) are all associated with the system communication module (200).

The data synchronization module (300), the data comparison analysis module (400), the data storage module (500), and the business process module (600) are all associated with the service recovery module (700).

3. The integrated platform for IT system disaster recovery according to claim 2, wherein

The system communication module is configured to implement message sending and receiving, message parsing, command execution, and result feedback between each service host and each management center.

4. The integrated platform for IT system disaster recovery according to claim 3, wherein

Each management center and each unit module in each service host are connected to a Tuxedo/Q service through a WTC interface.

5. The integrated platform for IT system disaster recovery according to claim 4, wherein

The security audit module encrypts the transmission and reception of messages between the management center and each unit module in the service host by using a WSL insertion message.

6. The integrated platform for IT system disaster recovery according to claim 5, wherein

The security audit module uses the generation of -z in the process of inserting a message through WSL Line rsa force secret setting.

7. The integrated platform for IT system disaster recovery according to claim 6, wherein:

The data comparison analysis module (400) is capable of performing data consistency comparison between the local and remote business system databases, and is capable of locating the difference and performing data matching for the difference.

8. The integrated platform for disaster recovery of an IT system as claimed in claim 7, wherein the integrated platform is characterized in that

The service recovery module (700) includes:

Used to start a business system application and database in a different place, and determine whether the first sub-module of the switching condition is available;

a second sub-module for obtaining a point in time of the disaster switch;

A third sub-module for performing network switching.

9. The integrated platform for disaster recovery of an IT system according to any one of claims 1 to 8, characterized in that

The data synchronization module (300) is configured to deploy image storage in a local service system to implement synchronous data replication between local service systems.

10. The integrated platform for disaster recovery of an IT system according to claim 9, wherein the integrated platform is characterized in that

The data synchronization module (300) is configured to implement asynchronous replication of data between local mirror storage and offsite storage.

11. The integrated platform for disaster recovery of an IT system according to claim 10, wherein: The data synchronization module (300) is further configured to replenish offsite data into the local database after the local system recovers the service.

12. The integrated platform for IT system disaster recovery according to claim 11, wherein:

The two management centers are functionally identical and are backups of each other.

13. The integrated platform for IT system disaster recovery according to claim 12, wherein:

The management center uses an LDAP server to mirror identity authentication.