CN113986610A - Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment - Google Patents

Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment Download PDF

Info

Publication number
CN113986610A
CN113986610A CN202111141991.1A CN202111141991A CN113986610A CN 113986610 A CN113986610 A CN 113986610A CN 202111141991 A CN202111141991 A CN 202111141991A CN 113986610 A CN113986610 A CN 113986610A
Authority
CN
China
Prior art keywords
service
task
disaster
disaster recovery
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111141991.1A
Other languages
Chinese (zh)
Inventor
陈靓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New H3C Big Data Technologies Co Ltd
Original Assignee
New H3C Big Data Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by New H3C Big Data Technologies Co Ltd filed Critical New H3C Big Data Technologies Co Ltd
Priority to CN202111141991.1A priority Critical patent/CN113986610A/en
Publication of CN113986610A publication Critical patent/CN113986610A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1471Saving, restoring, recovering or retrying involving logging of persistent data for recovery

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present specification provides a disaster tolerance system, a method for implementing disaster tolerance of service devices, and a management device, where the disaster tolerance system includes: the disaster tolerance business layer, the disaster tolerance business layer still includes: disaster tolerance service unit, task center unit, and scheduling center unit. The disaster recovery system can provide a disaster recovery system architecture for flexibly realizing the butt joint of the disaster recovery service and the service equipment.

Description

Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment
Technical Field
The present disclosure relates to the field of communications technologies, and in particular, to a disaster recovery system, a method for implementing disaster recovery of a service device, and a management device.
Background
With the development of micro services and the development of deployment management platforms, more and more products can be deployed on a unified platform, and a disaster recovery system is used as a basic disaster recovery solution provided by the platform. In the user use level, the sensitivity and granularity of the fault sensing of different users to the disaster recovery service, the strength of the disaster recovery switching, and the like may be different. In summary, the disaster recovery system needs a flexible service docking architecture.
Disclosure of Invention
The present disclosure provides a disaster recovery system, a method for implementing disaster recovery of service devices, and a management device, and a disaster recovery system architecture for flexibly implementing the butt joint of disaster recovery services and service devices can be provided by the disaster recovery system.
The present disclosure provides a disaster recovery system, the system comprising: the disaster tolerance business layer, the disaster tolerance business layer still includes: the system comprises a disaster tolerance service unit, a task center unit and a scheduling center unit;
the disaster recovery service unit is used for receiving a disaster recovery instruction sent by a user and determining a disaster recovery module to be called according to the disaster recovery instruction;
the task center unit is used for calling a corresponding target service task from a task list of the task center unit according to the disaster recovery module and sending the target service task to the scheduling center unit;
and the dispatching center unit is used for calling back the corresponding service equipment to execute the target service task according to the target service task.
Optionally, the disaster recovery system further includes: the user layer is connected with the disaster recovery service unit of the disaster recovery service layer;
the user layer is used for receiving an operation instruction of a user, converting the operation instruction into a disaster tolerance instruction and sending the disaster tolerance instruction to the disaster tolerance service unit.
Optionally, the task center unit further includes: an interface module for receiving a signal from a user,
the interface module is used for connecting each service device and realizing the communication between the disaster recovery system and each service device.
Optionally, the task list includes one or more list items, each list item includes one or more business tasks, and each business task at least includes a corresponding business device identifier.
Optionally, the system further includes: receiving a registration request sent by first service equipment through an interface module of a task center unit;
acquiring a first service equipment identifier and a first service task supported by the first service equipment from the registration request;
and determining a corresponding list item according to the first service task, and storing the first service task and a first service equipment identifier into the corresponding list item.
According to the system, the disaster recovery system can call back the corresponding service equipment through the dispatching center unit according to the disaster recovery instruction of the user, so that the flexible disaster recovery function is realized.
The present disclosure also provides a method for implementing disaster recovery of service devices, where the method is applied to a management device, and the management device is configured with a disaster recovery system, where the disaster recovery system includes: the disaster tolerance business layer, the disaster tolerance business layer still includes: the disaster tolerance service unit, the task center unit and the dispatching center unit, wherein the method comprises the following steps:
receiving a first disaster tolerance instruction input by a user, sending the first disaster tolerance instruction to a disaster tolerance service unit, and enabling the disaster tolerance service unit to determine a first disaster tolerance module to be called according to the first disaster tolerance instruction;
the task center unit calls a target service task corresponding to the first disaster recovery module from a task list according to the first disaster recovery module determined by the disaster recovery service unit and sends the target service task to the dispatching center unit;
and the dispatching center unit calls back the corresponding service equipment to execute the target service task according to the target task service.
Optionally, the method for receiving a first disaster recovery instruction input by a user includes:
the method comprises the steps of receiving a first disaster recovery instruction input by a user through a user layer, and sending the first disaster recovery instruction to a disaster service unit.
Optionally, the step of the scheduling center unit calling back, according to the target task service, a corresponding service device to execute the target service task includes:
and the dispatching center sends a callback instruction to the corresponding service equipment through the dispatching interface so as to enable the corresponding service equipment to execute the target service task.
The present disclosure also provides a management device, which includes: a memory, a processor and a program stored on the memory and executable on the processor, which when executed by the processor implements any of the embodiment steps described above.
The present disclosure also provides a computer-readable storage medium having a program stored thereon, which when executed by a processor, performs the steps of any of the embodiments described above.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present specification and together with the description, serve to explain the principles of the specification.
Fig. 1 is a logic diagram of a disaster recovery system according to an embodiment of the present disclosure.
Fig. 2 is a logic diagram of a method for implementing disaster tolerance of a service device according to an embodiment of the present disclosure.
Fig. 3 is a logic diagram of a method for implementing disaster tolerance of a service device according to an embodiment of the present disclosure.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present specification. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the specification, as detailed in the appended claims.
The terminology used in the description herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the description. As used in this specification and the appended claims, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.
It should be understood that although the terms first, second, third, etc. may be used herein to describe various information, these information should not be limited to these terms. These terms are only used to distinguish one type of information from another. For example, the first information may also be referred to as second information, and similarly, the second information may also be referred to as first information, without departing from the scope of the present specification. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.
As shown in fig. 1, the present disclosure provides a disaster recovery system, including: the disaster tolerance business layer, the disaster tolerance business layer still includes: the system comprises a disaster tolerance service unit, a task center unit and a scheduling center unit;
the disaster recovery service unit is used for receiving a disaster recovery instruction sent by a user and determining a disaster recovery module to be called according to the disaster recovery instruction;
the task center unit is used for calling a corresponding target service task from a task list of the task center unit according to the disaster recovery module and sending the target service task to the scheduling center unit;
and the dispatching center unit is used for calling back the corresponding service equipment to execute the target service task according to the target service task.
In this embodiment, the disaster recovery system may further include a user layer, where the user layer implements management interaction with the user through a management interface, for example, the user layer obtains a disaster recovery instruction input by the user through the management interface.
Generally, the disaster recovery instruction includes instructions related to disaster recovery operation, query, and the like, for example: a master-slave switching instruction, a state query instruction, a user-defined display instruction and the like.
And the user layer sends the disaster tolerance instruction input by the user to the disaster tolerance service unit so that the disaster tolerance service unit determines the disaster tolerance module to be called according to the disaster tolerance instruction.
Generally, the disaster recovery service unit may include the following disaster recovery modules: the system comprises a main/standby switching module, a cold standby recovery module, a fault query module and a content acquisition module. It should be noted that, in this embodiment, only the modules related to disaster recovery are listed, and in other embodiments, the disaster recovery service unit may further include other modules.
In this embodiment, after the disaster recovery service unit receives the disaster recovery instruction from the user layer, the disaster recovery service unit matches the corresponding module according to the content of the disaster recovery instruction, for example, when the disaster recovery service unit receives the active/standby switching instruction from the user layer, the disaster recovery service unit calls the active/standby switching module and the cold backup recovery module according to the active/standby switching instruction. For another example, when the disaster recovery service unit receives a status query instruction from the user layer, the disaster recovery service unit invokes the fault detection module according to the status query instruction.
In this embodiment, after the disaster recovery service unit determines the disaster recovery module to be invoked, the target business task corresponding to the lower-level module can be invoked from the task center unit.
In this embodiment, the task center unit may include a task list, where the task list stores a plurality of (one or more) list items, and each list item may include a plurality of (one or more) business tasks, where each business task carries a corresponding business device identifier, and each list item corresponds to each disaster recovery module in the disaster recovery service unit. For example, the list items may include: the main/standby switching item, the cold standby recovery item, the status monitoring item, the key service item, the custom display item, the extensible item, etc. (it should be noted that, this is only an example, and in practical application, the list items may be redundant or less than the number of items listed here).
In practical application, after receiving a disaster tolerance instruction sent by a user, a disaster tolerance service unit can obtain a service device identifier which needs to execute a disaster tolerance action from the disaster tolerance instruction, and then a task center unit can determine a corresponding list item from a task list according to the disaster tolerance instruction, and simultaneously determine a target service task from the list item according to the service device identifier carried in the disaster tolerance instruction.
In this embodiment, the task center unit may further include: an interface module, configured to be connected to each service device, and configured to receive a registration message sent by each service device, for example, when a first service device accesses the disaster recovery system, the interface module may send a first service task that is supportable by the first service device and a first service device identifier of the first service device to the disaster recovery system. And when receiving the first service task, the task center unit determines the list item to which the task center unit belongs according to the first service task and stores the list item and the first service equipment identifier in the list item to which the task center unit belongs.
In this embodiment, the interaction process between the disaster recovery service unit and the task center unit can be directly completed by the disaster recovery service unit and the task center unit, and the interaction between the disaster recovery service unit and the task center unit can also be realized by the scheduling center unit.
In this embodiment, the scheduling center unit is connected to each service device through the scheduling port, and is configured to call back a corresponding target service device to execute a target service task according to a target service device identifier carried in the target service task after the task center unit determines the target service task to be called.
It should be further noted that the service device in this embodiment may be a network device having switching and processing functions, such as a router, a switch, and a server in a network.
It can be seen from the above embodiments that the disaster recovery system in this embodiment implements task type division for disaster recovery services of the disaster recovery system, so that other business components register their own custom tasks in the disaster recovery system according to the disaster recovery task arrangement structure, thereby achieving customization of each business task to different function points of the disaster recovery system, and enabling the disaster recovery system to be customized according to the business tasks.
Meanwhile, by the aid of the disaster recovery system, the disaster recovery system and the service are decoupled, the service is not depended on, and task management and scheduling are mainly concerned.
An embodiment of the present disclosure further provides a method for implementing disaster recovery of a service device, as shown in fig. 2, where the method is applied to a management device, and a disaster recovery system is configured in the management device, where the disaster recovery system includes: the disaster tolerance business layer, the disaster tolerance business layer still includes: the disaster tolerance service unit, the task center unit and the dispatching center unit, wherein the method comprises the following steps:
s201, receiving a first disaster tolerance instruction input by a user, sending the first disaster tolerance instruction to a disaster tolerance service unit, and enabling the disaster tolerance service unit to determine a first disaster tolerance module to be called according to the first disaster tolerance instruction;
s202, the task center unit calls a target service task corresponding to a first disaster recovery module from a task list according to the first disaster recovery module determined by the disaster recovery service unit and sends the target service task to a dispatching center unit;
s203, the dispatching center unit calls back the corresponding service equipment to execute the target service task according to the target task service.
The method for receiving the first disaster recovery instruction input by the user comprises the following steps: the method comprises the steps of receiving a first disaster recovery instruction input by a user through a user layer, and sending the first disaster recovery instruction to a disaster service unit.
The dispatching center unit calls back the corresponding service equipment to execute the target service task according to the target task service, and the method comprises the following steps:
and the dispatching center sends a callback instruction to the corresponding service equipment through the dispatching interface so as to enable the corresponding service equipment to execute the target service task.
To explain the scheme in the present disclosure in detail, the present disclosure further provides an embodiment of a method for implementing disaster recovery of a service device, as shown in fig. 3.
The user inputs a disaster tolerance instruction through a user interface of the user layer, for example, inputs a main/standby switching instruction, and carries the service device identifier 1.
The disaster tolerance service unit of the disaster tolerance business layer receives the main/standby switching instruction from the user layer, and determines the disaster tolerance module as follows according to the main/standby switching instruction: the device comprises a main/standby switching module and a cold standby recovery module.
And the disaster recovery service unit sends the main/standby switching module and the cold standby recovery module to be called and the service equipment identifier 1 to the dispatching center unit.
And the scheduling center unit calls the service 1 task from the main/standby switching list item and the cold standby recovery list item in the task center unit according to the information (the service 1 task represents a task which is to be executed by the service equipment with the identifier of 1).
And the scheduling center unit recalls the tasks of the service 1 in the primary/standby switching list items and the cold standby recovery list items and sends the tasks to the service device 1 for execution, so that the disaster tolerance task is completed.
As can be seen from the above embodiments, the disaster recovery system provided in this embodiment implements arrangement and execution of disaster recovery tasks, and performs tasking on the disaster recovery services provided by the disaster recovery system, so as to perform unified tasking management on complex and diverse services, thereby simplifying disaster recovery services.
Based on the foregoing embodiments, an embodiment of the present disclosure further provides a management device, where the management device includes: a memory, a processor and a program stored on the memory and executable on the processor, which when executed by the processor implements any of the embodiment steps described above.
The present disclosure also provides a computer-readable storage medium having a program stored thereon, which when executed by a processor, performs the steps of any of the embodiments described above.
The foregoing description has been directed to specific embodiments of this disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims may be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
Other embodiments of the present description will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This specification is intended to cover any variations, uses, or adaptations of the specification following, in general, the principles of the specification and including such departures from the present disclosure as come within known or customary practice within the art to which the specification pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the specification being indicated by the following claims.
It will be understood that the present description is not limited to the precise arrangements described above and shown in the drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present description is limited only by the appended claims.
The above description is only a preferred embodiment of the present disclosure, and should not be taken as limiting the present disclosure, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present disclosure should be included in the scope of the present disclosure.

Claims (10)

1. A disaster recovery system, the system comprising: the disaster tolerance business layer, the disaster tolerance business layer still includes: the system comprises a disaster tolerance service unit, a task center unit and a scheduling center unit;
the disaster recovery service unit is used for receiving a disaster recovery instruction sent by a user and determining a disaster recovery module to be called according to the disaster recovery instruction;
the task center unit is used for calling a corresponding target service task from a task list of the task center unit according to the disaster recovery module and sending the target service task to the scheduling center unit;
and the dispatching center unit is used for calling back the corresponding service equipment to execute the target service task according to the target service task.
2. The system of claim 1, wherein the disaster recovery system further comprises: the user layer is connected with the disaster recovery service unit of the disaster recovery service layer;
the user layer is used for receiving an operation instruction of a user, converting the operation instruction into a disaster tolerance instruction and sending the disaster tolerance instruction to the disaster tolerance service unit.
3. The system of claim 1, wherein the task center unit further comprises: an interface module for receiving a signal from a user,
the interface module is used for connecting each service device and realizing the communication between the disaster recovery system and each service device.
4. The system of claim 1, wherein the task list comprises one or more list items, each of the list items comprises one or more business tasks, and each of the business tasks at least comprises a corresponding business device identifier.
5. The system of claim 4, further comprising:
receiving a registration request sent by first service equipment through an interface module of a task center unit;
acquiring a first service equipment identifier and a first service task supported by the first service equipment from the registration request;
and determining a corresponding list item according to the first service task, and storing the first service task and a first service equipment identifier into the corresponding list item.
6. A method for realizing disaster recovery of service equipment is characterized in that the method is applied to management equipment, a disaster recovery system is configured in the management equipment, and the disaster recovery system comprises: the disaster tolerance business layer, the disaster tolerance business layer still includes: the disaster tolerance service unit, the task center unit and the dispatching center unit, wherein the method comprises the following steps:
receiving a first disaster tolerance instruction input by a user, sending the first disaster tolerance instruction to a disaster tolerance service unit, and enabling the disaster tolerance service unit to determine a first disaster tolerance module to be called according to the first disaster tolerance instruction;
the task center unit calls a target service task corresponding to the first disaster recovery module from a task list according to the first disaster recovery module determined by the disaster recovery service unit and sends the target service task to the dispatching center unit;
and the dispatching center unit calls back the corresponding service equipment to execute the target service task according to the target task service.
7. The method of claim 6, wherein the method for receiving the first disaster recovery instruction input by the user comprises:
the method comprises the steps of receiving a first disaster recovery instruction input by a user through a user layer, and sending the first disaster recovery instruction to a disaster service unit.
8. The method according to claim 6, wherein the scheduling center unit executes the target service task according to the service device corresponding to the target task service callback, including:
and the dispatching center sends a callback instruction to the corresponding service equipment through the dispatching interface so as to enable the corresponding service equipment to execute the target service task.
9. A management device, characterized in that the management device comprises: memory, a processor and a program stored on the memory and executable on the processor, which when executed by the processor implements the steps of any of claims 1 to 8.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium has stored thereon a program which, when being executed by a processor, carries out the steps of any one of claims 1 to 8.
CN202111141991.1A 2021-09-28 2021-09-28 Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment Pending CN113986610A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111141991.1A CN113986610A (en) 2021-09-28 2021-09-28 Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111141991.1A CN113986610A (en) 2021-09-28 2021-09-28 Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment

Publications (1)

Publication Number Publication Date
CN113986610A true CN113986610A (en) 2022-01-28

Family

ID=79737022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111141991.1A Pending CN113986610A (en) 2021-09-28 2021-09-28 Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment

Country Status (1)

Country Link
CN (1) CN113986610A (en)

Similar Documents

Publication Publication Date Title
CN109284197B (en) Distributed application platform based on intelligent contract and implementation method
US8032780B2 (en) Virtualization based high availability cluster system and method for managing failure in virtualization based high availability cluster system
CN111913818B (en) Method for determining dependency relationship between services and related device
WO2021129008A1 (en) Service invocation method, apparatus and device, and medium
CN114048046B (en) Service calling method and device and load balancing equipment
CN104199957A (en) Redis universal agent implementation method
CN107769932B (en) Charging method, device and system
CN112637335B (en) Main/standby mode service deployment method, device, equipment and storage medium
CN106034138A (en) Remote service calling method and remote service calling device
CN103812838A (en) Service calling method and device and system
CN112787999B (en) Cross-chain calling method, device, system and computer readable storage medium
EP0405829A2 (en) Object oriented software system architecture
JP3599364B2 (en) Network equipment
CN111147600B (en) Service execution method and terminal under cluster environment
CN113986610A (en) Disaster tolerance system, method for realizing disaster tolerance of business equipment and management equipment
CN111143040A (en) Transaction processing method, device, system and storage medium
CN111221620A (en) Storage method, storage device and storage medium
CN111737028B (en) Dubbo service detection method and device
CN113037871A (en) Conference call recovery method, device, system, electronic equipment and readable storage medium
CN114691309A (en) Batch business processing system, method and device
CN112671917A (en) Switch idle calculation power calling method and related equipment
CN113055236A (en) Method, device, equipment and storage medium for processing cluster service node failure
CN112130900A (en) User information management method, system, equipment and medium for BMC
CN111083719A (en) Method, device and storage medium for flexibly adjusting network element capacity
CN113342540B (en) Terminal user fee-paying starting method, system, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination