CN116089197A - Fault equipment switching exercise method, device, equipment and storage medium - Google Patents

Fault equipment switching exercise method, device, equipment and storage medium Download PDF

Info

Publication number
CN116089197A
CN116089197A CN202310286985.8A CN202310286985A CN116089197A CN 116089197 A CN116089197 A CN 116089197A CN 202310286985 A CN202310286985 A CN 202310286985A CN 116089197 A CN116089197 A CN 116089197A
Authority
CN
China
Prior art keywords
fault
equipment
target system
switching
template
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310286985.8A
Other languages
Chinese (zh)
Inventor
李爱平
杜俊
成鲲
敖浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Agricultural Bank of China
Original Assignee
Agricultural Bank of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agricultural Bank of China filed Critical Agricultural Bank of China
Priority to CN202310286985.8A priority Critical patent/CN116089197A/en
Publication of CN116089197A publication Critical patent/CN116089197A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/203Failover techniques using migration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/26Functional testing
    • G06F11/263Generation of test inputs, e.g. test vectors, patterns or sequences ; with adaptation of the tested hardware for testability with external testers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses a fault equipment switching exercise method, a fault equipment switching exercise device, fault equipment switching exercise equipment and a storage medium. The method comprises the following steps: receiving a fault parameter configuration operation input for a device fault template, and generating a fault drilling template based on the fault parameter configuration operation; responding to a device switching exercise instruction, and performing fault injection on target system devices based on the fault exercise template; acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information; and under the condition of eliminating the faults of the target system equipment, calling a preset back-cut script to carry out equipment back-cut processing on the target system equipment so as to enable the target system equipment to normally operate, thereby improving the accuracy of equipment switching and reducing the equipment switching time and the data loss risk.

Description

Fault equipment switching exercise method, device, equipment and storage medium
Technical Field
The present invention relates to the field of device emergency switching technologies, and in particular, to a fault device switching exercise method, apparatus, device, and storage medium.
Background
Under the wave of digital transformation, enterprises enjoy convenient service and convenient management brought by the digital transformation, and meanwhile, risks of downtime and data loss of the system caused by accidents (such as fire, earthquake, server fault, hard disk damage and the like) are also faced.
The traditional unexpected fault exercise is to arrange system-level, park-level and regional-level disaster-tolerant switching treatment scripts, directly start from executing the switching treatment scripts, verify whether the switching treatment flow accords with expectations, judge whether the switching treatment duration and the data loss accord with expectations, and execute the switching treatment scripts manually.
However, the traditional unexpected fault exercise adopts a device switching method fixed by a switching treatment script to switch devices, so that the capabilities of monitoring alarm, fault diagnosis or positioning and the like cannot be effectively verified, the capability of accurately switching devices in a real scene cannot be verified, the problems of longer switching time of actual devices and larger risk of data loss exist, and the switching requirement of an information system in the actual scene cannot be met.
Disclosure of Invention
The invention provides a fault equipment switching exercise method, a fault equipment switching exercise device, fault equipment switching exercise equipment and a storage medium, so that the accuracy of equipment switching during unexpected fault exercise is improved, and the equipment exercise switching time and the data loss risk are reduced.
According to an aspect of the present invention, there is provided a fault device switching exercise method. The method comprises the following steps:
receiving a fault parameter configuration operation input for a device fault template, and generating a fault drilling template based on the fault parameter configuration operation;
responding to the device switching exercise instruction, and carrying out fault injection on the target system device based on the fault exercise template;
acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information;
and under the condition of eliminating the fault of the target system equipment, calling a preset back cut script to carry out equipment back cut processing on the target system equipment so as to enable the target system equipment to normally operate.
According to another aspect of the present invention, there is provided a fault device switching exercise apparatus, the apparatus comprising:
the fault drilling template generation module is used for receiving fault parameter configuration operation input for the equipment fault template and generating a fault drilling template based on the fault parameter configuration operation;
the fault drilling template injection module is used for responding to the device switching drilling instruction and performing fault injection on the target system device based on the fault drilling template;
the fault equipment switching processing module is used for acquiring fault alarm information corresponding to target system equipment based on the fault detection index and performing equipment switching processing on the target system equipment based on the fault alarm information;
and the fault equipment back-cut processing module is used for calling a preset back-cut script to carry out equipment back-cut processing on the target system equipment under the condition of eliminating the fault of the target system equipment so as to enable the target system equipment to normally operate.
According to another aspect of the present invention, there is provided an electronic apparatus including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the method of failsafe device switching exercise of any embodiment of the invention.
According to another aspect of the present invention, there is provided a computer readable storage medium storing computer instructions for causing a processor to implement the fault device switching exercise method according to any one of the embodiments of the present invention when executed.
According to the technical scheme, the fault drilling template is generated based on the fault parameter configuration operation by receiving the fault parameter configuration operation input for the equipment fault template; responding to a device switching exercise instruction, and performing fault injection on target system devices based on the fault exercise template; acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information; under the condition of eliminating faults of the target system equipment, a preset back-cut script is called to carry out equipment back-cut processing on the target system equipment so that the target system equipment can normally operate, the problem that the traditional fault switching exercise cannot effectively verify the capabilities of monitoring alarm, fault diagnosis or positioning and the like is solved, the equipment switching accuracy during accidental fault exercise can be effectively improved, and the equipment exercise switching duration and the data loss risk are reduced.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the invention or to delineate the scope of the invention. Other features of the present invention will become apparent from the description that follows.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart of a fault device switching exercise method according to a first embodiment of the present invention;
fig. 2 is a flowchart of a fault device switching exercise method according to a second embodiment of the present invention;
fig. 3 is a block diagram of a fault equipment switching exercise device according to a third embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device implementing the failure device switching exercise method according to the embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present invention and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the invention described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
Example 1
Fig. 1 is a flowchart of a method for performing a device switching operation on a faulty device according to an embodiment of the present invention, where the method may be performed by a faulty device switching operation device, and the faulty device switching operation device may be implemented in hardware and/or software, and the faulty device switching operation device may be configured in an electronic device. As shown in fig. 1, the method includes:
s101, receiving a fault parameter configuration operation input for a device fault template, and generating a fault drilling template based on the fault parameter configuration operation.
The invention creates a device fault scene based on chaotic engineering. Various fault scene templates are stored in a fault scene library of the chaotic platform so as to simulate equipment faults generated under the actual scene of faults or disasters.
The equipment fault template can be obtained from chaotic engineering. The fault simulation parameters common to various devices are recorded in the device fault template. The fault drill template may refer to a device fault template that is parameter configured for use with the device to be drill.
Specifically, a fault parameter configuration operation input for the equipment fault template is received, parameter configuration is carried out on the downloaded equipment fault template, and a fault drilling template is generated, so that the fault drilling template can be suitable for equipment to be drilled.
S102, responding to the device switching exercise instruction, and performing fault injection on the target system device based on the fault exercise template.
The device switching exercise instruction may refer to an exercise start instruction. The target system device may refer to a malfunctioning device to be exercised.
Specifically, responding to a device switching exercise instruction sent by a user, and performing fault injection on the target system device according to the fault exercise template so as to enable the target system device to generate a device fault corresponding to the fault exercise template.
S103, acquiring fault alarm information corresponding to target system equipment based on the fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information.
The fault detection index may refer to index data for judging that the target system device fails. Illustratively, the fault detection indicators may include, but are not limited to, response time, success rate, and throughput. The fault alert information may refer to alert information that detects a device fault in the target system device. Illustratively, the fault alert information may include, but is not limited to, system number information, server address information, monitoring index information, affiliated center information, system type information, and the like. The target system devices can be divided into single system devices, system group devices, park-level devices, regional-level devices and the like according to the belonging center information. Illustratively, the fault alert information may be as shown in table one:
Figure BDA0004140140810000051
/>
Figure BDA0004140140810000061
list one
The fault major class, the fault sub-class and the detection index information of the target system equipment are determined according to the fault detection index corresponding to the target system equipment, and then fault alarm information corresponding to the target system equipment is generated. And determining a device switching scheme corresponding to the target system device based on the fault alarm information, and further performing device switching processing on the target system device to realize targeted device switching processing and reduce the device switching cost.
Illustratively, the performing device switching processing on the target system device based on the fault alarm information includes: determining fault handling information corresponding to the fault alarm information based on a fault handling matrix, wherein the fault handling matrix at least comprises switching threshold conditions corresponding to each type of equipment fault; and performing device switching processing on the target system device based on the fault handling information.
The fault handling matrix may include a fault class, a fault subclass, monitoring index information, a monitoring index, system type information, a fault attribute, a switching type, and a switching threshold condition corresponding to each type of device fault. Illustratively, the fault handling matrix is as shown in table two:
field name Chinese name Sample example
DisClass General class of faults System class
DisSubClass Failure subclass CPU
MonIndic Monitoring index information Response time |success rate|throughput
Threshold_1 System level handover threshold condition Null|20%|Null
Threshold_2 Park level handover threshold condition Null|40%|Null
Threshold_3 Zone level switching threshold condition Null|80%|Null
SysKind System type information In general
Attribute Fault attributes Application layer
ChangeKind Handover type Automation of
Specifically, the fault handling matrix is compared with the fault alarm information, so that fault handling information corresponding to the fault alarm information can be determined. And performing device switching processing on the target system device based on the fault handling information.
Illustratively, the determining, based on the fault handling matrix, fault handling information corresponding to the fault alert information includes: determining the equipment fault type based on the fault alarm information; determining a switching threshold condition corresponding to the equipment fault based on the type of the equipment fault and a fault handling matrix; and determining the fault handling information according to the fault alarm information and the switching threshold condition.
The equipment fault types can include single system class faults, system group class faults, park class faults and regional class faults. The fault handling information may be a switching type for device switching for the faulty device. The fault handling information may be specifically traded based on a handover threshold condition. Preferably, the fault handling information is divided into an overall handover and a partial handover as a whole.
Specifically, the equipment fault type of the target system equipment is determined according to the central information in the fault alarm information. And comparing in a fault handling matrix according to the fault type, and determining a switching threshold condition corresponding to the equipment fault. And carrying out matching processing according to the monitoring index information in the fault alarm information and the switching threshold condition, and determining fault handling information corresponding to the target system equipment. When the fault alarm information does not reach the switching threshold condition, specific fault position information of the target system equipment is determined, and a specific fault part of the target system equipment is subjected to switching processing. And when the fault alarm information reaches a switching threshold condition, switching the whole target system equipment.
Illustratively, the performing device switching processing on the target system device based on the fault handling information includes: determining a switching type corresponding to the target system equipment based on the fault handling information; and performing equipment switching processing on the target system equipment based on the switching type.
The handover types may include single system handover, system group handover, park handover, and zone handover, among others. Specifically, determining a switching type corresponding to the target system equipment based on the fault handling information; and performing device switching processing on the target system device based on the fault handling information when the switching type of the target system device is single system switching, system group switching or park switching. Under the condition that the switching type of the target system equipment is regional switching, equipment switching request information is sent to a user, equipment switching processing is carried out on the target system equipment based on fault handling information according to the determined switching information fed back by the user, and therefore whether regional equipment switching is carried out or not can be judged based on manual actual conditions, rationalization of equipment switching is achieved, and huge cost loss caused by blind switching is avoided.
It should be noted that, after the device switching process is performed on the target system device, the method further includes: and performing equipment checking and verifying processing on the switching equipment so as to enable the switching equipment to normally operate.
The switching device may refer to a new device that replaces the target system device to operate after performing device switching processing on the target system device, or may refer to a standby device corresponding to the target system device. For example, the device check verification may include a system level health check, an application level health check, and a business level availability verification. Specifically, after the device switching process is performed on the target system device, the device checking and verifying process is further performed on the switching device, so that the switching device is ensured to normally operate, and each service work is recovered to be normal.
S104, under the condition that the fault of the target system equipment is eliminated, calling a preset back cut script to carry out equipment back cut processing on the target system equipment so as to enable the target system equipment to normally operate.
Specifically, after the target system equipment performs the equipment switching process, equipment maintenance is required to be performed on the target system equipment, and under the condition that the fault of the target system equipment is eliminated, according to a preset back-cut script, automatic equipment back-cut processing can be performed on the target system equipment, so that the target system equipment performs service operation processing.
For example, after performing the device switching process on the target system device, it is further required to perform a device check and verification process on the target system device, so as to ensure that the target system device operates normally, so as to restore each service to be normal.
According to the technical scheme, the fault drilling template is generated based on the fault parameter configuration operation by receiving the fault parameter configuration operation input for the equipment fault template; responding to a device switching exercise instruction, and performing fault injection on target system devices based on the fault exercise template; acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information; under the condition of eliminating faults of the target system equipment, a preset back-cut script is called to carry out equipment back-cut processing on the target system equipment so that the target system equipment can normally operate, the problem that the traditional fault switching exercise cannot effectively verify the capabilities of monitoring alarm, fault diagnosis or positioning and the like is solved, the equipment switching accuracy during accidental fault exercise can be effectively improved, and the equipment exercise switching duration and the data loss risk are reduced.
Example two
Fig. 2 is a flowchart of a fault equipment switching exercise method according to a second embodiment of the present invention, where, based on the foregoing embodiments, fault injection is further refined for a target system device based on the fault exercise template. As shown in fig. 2, the method includes:
s201, receiving a fault parameter configuration operation input for a device fault template, and generating a fault drilling template based on the fault parameter configuration operation.
S202, responding to the device switching exercise instruction, and generating a target system device fault script according to the fault exercise template.
The target system device fault script may refer to an automatic operation script that causes the target system device to generate a device fault.
Specifically, after the relevant parameter configuration is completed on the equipment fault template based on the client, trigger operation is generated based on the script, and a target system equipment fault script is generated according to the relevant configuration parameters of the equipment fault template. For example, if the specific usage rate fault is specific to the CPU, and if the specific usage rate of the CPU of a database server reaches 95% in the drilling scenario, the script generation module may be invoked, and the customized fault script of the target system device may be formed by simply configuring the system number, the machine IP address, the CPU usage rate, and other parameter information of the target system.
S203, transmitting the fault script of the target system equipment to the target system equipment so as to perform fault injection on the target system equipment based on the fault script of the target system equipment.
Specifically, the fault script of the target system equipment is transmitted to the target system equipment, so that the technical effect of fault injection of the fault script of the target system equipment on the target system equipment is achieved, and the target system equipment generates equipment faults corresponding to the equipment fault templates.
S204, acquiring fault alarm information corresponding to the target system equipment based on the fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information.
S205, under the condition that the fault of the target system equipment is eliminated, calling a preset back cut script to carry out equipment back cut processing on the target system equipment so as to enable the target system equipment to normally operate.
Illustratively, prior to said receiving a fault parameter configuration operation for a device fault template input, further comprising: and acquiring a preset equipment fault template based on the pre-constructed equipment fault index.
The equipment fault index is used for indicating the corresponding relation between the fault classification information and the equipment fault template. It should be noted that, in order to facilitate quick searching of various equipment fault templates in the chaotic platform, an index of equipment faults should be established. Illustratively, the index may include a failure major class, a failure sub-class, a failure attribute, a failure injection template address. The fault major class mainly comprises a system class and an application class; the fault subclasses are subclasses corresponding to the fault major classes, for example, the fault subclasses corresponding to the system class fault major classes include CPU, disk, network, memory and the like; the failure attributes include an access layer, an application layer, and a data layer. The fault attribute is mainly used for locating the fault position and the possible influence range, and is convenient for subsequent fault diagnosis and location. Illustratively, the index information may be as shown in Table three:
Figure BDA0004140140810000101
specifically, an equipment fault index is built in the chaotic platform template library, and a preset equipment fault template can be quickly obtained based on the equipment fault index.
According to the technical scheme of the embodiment of the invention, the fault script of the target system equipment is generated according to the fault drilling template; and transmitting the target system equipment fault script to the target system equipment so as to perform fault injection on the target system equipment based on the target system equipment fault script, thereby realizing automatic operation of switching the target system equipment without manual participation, improving the equipment switching efficiency and improving the equipment switching accuracy.
Example III
Fig. 3 is a schematic structural diagram of a switching exercise device for fault equipment according to a third embodiment of the present invention. As shown in fig. 3, the apparatus includes: the system comprises a fault drilling template generation module 301, a fault drilling template injection module 302, a fault device switching processing module 303 and a fault device switching processing module 304.
Wherein,,
the fault drilling template generation module 301 is configured to receive a fault parameter configuration operation input for a device fault template, and generate a fault drilling template based on the fault parameter configuration operation;
the fault drilling template injection module 302 is configured to perform fault injection on a target system device based on the fault drilling template in response to a device switching drilling instruction;
the fault equipment switching processing module 303 is configured to obtain fault alarm information corresponding to a target system device based on a fault detection indicator, and perform equipment switching processing on the target system device based on the fault alarm information;
and the fault equipment back-cut processing module 304 is configured to call a preset back-cut script to perform equipment back-cut processing on the target system equipment under the condition that the fault of the target system equipment is eliminated, so that the target system equipment performs normal operation.
According to the technical scheme, the fault drilling template is generated based on the fault parameter configuration operation by receiving the fault parameter configuration operation input for the equipment fault template; responding to a device switching exercise instruction, and performing fault injection on target system devices based on the fault exercise template; acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information; under the condition of eliminating faults of the target system equipment, a preset back-cut script is called to carry out equipment back-cut processing on the target system equipment so that the target system equipment can normally operate, the problem that the traditional fault switching exercise cannot effectively verify the capabilities of monitoring alarm, fault diagnosis or positioning and the like is solved, the equipment switching accuracy during accidental fault exercise can be effectively improved, and the equipment exercise switching duration and the data loss risk are reduced.
Optionally, the fault device switching processing module 303 includes: the fault handling information determining unit and the device switching processing unit. Wherein,,
a failure handling information determining unit configured to determine failure handling information corresponding to the failure alarm information based on a failure handling matrix, wherein the failure handling matrix includes at least a switching threshold condition corresponding to each type of equipment failure;
and the device switching processing unit is used for performing device switching processing on the target system device based on the fault handling information.
Alternatively, the fault handling information determination unit may be specifically configured to:
determining the equipment fault type based on the fault alarm information;
determining a switching threshold condition corresponding to the equipment fault based on the type of the equipment fault and a fault handling matrix;
and determining the fault handling information according to the fault alarm information and the switching threshold condition.
Alternatively, the device switching processing unit may be specifically configured to:
determining a switching type corresponding to the target system equipment based on the fault handling information;
and performing equipment switching processing on the target system equipment based on the switching type.
Optionally, the fault drill template injection module 302 includes:
the fault script generation subunit is used for generating a fault script of the target system equipment according to the fault drilling template;
and the fault script transmission subunit is used for transmitting the fault script of the target system equipment to the target system equipment so as to perform fault injection on the target system equipment based on the fault script of the target system equipment.
Optionally, the apparatus further comprises: and a device fault index construction module. Wherein,,
the equipment fault index construction module is used for: and acquiring a preset equipment fault template based on a pre-constructed equipment fault index, wherein the equipment fault index is used for indicating the corresponding relation between the fault classification information and the equipment fault template.
Optionally, the apparatus further comprises: the device checks the authentication module. Wherein,,
the equipment checking and verifying module is used for performing equipment checking and verifying processing on the switching equipment so as to enable the switching equipment to normally operate.
The fault equipment switching exercise device provided by the embodiment of the invention can execute the fault equipment switching exercise method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
Example IV
Fig. 4 shows a schematic diagram of the structure of an electronic device 10 that may be used to implement an embodiment of the invention. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. Electronic equipment may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices (e.g., helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the inventions described and/or claimed herein.
As shown in fig. 4, the electronic device 10 includes at least one processor 11, and a memory, such as a Read Only Memory (ROM) 12, a Random Access Memory (RAM) 13, etc., communicatively connected to the at least one processor 11, in which the memory stores a computer program executable by the at least one processor, and the processor 11 may perform various appropriate actions and processes according to the computer program stored in the Read Only Memory (ROM) 12 or the computer program loaded from the storage unit 18 into the Random Access Memory (RAM) 13. In the RAM 13, various programs and data required for the operation of the electronic device 10 may also be stored. The processor 11, the ROM 12 and the RAM 13 are connected to each other via a bus 14. An input/output (I/O) interface 15 is also connected to bus 14.
Various components in the electronic device 10 are connected to the I/O interface 15, including: an input unit 16 such as a keyboard, a mouse, etc.; an output unit 17 such as various types of displays, speakers, and the like; a storage unit 18 such as a magnetic disk, an optical disk, or the like; and a communication unit 19 such as a network card, modem, wireless communication transceiver, etc. The communication unit 19 allows the electronic device 10 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The processor 11 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of processor 11 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various processors running machine learning model algorithms, digital Signal Processors (DSPs), and any suitable processor, controller, microcontroller, etc. The processor 11 performs the various methods and processes described above, such as a method failsafe switching exercise.
In some embodiments, the method failsafe device switching maneuver may be implemented as a computer program tangibly embodied on a computer readable storage medium, such as storage unit 18. In some embodiments, part or all of the computer program may be loaded and/or installed onto the electronic device 10 via the ROM 12 and/or the communication unit 19. When the computer program is loaded into RAM 13 and executed by processor 11, one or more of the steps of the method of fault device switching exercise described above may be performed. Alternatively, in other embodiments, processor 11 may be configured to perform the method failsafe device switching exercise in any other suitable manner (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
A computer program for carrying out methods of the present invention may be written in any combination of one or more programming languages. These computer programs may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the computer programs, when executed by the processor, cause the functions/acts specified in the flowchart and/or block diagram block or blocks to be implemented. The computer program may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of the present invention, a computer-readable storage medium may be a tangible medium that can contain, or store a computer program for use by or in connection with an instruction execution system, apparatus, or device. The computer readable storage medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. Alternatively, the computer readable storage medium may be a machine readable signal medium. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on an electronic device having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) through which a user can provide input to the electronic device. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computing system may include clients and servers. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service are overcome.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present invention may be performed in parallel, sequentially, or in a different order, so long as the desired results of the technical solution of the present invention are achieved, and the present invention is not limited herein.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims (10)

1. A method of performing a failover exercise, comprising:
receiving a fault parameter configuration operation input for a device fault template, and generating a fault drilling template based on the fault parameter configuration operation;
responding to a device switching exercise instruction, and performing fault injection on target system devices based on the fault exercise template;
acquiring fault alarm information corresponding to target system equipment based on a fault detection index, and performing equipment switching processing on the target system equipment based on the fault alarm information;
and under the condition of eliminating the fault of the target system equipment, calling a preset back cut script to carry out equipment back cut processing on the target system equipment so as to enable the target system equipment to normally operate.
2. The method according to claim 1, wherein performing a device switching process on the target system device based on the fault alert information comprises:
determining fault handling information corresponding to the fault alarm information based on a fault handling matrix, wherein the fault handling matrix at least comprises switching threshold conditions corresponding to each type of equipment fault;
and performing device switching processing on the target system device based on the fault handling information.
3. The method of claim 2, wherein the determining, based on the fault handling matrix, fault handling information corresponding to the fault alert information comprises:
determining the equipment fault type based on the fault alarm information;
determining a switching threshold condition corresponding to the equipment fault based on the type of the equipment fault and a fault handling matrix;
and determining the fault handling information according to the fault alarm information and the switching threshold condition.
4. The method of claim 2, wherein performing a device switching process on the target system device based on the fault handling information comprises:
determining a switching type corresponding to the target system equipment based on the fault handling information;
and performing equipment switching processing on the target system equipment based on the switching type.
5. The method of claim 1, wherein the performing fault injection on the target system device based on the fault drill template comprises:
generating a target system equipment fault script according to the fault drilling template;
transmitting the target system equipment fault script to the target system equipment so as to perform fault injection on the target system equipment based on the target system equipment fault script.
6. The method of claim 5, further comprising, prior to said receiving a fault parameter configuration operation for a device fault template input:
and acquiring a preset equipment fault template based on a pre-constructed equipment fault index, wherein the equipment fault index is used for indicating the corresponding relation between the fault classification information and the equipment fault template.
7. The method of claim 1, further comprising, after said performing a device handoff process on said target system device:
and performing equipment checking and verifying processing on the switching equipment so as to enable the switching equipment to normally operate.
8. A fault device switching exercise apparatus, comprising:
the fault drilling template generation module is used for receiving fault parameter configuration operation input for the equipment fault template and generating a fault drilling template based on the fault parameter configuration operation;
the fault drilling template injection module is used for responding to the device switching drilling instruction and performing fault injection on the target system device based on the fault drilling template;
the fault equipment switching processing module is used for acquiring fault alarm information corresponding to target system equipment based on the fault detection index and performing equipment switching processing on the target system equipment based on the fault alarm information;
and the fault equipment back-cut processing module is used for calling a preset back-cut script to carry out equipment back-cut processing on the target system equipment under the condition of eliminating the fault of the target system equipment so as to enable the target system equipment to normally operate.
9. An electronic device, the electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the method of failover exercise of any of claims 1-7.
10. A computer readable storage medium storing computer instructions for causing a processor to perform the method of switching between malfunctioning devices as claimed in any of claims 1 to 7 when executed.
CN202310286985.8A 2023-03-22 2023-03-22 Fault equipment switching exercise method, device, equipment and storage medium Pending CN116089197A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310286985.8A CN116089197A (en) 2023-03-22 2023-03-22 Fault equipment switching exercise method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310286985.8A CN116089197A (en) 2023-03-22 2023-03-22 Fault equipment switching exercise method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN116089197A true CN116089197A (en) 2023-05-09

Family

ID=86201051

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310286985.8A Pending CN116089197A (en) 2023-03-22 2023-03-22 Fault equipment switching exercise method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116089197A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116302711A (en) * 2023-05-19 2023-06-23 山东捷瑞数字科技股份有限公司 Disaster recovery method and device based on cloud platform mirror image, electronic equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116302711A (en) * 2023-05-19 2023-06-23 山东捷瑞数字科技股份有限公司 Disaster recovery method and device based on cloud platform mirror image, electronic equipment and storage medium
CN116302711B (en) * 2023-05-19 2023-10-03 山东捷瑞数字科技股份有限公司 Disaster recovery method and device based on cloud platform mirror image, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN114844768B (en) Information analysis method and device and electronic equipment
CN116089231B (en) Fault alarm method and device, electronic equipment and storage medium
CN116089197A (en) Fault equipment switching exercise method, device, equipment and storage medium
CN116010220A (en) Alarm diagnosis method, device, equipment and storage medium
CN113656252B (en) Fault positioning method, device, electronic equipment and storage medium
CN114116288A (en) Fault processing method, device and computer program product
CN116467161A (en) Application testing method and device, electronic equipment and storage medium
CN116541224A (en) Performance test method, device, electronic equipment and readable storage medium
CN116226644A (en) Method and device for determining equipment fault type, electronic equipment and storage medium
CN114444087A (en) Unauthorized vulnerability detection method and device, electronic equipment and storage medium
CN114647531A (en) Failure solving method, failure solving system, electronic device, and storage medium
CN114095394A (en) Network node fault detection method and device, electronic equipment and storage medium
CN116112339B (en) Root cause alarm positioning method, device, equipment and medium
CN117827647A (en) Execution method, device, equipment and medium of test script
CN117632670A (en) Fault warning method and device for test environment, electronic equipment and storage medium
CN117714515A (en) Remote debugging method, device, equipment and storage medium
CN117609054A (en) Automatic test method, device, equipment and storage medium
CN116431499A (en) Automatic test method and device, electronic equipment and storage medium
CN114238075A (en) Method, device, equipment, medium and product for testing program component
CN116719719A (en) Test method, test device, electronic equipment and storage medium
CN116980320A (en) Website operation test method, device, equipment and medium
CN118277275A (en) Interface testing method and device, electronic equipment and storage medium
CN118131734A (en) Risk verification method, device and equipment for control loop and storage medium
CN118363585A (en) Tool component integration method, device, equipment and storage medium based on DevOps platform
CN118611267A (en) Distribution network automation terminal online rate monitoring method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination