CN109144829A - Fault processing method and device, computer equipment and storage medium - Google Patents

Fault processing method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN109144829A
CN109144829A CN201811002316.9A CN201811002316A CN109144829A CN 109144829 A CN109144829 A CN 109144829A CN 201811002316 A CN201811002316 A CN 201811002316A CN 109144829 A CN109144829 A CN 109144829A
Authority
CN
China
Prior art keywords
fault warning
database
fault
troubleshooting
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811002316.9A
Other languages
Chinese (zh)
Other versions
CN109144829B (en
Inventor
冷迪
陈瑞
黄建华
庞宁
吕志宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Power Supply Co ltd
Original Assignee
Shenzhen Power Supply Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Power Supply Co ltd filed Critical Shenzhen Power Supply Co ltd
Priority to CN201811002316.9A priority Critical patent/CN109144829B/en
Publication of CN109144829A publication Critical patent/CN109144829A/en
Application granted granted Critical
Publication of CN109144829B publication Critical patent/CN109144829B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application relates to a fault processing method, a fault processing device, computer equipment and a storage medium. The method comprises the following steps: acquiring fault alarm information; carrying out convergence processing on the fault alarm information and converting the fault alarm information into a fault alarm problem; acquiring a corresponding fault processing rule from a database according to the fault alarm problem; and performing fault processing according to the fault processing rule. The method can avoid repeated receiving of the alarm information, save the fault processing time and the occupied storage resources, solve the problem of insufficient automation and automatically process the fault alarm information.

Description

Fault handling method, device, computer equipment and storage medium
Technical field
This application involves computer application field, more particularly to a kind of fault handling method, device, computer equipment and Computer storage medium.
Background technique
With the development of computer technology, there is O&M technology.Operation and maintenance are supervised to the state of service operation Control, convenient for find in time service operation exception and resource consumption the case where.When an error occurs, O&M engineer is to servicing out Existing any exception is handled in time, and the extension of problem is avoided even to stop to service as far as possible.And O&M engineer needs The prediction scheme that formulate processing extremely for all kinds of services, can manually perform prediction scheme and reach the mesh stopped loss when occurring so as to problem 's.
However, there is automation deficiency in current O&M mode.
Summary of the invention
Based on this, it is necessary to for the technical problem that automation is insufficient, provide one kind can reduce alarm windstorm and from Fault handling method, device, computer equipment and the computer storage medium of dynamic handling failure warning information.
A kind of fault handling method, this method comprises: obtaining fault warning information;The fault warning information is restrained Processing, and it is converted to fault warning problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;It presses Troubleshooting is carried out according to the troubleshooting rule.
In one of the embodiments, before obtaining fault warning information, further includes: obtain the reconciliation of fault warning problem The certainly script of the fault warning problem;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form troubleshooting Rule;The fault warning problem and the troubleshooting rule are stored in the database.
The fault warning information is subjected to convergence process in one of the embodiments, and is converted to fault warning problem, It include: that the same fault warning information that will occur within a preset time converges a fault warning information;By the fault warning Information is converted to corresponding fault warning problem.
Corresponding troubleshooting rule packet is obtained from database according to the fault warning problem in one of the embodiments, It includes: according to the fault warning problem in the fault warning problem matching database;When in the fault warning problem and the database Fault warning problem successful match when, call the troubleshooting in the corresponding database of fault warning problem in the database Rule.
This method in one of the embodiments, further include: when the failure in the fault warning problem and the database is accused Alert problem reports the fault warning problem when it fails to match;The troubleshooting rule of input is obtained according to the fault warning problem; It is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Corresponding troubleshooting rule is obtained from database according to the fault warning problem in one of the embodiments, Further include: when it fails to match for the fault warning problem in the fault warning problem and the database, identify that the fault warning is asked Topic obtains failure subproblem;Corresponding troubleshooting rule is matched according to the failure subproblem;The failure subproblem is corresponding Troubleshooting rule forms the corresponding troubleshooting rule of the fault warning problem.
The data in the database are stored using Redis mode in one of the embodiments,.
A kind of fault treating apparatus, which includes: acquisition module, for obtaining fault warning information;It should for basis Fault warning problem obtains corresponding troubleshooting rule from database;Conversion module, for carrying out the fault warning information Convergence process, and it is converted to fault warning problem;Processing module, for carrying out troubleshooting according to the troubleshooting rule.
A kind of computer equipment, including memory and processor, the memory are stored with computer program, the processing Device performs the steps of when executing the computer program obtains fault warning information;The fault warning information is restrained Processing, and it is converted to fault warning problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;It presses Troubleshooting is carried out according to the troubleshooting rule.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor It is performed the steps of when row and obtains fault warning information;The fault warning information is subjected to convergence process, and is converted to failure announcement Alert problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;According to the troubleshooting rule into Row troubleshooting.
Above-mentioned fault handling method, device, computer equipment and storage medium, by obtaining fault warning information and to this Information carries out convergence process, can effectively reduce alarm windstorm, avoids repeating to receive warning information, saves fault handling time and account for Storage resource;Fault warning information is converted to fault warning problem, troubleshooting is obtained according to the fault warning problem Rule, and troubleshooting is carried out, it can solve the insufficient problem of automation, automatically process fault warning information.
Detailed description of the invention
Fig. 1 is the applied environment figure of fault handling method in one embodiment;
Fig. 2 is the flow diagram of fault handling method in one embodiment;
Fig. 3 is the flow diagram that troubleshooting rule generates in one embodiment;
Fig. 4 is the flow diagram of fault handling method in another embodiment;
Fig. 5 is the flow diagram of fault handling method in another embodiment;
Fig. 6 is the flow diagram of fault handling method in further embodiment;
Fig. 7 is the application scenario diagram of fault handling method in another embodiment;
Fig. 8 is the structural block diagram of fault treating apparatus in one embodiment;
Fig. 9 is the structural block diagram of fault treating apparatus in another embodiment;
Figure 10 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not For limiting the application.
Fault handling method provided by the present application can be applied in application environment as shown in Figure 1.Wherein, the first clothes Business device 102 is communicated with second server 104 by network by network.Wherein, first server 102 and second server 104 can be realized with the server cluster of the either multiple server compositions of independent server, and first server 102 It can be heterogeneous database system, which is the set of relevant multiple Database Systems, may be implemented to count According to shared and transparent access.Each component part of heterogeneous database has the autonomy of itself, realizes the same of data sharing When, each Database Systems still possess application characteristic, integrity control and the safety control of oneself.
In one embodiment, as shown in Fig. 2, providing a kind of fault handling method, it is applied in Fig. 1 in this way It is illustrated for second server 104, comprising the following steps:
Step 202, fault warning information is obtained.
Wherein, fault warning information refers to the fault warning information generated when first server 102 breaks down.The event Barrier warning information may include server delay machine, service abend, disk space is inadequate and server ping is obstructed etc..
Specifically, second server 104 obtains the generation when first server 102 breaks down from first server 102 Fault warning information.Second server 104 includes front-end receiver, which provides interface, for pulling automatically Or other warning information resources are received, the fault warning information in warning information resource is received, and fault warning information is pushed into event Hinder converter.
Wherein, which includes Zabbix, open-falcon, Naglos and CMDB (Configuration Management Database, configuration management database) etc., the warning information resource is for monitoring network system, terminal, data Library, service and process etc..
Zabbix is a kind of open source solution of enterprise-level for providing distributed system monitoring and network monitoring function, It can monitor various network parameters, guarantee the safe operation of server system.
Open-falcon is a enterprise-level, High Availabitity, expansible open source monitoring solution.
Naglos is the computer system and network monitor tools of a kind of open source, can effective monitoring Windows system, Linux The Host Status of system and Unix system.
Wherein, linux system is a set of class Unix operating system, be a multi-user end, multitask, support multithreading and The operating system of multi -CPU (Central Processing Unit, central processing unit).
Unix operating system is powerful multi-user end, a multiple task operating system, supports various processor framework, According to the classification of operating system, belong to time sharing operating system.
CMDB (Configuration Management Database, configuration management database) is used for storage and management The various configuration informations of equipment in enterprise architecture, it and all service supports and service delivery process are all closely linked to, and support this The operating of a little processes, the value for playing configuration information.
By from different warning information resources obtain fault warning information, can guarantee the consistency of fault warning information with And accuracy, the redundancy of data is reduced, does not need to deposit in the fault warning information in second server 104 differently Side reduces management cost.
Step 204, which is subjected to convergence process, and is converted to fault warning problem.
Wherein, convergence process refers to identical fault warning information converging a fault warning information.
Specifically, second server 104 converges the identical fault warning information obtained from first server 102 together One fault warning information, and it is converted to fault warning problem corresponding with fault warning information.
Step 206, corresponding troubleshooting rule is obtained from database according to the fault warning problem.
Wherein, troubleshooting rule refers to solving the problems, such as the scheme of the fault warning.
Specifically, second server 104 obtains stored solution failure according to fault warning problem from database The scheme of alarm problem, the database can be located in second server 104, can also be independently of second server 104.
Step 208, troubleshooting is carried out according to the troubleshooting rule.
Specifically, second server 104 is according to the troubleshooting rule obtained from database to the troubleshooting problem Carry out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system, The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter Failure " calls the troubleshooting rule in database, executes " restarting server " script, carry out according to the troubleshooting rule Troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, data are called Troubleshooting rule in library, executes " service of restarting " script, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue When, then the troubleshooting rule in database is called, " clear up journal file and enter journey " script is executed, according to the troubleshooting Rule carries out troubleshooting.
In the present embodiment, after completing troubleshooting, second server 104 can be with Real-time Feedback troubleshooting result to use Family end facilitates and carries out secondary collation process.
It, can be effective by obtaining fault warning information and carrying out convergence process to the information in above-mentioned fault handling method Alarm windstorm is reduced, avoids repeating to receive warning information, saves the storage resource of fault handling time and occupancy;By fault warning Information is converted to fault warning problem, obtains troubleshooting rule according to the fault warning problem, and carry out troubleshooting, can solve Insufficient problem is certainly automated, fault warning information is automatically processed.
In one embodiment, as shown in figure 3, before obtaining fault warning information, the fault handling method further include:
Step 302, it obtains fault warning problem and solves the problems, such as the script of the fault warning.
Wherein, the script for solving the problems, such as the fault warning is to solve the fault warning solution to the problem.
Specifically, second server 104 is obtained that user terminal inputs or is accused from other terminals or the failure of system introducing Alert problem and the script for solving the problems, such as the fault warning.
Step 304, which is split into subprocessing operation.
Wherein, subprocessing operation refers to the processing step in the script, and one basis of the corresponding solution of the processing step is asked Topic and the smallest troubleshooting unit.
Specifically, this is solved the problems, such as that the script of fault warning splits into single subprocessing and operates by second server 104.
In the present embodiment, in O&M technical field, most of operation can be completed by perform script.Second server 104 regard the subprocessing operation of each script as atom, the operation of a large amount of subprocessing are stored in atom, the atom warehouse compartment In second server 104.
Step 306, the subprocessing is recombinated to operate to form troubleshooting rule.
Specifically, second server 104 solves the problems, such as the fault warning by the processing step formation recombinated in the script Scheme.
In the present embodiment, second server 104 can obtain subprocessing operation from atom, form script processing Step, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.In entire mistake Cheng Zhong, many atoms can reuse, thus be directed to specific fault warning problem, can in atom will required for Atom is combined into troubleshooting rule.
Step 308, it is stored in the fault warning problem is corresponding with the troubleshooting rule in the database.
Specifically, second server 104 by the fault warning problem and solves the problems, such as that the troubleshooting of the fault warning is advised It then corresponds and is stored in the database.The database is also used to save configuration information, in execution journal of troubleshooting etc. Hold.
In the present embodiment, which is stored as the form of fault warning table but unlimited by second server 104 In this, while failure definition handles rule list, after the corresponding relationship for having configured fault warning problem and troubleshooting rule, by this Fault warning problem is corresponding with the troubleshooting rule to be stored in the database.
In above-mentioned fault handling method, by obtaining fault warning problem and solving the problems, such as this scheme, script is split It is operated at subprocessing to solve underlying issue, and subprocessing operation may be reused;Again by recombinating subprocessing operation Form different troubleshooting rules, it is possible to reduce occupied space, and solve the problems, such as more fault warnings, when saving is write Between and the response time, rewrite a script without each fault warning problem;By fault warning problem and failure Processing is regular to correspond to storage in the database, so that calling more rapidly, intuitively.
In one embodiment, which is subjected to convergence process, and is converted to fault warning problem, wrapped It includes: the same fault warning information occurred within a preset time is converged into a fault warning information;The fault warning is believed Breath is converted to corresponding fault warning problem.
Wherein, fault warning information refers to that preset time is 0 to 24 hour but not limited to this, with 5 minutes in the present embodiment As preset time.
Specifically, second server 104 will be in 5 minutes but to be not limited to 5 minutes identical failures within a preset time Warning information convergence is merged into same fault warning information, and by failure converter converted the fault warning information to and The corresponding fault warning problem of the fault warning information.
In the present embodiment, when preposition alarm device receive first server 102 transmission " warning system occur using clothes The fault warning information of business device delay machine situation, a large amount of unreachable message of ping " occurs several hundred alarms in 5 minutes, then makes At alarm windstorm.Actually the alarm windstorm is the same problem, by the convergence process to the warning information, then second is taken Business device 104 can receive only a fault warning information.The fault warning information is converted " server by failure converter Ping failure ", and calling from database should " server ping failure " corresponding fault warning rule such as " service of restarting The script of device " is to solve the problems, such as " the server ping failure ".
In the present embodiment, there are also fault category libraries in failure converter, comprising: can directly handle class, remind post-processing class With need manpower intervention handle class.Specifically, when the fault warning problem has stored in the database, then the fault warning problem is returned Class can directly be handled by entering;When the fault warning problem be more complex fault warning problem, cannot solve at once, then by the failure Alarm problem is included into prompting post-processing class;When the fault warning problem is novel fault warning problem and can not be solved immediately When, which, which is included into, needs manpower intervention to handle class.
In above-mentioned fault handling method, by the way that identical fault warning information is converged same fault warning information, Reduce alarm windstorm, avoids repeating to receive warning information, the storage resource of fault handling time and occupancy can be saved.
In one embodiment, obtaining corresponding troubleshooting rule from database according to the fault warning problem includes: According to the fault warning problem in the fault warning problem matching database;When the event in the fault warning problem and the database When hindering alarm problem successful match, the troubleshooting in the corresponding database of fault warning problem in the database is called to advise Then.
Specifically, second server 104 is matched from the fault warning problem table in database according to fault warning problem Fault warning problem calls the troubleshooting problem corresponding when successful match from the troubleshooting rule list in database Troubleshooting rule.
In above-mentioned fault handling method, by according to the troubleshooting problem in troubleshooting trouble-shoots database, and Troubleshooting rule is called, can achieve the effect that fault self-recovery, and failure response is rapid, reduce manually-operated fault Rate.
In one embodiment, as shown in figure 4, the fault handling method further include:
Step 402, when it fails to match for the fault warning problem in the fault warning problem and the database, the event is reported Hinder alarm problem.
Specifically, when the fault warning problem in second server 104 can not be with the fault warning problem in database Timing, second server 104 report the fault warning problem.
Step 404, the troubleshooting rule of input is obtained according to the fault warning problem.
Specifically, user terminal creates the corresponding troubleshooting rule of the fault warning problem according to the fault warning problem, And the troubleshooting rule is uploaded to second server 104, second server 104 obtains the troubleshooting of user terminal input Rule.
Step 406, it is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Specifically, which according to the form of fault warning problem table but is not limited to by second server 104 This storage corresponding with the troubleshooting rule of the input is in the database.
In above-mentioned fault handling method, by obtaining the troubleshooting rule of input, can effectively solve can not in database Fault warning problem and corresponding troubleshooting rule are stored in the database, expand data by the fault warning problem of solution The amount of storage in library, and database problem-solving ability can be reinforced.
In one embodiment, as shown in figure 5, obtaining corresponding troubleshooting from database according to the fault warning problem Rule, further includes:
Step 502, when it fails to match for the fault warning problem in the fault warning problem and the database, the event is identified Barrier alarm problem obtains failure subproblem.
Wherein, failure subproblem refers to the relatively subsection in the fault warning problem, i.e. the fault warning problem can be with It is split as multiple failure subproblems.
Specifically, when it fails to match for the fault warning problem in the fault warning problem and the database, i.e. database In when not storing the fault warning problem, second server 104 identifies the fault warning problem, obtains two or more failures Problem.
Step 504, corresponding troubleshooting rule is matched according to the failure subproblem.
Specifically, in the failure subproblem and database that second server 104 is split out according to the fault warning problem Fault warning problem is matched, and the corresponding troubleshooting rule of the failure subproblem is called if successful match.
Step 506, the corresponding troubleshooting rule of the failure subproblem is formed into the corresponding failure of fault warning problem Processing rule.
Specifically, second server 104 is by the corresponding troubleshooting rule of the failure subproblem according to failure subproblem point Corresponding sequence is regularly arranged neat by the corresponding troubleshooting of failure subproblem one by one when solution, obtains the fault warning problem Corresponding troubleshooting rule.
In above-mentioned fault handling method, asked by the way that fault warning problem is split as two or more corresponding failure Topic can refine the step of solving the problems, such as fault warning, while not have to write the corresponding fault warning of fault warning problem again Rule saves the time and reduces the resource of occupancy.
In one embodiment, the data in the database are stored using Redis mode.
Wherein, Redis is a storage system, and the data of update periodically can be written disk or modification is grasped by it Make the additional record file of write-in, and realizes master-slave synchronisation on this basis.
Specifically, it due to fault warning information and unstructured data, needs to store by unstructured storage engines, and And need cleaning and calculation processing at fault warning problem, then it needs to cache high speed processing, therefore Redis is used to store.
In one embodiment, which uses plug-in management.The corresponding failure of each warning information resource Alarm plug-in then when warning information resource changes, such as from Zabbix warning information resource becomes Open-falcon warning information Source then can carry out troubleshooting by the adaptor interface reconfigured in front-end receiver.Above-mentioned fault handling method leads to It crosses from a variety of warning information resources by the way of plug-in management, can be adapted to, enhances scalability.
In one embodiment, which further includes providing system management interface by control console, is led to Cross the configuration that the system management interface carries out system regulation and plug-in unit.
In one embodiment, as shown in fig. 6, providing a kind of fault handling method, comprising the following steps:
Step 602, it obtains fault warning problem and solves the problems, such as the script of the fault warning.
Specifically, second server 104 is obtained that user terminal inputs or is accused from other terminals or the failure of system introducing Alert problem and the script for solving the problems, such as the fault warning.
Step 604, which is split into subprocessing operation.
Specifically, this is solved the problems, such as that the script of fault warning splits into single subprocessing and operates by second server 104.
In the present embodiment, the subprocessing operation of each script is refined into atom by second server 104, at a large amount of son Reason operation is stored in atom, which is located in the database in second server 104.
Step 606, the subprocessing is recombinated to operate to form troubleshooting rule.
Specifically, second server 104 solves the problems, such as the fault warning by the processing step formation recombinated in the script Scheme.
In the present embodiment, second server 104 can obtain subprocessing operation from atom, form script processing Step, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.In entire mistake Cheng Zhong, many atoms can reuse, thus be directed to specific fault warning problem, can in atom will required for Atom is combined into troubleshooting rule.
Step 608, it is stored in the fault warning problem is corresponding with the troubleshooting rule in the database.
Specifically, second server 104 by the fault warning problem and solves the problems, such as that the troubleshooting of the fault warning is advised It then corresponds and is stored in the database.The database is also used to save configuration information, in execution journal of troubleshooting etc. Hold.
In the present embodiment, which is stored as the form of fault warning table but unlimited by second server 104 In this, while failure definition handles rule list, after the corresponding relationship for having configured fault warning problem and troubleshooting rule, by this Fault warning problem is corresponding with the troubleshooting rule to be stored in the database.
Step 610, fault warning information is obtained from warning information resource.
Specifically, second server 104 obtains the generation when first server 102 breaks down from first server 102 Fault warning information.Second server 104 includes front-end receiver, which provides interface, automatic for receiving Other warning information resources are pulled or receive, which includes Zabbix, open-falcon, Naglos and CMDB etc., is somebody's turn to do Warning information resource can be used for monitoring network system, terminal, database, service and process etc..Second server 104 receives alarm letter Fault warning information in breath source, and fault warning information is pushed into failure converter.
Step 612, which is subjected to convergence process, and is converted to fault warning problem.
Specifically, second server 104 converges the identical fault warning information obtained from first server 102 together One fault warning information, and fault warning problem corresponding with fault warning information is converted to by failure converter.
In the present embodiment, when preposition alarm device receive first server 102 transmission " warning system occur using clothes The fault warning information of business device delay machine situation, a large amount of unreachable message of ping " occurs several hundred alarms in 5 minutes, then makes At alarm windstorm.Actually the alarm windstorm is the same problem, by the convergence process to the warning information, then second is taken Business device 104 can receive only a fault warning information.The fault warning information is converted " server by failure converter Ping failure ", and calling from database should " server ping failure " corresponding fault warning rule such as " service of restarting The script of device " is to solve the problems, such as " the server ping failure ".
In the present embodiment, there are also fault category libraries in failure converter, comprising: can directly handle class, remind post-processing class With need manpower intervention handle class.Specifically, when the fault warning problem has stored in the database, then the fault warning problem is returned Class can directly be handled by entering;When the fault warning problem be more complex fault warning problem, cannot solve at once, then by the failure Alarm problem is included into prompting post-processing class;When the fault warning problem is novel fault warning problem and can not be solved immediately When, which, which is included into, needs manpower intervention to handle class.
Step 614, corresponding troubleshooting rule is obtained from database according to the fault warning problem.
Specifically, second server 104 is matched from the fault warning problem table in database according to fault warning problem Fault warning problem calls the troubleshooting problem corresponding when successful match from the troubleshooting rule list in database Troubleshooting rule.
In the present embodiment, when the fault warning problem in second server 104 can not be with the fault warning problem in data When matching, second server 104 reports the fault warning problem.User terminal creates the fault warning according to the fault warning problem The corresponding troubleshooting rule of problem, and the troubleshooting rule is uploaded to second server 104, second server 104 obtains Take the user terminal input troubleshooting rule, and the fault warning problem according to fault warning problem table form but be not limited to This storage corresponding with the troubleshooting rule of the input is in the database.
In the present embodiment, when it fails to match for the fault warning problem in the fault warning problem and the database, that is, count When according to not storing the fault warning problem in library, second server 104 identifies the fault warning problem, obtains two or more events Hedge problem.The failure in failure subproblem and database that second server 104 is split out according to the fault warning problem is accused Alert problem is matched, and the corresponding troubleshooting rule of the failure subproblem is called if successful match.Second server 104 will Corresponding sequence one by one asks failure when the corresponding troubleshooting rule of the failure subproblem is according to the decomposition of failure subproblem It is regularly arranged neat to inscribe corresponding troubleshooting, obtains the corresponding troubleshooting rule of the fault warning problem.
Step 616, troubleshooting is carried out according to the troubleshooting rule.
Specifically, second server 104 is according to the troubleshooting rule obtained from database to the troubleshooting problem Carry out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system, The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter Failure " calls the troubleshooting rule in database, executes " restarting server " script, carry out according to the troubleshooting rule Troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, data are called Troubleshooting rule in library, executes " service of restarting " script, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue When, then the troubleshooting rule in database is called, " clear up journal file and enter journey " script is executed, according to the troubleshooting Rule carries out troubleshooting.
In the present embodiment, after completing troubleshooting, second server 104 can be with Real-time Feedback troubleshooting result to use Family end facilitates and carries out secondary collation process.
In the present embodiment, which further includes providing system management interface by control console, by this The configuration of system management interface progress system regulation and plug-in unit.
In one embodiment, it is illustrated by taking the application scenarios that the fault handling method is applied in Fig. 7 as an example.Second Server 104 includes front-end receiver, failure interpreter, processor and database, further includes atom in the database.
Specifically, second server 104 pulls from front-end receiver or receives warning information resource, connects from warning information resource Fault warning information is received, which is pushed into failure converter in queue.Failure converter believes fault warning Breath converges same fault warning information, and is converted to fault warning problem corresponding with fault warning information.Processor root The fault warning problem is handled according to the troubleshooting rule obtained from database.There is atom in the database, it is former Word bank is for storing subprocessing operation.The database is also used to save template content, configuration information, execution journal etc..In addition, pipe Control console passes through the interface adjusting system and plug-in card configuration for providing system management interface.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system, The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter Failure ", processor call the troubleshooting rule in database, execute " restarting server " script, advise according to the troubleshooting Then carry out troubleshooting.
In the present embodiment, become Open-falcon alarm when warning information resource changes, such as from Zabbix alarm source Source then can carry out troubleshooting by the adaptor interface reconfigured in front-end receiver.
It should be understood that although each step in the flow chart of Fig. 2-6 is successively shown according to the instruction of arrow, These steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly stating otherwise herein, these steps Execution there is no stringent sequences to limit, these steps can execute in other order.Moreover, at least one in Fig. 2-6 Part steps may include that perhaps these sub-steps of multiple stages or stage are not necessarily in synchronization to multiple sub-steps Completion is executed, but can be executed at different times, the execution sequence in these sub-steps or stage is also not necessarily successively It carries out, but can be at least part of the sub-step or stage of other steps or other steps in turn or alternately It executes.
In one embodiment, as shown in figure 8, providing a kind of fault treating apparatus, comprising: obtain module 802, conversion Module 804 and processing module 806, in which:
Module 802 is obtained, for obtaining fault warning information;It is also used to be obtained according to the fault warning problem from database Take corresponding troubleshooting rule.
Specifically, module 802 is obtained to be used to obtain the production when first server 102 breaks down from first server 102 Raw fault warning information pulls automatically or receives other warning information resources, receives the fault warning information in warning information resource, And fault warning information is pushed into failure converter.Wherein, the warning information resource include Zabbix, open-falcon, Naglos and CMDB etc., the warning information resource is for monitoring network system, terminal, database, service and process etc..
Specifically, acquisition module 802 is also used to obtain stored solution from database according to fault warning problem and is somebody's turn to do The scheme of fault warning problem.
Conversion module 804 for the fault warning information to be carried out convergence process, and is converted to fault warning problem.
Specifically, conversion module 804 is for converging the identical fault warning information obtained from first server 102 Same fault warning information, and it is converted to fault warning problem corresponding with fault warning information.
Processing module 806, for carrying out troubleshooting according to the troubleshooting rule.
Specifically, processing module 806 is for asking the troubleshooting according to the troubleshooting rule obtained from database Topic carries out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system, The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter Failure ", processing module 806 are used to call the troubleshooting rule in database, " restarting server " script are executed, according to the event Barrier processing rule carries out troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, processing module 806, for calling the troubleshooting rule in database, execute " service of restarting " script, carry out event according to the troubleshooting rule Barrier processing.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue When, then processing module 806 is used to call the troubleshooting rule in database, executes " clear up journal file and enter journey " foot This, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, after completing troubleshooting, processing module 806 can be used for Real-time Feedback troubleshooting result to use Family end facilitates and carries out secondary collation process.
In one embodiment, as shown in figure 9, the fault treating apparatus further includes splitting module 808, recombination module 810 It is executed with memory module 812, each intermodule and not in strict accordance with the sequence of Fig. 9.Wherein, the acquisition failure announcement of module 802 is being obtained Before alert problem information, it is also used to obtain fault warning problem and solves the problems, such as the script of the fault warning.
Specifically, obtain module 802 be used to obtain user terminal input or from other terminals or the failure of system introducing Alarm problem and the script for solving the problems, such as the fault warning.
Module 808 is split, for the script to be split into subprocessing operation.Specifically, module 808 is split to be used for the solution Certainly the script of fault warning problem splits into single subprocessing operation.In the present embodiment, splits module 808 and be also used to each foot This subprocessing operation is refined into atom, and the operation of a large amount of subprocessing is stored in atom.
Recombination module 810 operates to form troubleshooting rule for recombinating the subprocessing.Specifically, recombination module 810 is used The scheme for solving the problems, such as the fault warning is formed in the processing step by recombinating in the script.
In the present embodiment, recombination module 810 forms the step of script processing for obtaining subprocessing operation from atom Suddenly, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.
Memory module 812, for being stored in the fault warning problem is corresponding with the troubleshooting rule in the database. Specifically, memory module 812 is used for the fault warning problem and solves the problems, such as the troubleshooting rule of the fault warning one by one Correspondence is stored in the database.Data in the database use Redis storage device.Specifically, since fault warning is believed Simultaneously unstructured data is ceased, needs to store by unstructured storage engines, and cleaning and calculation processing is needed to accuse at failure Alert problem is then needed to cache high speed processing, therefore is stored using Redis storage device.In the present embodiment, memory module 812 For the fault warning problem to be stored as to the form of fault warning table but not limited to this, while failure definition handles rule list, After the corresponding relationship for having configured fault warning problem and troubleshooting rule, by the fault warning problem and troubleshooting rule Correspondence is stored in the database.
In one embodiment, conversion module 804 is also used to the same fault warning information that will occur within a preset time Converge a fault warning information;The fault warning information is converted to corresponding fault warning problem.
Specifically, conversion module 804 is also used to that in i.e. 5 minutes within a preset time but 5 minutes identical will be not limited to The convergence of fault warning information is merged into same fault warning information, and is converted the fault warning information by failure converter For fault warning problem corresponding with the fault warning information.
In the present embodiment, when obtain module 802 receive first server 102 transmission " warning system occur application There are several hundred alarms, then in 5 minutes in the fault warning information of server delay machine situation, a large amount of unreachable message of ping " Cause alarm windstorm.Actually the alarm windstorm is the same problem, and conversion module 804 is also used to by a plurality of failure The convergence process of warning information is converted into a fault warning information.Conversion module 804 is also used to turn the fault warning information Turn to " server ping failure ".
In the present embodiment, conversion module 804 is also used to failure modes, which includes: after can directly handling class, prompting It handles class and manpower intervention is needed to handle class.Specifically, when the fault warning problem has stored in the database, then conversion module 804 are also used to for the fault warning problem being included into and can directly handle class;When the fault warning problem is more complex fault warning Problem cannot solve at once, then conversion module 804 is also used to for the fault warning problem being included into prompting post-processing class;When the event When barrier alarm problem is novel fault warning problem and can not solve immediately, conversion module 804 is also used to the fault warning Problem, which is included into, needs manpower intervention to handle class.
In one embodiment, module 802 is obtained to be also used to according to the failure in the fault warning problem matching database Alarm problem;When the fault warning problem successful match in the fault warning problem and the database, call in the database The corresponding database of fault warning problem in troubleshooting rule.
Specifically, module 802 is obtained to be also used to according to fault warning problem from the fault warning problem table in database Matching fault warning problem calls the troubleshooting problem when successful match from the troubleshooting rule list in database Corresponding troubleshooting rule list.
In one embodiment, module 802 is obtained to be also used to accuse when the failure in the fault warning problem and the database Alert problem reports the fault warning problem when it fails to match;The troubleshooting rule of input is obtained according to the fault warning problem; It is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Specifically, when fault warning problem can not be matched with the fault warning problem in database, module 802 is obtained also For reporting the fault warning problem, the corresponding troubleshooting rule of the fault warning problem of user terminal input is obtained, and will The corresponding storage of the troubleshooting rule is in the database.
In one embodiment, module 802 is obtained to be also used to accuse when the failure in the fault warning problem and the database Alert problem identifies that the fault warning problem obtains failure subproblem when it fails to match;It is matched according to the failure subproblem corresponding Troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed into the corresponding troubleshooting of fault warning problem Rule.
Specifically, when it fails to match for the fault warning problem in the fault warning problem and the database, i.e. database In when not storing the fault warning problem, obtain module 802 and be also used to identify the fault warning problem, obtain it is two or more therefore Hedge problem and the failure subproblem split out according to the fault warning problem and the fault warning problem progress in database Match, the corresponding troubleshooting rule of the failure subproblem is called if successful match.Module 802 is obtained to be also used to failure The corresponding troubleshooting rule of problem one by one advises the corresponding troubleshooting of the failure subproblem according to sequence corresponding when decomposing Then marshalling obtains the corresponding troubleshooting rule of the fault warning problem.
Specific about fault treating apparatus limits the restriction that may refer to above for fault handling method, herein not It repeats again.Modules in above-mentioned fault treating apparatus can be realized fully or partially through software, hardware and combinations thereof.On Stating each module can be embedded in the form of hardware or independently of in the processor in computer equipment, can also store in a software form In memory in computer equipment, the corresponding operation of the above modules is executed in order to which processor calls.
In one embodiment, a kind of computer equipment is provided, which can be server, internal junction Composition can be as shown in Figure 10.The computer equipment include by system bus connect processor, memory, network interface and Database.Wherein, the processor of the computer equipment is for providing calculating and control ability.The memory packet of the computer equipment Include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system, computer program and data Library.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.The calculating The database of machine equipment is for storing failure handling data.The network interface of the computer equipment is used to pass through with external terminal Network connection communication.To realize a kind of fault handling method when the computer program is executed by processor.
It will be understood by those skilled in the art that structure shown in Figure 10, only part relevant to application scheme The block diagram of structure, does not constitute the restriction for the computer equipment being applied thereon to application scheme, and specific computer is set Standby may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory Computer program, which performs the steps of when executing computer program obtains fault warning information;By the fault warning Information carries out convergence process, and is converted to fault warning problem;Corresponding event is obtained from database according to the fault warning problem Barrier processing rule;Troubleshooting is carried out according to the troubleshooting rule.
In one embodiment, it is also performed the steps of when processor executes computer program and obtains fault warning problem With the script for solving the problems, such as the fault warning;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form failure Processing rule;The fault warning problem and the troubleshooting rule are stored in the database.
In one embodiment, it also performs the steps of and will go out within a preset time when processor executes computer program Existing same fault warning information converges a fault warning information;The fault warning information is converted to corresponding failure to accuse Alert problem.
In one embodiment, it also performs the steps of when processor executes computer program and is asked according to the fault warning Inscribe the fault warning problem in matching database;When the fault warning problem is matched into the fault warning problem in the database When function, the troubleshooting rule in the corresponding database of fault warning problem in the database is called.
In one embodiment, it also performs the steps of when processor executes computer program when the fault warning problem When it fails to match with the fault warning problem in the database, the fault warning problem is reported;It is obtained according to the fault warning problem Take the troubleshooting rule of input;The database is stored in by the fault warning problem is corresponding with the troubleshooting rule of the input In.
In one embodiment, it also performs the steps of when processor executes computer program when the fault warning problem When it fails to match with the fault warning problem in the database, identify that the fault warning problem obtains failure subproblem;According to this Failure subproblem matches corresponding troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed the failure to accuse The corresponding troubleshooting rule of police's problem.
In one embodiment, the data in the database are stored using Redis mode.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated Machine program performs the steps of when being executed by processor carries out convergence process for the fault warning information, and is converted to failure announcement Alert problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;According to the troubleshooting rule into Row troubleshooting.
In one embodiment, acquisition fault warning is also performed the steps of when computer program is executed by processor to ask Inscribe and solve the problems, such as the script of the fault warning;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form event Barrier processing rule;The fault warning problem and the troubleshooting rule are stored in the database.
In one embodiment, also performing the steps of when computer program is executed by processor will within a preset time The same fault warning information of appearance converges a fault warning information;The fault warning information is converted to corresponding failure Alarm problem.
In one embodiment, it also performs the steps of when computer program is executed by processor according to the fault warning Fault warning problem in problem matching database;When the fault warning problem is matched with the fault warning problem in the database When success, the troubleshooting rule in the corresponding database of fault warning problem in the database is called.
In one embodiment, it is also performed the steps of when computer program is executed by processor when the fault warning is asked Topic with the database in fault warning problem it fails to match when, report the fault warning problem;According to the fault warning problem Obtain the troubleshooting rule of input;The data are stored in by the fault warning problem is corresponding with the troubleshooting rule of the input In library.
In one embodiment, it is also performed the steps of when computer program is executed by processor when the fault warning is asked Topic with the database in fault warning problem it fails to match when, identify that the fault warning problem obtains failure subproblem;According to The failure subproblem matches corresponding troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed into the failure The corresponding troubleshooting rule of alarm problem.
In one embodiment, the data in the database are stored using Redis mode.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, To any reference of memory, storage, database or other media used in each embodiment provided herein, Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM (PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include Random access memory (RAM) or external cache.By way of illustration and not limitation, RAM is available in many forms, Such as static state RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing Type SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.

Claims (10)

1. a kind of fault handling method, which comprises
Obtain fault warning information;
The fault warning information is subjected to convergence process, and is converted to fault warning problem;
Corresponding troubleshooting rule is obtained from database according to the fault warning problem;
Troubleshooting is carried out according to the troubleshooting rule.
2. the method according to claim 1, wherein before the acquisition fault warning information, further includes:
It obtains fault warning problem and solves the problems, such as the script of the fault warning;
The script is split into subprocessing operation;
The subprocessing is recombinated to operate to form troubleshooting rule;
In the database by the fault warning problem and the corresponding storage of the troubleshooting rule.
3. the method according to claim 1, wherein it is described by the fault warning information carry out convergence process, And it is converted to fault warning problem, comprising:
The same fault warning information occurred within a preset time is converged into a fault warning information;
The fault warning information is converted to corresponding fault warning problem.
4. being corresponded to the method according to claim 1, wherein being obtained according to the fault warning problem from database Troubleshooting rule include:
According to the fault warning problem in the fault warning problem matching database;
When the fault warning problem successful match in the fault warning problem and the database, call in the database The corresponding database of fault warning problem in troubleshooting rule.
5. according to the method described in claim 4, it is characterized in that, the method also includes:
When it fails to match for the fault warning problem in the fault warning problem and the database, the fault warning is reported Problem;
The troubleshooting rule of input is obtained according to the fault warning problem;
In the database by the fault warning problem and the corresponding storage of the troubleshooting rule of the input.
6. according to the method described in claim 4, it is characterized in that, described obtain according to the fault warning problem from database Corresponding troubleshooting rule, further includes:
When it fails to match for the fault warning problem in the fault warning problem and the database, the fault warning is identified Problem obtains failure subproblem;
Corresponding troubleshooting rule is matched according to the failure subproblem;
The corresponding troubleshooting rule of the failure subproblem is formed into the corresponding troubleshooting rule of the fault warning problem.
7. the method according to claim 1, wherein the data in the database are stored using Redis mode.
8. a kind of fault treating apparatus, which is characterized in that described device includes:
Module is obtained, for obtaining fault warning information;It is corresponding for being obtained according to the fault warning problem from database Troubleshooting rule;
Conversion module for the fault warning information to be carried out convergence process, and is converted to fault warning problem;
Processing module, for carrying out troubleshooting according to the troubleshooting rule.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists In the step of processor realizes any one of claims 1 to 7 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program The step of method described in any one of claims 1 to 7 is realized when being executed by processor.
CN201811002316.9A 2018-08-30 2018-08-30 Fault processing method and device, computer equipment and storage medium Active CN109144829B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811002316.9A CN109144829B (en) 2018-08-30 2018-08-30 Fault processing method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811002316.9A CN109144829B (en) 2018-08-30 2018-08-30 Fault processing method and device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109144829A true CN109144829A (en) 2019-01-04
CN109144829B CN109144829B (en) 2022-03-22

Family

ID=64829393

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811002316.9A Active CN109144829B (en) 2018-08-30 2018-08-30 Fault processing method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109144829B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109757343A (en) * 2019-03-22 2019-05-17 厦门鑫天兴科技发展有限公司 A kind of irrigation method and system
CN109871305A (en) * 2019-01-18 2019-06-11 深圳壹账通智能科技有限公司 Processing method, device, computer equipment and the storage medium of warning information
CN110086682A (en) * 2019-05-22 2019-08-02 四川新网银行股份有限公司 Service link call relation view and failure root based on TCP are because of localization method
CN110728498A (en) * 2019-10-21 2020-01-24 北京百度网讯科技有限公司 Information interaction method and device
CN111769977A (en) * 2020-06-17 2020-10-13 广州嘉为科技有限公司 Processing method based on enterprise monitoring alarm event
CN111835760A (en) * 2020-07-10 2020-10-27 广州博冠信息科技有限公司 Alarm information processing method and device, computer storage medium and electronic equipment
CN112306794A (en) * 2020-09-28 2021-02-02 国网吉林省电力有限公司信息通信公司 Automatic processing method and device for typical fault scene of database
CN113434327A (en) * 2021-07-13 2021-09-24 上海浦东发展银行股份有限公司 Fault processing system, method, equipment and storage medium
CN114879636A (en) * 2022-05-15 2022-08-09 浙江工业大学 Chemical process fault isolation method based on fault interpreter

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101425924A (en) * 2008-06-12 2009-05-06 广东高新兴通信股份有限公司 Centralized monitoring system alarm data processing method
CN101605346A (en) * 2008-06-10 2009-12-16 中兴通讯股份有限公司 The fault restoration method and apparatus
CN105095523A (en) * 2015-09-28 2015-11-25 浪潮(北京)电子信息产业有限公司 Alarm event handling method and system
CN105262616A (en) * 2015-09-21 2016-01-20 浪潮集团有限公司 Failure repository-based automated failure processing system and method
CN107562556A (en) * 2017-08-14 2018-01-09 腾讯科技(深圳)有限公司 Restoration methods, recovery device and the storage medium of failure

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605346A (en) * 2008-06-10 2009-12-16 中兴通讯股份有限公司 The fault restoration method and apparatus
CN101425924A (en) * 2008-06-12 2009-05-06 广东高新兴通信股份有限公司 Centralized monitoring system alarm data processing method
CN105262616A (en) * 2015-09-21 2016-01-20 浪潮集团有限公司 Failure repository-based automated failure processing system and method
CN105095523A (en) * 2015-09-28 2015-11-25 浪潮(北京)电子信息产业有限公司 Alarm event handling method and system
CN107562556A (en) * 2017-08-14 2018-01-09 腾讯科技(深圳)有限公司 Restoration methods, recovery device and the storage medium of failure

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871305A (en) * 2019-01-18 2019-06-11 深圳壹账通智能科技有限公司 Processing method, device, computer equipment and the storage medium of warning information
CN109871305B (en) * 2019-01-18 2022-11-04 深圳壹账通智能科技有限公司 Alarm information processing method and device, computer equipment and storage medium
CN109757343A (en) * 2019-03-22 2019-05-17 厦门鑫天兴科技发展有限公司 A kind of irrigation method and system
CN110086682A (en) * 2019-05-22 2019-08-02 四川新网银行股份有限公司 Service link call relation view and failure root based on TCP are because of localization method
CN110086682B (en) * 2019-05-22 2022-06-24 四川新网银行股份有限公司 Service link calling relation view and fault root cause positioning method based on TCP
CN110728498A (en) * 2019-10-21 2020-01-24 北京百度网讯科技有限公司 Information interaction method and device
CN111769977A (en) * 2020-06-17 2020-10-13 广州嘉为科技有限公司 Processing method based on enterprise monitoring alarm event
CN111835760A (en) * 2020-07-10 2020-10-27 广州博冠信息科技有限公司 Alarm information processing method and device, computer storage medium and electronic equipment
CN111835760B (en) * 2020-07-10 2023-03-24 广州博冠信息科技有限公司 Alarm information processing method and device, computer storage medium and electronic equipment
CN112306794A (en) * 2020-09-28 2021-02-02 国网吉林省电力有限公司信息通信公司 Automatic processing method and device for typical fault scene of database
CN113434327A (en) * 2021-07-13 2021-09-24 上海浦东发展银行股份有限公司 Fault processing system, method, equipment and storage medium
CN114879636A (en) * 2022-05-15 2022-08-09 浙江工业大学 Chemical process fault isolation method based on fault interpreter

Also Published As

Publication number Publication date
CN109144829B (en) 2022-03-22

Similar Documents

Publication Publication Date Title
CN109144829A (en) Fault processing method and device, computer equipment and storage medium
CN110069572B (en) HIVE task scheduling method, device, equipment and storage medium based on big data platform
CN111274052A (en) Data distribution method, server, and computer-readable storage medium
CN109656782A (en) Visual scheduling monitoring method, device and server
CN110611707B (en) Task scheduling method and device
CN111026602A (en) Health inspection scheduling management method and device of cloud platform and electronic equipment
CN101719097A (en) Debug method and system of automation test script and automation application server
CN112035233A (en) Big data batch job task scheduling method and device
DE102008044018A1 (en) Method for determining a security level and security manager
CN114536339B (en) Control method and device for cooperative robot, cooperative robot and storage medium
CN110569113A (en) Method and system for scheduling distributed tasks and computer readable storage medium
CN110619014A (en) ETL-based data extraction method
CN111381940B (en) Distributed data processing method and device
CN113658351B (en) Method and device for producing product, electronic equipment and storage medium
CN111522630B (en) Method and system for executing planned tasks based on batch dispatching center
CN117407201A (en) Distributed cloud platform fault self-healing method and system
CN115687019A (en) Database cluster fault processing method, intelligent monitoring platform, equipment and medium
CN115686594A (en) Terminal automatic operation and maintenance management system and method
CN113010210A (en) Automatic operation and maintenance operation management method and system
CN116450305B (en) SOAR platform assembly execution method and device based on distributed task scheduling
CN112825044B (en) Task execution method, device and computer storage medium
CN117873691B (en) Data processing method, device, equipment and readable storage medium
CN117076508B (en) Method for supporting batch data processing by stream data processing system
CN116038687A (en) Mobile robot scheduling method and device, robot system and computer equipment
CN117896232A (en) Method and device for equipment operation and maintenance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant