CN109144829A - Fault processing method and device, computer equipment and storage medium - Google Patents
Fault processing method and device, computer equipment and storage medium Download PDFInfo
- Publication number
- CN109144829A CN109144829A CN201811002316.9A CN201811002316A CN109144829A CN 109144829 A CN109144829 A CN 109144829A CN 201811002316 A CN201811002316 A CN 201811002316A CN 109144829 A CN109144829 A CN 109144829A
- Authority
- CN
- China
- Prior art keywords
- fault warning
- database
- fault
- troubleshooting
- rule
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003860 storage Methods 0.000 title claims abstract description 29
- 238000003672 processing method Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 claims abstract description 73
- 238000012545 processing Methods 0.000 claims abstract description 40
- 230000008569 process Effects 0.000 claims abstract description 30
- 238000013024 troubleshooting Methods 0.000 claims description 194
- 238000004590 computer program Methods 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 13
- 238000007726 management method Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 9
- 230000004888 barrier function Effects 0.000 description 7
- 238000012544 monitoring process Methods 0.000 description 7
- 238000012805 post-processing Methods 0.000 description 5
- 230000006798 recombination Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012384 transportation and delivery Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
The application relates to a fault processing method, a fault processing device, computer equipment and a storage medium. The method comprises the following steps: acquiring fault alarm information; carrying out convergence processing on the fault alarm information and converting the fault alarm information into a fault alarm problem; acquiring a corresponding fault processing rule from a database according to the fault alarm problem; and performing fault processing according to the fault processing rule. The method can avoid repeated receiving of the alarm information, save the fault processing time and the occupied storage resources, solve the problem of insufficient automation and automatically process the fault alarm information.
Description
Technical field
This application involves computer application field, more particularly to a kind of fault handling method, device, computer equipment and
Computer storage medium.
Background technique
With the development of computer technology, there is O&M technology.Operation and maintenance are supervised to the state of service operation
Control, convenient for find in time service operation exception and resource consumption the case where.When an error occurs, O&M engineer is to servicing out
Existing any exception is handled in time, and the extension of problem is avoided even to stop to service as far as possible.And O&M engineer needs
The prediction scheme that formulate processing extremely for all kinds of services, can manually perform prediction scheme and reach the mesh stopped loss when occurring so as to problem
's.
However, there is automation deficiency in current O&M mode.
Summary of the invention
Based on this, it is necessary to for the technical problem that automation is insufficient, provide one kind can reduce alarm windstorm and from
Fault handling method, device, computer equipment and the computer storage medium of dynamic handling failure warning information.
A kind of fault handling method, this method comprises: obtaining fault warning information;The fault warning information is restrained
Processing, and it is converted to fault warning problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;It presses
Troubleshooting is carried out according to the troubleshooting rule.
In one of the embodiments, before obtaining fault warning information, further includes: obtain the reconciliation of fault warning problem
The certainly script of the fault warning problem;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form troubleshooting
Rule;The fault warning problem and the troubleshooting rule are stored in the database.
The fault warning information is subjected to convergence process in one of the embodiments, and is converted to fault warning problem,
It include: that the same fault warning information that will occur within a preset time converges a fault warning information;By the fault warning
Information is converted to corresponding fault warning problem.
Corresponding troubleshooting rule packet is obtained from database according to the fault warning problem in one of the embodiments,
It includes: according to the fault warning problem in the fault warning problem matching database;When in the fault warning problem and the database
Fault warning problem successful match when, call the troubleshooting in the corresponding database of fault warning problem in the database
Rule.
This method in one of the embodiments, further include: when the failure in the fault warning problem and the database is accused
Alert problem reports the fault warning problem when it fails to match;The troubleshooting rule of input is obtained according to the fault warning problem;
It is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Corresponding troubleshooting rule is obtained from database according to the fault warning problem in one of the embodiments,
Further include: when it fails to match for the fault warning problem in the fault warning problem and the database, identify that the fault warning is asked
Topic obtains failure subproblem;Corresponding troubleshooting rule is matched according to the failure subproblem;The failure subproblem is corresponding
Troubleshooting rule forms the corresponding troubleshooting rule of the fault warning problem.
The data in the database are stored using Redis mode in one of the embodiments,.
A kind of fault treating apparatus, which includes: acquisition module, for obtaining fault warning information;It should for basis
Fault warning problem obtains corresponding troubleshooting rule from database;Conversion module, for carrying out the fault warning information
Convergence process, and it is converted to fault warning problem;Processing module, for carrying out troubleshooting according to the troubleshooting rule.
A kind of computer equipment, including memory and processor, the memory are stored with computer program, the processing
Device performs the steps of when executing the computer program obtains fault warning information;The fault warning information is restrained
Processing, and it is converted to fault warning problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;It presses
Troubleshooting is carried out according to the troubleshooting rule.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor
It is performed the steps of when row and obtains fault warning information;The fault warning information is subjected to convergence process, and is converted to failure announcement
Alert problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;According to the troubleshooting rule into
Row troubleshooting.
Above-mentioned fault handling method, device, computer equipment and storage medium, by obtaining fault warning information and to this
Information carries out convergence process, can effectively reduce alarm windstorm, avoids repeating to receive warning information, saves fault handling time and account for
Storage resource;Fault warning information is converted to fault warning problem, troubleshooting is obtained according to the fault warning problem
Rule, and troubleshooting is carried out, it can solve the insufficient problem of automation, automatically process fault warning information.
Detailed description of the invention
Fig. 1 is the applied environment figure of fault handling method in one embodiment;
Fig. 2 is the flow diagram of fault handling method in one embodiment;
Fig. 3 is the flow diagram that troubleshooting rule generates in one embodiment;
Fig. 4 is the flow diagram of fault handling method in another embodiment;
Fig. 5 is the flow diagram of fault handling method in another embodiment;
Fig. 6 is the flow diagram of fault handling method in further embodiment;
Fig. 7 is the application scenario diagram of fault handling method in another embodiment;
Fig. 8 is the structural block diagram of fault treating apparatus in one embodiment;
Fig. 9 is the structural block diagram of fault treating apparatus in another embodiment;
Figure 10 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood
The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not
For limiting the application.
Fault handling method provided by the present application can be applied in application environment as shown in Figure 1.Wherein, the first clothes
Business device 102 is communicated with second server 104 by network by network.Wherein, first server 102 and second server
104 can be realized with the server cluster of the either multiple server compositions of independent server, and first server 102
It can be heterogeneous database system, which is the set of relevant multiple Database Systems, may be implemented to count
According to shared and transparent access.Each component part of heterogeneous database has the autonomy of itself, realizes the same of data sharing
When, each Database Systems still possess application characteristic, integrity control and the safety control of oneself.
In one embodiment, as shown in Fig. 2, providing a kind of fault handling method, it is applied in Fig. 1 in this way
It is illustrated for second server 104, comprising the following steps:
Step 202, fault warning information is obtained.
Wherein, fault warning information refers to the fault warning information generated when first server 102 breaks down.The event
Barrier warning information may include server delay machine, service abend, disk space is inadequate and server ping is obstructed etc..
Specifically, second server 104 obtains the generation when first server 102 breaks down from first server 102
Fault warning information.Second server 104 includes front-end receiver, which provides interface, for pulling automatically
Or other warning information resources are received, the fault warning information in warning information resource is received, and fault warning information is pushed into event
Hinder converter.
Wherein, which includes Zabbix, open-falcon, Naglos and CMDB (Configuration
Management Database, configuration management database) etc., the warning information resource is for monitoring network system, terminal, data
Library, service and process etc..
Zabbix is a kind of open source solution of enterprise-level for providing distributed system monitoring and network monitoring function,
It can monitor various network parameters, guarantee the safe operation of server system.
Open-falcon is a enterprise-level, High Availabitity, expansible open source monitoring solution.
Naglos is the computer system and network monitor tools of a kind of open source, can effective monitoring Windows system, Linux
The Host Status of system and Unix system.
Wherein, linux system is a set of class Unix operating system, be a multi-user end, multitask, support multithreading and
The operating system of multi -CPU (Central Processing Unit, central processing unit).
Unix operating system is powerful multi-user end, a multiple task operating system, supports various processor framework,
According to the classification of operating system, belong to time sharing operating system.
CMDB (Configuration Management Database, configuration management database) is used for storage and management
The various configuration informations of equipment in enterprise architecture, it and all service supports and service delivery process are all closely linked to, and support this
The operating of a little processes, the value for playing configuration information.
By from different warning information resources obtain fault warning information, can guarantee the consistency of fault warning information with
And accuracy, the redundancy of data is reduced, does not need to deposit in the fault warning information in second server 104 differently
Side reduces management cost.
Step 204, which is subjected to convergence process, and is converted to fault warning problem.
Wherein, convergence process refers to identical fault warning information converging a fault warning information.
Specifically, second server 104 converges the identical fault warning information obtained from first server 102 together
One fault warning information, and it is converted to fault warning problem corresponding with fault warning information.
Step 206, corresponding troubleshooting rule is obtained from database according to the fault warning problem.
Wherein, troubleshooting rule refers to solving the problems, such as the scheme of the fault warning.
Specifically, second server 104 obtains stored solution failure according to fault warning problem from database
The scheme of alarm problem, the database can be located in second server 104, can also be independently of second server 104.
Step 208, troubleshooting is carried out according to the troubleshooting rule.
Specifically, second server 104 is according to the troubleshooting rule obtained from database to the troubleshooting problem
Carry out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system,
The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter
Failure " calls the troubleshooting rule in database, executes " restarting server " script, carry out according to the troubleshooting rule
Troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, data are called
Troubleshooting rule in library, executes " service of restarting " script, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue
When, then the troubleshooting rule in database is called, " clear up journal file and enter journey " script is executed, according to the troubleshooting
Rule carries out troubleshooting.
In the present embodiment, after completing troubleshooting, second server 104 can be with Real-time Feedback troubleshooting result to use
Family end facilitates and carries out secondary collation process.
It, can be effective by obtaining fault warning information and carrying out convergence process to the information in above-mentioned fault handling method
Alarm windstorm is reduced, avoids repeating to receive warning information, saves the storage resource of fault handling time and occupancy;By fault warning
Information is converted to fault warning problem, obtains troubleshooting rule according to the fault warning problem, and carry out troubleshooting, can solve
Insufficient problem is certainly automated, fault warning information is automatically processed.
In one embodiment, as shown in figure 3, before obtaining fault warning information, the fault handling method further include:
Step 302, it obtains fault warning problem and solves the problems, such as the script of the fault warning.
Wherein, the script for solving the problems, such as the fault warning is to solve the fault warning solution to the problem.
Specifically, second server 104 is obtained that user terminal inputs or is accused from other terminals or the failure of system introducing
Alert problem and the script for solving the problems, such as the fault warning.
Step 304, which is split into subprocessing operation.
Wherein, subprocessing operation refers to the processing step in the script, and one basis of the corresponding solution of the processing step is asked
Topic and the smallest troubleshooting unit.
Specifically, this is solved the problems, such as that the script of fault warning splits into single subprocessing and operates by second server 104.
In the present embodiment, in O&M technical field, most of operation can be completed by perform script.Second server
104 regard the subprocessing operation of each script as atom, the operation of a large amount of subprocessing are stored in atom, the atom warehouse compartment
In second server 104.
Step 306, the subprocessing is recombinated to operate to form troubleshooting rule.
Specifically, second server 104 solves the problems, such as the fault warning by the processing step formation recombinated in the script
Scheme.
In the present embodiment, second server 104 can obtain subprocessing operation from atom, form script processing
Step, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.In entire mistake
Cheng Zhong, many atoms can reuse, thus be directed to specific fault warning problem, can in atom will required for
Atom is combined into troubleshooting rule.
Step 308, it is stored in the fault warning problem is corresponding with the troubleshooting rule in the database.
Specifically, second server 104 by the fault warning problem and solves the problems, such as that the troubleshooting of the fault warning is advised
It then corresponds and is stored in the database.The database is also used to save configuration information, in execution journal of troubleshooting etc.
Hold.
In the present embodiment, which is stored as the form of fault warning table but unlimited by second server 104
In this, while failure definition handles rule list, after the corresponding relationship for having configured fault warning problem and troubleshooting rule, by this
Fault warning problem is corresponding with the troubleshooting rule to be stored in the database.
In above-mentioned fault handling method, by obtaining fault warning problem and solving the problems, such as this scheme, script is split
It is operated at subprocessing to solve underlying issue, and subprocessing operation may be reused;Again by recombinating subprocessing operation
Form different troubleshooting rules, it is possible to reduce occupied space, and solve the problems, such as more fault warnings, when saving is write
Between and the response time, rewrite a script without each fault warning problem;By fault warning problem and failure
Processing is regular to correspond to storage in the database, so that calling more rapidly, intuitively.
In one embodiment, which is subjected to convergence process, and is converted to fault warning problem, wrapped
It includes: the same fault warning information occurred within a preset time is converged into a fault warning information;The fault warning is believed
Breath is converted to corresponding fault warning problem.
Wherein, fault warning information refers to that preset time is 0 to 24 hour but not limited to this, with 5 minutes in the present embodiment
As preset time.
Specifically, second server 104 will be in 5 minutes but to be not limited to 5 minutes identical failures within a preset time
Warning information convergence is merged into same fault warning information, and by failure converter converted the fault warning information to and
The corresponding fault warning problem of the fault warning information.
In the present embodiment, when preposition alarm device receive first server 102 transmission " warning system occur using clothes
The fault warning information of business device delay machine situation, a large amount of unreachable message of ping " occurs several hundred alarms in 5 minutes, then makes
At alarm windstorm.Actually the alarm windstorm is the same problem, by the convergence process to the warning information, then second is taken
Business device 104 can receive only a fault warning information.The fault warning information is converted " server by failure converter
Ping failure ", and calling from database should " server ping failure " corresponding fault warning rule such as " service of restarting
The script of device " is to solve the problems, such as " the server ping failure ".
In the present embodiment, there are also fault category libraries in failure converter, comprising: can directly handle class, remind post-processing class
With need manpower intervention handle class.Specifically, when the fault warning problem has stored in the database, then the fault warning problem is returned
Class can directly be handled by entering;When the fault warning problem be more complex fault warning problem, cannot solve at once, then by the failure
Alarm problem is included into prompting post-processing class;When the fault warning problem is novel fault warning problem and can not be solved immediately
When, which, which is included into, needs manpower intervention to handle class.
In above-mentioned fault handling method, by the way that identical fault warning information is converged same fault warning information,
Reduce alarm windstorm, avoids repeating to receive warning information, the storage resource of fault handling time and occupancy can be saved.
In one embodiment, obtaining corresponding troubleshooting rule from database according to the fault warning problem includes:
According to the fault warning problem in the fault warning problem matching database;When the event in the fault warning problem and the database
When hindering alarm problem successful match, the troubleshooting in the corresponding database of fault warning problem in the database is called to advise
Then.
Specifically, second server 104 is matched from the fault warning problem table in database according to fault warning problem
Fault warning problem calls the troubleshooting problem corresponding when successful match from the troubleshooting rule list in database
Troubleshooting rule.
In above-mentioned fault handling method, by according to the troubleshooting problem in troubleshooting trouble-shoots database, and
Troubleshooting rule is called, can achieve the effect that fault self-recovery, and failure response is rapid, reduce manually-operated fault
Rate.
In one embodiment, as shown in figure 4, the fault handling method further include:
Step 402, when it fails to match for the fault warning problem in the fault warning problem and the database, the event is reported
Hinder alarm problem.
Specifically, when the fault warning problem in second server 104 can not be with the fault warning problem in database
Timing, second server 104 report the fault warning problem.
Step 404, the troubleshooting rule of input is obtained according to the fault warning problem.
Specifically, user terminal creates the corresponding troubleshooting rule of the fault warning problem according to the fault warning problem,
And the troubleshooting rule is uploaded to second server 104, second server 104 obtains the troubleshooting of user terminal input
Rule.
Step 406, it is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Specifically, which according to the form of fault warning problem table but is not limited to by second server 104
This storage corresponding with the troubleshooting rule of the input is in the database.
In above-mentioned fault handling method, by obtaining the troubleshooting rule of input, can effectively solve can not in database
Fault warning problem and corresponding troubleshooting rule are stored in the database, expand data by the fault warning problem of solution
The amount of storage in library, and database problem-solving ability can be reinforced.
In one embodiment, as shown in figure 5, obtaining corresponding troubleshooting from database according to the fault warning problem
Rule, further includes:
Step 502, when it fails to match for the fault warning problem in the fault warning problem and the database, the event is identified
Barrier alarm problem obtains failure subproblem.
Wherein, failure subproblem refers to the relatively subsection in the fault warning problem, i.e. the fault warning problem can be with
It is split as multiple failure subproblems.
Specifically, when it fails to match for the fault warning problem in the fault warning problem and the database, i.e. database
In when not storing the fault warning problem, second server 104 identifies the fault warning problem, obtains two or more failures
Problem.
Step 504, corresponding troubleshooting rule is matched according to the failure subproblem.
Specifically, in the failure subproblem and database that second server 104 is split out according to the fault warning problem
Fault warning problem is matched, and the corresponding troubleshooting rule of the failure subproblem is called if successful match.
Step 506, the corresponding troubleshooting rule of the failure subproblem is formed into the corresponding failure of fault warning problem
Processing rule.
Specifically, second server 104 is by the corresponding troubleshooting rule of the failure subproblem according to failure subproblem point
Corresponding sequence is regularly arranged neat by the corresponding troubleshooting of failure subproblem one by one when solution, obtains the fault warning problem
Corresponding troubleshooting rule.
In above-mentioned fault handling method, asked by the way that fault warning problem is split as two or more corresponding failure
Topic can refine the step of solving the problems, such as fault warning, while not have to write the corresponding fault warning of fault warning problem again
Rule saves the time and reduces the resource of occupancy.
In one embodiment, the data in the database are stored using Redis mode.
Wherein, Redis is a storage system, and the data of update periodically can be written disk or modification is grasped by it
Make the additional record file of write-in, and realizes master-slave synchronisation on this basis.
Specifically, it due to fault warning information and unstructured data, needs to store by unstructured storage engines, and
And need cleaning and calculation processing at fault warning problem, then it needs to cache high speed processing, therefore Redis is used to store.
In one embodiment, which uses plug-in management.The corresponding failure of each warning information resource
Alarm plug-in then when warning information resource changes, such as from Zabbix warning information resource becomes Open-falcon warning information
Source then can carry out troubleshooting by the adaptor interface reconfigured in front-end receiver.Above-mentioned fault handling method leads to
It crosses from a variety of warning information resources by the way of plug-in management, can be adapted to, enhances scalability.
In one embodiment, which further includes providing system management interface by control console, is led to
Cross the configuration that the system management interface carries out system regulation and plug-in unit.
In one embodiment, as shown in fig. 6, providing a kind of fault handling method, comprising the following steps:
Step 602, it obtains fault warning problem and solves the problems, such as the script of the fault warning.
Specifically, second server 104 is obtained that user terminal inputs or is accused from other terminals or the failure of system introducing
Alert problem and the script for solving the problems, such as the fault warning.
Step 604, which is split into subprocessing operation.
Specifically, this is solved the problems, such as that the script of fault warning splits into single subprocessing and operates by second server 104.
In the present embodiment, the subprocessing operation of each script is refined into atom by second server 104, at a large amount of son
Reason operation is stored in atom, which is located in the database in second server 104.
Step 606, the subprocessing is recombinated to operate to form troubleshooting rule.
Specifically, second server 104 solves the problems, such as the fault warning by the processing step formation recombinated in the script
Scheme.
In the present embodiment, second server 104 can obtain subprocessing operation from atom, form script processing
Step, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.In entire mistake
Cheng Zhong, many atoms can reuse, thus be directed to specific fault warning problem, can in atom will required for
Atom is combined into troubleshooting rule.
Step 608, it is stored in the fault warning problem is corresponding with the troubleshooting rule in the database.
Specifically, second server 104 by the fault warning problem and solves the problems, such as that the troubleshooting of the fault warning is advised
It then corresponds and is stored in the database.The database is also used to save configuration information, in execution journal of troubleshooting etc.
Hold.
In the present embodiment, which is stored as the form of fault warning table but unlimited by second server 104
In this, while failure definition handles rule list, after the corresponding relationship for having configured fault warning problem and troubleshooting rule, by this
Fault warning problem is corresponding with the troubleshooting rule to be stored in the database.
Step 610, fault warning information is obtained from warning information resource.
Specifically, second server 104 obtains the generation when first server 102 breaks down from first server 102
Fault warning information.Second server 104 includes front-end receiver, which provides interface, automatic for receiving
Other warning information resources are pulled or receive, which includes Zabbix, open-falcon, Naglos and CMDB etc., is somebody's turn to do
Warning information resource can be used for monitoring network system, terminal, database, service and process etc..Second server 104 receives alarm letter
Fault warning information in breath source, and fault warning information is pushed into failure converter.
Step 612, which is subjected to convergence process, and is converted to fault warning problem.
Specifically, second server 104 converges the identical fault warning information obtained from first server 102 together
One fault warning information, and fault warning problem corresponding with fault warning information is converted to by failure converter.
In the present embodiment, when preposition alarm device receive first server 102 transmission " warning system occur using clothes
The fault warning information of business device delay machine situation, a large amount of unreachable message of ping " occurs several hundred alarms in 5 minutes, then makes
At alarm windstorm.Actually the alarm windstorm is the same problem, by the convergence process to the warning information, then second is taken
Business device 104 can receive only a fault warning information.The fault warning information is converted " server by failure converter
Ping failure ", and calling from database should " server ping failure " corresponding fault warning rule such as " service of restarting
The script of device " is to solve the problems, such as " the server ping failure ".
In the present embodiment, there are also fault category libraries in failure converter, comprising: can directly handle class, remind post-processing class
With need manpower intervention handle class.Specifically, when the fault warning problem has stored in the database, then the fault warning problem is returned
Class can directly be handled by entering;When the fault warning problem be more complex fault warning problem, cannot solve at once, then by the failure
Alarm problem is included into prompting post-processing class;When the fault warning problem is novel fault warning problem and can not be solved immediately
When, which, which is included into, needs manpower intervention to handle class.
Step 614, corresponding troubleshooting rule is obtained from database according to the fault warning problem.
Specifically, second server 104 is matched from the fault warning problem table in database according to fault warning problem
Fault warning problem calls the troubleshooting problem corresponding when successful match from the troubleshooting rule list in database
Troubleshooting rule.
In the present embodiment, when the fault warning problem in second server 104 can not be with the fault warning problem in data
When matching, second server 104 reports the fault warning problem.User terminal creates the fault warning according to the fault warning problem
The corresponding troubleshooting rule of problem, and the troubleshooting rule is uploaded to second server 104, second server 104 obtains
Take the user terminal input troubleshooting rule, and the fault warning problem according to fault warning problem table form but be not limited to
This storage corresponding with the troubleshooting rule of the input is in the database.
In the present embodiment, when it fails to match for the fault warning problem in the fault warning problem and the database, that is, count
When according to not storing the fault warning problem in library, second server 104 identifies the fault warning problem, obtains two or more events
Hedge problem.The failure in failure subproblem and database that second server 104 is split out according to the fault warning problem is accused
Alert problem is matched, and the corresponding troubleshooting rule of the failure subproblem is called if successful match.Second server 104 will
Corresponding sequence one by one asks failure when the corresponding troubleshooting rule of the failure subproblem is according to the decomposition of failure subproblem
It is regularly arranged neat to inscribe corresponding troubleshooting, obtains the corresponding troubleshooting rule of the fault warning problem.
Step 616, troubleshooting is carried out according to the troubleshooting rule.
Specifically, second server 104 is according to the troubleshooting rule obtained from database to the troubleshooting problem
Carry out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system,
The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter
Failure " calls the troubleshooting rule in database, executes " restarting server " script, carry out according to the troubleshooting rule
Troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, data are called
Troubleshooting rule in library, executes " service of restarting " script, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue
When, then the troubleshooting rule in database is called, " clear up journal file and enter journey " script is executed, according to the troubleshooting
Rule carries out troubleshooting.
In the present embodiment, after completing troubleshooting, second server 104 can be with Real-time Feedback troubleshooting result to use
Family end facilitates and carries out secondary collation process.
In the present embodiment, which further includes providing system management interface by control console, by this
The configuration of system management interface progress system regulation and plug-in unit.
In one embodiment, it is illustrated by taking the application scenarios that the fault handling method is applied in Fig. 7 as an example.Second
Server 104 includes front-end receiver, failure interpreter, processor and database, further includes atom in the database.
Specifically, second server 104 pulls from front-end receiver or receives warning information resource, connects from warning information resource
Fault warning information is received, which is pushed into failure converter in queue.Failure converter believes fault warning
Breath converges same fault warning information, and is converted to fault warning problem corresponding with fault warning information.Processor root
The fault warning problem is handled according to the troubleshooting rule obtained from database.There is atom in the database, it is former
Word bank is for storing subprocessing operation.The database is also used to save template content, configuration information, execution journal etc..In addition, pipe
Control console passes through the interface adjusting system and plug-in card configuration for providing system management interface.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system,
The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter
Failure ", processor call the troubleshooting rule in database, execute " restarting server " script, advise according to the troubleshooting
Then carry out troubleshooting.
In the present embodiment, become Open-falcon alarm when warning information resource changes, such as from Zabbix alarm source
Source then can carry out troubleshooting by the adaptor interface reconfigured in front-end receiver.
It should be understood that although each step in the flow chart of Fig. 2-6 is successively shown according to the instruction of arrow,
These steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly stating otherwise herein, these steps
Execution there is no stringent sequences to limit, these steps can execute in other order.Moreover, at least one in Fig. 2-6
Part steps may include that perhaps these sub-steps of multiple stages or stage are not necessarily in synchronization to multiple sub-steps
Completion is executed, but can be executed at different times, the execution sequence in these sub-steps or stage is also not necessarily successively
It carries out, but can be at least part of the sub-step or stage of other steps or other steps in turn or alternately
It executes.
In one embodiment, as shown in figure 8, providing a kind of fault treating apparatus, comprising: obtain module 802, conversion
Module 804 and processing module 806, in which:
Module 802 is obtained, for obtaining fault warning information;It is also used to be obtained according to the fault warning problem from database
Take corresponding troubleshooting rule.
Specifically, module 802 is obtained to be used to obtain the production when first server 102 breaks down from first server 102
Raw fault warning information pulls automatically or receives other warning information resources, receives the fault warning information in warning information resource,
And fault warning information is pushed into failure converter.Wherein, the warning information resource include Zabbix, open-falcon,
Naglos and CMDB etc., the warning information resource is for monitoring network system, terminal, database, service and process etc..
Specifically, acquisition module 802 is also used to obtain stored solution from database according to fault warning problem and is somebody's turn to do
The scheme of fault warning problem.
Conversion module 804 for the fault warning information to be carried out convergence process, and is converted to fault warning problem.
Specifically, conversion module 804 is for converging the identical fault warning information obtained from first server 102
Same fault warning information, and it is converted to fault warning problem corresponding with fault warning information.
Processing module 806, for carrying out troubleshooting according to the troubleshooting rule.
Specifically, processing module 806 is for asking the troubleshooting according to the troubleshooting rule obtained from database
Topic carries out troubleshooting.
In the present embodiment, when front-end receiver receives " there is the case where application server delay machine in zabbix warning system,
The fault warning information is converted " server ping by the fault warning information of the unreachable message of a large amount of ping ", failure converter
Failure ", processing module 806 are used to call the troubleshooting rule in database, " restarting server " script are executed, according to the event
Barrier processing rule carries out troubleshooting.
In the present embodiment, when second server 104 receives the fault warning problem that service abends, processing module
806, for calling the troubleshooting rule in database, execute " service of restarting " script, carry out event according to the troubleshooting rule
Barrier processing.
In the present embodiment, when second server 104 receives the fault warning problem of Insufficient disk space or performance issue
When, then processing module 806 is used to call the troubleshooting rule in database, executes " clear up journal file and enter journey " foot
This, carries out troubleshooting according to the troubleshooting rule.
In the present embodiment, after completing troubleshooting, processing module 806 can be used for Real-time Feedback troubleshooting result to use
Family end facilitates and carries out secondary collation process.
In one embodiment, as shown in figure 9, the fault treating apparatus further includes splitting module 808, recombination module 810
It is executed with memory module 812, each intermodule and not in strict accordance with the sequence of Fig. 9.Wherein, the acquisition failure announcement of module 802 is being obtained
Before alert problem information, it is also used to obtain fault warning problem and solves the problems, such as the script of the fault warning.
Specifically, obtain module 802 be used to obtain user terminal input or from other terminals or the failure of system introducing
Alarm problem and the script for solving the problems, such as the fault warning.
Module 808 is split, for the script to be split into subprocessing operation.Specifically, module 808 is split to be used for the solution
Certainly the script of fault warning problem splits into single subprocessing operation.In the present embodiment, splits module 808 and be also used to each foot
This subprocessing operation is refined into atom, and the operation of a large amount of subprocessing is stored in atom.
Recombination module 810 operates to form troubleshooting rule for recombinating the subprocessing.Specifically, recombination module 810 is used
The scheme for solving the problems, such as the fault warning is formed in the processing step by recombinating in the script.
In the present embodiment, recombination module 810 forms the step of script processing for obtaining subprocessing operation from atom
Suddenly, by corresponding troubleshooting problem, the operation of subprocessing required for dispatching, layout is at troubleshooting rule.
Memory module 812, for being stored in the fault warning problem is corresponding with the troubleshooting rule in the database.
Specifically, memory module 812 is used for the fault warning problem and solves the problems, such as the troubleshooting rule of the fault warning one by one
Correspondence is stored in the database.Data in the database use Redis storage device.Specifically, since fault warning is believed
Simultaneously unstructured data is ceased, needs to store by unstructured storage engines, and cleaning and calculation processing is needed to accuse at failure
Alert problem is then needed to cache high speed processing, therefore is stored using Redis storage device.In the present embodiment, memory module 812
For the fault warning problem to be stored as to the form of fault warning table but not limited to this, while failure definition handles rule list,
After the corresponding relationship for having configured fault warning problem and troubleshooting rule, by the fault warning problem and troubleshooting rule
Correspondence is stored in the database.
In one embodiment, conversion module 804 is also used to the same fault warning information that will occur within a preset time
Converge a fault warning information;The fault warning information is converted to corresponding fault warning problem.
Specifically, conversion module 804 is also used to that in i.e. 5 minutes within a preset time but 5 minutes identical will be not limited to
The convergence of fault warning information is merged into same fault warning information, and is converted the fault warning information by failure converter
For fault warning problem corresponding with the fault warning information.
In the present embodiment, when obtain module 802 receive first server 102 transmission " warning system occur application
There are several hundred alarms, then in 5 minutes in the fault warning information of server delay machine situation, a large amount of unreachable message of ping "
Cause alarm windstorm.Actually the alarm windstorm is the same problem, and conversion module 804 is also used to by a plurality of failure
The convergence process of warning information is converted into a fault warning information.Conversion module 804 is also used to turn the fault warning information
Turn to " server ping failure ".
In the present embodiment, conversion module 804 is also used to failure modes, which includes: after can directly handling class, prompting
It handles class and manpower intervention is needed to handle class.Specifically, when the fault warning problem has stored in the database, then conversion module
804 are also used to for the fault warning problem being included into and can directly handle class;When the fault warning problem is more complex fault warning
Problem cannot solve at once, then conversion module 804 is also used to for the fault warning problem being included into prompting post-processing class;When the event
When barrier alarm problem is novel fault warning problem and can not solve immediately, conversion module 804 is also used to the fault warning
Problem, which is included into, needs manpower intervention to handle class.
In one embodiment, module 802 is obtained to be also used to according to the failure in the fault warning problem matching database
Alarm problem;When the fault warning problem successful match in the fault warning problem and the database, call in the database
The corresponding database of fault warning problem in troubleshooting rule.
Specifically, module 802 is obtained to be also used to according to fault warning problem from the fault warning problem table in database
Matching fault warning problem calls the troubleshooting problem when successful match from the troubleshooting rule list in database
Corresponding troubleshooting rule list.
In one embodiment, module 802 is obtained to be also used to accuse when the failure in the fault warning problem and the database
Alert problem reports the fault warning problem when it fails to match;The troubleshooting rule of input is obtained according to the fault warning problem;
It is stored in the fault warning problem is corresponding with the troubleshooting rule of the input in the database.
Specifically, when fault warning problem can not be matched with the fault warning problem in database, module 802 is obtained also
For reporting the fault warning problem, the corresponding troubleshooting rule of the fault warning problem of user terminal input is obtained, and will
The corresponding storage of the troubleshooting rule is in the database.
In one embodiment, module 802 is obtained to be also used to accuse when the failure in the fault warning problem and the database
Alert problem identifies that the fault warning problem obtains failure subproblem when it fails to match;It is matched according to the failure subproblem corresponding
Troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed into the corresponding troubleshooting of fault warning problem
Rule.
Specifically, when it fails to match for the fault warning problem in the fault warning problem and the database, i.e. database
In when not storing the fault warning problem, obtain module 802 and be also used to identify the fault warning problem, obtain it is two or more therefore
Hedge problem and the failure subproblem split out according to the fault warning problem and the fault warning problem progress in database
Match, the corresponding troubleshooting rule of the failure subproblem is called if successful match.Module 802 is obtained to be also used to failure
The corresponding troubleshooting rule of problem one by one advises the corresponding troubleshooting of the failure subproblem according to sequence corresponding when decomposing
Then marshalling obtains the corresponding troubleshooting rule of the fault warning problem.
Specific about fault treating apparatus limits the restriction that may refer to above for fault handling method, herein not
It repeats again.Modules in above-mentioned fault treating apparatus can be realized fully or partially through software, hardware and combinations thereof.On
Stating each module can be embedded in the form of hardware or independently of in the processor in computer equipment, can also store in a software form
In memory in computer equipment, the corresponding operation of the above modules is executed in order to which processor calls.
In one embodiment, a kind of computer equipment is provided, which can be server, internal junction
Composition can be as shown in Figure 10.The computer equipment include by system bus connect processor, memory, network interface and
Database.Wherein, the processor of the computer equipment is for providing calculating and control ability.The memory packet of the computer equipment
Include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system, computer program and data
Library.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.The calculating
The database of machine equipment is for storing failure handling data.The network interface of the computer equipment is used to pass through with external terminal
Network connection communication.To realize a kind of fault handling method when the computer program is executed by processor.
It will be understood by those skilled in the art that structure shown in Figure 10, only part relevant to application scheme
The block diagram of structure, does not constitute the restriction for the computer equipment being applied thereon to application scheme, and specific computer is set
Standby may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory
Computer program, which performs the steps of when executing computer program obtains fault warning information;By the fault warning
Information carries out convergence process, and is converted to fault warning problem;Corresponding event is obtained from database according to the fault warning problem
Barrier processing rule;Troubleshooting is carried out according to the troubleshooting rule.
In one embodiment, it is also performed the steps of when processor executes computer program and obtains fault warning problem
With the script for solving the problems, such as the fault warning;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form failure
Processing rule;The fault warning problem and the troubleshooting rule are stored in the database.
In one embodiment, it also performs the steps of and will go out within a preset time when processor executes computer program
Existing same fault warning information converges a fault warning information;The fault warning information is converted to corresponding failure to accuse
Alert problem.
In one embodiment, it also performs the steps of when processor executes computer program and is asked according to the fault warning
Inscribe the fault warning problem in matching database;When the fault warning problem is matched into the fault warning problem in the database
When function, the troubleshooting rule in the corresponding database of fault warning problem in the database is called.
In one embodiment, it also performs the steps of when processor executes computer program when the fault warning problem
When it fails to match with the fault warning problem in the database, the fault warning problem is reported;It is obtained according to the fault warning problem
Take the troubleshooting rule of input;The database is stored in by the fault warning problem is corresponding with the troubleshooting rule of the input
In.
In one embodiment, it also performs the steps of when processor executes computer program when the fault warning problem
When it fails to match with the fault warning problem in the database, identify that the fault warning problem obtains failure subproblem;According to this
Failure subproblem matches corresponding troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed the failure to accuse
The corresponding troubleshooting rule of police's problem.
In one embodiment, the data in the database are stored using Redis mode.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated
Machine program performs the steps of when being executed by processor carries out convergence process for the fault warning information, and is converted to failure announcement
Alert problem;Corresponding troubleshooting rule is obtained from database according to the fault warning problem;According to the troubleshooting rule into
Row troubleshooting.
In one embodiment, acquisition fault warning is also performed the steps of when computer program is executed by processor to ask
Inscribe and solve the problems, such as the script of the fault warning;The script is split into subprocessing operation;The subprocessing is recombinated to operate to form event
Barrier processing rule;The fault warning problem and the troubleshooting rule are stored in the database.
In one embodiment, also performing the steps of when computer program is executed by processor will within a preset time
The same fault warning information of appearance converges a fault warning information;The fault warning information is converted to corresponding failure
Alarm problem.
In one embodiment, it also performs the steps of when computer program is executed by processor according to the fault warning
Fault warning problem in problem matching database;When the fault warning problem is matched with the fault warning problem in the database
When success, the troubleshooting rule in the corresponding database of fault warning problem in the database is called.
In one embodiment, it is also performed the steps of when computer program is executed by processor when the fault warning is asked
Topic with the database in fault warning problem it fails to match when, report the fault warning problem;According to the fault warning problem
Obtain the troubleshooting rule of input;The data are stored in by the fault warning problem is corresponding with the troubleshooting rule of the input
In library.
In one embodiment, it is also performed the steps of when computer program is executed by processor when the fault warning is asked
Topic with the database in fault warning problem it fails to match when, identify that the fault warning problem obtains failure subproblem;According to
The failure subproblem matches corresponding troubleshooting rule;The corresponding troubleshooting rule of the failure subproblem is formed into the failure
The corresponding troubleshooting rule of alarm problem.
In one embodiment, the data in the database are stored using Redis mode.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with
Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer
In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein,
To any reference of memory, storage, database or other media used in each embodiment provided herein,
Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM
(PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include
Random access memory (RAM) or external cache.By way of illustration and not limitation, RAM is available in many forms,
Such as static state RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing
Type SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM
(RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment
In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance
Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously
It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art
It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application
Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.
Claims (10)
1. a kind of fault handling method, which comprises
Obtain fault warning information;
The fault warning information is subjected to convergence process, and is converted to fault warning problem;
Corresponding troubleshooting rule is obtained from database according to the fault warning problem;
Troubleshooting is carried out according to the troubleshooting rule.
2. the method according to claim 1, wherein before the acquisition fault warning information, further includes:
It obtains fault warning problem and solves the problems, such as the script of the fault warning;
The script is split into subprocessing operation;
The subprocessing is recombinated to operate to form troubleshooting rule;
In the database by the fault warning problem and the corresponding storage of the troubleshooting rule.
3. the method according to claim 1, wherein it is described by the fault warning information carry out convergence process,
And it is converted to fault warning problem, comprising:
The same fault warning information occurred within a preset time is converged into a fault warning information;
The fault warning information is converted to corresponding fault warning problem.
4. being corresponded to the method according to claim 1, wherein being obtained according to the fault warning problem from database
Troubleshooting rule include:
According to the fault warning problem in the fault warning problem matching database;
When the fault warning problem successful match in the fault warning problem and the database, call in the database
The corresponding database of fault warning problem in troubleshooting rule.
5. according to the method described in claim 4, it is characterized in that, the method also includes:
When it fails to match for the fault warning problem in the fault warning problem and the database, the fault warning is reported
Problem;
The troubleshooting rule of input is obtained according to the fault warning problem;
In the database by the fault warning problem and the corresponding storage of the troubleshooting rule of the input.
6. according to the method described in claim 4, it is characterized in that, described obtain according to the fault warning problem from database
Corresponding troubleshooting rule, further includes:
When it fails to match for the fault warning problem in the fault warning problem and the database, the fault warning is identified
Problem obtains failure subproblem;
Corresponding troubleshooting rule is matched according to the failure subproblem;
The corresponding troubleshooting rule of the failure subproblem is formed into the corresponding troubleshooting rule of the fault warning problem.
7. the method according to claim 1, wherein the data in the database are stored using Redis mode.
8. a kind of fault treating apparatus, which is characterized in that described device includes:
Module is obtained, for obtaining fault warning information;It is corresponding for being obtained according to the fault warning problem from database
Troubleshooting rule;
Conversion module for the fault warning information to be carried out convergence process, and is converted to fault warning problem;
Processing module, for carrying out troubleshooting according to the troubleshooting rule.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists
In the step of processor realizes any one of claims 1 to 7 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program
The step of method described in any one of claims 1 to 7 is realized when being executed by processor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811002316.9A CN109144829B (en) | 2018-08-30 | 2018-08-30 | Fault processing method and device, computer equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811002316.9A CN109144829B (en) | 2018-08-30 | 2018-08-30 | Fault processing method and device, computer equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109144829A true CN109144829A (en) | 2019-01-04 |
CN109144829B CN109144829B (en) | 2022-03-22 |
Family
ID=64829393
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811002316.9A Active CN109144829B (en) | 2018-08-30 | 2018-08-30 | Fault processing method and device, computer equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109144829B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109757343A (en) * | 2019-03-22 | 2019-05-17 | 厦门鑫天兴科技发展有限公司 | A kind of irrigation method and system |
CN109871305A (en) * | 2019-01-18 | 2019-06-11 | 深圳壹账通智能科技有限公司 | Processing method, device, computer equipment and the storage medium of warning information |
CN110086682A (en) * | 2019-05-22 | 2019-08-02 | 四川新网银行股份有限公司 | Service link call relation view and failure root based on TCP are because of localization method |
CN110728498A (en) * | 2019-10-21 | 2020-01-24 | 北京百度网讯科技有限公司 | Information interaction method and device |
CN111769977A (en) * | 2020-06-17 | 2020-10-13 | 广州嘉为科技有限公司 | Processing method based on enterprise monitoring alarm event |
CN111835760A (en) * | 2020-07-10 | 2020-10-27 | 广州博冠信息科技有限公司 | Alarm information processing method and device, computer storage medium and electronic equipment |
CN112306794A (en) * | 2020-09-28 | 2021-02-02 | 国网吉林省电力有限公司信息通信公司 | Automatic processing method and device for typical fault scene of database |
CN113434327A (en) * | 2021-07-13 | 2021-09-24 | 上海浦东发展银行股份有限公司 | Fault processing system, method, equipment and storage medium |
CN114879636A (en) * | 2022-05-15 | 2022-08-09 | 浙江工业大学 | Chemical process fault isolation method based on fault interpreter |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101425924A (en) * | 2008-06-12 | 2009-05-06 | 广东高新兴通信股份有限公司 | Centralized monitoring system alarm data processing method |
CN101605346A (en) * | 2008-06-10 | 2009-12-16 | 中兴通讯股份有限公司 | The fault restoration method and apparatus |
CN105095523A (en) * | 2015-09-28 | 2015-11-25 | 浪潮(北京)电子信息产业有限公司 | Alarm event handling method and system |
CN105262616A (en) * | 2015-09-21 | 2016-01-20 | 浪潮集团有限公司 | Failure repository-based automated failure processing system and method |
CN107562556A (en) * | 2017-08-14 | 2018-01-09 | 腾讯科技(深圳)有限公司 | Restoration methods, recovery device and the storage medium of failure |
-
2018
- 2018-08-30 CN CN201811002316.9A patent/CN109144829B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101605346A (en) * | 2008-06-10 | 2009-12-16 | 中兴通讯股份有限公司 | The fault restoration method and apparatus |
CN101425924A (en) * | 2008-06-12 | 2009-05-06 | 广东高新兴通信股份有限公司 | Centralized monitoring system alarm data processing method |
CN105262616A (en) * | 2015-09-21 | 2016-01-20 | 浪潮集团有限公司 | Failure repository-based automated failure processing system and method |
CN105095523A (en) * | 2015-09-28 | 2015-11-25 | 浪潮(北京)电子信息产业有限公司 | Alarm event handling method and system |
CN107562556A (en) * | 2017-08-14 | 2018-01-09 | 腾讯科技(深圳)有限公司 | Restoration methods, recovery device and the storage medium of failure |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871305A (en) * | 2019-01-18 | 2019-06-11 | 深圳壹账通智能科技有限公司 | Processing method, device, computer equipment and the storage medium of warning information |
CN109871305B (en) * | 2019-01-18 | 2022-11-04 | 深圳壹账通智能科技有限公司 | Alarm information processing method and device, computer equipment and storage medium |
CN109757343A (en) * | 2019-03-22 | 2019-05-17 | 厦门鑫天兴科技发展有限公司 | A kind of irrigation method and system |
CN110086682A (en) * | 2019-05-22 | 2019-08-02 | 四川新网银行股份有限公司 | Service link call relation view and failure root based on TCP are because of localization method |
CN110086682B (en) * | 2019-05-22 | 2022-06-24 | 四川新网银行股份有限公司 | Service link calling relation view and fault root cause positioning method based on TCP |
CN110728498A (en) * | 2019-10-21 | 2020-01-24 | 北京百度网讯科技有限公司 | Information interaction method and device |
CN111769977A (en) * | 2020-06-17 | 2020-10-13 | 广州嘉为科技有限公司 | Processing method based on enterprise monitoring alarm event |
CN111835760A (en) * | 2020-07-10 | 2020-10-27 | 广州博冠信息科技有限公司 | Alarm information processing method and device, computer storage medium and electronic equipment |
CN111835760B (en) * | 2020-07-10 | 2023-03-24 | 广州博冠信息科技有限公司 | Alarm information processing method and device, computer storage medium and electronic equipment |
CN112306794A (en) * | 2020-09-28 | 2021-02-02 | 国网吉林省电力有限公司信息通信公司 | Automatic processing method and device for typical fault scene of database |
CN113434327A (en) * | 2021-07-13 | 2021-09-24 | 上海浦东发展银行股份有限公司 | Fault processing system, method, equipment and storage medium |
CN114879636A (en) * | 2022-05-15 | 2022-08-09 | 浙江工业大学 | Chemical process fault isolation method based on fault interpreter |
Also Published As
Publication number | Publication date |
---|---|
CN109144829B (en) | 2022-03-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109144829A (en) | Fault processing method and device, computer equipment and storage medium | |
CN110069572B (en) | HIVE task scheduling method, device, equipment and storage medium based on big data platform | |
CN111274052A (en) | Data distribution method, server, and computer-readable storage medium | |
CN109656782A (en) | Visual scheduling monitoring method, device and server | |
CN110611707B (en) | Task scheduling method and device | |
CN111026602A (en) | Health inspection scheduling management method and device of cloud platform and electronic equipment | |
CN101719097A (en) | Debug method and system of automation test script and automation application server | |
CN112035233A (en) | Big data batch job task scheduling method and device | |
DE102008044018A1 (en) | Method for determining a security level and security manager | |
CN114536339B (en) | Control method and device for cooperative robot, cooperative robot and storage medium | |
CN110569113A (en) | Method and system for scheduling distributed tasks and computer readable storage medium | |
CN110619014A (en) | ETL-based data extraction method | |
CN111381940B (en) | Distributed data processing method and device | |
CN113658351B (en) | Method and device for producing product, electronic equipment and storage medium | |
CN111522630B (en) | Method and system for executing planned tasks based on batch dispatching center | |
CN117407201A (en) | Distributed cloud platform fault self-healing method and system | |
CN115687019A (en) | Database cluster fault processing method, intelligent monitoring platform, equipment and medium | |
CN115686594A (en) | Terminal automatic operation and maintenance management system and method | |
CN113010210A (en) | Automatic operation and maintenance operation management method and system | |
CN116450305B (en) | SOAR platform assembly execution method and device based on distributed task scheduling | |
CN112825044B (en) | Task execution method, device and computer storage medium | |
CN117873691B (en) | Data processing method, device, equipment and readable storage medium | |
CN117076508B (en) | Method for supporting batch data processing by stream data processing system | |
CN116038687A (en) | Mobile robot scheduling method and device, robot system and computer equipment | |
CN117896232A (en) | Method and device for equipment operation and maintenance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |